    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH003622. Human spasmolytic...[gi:1477544] Links  


LOCUS       HUMSPAS1                1603 bp    DNA     linear   PRI 02-AUG-1996
DEFINITION  Human spasmolytic polypeptide (SP) gene, 5' region and exon 1.
ACCESSION   U47289
VERSION     U47289.1  GI:1477540
KEYWORDS    .
SEGMENT     1 of 4
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1603)
  AUTHORS   Tomasetto,C., Rio,M.C., Gautier,C., Wolf,C., Hareuveni,M.,
            Chambon,P. and Lathe,R.
  TITLE     hSP, the domain-duplicated homolog of pS2 protein, is co-expressed
            with pS2 in stomach but not in breast carcinoma
  JOURNAL   EMBO J. 9 (2), 407-414 (1990)
  MEDLINE   90151615
   PUBMED   2303034
REFERENCE   2  (bases 1 to 1603)
  AUTHORS   Seib,T., Hilgert,K., Seifert,M., Dooley,S. and Welter,C.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-JAN-1996) Thomas Seib, Inst. fuer Humangenetik,
            Universitaet des Saarlandes, Oskar-Orth-Str., Homburg, 66421,
            Germany
FEATURES             Location/Qualifiers
     source          1..1603
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="21"
                     /map="21q22.3"
     TATA_signal     1269..1273
                     /gene="SP"
     exon            1337..1415
                     /gene="SP"
                     /number=1
     intron          1416..>1603
                     /gene="SP"
                     /number=1
BASE COUNT      340 a    404 c    508 g    343 t      8 others
ORIGIN      
        1 tgaaagatgg gggcntggga ggatgaggtt ctggaggatg aggttctgga ggagcagggg
       61 cctggaggat gagggcctgg aggacaagag cctggaggac aggggcctga aagatggggg
      121 cctggaggat ggggcctgga ggatgggggc ctggaggacg ggggcctgga ggatgatggc
      181 ctgaaggatg aaacctggag gatggagcct ggaggatgga gcctggagga tgagggccta
      241 gaggatgggg gcctggagga caggggccta gaggatggag ccctggagaa tgggggcctg
      301 gaggatggag cctgggggac gagggcctgg aggatggggg cctggagtat ggggcctggc
      361 tctccagacc ctgaggacag ccccgtggtg gaccaaggat tttgggtttc ctgcgtctct
      421 gggcccctga ctgctcaacc atcagaaaca gactggcaac cccctgtcat ttccctggcg
      481 tggggaactt cgggtcccct ctgtccttcc caccacactt ttccctcttt ctttccgggt
      541 gtctactctc tggcttctgt cttctctgtc aggtccacag aatccttctc cagcacatcc
      601 taccccagga aggccatggg ctgggtccca ggtgccatct ttcagaagat gtagagcatt
      661 cccatggaac aaaaataacc catttcaggg gttggctgaa aatgaactta ttaaaacctg
      721 cctgtcacag gctactccgc tgaccctgtc agcctcatct ccatggagag cagcccctcc
      781 tgctgaagat gggacaaagg gcatcgtgct gcggttgggg aggctctaac cacagccctg
      841 ggagcagtct cttacctcct ctgagatgct tcccttcctc agggagggga cttttccatg
      901 ctatctgctg gcctgtacat tttccccagt aaacttggcc ctaatatttt ctaaattcct
      961 gtggtccctg cccactctat caatagaaat gcatagctta tcccttcctg ggtgtgaccc
     1021 tgtgtgtgcc cagccccaga cctgcacgtg gccggttttc cacgctggca gcctggcatg
     1081 acccaactct ctgtccaggg caggaagagg tatcaccgag cagggagaga gtcaccctgg
     1141 cccggaagcc tcgcctgcac agggcacagc tgcctcttgc ctcctcttcg cctccacggt
     1201 ggaagggctg gggccacggg gcagagaaga aaggttatct ctgcttgttg gacaaacaga
     1261 ggggagatta taaaacatac ccggcagtgg acaccatgca ttctgcaagc caccctgggg
     1321 tgcagctgag ctagacatgg gacggcgaga cgcccagctc ctggcagcgc tcctcgtcct
     1381 ggggctatgt gccctggcgg ggagtgagaa accctgtaag tgaaggagag ggtcttttta
     1441 tgtgctttct ttatttctct taaagaaaaa aaaaaagnac aaccataant taanttgaga
     1501 gggggaatgg ttataaaggn atctggaaat gtgtgttgtc canatgggat tggccactgn
     1561 tcaggagggt ggntccaaga agggcctccc tcctagggaa agg
//
LOCUS       HUMSPAS2                 726 bp    DNA     linear   PRI 02-AUG-1996
DEFINITION  Human spasmolytic polypeptide (SP) gene, exon 2.
ACCESSION   U47290
VERSION     U47290.1  GI:1477541
KEYWORDS    .
SEGMENT     2 of 4
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 726)
  AUTHORS   Tomasetto,C., Rio,M.C., Gautier,C., Wolf,C., Hareuveni,M.,
            Chambon,P. and Lathe,R.
  TITLE     hSP, the domain-duplicated homolog of pS2 protein, is co-expressed
            with pS2 in stomach but not in breast carcinoma
  JOURNAL   EMBO J. 9 (2), 407-414 (1990)
  MEDLINE   90151615
   PUBMED   2303034
REFERENCE   2  (bases 1 to 726)
  AUTHORS   Seib,T., Hilgert,K., Seifert,M., Dooley,S. and Welter,C.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-JAN-1996) Thomas Seib, Inst. fuer Humangenetik,
            Universitaet des Saarlandes, Oskar-Orth-Str., Homburg, 66421,
            Germany
FEATURES             Location/Qualifiers
     source          1..726
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="21"
                     /map="21q22.3"
     intron          <1..198
                     /gene="SP"
                     /number=1
     exon            199..348
                     /gene="SP"
                     /number=2
     intron          349..>726
                     /gene="SP"
                     /number=2
BASE COUNT      151 a    170 c    226 g    174 t      5 others
ORIGIN      
        1 tttaaactaa tataacaatt taagcaaagc ctatcggctt ctcaggagga aancgcattg
       61 cttaaatatg ggcaagataa gactttgtgt ttctctatgt ggcaacaaga cagtagaggc
      121 atcccctaga acctctgaga gaaggagcag tcgtggtctg gggtaccagg gtggggccga
      181 ctgagggtct ttccacagcc ccctgccagt gctccaggct gagcccccat aacaggacga
      241 actgcggctt ccctggaatc accagtgacc agtgttttga caatggatgc tgtttcgact
      301 ccagtgtcac tggggtcccc tggtgtttcc accccctccc aaagcaaggt aatcttccag
      361 ggaatcttcc tgggccagca gctggcaacc caggacccag cttcacaggc ggagcccaga
      421 gcaggggccg gaggaggccc agttgctagt ctagggttag cctgggtggg ttagtctcga
      481 gctagccccg gttggttagt ctggggctag cccaggttgg ttagtctaga gctagcccag
      541 gttggttagt ctggggctag cccaggttgg ttagtctggg gctancaggt tggttagtct
      601 agggctagtg taggctagtt agtctaaggc tagcccaggt tgggttagtt ttgagctacg
      661 caggttgggt nagctggggc tagtacnagg ttggtaagct ggagctatgg tnagtctggg
      721 gctacc
//
LOCUS       HUMSPAS3                1329 bp    DNA     linear   PRI 02-AUG-1996
DEFINITION  Human spasmolytic polypeptide (SP) gene, exon 3.
ACCESSION   U47291
VERSION     U47291.1  GI:1477542
KEYWORDS    .
SEGMENT     3 of 4
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1329)
  AUTHORS   Tomasetto,C., Rio,M.C., Gautier,C., Wolf,C., Hareuveni,M.,
            Chambon,P. and Lathe,R.
  TITLE     hSP, the domain-duplicated homolog of pS2 protein, is co-expressed
            with pS2 in stomach but not in breast carcinoma
  JOURNAL   EMBO J. 9 (2), 407-414 (1990)
  MEDLINE   90151615
   PUBMED   2303034
REFERENCE   2  (bases 1 to 1329)
  AUTHORS   Seib,T., Hilgert,K., Seifert,M., Dooley,S. and Welter,C.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-JAN-1996) Thomas Seib, Inst. fuer Humangenetik,
            Universitaet des Saarlandes, Oskar-Orth-Str., Homburg, 66421,
            Germany
FEATURES             Location/Qualifiers
     source          1..1329
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="21"
                     /map="21q22.3"
     intron          <1..946
                     /gene="SP"
                     /number=2
     exon            947..1093
                     /gene="SP"
                     /number=3
     intron          1094..>1329
                     /gene="SP"
                     /number=3
BASE COUNT      295 a    347 c    381 g    305 t      1 others
ORIGIN      
        1 tctagaggta acccaggtca gccaacagtg agatgaaaat ttcccaccta ccctgtttct
       61 acactgttag ttctttcaac agacatgtgt gtgtggagcc atcagtttta ctttagttga
      121 gaaaaaaata tatatatata tagtaggtct cctctagttt ttgaagtgtg acttctgaag
      181 aagcttccat ggggaaatga aggtatttaa taggacagca gtaacataag ggctgacagc
      241 cctcaaatgt tagggaagga agtgaagcct tctagggttc tttgggagtg agttttatgt
      301 tagtgcacgg gatcaggacc caagttgtaa cgccgacgag tgctcaaagg aaggttgtgt
      361 gtgtgtcgtg cacctgtgtg cgtggaacca ggcacgtcct ctggagaagg aggattcatc
      421 cccaagattg ttgctgggag gcttgctggg ccccgcaggg aaaccaggca gatggtggat
      481 tgttcacgag cgcccactga atggcagtgt ctttgggaat caataccatg tccaaacgct
      541 ttccatctta cccaggtgcc cacaaacctt ttctcatctt ggcccggggg accaacccca
      601 tttactgaga acactgagtc ccgagaggca aaatgatttc cccaaaggcg ggggactccc
      661 agagctcctg actgtgacca ccccacatgg gccccaactn cgcggaggac aggccagcca
      721 agcgtcgctg gggccgacac ttccacagtc cccgggggag gcggtcccag gggccgacac
      781 ttccacagtc cccgggggag gccgtcccgg gggatgctgc cccaggcagc acctcatgat
      841 ccacggaggc tgcaaatcag cgctgctctc agaggaggaa ggggtggagc tttccagggc
      901 acagcaggcc tgactgggtc tcggtgctgt gcctgtccca tggcagagtc ggatcagtgc
      961 gtcatggagg tctcagaccg aagaaactgt ggctacccgg gcatcagccc cgaggaatgc
     1021 gcctctcgga agtgctgctt ctccaacttc atctttgaag tgccctggtg cttcttcccg
     1081 aagtctgtgg aaggtaacgt cgctgtggga ctctctgtct ggttcccgga caccatgatt
     1141 cctcctccgt ccgtagaggt ggggtgcagg gaggggagct gcctcgctgc ctcagtgcca
     1201 tcgagccagg gcccctgcct cctatgggat tctgaaggca attccagaat gttcttggca
     1261 aagacagcgt ctattcaata agtttatagc ctccagcatt gccactgcgt catctgtgat
     1321 ggttctaga
//
LOCUS       HUMSPAS4                1057 bp    DNA     linear   PRI 02-AUG-1996
DEFINITION  Human spasmolytic polypeptide (SP) gene, exon 4 and complete cds.
ACCESSION   U47292
VERSION     U47292.1  GI:1477543
KEYWORDS    .
SEGMENT     4 of 4
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1057)
  AUTHORS   Tomasetto,C., Rio,M.C., Gautier,C., Wolf,C., Hareuveni,M.,
            Chambon,P. and Lathe,R.
  TITLE     hSP, the domain-duplicated homolog of pS2 protein, is co-expressed
            with pS2 in stomach but not in breast carcinoma
  JOURNAL   EMBO J. 9 (2), 407-414 (1990)
  MEDLINE   90151615
   PUBMED   2303034
REFERENCE   2  (bases 1 to 1057)
  AUTHORS   Seib,T., Hilgert,K., Seifert,M., Dooley,S. and Welter,C.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-JAN-1996) Thomas Seib, Inst. fuer Humangenetik,
            Universitaet des Saarlandes, Oskar-Orth-Str., Homburg, 66421,
            Germany
FEATURES             Location/Qualifiers
     source          1..1057
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="21"
                     /map="21q22.3"
     gene            join(U47289.1:1269..1603,U47290.1:1..726,U47291.1:1..1329,
                     1..1057)
                     /gene="SP"
     CDS             join(U47289.1:1337..1415,U47290.1:199..348,
                     U47291.1:947..1093,469..482)
                     /gene="SP"
                     /codon_start=1
                     /product="spasmolytic peptide"
                     /protein_id="AAB05397.1"
                     /db_xref="GI:1477545"
                     /translation="MGRRDAQLLAALLVLGLCALAGSEKPSPCQCSRLSPHNRTNCGF
                     PGITSDQCFDNGCCFDSSVTGVPWCFHPLPKQESDQCVMEVSDRRNCGYPGISPEECA
                     SRKCCFSNFIFEVPWCFFPKSVEDCHY"
     intron          <1..468
                     /gene="SP"
                     /number=3
     exon            469..>1057
                     /gene="SP"
                     /number=4
     3'UTR           483..>1057
                     /gene="SP"
     polyA_signal    634..639
                     /gene="SP"
BASE COUNT      247 a    237 c    256 g    309 t      8 others
ORIGIN      
        1 ggaggtccgc cggcaagagt gaacggtcca cttttcccac ccgcttagtg aatagtgtgt
       61 ccctgaactc ggagtgtgcg aggtaaaaaa aagaccagca agatccaaga aaatgggnaa
      121 agagctactg gcccttgaag gatgcctttt cttttccttt tgttaggata tcaaagcact
      181 ccaaagagcg aaatatttca tgttcaggat tttccgagtg atttttttta ttgtagccta
      241 aaggtccacc tagaaaatgt tcacttgtct ggggagaatg cgccccacag aggaaactct
      301 ggcctggggt gggaagattt ggtcccttta cacccctccc cgggaaagga gctccttctt
      361 cagtaggaag ctcctgggca aagtgatgca cgcccacccc agcttcgcag cctaggcact
      421 cccatttctg gggttccctt accaaccatc ttgcatttaa acttctagac tgccattact
      481 aagagaggct ggttccagag gatgcatctg gctcaccggg tgttccgaaa ccaaagaaga
      541 aacttcgcct tatcagcttc atatttcatg aaatcctggg ttttcttaac catcttttcc
      601 tcattttcaa tggtttaaca tataatttct ttaaataaaa cccttaaaat ctgctaaatt
      661 tctttttggt ttcattaaca cagaggtggt aatggtggtg tgcgtgtgta cacatgtatg
      721 ggtgtgtgta cccatgtatg ggtgtgtgta cccatgtatg ggtgtgtgta tatgtgcgaa
      781 tgtgcatatg tgtgagtgta tatgcagggt ttttttgtgt gtgttttgag acagagtctt
      841 gctctgtcac ccaggctaga gtgcaatggt gcaatcttgg ntcactgcaa cctcctcctc
      901 ccaggttcaa gctatcctcc tgnctcagcc tcccgagtag ntgggattac aggtgnccac
      961 catcacactg gntaattttg gggttttagn agagatgggg tttcacatgt gggncaggct
     1021 ggtctcgact cctgacatca agtggcctcc caagtgg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  

&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC004481. Homo sapiens, ute...[gi:13325339] Links  


LOCUS       BC004481                 481 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, uteroglobin, clone MGC:10583 IMAGE:3688615, mRNA,
            complete cds.
ACCESSION   BC004481
VERSION     BC004481.1  GI:13325339
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 481)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-MAR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Rachel Dickhoff, Jessica Fahey, Stephanie Ford, Julia
            Greene, Mark Ketteman and Anuradha Madan
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 14 Row: a Column: 21
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 457934.
FEATURES             Location/Qualifiers
     source          1..481
                     /organism="Homo sapiens"
                     /db_xref="LocusID:7356"
                     /db_xref="taxon:9606"
                     /clone="MGC:10583 IMAGE:3688615"
                     /tissue_type="Pancreas, adenocarcinoma"
                     /clone_lib="NIH_MGC_39"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             55..330
                     /codon_start=1
                     /product="uteroglobin"
                     /protein_id="AAH04481.1"
                     /db_xref="GI:13325340"
                     /translation="MKLAVTLTLVTLALCCSSASAEICPSFQRVIETLLMDTPSSYEA
                     AMELFSPDQDMREAGAQLKKLVDTLPQKPRESIIKLMEKIAQSSLCN"
BASE COUNT      159 a    144 c     94 g     84 t
ORIGIN      
        1 ggcacgaggc agagacggaa ccagagacag gccagagcat ccccctcctc caccatgaaa
       61 ctcgctgtca ccctcaccct ggtcacactg gctctctgct gcagctccgc ttctgcagag
      121 atctgcccga gctttcagcg tgtcatcgaa accctcctca tggacacacc ctccagttat
      181 gaggctgcca tggaactttt cagccctgat caagacatga gggaggcagg ggctcagctg
      241 aagaagctgg tggacaccct cccccaaaag cccagagaaa gcatcattaa gctcatggaa
      301 aaaatagccc aaagctcact gtgtaattag catttagaag ctgaagatcc ccaactgctc
      361 cagcctctgc cgctgccatg ctttgagtcc acgcccacca gccttgctct cttcaataaa
      421 ccacaagcat ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
      481 a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  

&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC012549. Homo sapiens, Sim...[gi:15214823] Links  


LOCUS       BC012549                1053 bp    mRNA    linear   PRI 20-AUG-2001
DEFINITION  Homo sapiens, Similar to LUNX protein; PLUNC (palate lung and nasal
            epithelium clone); tracheal epithelium enriched protein, clone
            MGC:13372 IMAGE:4246419, mRNA, complete cds.
ACCESSION   BC012549
VERSION     BC012549.1  GI:15214823
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1053)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-AUG-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: CLONTECH
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Rachel Dickhoff, Jessica Fahey, Stephanie Ford, Julia
            Greene, Mark Ketteman and Anuradha Madan
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 19 Row: h Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 9081878.
FEATURES             Location/Qualifiers
     source          1..1053
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:13372 IMAGE:4246419"
                     /tissue_type="Skeletal Muscle"
                     /clone_lib="NIH_MGC_81"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     CDS             75..845
                     /codon_start=1
                     /product="Similar to LUNX protein; PLUNC (palate lung and
                     nasal epithelium clone); tracheal epithelium enriched
                     protein"
                     /protein_id="AAH12549.1"
                     /db_xref="GI:15214824"
                     /translation="MFQTGGLIVFYGLLAQTMAQFGGLPVPLDQTLPLNVNPALPLSP
                     TGLAGSLTNALSNGLLSGGLLGILENLPLLDILKPGGGTSGGLLGGLLGKVTSVIPGL
                     NNIIDIKVTDPQLLELGLVQSPDGHRLYVTIPLGIKLQVNTPLVGASLLRLAVKLDIT
                     AEILAVRDKQERIHLVLGDCTHSPGSLQISLLDGLGPLPIQGLLDSLTGILNKVLPEL
                     VQGNVCPLVNEVLRGLDITLVHDIVNMLIHGLQFVIKV"
BASE COUNT      244 a    289 c    276 g    244 t
ORIGIN      
        1 gggggagtgg gggagagaga ggagaccagg acagctgctg agacctctaa gaagtccaga
       61 tactaagagc aaagatgttt caaactgggg gcctcattgt cttctacggg ctgttagccc
      121 agaccatggc ccagtttgga ggcctgcccg tgcccctgga ccagaccctg cccttgaatg
      181 tgaatccagc cctgcccttg agtcccacag gtcttgcagg aagcttgaca aatgccctca
      241 gcaatggcct gctgtctggg ggcctgttgg gcattctgga aaaccttccg ctcctggaca
      301 tcctgaagcc tggaggaggt acttctggtg gcctccttgg gggactgctt ggaaaagtga
      361 cgtcagtgat tcctggcctg aacaacatca ttgacataaa ggtcactgac ccccagctgc
      421 tggaacttgg ccttgtgcag agccctgatg gccaccgtct ctatgtcacc atccctctcg
      481 gcataaagct ccaagtgaat acgcccctgg tcggtgcaag tctgttgagg ctggctgtga
      541 agctggacat cactgcagaa atcttagctg tgagagataa gcaggagagg atccacctgg
      601 tccttggtga ctgcacccat tcccctggaa gcctgcaaat ttctctgctt gatggacttg
      661 gccccctccc cattcaaggt cttctggaca gcctcacagg gatcttgaat aaagtcctgc
      721 ctgagttggt tcagggcaac gtgtgccctc tggtcaatga ggttctcaga ggcttggaca
      781 tcaccctggt gcatgacatt gttaacatgc tgatccacgg actacagttt gtcatcaagg
      841 tctaagcctt ccaggaaggg gctggcctct gctgagctgc ttcccagtgc tcacagatgg
      901 ctggcccatg tgctggaaga tgacacagtt gccttctctc cgaggaacct gccccctctc
      961 ctttcccacc aggcgtgtgt aacatcccat gtgcctcacc taataaaatg gctcttcttc
     1021 tgcgaaaaaa aaaaaaaaaa aaaaaaaaaa aaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  

&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC010998. Homo sapiens, clo...[gi:15012183] Links  


LOCUS       BC010998                2407 bp    mRNA    linear   PRI 25-JUL-2001
DEFINITION  Homo sapiens, clone MGC:15297 IMAGE:4039973, mRNA, complete cds.
ACCESSION   BC010998
VERSION     BC010998.1  GI:15012183
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2407)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Rachel Dickhoff, Jessica Fahey, Stephanie Ford, Julia
            Greene, Mark Ketteman and Anuradha Madan
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 25 Row: n Column: 12
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4503720.
FEATURES             Location/Qualifiers
     source          1..2407
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:15297 IMAGE:4039973"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_18"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             218..1060
                     /codon_start=1
                     /product="Unknown (protein for MGC:15297)"
                     /protein_id="AAH10998.1"
                     /db_xref="GI:15012184"
                     /translation="MAEKFDCHYCRDPLQGKKYVQKDGHHCCLKCFDKFCANTCVECR
                     KPIGADSKEVHYKNRFWHDTCFRCAKCLHPLANETFVAKDNKILCNKCTTREDSPKCK
                     GCFKAIVAGDQNVEYKGTVWHKDCFTCSNCKQVIGTGSFFPKGEDFYCVTCHETKFAK
                     HCVKCNKAITSGGITYQDQPWHADCFVCVTCSKKLAGQRFTAVEDQYYCVDCYKNFVA
                     KKCAGCKNPITGFGKGSSVVAYEGQSWHDYCFHCKKCSVNLANKRFVFHQEQVYCPDC
                     AKKL"
BASE COUNT      650 a    587 c    545 g    625 t
ORIGIN      
        1 ggcacgaggc ggagggggct cagtccgcag ccgccgccgc caccgccgcg cctcggcctc
       61 ggtgcaggca gcggccgccg ccgccgagac agctgcgcgg gcgagcatcc ccacgcagca
      121 ccttggaagt tgttttcaac catatccagc ctttgccgaa tacatcctat ctgccacaca
      181 tccagcgtga ggtccctcca gctacaaggt gggcaccatg gcggagaagt ttgactgcca
      241 ctactgcagg gatcccttgc aggggaagaa gtatgtgcaa aaggatggcc accactgctg
      301 cctgaaatgc tttgacaagt tctgtgccaa cacctgtgtg gaatgccgca agcccatcgg
      361 tgcggactcc aaggaggtgc actataagaa ccgcttctgg catgacacct gcttccgctg
      421 tgccaagtgc cttcacccct tggccaatga gacctttgtg gccaaggaca acaagatcct
      481 gtgcaacaag tgcaccactc gggaggactc ccccaagtgc aaggggtgct tcaaggccat
      541 tgtggcagga gatcaaaacg tggagtacaa ggggaccgtc tggcacaaag actgcttcac
      601 ctgtagtaac tgcaagcaag tcatcgggac tggaagcttc ttccctaaag gggaggactt
      661 ctactgcgtg acttgccatg agaccaagtt tgccaagcat tgcgtgaagt gcaacaaggc
      721 catcacatct ggaggaatca cttaccagga tcagccctgg catgccgatt gctttgtgtg
      781 tgttacctgc tctaagaagc tggctgggca gcgtttcacc gctgtggagg accagtatta
      841 ctgcgtggat tgctacaaga actttgtggc caagaagtgt gctggatgca agaaccccat
      901 cactgggttt ggtaaaggct ccagtgtggt ggcctatgaa ggacaatcct ggcacgacta
      961 ctgcttccac tgcaaaaaat gctccgtgaa tctggccaac aagcgctttg ttttccacca
     1021 ggagcaagtg tattgtcccg actgtgccaa aaagctgtaa actgacaggg gctcctgtcc
     1081 tgtaaaatgg catttgaatc tcgttctttg tgtccttact ttctgcccta taccatcaat
     1141 aggggaagag tggtccttcc cttctttaaa gttctccttc cgtcttttct cccattttac
     1201 agtattactc aaataagggc acacagtgat catattagca tttagcaaaa agcaaccctg
     1261 cagcaaagtg aatttctgtc cggctgcaat ttaaaaatga aaacttaggt agattgactc
     1321 ttctgcatgt ttctcataga gcagaaaagt gctaatcatt tagccactta gtgatgtaag
     1381 caagaagcat aggagataaa acccccactg agatgcctct catgcctcag ctgggaccca
     1441 ccgtgtagac acacgacatg caagagttgc agcggctgct ccaactcact gctcaccctc
     1501 ttctgtgagc aggaaaagaa ccctactgac atgcatggtt taacttcctc atcagaactc
     1561 tgcccttcct tctgttcttt tgtgctttca aataactaac acgaacttcc agaaaattaa
     1621 catttgaact tagctgtaat tctaaactga cctttccccg tactaacgtt tggtttcccc
     1681 gtgtggcatg ttttctgagc gttcctactt taaagcatgg aacatgcagg tgatttggga
     1741 agtgtagaaa gacctgagaa aacgagcctg tttcagagga acatcgtcac aacgaatact
     1801 tctggaagct taacaaaact aaccctgctg tcctttttat tgtttttaat taatattttt
     1861 gttttaattg atagcaaaat agtttatggg tttggaaact tgcatgaaaa tattttagcc
     1921 ccctcagatg ttcctgcagt gctgaaattc atcctacgga agtaaccgca aaactctaga
     1981 gggggagttg agcaggcgcc agggctgtca tcaacatgga tatgacattt cacaacagtg
     2041 actagttgaa tcccttgtaa cgtagtagtt gtctgctctt tgtccatgtg ttaatgagga
     2101 ctgcaaagtc ccttctgttg tgattcctag gacttttcct caagaggaaa tctggatttc
     2161 cacctaccgc ttacctgaaa tgcaggatca cctacttact gtattctaca ttattatatg
     2221 acatagtata atgagacaat atcaaaagta aacatgtaat gacaatacat actaacattc
     2281 ttgtaggagt ggttagagaa gctgatgcct catttctaca ttctgtcatt agctattatc
     2341 atctaacgtt tcagtgtatc cttacagaaa taaagcagca tatgaaaaaa aaaaaaaaaa
     2401 aaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D88308. Homo sapiens mRNA...[gi:2653564] Links  


LOCUS       D88308                  2362 bp    mRNA    linear   PRI 29-NOV-1997
DEFINITION  Homo sapiens mRNA for very-long-chain acyl-CoA synthetase, complete
            cds.
ACCESSION   D88308
VERSION     D88308.1  GI:2653564
KEYWORDS    very-long-chain acyl-CoA synthetase.
SOURCE      Homo sapiens (strain:caucacian) adult male liver cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Uchiyama,A., Aoyama,T., Kamijo,K., Wakui,K., Fukushima,Y.,
            Shimozawa,N., Suzuki,Y., Kondo,N., Orii,T. and Hashimoto,T.
  TITLE     Molecular cloning of a possible human homolog of the rat
            very-long-chain acyl-CoA synthetase cDNA and its chromosomal
            localization
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 2362)
  AUTHORS   Kamijo,K.
  TITLE     Direct Submission
  JOURNAL   Submitted (08-OCT-1996) Keiju Kamijo, Shinshu University, School of
            Medicine, Department of Biochemistry; 3-1-1 Asahi, Matsumoto 390,
            Japan (E-mail:kkamijo@gipac.shinshu-u.ac.jp, Tel:+81-263-37-2603,
            Fax:+81-263-37-2604)
FEATURES             Location/Qualifiers
     source          1..2362
                     /organism="Homo sapiens"
                     /strain="caucacian"
                     /db_xref="taxon:9606"
                     /sex="male"
                     /tissue_type="liver"
                     /dev_stage="adult"
     gene            1..2362
                     /gene="vlacs"
     CDS             223..2085
                     /gene="vlacs"
                     /codon_start=1
                     /product="very-long-chain acyl-CoA synthetase"
                     /protein_id="BAA23644.1"
                     /db_xref="GI:2653565"
                     /translation="MLSAIYTVLAGLLFLPLLVNLCCPYFFQDIGYFLKVAAVGRRVR
                     SYGQRRPARTILRAFLEKARQTPHKPFLLFRDETLTYAQVDRRSNQVARALHDHLGLR
                     QGDCVALLMGNEPAYVWLWLGLVKLGCAMACLNYNIRAKSLLHCFQCCGAKVLLVSPE
                     LQAAVEEILPSLKKDDVSIYYVSRTSNTDGIDSFLDKVDEVSTEPIPESWRSEVTFST
                     PALYIYTSGTTGLPKAAMITHQRIWYGTGLTFVSGLKADDVIYITLPFYHSAALLIGI
                     HGCIVAGATLALRTKFSASQFWDDCRKYNVTVIQYIGELLRYLCNSPQKPNDRDHKVR
                     LALGNGLRGDVWRQFVKRFGDICIYEFYAATEGNIGFMNYARKVGAVGRVNYLQKKII
                     TYDLIKYDVEKDEPVRDENGYCVRVPKGEVGLLVCKITQLTPFNGYAGAKAQTEKKKL
                     RDVFKKGDLYFNSGDLLMVDHENFIYFHDRVGDTFRWKGENVATTEVADTVGLVDFVQ
                     EVNVYGVHVPDHEGRIGMASIKMKENHEFDGKKLFQHIADYLPSYARPRFLRIQDTIE
                     ITGTFKHRKMTLVEEGFNPAVIKDALYFLDDTAKMYVPMTEDIYNAISAKTLKL"
     misc_feature    898..924
                     /gene="vlacs"
                     /note="ATP-binding domain; putative"
     misc_feature    1540..1713
                     /gene="vlacs"
                     /note="hydrolysis domain; putative"
BASE COUNT      633 a    554 c    589 g    586 t
ORIGIN      
        1 ggaattccaa aaaaaaaaaa tacgactaca cctgctccgg agcccgcggc ggtacctgca
       61 gcggaggagc tctgtcttcc ccttcatctc acgcgagccc ggcgtcccgc cgcgtgcgcc
      121 ccggcgcagc ccgccagtcc gcccggagcc cgcccagtcg ccgcgctgca cgcccggggt
      181 gaaccctctg ccctcgctgg gacagagggc cccgcagccg tcatgctttc cgccatctac
      241 acagtcctgg cgggactgct gttcctgccg ctcctggtga acctctgctg cccatacttc
      301 ttccaggaca taggctactt cttgaaggtg gccgccgtgg gccggagggt gcgcagctac
      361 gggcagcggc ggccggcgcg caccatcctg cgggcgttcc tggagaaagc gcgccagacg
      421 ccacacaagc cttttctgct cttccgcgac gagactctca cctacgcgca ggtggaccgg
      481 cgcagcaatc aagtggcccg ggcgctgcac gaccacctcg gcctgcgcca gggagactgc
      541 gtggcgctcc ttatgggtaa cgagccggcc tacgtgtggc tgtggctggg gctggtgaag
      601 ctgggctgtg ccatggcgtg cctcaattac aacatccgcg cgaagtccct gctgcactgc
      661 ttccagtgct gcggggcgaa ggtgctgctg gtgtcgccag aactacaagc agctgtcgaa
      721 gagatactgc caagccttaa aaaagatgat gtgtccatct attatgtgag cagaacttct
      781 aacacagatg ggattgactc tttcctggac aaagtggatg aagtatcaac tgaacctatc
      841 ccagagtcat ggaggtctga agtcactttt tccactcctg ccttatacat ttatacttct
      901 ggaaccacag gtcttccaaa agcagccatg atcactcatc agcgcatatg gtatggaact
      961 ggcctcactt ttgtaagcgg attgaaggca gatgatgtca tctatatcac tctgcccttt
     1021 taccacagtg ctgcactact gattggcatt cacggatgta ttgtggctgg tgctactctt
     1081 gccttgcgga ctaaattttc agccagccag ttttgggatg actgcagaaa atacaacgtc
     1141 actgtcattc agtatatcgg tgaactgctt cggtatttat gcaactcacc acagaaacca
     1201 aatgaccgtg atcataaagt gagactggca ctgggaaatg gcttacgagg agatgtgtgg
     1261 agacaatttg tcaagagatt tggggacata tgcatctatg agttctatgc tgccactgaa
     1321 ggcaatattg gatttatgaa ttatgcgaga aaagttggtg ctgttggaag agtaaactac
     1381 ctacagaaaa aaatcataac ttatgacctg attaaatatg atgtggagaa agatgaacct
     1441 gtccgagatg aaaatggata ttgcgtcaga gttcccaaag gtgaagttgg acttctggtt
     1501 tgcaaaatca cacaacttac accatttaat ggctatgctg gagcaaaggc tcagacagag
     1561 aagaaaaaac tgagagatgt ctttaagaaa ggagacctct atttcaacag tggagatctc
     1621 ttaatggttg accatgaaaa tttcatctat ttccacgaca gagttggaga tacattccgg
     1681 tggaaagggg aaaatgtggc caccactgaa gttgctgata cagttggact ggttgatttt
     1741 gtccaagaag taaatgttta tggagtgcat gtgccagatc atgagggtcg cattggcatg
     1801 gcctccatca aaatgaaaga aaaccatgaa tttgatggaa agaaactctt tcagcacatt
     1861 gctgattacc tacctagtta tgcaaggccc cggtttctaa gaatacagga caccattgag
     1921 atcactggaa cttttaaaca ccgcaaaatg accctggtgg aggagggctt taaccctgct
     1981 gtcatcaaag atgccttgta tttcttggat gacacagcaa aaatgtatgt gcctatgact
     2041 gaggacatct ataatgccat aagtgctaaa accctgaaac tctgaatatt cccaggagga
     2101 taactcaaca tttccagaaa gaaactgaat ggacagccac ttgatataat ccaactttaa
     2161 tttgattgaa gattgtgagg aaattttgta ggaaatttgc atacccgtaa agggagactt
     2221 ttttaaataa cagttgagtc tttgcaagta aaaagattta gagattatta tttttcagtg
     2281 tgcacctact gtttgtattt gcaaactgag cttgttggag ggaaggcatt attttttaaa
     2341 atacttagta aattaaatga ac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U04270. Human putative po...[gi:487737] Links  


LOCUS       HSU04270                4070 bp    mRNA    linear   PRI 28-FEB-1995
DEFINITION  Human putative potassium channel subunit (h-erg) mRNA, complete
            cds.
ACCESSION   U04270
VERSION     U04270.1  GI:487737
KEYWORDS    .
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 184 to 3663)
  AUTHORS   Warmke,J.W. and Ganetzky,B.
  TITLE     A family of potassium channel genes related to eag in Drosophila
            and mammals
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 91 (8), 3438-3442 (1994)
  MEDLINE   94211879
   PUBMED   8159766
REFERENCE   2  (bases 1 to 4070)
  AUTHORS   Warmke,J.W.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-DEC-1993) Jeffrey W. Warmke, Genetics and Molecular
            Biology, Merck Research Laboratories, 126 East Lincoln Avenue, P.O.
            Box 2000, Rahway, NJ 07065, USA
FEATURES             Location/Qualifiers
     source          1..4070
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="7"
                     /clone="pBII+HH1, pBII+HH10, pBHH10-4.5"
                     /sex="female"
                     /tissue_type="hippocampus"
                     /clone_lib="Stratagene Number 936205 Human hippocampus
                     cDNA library"
                     /dev_stage="2 year old"
     gene            1..4070
                     /gene="h-erg"
     CDS             184..3663
                     /gene="h-erg"
                     /standard_name="human eag related gene"
                     /codon_start=1
                     /product="putative potassium channel subunit"
                     /protein_id="AAA62473.1"
                     /db_xref="GI:487738"
                     /translation="MPVRRGHVAPQNTFLDTIIRKFEGQSRKFIIANARVENCAVIYC
                     NDGFCELCGYSRAEVMQRPCTCDFLHGPRTQRRAAAQIAQALLGAEERKVEIAFYRKD
                     GSCFLCLVDVVPVKNEDGAVIMFILNFEVVMEKDMVGSPAHDTNHRGPPTSWLAPGRA
                     KTFRLKLPALLALTARESSVRSGGAGGAGAPGAVVVDVDLTPAAPSSESLALDEVTAM
                     DNHVAGLGPAEERRALVGPGSPPRSAPGQLPSPRAHSLNPDASGSSCSLARTRSRESC
                     ASVRRASSADDIEAMRAGVLPPPPRHASTGAMHPLRSGLLNSTSDSDLVRYRTISKIP
                     QITLNFVDLKGDPFLASPTSDREIIAPKIKERTHNVTEKVTQVLSLGADVLPEYKLQA
                     PRIHRWTILHYSPFKAVWDWLILLLVIYTAVFTPYSAAFLLKETEEGPPATECGYACQ
                     PLAVVDLIVDIMFIVDILINFRTTYVNANEEVVSHPGRIAVHYFKGWFLIDMVAAIPF
                     DLLIFGSGSEELIGLLKTARLLRLVRVARKLDRYSEYGAAVLFLLMCTFALIAHWLAC
                     IWYAIGNMEQPHMDSRIGWLHNLGDQIGKPYNSSGLGGPSIKDKYVTALYFTFSSLTS
                     VGFGNVSPNTNSEKIFSICVMLIGSLMYASIFGNVSAIIQRLYSGTARYHTQMLRVRE
                     FIRFHQIPNPLRQRLEEYFQHAWSYTNGIDMNAVLKGFPECLQADICLHLNRSLLQHC
                     KPFRGATKGCLRALAMKFKTTHAPPGDTLVHAGDLLTALYFISRGSIEILRGDVVVAI
                     LGKNDIFGEPLNLYARPGKSNGDVRALTYCDLHKIHRDDLLEVLDMYPEFSDHFWSSL
                     EITFNLRDTNMIPGSPGSTELEGGFSRQRKRKLSFRRRTDKDTEQPGEVSALGPGRAG
                     AGPSSRGRPGGPWGESPSSGPSSPESSEDEGPGRSSSPLRLVPFSSPRPPGEPPGGEP
                     LMEDCEKSSDTCNPLSGAFSGVSNIFSFWGDSRGRQYQELPRCPAPTPSLLNIPLSSP
                     GRRPRGDVESRLDALQRQLNRLETRLSADMATVLQLLQRQMTLVPPAYSAVTTPGPGP
                     TSTSPLLPVSPLPTLTLDSLSQVSQFMACEELPPGAPELPQEGPTRRLSLPGQLGALT
                     SQPLHRHGSDPGS"
BASE COUNT      713 a   1413 c   1255 g    689 t
ORIGIN      
        1 acgcggcctg ctcaggcctc cagcggccgg tcggagggga ggcgggaggc gagcgaggac
       61 ccgcgcccgc agtccagtct gtgcgcgccc gtgctcgctt ggcgcggtgc gggaccagcg
      121 ccggccaccc gaagcctagt gcgtcgccgg gtgggtgggc ccgcccggcg ccatgggctc
      181 aggatgccgg tgcggagggg ccacgtcgcg ccgcagaaca ccttcctgga caccatcatc
      241 cgcaagtttg agggccagag ccgtaagttc atcatcgcca acgctcgggt ggagaactgc
      301 gccgtcatct actgcaacga cggcttctgc gagctgtgcg gctactcgcg ggccgaggtg
      361 atgcagcgac cctgcacctg cgacttcctg cacgggccgc gcacgcagcg ccgcgctgcc
      421 gcgcagatcg cgcaggcact gctgggcgcc gaggagcgca aagtggaaat cgccttctac
      481 cggaaagatg ggagctgctt cctatgtctg gtggatgtgg tgcccgtgaa gaacgaggat
      541 ggggctgtca tcatgttcat cctcaatttc gaggtggtga tggagaagga catggtgggg
      601 tccccggctc atgacaccaa ccaccggggc ccccccacca gctggctggc cccaggccgc
      661 gccaagacct tccgcctgaa gctgcccgcg ctgctggcgc tgacggcccg ggagtcgtcg
      721 gtgcggtcgg gcggcgcggg cggcgcgggc gccccggggg ccgtggtggt ggacgtggac
      781 ctgacgcccg cggcacccag cagcgagtcg ctggccctgg acgaagtgac agccatggac
      841 aaccacgtgg cagggctcgg gcccgcggag gagcggcgtg cgctggtggg tcccggctct
      901 ccgccccgca gcgcgcccgg ccagctccca tcgccccggg cgcacagcct caaccccgac
      961 gcctcgggct ccagctgcag cctggcccgg acgcgctccc gagaaagctg cgccagcgtg
     1021 cgccgcgcct cgtcggccga cgacatcgag gccatgcgcg ccggggtgct gcccccgcca
     1081 ccgcgccacg ccagcaccgg ggccatgcac ccactgcgca gcggcttgct caactccacc
     1141 tcggactccg acctcgtgcg ctaccgcacc attagcaaga ttccccaaat caccctcaac
     1201 tttgtggacc tcaagggcga ccccttcttg gcttcgccca ccagtgaccg tgagatcata
     1261 gcacctaaga taaaggagcg aacccacaat gtcactgaga aggtcaccca ggtcctgtcc
     1321 ctgggcgccg acgtgctgcc tgagtacaag ctgcaggcac cgcgcatcca ccgctggacc
     1381 atcctgcatt acagcccctt caaggccgtg tgggactggc tcatcctgct gctggtcatc
     1441 tacacggctg tcttcacacc ctactcggct gccttcctgc tgaaggagac ggaagaaggc
     1501 ccgcctgcta ccgagtgtgg ctacgcctgc cagccgctgg ctgtggtgga cctcatcgtg
     1561 gacatcatgt tcattgtgga catcctcatc aacttccgca ccacctacgt caatgccaac
     1621 gaggaggtgg tcagccaccc cggccgcatc gccgtccact acttcaaggg ctggttcctc
     1681 atcgacatgg tggccgccat ccccttcgac ctgctcatct tcggctctgg ctctgaggag
     1741 ctgatcgggc tgctgaagac tgcgcggctg ctgcggctgg tgcgcgtggc gcggaagctg
     1801 gatcgctact cagagtacgg cgcggccgtg ctgttcttgc tcatgtgcac ctttgcgctc
     1861 atcgcgcact ggctagcctg catctggtac gccatcggca acatggagca gccacacatg
     1921 gactcacgca tcggctggct gcacaacctg ggcgaccaga taggcaaacc ctacaacagc
     1981 agcggcctgg gcggcccctc catcaaggac aagtatgtga cggcgctcta cttcaccttc
     2041 agcagcctca ccagtgtggg cttcggcaac gtctctccca acaccaactc agagaagatc
     2101 ttctccatct gcgtcatgct cattggctcc ctcatgtatg ctagcatctt cggcaacgtg
     2161 tcggccatca tccagcggct gtactcgggc acagcccgct accacacaca gatgctgcgg
     2221 gtgcgggagt tcatccgctt ccaccagatc cccaatcccc tgcgccagcg cctcgaggag
     2281 tacttccagc acgcctggtc ctacaccaac ggcatcgaca tgaacgcggt gctgaagggc
     2341 ttccctgagt gcctgcaggc tgacatctgc ctgcacctga accgctcact gctgcagcac
     2401 tgcaaaccct tccgaggggc caccaagggc tgccttcggg ccctggccat gaagttcaag
     2461 accacacatg caccgccagg ggacacactg gtgcatgctg gggacctgct caccgccctg
     2521 tacttcatct cccggggctc catcgagatc ctgcggggcg acgtcgtcgt ggccatcctg
     2581 gggaagaatg acatctttgg ggagcctctg aacctgtatg caaggcctgg caagtcgaac
     2641 ggggatgtgc gggccctcac ctactgtgac ctacacaaga tccatcggga cgacctgctg
     2701 gaggtgctgg acatgtaccc tgagttctcc gaccacttct ggtccagcct ggagatcacc
     2761 ttcaacctgc gagataccaa catgatcccg ggctcccccg gcagtacgga gttagagggt
     2821 ggcttcagtc ggcaacgcaa gcgcaagttg tccttccgca ggcgcacgga caaggacacg
     2881 gagcagccag gggaggtgtc ggccttgggg ccgggccggg cgggggcagg gccgagtagc
     2941 cggggccggc cgggggggcc gtggggggag agcccgtcca gtggcccctc cagccctgag
     3001 agcagtgagg atgagggccc aggccgcagc tccagccccc tccgcctggt gcccttctcc
     3061 agccccaggc cccccggaga gccgccgggt ggggagcccc tgatggagga ctgcgagaag
     3121 agcagcgaca cttgcaaccc cctgtcaggc gccttctcag gagtgtccaa cattttcagc
     3181 ttctgggggg acagtcgggg ccgccagtac caggagctcc ctcgatgccc cgcccccacc
     3241 cccagcctcc tcaacatccc cctctccagc ccgggtcggc ggccccgggg cgacgtggag
     3301 agcaggctgg atgccctcca gcgccagctc aacaggctgg agacccggct gagtgcagac
     3361 atggccactg tcctgcagct gctacagagg cagatgacgc tggtcccgcc cgcctacagt
     3421 gctgtgacca ccccggggcc tggccccact tccacatccc cgctgttgcc cgtcagcccc
     3481 ctccccaccc tcaccttgga ctcgctttct caggtttccc agttcatggc gtgtgaggag
     3541 ctgcccccgg gggccccaga gcttccccaa gaaggcccca cacgacgcct ctccctaccg
     3601 ggccagctgg gggccctcac ctcccagccc ctgcacagac acggctcgga cccgggcagt
     3661 tagtggggct gcccagtgtg gacacgtggc tcacccaggg atcaaggcgc tgctgggccg
     3721 ctccccttgg aggccctgct caggaggccc tgaccgtgga aggggagagg aactcgaaag
     3781 cacagctcct cccccagccc ttgggaccat cttctcctgc agtcccctgg gccccagtga
     3841 gaggggcagg ggcagggccg gcagtaggtg gggcctgtgg tccccccact gccctgaggg
     3901 cattagctgg tctaactgcc cggaggcacc cggccctggg ccttaggcac ctcaaggact
     3961 tttctgctat ttactgctct tattgttaag gataataatt aaggatcata tgaataatta
     4021 atgaagatgc tgatgactat gaataataaa taattatcct gaggagaaaa 
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U50929. Human betaine:hom...[gi:1522682] Links  


LOCUS       HSU50929                2436 bp    mRNA    linear   PRI 05-SEP-1996
DEFINITION  Human betaine:homocysteine methyltransferase mRNA, complete cds.
ACCESSION   U50929
VERSION     U50929.1  GI:1522682
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2436)
  AUTHORS   Garrow,T.A.
  TITLE     Purification, kinetic properties, and cDNA cloning of mammalian
            betaine-homocysteine methyltransferase
  JOURNAL   J. Biol. Chem. 271 (37), 22831-22838 (1996)
  MEDLINE   96394355
   PUBMED   8798461
REFERENCE   2  (bases 1 to 2436)
  AUTHORS   Garrow,T.A.
  TITLE     Direct Submission
  JOURNAL   Submitted (07-MAR-1996) Timothy A. Garrow, Food Science and Human
            Nutrition, University of Illinois, Urbana-Champaign, 905 South
            Goodwin Avenue, Urbana, IL 61801, USA
FEATURES             Location/Qualifiers
     source          1..2436
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="liver"
     CDS             27..1247
                     /EC_number="2.1.1.5"
                     /note="betaine-dependent methylation of homocysteine;
                     methyltransferase"
                     /codon_start=1
                     /product="betaine:homocysteine methyltransferase"
                     /protein_id="AAC50668.1"
                     /db_xref="GI:1522683"
                     /translation="MPPVGGKKAKKGILERLNAGEIVIGDGGFVFALEKRGYVKAGPW
                     TPEAAVEHPEAVRQLHREFLRAGSNVMQTFTFYASEDKLENRGNYVLEKISGQEVNEA
                     ACDIARQVADEGDALVAGGVSQTPSYLSCKSETEVKKVFLQQLEVFMKKNVDFLIAEY
                     FEHVEEAVWAVETLIASGKPVAATMCIGPEGDLHGVPPGECAVRLVKAGASIIGVNCH
                     FDPTISLKTVKLMKEGLEAAQLKAHLMSQPLAYHTPDCNKQGFIDLPEFPFGLEPRVA
                     TRWDIQKYAREAYNLGVRYIGGCCGFEPYHIRAIAEELAPERGFLPPASEKHGSWGSG
                     LDMHTKPWVRARARKEYWENLRIASGRPYNPSMSKPDGWGVTKGTAELMQQKEATTEQ
                     QLKELFEKQKFKSQ"
BASE COUNT      772 a    498 c    568 g    598 t
ORIGIN      
        1 cgaccacctg tctggacacc acaaagatgc cacccgttgg gggcaaaaag gccaagaagg
       61 gcatcctaga acgtttaaat gctggagaga ttgtgattgg agatggaggg tttgtctttg
      121 cactggagaa gaggggctac gtaaaggcag gaccctggac tcctgaagct gctgtggagc
      181 acccagaagc agttcgccag cttcatcgag agttcctcag agctggctca aacgtcatgc
      241 agaccttcac cttctatgcg agtgaagaca agctggagaa caggggcaac tatgtcttag
      301 agaagatatc tgggcaggaa gtcaatgaag ctgcttgcga catcgcccga caagtggctg
      361 atgaaggaga tgctttggta gcaggaggag tgagtcagac accttcatac cttagctgca
      421 agagtgaaac tgaagtcaaa aaagtatttc tgcaacagtt agaggtcttt atgaagaaga
      481 acgtggactt cttgattgca gagtattttg aacacgttga agaagctgtg tgggcagttg
      541 aaaccttgat agcatccggt aaacctgtgg cagcaaccat gtgcattggc ccagaaggag
      601 atttgcatgg cgtgcccccc ggcgagtgtg cagtgcgcct ggtgaaagca ggagcatcca
      661 tcattggtgt gaactgccac tttgacccca ccattagttt aaaaacagtg aagctcatga
      721 aggagggctt ggaggctgcc caactgaaag ctcacctgat gagccagccc ttggcttacc
      781 acactcctga ctgcaacaag cagggattca tcgatctccc agaattccca tttggactgg
      841 aacccagagt tgccaccaga tgggatattc aaaaatacgc cagagaggcc tacaacctgg
      901 gggtcaggta cattggcggg tgctgtggat ttgagcccta ccacatcagg gcaattgcag
      961 aggagctggc cccagaaagg ggctttttgc caccagcttc agaaaaacat ggcagctggg
     1021 gaagtggttt ggacatgcac accaaaccct gggttagagc aagggccagg aaggaatact
     1081 gggagaatct tcggatagcc tcaggccggc catacaaccc ttcaatgtca aagccagatg
     1141 gctggggagt gaccaaagga acagccgagc tgatgcagca gaaagaagcc acaactgagc
     1201 agcagctgaa agagctcttt gaaaaacaaa aattcaaatc acagtagcct cgatagaagc
     1261 tatttttgat gaatttctag gtgtttgggt cacagttcct acaaatacgg aaaagggggt
     1321 taaaaagcag tgctttcatg aatgccatcc tacacatatt attgctatta cctgaacaaa
     1381 atagaattac aaatagcact tgataatttt aaagtatgtt ttagaaattt tcttaggagc
     1441 aaaataagta caaagtaaat cttgaacagg ttcactaagc acccaccctg tgaaaagtat
     1501 tatggaaatc actgcagcac aggaaaagta attcagatgt taatgccact tgaagaagtt
     1561 ggtaggctag caaagaggat gagacatgaa ctgtcataaa ggactcagca accagccagg
     1621 gacagataaa gcgctatgga aaggggcttc caagttcttt tgaacatgac ccttagtaac
     1681 aaacacaatt tatataatga cccagcaaaa cacatcacat cttactgtcg aaattaaatg
     1741 tgtgatccat cctagtattt tctgttccat tccttttcat tctatttcat ttataaaaca
     1801 tgctagttga gacttttcaa atggattttt atgacccact actgggtttg gatccacagt
     1861 ttgaaaaata ttgctacaag acacttaagg agaccatcct gtttaagttt attcttataa
     1921 gtaggtcagt catatgagac ctgatcaata aatatccaat acccagagtc ctgctctcag
     1981 agttcttctg tttcgtgacc cacttttcta ccagtaaaag acatagacca atggggagga
     2041 ggggaggaga gatggatatt tcagccctct ccatcctagt caacactgga tccacctagt
     2101 gcctctgggc cataaggctg agcagagtga gcttgtatta gttggtagct tttaaaaaat
     2161 ataataaaaa aaaagtagag attctccaaa ctctagcctg gtttcctaga ttgagaacta
     2221 tgatattttt ctctgataat ttaatatcta ctctcctaca aaagctcaag cctgaagata
     2281 caagactatt agaagaaaca tgactaccct cagtgtatta gaaaagaggt catgcagctt
     2341 tctaaacatt attgaattgt ttgagctgtt ttgaaattgt aattcttttc agctattaaa
     2401 aagaagagca atgagaaaaa aaaaaaaaaa aaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  

&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X00371. Human myoglobin g...[gi:34607] Links  


LOCUS       HSMG01                  3768 bp    DNA     linear   PRI 18-MAR-1996
DEFINITION  Human myoglobin gene (exon 1) (and joined CDS).
ACCESSION   X00371
VERSION     X00371.1  GI:34607
KEYWORDS    direct repeat; myoglobin; tandem repeat.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3768)
  AUTHORS   Weller,P., Jeffreys,A.J., Wilson,V. and Blanchetot,A.
  TITLE     Organization of the human myoglobin gene
  JOURNAL   EMBO J. 3 (2), 439-446 (1984)
  MEDLINE   84182508
COMMENT     Data kindly reviewed (09-MAY-1985) by P. Weller.
FEATURES             Location/Qualifiers
     source          1..3768
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     repeat_region   831..1482
                     /rpt_type=TANDEM
     repeat_region   868..983
                     /rpt_type=DIRECT
     repeat_region   996..1112
                     /rpt_type=DIRECT
     repeat_region   1207..1264
                     /rpt_type=DIRECT
     repeat_region   1323..1380
                     /rpt_type=DIRECT
     TATA_signal     2548..2553
     mRNA            join(2581..2745,X00372.1:1264..1486,X00373.1:101..778)
                     /label=myo_mrna
     exon            2581..2745
                     /label=ex1
     misc_RNA        2581
                     /note="cap-site"
     CDS             join(2651..2745,X00372.1:1264..1486,X00373.1:101..247)
                     /codon_start=1
                     /label=myo_cds
                     /product="myoglobin"
                     /protein_id="CAA25109.1"
                     /db_xref="GI:1235527"
                     /db_xref="SWISS-PROT:P02144"
                     /translation="MGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKF
                     DKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKHKIPV
                     KYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNYKELGFQG"
     misc_feature    2651..2745
                     /label=start
BASE COUNT     1030 a    730 c   1089 g    919 t
ORIGIN      
        1 cctctgaccc ttttggtcgc taggagtcag ccgactcagt acacaggact cactgaatgg
       61 agacacaagg ctcctccagg gagtggcggc tcatggcaat cctagaatgg tcaccagcca
      121 ggctttagag acccacacag agggcgttct gacccaaagt tgcactgggg aactccaagt
      181 ttggggattc tttgaattta actctttttc tagctacatt tcctattatt tgtccaattc
      241 ttaccaaaca tctctgttca cattctgaag ctgggatctg actggcagag ctagtagatg
      301 ctgactattc agatggagcc ctgacattgg ctttctcagc ttggctgtga ctggcagcag
      361 gtttgcggga gaactgtgtg tcccagaaca tgactggcta cacctgcacc tcagcaagat
      421 tggggcaggg cagttatctt caaaaagctg tgtaggtggg gcagtcatta ctgacaaatc
      481 cagtgcagac ccaggatggc ccaaacactg gcttatcctt tctgaatctc atctcccaca
      541 gctgtaaagc ggggtggtgc tcgctacctc acagaggtgt tgtaaagatt agatgtaatc
      601 ttgccaagca gccactttgt aaactgtata gtcttatgca gatggaagga agggcctgtg
      661 cctaccttga tcatagcact aaacaaactg tactgtattt tcattcctct tagttatctc
      721 cctaaaaaga ctctgagttc cttgaacaca ggaaggtgtt ttatttgatt ttgttatcct
      781 cagcatgtag cagtgtctga cacacagtag gtgctctatc actgtgagag ggatggatgg
      841 atgggtggag ttacagatgg atagaaggat agatggaggg atgggtggat gatggatgga
      901 tagatggatg gaggggggat gatgaatgga gggataatga gtggatgaat gagggaatgg
      961 gtggatggat ggatggaggg atggaggaac agatagatag atggagggat gggtgggtga
     1021 tggatggata gatggatgga gggagggatg atgaatggag ggataatgaa tggatgaatg
     1081 aggggatggg tggatggatg aatggaggga tgatgggtgg atgaatgaat tgagggatgg
     1141 atggatgaac acatggatgg atggatagat ggatagatgg aggaactggt ggattttgga
     1201 tggatgggtg gatggataga tgaatgaatg cctggataga caaagagatg atggatagat
     1261 gaatagatga attaagggat gtcggataga tggagggatt gatagatgtt ggatggatgg
     1321 gtggtggatg gatagatgag tgaatgcatg gatagacaaa gagatgatgg atggatgaat
     1381 taagggatga cagatggatg gatggatgag taactggatg gacaagtgga taaatggata
     1441 gatggttgaa tacctgaatg gattgaagga ggatgcatgg atgtaagata aggctaatca
     1501 tcctccactc tctttctttg caaaaccatc cacccattta ctcaataaac atttattcag
     1561 ttcaaacttg gcacaaagca ccatgtgagg cccaagagat acgtgggtta ataaaacaga
     1621 gctcctgccc tcctgaaaac tgcaaagaaa ggggcgtggc ttcctgagtt caaatcccaa
     1681 ctctgccagc gactagctgt acatcagtga tgtttcccta ctttctctca attaaatagg
     1741 gataatgtca gtacctatca cattgggagg tcttgcgggg attaaatgag ttaccaaatg
     1801 ccaagtgttt gggacagggc ctggcaccca gcaaagtctc ttgtgagtgc tggctgctat
     1861 tatcctaatg gagaagatgg catgaaaacc aggaaatagg atgccctttg ggaagcaatg
     1921 caacaggaac ttacacaaag aaaggaaagg aggaagcaat tagtggtgtc tcaaaggagt
     1981 atgtcaagaa aaacttttca gagggaaacc tttgagcagg gccatgaaaa caggagttct
     2041 ctaagagatt gtggacttgc ctgggaccac ctggctataa gcacaaaacc atccggttcc
     2101 tttctgtcac ttctggcggg tgaggggtct ctggcaaagg ggcagaaggt gcgtgagagg
     2161 ttgcgaatgg caggactgtc ctggccagcc ggggcacctg gtggccaagc ttagaaacat
     2221 gacaggtcct cttgggaggg ctgaccgcag ggagcgttgg gtttcaggct gctggcgtcg
     2281 gcttctgtgg tgccctttct gtcggctatg agagtccaga cagtgcccaa cctcctcccc
     2341 ttctttccac acgcacaacc accccacccc ctgtggcctg agctgtcctg cctcgccaca
     2401 atggcacctg ccctaaaata gcttcccatg tgagggctag agaaaggaaa agattagacc
     2461 ctccctggat gagagagaga aagtgaagga gggcagggga gggggacagc gagccattga
     2521 gcgatctttg tcaagcatcc cagaaggtat aaaaacgccc ttgggaccag gcagcctcaa
     2581 accccagctg ttggggccag gacacccagt gagcccatac ttgctctttt tgtcttcttc
     2641 agactgcgcc atggggctca gcgacgggga atggcagttg gtgctgaacg tctgggggaa
     2701 ggtggaggct gacatcccag gccatgggca ggaagtcctc atcaggtaaa aggaagagat
     2761 tccattgccc ctgccaccca caccctaaga tcaagggtgt tcagctgcaa ggtggaaagt
     2821 ttgcacgtgg ggtaggtcag ttggctgcat tagttaaggg tgttagaacg gtcacttgct
     2881 ttttctttgc ttttaagtgt cagggattgg actcaggaga gggaaaggag ccatttcagg
     2941 ctgatatcag cagctggagg aagcatgaga atcaaaccta ggatgctcag agtccaccag
     3001 gaagaatttt agaattatag acagtcagag ttaacaaggg tcctgagaga ttttgtacag
     3061 ccacctctct tacaggatga ggacaaaaag cgactgagaa ggggaggaca tttccagagt
     3121 cacagctcat taaatgctct taaagtgtca aggttaagac atgctcttca aggggagaca
     3181 gatctggttc tagacttggc tctgccactg agccactggg tgacctttgg gaaggtactc
     3241 aacctctcgg agcctcaatt tcctctcctg tacagtgagg ggatatccta atatctatat
     3301 cctagaggag atgtgagaat taaataaaat aatgcatgca agaggcctgg catggttcct
     3361 ggcatatact gagtcctaga aatgttagta gctattactg atgaagccca ggctagggac
     3421 ctttcaaagc attgcaatta gagaacagaa gatagaggct cattagtgac cttcgatgtt
     3481 gagtatgtct ctagtttgag aggtctgaat gatgtggtct gcaagtatat cctgccttct
     3541 accacaaggg attccagaat acaccaaaga aaacaaaatt ctgaggtttg taaatagagg
     3601 gtggctgtgg tttgtacata gaagctcatc tcctcgttgc cttctatccc aaaggtgata
     3661 cactcttctc ttggcccctt ccctcaccat tctgagctgg ttccctcaga agtctaatag
     3721 gttaagaatc aacgtttctg ccaacgggag gaaggaagtg ggcgccgg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


   


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  

&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U68233. Human farnesol re...[gi:1546083] Links  


LOCUS       HSU68233                2218 bp    mRNA    linear   PRI 15-SEP-1996
DEFINITION  Human farnesol receptor HRR-1 (HRR-1) mRNA, complete cds.
ACCESSION   U68233
VERSION     U68233.1  GI:1546083
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2218)
  AUTHORS   Papetti,M., Wood,N., Lohmar,P.D. and Bowman,M.R.
  TITLE     The Identification of the cDNA Coding for HRR-1, a Novel Human
            Farnesol Receptor
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 2218)
  AUTHORS   Papetti,M., Wood,N., Lohmar,P.D. and Bowman,M.R.
  TITLE     Direct Submission
  JOURNAL   Submitted (28-AUG-1996) Immunology and Hematopoiesis, Genetics
            Institute, 87 CambridgePark Drive, Cambridge, MA 02140, USA
FEATURES             Location/Qualifiers
     source          1..2218
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..2218
                     /gene="HRR-1"
     CDS             354..1772
                     /gene="HRR-1"
                     /note="FXR; retinoid receptor"
                     /codon_start=1
                     /product="farnesol receptor HRR-1"
                     /protein_id="AAB08107.1"
                     /db_xref="GI:1546084"
                     /translation="MGSKMNLIEHSHLPTTDEFSFSENLFGVLTEQVAGPLGQNLEVE
                     PYSQYSNVQFPQVQPQISSSSYYSNLGFYPQQPEEWYSPGIYELRRMPAETLYQGETE
                     VAEMPVTKKPRMGASAGRIKGDELCVVCGDRASGYHYNALTCEGCKGFFRRSITKNAV
                     YKCKNGGNCVMDMYMRRKCQECRLRKCKEMGMLAECLLTEIQCKSKRLRKNVKQHADQ
                     TVNEDSEGRDLRQVTSTTKSCREKTELTPDQQTLLHFIMDSYNKQRMPQEITNKILKE
                     EFSAEENFLILTEMATNHVQVLVEFTKKLPGFQTLDHEDQIALLKGSAVEAMFLRSAE
                     IFNKKLPSGHSDLLEERIRNSGISDEYITPMFSFYKSIGELKMTQEEYALLTAIVILS
                     PDRQYIKDREAVEKLQEPLLDVLQKLCKIHQPENPQHFACLLGRLTELRTFNHHHAEM
                     LMSWRVNDHKFTPLLCEIWDVQ"
     polyA_signal    2124..2129
                     /gene="HRR-1"
BASE COUNT      741 a    423 c    458 g    596 t
ORIGIN      
        1 acgagactct ctcctcctcc tcacctcatt gtctccccga cttatcctaa tgcgaaattg
       61 gattctgagc atttgtagca aaatcgctgg gatctggaga ggaagactca gtccagaatc
      121 ctcccagggc cttgaaagtc catctctgac ccaaaacaat ccaaggaggt agaagacatc
      181 gtagaaggag tgaaagaaga aaagaagact tagaaacata gctcaaagtg aacactgctt
      241 ctcttagttt cctggatttc ttctggacat ttcctcaaga tgaaacttca gacactttgg
      301 agtttttttt gaagaccacc ataaagaaag tgcatttcaa ttgaaaaatt tggatgggat
      361 caaaaatgaa tctcattgaa cattcccatt tacctaccac agatgaattt tctttttctg
      421 aaaatttatt tggtgtttta acagaacaag tggcaggtcc tctgggacag aacctggaag
      481 tggaaccata ctcgcaatac agcaatgttc agtttcccca agttcaacca cagatttcct
      541 cgtcatccta ttattccaac ctgggtttct acccccagca gcctgaagag tggtactctc
      601 ctggaatata tgaactcagg cgtatgccag ctgagactct ctaccaggga gaaactgagg
      661 tagcagagat gcctgtaaca aagaagcccc gcatgggcgc gtcagcaggg aggatcaaag
      721 gggatgagct gtgtgttgtt tgtggagaca gagcctctgg ataccactat aatgcactga
      781 cctgtgaggg gtgtaaaggt ttcttcagga gaagcattac caaaaacgct gtgtacaagt
      841 gtaaaaacgg gggcaactgt gtgatggata tgtacatgcg aagaaagtgt caagagtgtc
      901 gactaaggaa atgcaaagag atgggaatgt tggctgaatg cttgttaact gaaattcagt
      961 gtaaatctaa gcgactgaga aaaaatgtga agcagcatgc agatcagacc gtgaatgaag
     1021 acagtgaagg tcgtgacttg cgacaagtga cctcgacaac aaagtcatgc agggagaaaa
     1081 ctgaactcac cccagatcaa cagactcttc tacattttat tatggattca tataacaaac
     1141 agaggatgcc tcaggaaata acaaataaaa ttttaaaaga agaattcagt gcagaagaaa
     1201 attttctcat tttgacggaa atggcaacca atcatgtaca ggttcttgta gaattcacaa
     1261 aaaagctacc aggatttcag actttggacc atgaagacca gattgctttg ctgaaagggt
     1321 ctgcggttga agctatgttc cttcgttcag ctgagatttt caataagaaa cttccgtctg
     1381 ggcattctga cctattggaa gaaagaattc gaaatagtgg tatctctgat gaatatataa
     1441 cacctatgtt tagtttttat aaaagtattg gggaactgaa aatgactcaa gaggagtatg
     1501 ctctgcttac agcaattgtt atcctgtctc cagatagaca atacataaag gatagagagg
     1561 cagtagagaa gcttcaggag ccacttcttg atgtgctaca aaagttgtgt aagattcacc
     1621 agcctgaaaa tcctcaacac tttgcctgtc tcctgggtcg cctgactgaa ttacggacat
     1681 tcaatcatca ccacgctgag atgctgatgt catggagagt aaacgaccac aagtttaccc
     1741 cacttctctg tgaaatctgg gacgtgcagt gatggggatt acaggggagg ggtctagctc
     1801 ctttttctct ctcatattaa tctgatgtat aactttcctt tatttcactt gtacccagtt
     1861 tcactcaaga aatcttgatg aatatttatg ttgtaattac atgtgtaact tccacaactg
     1921 taaatattgg gctagataga acaactttct ctacattgtg ttttaaaagg ctccagggaa
     1981 tcctgcattc taattggcaa gccctgtttg cctaattaaa ttgattgtta cttcaattct
     2041 atctgttgaa ctagggaaaa tctcattttg ctcatcttac catattgcat atattttatt
     2101 aaagagttgt attcaatctt ggcaataaag caaacataat ggcaacagaa aaaaaaaaaa
     2161 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  

&&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X51957. H.sapiens mRNA fo...[gi:34788] Links  


LOCUS       HSMSER                  1398 bp    mRNA    linear   PRI 02-NOV-1992
DEFINITION  H.sapiens mRNA for muscle specific enolase (MSE) (EC 4.2.1.11).
ACCESSION   X51957
VERSION     X51957.1  GI:34788
KEYWORDS    beta-enolase gene; enolase; muscle specific enolase.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1398)
  AUTHORS   Cali,L., Feo,S., Oliva,D. and Giallongo,A.
  TITLE     Nucleotide sequence of a cDNA encoding the human muscle-specific
            enolase (MSE)
  JOURNAL   Nucleic Acids Res. 18 (7), 1893 (1990)
  MEDLINE   90245587
REFERENCE   2
  AUTHORS   Feo,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-FEB-1990) Feo S., Istituto di Biologia dello
            Sviluppo, del consiglio naziolae delle ricerche, via archirafi 20,
            90123 Palermo, Italy
  REMARK    revised by [3]
REFERENCE   3  (bases 1 to 1398)
  AUTHORS   Feo,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-OCT-1990)
COMMENT     For conflicting sequence see .
FEATURES             Location/Qualifiers
     source          1..1398
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="17pter-p11"
                     /clone="B700/B800"
                     /tissue_type="skeletal muscle"
                     /clone_lib="lambda gt10"
                     /dev_stage="adult"
     CDS             13..1317
                     /codon_start=1
                     /product="muscle-specific enolase"
                     /protein_id="CAA36216.1"
                     /db_xref="GI:34789"
                     /db_xref="SWISS-PROT:P13929"
                     /translation="MAMQKIFAREILDSRGNPTVEVDLHTAKGRFRAAVPSGASTGIY
                     EALELRDGDKGRYLGKGVLKAVENINNTLGPALLQKKLSVADQEKVDKFMIELDGTEN
                     KSKFGANAILGVSLAVCKAGAAEKGVPLYRHIADLAGNPDLILPVPAFNVINGGSHAG
                     NKLAMQEFMILPVGASSFKEAMRIGAEVYHHLKGVIKAKYGKDATNVGDEGGFAPNIL
                     ENNEALELLKTAIQAAGYPDKVVIGMDVAASEFYRNGKYDLDFKSPDDPARHITGEKL
                     GELYKSFIKNYPVVSIEDPFDQDDWATWTSFLSGVNIQIVGDDLTVTNPKRIAQAVEK
                     KACNCLLLKVNQIGSVTESIQACKLAQSNGWGVMVSHRSGETEDTFIADLVVGLCTGQ
                     IKTGAPCRSERLAKYNQLMRIEEALGDKAIFAGRKFRNPKAK"
     polyA_signal    1378..1383
     polyA_site      1398
BASE COUNT      340 a    367 c    420 g    271 t
ORIGIN      
        1 gtagtaaagg ccatggccat gcagaaaatc tttgcccggg aaatcttgga ctccaggggc
       61 aaccccacgg tggaggtgga cctgcacacg gccaagggcc gattccgagc agctgtgccc
      121 agtggggctt ccacgggtat ctatgaggct ctggaactaa gagacggaga caaaggccgc
      181 tacctgggga aaggagtcct gaaggctgtg gagaacatca acaatactct gggccctgct
      241 ctgctgcaaa agaaactaag cgttgcggat caagaaaaag ttgacaaatt tatgattgag
      301 ctagatggga ccgagaataa gtccaagttt ggggccaatg ccatcctggg cgtgtccttg
      361 gccgtgtgta aggcgggagc agctgagaag ggggtccccc tgtaccgcca catcgcagat
      421 ctcgctggga accctgacct catactccca gtgccagcct tcaatgtgat caacgggggc
      481 tcccatgctg gaaacaagct ggccatgcag gagttcatga ttctgcctgt gggagccagc
      541 tccttcaagg aagccatgcg cattggcgcc gaggtctacc accacctcaa gggggtcatc
      601 aaggccaagt atgggaagga tgccaccaat gtgggtgatg aaggtggctt cgcacccaac
      661 atcctggaga acaatgaggc cctggagctg ctgaagacgg ccatccaggc ggctggttac
      721 ccagacaagg tggtgatcgg catggatgtg gcagcatctg agttctatcg caatgggaag
      781 tacgatcttg acttcaagtc gcctgatgat cccgcacggc acatcactgg ggagaagctc
      841 ggagagctgt ataagagctt tatcaagaac tatcctgtgg tctccatcga agaccccttt
      901 gaccaggatg actgggccac ttggacctcc ttcctctcgg gggtgaacat ccagattgtg
      961 ggggatgact tgacagtcac caaccccaag aggattgccc aggccgttga gaagaaggcc
     1021 tgcaactgtc tgctgctgaa ggtcaaccag atcggctcgg tgaccgaatc gatccaggcg
     1081 tgcaaactgg ctcagtctaa tggctggggg gtgatggtga gccaccgctc tggggagact
     1141 gaggacacat tcattgctga ccttgtggtg gggctctgca caggacagat caagactggc
     1201 gccccctgcc gctcggagcg tctggccaaa tacaaccaac tcatgaggat cgaggaggct
     1261 cttggggaca aggcaatctt tgctggacgc aagttccgta acccgaaggc caagtgagaa
     1321 gctggaggct ccaggactcc actggacaga cccaggtctt ccagacctgc ttcctgaaat
     1381 aaacactggt gccaacca
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X77753. H.sapiens TROP-2 ...[gi:1524102] Links  


LOCUS       HSTROP2A                2805 bp    mRNA    linear   PRI 12-SEP-1996
DEFINITION  H.sapiens TROP-2 gene.
ACCESSION   X77753
VERSION     X77753.1  GI:1524102
KEYWORDS    gp50 protein; TROP-2 gene.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Fornaro,M., Dell'Arciprete,R., Stella,M., Bucci,C., Nutini,M.,
            Capri,M.G. and Alberti,S.
  TITLE     Cloning of the gene encoding Trop-2, a cell-surface glycoprotein
            expressed by human carcinomas
  JOURNAL   Int. J. Cancer 62 (5), 610-618 (1995)
  MEDLINE   95394524
REFERENCE   2
  AUTHORS   Alberti,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-1994) S. Alberti, Institute Mario Negri Sud, Via
            Nazionale, 66030 Santa Maria Imbaro, Chieti, ITALY
  REMARK    revised by [3]
REFERENCE   3
  AUTHORS   Alberti,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-AUG-1995) S. Alberti, Institute Mario Negri Sud, Via
            Nazionale, 66030 Santa Maria Imbaro, Chieti, ITALY
  REMARK    revised by [4]
REFERENCE   4  (bases 1 to 2805)
  AUTHORS   Alberti,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (04-SEP-1996) S. Alberti, Institute Mario Negri Sud, Via
            Nazionale, 66030 Santa Maria Imbaro, Chieti, ITALY
COMMENT     On Sep 6, 1996 this sequence version replaced gi:944831.
FEATURES             Location/Qualifiers
     source          1..2805
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="1"
                     /map="p32"
                     /cell_type="epithelial"
                     /germline
     gene            1..2805
                     /gene="TROP-2"
     CDS             616..1587
                     /gene="TROP-2"
                     /codon_start=1
                     /product="gp50/Trop-2"
                     /protein_id="CAA54799.1"
                     /db_xref="GI:606779"
                     /db_xref="SWISS-PROT:P09758"
                     /translation="MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVC
                     SPDGPGGRCQCRALGSGMAVDCSTLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDG
                     LYDPDCDPEGRFKARQCNQTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRH
                     RPTAGAFNHSDLDAELRRLFRERYRLHPKFVAAVHYEQPTIQIELRQNTSQKAAGEVD
                     IGDAAYYFERDIKGESLFQGRGGLDLRVRGEPLQVERTLIYYLDEIPPKFSMKRLTAG
                     LIAVIVVVVVALVAGMAVLVITNRRKSGKYKKVEIKELGELRKEPSL"
     sig_peptide     616..794
                     /gene="TROP-2"
                     /product="gp50/Trop-2"
BASE COUNT      618 a    755 c    801 g    631 t
ORIGIN      
        1 cgggtctgat agtccctacc tgtcaggact ggtgttagga tgagataatg tttgtgaact
       61 gtaaacatat ataaacgtgt gctactgtga gaactggaac aaagaagaga gggagtgaga
      121 gaaatcaagg gagggctggg gctgggaaag aacgaaaagg gagtcgcgta tagaggagag
      181 gcgacagtcg cgagccacac tttgcaatga aactctttag actttctgcc gggagagcgg
      241 cccagacgcg ccaggtctgt agcaggaggc cgcgagggcg ggtccccaga agcctacagg
      301 tgagtatcgg ttctcccctt cccggctttc ggtccggagg aggcgggagc agcttccctg
      361 ttctgatcct atcgcgggcg gcgcagggcc ggcttggcct tccgtgggac ggggaggggg
      421 gcgggatgtg tcacccaaat accagtgggg acggtcggtg gtggaaccag ccgggcaggt
      481 cgggtagagt ataagagccg gagggagcgg ccggggcgca gacgcctgca gaccatccca
      541 gacgccggag cccgagcccc gacgagtccc cgcgcctcat ccgcccgcgt ccggtccgcg
      601 ttcctccgcc ccaccatggc tcggggcccc ggcctcgcgc cgccaccgct gcggctgccg
      661 ctgctgctgc tggtgctggc ggcggtgacc ggccacacgg ccgcgcagga caactgcacg
      721 tgtcccacca acaagatgac cgtgtgcagc cccgacggcc ccggcggccg ctgccagtgc
      781 cgcgcgctgg gctcgggcat ggcggtcgac tgctccacgc tgacctccaa gtgtctgctg
      841 ctcaaggcgc gcatgagcgc ccccaagaac gcccgcacgc tggtgcggcc gagtgagcac
      901 gcgctcgtgg acaacgatgg cctctacgac cccgactgcg accccgaggg ccgcttcaag
      961 gcgcgccagt gcaaccagac gtcggtgtgc tggtgcgtga actcggtggg cgtgcgccgc
     1021 acggacaagg gcgacctgag cctacgctgc gatgagctgg tgcgcaccca ccacatcctc
     1081 attgacctgc gccaccgccc caccgccggc gccttcaacc actcagacct ggacgccgag
     1141 ctgaggcggc tcttccgcga gcgctatcgg ctgcacccca agttcgtggc ggccgtgcac
     1201 tacgagcagc ccaccatcca gatcgagctg cggcagaaca cgtctcagaa ggccgccggt
     1261 gaagtggata tcggcgatgc cgcctactac ttcgagaggg acatcaaggg cgagtctcta
     1321 ttccagggcc gcggcggcct ggacttgcgc gtgcgcggag aacccctgca ggtggagcgc
     1381 acgctcatct attacctgga cgagattccc ccgaagttct ccatgaagcg cctcaccgcc
     1441 ggcctcatcg ccgtcatcgt ggtggtcgtg gtggccctcg tcgccggcat ggccgtcctg
     1501 gtgatcacca accggagaaa gtcggggaag tacaagaagg tggagatcaa ggaactgggg
     1561 gagttgagaa aggaaccgag cttgtaggta cccggcgggg caggggatgg ggtggggtac
     1621 cggatttcgg tatcgtccca gacccaagtg agtcacgctt cctgattcct cggcgcaaag
     1681 gagacgttta tcctttcaaa ttcctgcctt ccccctccct tttgcgcaca caccaggttt
     1741 aatagatcct ggcctcaggg tctcctttct ttctcacttc tgtcttgagg gaagcatttc
     1801 taaaatgtat cccctttcgg tccaacaaca ggaaacctga ctggggcagt gaaggaaggg
     1861 atggcacagc gttatgtgta aaaaacaagt atctgtatga caacccggga tcgtttgcaa
     1921 gtaactgaat ccattgcgac attgtgaagg cttaaatgag tttagatggg aaatagcgtt
     1981 gttatcgcct tgggtttaaa ttatttgatg agttccactt gtatcatggc ctacccgagg
     2041 agaagaggag tttgttaact gggcctatgt agtagcctca tttaccatcg tttgtattac
     2101 tgaccacata tgcttgtcac tgggaaagaa gcctgtttca gctgcctgaa cgcagtttgg
     2161 atgtctttga ggacagacat tgcccggaaa ctcagtctat ttattcttca gcttgccctt
     2221 actgccactg atattggtaa tgttcttttt tgtaaaatgt ttgtacatat gttgtctttg
     2281 ataatgttgc tgtaattttt taaaataaaa cacgaattta ataaaatatg ggaaaggcac
     2341 aaaccagaag tcggcatttg tgaaaagtcc ctccagattt ctatcacttt ggtctctaat
     2401 ttcccaagac ttgtattttt tttttatttc aaattataac actttttttt cccccagaag
     2461 tgggtgtttc atgttgctac tctggtgtgt cccaagatat cctaactggc cagtgtaaat
     2521 gctattcttt ctaaataaga ttatttggaa acttccttca aactgcagga gggcgagctc
     2581 tgagggcacg agaagctaaa actagctgct tttgatgaaa aagagtgcca gtctttggtc
     2641 atctctaaac aaggcttatc accaatggag acagaaaact ctagttcaag agctgtacct
     2701 cctttgaatc ccagccctac tcgaaataag tggtactatt tccatttagc ctttgagcaa
     2761 atcacttaac tcaaaggcgt tgtggctcta agattaaacg acttt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AL109804. Human DNA sequenc...[gi:11121192] Links  


LOCUS       HS1009E24             185820 bp    DNA     linear   PRI 26-APR-2001
DEFINITION  Human DNA sequence from clone RP5-1009E24 on chromosome 20 Contains
            the SN gene encoding sialoadhesin, a novel gene similar to
            KIAA0417, the CENPB gene for centromere protein B, the CDC25B gene
            for Cell division cycle protein 25B, three novel genes, the 5' end
            of gene KIAA1271, nine CpG islands, ESTs, STSs and GSSs, complete
            sequence.
ACCESSION   AL109804
VERSION     AL109804.41  GI:11121192
KEYWORDS    HTG; CDC25B; CENPB; Centromere; CpG island; KIAA0417; KIAA1271;
            sialoadhesin; SN.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 185820)
  AUTHORS   Phillimore,B.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-APR-2001) Sanger Centre, Hinxton, Cambridgeshire,
            CB10 1SA, UK. E-mail enquiries: humquery@sanger.ac.uk Clone
            requests: clonerequest@sanger.ac.uk
COMMENT     On Nov 8, 2000 this sequence version replaced gi:10443466.
            During sequence assembly data is compared from overlapping clones.
            Where differences are found these are annotated as variations
            together with a note of the overlapping clone name. Note that the
            variation annotation may not be found in the sequence submission
            corresponding to the overlapping clone, as we submit sequences with
            only a small overlap as described above.
            The following abbreviations are used to associate primary accession
            numbers given in the feature table with their source databases:
            Em:, EMBL; Sw:, SWISSPROT; Tr:, TREMBL; Wp:, WORMPEP; Information
            on the WORMPEP database can be found at
            http://www.sanger.ac.uk/Projects/C_elegans/wormpep This sequence
            was generated from part of bacterial clone contigs of human
            chromosome 20, constructed by the Sanger Centre Chromosome 20
            Mapping Group.  Further information can be found at
            http://www.sanger.ac.uk/HGP/Chr20
            This sequence is the entire insert of clone RP5-1009E24 The true
            left end of clone RP11-119B16 is at 128095 in this sequence. The
            true right end of clone RP5-964F7 is at 20907 in this sequence.
            This sequence was finished as follows unless otherwise noted: all
            regions were either double-stranded or sequenced with an alternate
            chemistry or covered by high quality data (i.e., phred quality >=
            30); an attempt was made to resolve all sequencing problems, such
            as compressions and repeats; all regions were covered by at least
            one plasmid subclone or more than one M13 subclone; and the
            assembly was confirmed by restriction digest. RP5-1009E24 is from
            the library RPCI-5 constructed by the group of Pieter de Jong. For
            further details see
            http://www.chori.org/bacpac/home.htm
            VECTOR: pCYPAC2.
FEATURES             Location/Qualifiers
     source          1..185820
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="20"
                     /clone="RP5-1009E24"
                     /clone_lib="RPCI-5"
     repeat_region   644..695
                     /note="MIR repeat: matches 195..250 of consensus"
     repeat_region   1011..1288
                     /note="AluSx repeat: matches 37..312 of consensus"
     misc_feature    2195..2583
                     /note="match: GSS: Em:AQ790341"
     repeat_region   3333..3423
                     /note="MIR repeat: matches 79..169 of consensus"
     repeat_region   3643..3706
                     /note="MER69 repeat: matches 70..136 of consensus"
     repeat_region   3708..3789
                     /note="MER69 repeat: matches 2422..2512 of consensus"
     repeat_region   3708..3739
                     /note="16 copies 2 mer tt 84% conserved"
     misc_feature    6250..7126
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   6526..6555
                     /note="10 copies 3 mer agc 90% conserved"
     repeat_region   6702..6757
                     /note="28 copies 2 mer cc 73% conserved"
     repeat_region   7387..7686
                     /note="AluSc repeat: matches 1..300 of consensus"
     repeat_region   7710..8015
                     /note="AluSp repeat: matches 1..307 of consensus"
     repeat_region   8249..8302
                     /note="27 copies 2 mer tt 72% conserved"
     repeat_region   8361..8589
                     /note="AluJo repeat: matches 80..292 of consensus"
     repeat_region   8590..8899
                     /note="AluSq repeat: matches 1..310 of consensus"
     repeat_region   8900..8968
                     /note="AluJo repeat: matches 9..80 of consensus"
     repeat_region   9029..9471
                     /note="L1ME2 repeat: matches 5706..6155 of consensus"
     repeat_region   9484..9645
                     /note="MLT2CA repeat: matches 321..483 of consensus
                     MLT2CA repeat: matches 321..483 of consensus"
     repeat_region   9840..10070
                     /note="L2 repeat: matches 2417..2664 of consensus"
     repeat_region   10677..10756
                     /note="L1ME2 repeat: matches 5557..5637 of consensus"
     repeat_region   10754..10907
                     /note="L1ME3 repeat: matches 5901..6065 of consensus"
     gene            complement(11558..31714)
                     /gene="SN"
     mRNA            complement(join(11558..13205,13741..13813,14186..14288,
                     14548..14850,15926..16186,16489..16788,17107..17358,
                     17448..17717,18033..18284,18807..19118,19249..19509,
                     21172..21474,21671..21928,22384..22719,23788..24045,
                     25928..26227,27783..28037,28411..28677,30330..30626,
                     30933..31292,31666..31714))
                     /gene="SN"
                     /product="dJ1009E24.1.1 (Sialoadhesin (isoform 1) )"
                     /note="match: cDNAs: Em:Z36293 Em:Z36233 Em:AK024462
                     match: ESTs: Em:AI347674 Em:AI818995 Em:AI864580
                     Em:AI375186 Em:AI936668 Em:AI032935 Em:AI031641
                     Em:BE857558"
                     /evidence=not_experimental
     mRNA            complement(join(11558..14850,15926..16186,16489..16788,
                     17107..17358,17448..17665))
                     /gene="SN"
                     /product="dJ1009E24.1.2 (Sialoadhesin (isoform 2))"
                     /note="match: cDNAs: Em:AK024479 Em:AK024459"
                     /evidence=not_experimental
     polyA_site      complement(11558)
                     /gene="SN"
     polyA_signal    complement(11575..11580)
                     /gene="SN"
     repeat_region   12381..12733
                     /note="MLT1J repeat: matches 91..485 of consensus"
     CDS             complement(join(13146..13205,13741..13813,14186..14288,
                     14548..14850,15926..16186,16489..16788,17107..17358,
                     17448..17717,18033..18284,18807..19118,19249..19509,
                     21172..21474,21671..21928,22384..22719,23788..24045,
                     25928..26227,27783..28037,28411..28677,30330..30626,
                     30933..31292,31666..31714))
                     /gene="SN"
                     /note="match: proteins: Tr:Q62523 Sw:Q62230 Sw:P20273
                     Tr:Q08476"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.1.1 (Sialoadhesin (isoform 1) )"
                     /protein_id="CAC17543.1"
                     /db_xref="GI:11493365"
                     /translation="MGFLPKLLLLASFFPAGQASWGVSSPQDVQGVKGSCLLIPCIFS
                     FPADVEVPDGITAIWYYDYSGQRQVVSHSADPKLVEARFRGRTEFMGNPEHRVCNLLL
                     KDLQPEDSGSYNFRFEISEVNRWSDVKGTLVTVTEEPRVPTIASPVELLEGTEVDFNC
                     STPYVCLQEQVRLQWQGQDPARSVTFNSQKFEPTGVGHLETLHMAMSWQDHGRILRCQ
                     LSVANHRAQSEIHLQVKYAPKGVKILLSPSGRNILPGELVTLTCQVNSSYPAVSSIKW
                     LKDGVRLQTKTGVLHLPQAAWSDAGVYTCQAENGVGSLVSPPISLHIFMAEVQVSPAG
                     PILENQTVTLVCNTPNEAPSDLRYSWYKNHVLLEDAHSHTLRLHLATRADTGFYFCEV
                     QNVHGSERSGPVSVVVNHPPLTPVLTAFLETQAGLVGILHCSVVSEPLATLVLSHGGH
                     ILASTSGDSDHSPRFSGTSGPNSLRLEIRDLEETDSGEYKCSATNSLGNATSTLDFHA
                     NAARLLISPAAEVVEGQAVTLSCRSGLSPTPDARFSWYLNGALLHEGPGSSLLLPAAS
                     STDAGSYHCRARDGHSASGPSSPAVLTVLYPPRQPTFTTRLDLDAAGAGAGRRGLLLC
                     RVDSDPPARLQLLHKDRVVATSLPSGGGCSTCGGCSPRMKVTKAPNLLRVEIHNPLLE
                     EEGLYLCEASNALGNASTSATFNGQATVLAIAPSHTLQEGTEANLTCNVSREAAGSPA
                     NFSWFRNGVLWAQGPLETVTLLPVARTDAALYACRILTEAGAQLSTPVLLSVLYPPDR
                     PKLSALLDMGQGHMALFICTVDSRPLALLALFHGEHLLATSLGPQVPSHGRFQAKAEA
                     NSLKLEVRELGLGDSGSYRCEATNVLGSSNTSLFFQVRGAWVQVSPSPELQEGQAVVL
                     SCQVHTGVPEGTSYRWYRDGQPLQESTSATLRFAAITLTQAGAYHCQAQAPGSATTSL
                     AAPISLHVSYAPRHVTLTTLMDTGPGRLGLLLCRVDSDPPAQLRLLHGDRLVASTLQG
                     VGGPEGSSPRLHVAVAPNTLRLEIHGAMLEDEGVYICEASNTLGQASASADFDAQAVN
                     VQVWPGATVREGQLVNLTCLVWTTHPAQLTYTWYQDGQQRLDAHSIPLPNVTVRDATS
                     YRCGVGPPGRAPRLSRPITLDVLYAPRNLRLTYLLESHGGQLALVLCTVDSRPPAQLA
                     LSHAGRLLASSTAASVPNTLRLELRGPQPRDEGFYSCSARSPLGQANTSLELRLEGVR
                     VILAPEAAVPEGAPITVTCADPAAHAPTLYTWYHNGRWLQEGPAASLSFLVATRAHAG
                     AYSCQAQDAQGTRSSRPAALQVLYAPQDAVLSSFRDSRARSMAVIQCTVDSEPPAELA
                     LSHDGKVLATSSGVHSLASGTGHVQVARNALRLQVQDVPAGDDTYVCTAQNLLGSIST
                     IGRLQVEGARVVAEPGLDVPEGAALNLSCRLLGGPGPVGNSTFAWFWNDRRLHAEPVP
                     TLAFTHVARAQAGMYHCLAELPTGAAASAPVMLRVLYPPKTPTMMVFVEPEGGLRGIL
                     DCRVDSEPLASLTLHLGSRLVASSQPQGAPAEPHIHVLASPNALRVDIEALRPSDQGE
                     YICSASNVLGSASTSTYFGVRALHRLHQFQQLLWVLGLLVGLLLLLLGLGACYTWRRR
                     RVCKQSMGENSVEMAFQKETTQLIDPDAATCETSTCAPPLG"
     repeat_region   14212..14286
                     /note="25 copies 3 mer cag 65% conserved"
     CDS             complement(join(14492..14850,15926..16186,16489..16788,
                     17107..17358,17448..>17665))
                     /gene="SN"
                     /codon_start=2
                     /evidence=not_experimental
                     /product="dJ1009E24.1.2 (Sialoadhesin (isoform 2))"
                     /protein_id="CAC17542.1"
                     /db_xref="GI:11493364"
                     /translation="LALVLCTVDSRPPAQLALSHAGRLLASSTAASVPNTLRLELRGP
                     QPRDEGFYSCSARSPLGQANTSLELRLEGVRVILAPEAAVPEGAPITVTCADPAAHAP
                     TLYTWYHNGRWLQEGPAASLSFLVATRAHAGAYSCQAQDAQGTRSSRPAALQVLYAPQ
                     DAVLSSFRDSRARSMAVIQCTVDSEPPAELALSHDGKVLATSSGVHSLASGTGHVQVA
                     RNALRLQVQDVPAGDDTYVCTAQNLLGSISTIGRLQVEGARVVAEPGLDVPEGAALNL
                     SCRLLGGPGPVGNSTFAWFWNDRRLHAEPVPTLAFTHVARAQAGMYHCLAELPTGAAA
                     SAPVMLRVLYPPKTPTMMVFVEPEGGLRGILDCRVDSEPLASLTLHLGSRLVASSQPQ
                     GAPAEPHIHVLASPNALRVDIEALRPSDQGEYICSASNVLGSASTSTYFGVRGEGRGL
                     HLPGHSAQKPSS"
     repeat_region   14872..15301
                     /note="MLT1H repeat: matches 8..547 of consensus"
     repeat_region   15333..15405
                     /note="MIR repeat: matches 196..255 of consensus"
     repeat_region   15406..15711
                     /note="AluSx repeat: matches 6..311 of consensus"
     repeat_region   15712..15816
                     /note="MIR repeat: matches 47..196 of consensus"
     repeat_region   18761..18802
                     /note="21 copies 2 mer ac 100% conserved"
     repeat_region   19757..19875
                     /note="L2 repeat: matches 2617..2749 of consensus"
     repeat_region   20406..20683
                     /note="AluSp repeat: matches 1..296 of consensus"
     misc_feature    20673..21013
                     /note="match: STS: Em:HSB026XH5"
     repeat_region   20707..20752
                     /note="23 copies 2 mer ca 87% conserved"
     repeat_region   22819..22997
                     /note="MER91A repeat: matches 2..186 of consensus
                     MER91A repeat: matches 2..186 of consensus"
     repeat_region   24281..24402
                     /note="AluJo repeat: matches 1..124 of consensus"
     repeat_region   24403..24693
                     /note="AluJb repeat: matches 1..294 of consensus"
     repeat_region   24693..25003
                     /note="AluSx repeat: matches 1..311 of consensus"
     repeat_region   25004..25187
                     /note="AluJo repeat: matches 434..300 of consensus"
     repeat_region   25274..25584
                     /note="AluSc repeat: matches 1..309 of consensus"
     repeat_region   26485..26889
                     /note="L1MC4 repeat: matches 7188..7559 of consensus"
     repeat_region   26890..27187
                     /note="AluSx repeat: matches 1..296 of consensus"
     repeat_region   27188..27292
                     /note="L1MC4 repeat: matches 7559..7656 of consensus"
     repeat_region   27482..27599
                     /note="L1MC4 repeat: matches 7854..7977 of consensus"
     repeat_region   29202..29509
                     /note="AluJo repeat: matches 1..301 of consensus"
     repeat_region   29510..29569
                     /note="MIR repeat: matches 185..244 of consensus"
     repeat_region   29574..29882
                     /note="AluSx repeat: matches 1..310 of consensus"
     repeat_region   32272..32335
                     /note="L2 repeat: matches 2163..2229 of consensus"
     repeat_region   32934..33069
                     /note="L1MB5 repeat: matches 5022..5156 of consensus"
     repeat_region   33070..33350
                     /note="AluJo repeat: matches 1..296 of consensus"
     repeat_region   33352..33653
                     /note="AluSc repeat: matches 1..300 of consensus"
     repeat_region   33654..34656
                     /note="L1MB5 repeat: matches 5156..6164 of consensus"
     repeat_region   37748..38059
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   38241..38563
                     /note="AluJo repeat: matches 1..312 of consensus"
     repeat_region   39027..39221
                     /note="MLT1H repeat: matches 110..307 of consensus"
     repeat_region   39512..39809
                     /note="AluSx repeat: matches 1..294 of consensus"
     repeat_region   39860..40183
                     /note="AluJb repeat: matches 3..310 of consensus"
     repeat_region   40187..40253
                     /note="MIR repeat: matches 79..150 of consensus"
     repeat_region   40355..40547
                     /note="MLT1J repeat: matches 208..410 of consensus"
     repeat_region   40569..40724
                     /note="FRAM repeat: matches 8..163 of consensus"
     repeat_region   40736..40867
                     /note="MLT1J repeat: matches 37..195 of consensus"
     repeat_region   41138..41232
                     /note="MIR repeat: matches 5..107 of consensus"
     repeat_region   41793..41956
                     /note="AluY repeat: matches 136..299 of consensus"
     repeat_region   41965..42234
                     /note="AluSx repeat: matches 3..284 of consensus"
     repeat_region   42760..43063
                     /note="AluJb repeat: matches 1..299 of consensus"
     repeat_region   43075..43203
                     /note="MLT1J repeat: matches 37..195 of consensus"
     repeat_region   43221..43334
                     /note="AluSg/x repeat: matches 198..311 of consensus"
     repeat_region   43415..43726
                     /note="AluSx repeat: matches 5..305 of consensus"
     repeat_region   43730..43942
                     /note="MIR repeat: matches 2..221 of consensus"
     misc_feature    complement(43953..44223)
                     /note="match: STS: Em:L18465"
     repeat_region   43997..44096
                     /note="AluJo/FLAM repeat: matches 2..102 of consensus"
     repeat_region   44097..44383
                     /note="AluJb repeat: matches 1..282 of consensus"
     repeat_region   44384..44414
                     /note="FLAM_C repeat: matches 102..132 of consensus"
     repeat_region   44434..44598
                     /note="L1M4 repeat: matches 1199..1354 of consensus"
     repeat_region   44494..44609
                     /note="HAL1 repeat: matches 170..289 of consensus"
     repeat_region   45597..45902
                     /note="AluSg repeat: matches 3..308 of consensus"
     repeat_region   46249..46504
                     /note="AluJb repeat: matches 1..301 of consensus"
     repeat_region   46898..47239
                     /note="TIGGER2 repeat: matches 1..2717 of consensus
                     TIGGER2 repeat: matches 1..2717 of consensus"
     repeat_region   47351..47650
                     /note="AluSx repeat: matches 1..300 of consensus"
     repeat_region   47654..47808
                     /note="L1MC2 repeat: matches 5182..5342 of consensus"
     repeat_region   47809..48086
                     /note="AluSx repeat: matches 9..294 of consensus"
     repeat_region   48087..48432
                     /note="L1MC2 repeat: matches 5342..5675 of consensus"
     repeat_region   48434..48518
                     /note="AluJo repeat: matches 1..91 of consensus"
     repeat_region   48519..48832
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   48833..49043
                     /note="AluJo repeat: matches 91..273 of consensus"
     repeat_region   49044..49171
                     /note="64 copies 2 mer ga 61% conserved"
     repeat_region   49209..49921
                     /note="L1MC2 repeat: matches 5661..6326 of consensus"
     repeat_region   49952..50270
                     /note="AluSx repeat: matches 12..298 of consensus"
     repeat_region   50310..50476
                     /note="FRAM repeat: matches 1..165 of consensus"
     misc_feature    complement(50609..50813)
                     /note="match: STS: Em:G04713"
     repeat_region   50818..51120
                     /note="AluSx repeat: matches 1..303 of consensus"
     repeat_region   51155..51340
                     /note="AluJb repeat: matches 136..312 of consensus"
     repeat_region   51347..51645
                     /note="AluSg repeat: matches 1..299 of consensus"
     repeat_region   52013..52178
                     /note="L2 repeat: matches 1591..1754 of consensus"
     repeat_region   52189..52690
                     /note="MLT1J repeat: matches 13..513 of consensus"
     repeat_region   53124..53252
                     /note="L2 repeat: matches 2363..2493 of consensus"
     misc_feature    53729..54143
                     /note="match: GSS: Em:AQ071055"
     repeat_region   53834..53930
                     /note="MIR repeat: matches 92..191 of consensus"
     repeat_region   54064..54173
                     /note="55 copies 2 mer tg 73% conserved"
     repeat_region   55555..55726
                     /note="86 copies 2 mer ac 69% conserved"
     repeat_region   56198..56475
                     /note="AluJo repeat: matches 12..289 of consensus"
     repeat_region   57171..57398
                     /note="114 copies 2 mer gg 54% conserved"
     repeat_region   58617..58760
                     /note="AluJb repeat: matches 146..289 of consensus"
     repeat_region   59324..59623
                     /note="AluSg repeat: matches 1..300 of consensus"
     repeat_region   59830..59867
                     /note="MIR repeat: matches 107..144 of consensus"
     repeat_region   60048..60351
                     /note="AluSq repeat: matches 1..304 of consensus"
     repeat_region   60446..60505
                     /note="L2 repeat: matches 2643..2706 of consensus"
     repeat_region   60556..60761
                     /note="MER63A repeat: matches 1..209 of consensus
                     MER63A repeat: matches 1..209 of consensus"
     repeat_region   61277..61326
                     /note="MER5A repeat: matches 133..183 of consensus"
     repeat_region   61327..61670
                     /note="AluSx repeat: matches 1..311 of consensus"
     misc_feature    complement(61498..61660)
                     /note="match: GSS: Em:AQ750212"
     repeat_region   61671..61806
                     /note="MER5A repeat: matches 4..133 of consensus"
     repeat_region   61876..61980
                     /note="L2 repeat: matches 1129..1227 of consensus"
     repeat_region   61993..62054
                     /note="AluJ/FRAM repeat: matches 231..292 of consensus"
     repeat_region   62307..62607
                     /note="AluSx repeat: matches 1..301 of consensus"
     misc_feature    complement(62332..62490)
                     /note="match: GSS: Em:AQ015769"
     misc_feature    complement(62339..62515)
                     /note="match: GSS: Em:AQ028161"
     misc_feature    complement(62426..62603)
                     /note="match: GSS: Em:AQ891720"
     misc_feature    complement(62436..62607)
                     /note="match: GSS: Em:AQ317665"
     repeat_region   62744..62933
                     /note="L2 repeat: matches 2522..2735 of consensus"
     repeat_region   62938..63152
                     /note="L1MC5 repeat: matches 7558..7783 of consensus"
     gene            63244..77697
                     /gene="dJ1009E24.2"
     mRNA            join(63244..63303,65401..65498,66870..66994,69488..69674,
                     70079..70183,70501..70617,72803..72977,73818..73904,
                     74336..74440,74555..74813,75390..75493,76097..77697)
                     /gene="dJ1009E24.2"
                     /product="dJ1009E24.2 (novel protein similar to KIAA0417)"
                     /note="match: cDNAs: Em:AB007877 Em:AK012548 Em:AK006867"
                     /evidence=not_experimental
     CDS             join(63261..63303,65401..65498,66870..66994,69488..69674,
                     70079..70183,70501..70617,72803..72977,73818..73904,
                     74336..74440,74555..74813,75390..75493,76097..76752)
                     /gene="dJ1009E24.2"
                     /note="match: proteins: Tr:O43301"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.2 (novel protein similar to KIAA0417)"
                     /protein_id="CAC17544.3"
                     /db_xref="GI:13624748"
                     /translation="MLAVPEMGLQGLYIGSSPERSPVPSPPGSPRTQESCGIAPLTPS
                     QSPKPEVRAPQQASFSVVVAIDFGTTSSGYAFSFASDPEAIHMMRKWEGGDPGVAHQK
                     TPTCLLLTPEGAFHSFGYTARDYYHDLDPEEARDWLYFEKFKMKIHSATDLTLKTQLE
                     AVNGKTMPALEVFAHALRFFREHALQELREQSPSLPEKDTVRWVLTVPAIWKQPAKQF
                     MREAAYLAGLVSRENAEQLLIALEPEAASVYCRKLRLHQLLDLSGRAPGGGRLGERRS
                     IDSSFRQAREQLRRSRHSRTFLVESGVGELWAEMQAGDRYVVADCGGGTVDLTVHQLE
                     QPHGTLKELYKASGGPYGAVGVDLAFEQLLCRIFGEDFIATFKRQRPAAWVDLTIAFE
                     ARKRTAGPHRAGALNISLPFSFIDFYRKQRGHNVETALRRSSVNFVKWSSQGMLRMSC
                     EAMNELFQPTVSGIIQHIEALLARPEVQGVKLLFLVGGFAESAVLQHAVQAALGARGL
                     RVVVPHDVGLTILKGAVLFGQAPGVVRVRRSPLTYGVGVLNRFVPGRHPPEKLLVRDG
                     RRWCTDVFERFVAAEQSVALGEEVRRSYCPARPGQRRVLINLYCCAAEDARFITDPGV
                     RKCGALSLELEPADCGQDTAGAPPGRREIRAAMQFGDTEIKVTAVDVSTNRSVRASID
                     FLSN"
     repeat_region   63417..63452
                     /note="18 copies 2 mer tg 86% conserved"
     repeat_region   63631..63726
                     /note="48 copies 2 mer gt 61% conserved"
     repeat_region   65936..66232
                     /note="AluSg repeat: matches 1..294 of consensus"
     repeat_region   67275..67457
                     /note="AluSg/x repeat: matches 125..307 of consensus"
     repeat_region   67462..67770
                     /note="AluY repeat: matches 1..297 of consensus"
     repeat_region   67771..68082
                     /note="AluJo repeat: matches 1..299 of consensus"
     repeat_region   68220..68425
                     /note="L1ME repeat: matches 5620..5833 of consensus"
     repeat_region   68462..68513
                     /note="26 copies 2 mer aa 78% conserved"
     repeat_region   69072..69282
                     /note="L2 repeat: matches 1974..2184 of consensus"
     repeat_region   69390..69438
                     /note="L2 repeat: matches 2352..2400 of consensus"
     repeat_region   70848..70885
                     /note="LTR16A repeat: matches 408..444 of consensus"
     repeat_region   70886..71187
                     /note="AluYb8 repeat: matches 1..311 of consensus"
     repeat_region   71188..71523
                     /note="LTR16A repeat: matches 64..408 of consensus"
     misc_feature    74264..76883
                     /gene="dJ1009E24.2"
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   75669..75835
                     /note="MIR repeat: matches 82..251 of consensus"
     misc_feature    complement(77427..77688)
                     /note="match: STS: Em:G14972"
     repeat_region   77529..77610
                     /note="MIR repeat: matches 59..142 of consensus"
     polyA_signal    77672..77677
                     /gene="dJ1009E24.2"
     polyA_site      77697
                     /gene="dJ1009E24.2"
     gene            complement(78095..92330)
                     /gene="dJ1009E24.3"
     mRNA            complement(join(78095..78744,78983..79105,80055..80193,
                     83122..83264,84668..84772,92249..92330))
                     /gene="dJ1009E24.3"
                     /product="dJ1009E24.3 (novel protein)"
                     /note="match: cDNAs: Em:AK024220 Em:AK000557 Em:AK022713
                     match: ESTs: Em:BE795641 Em:BE786533 Em:BE793892
                     Em:BE536848"
                     /evidence=not_experimental
     polyA_site      complement(78095)
                     /gene="dJ1009E24.3"
     polyA_signal    complement(78112..78117)
                     /gene="dJ1009E24.3"
     CDS             complement(join(78644..78744,78983..79105,80055..80193,
                     83122..83264,84668..84686))
                     /gene="dJ1009E24.3"
                     /note="match: proteins: Tr:Q62523"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.3 (novel protein)"
                     /protein_id="CAC17545.1"
                     /db_xref="GI:11493367"
                     /translation="MAAANKGNKPRVRSIRFAAGHDAEGSHSHVHFDEKLHDSVVMVT
                     QESDSSFLVKVGFLKILHRYEITFTLPPVHRLSKDVREAPVPSLHLKLLSVVPVPEGY
                     SVKCEYSAHKEGVLKEEILLACEGGTGTCVRVTVQARVMDRHHGTPMLLDGVKCVGAE
                     LEYDSEHSDWHGFD"
     repeat_region   81655..81948
                     /note="AluSq repeat: matches 1..299 of consensus"
     repeat_region   82214..82427
                     /note="AluSg/x repeat: matches 87..303 of consensus"
     repeat_region   82771..82860
                     /note="MIR repeat: matches 29..122 of consensus"
     repeat_region   82976..83010
                     /note="MIR repeat: matches 110..144 of consensus"
     repeat_region   83267..83313
                     /note="L1PBa repeat: matches -1202..-1156 of consensus"
     misc_feature    83884..84178
                     /note="match: GSS: Em:AQ028334"
     repeat_region   84500..84575
                     /note="38 copies 2 mer ca 75% conserved"
     misc_feature    complement(85104..85666)
                     /gene="dJ1009E24.3"
                     /note="match: GSS: Em:AQ375029"
     misc_feature    complement(85341..85642)
                     /gene="dJ1009E24.3"
                     /note="match: GSS: Em:AQ092132"
     repeat_region   85441..85583
                     /note="L1MC4 repeat: matches 6879..7018 of consensus"
     repeat_region   85633..85880
                     /note="L1MC5 repeat: matches 7005..7228 of consensus"
     repeat_region   85917..86221
                     /note="AluSg repeat: matches 1..302 of consensus"
     repeat_region   86336..86415
                     /note="L1MC4 repeat: matches 7341..7420 of consensus"
     repeat_region   86364..86543
                     /note="L1MC5 repeat: matches 7268..7480 of consensus"
     repeat_region   86592..86885
                     /note="AluSx repeat: matches 1..310 of consensus"
     repeat_region   86970..87236
                     /note="AluJb repeat: matches 2..299 of consensus"
     repeat_region   87244..87416
                     /note="MER58A repeat: matches 50..224 of consensus"
     repeat_region   87561..87861
                     /note="AluSx repeat: matches 2..302 of consensus"
     repeat_region   87862..88162
                     /note="AluSp repeat: matches 1..302 of consensus"
     repeat_region   88958..89325
                     /note="L2 repeat: matches 2302..2708 of consensus"
     repeat_region   89944..90083
                     /note="MIR repeat: matches 25..173 of consensus"
     misc_feature    91907..93346
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   94050..94362
                     /note="AluSx repeat: matches 1..303 of consensus"
     repeat_region   94488..94564
                     /note="L2 repeat: matches 2616..2701 of consensus"
     repeat_region   95014..95314
                     /note="AluY repeat: matches 1..301 of consensus"
     repeat_region   95402..95539
                     /note="FLAM_C repeat: matches 1..133 of consensus"
     repeat_region   95604..95637
                     /note="L1MC/D repeat: matches 5604..5637 of consensus"
     repeat_region   95638..95989
                     /note="AluYb8 repeat: matches 1..303 of consensus"
     repeat_region   95992..96126
                     /note="AluJo/FLAM repeat: matches 1..133 of consensus"
     repeat_region   96127..96264
                     /note="L1MC/D repeat: matches 5474..5605 of consensus"
     repeat_region   96404..96429
                     /note="13 copies 2 mer ac 100% conserved"
     repeat_region   96433..96652
                     /note="AluSg/x repeat: matches 91..309 of consensus"
     repeat_region   96659..96885
                     /note="L1ME2 repeat: matches 5594..5826 of consensus"
     repeat_region   96908..97088
                     /note="AluSg repeat: matches 1..179 of consensus"
     repeat_region   97105..97539
                     /note="MLT2D repeat: matches 115..553 of consensus"
     repeat_region   97540..97847
                     /note="AluSx repeat: matches 1..308 of consensus"
     repeat_region   97848..97861
                     /note="MLT2D repeat: matches 103..115 of consensus"
     repeat_region   97876..98166
                     /note="AluSx repeat: matches 2..302 of consensus"
     repeat_region   98350..98472
                     /note="L2 repeat: matches 2570..2710 of consensus"
     repeat_region   98522..98673
                     /note="AluJb repeat: matches 1..124 of consensus"
     repeat_region   98674..98964
                     /note="AluSx repeat: matches 3..296 of consensus"
     repeat_region   98965..99123
                     /note="AluJb repeat: matches 124..292 of consensus"
     repeat_region   99133..99442
                     /note="AluSx repeat: matches 1..310 of consensus"
     misc_feature    complement(99437..99666)
                     /note="Single clone region. sequence from reads from a
                     short insert library derived from a clone PCR.Restriction
                     digest data confirm the assembly."
     repeat_region   99478..99646
                     /note="MLT1E repeat: matches 1..156 of consensus"
     repeat_region   100425..100800
                     /note="MLT1E repeat: matches 186..568 of consensus"
     repeat_region   100844..101151
                     /note="AluSx repeat: matches 1..308 of consensus"
     repeat_region   101270..101563
                     /note="AluSq repeat: matches 1..296 of consensus"
     gene            complement(102090..106022)
                     /gene="dJ1009E24.4"
     mRNA            complement(join(102090..102950,103007..103130,
                     103332..103576,103782..103938,104250..104361,
                     105765..106022))
                     /gene="dJ1009E24.4"
                     /product="dJ1009E24.4 (novel protein)"
                     /note="match: cDNAs: Em:AL080154
                     match: ESTs: Em:AL040853 Em:AA166662 Em:BE368954
                     Em:BE652481"
                     /evidence=not_experimental
     polyA_site      complement(102090)
                     /gene="dJ1009E24.4"
     polyA_signal    complement(102107..102112)
                     /gene="dJ1009E24.4"
     misc_feature    102146..103123
                     /note="CpG island"
                     /evidence=not_experimental
     CDS             complement(join(103466..103576,103782..103938,
                     104250..104361,105765..105873))
                     /gene="dJ1009E24.4"
                     /note="match: proteins: Tr:Q62523 Tr:Q9Y4P9"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.4 (novel protein)"
                     /protein_id="CAC17546.1"
                     /db_xref="GI:11493368"
                     /translation="MASSVDEEALHQLYLWVDNIPLSRPKRNLSRDFSDGVLVAEVIK
                     FYFPKMVEMHNYVPANSLQQKLSNWGHLNRKVLKRLNFSVPDDVMRKIAQCAPGVVEL
                     VLIPLRQRLEERQRRRKQGAGSLQELAPQDGSGYMDVGKVAFSISPSRLELSFCPSSC
                     HL"
     repeat_region   104648..104947
                     /note="AluSx repeat: matches 7..309 of consensus"
     repeat_region   105003..105044
                     /note="21 copies 2 mer ca 92% conserved"
     misc_feature    complement(105830..106256)
                     /note="match: GSS: Em:AQ705250"
     repeat_region   106540..106595
                     /note="MIR repeat: matches 88..147 of consensus"
     repeat_region   106672..106986
                     /note="AluSp repeat: matches 1..313 of consensus"
     repeat_region   107087..107188
                     /note="MIR repeat: matches 76..200 of consensus"
     gene            complement(108438..111069)
                     /gene="CENPB"
     mRNA            complement(108438..111069)
                     /gene="CENPB"
                     /product="dJ1009E24.5 (Centromere protein B (80KDa))"
                     /note="match: cDNAs: Em:U38847 Em:AB018254 Em:AF056312
                     Em:X05299
                     match: ESTs: Em:AA639474 Em:BE269163 Em:F01282"
                     /evidence=not_experimental
     polyA_site      complement(108438)
                     /gene="CENPB"
     polyA_signal    complement(108481..108486)
                     /gene="CENPB"
     misc_feature    108691..109111
                     /note="match: GSS: Em:AZ088838"
     CDS             complement(109270..111069)
                     /gene="CENPB"
                     /note="match: proteins: Tr:Q62523 Tr:Q13537 Tr:Q60976
                     Tr:Q9TT39"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.5 (Centromere protein B (80KDa))"
                     /protein_id="CAC17547.1"
                     /db_xref="GI:11493369"
                     /translation="MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLST
                     ILKNKRAILASERKYGVASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKE
                     KALRIAEELGMDDFTASNGWLDRFRRRHGVVSCSGVARARARNAAPRTPAAPASPAAV
                     PSEGSGGSTTGWRAREEQPPSVAEGYASQDVFSATETSLWYDFLPDQAAGLCGGDGRP
                     RQATQRLSVLLCANADGSEKLPPLVAGKSAKPRAGQAGLPCDYTANSKGGVTTQALAK
                     YLKALDTRMAAESRRVLLLAGRLAAQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVK
                     GHYRQAMLLKAMAALEGQDPSGLQLGLTEALHFVAAAWQAVEPSDIAACFREAGFGGG
                     PNATITTSLKSEGEEEEEEEEEEEEEEGEGEEEEEEGEEEEEEGGEGEELGEEEEVEE
                     EGDVDSDEEEEEDEESSSEGLEAEDWAQGVVEAGGSFGAYGAQEEAQCPTLHFLEGGE
                     DSDSDSEEEDDEEEDDEDEDDDDDEEDGDEVPVPSFGEAMAYFAMVKRYLTSFPIDDR
                     VQSHILHLEHDLVHVTRKNHARQAGVRGLGHQS"
     repeat_region   109457..109861
                     /note="135 copies 3 mer tcc 64% conserved"
     repeat_region   109719..109858
                     /note="70 copies 2 mer cc 61% conserved"
     misc_feature    110283..111575
                     /note="CpG island"
                     /evidence=not_experimental
     misc_feature    111113..111331
                     /note="Single clone region. Sequence from reads from a
                     short insert library derived from a single pUC clone.
                     Restriction digest data confirm the assembly."
     repeat_region   111190..111385
                     /note="98 copies 2 mer gg 57% conserved"
     repeat_region   112335..112438
                     /note="L2 repeat: matches 1554..1650 of consensus"
     repeat_region   112439..112750
                     /note="AluSg repeat: matches 1..308 of consensus"
     repeat_region   112751..112840
                     /note="L2 repeat: matches 1443..1554 of consensus"
     repeat_region   112871..113170
                     /note="L1M4 repeat: matches 5051..5389 of consensus"
     repeat_region   113294..113487
                     /note="97 copies 2 mer tt 61% conserved"
     repeat_region   113815..113913
                     /note="AluJo/FRAM repeat: matches 208..306 of consensus"
     repeat_region   113956..114256
                     /note="AluSx repeat: matches 1..300 of consensus"
     repeat_region   114281..114297
                     /note="AluJo repeat: matches 118..134 of consensus"
     repeat_region   114298..114588
                     /note="AluY repeat: matches 5..295 of consensus"
     repeat_region   114589..114765
                     /note="AluJo repeat: matches 133..312 of consensus"
     repeat_region   114782..114865
                     /note="AluSg/x repeat: matches 211..294 of consensus"
     repeat_region   114867..114962
                     /note="L1MB8 repeat: matches 6055..6170 of consensus"
     repeat_region   114963..115085
                     /note="AluSg/x repeat: matches 181..299 of consensus"
     repeat_region   115127..115191
                     /note="Alu repeat: matches 2..66 of consensus"
     repeat_region   115193..115455
                     /note="AluSq repeat: matches 1..264 of consensus"
     repeat_region   115456..115583
                     /note="AluJb repeat: matches 9..138 of consensus"
     repeat_region   115989..116286
                     /note="AluJb repeat: matches 3..303 of consensus"
     misc_feature    116336..117010
                     /note="match: GSS: Em:AQ313572"
     repeat_region   116489..116794
                     /note="AluSx repeat: matches 6..311 of consensus"
     repeat_region   116909..117046
                     /note="AluJo repeat: matches 157..301 of consensus"
     repeat_region   117047..117350
                     /note="AluSg repeat: matches 2..304 of consensus"
     repeat_region   117351..117524
                     /note="AluJo repeat: matches 6..157 of consensus"
     repeat_region   117529..117761
                     /note="MER33 repeat: matches 39..249 of consensus"
     repeat_region   117762..118061
                     /note="AluSx repeat: matches 1..301 of consensus"
     repeat_region   118129..118190
                     /note="31 copies 2 mer tt 75% conserved"
     repeat_region   118194..118970
                     /note="L1PA3 repeat: matches 5369..6146 of consensus"
     repeat_region   118196..118970
                     /note="L1PA2 repeat: matches 5369..6144 of consensus"
     repeat_region   119666..119974
                     /note="AluSg repeat: matches 1..310 of consensus"
     misc_feature    120138..121313
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   120393..120480
                     /note="MIR repeat: matches 55..145 of consensus"
     gene            120896..130701
                     /gene="CDC25B"
     mRNA            join(120896..121317,122250..122335,122998..123049,
                     124874..124915,125048..125084,125329..125451,
                     125557..125679,125840..125974,126308..126388,
                     126510..126686,126867..126962,127496..127558,
                     127694..127792,127990..128123,129155..129266,
                     129407..130701)
                     /gene="CDC25B"
                     /product="dJ1009E24.6.1 (Cell division cycle protein 25B,
                     isoform 1)"
                     /note="match: cDNAs: Em:S93521 Em:S78187 Em:Z68092
                     match: ESTs: Em:BE295234 Em:BE902692 Em:BE799234
                     Em:BE869880"
                     /evidence=not_experimental
     repeat_region   120930..121037
                     /note="54 copies 2 mer cc 66% conserved"
     mRNA            join(121038..121317,122208..122335,122998..123049,
                     124874..124915,125048..125084,125557..125679,
                     125840..125974,126308..126388,126510..126686,
                     126867..126962,127496..127558,127694..127792,
                     127990..128123,129155..129266,129407..129804)
                     /gene="CDC25B"
                     /product="dJ1009E24.6.2 (Cell division cycle protein 25B,
                     isoform 2)"
                     /note="match: cDNAs: Em:Z68092
                     match: ESTs: Em:BF310722 Em:W51993"
                     /evidence=not_experimental
     CDS             join(121118..121317,122208..122335,122998..123049,
                     124874..124915,125048..125084,125557..125679,
                     125840..125974,126308..126388,126510..126686,
                     126867..126962,127496..127558,127694..127792,
                     127990..128123,129155..129266,129407..129547)
                     /gene="CDC25B"
                     /note="match: proteins: Tr:Q13971"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.2 (Cell division cycle protein 25B,
                     isoform 2)"
                     /protein_id="CAC17548.1"
                     /db_xref="GI:11493370"
                     /translation="MEVPQPEPAPGSALSPAGVCGGAQRPGHLPGLLLGSHGLLGSPV
                     RAAASSPVTTLTQTMHDLAGLGSETPKSQVGTLLFRSRSRLTHLSLSRRASESSLSSE
                     SSESSDAGLCMDSPSPMDPHMAEQTFEQAIQAASRIIRNEQFAIRRFQSMPDGFVFKM
                     PWKPTHPSSTHALAEWASRREAFAQRPSSAPDLMCLSPDRKMEVEELSPLALGRFSLT
                     PAEGDTEEDDGFVDILESDLKDDDAVPPGMESLISAPLVKTLEKEEEKDLVMYSKCQR
                     LFRSPSMPCSVIRPILKRLERPQDRDTPVQNKRRRSVTPPEEQQEAEEPKARVLRSKS
                     LCHDEIENLLDSDHRELIGDYSKAFLLQTVDGKHQDLKYISPETMVALLTGKFSNIVD
                     KFVIVDCRYPYEYEGGHIKTAVNLPLERDAESFLLKSPIAPCSLDKRVILIFHCEFSS
                     ERGPRMCRFIRERDRAVNDYPSLYYPEMYILKGGYKEFFPQHPNFCEPQDYRPMNHEA
                     FKDELKTFRLKTRSWAGERSRRELCSRLQDQ"
     CDS             join(121118..121317,122250..122335,122998..123049,
                     124874..124915,125048..125084,125329..125451,
                     125557..125679,125840..125974,126308..126388,
                     126510..126686,126867..126962,127496..127558,
                     127694..127792,127990..128123,129155..129266,
                     129407..129547)
                     /gene="CDC25B"
                     /note="match: proteins: Sw:P30305 Sw:P30306 Tr:O43551"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.1 (Cell division cycle protein 25B,
                     isoform 1)"
                     /protein_id="CAC17549.1"
                     /db_xref="GI:11493371"
                     /translation="MEVPQPEPAPGSALSPAGVCGGAQRPGHLPGLLLGSHGLLGSPV
                     RAAASSPVTTLTQTMHDLAGLGSRSRLTHLSLSRRASESSLSSESSESSDAGLCMDSP
                     SPMDPHMAEQTFEQAIQAASRIIRNEQFAIRRFQSMPVRLLGHSPVLRNITNSQAPDG
                     RRKSEAGSGAASSSGEDKENDGFVFKMPWKPTHPSSTHALAEWASRREAFAQRPSSAP
                     DLMCLSPDRKMEVEELSPLALGRFSLTPAEGDTEEDDGFVDILESDLKDDDAVPPGME
                     SLISAPLVKTLEKEEEKDLVMYSKCQRLFRSPSMPCSVIRPILKRLERPQDRDTPVQN
                     KRRRSVTPPEEQQEAEEPKARVLRSKSLCHDEIENLLDSDHRELIGDYSKAFLLQTVD
                     GKHQDLKYISPETMVALLTGKFSNIVDKFVIVDCRYPYEYEGGHIKTAVNLPLERDAE
                     SFLLKSPIAPCSLDKRVILIFHCEFSSERGPRMCRFIRERDRAVNDYPSLYYPEMYIL
                     KGGYKEFFPQHPNFCEPQDYRPMNHEAFKDELKTFRLKTRSWAGERSRRELCSRLQDQ
                     "
     CDS             join(<121283..121317,122208..122335,122998..123049,
                     124874..124915,125048..125084,125329..125679,
                     125840..125974,126510..>126644)
                     /gene="CDC25B"
                     /note="match: proteins: Tr:O43550"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.5 (Cell division cycle protein 25B,
                     isoform 5)"
                     /protein_id="CAC32459.1"
                     /db_xref="GI:13159927"
                     /translation="TQTMHDLAGLGSETPKSQVGTLLFRSRSRLTHLSLSRRASESSL
                     SSESSESSDAGLCMDSPSPMDPHMAEQTFEQAIQAASRIIRNEQFAIRRFQSMPVRLL
                     GHSPVLRNITNSQAPDGRRKSEAGSGAASSSGEDKENVRFWKAGVGALREEEGACWGG
                     SLACEDPPLPSWLQDGFVFKMPWKPTHPSSTHALAEWASRREAFAQRPSSAPDLMCLS
                     PDRKMEVEELSPLALGRFSLTPAEGDTEEDDGFVDILESDLKDLVMYSKCQRLFRSPS
                     MPCSVIRPILKRLERPQDRDTPVQNKRRR"
     CDS             join(<121283..121317,122208..122335,122998..123049,
                     124874..124915,125048..125084,125329..125451,
                     125557..125679,125840..125974,126308..126388,
                     126510..>126644)
                     /gene="CDC25B"
                     /note="match: proteins: Tr:O43551"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.6 (Cell division cycle protein 25B,
                     isoform 6)"
                     /protein_id="CAC32458.1"
                     /db_xref="GI:13159926"
                     /translation="TQTMHDLAGLGSETPKSQVGTLLFRSRSRLTHLSLSRRASESSL
                     SSESSESSDAGLCMDSPSPMDPHMAEQTFEQAIQAASRIIRNEQFAIRRFQSMPVRLL
                     GHSPVLRNITNSQAPDGRRKSEAGSGAASSSGEDKENDGFVFKMPWKPTHPSSTHALA
                     EWASRREAFAQRPSSAPDLMCLSPDRKMEVEELSPLALGRFSLTPAEGDTEEDDGFVD
                     ILESDLKDDDAVPPGMESLISAPLVKTLEKEEEKDLVMYSKCQRLFRSPSMPCSVIRP
                     ILKRLERPQDRDTPVQNKRRR"
     repeat_region   122900..122923
                     /note="12 copies 2 mer tt 95% conserved"
     misc_feature    complement(123556..124203)
                     /note="match: GSS: Em:AQ382541"
     misc_feature    complement(123724..124197)
                     /note="match: GSS: Em:AQ762596"
     repeat_region   124071..124112
                     /note="21 copies 2 mer tg 100% conserved"
     misc_feature    124205..124409
                     /gene="CDC25B"
                     /note="match: GSS: Em:AQ198009"
     CDS             join(<126251..126388,126510..>126602)
                     /gene="CDC25B"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.4 (Cell division cycle protein 25B,
                     isoform 4)"
                     /protein_id="CAC17550.1"
                     /db_xref="GI:11493372"
                     /translation="EGFGLCLWVVTLSLLIWPQDDDAVPPGMESLISAPLVKTLEKEE
                     EKDLVMYSKCQRLFRSPSMPCSVIRPILKRLER"
     CDS             join(<127355..127558,127694..127792,127990..>128009)
                     /gene="CDC25B"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.3 (Cell division cycle protein 25B,
                     isoform 3)"
                     /protein_id="CAC17551.1"
                     /db_xref="GI:11493373"
                     /translation="LPWEPELNLKAKAGVASDFGNETSFFEAPGLTNLTFSPLPHPFR
                     SLQAFLLQTVDGKHQDLKYISPETMVALLTGKFSNIVDKFVIVDCRYPYEYEGGHIKT
                     AVNLPL"
     misc_feature    complement(127847..127955)
                     /note="match: GSS: Em:AQ548024"
     misc_feature    128101..128439
                     /gene="CDC25B"
                     /note="match: GSS: Em:AQ341279"
     repeat_region   128680..129071
                     /note="L2 repeat: matches 1821..2210 of consensus"
     misc_feature    129548..130698
                     /gene="CDC25B"
                     /note="match: STS: Em:G06651"
     misc_feature    complement(130195..130698)
                     /note="match: STS: Em:G22801"
     polyA_signal    130676..130681
                     /gene="CDC25B"
     polyA_site      130702
     repeat_region   130963..131020
                     /note="L2 repeat: matches 2163..2225 of consensus"
     repeat_region   131021..131316
                     /note="AluSg repeat: matches 1..296 of consensus"
     repeat_region   131317..131331
                     /note="L2 repeat: matches 2225..2238 of consensus"
     repeat_region   131597..131654
                     /note="MIR repeat: matches 117..168 of consensus"
     repeat_region   132314..132361
                     /note="L2 repeat: matches 2647..2695 of consensus"
     repeat_region   133102..133390
                     /note="L1ME3 repeat: matches 5780..6082 of consensus"
     repeat_region   133396..133575
                     /note="AluSq repeat: matches 134..313 of consensus"
     repeat_region   133576..133866
                     /note="AluSg repeat: matches 7..296 of consensus"
     misc_feature    133807..134133
                     /note="Single clone region. Assembly confirmed by
                     restriction digest data."
     repeat_region   133905..134185
                     /note="AluSq repeat: matches 1..292 of consensus"
     misc_feature    134053..134057
                     /note="weak data"
     repeat_region   134186..134207
                     /note="11 copies 2 mer ta 100% conserved"
     repeat_region   134208..134438
                     /note="AluJb repeat: matches 86..311 of consensus"
     repeat_region   134446..134769
                     /note="AluJo repeat: matches 1..312 of consensus"
     repeat_region   134840..134949
                     /note="L1ME1 repeat: matches 6040..6137 of consensus"
     repeat_region   134988..135145
                     /note="L1ME3 repeat: matches 5486..5652 of consensus"
     repeat_region   135147..135453
                     /note="AluSx repeat: matches 2..308 of consensus"
     repeat_region   135454..135756
                     /note="AluY repeat: matches 1..299 of consensus"
     repeat_region   136920..137062
                     /note="MIR repeat: matches 82..225 of consensus"
     repeat_region   137065..137192
                     /note="FLAM_C repeat: matches 1..128 of consensus"
     repeat_region   137197..137503
                     /note="AluY repeat: matches 1..306 of consensus"
     repeat_region   137705..137994
                     /note="AluJo repeat: matches 1..298 of consensus"
     repeat_region   138351..138660
                     /note="AluSg repeat: matches 1..304 of consensus"
     repeat_region   138772..138953
                     /note="FAM repeat: matches 1..175 of consensus"
     repeat_region   139041..139160
                     /note="AluSx repeat: matches 1..121 of consensus"
     repeat_region   139161..139441
                     /note="AluY repeat: matches 2..294 of consensus"
     repeat_region   139442..139621
                     /note="AluSx repeat: matches 121..298 of consensus"
     repeat_region   139917..139978
                     /note="31 copies 2 mer tt 71% conserved"
     repeat_region   140076..140371
                     /note="AluSx repeat: matches 1..298 of consensus"
     repeat_region   140568..140642
                     /note="MIR repeat: matches 185..253 of consensus"
     repeat_region   140981..141128
                     /note="L2 repeat: matches 2347..2497 of consensus"
     repeat_region   141578..141869
                     /note="AluYb8 repeat: matches 1..311 of consensus"
     repeat_region   141934..142386
                     /note="L1ME3A repeat: matches 5705..6159 of consensus"
     repeat_region   142387..142697
                     /note="AluSg repeat: matches 1..307 of consensus"
     repeat_region   142698..142823
                     /note="L1ME3A repeat: matches 5583..5705 of consensus"
     repeat_region   143764..143958
                     /note="AluSq repeat: matches 1..200 of consensus"
     repeat_region   143983..144006
                     /note="12 copies 2 mer aa 100% conserved"
     repeat_region   143984..144007
                     /note="12 copies 2 mer aa 100% conserved"
     repeat_region   144603..144681
                     /note="MER5A repeat: matches 92..171 of consensus"
     misc_feature    144701..145485
                     /note="CpG island"
                     /evidence=not_experimental
     misc_feature    144851..145589
                     /note="match: GSS: Em:AQ939180"
     misc_feature    144851..145539
                     /note="match: GSS: Em:AQ938678"
     gene            145142..149893
                     /gene="dJ1009E24.7"
     mRNA            join(145142..145344,146688..146879,148457..149893)
                     /gene="dJ1009E24.7"
                     /product="dJ1009E24.7.1 (novel protein, isoform 1)"
                     /note="match: cDNAs: Em:AK002030
                     match: ESTs: Em:BE907799 Em:AA247632 Em:R15304"
                     /evidence=not_experimental
     mRNA            join(145190..145440,146688..146757)
                     /gene="dJ1009E24.7"
                     /product="dJ1009E24.7.2 (putative novel protein, isoform
                     2)"
                     /note="match: ESTs: Em:Z44061"
                     /evidence=not_experimental
     repeat_region   145645..145951
                     /note="L1MC4 repeat: matches 7607..7922 of consensus"
     repeat_region   145952..146252
                     /note="AluSx repeat: matches 1..306 of consensus"
     repeat_region   146253..146437
                     /note="L1MC4 repeat: matches 7425..7607 of consensus"
     repeat_region   146441..146484
                     /note="22 copies 2 mer tt 84% conserved"
     CDS             join(146704..146879,148457..148883)
                     /gene="dJ1009E24.7"
                     /note="match: proteins: Tr:Q9R172 Tr:Q13577"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.7.1 (novel protein, isoform 1)"
                     /protein_id="CAC17552.1"
                     /db_xref="GI:11493374"
                     /translation="MVHAFLIHTLRAPNTEDTGLCRVLYSCVFGAEKSPDDPRPHGAE
                     RDRLLRKEQILAVARQVESMCRLQQQASGRPPMDLQPQSSDEQVPLHEAPRGAFRLAA
                     ENPFQEPRTVVWLGVLSLGFALVLDAHENLLLAEGTLRLLTRLLLDHLRLLAPSTSLL
                     LRADRIEGILTRFLPHGQLLFLNDQFVQGLEKEFSAAWPR"
     repeat_region   147088..147335
                     /note="MIR repeat: matches 20..258 of consensus"
     repeat_region   147841..148136
                     /note="AluY repeat: matches 1..293 of consensus"
     repeat_region   148293..148423
                     /note="MIR repeat: matches 71..202 of consensus"
     misc_feature    complement(148381..148792)
                     /note="match: STS: Em:G56165
                     match: GSS: Em:AQ322349"
     misc_feature    148861..149365
                     /gene="dJ1009E24.7"
                     /note="match: GSS: Em:AQ615523"
     repeat_region   149237..149533
                     /note="AluSx repeat: matches 1..298 of consensus"
     polyA_signal    149862..149867
                     /gene="dJ1009E24.7"
     polyA_site      149894
     repeat_region   150133..150201
                     /note="LTR16C repeat: matches 301..366 of consensus"
     misc_feature    complement(150573..151059)
                     /note="match: GSS: Em:AQ889255"
     misc_feature    151073..151469
                     /note="match: GSS: Em:AQ721082"
     repeat_region   151361..151437
                     /note="L2 repeat: matches 2672..2746 of consensus"
     repeat_region   151851..152146
                     /note="AluY repeat: matches 1..299 of consensus"
     repeat_region   152336..152647
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   152791..153673
                     /note="L1MC1 repeat: matches 5461..6327 of consensus"
     repeat_region   153674..153840
                     /note="AluSg/x repeat: matches 132..302 of consensus"
     repeat_region   153841..154141
                     /note="AluSx repeat: matches 1..301 of consensus"
     repeat_region   154142..154495
                     /note="L1MC1 repeat: matches 5059..5403 of consensus"
     repeat_region   154496..154795
                     /note="AluJo repeat: matches 1..301 of consensus"
     repeat_region   154796..154812
                     /note="L1MC1 repeat: matches 5044..5059 of consensus"
     repeat_region   154813..155113
                     /note="AluSp repeat: matches 1..303 of consensus"
     repeat_region   155114..155489
                     /note="L1MC1 repeat: matches 4678..5044 of consensus"
     repeat_region   155490..155790
                     /note="AluSx repeat: matches 1..303 of consensus"
     repeat_region   155791..155812
                     /note="L1MC1 repeat: matches 4766..4678 of consensus"
     repeat_region   155813..156125
                     /note="AluYa8 repeat: matches 1..308 of consensus"
     misc_feature    complement(155959..156295)
                     /note="match: GSS: Em:AQ629218"
     repeat_region   156126..156197
                     /note="L1MC1 repeat: matches 4698..4767 of consensus"
     misc_feature    complement(156184..156300)
                     /note="match: STS: Em:HSB022XB1"
     repeat_region   156198..156221
                     /note="12 copies 2 mer at 95% conserved"
     misc_feature    complement(156205..156336)
                     /note="match: GSS: Em:AZ016984"
     repeat_region   156266..156511
                     /note="AluJo repeat: matches 35..304 of consensus"
     repeat_region   156543..156841
                     /note="AluSx repeat: matches 1..301 of consensus"
     repeat_region   156842..157153
                     /note="AluY repeat: matches 1..311 of consensus"
     repeat_region   157201..157496
                     /note="AluSx repeat: matches 3..297 of consensus"
     repeat_region   157503..157799
                     /note="AluJb repeat: matches 1..299 of consensus"
     repeat_region   157803..157941
                     /note="AluJo/FLAM repeat: matches 1..133 of consensus"
     repeat_region   157944..158484
                     /note="L1 repeat: matches 3689..4328 of consensus"
     repeat_region   158591..158696
                     /note="AluSg/x repeat: matches 68..172 of consensus"
     repeat_region   158699..158934
                     /note="AluSx repeat: matches 7..242 of consensus"
     repeat_region   159172..159480
                     /note="AluSp repeat: matches 1..311 of consensus"
     repeat_region   159487..159793
                     /note="L1 repeat: matches 2839..3131 of consensus"
     repeat_region   159796..160091
                     /note="AluSq repeat: matches 1..295 of consensus"
     repeat_region   160200..160507
                     /note="AluSp repeat: matches 1..309 of consensus"
     misc_feature    complement(160512..160880)
                     /note="match: GSS: Em:AQ344470"
     repeat_region   160575..160838
                     /note="AluSc repeat: matches 36..299 of consensus"
     repeat_region   161648..161770
                     /note="FLAM_C repeat: matches 1..124 of consensus"
     repeat_region   161787..161973
                     /note="AluSg1 repeat: matches 161..298 of consensus"
     repeat_region   161974..162276
                     /note="AluSp repeat: matches 1..313 of consensus"
     repeat_region   162277..162422
                     /note="AluSg1 repeat: matches 1..161 of consensus"
     repeat_region   162432..162546
                     /note="FLAM_A repeat: matches 23..142 of consensus"
     repeat_region   162591..162887
                     /note="AluSx repeat: matches 1..296 of consensus"
     repeat_region   162895..162999
                     /note="L1MB4 repeat: matches 6081..6185 of consensus"
     repeat_region   163000..163308
                     /note="AluSq repeat: matches 1..309 of consensus"
     repeat_region   163309..163433
                     /note="L1MB4 repeat: matches 5957..6082 of consensus"
     repeat_region   163770..163937
                     /note="L2 repeat: matches 2524..2709 of consensus"
     repeat_region   164193..164495
                     /note="AluSg repeat: matches 1..307 of consensus"
     repeat_region   164625..164879
                     /note="AluSx repeat: matches 1..256 of consensus"
     repeat_region   164980..165275
                     /note="AluSx repeat: matches 1..294 of consensus"
     repeat_region   165313..165438
                     /note="AluJo repeat: matches 23..142 of consensus"
     repeat_region   165439..165735
                     /note="AluY repeat: matches 1..298 of consensus"
     repeat_region   165736..165804
                     /note="AluJo repeat: matches 142..212 of consensus"
     repeat_region   165844..166097
                     /note="AluJb repeat: matches 1..248 of consensus"
     repeat_region   166100..166187
                     /note="L1ME3 repeat: matches 6019..6114 of consensus"
     repeat_region   166188..166494
                     /note="AluSx repeat: matches 16..307 of consensus"
     repeat_region   166500..166800
                     /note="AluSc repeat: matches 1..301 of consensus"
     repeat_region   166801..166855
                     /note="L1ME3 repeat: matches 6118..6154 of consensus"
     repeat_region   167649..167811
                     /note="AluSg/x repeat: matches 124..288 of consensus"
     repeat_region   167930..168100
                     /note="FLAM_A repeat: matches 6..142 of consensus"
     repeat_region   168530..168584
                     /note="L1MC4 repeat: matches 7924..7977 of consensus"
     repeat_region   168537..168631
                     /note="L1MD3 repeat: matches 7647..7734 of consensus"
     repeat_region   168632..168936
                     /note="AluSq repeat: matches 1..305 of consensus"
     repeat_region   168937..169127
                     /note="L1MD3 repeat: matches 7471..7647 of consensus"
     repeat_region   169128..169175
                     /note="MER3 repeat: matches 1..48 of consensus"
     repeat_region   169176..169485
                     /note="AluSp repeat: matches 1..311 of consensus"
     repeat_region   169486..169550
                     /note="MER3 repeat: matches 48..112 of consensus"
     repeat_region   169551..169862
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   169863..169959
                     /note="MER3 repeat: matches 112..209 of consensus"
     repeat_region   169960..170041
                     /note="L1MD3 repeat: matches 7307..7471 of consensus"
     repeat_region   170422..170732
                     /note="AluSx repeat: matches 1..311 of consensus"
     repeat_region   171023..171056
                     /note="MIR repeat: matches 221..254 of consensus"
     misc_feature    171154..171737
                     /note="match: GSS: Em:AQ696969"
     misc_feature    171165..171970
                     /note="CpG island"
                     /evidence=not_experimental
     gene            171429..182395
                     /gene="bA119B16.1"
     mRNA            join(171429..171489,179144..179327,182221..>182395)
                     /gene="bA119B16.1"
                     /product="dJ1009E24.8 (KIAA1271)"
                     /note="match: cDNAs: Em:AK023799 Em:AB033097
                     match: ESTs: Em:AU140492 Em:BF204863"
                     /evidence=not_experimental
     repeat_region   172691..172984
                     /note="AluSx repeat: matches 1..295 of consensus"
     repeat_region   173297..173793
                     /note="L2 repeat: matches 1736..2191 of consensus"
     repeat_region   173794..174079
                     /note="AluSq repeat: matches 12..302 of consensus"
     repeat_region   174080..174463
                     /note="L2 repeat: matches 2191..2687 of consensus"
     repeat_region   174583..174889
                     /note="AluY repeat: matches 1..310 of consensus"
     repeat_region   174902..175190
                     /note="AluSp repeat: matches 2..298 of consensus"
     repeat_region   175191..175491
                     /note="AluY repeat: matches 1..301 of consensus"
     repeat_region   175519..175624
                     /note="AluSx repeat: matches 30..135 of consensus"
     repeat_region   175625..175942
                     /note="AluSp repeat: matches 1..312 of consensus"
     repeat_region   175943..176111
                     /note="AluSx repeat: matches 135..304 of consensus"
     repeat_region   176260..176560
                     /note="AluSx repeat: matches 1..299 of consensus"
     repeat_region   176577..176879
                     /note="AluY repeat: matches 1..303 of consensus"
     repeat_region   177277..177385
                     /note="FLAM_A repeat: matches 30..138 of consensus"
     repeat_region   177421..177722
                     /note="AluSg1 repeat: matches 3..302 of consensus"
     misc_feature    177660..178067
                     /gene="bA119B16.1"
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   177791..178089
                     /note="AluY repeat: matches 1..299 of consensus"
     repeat_region   178098..178374
                     /note="AluJo repeat: matches 1..268 of consensus"
     repeat_region   178375..178683
                     /note="AluSg repeat: matches 1..297 of consensus"
     repeat_region   178684..178725
                     /note="AluJo repeat: matches 268..310 of consensus"
     repeat_region   178727..179031
                     /note="AluSx repeat: matches 1..303 of consensus"
     misc_feature    complement(178764..178905)
                     /note="match: GSS: Em:AQ715943"
     CDS             join(179211..179327,182221..>182395)
                     /gene="bA119B16.1"
                     /note="Continues in Em:AL353194 as bA119B16.1
                     match: proteins: Tr:Q9ULE9"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.8 (KIAA1271)"
                     /protein_id="CAC17553.1"
                     /db_xref="GI:11493375"
                     /translation="MPFAEDKTYKYICRNFSNFCNVDVVEILPYLPCLTARDQDRLRA
                     TCTLSGNRDTLWHLFNTLQRRPGWVEYFIAALRGCELVDLADEVASVYQSYQP"
     repeat_region   179473..179770
                     /note="AluSc repeat: matches 5..302 of consensus"
     repeat_region   180351..180582
                     /note="AluSg/x repeat: matches 80..306 of consensus"
     repeat_region   180583..180643
                     /note="Alu repeat: matches 236..296 of consensus"
     repeat_region   180644..180938
                     /note="AluY repeat: matches 2..296 of consensus"
     repeat_region   180940..181228
                     /note="AluSx repeat: matches 1..302 of consensus"
     misc_feature    181311..182190
                     /gene="bA119B16.1"
                     /note="match: GSS: Em:AQ739964"
     repeat_region   181338..181651
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   181829..182002
                     /note="MIR repeat: matches 29..234 of consensus"
     repeat_region   182506..182728
                     /note="MIR repeat: matches 30..250 of consensus"
     repeat_region   182984..183283
                     /note="AluY repeat: matches 1..310 of consensus"
     repeat_region   183375..183675
                     /note="AluJo repeat: matches 1..311 of consensus"
     repeat_region   183812..184114
                     /note="AluSg repeat: matches 1..298 of consensus"
     repeat_region   184411..184702
                     /note="AluY repeat: matches 1..291 of consensus"
     repeat_region   184819..184938
                     /note="AluSg/x repeat: matches 176..295 of consensus"
     repeat_region   184951..185260
                     /note="AluSp repeat: matches 1..310 of consensus"
     repeat_region   185261..185560
                     /note="AluSg repeat: matches 1..300 of consensus"
     repeat_region   185570..185820
                     /note="AluY repeat: matches 59..308 of consensus"
BASE COUNT    43611 a  49688 c  48794 g  43727 t
ORIGIN      
        1 gatccacagg atcgggaggg gaggagtcag gagacactgc cgaagaatgg gacttggagt
       61 tggggaaatg cggtgacctc ccccagttcc cctgcctgct gccctccttt gttgggcatc
      121 tggtcgaccc tcttgccccc acctgcccta gatccttgaa atattttcct cagacttcta
      181 gaccccacat acctcccacc tgtccttcag tgattgatgc tcaccccctg cctccagaga
      241 aaacagaatc gccacctgcc cacgctgctt ccaccctccc tgccttctcc acacccactc
      301 cggtaatgat tccatcttca ggctccatct caacaggatc tttcccacac ggatggatca
      361 atcataagtc aatctgtctt ctttaaagaa aatccttaac ccaacctcac cttggcctca
      421 ttacctccag accacccgct aatgatggct gcttcccccc tcccaggcat tccaccacct
      481 gccccagctc tgccccctac ccctgcccca cacacacccg ccaccctagg aggtaggtga
      541 tgtgaccacc ccattgaaag ggtagggacg tcgggaaaat atggttgggc acagtggaac
      601 tagagtttgt tccctgtcca tccgactcca cgagggagaa taaaatacgt gtcaagtgct
      661 cagaacagcg cctgcggtca agcactcagt aggtgatata tactgataac ataatctggg
      721 tggttttaag agcctgcgct ccagcccgga cacccacccc acccagccca aggcaccttc
      781 ctgagcacag gtctgcctgt cccttcccca gctcaaaatc tttggctctg gatggtccag
      841 gacactgagc accaaaatgg ccttctgtga tctggccctc ctgggactct ccaaattcat
      901 tcccccctgc tcccctccct gtagatggag accttccagg caggtcagac aaactgctgc
      961 ccccaagtgt agccactgca tctttttctt ttttcttttc tttctttttt tttttctttt
     1021 tctttttttt ctcttttttt gagacggagt ctcactcttt gcccaggccg gagtggtgca
     1081 gtggtgtgat cttggctcac cacaacctct acctccgggg ttcaagcgat tctcccgtct
     1141 cagcccccta agtagctggg attacaggcg ccgccaccac gcctggctaa ttttttgtat
     1201 ttttagtaga gacggggttt caccatgttg gcaaggctgg tctcaaactc ctgacctcag
     1261 gtgagccgcc cgcctcggcc tcccaaagcc accgcatctt ggtccctgcc attcccttag
     1321 cctggggtgc cggctcatct tttccctcta ggatttcttt agactcagca tatcttgcaa
     1381 atgtccacta ggtggtgctc actcatcgcc agcagggagc taacaagccg ctcctggggt
     1441 tgggagggcg gaggtgcccc acagcggggc tgacagcctc agcggtcctc ttcagcctcc
     1501 agggagccaa ccacaggcct gcgtgactct ccctgtcatc tgcaccctct ctggggtcct
     1561 ctgcccatcc agccacccgc acagatctgt gtcagtccct gccccccaac actgatcccc
     1621 tcctcccagc cctaccccag cctggcactc actggttctt ctccagctca agcaggagct
     1681 cctggccttc agcctccagg gccaccagcc ccatgtctgg cttcgagacc tgggcaagaa
     1741 aatgtgtgga gctgagatgg tggcctccag gcctcctgcc tgccagggag taggtggcct
     1801 gtggagccgg ctggggagga agttcttggg gagaacgtgg gctggggagt cagcaggacc
     1861 ccccacatac tatggagggc gtggaggagg tgagaacata caaagatgtt cccaaactca
     1921 ggatgtttgc agtcctgaca acagccactt ggaagggcgt tggcacagcc tgccaggcac
     1981 accagcatcc tccctagaga ccagaggtcc cagaaaggtg cccctcccct ggcccgccct
     2041 cttctttcat gcccagaagg ggcatcaaaa gcaggggaag acagaggggt gctgaggaca
     2101 ttatgggggc atcgggtagc catggtcagg gcctcctcag agcctctgct acctgaggct
     2161 tgtttccaaa tgagctgctg ctcattccct atagaattca aatttgactc ctccacttcc
     2221 aattttggca aactgctccc tcttccaaag ttttcctggg cctccagcag cccccgtccc
     2281 tccggctccg acacctgctt cactggaccc acgaagtaaa catggacgcc attccagcca
     2341 agagagcaca ctggctctca gctaggtgtc aggaggctgg cttggacggc cagccctctc
     2401 tccttccccc accctcttgg cgtctcccac cctgtgggaa caccccactt cccccttgtc
     2461 cactcagcct ggctgggggc ccagagttgg agccggccca ggagcttcct gggaggctgc
     2521 tgcgccttcg gaatgtttaa cccccgactc cttttctcca aaaatgcact ggcctggggc
     2581 cctgtccaag ggtctcagag tctttggagg gagttcttcc ttcgcaagtg gggagcagat
     2641 ggtccttgcc tccctggcca caggccccac aaggcctcca gcatgagctc atgaggctgg
     2701 aatgccactt gctttattgg ggaaaggtct gcaccgggaa aaaggccata ctcgaggtcc
     2761 ctgttcctct gcagcccctg ctatctttac tcttgccctc ctggtaccct gccccttgat
     2821 atatacccct catcttgaaa tgtgagtgtt tcctgccttt tggaggggac acctagcctc
     2881 tactctttct tctgtaccat cttggcaggc ttcctggggg caggggccca ccggtggggg
     2941 aagcagagcc cctttggggc tctcctcttg gtcacagccc aggccagaca gacagggagg
     3001 cccagaggca gagtgacccc agtgtgtgtc cagccttccc ctcctgggga tggggagggc
     3061 aatctcaaag ctcaggccag tgccgtgctt gaccagtgga atgggggcct tatgggccta
     3121 ggggatccca gtgagggccc tgggttggga gctgctgggt ctctgggggc ctctcagcct
     3181 tcatggcaat gctcccctgc cttccctctt gctggatttg gacagtaggg ctgaaaattc
     3241 caaacaaaga gggctctcta ggaggggcag gggtgtagcc aatggtttaa aatcgttcag
     3301 accttagtgg gtctcaggct cccagcctaa agagctgtgt gaccatggac aatttcccca
     3361 agctctctgg gcttccgttt gcccctctgt aaaatgagca tatcaaggct actgccctct
     3421 tagtttgcag cacagatatt atggcacaaa cagatggggc atggttattc tggaagcgtg
     3481 tgaagagcgg gattgggaag aggctggggc agagcgtcct gcagaagaag cacatggggt
     3541 ggtcttacat ctgggggaca tcaggagagt gaccactgcc cccccgatac cagaagtgga
     3601 ttccacagga gccagtgagg ctgaaggttc aggccttcgt ggcagggccc tgagagggac
     3661 agcagtgtgt ccacagggtc acatgttctg gtcaactttg caaaaggttt tctttttggt
     3721 gctttttttt tttttttttt agaggctcct gaaaagcttc aggacccaca aactctggac
     3781 ccatttctgc ctggtggggg tgggggtggc ccagatcatc cagggaggga gggaaagagg
     3841 gaggtggggt ggagaaagct gaaatgactt ccatgtgtgc gggctcacga gatccagatg
     3901 tccaaacccc agtgccttct tctgcccact tgaggggcag gggaggcagg ggcctatagg
     3961 agtagtgact tggtggttct ggggacccca gcaaaactag aagctgtaat gtagggagag
     4021 acaaaagggc tgggaggttc agggcccctg tggagggcgg ggagacatgg cactgaccgg
     4081 ctcctccagg ctgacggtgc gccagggttg tccatccagg acccagtgcg gggtgactgg
     4141 ctgcccaggg atatgtcctg gagtaaagac agagcacagg gtgaggggga cctgaggaac
     4201 acaggggcat gggacaaagc agagggaggg gggtagagga catccccagg gaggcactgg
     4261 aggccttttg gggcagactt caccttcaac acgcgtgggc tcagcctgga gaaggaggga
     4321 cgcccgtggg catccttgga tctgaggagc tatcaaggag gaggaaaaga gaaggctgga
     4381 aagggacagc tcagctgggg acacgggagt cccctgacct ttgtcggggg gcaggcttgg
     4441 gctcggcgat cacaaggaag aggccaaggc cgccagtgca gaggggaagg aaagagcgcg
     4501 gcagccttag ggatttttag atgggcagca gatgccttta gggtgagaga tgtacgaaga
     4561 gaggacactt gtgccccccc catcatctga gaaaaacaac agccagatgt tgccttgcga
     4621 ggtccacctt gcccagagct ccctcgggga ctctgtcctg gtggcagggt tttggtaccc
     4681 tggcccagaa ggcccctcct catctcttca aggggagggg acgcttctgg acggagcctt
     4741 ggtgctccct ggccgggtgt gcctaagggg gctctaggag gaatcccaga gccaagcatt
     4801 actcagaggg cgcctggaat gttcccctgg aatgctccca gccctccact ggccccaacc
     4861 actctcacag gcccgccctg caggagccag gccccaggca cccagagcct gcagcagccc
     4921 tccttccccc ggtacccagt cccagctccc agaacagaca gcctcccccc tccacgcagc
     4981 cctggcctca gtcctgctgg gctgatggct gcctgtggaa gtgactcagc tcctgctagg
     5041 ccaccccaac tccttttttc tcctccacct tctctcccag actacaaaca tcaaagaccc
     5101 ttcctccaag aagccctcct tgattggatg agtgaattgc catcaggcag atgagggccg
     5161 agaggagtct gccaccttgg aaaggaggct agaggggcca gtgcagggag ggctctgagt
     5221 ggatgtgggg gaggggaagg aggggaggtc tctcagccca gagagcactt aactgagagt
     5281 agagaaccaa gctttgctgc tcctaggcct ctaagggttt ggggaagagg tagggtgggc
     5341 ccgggcacag gtgtggtgtg ggtgcagtgt ggtgtgtggg tgctgtccac atggccttgc
     5401 gcgcacgtgc tggccacggg caccctgacc ccaatgaggg agagaggggc agagctggag
     5461 ctggagctgg agctccggtg accgggtgaa tgggggtgga acccgaggga gccaggctgg
     5521 tattgggcac atagacgccc ctctcccagg ggtcccatca cctcccctga ccccaggata
     5581 gggctcagag gggagggagc agtggaccgc ctggggccct cccctggggc cagaacagac
     5641 caggcccctg tacctgtttg gtccccacac agtgctgtgg aagccaccgc ccagtctgca
     5701 tagcacagcc cagccccgca tgccccctcc ctggttgccc tccctgttcc cggccaggca
     5761 cttgctgtgc aggactggct aatcctccca cccgcttgca gaggttgttc cagccccatc
     5821 ttaacatctt tgtttggagg ggttaccccg aggagacagc tgcagtcttc ccagagcact
     5881 gctaaacaga caccttctat ctggagaggc ccttctctat ctcaccaaac aaggcaacaa
     5941 tataaacaac atacacactg ccctgctgcc ctgggaggag ggacgagggg tgagcagggt
     6001 ggaggccaca gctagttctg cagcctgaga gcaaagcagg gactctgggg gactcttggg
     6061 catgggggct tcctagagga cggagccccg ctgagtccta aggggtggag gagcaggagc
     6121 gggtcacacg gtggcctgcg gatggaagct ggttgtgaga gcgagaatcc aggaagaggg
     6181 ggctacggct atgggctggg ggctgggctg cccccgaggg ggagggagcc atgccctctg
     6241 ctttgccagc ggagtggcag ccgggcagtg tgggcaagtc cgggcccggg gccagcccaa
     6301 gcacacttga gcgtccctgg gcaggtccca cggagacccc cccaaagagt ccccacgccc
     6361 tgacctactg gccgtatggt gccggggccg tgagaccctc cgcgcgctga cccgagctct
     6421 gagcagaacc catccccgcc accaccaccg cgcctagcct gcccctcagg gcgcaccccg
     6481 cccgcgtcct caccttgaag caccccggcg cctggcactg gccagagcag cagcagtagt
     6541 agcagcagca gcaacggggt cccccgagct ctccggggcc tccagcccat agctgtgagc
     6601 tcctcggcct ctaggcagcg gctcgcaact ccggctccgc ccaggctgga ttgcggccga
     6661 cccgtgcccg gtgcagcctc aggccgccgc cttcggacct tcccgccccc acctcccacc
     6721 gcccgccctc gctcccgcct cccctccccg ccaaccccgc tcggagcctg gccaggggcc
     6781 ccgacggcgc gcgccatggg ggagccgggt cgccactccc ggaccgccgc ccctcgaggg
     6841 ggtggagctg ggcggaggag ggaatccgtg cggcccctcg gatgaccggc ccgagccgtc
     6901 cctccccgtc ggtctcagag ggcctctact cctgagagga ggagagaacc gctgggaagg
     6961 ttcttggagg accgcggcgt ggtgggatga ggcggtgggc aaaggccgcc tctcgctgct
     7021 gaagttggcc ccaggagcgc gatcttccgt ggtctcctgg ggccgatctc tgtcccctcc
     7081 ttgctacccg tcctgccccg agggtgccct ggcggaggtt gagtcgggtc atccacctgc
     7141 actgggtgcc cccaaggata ggaaggttca ggcaaccggc tgccgctgtc ttgggggctt
     7201 cattgctggg caaaggcgat gcagcagacg gagacaacct ttcttccctg gcggtggcca
     7261 gagggcagaa ttgcataaaa gctgcagact cccaggcctg ggagaccctt tcggcctcag
     7321 taacatctgt ttcatgtttt aaacttttgt tttcctactc ggtgcaaatt tggatgagat
     7381 gttaactttt tttttttttt tttttttgag atggagtctc cctctgtcgc caggctggag
     7441 tgcagcggcg cgatcttggc tcactgcaac ctccgactcc ctggttcaag cgattctcct
     7501 gcctcagcct cccgagtagc tgggactaca ggcgcgcgct accaccccca gctaactttt
     7561 gtatttttag cagagacgag gtttcaccat tttggccagg atggtctcaa tctcctgatc
     7621 tcgtgatcca cccgcctcgg cctctcaaag cgctgggatt acaggcatga gccaccgcgc
     7681 ccggccggag atgttaactt ttaagcaaat cttttttttt tttttttttt tttgagacag
     7741 agtttctctc ttgttaccca gactggagtg caatggcatg atctgggctc actgcaacct
     7801 ctgcctccca gattcaagtg attcttctgc ctcagcctcc cgagtagctg ggattacagg
     7861 cattcgccac cacgcctggc taattttgta tttttagtag agatggggtt tctccatgtt
     7921 ggtcaggctg gtctcgaact cccgacctca ggtgatctgc ccgcctcggc ctcccaaagc
     7981 gctggaatta caggcgtgag acaccgcacc cagcctactt ttaagtaaat ctatttgttt
     8041 ttgagaattt ggaatgtagt aatttggtta gtgaaagttc gagcagtgag agaaacctac
     8101 attcacatat ctcaaaatca aaaagtacag aaagcatagg gaaaagtctc cgtgctctta
     8161 gccctcctca ccaacaggaa accaatatga ttagtttctt tcataggctt ttagattatt
     8221 ttttcacact caagacaata cagacatatt tttttctctt attaacgttt ttctgcactt
     8281 tgattttctt tttttttttt ttggtcgctt aatacacctt agatatcagt gcgtttagag
     8341 ggtccttgtt gttcttatga ttattattta gagacagggt ctcactctgt cacccacgct
     8401 agaggacagt ggcctgatca tgcctcattg cagccttgaa atcctgggct caaggtatcc
     8461 tcccacctca gcctcctgag tagctggaac tacaggcaca cggcaccagg cccagctaaa
     8521 atttttaatt tttctgtaga caggggggtc tcactttgtt tcccaggctg gtctcaaact
     8581 cctggtcttg gccaggcgca gtgtctcatg cctgtaatcc cagcactttg ggaggccgag
     8641 gcgggcagat cactggaggt caggagttca agaccagtct ggccaacatg gtgaaacccc
     8701 atctctacta aaaatacaaa aattagccgg gcatggtggt gagtgcctgt agttccagct
     8761 acttgggagg ctgaggcagg aaaatcgctt gaactcagaa ggtggaggtt gcagcgagcc
     8821 gagatcatgc cattgcactc cagcctgggc aacaagagcg aaactccgtc tcaaaaaata
     8881 aaaataaaaa taaaaagaac tcctgatctt aagtgatcct cctgcctcag cttctcaaat
     8941 cgctggaatt acaggagtga gtcaccacag ctgtccagct acgagattat tacttattat
     9001 tactactttg gattttcaaa tcaacttcat taaggtataa tttacacaca ataaaatgca
     9061 cttattttaa gtggccagta agatgagttt cgataagtgt atataactac ataagcatca
     9121 ctataatgca gacacattcc ctcactcaca gaaagagccc tgtgcccttc cagccaaact
     9181 tgcccactcc caaccccaga cagccactga tctgttgttc tctgtctata gataagtttt
     9241 gcctgttcta gaatttcata taaatggaat catgcggcat gcactcttct gtgtctggct
     9301 tccttccctc tttccgatgt ttttgagatt catttacact attttgcata tcaatagttt
     9361 gttccttcgt attgctgaat agtgttcggt ggtttgaggg aaccacagtt tctctactca
     9421 ccagtgcacc atagggttat tttccagtta ggggctctta taattggaac tatatttgca
     9481 cagagagaga gagaggaaga aagagggaga gagatattta ttatagcaat tggctcacgt
     9541 gattatggag gccaaaaagt tcccgaatct gccatctgca agctggagaa cgaggaaagc
     9601 cagtggtgtg attcagtttg agttcaaagg cctgagaacc aggagcacca gtatggaggt
     9661 ggctcgagct cagaacaagt tggggacagg aaagcagagc agcgccccag agcagcccct
     9721 cagcgacacc tcttcagtaa agcaaggctg aacacagagg ggctggcttc agtgtggatg
     9781 tcaggtacag aaggcagctc gaggagctac tctggcgttc ttgcttactg gtattcttac
     9841 ctcgaactgg ccaactccta cttaaactgc aggccatggc tttaatgtcc tgtcattcag
     9901 aggctgtccc ttacccaaag ccaggttagc atcccctgac tgacacttct ccctgcaaca
     9961 cgtttcagaa ggccctgtag tcgtccactt ccctgtctct ctccccaagc tcctgagctc
    10021 catgtggtct gggaatatgt gtgttgctca cttcctagca cagtcagtgc taataactga
    10081 ctgtagaggg gacacagtcg aaaagccaca tggggatcag agtcatcctt acacagttga
    10141 cacctcccaa acccagatga gctgtgtcca agtgcaggtc agaggaattt tctgccgaag
    10201 tctctgagaa agggtttatt tacattttga ggttgcaggg gaggagatga ggccatcaaa
    10261 ccaaagctga ggaagaggga tcctaggatg caccgagcag ctccgggggc gcctgacagc
    10321 acctgggaaa gatggcttct ccactggctt gttggcgtca ccctccagag gggcatcagg
    10381 aaatgtcctg ggaaccaggc aaaccagtga gcattaaccc ttaggagtgc ttggcatggg
    10441 tgacacccac catctgtaaa cacgacttct cccaaggagt gacgcagaac aggatgtctg
    10501 agggaggcac tccgactcca gccttcagag atcgccaggg tggcacctgg tgacgacagg
    10561 ctgatgcttg ggtgccccgg aaaaagtcat gtgtgtgaat gggggcccca aagccaacgc
    10621 ttcatccctg acagcctggt gcatttagag gggaactttt tgtcccttgg caaggtgggt
    10681 ggaatttcag gttcataggg caagggtatt ttagctttaa tagatattgt caaacagttt
    10741 tccaaagtca ttgtacacac tctgtgattc tacttaagta aagtttaaaa acaggcaaat
    10801 caaatctatg gtgttcgaag tcaagacagt agttaccctt gtgggggctg caactggtac
    10861 agagtgtaag gggggactgt aggatggtct atttcttgat ctgggtgtgt tcgcttttgg
    10921 aaaagtcctt gagttgcatt tataatgtgt gaacttttct gtatgttaca ctttaattga
    10981 atgtacaaaa agtctcagga ggcctcagac cactggaagc ggacacaact aacccctctg
    11041 agagcctcca atccaagatg gacatatgtc cccttggaag tatgcagaag caggtgaaga
    11101 ctcctaagcc ggatattccc aaatcccccc agtagctgca gcttcagcag ctgcttatgg
    11161 tcctccctac accctctctt ccccagacag cccccaaaca tctggctgca tttgacttgc
    11221 tctctccctg tcccacctct ggatttagtc catgttctcc accctcccca ctgtcagcaa
    11281 tgtagacaag acaaacgctt agttcacgtg cccacctact gcgtgccatg cacggggctg
    11341 gtcattgtgg gtggcaaatg tgagcaacac acgaagcctc aaggagcaga aagggacaca
    11401 aatcacttca gcgtaaggta atttgtgata aatgtcatgt aacttgcagc ccctggcccc
    11461 ctcctacaga tggtgtctaa gaataaaccc cactaacatg tgactcctct gttctagccc
    11521 agctgtttgg gttgcaagaa agagactcac tccagttgcg tcataggatg gagttttatt
    11581 gggaggacat tctggacggg ctcccagcaa gagtctggca atggagcatg aaaatgaatg
    11641 agcctggaat gagggagggt gcggcctcgg ctacacaaag ttcacccgcc cccaactgcc
    11701 tcccagcgtg ttagctcctg tggcaactcc ccacttctct ctctgttttt caacccaaat
    11761 cctagagcag agggctcttc gctcctcaca ccatctccac caggctgcag ggggagtcac
    11821 tagcccactc aagagctctc tcctgttggt ccctgattcg caggggcacg tcattccttc
    11881 ttggtaactc attctcttta acagcccgac acagggtctc caggaaaagg acaggagctc
    11941 acacagtcat gaacagagat gcatctggag caggaataag gatgctgaca acatctgtgt
    12001 ctgccctcca ctcttgtaag ggccagtaag gtagtaccca gtccctgggg gtaggggggg
    12061 tggtggtggg aatgagggca tctcttcttt caggggccaa atctggcaca aggatgtctg
    12121 cttaggcaac tccactgcct ggcatgctct ctccctagga aaacaccaaa ccttttttat
    12181 ttcctcagtc ttagtgtgag agtgatcacc tcttccagga agcccacaac cagaatggcc
    12241 aggggcacac cctggccctc agtagatggg cactgagaca aaacatggcc tgctggtggc
    12301 attcccaaca atgtcaaaag tctcagggat gagctcacag ttgggagtcc tttggaagtc
    12361 caattccaaa tgtcctctgg actcaaaata aaaactacat ttcccagtct cccttgtagc
    12421 ttgcagtagc catgtgacaa tactccagcc aatgggaagt gagtggaaaa gtcctgtgca
    12481 gcttctgagt tgcgtccacc atgggcatgg gtgtgcttct acctccactc ttccttcttc
    12541 tggctggagg gcatgtggac aaagcggggc catggtgagc ctcacaatca aaggcatcat
    12601 tttagggata gagaaaagga agatagaagg atcctgagcc ccaacatcat aaaccatcgt
    12661 aacagccaga ctagcgtggg agagaaaaat aaaattctat catgttgaag tcactgtatt
    12721 tggtctcttg ttagagcagg ctgactgata ctctgttgaa tacagaatgc cttggtgagc
    12781 ctctgaggat gcaggatgct gcaatccaac caaagtccag ggtgactttc gaggagaagg
    12841 atgtgaaagg gcagggcttc ctttgaggga gaaatggaga gaagggagga gatccagaga
    12901 agagagagcg agatcagctc agtgctcttc agtgtcctcc agggtcaaca ccctcctggg
    12961 cagacctctc accacccatt tttggggggg caagaggctt ggggccaggt cataaaaagt
    13021 cagatgtcac agagctgttt tcgtagaggc gggcaggact caacactgcc tcattcacat
    13081 tcataggctg gagtcatcac agattctggc cactttctcc tccggagggc aggcaacacc
    13141 actggtcagc ccaggggtgg ggcacaggtt gaggtctcac atgtggctgc atcaggatca
    13201 atgagcttcc cacagagaaa accctggtca caggaggcct ctgggagcca gggcaggccc
    13261 ccaggccccc agcctcaggt ggggtcccaa gaggattggt gggagtgtgc atggctggaa
    13321 gtgtgacttg gagccctggc ctaccaggtg aatcagctgc agaagggaca caccacaggg
    13381 gaggagaaac tgactttcac ttctgcccag gtgacgggca ggcttgggag tggggaagag
    13441 gcctcagaac aaaggtgggg gagtcagaac atacgtgtct tggaaatgga ggggatgctg
    13501 ccaagtccag aaagtctccc ctagtcctct aagccagccg ccagctgaca cagggattcc
    13561 ctctgccccg agcagaacag ggctctccta tctcctgagg ggctcctggt tccctgcaca
    13621 ctctccccaa cctccccttg gtcccaggca ccatcctcta aatgacagca gtctccagat
    13681 gccagttggc actgtgggtt agctgggtat tatatctgcc cgtgtgtgtt gaatggatac
    13741 ctgcgtggtc tctttctgaa aagccatctc caccgaattc tcgcccatgc tctgcttaca
    13801 aacacgcctc cttctgcaag gcccggagtg gactcagaga gcctctcccc tccccacctc
    13861 ctgcccccat tcctctcccg ctccccacag gctggctcca gaccacaaga gcgtgggaac
    13921 ctcatggaga gtgggtgctt cctctgtctt cctccctagc ccttgcttta gcacagccag
    13981 ggcagagagg aggaaagaag gaaactacag caaggaggat ttagggacaa ttctagaagg
    14041 gatttccaat ttggaggggc agtctgggaa gtaggcagcc tgtgaactta ataggaggtc
    14101 ctaaaggacc tgggggaatt tggggagaag acgaggtggg ctgccttacc ccaatcttct
    14161 ggcccactcc cagctccaga cccacctcca ggtgtagcag gcccccaggc ccaacagcag
    14221 gagcaggagg cccaccagca gtcccaggac ccagagcagc tgctggaact gatgcaggcg
    14281 gtgcagggct ggaacacaga gcgggactca gagcagccac agctgcaggc ccccagtgct
    14341 tccttccaag gggacccacc caagatgaca ccttctaact tttgctttat cgctgaagct
    14401 ctccacacaa gggtgccccg agcaatcagt ttagagtcga caggcaaatc atgctgcagc
    14461 aggagggata cagggaagga agcctttggg cctaagagct gggcttttgt gcagagtggc
    14521 ctgggagatg gagccccctc ccctcacctc tgaccccaaa gtaggtggag gtagaggcag
    14581 agcccaggac atttgaggca gaacagatgt attccccttg gtcgctgggc ctcagcgcct
    14641 cgatgtccac cctcagggca ttgggggaag ccaggacatg gatgtgtggc tctgcaggag
    14701 caccctgggg ctgactggag gccaccagtc gactgccaag gtggagagtc aggctggcga
    14761 gcggctcgct gtccactcgg caatccagga tgccccggag gccaccctca ggctccacga
    14821 agaccatcat ggtgggcgtc ttgggagggt ctgtggggag gaagggagtg tggtgattgc
    14881 agacaatgac cacaaatttc tccctccctg tacccatgca cagtgacttt gctgctcctc
    14941 ccattaagag gcagaaccta tttcctcact ccttgaatct gtgactcagt ttgaccaatt
    15001 gaagaaagaa gtgatgttgt gcaacttcag agccagacct caagaggcct tgtagcttct
    15061 gctctacttg ttggaacaca acaatgtggg aagcccaggc cagcctgctg ggcacatgtg
    15121 gcccaactgg cagccaggag cgtgcagtca tctgagacca ctgggcaact gcagccacat
    15181 gaggtgtgaa caacagaaga actccccagc tgagcccagc ccacagaatt gagcagataa
    15241 catgactttt gtttgaagca gtaagttttg ttgtggtcaa tagcaatagc tgactgatac
    15301 agtgtaggcc ttggtgaagg ctggagtggg agaccaagac tgatgagggc ttatcgtgtg
    15361 ccagacacac tcagcacgct ttatttcttt tggttttttc tcttcttttc ttttcttttt
    15421 tttttttttt tcttgagaca gagtctcagt ctgtcaccca ggctggagtg cgttgtggtg
    15481 atcttggctc actgcaacct ccacctccca ggctcaagtg agtctcctgc ctcagcctcc
    15541 tgaatagctg ggattacagg tgtgtgcctt catgccgtgc taattcttgt atttttagta
    15601 gagacggagt ttcgccatat tgaccaggct agtcttgaac tcctgacctc aagtgatccg
    15661 cccacctcag cctctcaaag tgctgggatt acaggcataa gcccctgcac cagccattta
    15721 tttcttatac aacttgcagg atgaggcgaa tgatgtattc cccattttac aaggtcaccc
    15781 aaccagacag cgggagagcc aggattccaa cccaggactc aatatccgtg aagccttcac
    15841 tcttggcaga cagaagagga ggaagaactg aggtgggcgt gtgagatgtg gggagactaa
    15901 ggcggaggag ggggttcaca ctcacagagc acacggagca tgactggagc agaggcagca
    15961 gccccagtgg ggagctcagc caggcagtgg tacatcccag cttgagcacg agccacgtgg
    16021 gtgaaggcga gagtgggcac aggctccgcg tgcagccgcc ggtcattcca gaaccatgca
    16081 aaggtggagt tgcccacagg cccagggcca cccaggaggc ggcagctgag gttcagggca
    16141 gcgccctcag gcacgtccag gccaggctct gccaccacgc gtgcacctgc gggcggagga
    16201 tagagagatg attggggatc tgtaggcctt gggggctgga gcctctgggt gggacctctg
    16261 aggggcctct gcacattgtg ggaaggtctc agtctgactt ggtatagggc tttccaaggt
    16321 ggaaccatgc cccctccagt gggagctgtg ccccctccac cagattaggc ttctcaaggc
    16381 agagctttct cagaccagac agcccgctct agtgggagct ccagcccctc tcgtggactt
    16441 ggacctgaga gcatcagagc ccctcctcct cccctctctt ccactcacct tctacctgca
    16501 accgcccgat ggtgctgatt gagcccagca agttttgggc tgtgcaaaca taggtgtcat
    16561 cacctgcagg cacatcttgc acctgcagcc gtagggcgtt tcgggccacc tggacatggc
    16621 ctgtccctga tgccaagctg tggaccccgc tgctcgtggc cagcaccttg ccatcatgag
    16681 atagggccag ctcagcaggt ggctcactgt ccacagtgca ctgtatcaca gccatggatc
    16741 tggccctgga gtcccggaag gaggacagga cagcgtcctg aggggcatct gtgggcaggc
    16801 agggcacaga tggggacctt gcttaggcac cctgtactgc tcattgccta gctgcctggg
    16861 gttggtcagg ctgaggcctc tggaccaatg gccacttcct agaagtgaca ccttcccagg
    16921 agtcctgtgg gcagagtgct ccaggcaggg cttataggac tgtctctttc tctagcacct
    16981 atgtcctctt tccccaactg cccattcctg ggcccactgc agtcctttgc ccactcccac
    17041 cagagcccag aagcaccctc caccaccctc ctcctgggca gccctggcca aggaccctct
    17101 gctcacagag gacttgcagg gcagcaggac gggagctgcg ggtgccctgg gcatcctggg
    17161 cctggcaaga gtaggcgcct gcatgagccc gcgtggccac caggaatgag agtgaggcag
    17221 ctggaccctc ctgcagccaa cgaccgttgt ggtaccaagt atagagtgtg ggtgcgtggg
    17281 cagcagggtc cgcacaggtc actgtgatgg gggcaccttc aggcacggca gcctccggag
    17341 ccaggatcac ccgcacacct gtgccaagga ggccaggatc agccgggccc agccggacac
    17401 cacagtgggc ttccatccca gtgggcttgg cctaggccct gccttaccct ccagccgcag
    17461 ctccagggac gtgttggcct ggcccagagg gctgcgggca gagcagctgt agaaaccctc
    17521 atccctgggc tgtggccctc gcagctccag gcgcagggtg ttggggacag aggctgctgt
    17581 cgaggaggcc aagaggcgac cggcgtggct gagggccagc tgggcgggcg ggcggctgtc
    17641 cacagtgcac agtaccaggg ccagctgccc gccatggctc tccaggaggt aggtcaggcg
    17701 caggttgcgg ggcgcgtctg cagggcatga gaggcttaca cggagtcaca ggcagcagcc
    17761 tgcaggaggc cagtttgcag gtggcattca aactcaggag ggccctggct ccccacccct
    17821 atggctccca accacccagt ctgaggcact gctcctctgc ccagacagga ctcaccccag
    17881 cgccccctac tcttaatgct ccttgcccag tgctccacta gctactgggt accgagttcc
    17941 ccactggaca cctgtcgggt tccggccatg ccgtgatccc tgtgggcttc tgtcattcct
    18001 gtgagtccca ccctgcatcc agccagactc acagaggacg tccaaggtga taggtctgga
    18061 gaggcggggt gcccgaccag gggggcccac accgcagcgg taggaggtgg catccctgac
    18121 tgtgacgttg ggcaggggga tggagtgggc atccaggcgc tgctgcccat cctggtacca
    18181 tgtgtaggtg agctgggccg ggtgagtggt ccacacaagg caggtcaggt tcaccagctg
    18241 cccctcccgc acggtagccc cgggccacac ctgcacattc acagctgggg agagggaggg
    18301 cacaggacac cagtgagggt cttgaggctg tgtacagcag gccacctgtc tccatgtggg
    18361 tgagctactc ctactgctgg ggagcccctt ggcctttggg agccttgggt ggcccaccta
    18421 taaatgtggc ttagaagtca agcttgggaa tagcccactg gctcctgagt ggtcccaagt
    18481 agagttccac agagtcctgg aaagccctga gcccctccct gaccctggct gggggcaaat
    18541 ggggagacca cagacctccc ctcccctgct gcactttctc ctgaaattcc ctgtttagtt
    18601 ttatccactg ggcatacttg taaaatttgg tgtttataaa aggttctcct aagaaactca
    18661 agtctggagt ccaccatcgt taggaagggt gggaatgggg actattcccg tgcccccagg
    18721 cttctagaac cctgtcccac ctcctctcct cttttaaagg acacacacac acacacacac
    18781 acacacacac acacacacac actgaccttg agcgtcgaag tcagctgagg ccgaggcctg
    18841 gcccagggtg ttggaggcct cacagatata gacaccctca tcctccagca tagccccgtg
    18901 gatctccaga cgcagtgtgt tgggggccac agccacatgc agcctgggag agctgccttc
    18961 gggtcccccc acaccttgta gggtggaggc cacaaggcga tccccgtgga gcagccgcag
    19021 ctgggccgga gggtcactgt ccacacggca caggaggagg cccagtcgtc cagggcctgt
    19081 gtccatcagg gtagtgagtg tgacgtggcg tggggcatct acacagggtg gggagtggtc
    19141 aggacccagc caggggggtc cctcatccag ggctctgtca tgtgacacag cccaggactg
    19201 ggggactgca ttccttcccc cacacctgga tgggacaaaa gtccttacag gacacgtgga
    19261 ggctgatggg tgcagctagg ctcgtggtgg ctgagcctgg ggcctgggct tggcaatgat
    19321 aggccccagc ttgtgtcaaa gttatggctg caaagcggag cgtggccgag gtcgactcct
    19381 ggaggggctg gccatcccga taccaacgat atgaggtccc ctctgggact cctgtgtgta
    19441 cctggcagct caggaccaca gcctggccct cttggagctc aggtgatggt gacacctgga
    19501 cccaggctcc tgcaggggaa aaccaagagc aggtgagggc tctccaccac acctctcaca
    19561 gtctgggacc atgtgcgtgt ccacctagga ctacacaacc caaccccatg tgctggagcc
    19621 gcagcccctt ccagaccacc gccagggccc actcagccct atgccttggc ccctcctgct
    19681 ccccactctc ctgcacaggc tccatcctgg catttacctc ccaccacctg gtgggatggt
    19741 tttactgtgt cagtcatatc tcgcctacca aagcgtaagc cttatgggca ctggacttgt
    19801 gtgacctgta accccagttc caggcactgt gcctgcatgt gttagctcct gacaaatgtt
    19861 ggttcaaccc ataaagtact gaaaagggag ggatttagat catcccgggg ccattgtcct
    19921 aactaggatc tggactcagc atgggtgacc gagggcttgg aagggaatgt tagcagctat
    19981 gttatcttga gcgctctcgc caggccagcc ctgcatccag actccaggcc acaagagcac
    20041 aggacctggg gaccaggcaa agggcagctc ctcagtggcc cctgggtgtg aagcaggaga
    20101 accccaggtt gcagggagtg agatggaggg acagctggaa tgctaagcaa gaacacagac
    20161 catgcctgag accacagcta gcagggtcat gcaactgggg aaaagctcct aaaactcaaa
    20221 ccttcaattt cttcctgcga aataggtatg ctaagtaaga aaagatacac agaaccaagg
    20281 ctcaaagtaa atgtcccaaa attggtgctg tcggtcaccg tggtggaaga aacactcatg
    20341 ttcctttgac cttcgcatta tgggcattat ctttcctttt tcctggttta tttgtttagg
    20401 gagaattttt ttttttttaa gacagagtct cactctgtca tccaggctag agtgaagtac
    20461 aattttggct cactgcaacc tccatctccc aggttcaaac gattcgcctg ccttagcctc
    20521 ccgagtaact gggattacaa atgcccacca ccacacctag ctaatttttg tatttttagt
    20581 agagtgttgg ccaggctggt cttgaactcc tgacctcaag tgatccgccc gcctcagcct
    20641 cccaaagtgc tgggattaca ggcatgagcc atggtgcccg gcctgaggga ggatttttat
    20701 agagactata tatatacaca cacacaaaca cacacacaca cacacacaca cacgtatgtg
    20761 tatataaata aagaatatga atatattatg cataagaata cacataatgt atatttgtat
    20821 gtgtatatac tcatacataa acatatatgc ataatataaa tatacatatg tatgagaatg
    20881 tttactatgt aacaaaatta tatgatcaca aaacaagtta tttcagaaag tgctcccacc
    20941 aagcaagcca ctctgtgaat tgccattgcc tcctagtggc tgctgcagtt attacaggtg
    21001 aaaatatcta gctggaggca aagggaggcc ttctgctggc tggaaactgc ccttaccagg
    21061 actgaaaaag acatatttct gcacaaaatt cacagcaccc agcccaaagt ctgaccagag
    21121 tagtccccag ggcatcagag tctaccatat cttcagaaga ctgacactca cctcggacct
    21181 ggaagaagag tgaggtgttg gatgatccaa gaacatttgt ggcctcacag cggtagctgc
    21241 cagagtcccc aaggcccagt tctcggacct ctaacttcag ggagttggcc tcagctttag
    21301 cctggaaccg accatgggat gggacctggg gacccaggct ggtggccagg aggtgctccc
    21361 catggaacaa ggccagcaag gccagggggc ggctgtccac agtgcagatg aacagagcca
    21421 tgtggccctg gcccatgtct aggagggctg acagctttgg acggtccggg ggatctgcag
    21481 gaacagaggg agctgaggcc acccagccta gctcaccaca ttacaactgc cacactatgt
    21541 cccccacctc ctgccacaca cacagcctaa gccatgctct ttcctcccca aaccccagcc
    21601 cggcccctgc ttctccccag accacccctc caccctccaa gtccccaggc tgcccatgcc
    21661 agtcacccac agagtacact caggagcacg ggagtggaga gctgggcacc agcctcagtc
    21721 aggatgcggc aggcgtaaag ggcagcatca gttctggcca cgggcagcag tgtcacggtc
    21781 tccaggggac cctgggccca cagcacccca tttcggaacc aggagaagtt agcagggctg
    21841 ccagcagctt cccggctcac gttgcaagtc aagttggctt ctgtgccctc ctgaagtgtg
    21901 tgtgatggtg caatggccag gacagtggct ggagagcagg cggcacagct tactgaccac
    21961 ccccggcccc caggagccag gggtccagcc gcccagcctg gaaggagcag gaaggaggcc
    22021 ccctggggtg cccacagggt tggggagaaa agcaagctag ctcacagggc agcctaggag
    22081 aagtgggccc tggggactgt gggggccctg ggcagagagg gttgcagtac agggaaggag
    22141 gacaaggcag cccaactgtc ccagctgggg agtccttcct cagagaccag ctgtgttgcc
    22201 tatccccggc ctgaacagag atcatgggaa ggagcagctc cgctcttcct gaatgtggag
    22261 gagggcagcg aagggccccc actgctgcac agagtggttt tgcctgatta gatcctcctc
    22321 ggagcagaac agtccagagc tccagcccct gcccccaggc cacccattcc ctgcctgact
    22381 caccctggcc attgaaggtg gctgaggtgg aggcgttgcc cagggcattg ctggcctcac
    22441 agaggtacaa gccctcctct tccagcaaag ggttgtgaat ctccacacgc agcaagttgg
    22501 gggctttggt gaccttcatg cgtggggaac agcccccaca ggtgctgcag ccaccccctg
    22561 atggcaggga agtggccaca acacggtcct tgtggagcag ctgcagcctg gcgggggggt
    22621 cgctgtccac acggcacaaa aggaggcctc gccgtccagc cccggcccca gcggcatcaa
    22681 ggtccagcct ggtggtgaat gttggttgtc gaggggggtc tgcagggagg aagaacatgg
    22741 gcactcatcc cacggatgct ccagggcccc acaagcctgg ctaggctccc agaatacact
    22801 ggacaaaggc agatcccgaa ggctgcccca agcagctgtg cagactgtgc actgcacaag
    22861 ggcagccaga ccagggtggg tggaactcaa gctaggctca tgctcctcaa gctaagctgt
    22921 gccctgatgc ctactgagtc ttcccagaag aaggagcacc attttctagc tcagacaaag
    22981 gtgctgtatg ggctggctgc caccttgcat aagtcccatc tgctgctgag cagggtgccc
    23041 acctagcaat ccctgggggt catgctgtct gctcccaccc tgcacccaat gccttaccac
    23101 ggagcctccc ctcaagatct gccctgcagc ttatctgtga tctccaagcc agctcctgtg
    23161 atactagcga atgcatagaa tggcctcttc ctgaaggctt ctggagacag ttggccaaaa
    23221 gagagcaatc agccaaggcc cccatgtgat ctcactcaga caccagaaaa ctttgcagag
    23281 gcagctgccc tctgacagtt ccgggacgtg gctccggcac acatgtgtca ggggaagcct
    23341 ctggaatcag gcccttggtc ttacacggag cctccccgct tggcggatgc agaggggctg
    23401 tctgtccctg tgctgagggg ctttgccttg gtgtctatgg gagcccagca cagcctgtgg
    23461 ggcatgtgca caaatagagg agtgtggtgt cccagggctc acccatgagg acaaagcatg
    23521 gatgtggtgc ccatgctgag cttgaggctg ggacagagga gcccacaggg ctggcattgg
    23581 agggtaaaga catggggagg tagagaaggc cccagcctca tttcctaaac ttcagcccca
    23641 catggaaaga cccactgtgg ccaggtctgt gacccagccc tcccgatata gaccgtgggg
    23701 cagggcagga cagggctggg gggcctagag gaggtggggt ggctggtggg tcccggcagg
    23761 aggctgcaac aggtggtggc tgctcacaga gcacagtgag aacagctggc gaagaggggc
    23821 cactggcact gtggccgtcc cgggcccggc agtggtatga gccggcgtca gtgctggagg
    23881 ccgcggggag caggaggctg ctgccgggac cctcgtgaag cagggctcca ttcaggtacc
    23941 aggagaagcg ggcatcaggt gtggggctta ggccgcttct gcagctcagt gtcactgcct
    24001 gtccttccac cacctcggct gccgggctga tgaggagacg ggcggctgcg gggagaggaa
    24061 gaggctggga agggtccctc ctctcaaccc cacatgctgc cctatataga aagccttcca
    24121 gggttctcct ttccccacac tactgtaggt agctctgggc tttgtggtgc atcaggaggg
    24181 tttgcccttt aatgccaaat cagatctatt ggtagtagat ggctgcagca tggttgaaga
    24241 ttcagagcca gagcccaggc ttatgtccaa cacctggctt ggctgggcat ggtagctcat
    24301 gcctgtaatc ttagcacttt gggagactga ggcagaagga tcgtttgagg ccaggagttc
    24361 cagaccagcc taggcaacat agtgagactc catctcaaac aatttttttt ttttgagact
    24421 gagtctcact ctgtctccca ggctggagtg cagtggtgtg atcctggctc actgcaacct
    24481 ctgcctccca ggttcaagcg attctcctgc ctcagcctcc caagtagctg ggattacagg
    24541 tgtgccacca cacccagcta atgttataca tgtagtagag atagggtttc accatgttag
    24601 ccaggcagat ctcgaactcc cgacctctgg tgatccaccc gcctcagcct cccaaagtgc
    24661 tgggattaca ggtgtgagcc actgtggcca gctttttttt tttttttttt tttttttttt
    24721 tgagacggag tcttgctccg tcagccaggc tggagtgcag tggcgcaatc tcagctcgct
    24781 acaacctctg cctcccaggt tcaagcaatt atcctgcctc agcctcccta gtagctggga
    24841 ccacaggtgt gcgccaccac acccggctaa tttttgtatt tttagtggag acggggtttc
    24901 accacgttgg ccaggctgat ctcaagctcc tgacctcagg tgatctgcct gcctcggcct
    24961 cccaaagtgc tggaattaca ggcatgagcc accatgcctg gccacaattt ttttttttta
    25021 attagctggg tgtggtgaca tgggccgtag tctcagctat ttgggaggct gagatgggag
    25081 gatggcttga gcccaggagt ttgaggctgc agtgagccat gaacatacca ttgcactccg
    25141 gcctgggcaa cagagcaaca ccctatctca aaaaacaaaa agaaaaacct ggcttgatca
    25201 attagctacc atgccctcag gaggagggaa ggacagtgca cataccgaaa gttggaagac
    25261 cgtacttttc ttttttcttt ttcttttttt tttttttttt tttgagacag agtctcactc
    25321 ttgtcaccca ggctggagtg cagtggcgct atcttggccc actacaactt ccacctcctg
    25381 ggttcaagcg attctcctgc cttagcctcc caggtagctg ggactacagg aactcaccac
    25441 catgcccagc taatttttgt atttttagta gagatgggat ttcaccatgt tggccaggat
    25501 ggtctcgatc tcttgacctc gtgatccacc cacctcggcc tcccaaagtg ctgggattac
    25561 aggagtgagc caccacgccc agccagaaga ccctactttt ctatttggct tcccacatct
    25621 gactgctagc atagagcctg ctcccagagt ttcataatta aaaaacaatg aatgcttctg
    25681 agggactctc caagtttagg gtcagggtag gtgcaaaagg aatgatgtga cctgttgtgt
    25741 ttcccttttt cccttgactt ccaggaagct ctgccgttgg gtcactgcac agcccctgtc
    25801 ttttatgtgg cgtagccagt tagctcagtc ctgcggttga gtccactaga cttctagaag
    25861 gaacagactg gagcaggctc ctcctcaggc tccctccact ctccctggct gccagtgccc
    25921 atcttaccat tggcatggaa gtccagggtg gaggttgcat ttccaaggga gttggtggct
    25981 gagcacttgt actccccact gtcagtttcc tccaggtctc ggatctccag gcgcagggag
    26041 ttgggaccag aggtaccact gaagcgtggg ctgtgatcac tgtccccgga ggtggaggcc
    26101 aggatatgac ccccatgtga cagcaccagt gtggccaggg gctcactgac cacagagcag
    26161 tgaaggatgc ccacaagtcc cgcctgggtc tccaggaagg ctgtcaggac tggagtgaga
    26221 ggcgggtctg tgtggagacg agaggtgggc ctgtcaccct cagacaaggg cattccctgg
    26281 ataccctgat acaacccgtg acctctgcac cgctttgtcc cacactgccc tgtgagaagg
    26341 gggtgatccc aaagtgcctg agtgcctgct gaatacactt ttgtccttgg ctgggctggg
    26401 tacgtcactc tgttgtccca gtcagtgttc aaggccaccc tgcaagtggg gataaaagcc
    26461 ccacttcaaa gatgagaaaa ctaatagaga gacctggtga gggacagcac ctcagccagg
    26521 tgaccaaagt gagcatcatc agcaatggga caagctgaca cagagtgcct cctgacaggg
    26581 cagagcagca tgtccgcggc cttcccaccc agagggcatg gtctcaggcc attcaagtca
    26641 ggcaaatgtg gagtgaggga cctcctcaga acagcaggcc tgcactctgc aagtttcaat
    26701 gtcaagaatg acaaggaaag actgaggaac agtctcagac taacggagaa tgaagagacg
    26761 caacgaccca atgcaatatg tgaactgtga ttgtattctg gaccagaaaa aaaatggcta
    26821 caaaagacag tattaggtcc actggtaaaa tgtgaatata gattatagct tagataacag
    26881 tcttctatca gccaggtgta gtggctcata cctgtaatcc cagtactttg ggaggccgag
    26941 gtaggtggat cacgtgaggt caggagttca agaccatcct ggccaacatg gtgaaaccct
    27001 gtctctacaa aaacacaaaa attagccagg caggatggtg ggtgcctgta atcccagcta
    27061 ctcaggaggc tgaggcagga ggagaatcgc ttgagcccca gaggcggaca ttgcagtgag
    27121 ccaagatagc accattgcac tccagcctgg gcaacacagt gagactccat ctcaaaaaaa
    27181 aaaaaaagtc ttctatcaat gctaattttc ctgatcatga tcattgcatt gtggatatgt
    27241 aagagaatga tcttaggaat ttagcggtaa agaagcctca tgtctgcaac ttcaggaata
    27301 taaatatgta ggtagataca taaataacat atgcatatgt atatagagtg tccatatatg
    27361 tataagtgca catgtccaca tagagtggat gtgcatacac acaagtgcac ctgtatatat
    27421 gcaagtctat atccatacat ttatatgtat gtatgtgcgt gtgtgtgtct gtgtagacag
    27481 cgagaaagat taagcaaatg tgccaaaatg ttaataaatg ggaaatctag gttaaaggta
    27541 tataatcatt atttgtcttt ctctgcaact tttcataagt ttaaacttcc aaatataaaa
    27601 taggagggaa ctagggaggt caaagcattg cccaggtgtg cagagctggg acatgagctc
    27661 aaggccacct ccaggtaagt ggccttgaag ttcccatgcc caggacccca acccttcctt
    27721 ggggctccac catctggtcc tgctctgact gtgtccagtg ccaccccacc ctggcctctt
    27781 acggttgact accacgctga cagggcccga gcgctcgctg ccatggacgt tctgcacctc
    27841 acagaagtag aagccagtat cagccctagt ggccaagtgc agccggaggg tatgggagtg
    27901 ggcatcctcc agcaggacat ggttcttgta ccagctgtag cggagatcac tgggtgcctc
    27961 attgggtgtg ttgcagacta gtgtcactgt ctggttctcc aggatgggac ctgctgggct
    28021 cacctggacc tcagccactg caagggcagc atagggagtg ctggggggtc ccagccaact
    28081 tccagcccca gccccataca gtccccgggt ctcagccaat cagcctcaat ctctgcacat
    28141 cctacaccac ccgcctgctg ctacagggga gcctcctgga gtgctctgca tctctttgtc
    28201 ctgctcacca ggatccccct gacaccccac ccttcgtggg ggtattgtca cctgtctcat
    28261 ggtaaggcag ggctgggact ccacacctgc atccaggaag cactgggtta tgccagttgc
    28321 tggccctgcc ctgccctgtc tcccctccgt ccctgaggcc tgagctcctc cctattctct
    28381 ggccaatacg caaccttccc aagaactcac tgaagatgtg gaggctgatg gggggtgaga
    28441 ccaaagagcc cacgccgttc tcagcttggc aggtgtagac gccagcatcg ctccaggctg
    28501 cctggggcag gtgcagcaca ccagtcttgg tttggaggcg taccccatcc ttgagccact
    28561 taatggaact gactgcaggg tagctgctgt tcacctggca ggtgagtgtg accagctcac
    28621 ctggaaggat gttcctcccc gaggggctga ggaggatctt cacacccttg ggggcatctg
    28681 caagtcacag tagggggtat tgggtaaggt gcttggggag ggcagaggat ggcacacttc
    28741 ttcttgcccc cctttaaaag ctcagtccta aggaagtatg cccagataaa acagcagtcc
    28801 ccttcaatcc tcacccaggg catgctctgt ctgcccatgt ccgtcccttt cccgccccct
    28861 gcgcctgatg cattcctatg cccattgaga gctgatcatg tgacgcttgg ccagcgtcca
    28921 gcctacccca ccagctgtag ttttctgctt cccaatgctg tccattgctc ctgctctatt
    28981 tgggatgagc tccacacaca gggtcatggg taccccactg ctagctttag tggcctgtgg
    29041 aagctattgg taagggacca actacctagt gggagggggc caaaggcagc atcaaactag
    29101 ctctgaaaat agttacccag tttgtaagca agaggccaac aacacaaaga actgcatttc
    29161 cttaaatctt gttccaaagc ctctctctgt gtattcaagt gttttattct tattttttta
    29221 gaaagaggat ctggctcagt cagccaagct ggagtgcaga ggcacaatca tagctcactg
    29281 cagcctgaaa ctcctgggct caagtgatcc tcctgcctca gcctcccgag gaactgggac
    29341 tacaggtgca agccaccaca tccagctaat ttttgttttt taattttttt gtagagacag
    29401 ggtctcacta tgttgcccag gctggtctcg aactccttgc ctcaagcaat tctcctgcct
    29461 tggcctccca aagcactggg gttacaggtg tgagccactg tatccggcct caagtgttta
    29521 atatgtgcca ggcactcttc taaatccttg acctgggtca tctcctttat ttgtttttgt
    29581 tgttgttgtt gttttttgtt tgagacacag tctcactctg tcacccaggc tggagtgcag
    29641 tggcacaatc acaactcaat acaactccac ccccggggtt caagcaatcc tagtgcctca
    29701 gcctccttag taactgggat tacaggcata tgccaccaca cccggctaat ttttgtattt
    29761 ttagtagaga cgaggtttca ccatgttggc caagctggtc ttgaatccct ggcctcaagt
    29821 gatccacccg cctcagcctc ccaaagtgat gggattacag gcataagcca ccgcgcccgg
    29881 cccatctcct ttaattttta tcataaatct gtgagacagg aaccatctat tgttatcttc
    29941 gtcatagact gagaaaacag agccaggcag gaaagggata aatcccacct ccccagcatc
    30001 ctccagcacg gggtaaatcc cggggcactt gctgccaacc ctgaagctgc atgggagctc
    30061 ctgttcccca accctttgtg tctccctttc tctgcccagc ctggcctcaa ggacaagccc
    30121 cttcgaagca gtaggaggtg gagggaacca tttaacgaag tccttggggc tcagcagtgt
    30181 ctctccactt gccctaccct ggaggcccag gagcagcctg gttttgcatc aggagcaagg
    30241 gttccgtttc tgtgggctgg agaggggctg gtttctgtca ggagcaacag atgcgctcag
    30301 ccacaagggt gtgtccccag acaactcaca cttcacttgg aggtgaatct cgctctgagc
    30361 cctgtgattg gccacggaga gctggcagcg caggatccgg ccgtggtcct gccaggacat
    30421 ggccatgtgg agggtctcca ggtggccgac gccggtgggc tcaaacttct ggctgttgaa
    30481 ggtgacagag cgagcagggt cctggccttg ccactgcagt ctgacctgct cctgcaggca
    30541 tacgtaggga gtggagcagt tgaagtccac ctctgtgccc tcgagaagct ccaccgggga
    30601 ggcaatggtg ggcaccctgg gctcctctga ggacagagac agcagtgctc aggacccgct
    30661 tttgccaccc ctgagatccc tcgccctgga aaccccagct gaggagagag ccctggggag
    30721 ggggctttta ggggaaggaa agacgctttc tactccagcc ccacttgggt gaatacaagg
    30781 gagaaccagg cctgggccta cgccggggct ggaacagagg ctgagactgg ctggggttag
    30841 attcaggaca agggctgggg ctgagagcca aggggtccag aagcagcttg ggaatccctc
    30901 ccggggggca gccaggccac cccacttatc acctgttact gtgaccaagg tgcctttcac
    30961 atctgaccag cggttgacct cactgatctc gaagcggaag ttgtaggaac cagagtcctc
    31021 gggctgcagg tccttcagca gcaggttgca caccctgtgc tcggggttcc ccatgaactc
    31081 ggtgcggccg cggaagcggg cctccaccag cttggggtcc gccgagtggc tcaccacctg
    31141 ccgctggccc gagtagtcgt agtaccagat ggccgtgatg ccgtcgggca cctccacgtc
    31201 ggcagggaag ctgaagatgc aggggataag caggcaagac cccttcacac cctgcacgtc
    31261 ctggggactg gagacgcccc atgaggcctg gcctggggga agaacggcag ggggacagag
    31321 gggagggtga tacaggcctc agggtgccac agagccctgc gacctgcccc agagaaggtg
    31381 ccccagctgg gctcccaaat tctgccctgc cccgagaaat gcacacttag agcagccctt
    31441 ctcagtgccc caggggtcag tccactgccc gaggctgctc agaggtttgg tagggtggct
    31501 cgaagacaga tggcagcttc ctgcccagct cctggccact gcccgattgg gccctccctt
    31561 gacctcagca accaaacatg tgacccaggc atagtctaat ctgccaaggg ctggacttat
    31621 gaaggctgga ccctggaaga caggcactag gcccgcaaag cttacctgct gggaagaatg
    31681 aggccaggag gagaagcttg ggcaagaagc ccatagcagg ttcttgtgct gctcctgttg
    31741 cctaagaggg tggtgcgcac tgcgctggct gggctcacag gggcctccag ggacacctct
    31801 gggcacttta gccccagcac ctgctagaag tccgagcctg tgtccccacc tcctctgctg
    31861 gccaacccaa taagagggca gggctcttaa agacctctga gtcagacacc agcagagagc
    31921 caggaggcca cgttcccagc tcaggctgtg cccaggaatg ccctcacttg gtggcctgcc
    31981 tcagaaaagc ctgtgtgtcc cttggcccta tcccaagttc tgctttccca gcccctcaag
    32041 gataccaccc taaggcagat gaacagctgt ttctccctct tccccacttc cctgccccct
    32101 ccccaccacc caaaccaaca ggaactggag cccagagatg cccagttact cactccaaga
    32161 cacccagcta gaatgatggt ttcttcctga ggcttgtctc ctaccacctg ccttactaac
    32221 tatagaccat aatggggctt tactgaactt gccgaagtgc tgcctttaac agtcactccc
    32281 ctgctcaaaa accttctgtg gctccccatt gcccgtgaga tgtgaaaagt cctcatttcc
    32341 tgcccccagc tctgtcccca ttccctgctc tccgcagacc cactctgggg gcagttctgt
    32401 ctgctcaagg gctccctagc tgcccagctc tatctccacc acagataatc tttgcctgct
    32461 gaaaccttat tcaaccttca aggagcagca tgaatttggc ttccagctgg aacttctcct
    32521 ttgaggttcc tgtagctacc ccagagctat ctctattttc cttgccttgt tttttacagc
    32581 ttgtgagagc ccatttctca ggacctagaa ctgaagatat gtgccccata gcagtgcgga
    32641 gcctaccagg cattcagcaa accccttagt gactaagaga ggggtgaggt ctttaggggt
    32701 tcagagctga ggttcagagt tggagtgggg aggtggcaag gcaagtctag gtttgaaagg
    32761 tagcatgaga gcgctgtgga acacataccc cacaaatatg agctcaatgt gtgcggagtg
    32821 taccatatcc aaaaaggcag gccctcaacc atggagtgcc cctggtcagg gagtgtctaa
    32881 ggggtaccat agacctgagc ccaaaaggaa gagatgccag aaacacatat aagtgaaact
    32941 ataaaactct tagaagaaaa caggtgaaaa tcttcatgac cttggattag gcaatgcttt
    33001 cttaagtatg aaattaaaag cacgaaaaaa aaaaataggt aagttggact tcatcaaaat
    33061 ttaaaacttg gccagacaca gtggctcatg cctgtgaacc caacactttg ggaggctgag
    33121 gcaggaggat cactttagcc caggagttca agacgagcct gggcaatact gcaagactct
    33181 gtctctacga aaaattaaaa aacaggcctg tggtcccagc tactctggag gctgaggtgg
    33241 gaagatcgct tgagcccagg ggaggggtcg aggctgcagt gagccatgat tgcaccactg
    33301 cactccagcc tgggtgacag agcaagaccc tgtctgaaaa gaacaaacaa cagctgggtg
    33361 cggtggctca cgcctgtaat cccagcactt tgggaggctg aggcatgcag atcacaaggt
    33421 caagagattg agaccatcct ggccaacatg gtgaaacccc gtctctacta aaaatacaaa
    33481 attagctggg tgtggtggtg tgcacctcta gtcctagcta ctcgggaggc tgaggcagga
    33541 gaatcacctg aacccaggag gcggaggtca cggtgagcca ggatcacgcc actgtactcc
    33601 agcctggtga cagagtgaga ctcttctgtc tcaaaaaaaa aaaaaaaaaa aaaggatatg
    33661 aatcaaagga cactatccag agagtgaata aaaggaggac aacccacaga atgggagaaa
    33721 atatttgtaa atcatttatc tgataaagga ctaatatcca gaatatataa agaattccta
    33781 caatgaacaa caacaaccac caaaaaaacc atgaaatcca actccaaaaa tgggcaaaac
    33841 acttgaataa acatttcttc aaagaagata tataactggc tgataaggac atgaaaagat
    33901 gctcaacatc actaggcatt aggaaatgca aatcaaaacc aaaccacagt gagataccac
    33961 ttcacatccg ttagaatggc tattcacaaa caaacaaagc aacacagaaa acaataaata
    34021 ttggtgagga tgcgaagttg aaattcttgt gtattgctgg tgggaatata aaatggttcc
    34081 gtcactgtgg aaaacaattg ggtcattcct caaaaagtca acataggatt accatatgat
    34141 ccagcaattc cactcctagg tatataccca aaagtactga aaacagggac tctaacagag
    34201 tacaccaatg ttcacggcag cactattcca ctaaaaggtg gaaacaggtc aagtgtccat
    34261 cagtgaatgg atgtggataa acaaactgtg gtatgtacat acaatggaat atcaatcggc
    34321 cataaggagg aatgaattct aacatatgtg aaccttgaaa acattatgct cagtgaaacc
    34381 agccagacac aaaagggcaa atattgtagg gttccaatta catgaaatat ctagactacg
    34441 tatattcaga gactgaaagt agaataggat agaggtaacc aggggctgca gggaggggga
    34501 gctaatgttt aatgattgct gagtctctca gataatgaaa aagttctgga aatagtggtg
    34561 atggttacac aatattgcaa atgtacctaa tgtcatgggg tgtacactta aagacagtta
    34621 aaacagtaaa ttttacatta tgtatatttc accacataca cacccatgtt gccaactttg
    34681 caatcctccc tggtcctaaa tgctgacttg gccaagtgaa cgaggaggct ggaagtgggg
    34741 acaggaaact catgacctcc cagctcccag cccatccgcc tcaggggctg ggctcagcag
    34801 attccaatga ctaccagggg tcacacctgg gaagggggtg agccgaggcc cagggccagt
    34861 caggctgacc aggtgggact tagcctgctg cagaaggcag aaggtgcccc agcagggggc
    34921 acagtacagg gcgggattgg gacaggaagg acaccgctcc ccaggggacc cagccctctc
    34981 gcaggctgct ggagtggact gatctggcca tttatggagg cccaagggct catctccagt
    35041 tctctaggaa gccctaggcc tcctcctctt ctgggaagat gcacccccag cctccacacc
    35101 aggttcttgg ccactggaga atgatatagc tggggccctg ggacctggac acctcaccgt
    35161 gaagaaaagc agcctgctgg gcacactggg ggtcagatgt gtccctggcc acaggggatg
    35221 tcagggtcag ctctgctatg gccagggcag ctatcttgtc ccagctcccc tgttcctccc
    35281 atttggggtc ctgaaaaggg caatcgtgaa cctgatggaa gaatggtggg gttctggaca
    35341 cagcgaccct ggaacagggc gcgggggagg accctttcca ggaccactcc catcacataa
    35401 tgtagaggcc acctatgctt agcccggccc taaccccaaa ggggtcagcc ccaccggaat
    35461 ccagcctatt ggctcagcct gtcaccacaa aggccagctt cagcccagat aactgttctg
    35521 gaaacagaaa gagcagggac cgctcagaaa ggagatctct gtccctgttt gaaagcctgg
    35581 agttgagggg acagtgcccc gccccccgcc ccccgcaact tgggttgcag ctgtggccta
    35641 gtgagcacgc agcgccccct ggtggtcgag ggggaattgc gggtcccggg aaagggggcg
    35701 gtgtgccagc aacagggagc aggcagctct gcagccctga accatccctc ccttgggtga
    35761 ctcttttgga aatcattgtt ccccagacag gaggttcctg aggttcatac ttgggtctcc
    35821 aagtcttggg tgctctgaag acaggatttt aaatccccac tcctactatt ggtatgtgtt
    35881 gcatcagggt ggcttgagct ggcctggtac acagtgggca ctcacgtttg ctgcctgttt
    35941 gagaccaagt gcctcaggag gtcttggagg gttggctggg gccccaagtc cctgacctct
    36001 gattccagag gccaagttta gctggggaag aaagggcaga ggcagtttcc ctatggacag
    36061 ctaggcccgg gtgtaggatt cagtttctgt ttcctgacac caaggcttct ccccaacttc
    36121 cccattgggc tagagaagga agaacacagg gtgacatggc cagctggagg ttactggccc
    36181 acagataggg agtcagggta cggatgggca attcctggag cagattatgg tcaaaatagt
    36241 ggaaatcccc aatcacaggc caaatgttta attctcagag ccatagaatc cataactaat
    36301 gcttattggc tttgacatgg gcagtagaaa tttcacactt cattccttaa tctggctcta
    36361 aatgcttctg gctggagtgc tcactctcca aactgtgctg gacagcacca gaagccctcc
    36421 tgtggacgga cgaagtggtg atggatgaag tggaattgtg ctggggttag agtaaggaga
    36481 gattatcgtt gggctgggct ggggctgagg tcagggtgat gatcaggttg gaattctgga
    36541 agcaaattaa ggctgggatt ggggtggaca ttgaggttga gtgatgggtg ggggtaaggg
    36601 tgaaggttgg agttgggtgc aggtgatggt taagatacgg tggaggctgg gttgagatga
    36661 ggatgataga gtcggagttg tggttagggt tgggatgatg gatggttggg attagatgag
    36721 atgagtaaat ggttagggtt ggggtgcaag tgaggtgagg gtgagataag gttggaatag
    36781 gggctggggc tggggctgag gtagggtcag agcagggtga tggtcgggac tgggattggg
    36841 atgcaggttg gaaggatttg ggggtggtgg aagggtttca gttgagtcca tctttgatta
    36901 gtgtcttgga cttgggttgg tttggggtcc accactcgca cccagatgga gccccccccc
    36961 gacccctgcc cctatcccgc tcagccagtt tcagcccagc cccgctccta atgctccact
    37021 caccctctgg gcccagggac cagggacagg ggtacctgct ccacagaagg aagtggctgc
    37081 ggcggtgctg gacctgcgga gaggagaaca ggaaggacgg ccaagagctc ctggtgcagc
    37141 tggctcccca gggctctgcc ggctcaagag agaaggatcc cgtatcaggg gctgcttcct
    37201 ctttcccaaa gcctcagctc tactgtccaa cccagaggct ggtcagggag gcagctgcag
    37261 gccttgcgca atgccaaaac gggaaagacc tccatagggg aaggccctcg gcaaggccag
    37321 ggacttaggg actccagcaa gcagaagtgg gaccgctgca acgctggagc ctccccaggc
    37381 aaagtgagaa atggagtggg ggactcccat tcaccagcca aatccacacc ccactctctc
    37441 tgagcccctt agggaggctg ggggaggtgg aagggggctt cctgcacaca gctctcctcc
    37501 ctgacacctg agagggaggc gcgcccagcc tggggtgggg atgcactaca cgatgcccag
    37561 accaaggcag ttattctcca agccattcag gagcctccct gtaccactga gtactcttaa
    37621 gaacccccaa gaggcagtcc cgtgttgtgg gggacataag gccccaatgt ccaggacact
    37681 cctcaggttt tctctttccc ctctcactgt cctgtacgtt ttttggtttt tggcttcttg
    37741 ttgttggttt ttttcttttt gttttgtttt tgcttttgag atggagtctc actctgtcgt
    37801 ccaggctgga gtgcggtggc acaatctcag ctcacggcaa cctccacctc ccgagttcaa
    37861 gtgatcatct tgcctcagcc tctcgagtag ctgggattac aggcatgcac caccacatcc
    37921 ggctaatttt tgtatttttg gtagagacgg ggtttcacca tgttgtccag gttggtcttg
    37981 aactcctgac cccaagtaat ctgcctgcct cggcctccca aagttctggg attacagagg
    38041 tgagccgctg cacccggcca ctatcctata ccttcacccc caccttggga caggaaagga
    38101 aagcccccac cacagctagt tactgttaca ttactgcagc ccatcttatt gagggcctag
    38161 tttgtgcagg cacttgacat gtgaagcaga tgttaattgc tccaagcaat cccagataca
    38221 agcaacaaga ctttttttgg tttttgtttg tttgtttgtt tttgtttttt gagacagtgt
    38281 cttgctctgt cgcccaggct ggggtgtcct ggtgcaatct tggctcactt tggctcactg
    38341 cagcttcgaa attctaggct taagcaatcc tcctgcttca gcctcctaag aagccgggat
    38401 tacaggcgct tgccactatg cacggctaat tttttaagtt attttgtaga gacggagtct
    38461 ccctatgttc ccaggctggt ctcaaacacc tgggctcaag tgactctccc acctcgacct
    38521 cccaaagtgc tggaactaca ggcgtaagcc accactcctg gccatttttt tttttaattt
    38581 cacaactcaa acttaattca actcagtccc tgcctacctg tactggtggg gagctggcca
    38641 cagcaaatta ggcccctgta ccctgagggc aaggccacgt gtgcaggtgt aaaaggatgt
    38701 gaaaccctaa atcagggtgg atccagaatc gcaggccatg gtgccccaaa gcagatgtct
    38761 ggtgacattc caccctgaaa tgctcaggct acagagatat taggtctcta tcactctgtt
    38821 cctctttata gctcctgtgt ccactcctat gcttgggcca tttcctcttt cggccaaaac
    38881 acaaagggtt catcccatta cttcctccct caacagctgg tccggagaca cccagctcta
    38941 ggcctgtggg gttgtgacac atgggtacca atccttcagt ccactgggac tctatacatc
    39001 caccccttgg cttcatggtg ggaaagcatc ccttgacttt gaccttagtc atatgacttg
    39061 ctctggccaa tggatgtgca cggacatgac acaaactaag tcttgaaacg tacttaagca
    39121 gtttgccttt ccccctgaat ttctgtcatt gccatgcaag gggcaggctc caactaacct
    39181 gctggtccaa agaggatgac gaacacatgc agatgacctg aatcagaccc atggattgaa
    39241 acaaaactca gctgagccca gcctacgtcc accagaccag tcaacctgtg gatgcgtgaa
    39301 tttattgctg gatgctgctg agaattttgt ggctacatta gcaagatgat acaaggccta
    39361 agtcccagaa caacacaccc agaacttgct tacctttcct tagcatgagg agagcaaaga
    39421 cttgtctacc ttgattagtc agggagcact gcttcctgtc atttccttga gtatacagca
    39481 aactaggtaa ataaataaaa ataactaggt aggctgggca cggtggctca cgcctgtaat
    39541 ctcagcactt taggaggccg aggtgggcag atcgcttgag gccaggagtt caagaccagc
    39601 ctggtcaacg tggcgaaacc ctgtctctac gaaaaataca aaaattagct gggcctggtg
    39661 gcaggcgcct gtagtctcag ctactcagga ggctgaggca cgagaatcga ttgaacccgg
    39721 gaggtggaga ttgcagtgag ccgagatcac actactgcac tccagcctgg atgacagagc
    39781 gagactctgc ctcaaaatat tttaaaaaat gtaatttctc agtaggtcac actgttacac
    39841 attcatctaa taacaattat tctttttctt tttttttttt ttttttggag atagggtctc
    39901 actgtcaccc aggctggaca ggctggagtg caatggcaca atctcggctc actgcaactt
    39961 tgacctccta ggctcaagtg atcctcctgc ctcagcctcc caagttgctg ggactacagg
    40021 tgagtaccac cagacgcagc caatttttgt atttttttgt agagatgggg cttcaccatg
    40081 ttgcccaggc tggtctcaca ctcctgagag ttcccagtca agtgatccac ccaccttggc
    40141 ctcccaaagt gctgggatta caggtatgag ccaccatacc cagtgaatta ttctcgttcc
    40201 agataggaaa accaagtcat agggaggtta gagaatttgc caaagacaaa actttttggt
    40261 tgaaaaaaaa taagttttgc tacaagtata gaaaacacca aataacggtg ttttaaataa
    40321 aatagaagtg tttcaccctc tccctcaagt aagtgttggc atccaagatg atatgacaac
    40381 tccacaatca tgaaactaga tcccttttta ttttttggct cagtcattgt caatgggctg
    40441 cttccagtca ttgtccaaag tggctgcttg cgctccagcc actgtatctg cattcgggaa
    40501 agcaggatgg aggaaaggac aaagtaggcc atgccctcta ctttaagata actccttaaa
    40561 ttgctgatgt ggtgggtcac tcctgcaata ccagccactc aggaggctga agcaggaggt
    40621 tcacttgaac ccaggagctt gaggctgcag tgaactataa ttgtgtcact gcattccagc
    40681 ctgagtgaca gagtgagatc ttgtctctta acaacaacaa aaaaggtaat tctttaaatt
    40741 gtacacttta ttgctgctta tattccacta gaagctagaa gttagtcaca tggtcttaag
    40801 tctttattct aacaggtgat gtgcccgagt gaatcctggg ggcccagtgc taagaagtag
    40861 agggagatga agccctgccc tcctgtggag caaccaggac cttaattcag tcattcattc
    40921 cttggaccta cccagattcc agcctcaggc ccctgcccac ctccttgcca gtatctctgc
    40981 agagcctcct gcctccccca gagggtgcca gagccctctg cttcctcagc cttcagaccc
    41041 acttgctgtg tctcaggtgg ggacaactca gtctatagag gcccaaggga gatctttgag
    41101 acccatactg tagtcagctt ggggcagatg ggagtcatgt agctcactgg ctaaaagcct
    41161 aggtcctgca tagagctggg ttcaaatcca ggcatatcca tcacctgctc tacgatccca
    41221 ggagtcactc aatgactcag aagctagagt cctctggaaa gcttgtatga agagttccca
    41281 gggcctggaa catcaaaaga gctctgagaa tgttggctct gctgttgctt ctcttattgg
    41341 gtatctgaag gccgtactct cctctctcct ccacccgagg tgacctctac tctcatagcc
    41401 acaccctgga ccctcactca ttcagtcacc tcacccagtc tgtcatctgt ccctctggct
    41461 ctcctctcct gttcatgcat cttgtactca cccacttctg tccctgagtg tttcaaggct
    41521 ctggggtttt cagagacact gaagggacct ccctcctcaa cacaaccaca aggtctaggt
    41581 ggaatgaccc actaagggac cggctctgaa gccagagact tccagggaaa gtcaacaagc
    41641 ccaaggatgc ccgttacaag aaagttaaaa gacccatgta catgtcctcc cgttttattc
    41701 cctgctcagg gtctgggcat acagtggaac acatgcagtc cccaaaggac accgtctgta
    41761 cagagtcaga tggagttaag aacatttagc cggctgggcg cggtggctca cgcctgtagt
    41821 cccagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc ggagcttgca
    41881 gtgagccgag atcgcgccgc tgcactccag cctgggccac agagacagac tctgtctcaa
    41941 aaaaaaaaaa aaagaatatt tagtctgggc acggaagctt atgcctgtaa tcccagcact
    42001 ttgggaggct gaggtgggcg gataacctga ggtcagaagt ttgagaccag cctggccaac
    42061 atggtgaaac cccatctcta ctaaaaatac aaaaattagc caggcatctg taatcccagc
    42121 tactcaggag gctgaggcag gaaaatcact tgaacccagg aggtggcggt cacagtgagc
    42181 caagattgtg ccagtgcact ccagcctggg tgacagagca agactccatc taaacacaca
    42241 cacgcacacg cacacatatt taaaccacca caccaacatc tagttcaaga tggtggactg
    42301 agaacttgtc tctgccattc ctggcccatc caataccact gagagcacag taagcaaagg
    42361 gaaaaggaga cagaagggct gggaacagga tggctggggg atgggaagta tccactgcac
    42421 aggattttga tttaattcta gaagatagaa agagggagga tcacgttcag gaacagatgc
    42481 gggtaaagga aaccagagcc aaagcacgct gagagaaagc tgccccagag gccggagcag
    42541 aagtggactc tctacaggga ctcaatacac cccaaagggt tggtagctgg cacacgtacc
    42601 tctccacccc cacatgtaac actgcagggc agaggaaata ccctaggtga gttccagtat
    42661 caatcaacct gccctttgtt cataaagatg aattggttac caggtattac cagacaggtg
    42721 aggaagacca acacaaagag aaagatccag aaacaaacag gccaggcgca ttggctcacg
    42781 cctgtaatcc cagcactctg ggaggccaag atgggtggat cacctgaggt caggagttca
    42841 agaccagcct tgccaacatg gtgaaaccct gtctctacta aaaatacaaa aattagctgg
    42901 gtgtggtggg gttctcctgc ctcagcctcc cgagtagctg ggaggctgaa gtaggaggtt
    42961 cacttgaacc caggagctcg aggctgcatt gaactatgat tatgtcacta cattccagcc
    43021 tgagtgacag agtgagatct tgtctcttaa caacaacaaa aaaggtaatt ctttaaattg
    43081 cacactacat tgctgcttat attccactgg ctagaggtta gtcacatggt ctcatgtctt
    43141 tattctaaca ggtgatgtgc ccgagtaaat cctggggccc cagtgctaag aagtaggggg
    43201 agatgaagcc ctgccctcct gcttgaaccc tagaggtaga ggttgcagag agcggagatc
    43261 atgccactgc actccagcct gggtgacaga gtgagactcc atctcaaaaa ataaaaaata
    43321 aataaataaa taaataaata aataaataaa aacctacagc aaagaacaaa gagctattgc
    43381 accgttactg tgggcctggc agtattatga caactttttt tttttttttt ttttttggag
    43441 atggagtatt tttctgtcac ccaggctgga gtgcagtggc tcaatctcag ctcactgcaa
    43501 cctctgcctc ctgggttcaa gcgattctcc tgcttcagac tcccgagtac ctggtattat
    43561 aggcacatcc caccacaccc ggctaatttt tgtattttta gtagagaccg ggttacaaca
    43621 tggtttcacc atgttggcca gtctagtatc aaactcctga cctcaggtga tccacccgcc
    43681 tcagcctccc aaagtgccag gattacaggc atgagccacc acacccatcc agtgttctga
    43741 caattttata cttattaact catttagcaa ccctctgagc tagatgctat tgttatctcc
    43801 acttacagga aactgagtca cagaatggta cagtaaaata accttgaccg aggtttacat
    43861 ggctggtaag tggcagagaa atgaaacttg aacccaggca acctgtttcc tgagttggac
    43921 tctgaactac tcagccatac tgcctattta tttatttatt tatttactta cttacttatt
    43981 tattgagatg gggtcagccg ggcatggtgg cttacgcctg taatcccacc actttgggag
    44041 gctgagggga acagatcact tgaggccagc agttcgagac cagcctggac aacaaggaga
    44101 cagggtctcg ctctgtcacc caggctggag tgcaatggtg caatcatggt tcactgcagc
    44161 ttcgacctcc caggctcaag cgatcctccc acctcagcct cccaagtagc tgggaccata
    44221 ggcacccact atcatgtcca gctaatcgaa aaaaaaaaat tatatagaga tgggggtctc
    44281 actatattgc ctagactggt ctctaactcc tgggcttaag caatccaccc accttgccct
    44341 cccaaagtac tagaattaca ggtgtgagcc accgtgcctg accaatactt cctctttaat
    44401 aaactaaaca aaaagtgaac aaaccaatcc cggagggaac agataattca aggaatacaa
    44461 gaaaacttat gaaaagaaag attctagtgc caatatgttc ataaaacaaa aaaaaagcag
    44521 ctatgaaaag aaagcaaaaa ataattcttg gaaatttaaa atagaattgc tgaaataaaa
    44581 agaaattcaa tagaaaaggg tgtgaacatc tgtggttttg gtttccaaaa ttagttcact
    44641 cttctttggg taaaaatacc ctgattttcc tgctgtattc tcaagccctg tgggttggtg
    44701 gaattgacac ctcttccatt tactgcacga gagaagcatg ttgctgctca cagttcatta
    44761 tgaggagcca gcaggtaaag acaacactga gggcagatga aagatatata ctgaccctgg
    44821 gtccttgtgg acatcattga gtccctggat caagccttac ctgaagctaa agggatgagt
    44881 gcttctcact gttaatagcc actgatcaaa ttccatagaa ggactccaat tatccttgcc
    44941 tcagttgtgt gtccagccct cggatgagct accatcaagg gaaactgcca gaccttggtg
    45001 acaagcccac ccactgaaaa tggggaaaaa acccaagtca catcattgaa ctgaaagagc
    45061 tgcagttccc cgaaagacag gagagaagga atgctggata aacacaacaa ctacttcatc
    45121 tcactgcacc tgcaaagaat ccaggcacag gaaggtgggc acagcaagta gtctttctgg
    45181 ttgacataac aaagacaaca tctgtttctt ctctcagaaa gaagtaagaa aggcaattgg
    45241 agaaccagaa acgacaaaat ctattccagg accaaataga aagaaaacac atctgaagca
    45301 tgattttgaa caattagtgg ggtgtaagaa aagagaatcc atttgacctt gacatctgaa
    45361 acagcaaata ttctccatca aggcacttta gggtagagga aatgatacta tgtttccttc
    45421 tcaaaggagc tatctggtta cgtggttctg cagtaaatga cgtttataca atcataacat
    45481 ccgttggttt tcaattttca gaatcaacct gaagataaag catggaagat ttcattataa
    45541 ttaccgaaca gcatgcaaat gttacaaact ttgaccatga aaatgtaaaa aagagcccag
    45601 gcgcagtggc tcacgcctgt aatcctagca ctttcggagg ccgaggcggg tggatcacaa
    45661 ggtcaggagt ttgagaccag cttggccaac atagtgaaac cccatctcta ctaaaaatac
    45721 aaacattagc taggcatggt ggcacgcacc tgtagtccca gctacatggg aggctgaggc
    45781 aggagaatca cttgaaccca ggaggcggag tttgtggtga gccaagatca cgccgcggca
    45841 ctacagcctg gtcaacagag caagactcca tctcaaaaaa aagaaaagaa aagaaaagaa
    45901 aagtagacag cagaaattag agggagatgg caggtgacct ggggctgggg gaggcacaat
    45961 ttttcttaca tagtgatagg gctcaagaaa ttctttctaa aattgatgta tgaggaacga
    46021 aattttaaat atagattgtt tagaactata agtagcacca acacaggact ccagtaatca
    46081 cacagcaaac aaaactgaga aaacagacat gggcggggag atgggaatgg gtgatttcca
    46141 ttccctgctt taatgatgag gtgtcagtag atgctctctt gagttactaa attgagaaac
    46201 aaagataaag tatattgttt agaggagtga tgttcaaact ccagaaaagg ctgggcgcgg
    46261 tggctcatac ctgtaatcct ggcactttgg gaggccgagg tgggcagatc acttaagccc
    46321 aggagtttga gaccagccta ggcaacatga tgaagcccta tctctactaa aaaaaaaaaa
    46381 aatacaaaaa attagcaaag catggtggtg cacacctgta gtcccagtta ttcaggatct
    46441 cactactgta ctccagcctg ggtaatgaga gtgagaccct ctctcaaaaa aaaaaaaaaa
    46501 aaaagaagaa gaagaagaaa actccagaat atataaaaat gaaaagaggg cggaagagtg
    46561 gaactgcggg gtagaaaatt ttgcattttg tttcatctct ctgtatatgt tgaacttttt
    46621 ttttaaccta catgtactac gtcatatcca cagcccccaa ccatccacca ggcatcaact
    46681 gctgtgttta tatggagggt tgagcaatca ttcctgcctc cttccagttc gtcttcctgt
    46741 actgcagagg ctagaaaact aaatttatat cacccagatt cccttccagc taccttaatg
    46801 ctaccacctc attcagccat cattgattct ccacttcacc aaactggtca ataattgtct
    46861 cttctaaaaa tattagaatt gggactaaga gacactacag ctgaccctta aacaacatgg
    46921 gtttgaactg tgctagtctg cttatacaca gatttttaaa aaaataaaca tattgaaaaa
    46981 aattttggag agatgtgaca atttgaaaaa actcacaaac cacatagcta gaaatatcaa
    47041 aaaaaaaaaa actgaaacag agttaggtag gtcaggaatg cataaatata tatgttaatt
    47101 ggctgtttat attattagta gggcttccag tcaacagaag gctattagta attaagtatt
    47161 tgagtcaaaa gttatacaca gatttttgat gatgaggagg attagtgccc ccaaccccca
    47221 tgttgttcaa gggttagcta tatttgaaat ctgctggtag ctggaattgg gcaactaaat
    47281 tcacgagatg tggaacggtc gtatacatgg agaataaata aataaataaa taaataaata
    47341 agtatttgtt ggcctggtgc agtggctcat acctgtaatc ccaacacttt gggaggccaa
    47401 ggcaggcaga tgacttgagg tcaggagttc gagaccagcc tggccaacat ggtgaaatac
    47461 cgtctctact aaaaatacaa aaatcagctg ggcgtggtgg tgtgcacctg taatcccagc
    47521 tgcttgggag gctgagggat gagaattgct tgaacctggg agtcagaggt tgcagtgagc
    47581 cgagattgca ccactgcact ccagcctggg cgaaagagtg ggactctgtc tcaacaacaa
    47641 caacaaaaaa gcaattgtta aaagaataag aaaacaagcc acagattggg agaatatatt
    47701 tgcaaaacaa atatctgatg acccaaaata cacaaagaac tcttaaaacc caacactaag
    47761 aaatcaaaca accccattaa aaaatggaca aacgggccag gcatggtggc ggtggctcac
    47821 atctgtaatg ccagcacttt gggaggccga gcagggcaga tcaccttagg tcaggagttc
    47881 tcaactagcc tggccaacat agtgaaaccg tctctactga aaatacaaaa attagccgac
    47941 cgtggtgacg tgcatctgtg gtcccagcta cttgggtgtc tgcggcagga aaatagcttg
    48001 aaccagggag gttgcaatga gctaagatcg cgccactgca ctccagcctg ggcaacacag
    48061 tgagactccg tctcaaaaaa aaaaaatggg caaaagatgt gaacagacac ctcatcaaag
    48121 aagatataca gatggaagat aagcatatga aaatatgctt aacctgatgt cattaaggaa
    48181 ttacaaattg aagcaacaag gtaccactaa acccctatta aaatggtcaa aatccagaat
    48241 gctaaaaaca ccaaatgctg gtgaggatgt ggagtgacag gactctcatt cgttgctggt
    48301 ggaaatgtaa aatggtatgg ccatttttaa gacagtttgg cagtttctta caaaactaaa
    48361 catagtctta ccatacaatc taacaattac actcctagat agttacccaa ttgaggtgaa
    48421 aacttatatc cagggccgag cacagtggct cacacctgta atcccatcac tttgggaggc
    48481 caaagaggaa ggattgcttg aggccaggag ttctttcttt ttatttattt atttatttat
    48541 tttcttttga gacagagttt cgctctgtca cccaggctgg agtgcagtgg agtgatctca
    48601 tctcactgca atctctgcct cccggcttca agcgattctc ctgtctcggc ctctgagtag
    48661 ctgctgggat tacaggtgca cgccaccatg cccagctaat ttttatattt ttagtagaga
    48721 cagggtttca ccatgtcagc tagactggtc ttgaactcct gacctcaagt gacctgcctg
    48781 cctcggtctc ccaaagtgct gggattacag gtgtgagcta ctgtgctcag ccaaggccag
    48841 gagttccaga ccagtcctgg caacacagtg atactctgtc tctacaaaaa aaaatttttt
    48901 taattagtca catgtggtgg cacacacttg tagtgtcagc tactggggag gctgaggctg
    48961 aggatcactt gagccccagg agtttgaggt tgcagtgagc cacgattatg cctctgcact
    49021 ccagcgtggg caacagagta agagaaagaa agagagagag agagagagag agaggaagga
    49081 aggaaggaag gagaaaagaa acgaaaagaa aagaaggaag gaagggaagg aagggaggga
    49141 aggaagggag gaaggaagga agaaagaaag aaaggaaaga aataaaaaag gaaaagaaaa
    49201 gagagagaga aaccttatat ccatacaaaa ccctgcgcaa gaatgtttat agctgcttta
    49261 ttcataattg ccaaaaactg gaagcaacta agaagctctc caatgggtga aaggataaac
    49321 aaactgtggt atatctgtgc aatggagtat cattaagtaa taaaaagaag acttagcaaa
    49381 ccacaaaaag taatatgcat actaaatatg catttagtaa tattaatatg cattaagcaa
    49441 taaaaaaaga cgtgtcaagc cacaaaaagt aatatccata ttactaaatg aaagaagcca
    49501 gtctgaaaag gctacgtgct gtgtgatttc aactatttga ctttctggaa aaggcaaaac
    49561 tacagacagt aaaaagatca agggttgccg ggggttctgg ggacaggaaa ggatgaatag
    49621 gtggagcacg ggaattttag gtcagtgata ttattctata tgatactata atggtggatc
    49681 catgatatgc ttttgtcaaa acccatagaa agtacaacac aaagagtgat tcttaatgta
    49741 aactatgtgc tttagtcaac aatgtattga tattagttca ttaattttaa caatgtacca
    49801 cactaatcca agatgttaat aatggggaaa ctgcgtgtga gggaggggat acatgggaac
    49861 tctgtactac agctcaataa ttttataaat ctaaaacttt ttttttaaat aaagtctctt
    49921 agaaaacaat aaaaataaaa taaaaagtca attttttttt tttttttgag agggaatctt
    49981 gctctgtcac ccaggctgga gtgcagtggc acaatctctg ctcactgcaa cctctgcctc
    50041 cttagttcaa gcgattctcc tgcctctacc tcccaggttc aatcaattct cccacctcag
    50101 cctcccaagt agctgggact acaggcatgc gccaccacgc ctggctaatg tttgtatttt
    50161 tagtagagat ggggtttcgc catgttggcc aggctggtca agaactcctg acctcaggtg
    50221 atccgcccgc ctcagccttt caaagtgctg ggattacagg gttgaaccac atgcctggtc
    50281 taaaatgtcc atttttaaaa ggcagtctgg ctggggcagt ggctcacgcc tgtaatccca
    50341 gcaccatggg agaaggctga gcagggtgta tctcttgagc ccaggagttc gaggctgtag
    50401 tgtgctatga tggtgccact gcactccagc ctgggtgaca gagtgagact ctgtctctaa
    50461 aataaataaa taaaaataca ataaaatttt aaaaggcaat ctgcagcaag agaagatgaa
    50521 gttagcagga gagcaggaca gaggagagtc catgaacaac agggagagga aagtggtcat
    50581 ggtggcagct cccaggctcc tggcatatgt tcaattccct gttcccagcc ctcaggaagc
    50641 ccagatgtcc ccacctgccc ctacatacaa accctggatc cttgacatca acttctcctt
    50701 tccttcatgt gctctaatga gtttttgtta cttgtggaac atttaactaa catcattaac
    50761 caagtggacc tgtctcagag tggtgttatg agaagacagc agccaaaaga cagctgcagc
    50821 caagcacagt ggctcatgcc tgtaatccta gcattttggg aggccgaggt gggtggatca
    50881 cctgaggtca ggagttcgag accagcatgg ccaacatggc gaaaccccct ctccactaaa
    50941 aatataaaaa ttagccgggt gtggtggcga gcgcctataa tcccagctac ttgggaggtt
    51001 gaggcaggag aattgcttga acccaggggg cagaggtggc agtgagccgg gatcatgcca
    51061 cttcactcca gcctgggtga aagagcaaaa ctctgtctca aaaaaaaaaa aaaaaaaaga
    51121 cagctgcaac aaatgtcaag ttctgtgtgt tttcttttct tttctttttt ttctatttaa
    51181 ttaatttatt ttagagtcag agcctcccta tgtcacccag gctggagtgc agtggcacag
    51241 tcacagctca ctgtagcctc aacctcctgg gctcaggcga tccttccacc tcagcctcct
    51301 tcctagctgg gactacaggt gtgtgccacg acatctggct tgtgtgtttt cttttctttt
    51361 tttttgagac ggagtcttgc tctgccaccc aggctggagt gcagtggcgc gatcttggct
    51421 cactgcaacc tctgcctcct gggctcaagc aattctcctg cctccgcctc ctgagtagct
    51481 gggaatacag gcgcacacca ccatgcccag ctaatttttg tatttttagt agagacgggg
    51541 tttcaccatg ttggccagga tggtctcgat ctcctgacct tgtgatccac ctgactcggc
    51601 atcccaaagt gctgggatta caggcgtgag ccactgcacc ctgcctggct tgcatgtttt
    51661 ctgacatact gtcaaaagga tactcatact aaatggcaac acattctcaa gccccttcct
    51721 tttcttctcc tatctgcttt accacacacc atagctgctt ttaccgtttt ctccttaaaa
    51781 actcaaaaaa accttccccc aacctatcta tccacttctt gtcctcccct caaacccact
    51841 ccattttgat tctgcatcat aaaaaccaag cagagttctg gggaggcaga agcctcctct
    51901 tatgatattg gaagggggtg gatgaaattg agcacacaaa gccagaatcc tcttttgtgg
    51961 aaagatgggg gcagtgggca gagggaagca ggctcatctc tttccctctc ttcccttccc
    52021 ttctctttca caccacacgc tccgcctggg tgagctcatc gtcctcgtgt cttcaatttc
    52081 cacctccagg agatcgactg ccaaatttct acccgcaatc ccaaactcca ctctgagctc
    52141 cagacccatc tactaattgc ctagtcaaca ttttctctgt gggatcgaat cagataggat
    52201 tatattctgc tgtgactaac agagactgaa aattcagagt cctaaacaaa agactttctt
    52261 tttcttgtgc tgaaaagtct ggaagtacac agccctaggc tggaataggg atactgctgg
    52321 ggggctccgc ctccttgcag gctgcccctc tgctatctcc agggtgtgtc cctcaaccac
    52381 actgtccaaa gtggtgactg aaggaccagt cctcccactg acattccagg caacagaatg
    52441 gagaagaaca ggctgaggaa ggaggaacca caggcgccct ccagctgttt caaggaaggt
    52501 tcctggaaga tgctaaataa caattccgtt tatatctcat tagccaggct tagtcacgtg
    52561 gcaacaccta gaagcaagaa aggctgggaa atatactgtt tctgctgggt taccgtatgc
    52621 atgactaaaa agagagcgct ctgttcttag gaaaatgaga atagactggg ttgggggagg
    52681 ataaccagca ttggctacac tgcagcccca caacctcctc atgcttgatg tggctgaaat
    52741 aaaaagagca tctcccccta acatttcctt tcccttagtc ttgttccttt tctgtacccc
    52801 ataccctagc agaggggcaa ttctgtgggg tgggctcact ggcaagggga ggtctcaccc
    52861 aagctcagtc aagggtgagc agagagaatt gtacaaggaa gctgaatttc cagctggtct
    52921 ggaggaaaca ctgaaaaccc caaacagaag atgtccaggt ggagcaaatg cagctgagag
    52981 ttcttgtcca aggggacaga atgtcagcag agtaggacac aagggcaggt ctctactgaa
    53041 agcaccaggg gcaaagtcac ccagcccttc aaccggctgg ccaccaagca catcggctcc
    53101 actctgcctc ctgcccgtgc caccctttgc acttttgtct gtaccatctc ctccaccagg
    53161 agtgcccttc ccgctcctcc atgtggtata ctctcagcca cccctcaggc tcagatccag
    53221 agccacctcc tccaagaagt ctccacctgt ctgctgggaa ccctcacatc acactgggag
    53281 acgaagccaa aagtgggggg gtcagtacag aggtgtggga ggtaggtcca atcctggctg
    53341 gacctggctt tgggtgccat ttgggacagg gtattgggga ggttcagcag gaaggtggga
    53401 cctcaaggct ttctttttat gaaggaggag gaaccagctg ttgacagagc agcaacagca
    53461 gggagccacc aaggctgagc cttgaaccct gggccaccag gctcctgcag gcaactggga
    53521 gcagctcagc ctctcacagt cccatttcca gggaggaaga tgatgcagcc tgaggcaatt
    53581 ggccacctgg aggaagttca gctgggtggg gcaggcagag gaattcctgg aaggaagctc
    53641 acccctcagc actggggtag agggacacta gagacagggg tggcccagga gctgcccacc
    53701 tcctatcccc atgaataagc aggtgggccc gggtgaccag ctgggagcca cccctgggtg
    53761 ctcaggtacc atgcttctgg cccagcctcc tgagctgtgg cacaggcaag agcaccggcc
    53821 cacggggtct gcatggggca gtgacctgtt ctctctaagc ttcagctttc tcatctgtag
    53881 aaagagggtg aagagtcctt ccccaccatg gttgtggtga gggtaactga agtgtcaggg
    53941 taggggaccc agtggggcac cagcacacct gaggttcagt caacaaggcc cactggtgaa
    54001 gaaaacctcg cccaccctcc cgtccctcct gaggcaggtg gtggaggcac atgccctctc
    54061 cattgtgtgt gcatgagtgt gtgggtgcat gcatgtgtgt gcatgcaagt gtgtgtgcat
    54121 gtgtgtgcat gcatgtgtgt gcgaatgtgt gtgtgtgcgc acatgcatgt gtgctaagct
    54181 ctggagaaca agcagccagg cttgggcagg agtccggggg caaggggggg acatgcaaag
    54241 ttctctccac agcccctcag tgctaacagg ctgaggtggg ggatgacatg tcttagaaac
    54301 aaacttcacc ctctgcccta aaactgcctg ctggtctgcc acatcgggcc aagctgtggg
    54361 tacttggaaa gagggccctg ctcctgcctt ccaaacccag gaagtcttga aatccccatg
    54421 gaggggacct ggggccaggg gactccccaa gcagacacag cacccacaca agggtctaag
    54481 ggaccccttt ctccccaacc gcctgaggta gggatggatg tggacaagca gctcagggct
    54541 ggggctggcc taggtggaag gttcagaagc acagaaggca ggacgtgccc aaggggctgg
    54601 acatggggtt ggaggctagg ggctggaaaa agagtttggg aggtcagcca agggcagtcc
    54661 acgccttggc tcttctctgg gtcatgggga gccacggaga gtataggagt ggtgatggga
    54721 tggccccaga gttgtgcctt tgaaagtcac ctctggttgc agatagagat ggactggggg
    54781 cagggctgag aggaagggca tctcatgaga ggggaaatcc ctgaccacag tccagaggaa
    54841 aacagcgggc ctggggagag gctgctcctg gggaggacct actcagaggc tgattcttcc
    54901 tcgtgctacc ccacccaggg agatgcagga agctggtgag gtcagtgggt gggcccacaa
    54961 ccatgctgga gaggagcagg agggagacgg gagggcacag agaggtagag ggcagggact
    55021 ctgaggaccc tcatccatgc ccacctggac aattgaatcg agcttcctct agtgtggaat
    55081 gtgggctcac tggggctatt tgctatcctt ctctcatgtc agactggatg cggcccaagg
    55141 gtggggtctg tgactccctc atcagactga ggggggccga aatgtggggg tgggggaagg
    55201 agggagcttc caaacaggaa ggtttggagg gcgctggcac tgaggacagg ggatacgccc
    55261 cccatgagtc ttcacccctg ctcagagcct gctgccgtgg ccctcagaga cgagcccatc
    55321 tttggagctg gagttgggga cagggaaacc aggtggaagg gaaaaagaaa aaaggggcag
    55381 gcatgagtgt gaggggccag accctgcaga ccccaggaca tgcacagcag acacacactc
    55441 caatccctcc tggctctgct cagcctcaaa cgaggtgcag gcctgcaggc cccaaccaag
    55501 gacccctgtt acacacagac acaaagacac aagatgcagc aacacacaca cgtaatacag
    55561 aaacacagtc acacactagc gcacacacag acacagtcag acaatccaag aaacacactc
    55621 acacatgaac agacatataa gcaaagtctt acacatacaa agacacactt aagcacagac
    55681 acaaacacac caacagatgc acagacggac acacatagac gcacactggg aggcggcctg
    55741 tggagactgc tgattggata ccatgttggc ttggtatcca gaattgttgg tgcagcgcac
    55801 tcaccatgcc tgacctcatt aaggaggccc tctggggcag tggctctgag aagaaaggca
    55861 gcaagcctag ggaggtgaca ctgtagctgg ggagcatttt ggctgggcag agaggaggaa
    55921 ggagagcatt tcagggaggg aaaatagccc ttgcaaaggt ttggagtgga agagcaaaag
    55981 ggatgtccca agaactgtgc ctcatttgat ggggctgtgg ggcccttctc tgacgggcag
    56041 ccttcccagg cacaaaggga ggtggactga gaagcatctg gaacttctgt gggggcacat
    56101 ggggcagagg ggaaggggaa caggcagaac gagtcagctg cctttgaaag ctccacgagg
    56161 gacctagcag aaattcattc attcatccgt tcattcattt atttatgaca ggttctctct
    56221 ctgtcaccca agctggagtg cggtggtgtg atcacactta ctgcagcctc aaactcttgg
    56281 gctcaagcaa tcctccaccc cagcctcctg agtacctagg actacagacg tgagctacta
    56341 cacctggcta atttttaaaa ctttttagag acagggtctc cctttgttgc ccaggctggt
    56401 ctttgttgcc cagccagctt caagccatcc tcccacctca gactcccaaa gcgctggatt
    56461 acaggcatga gccacgggtt cccagcccag aaagagacct tcaccccaca aaccccctgt
    56521 ctggaggcca agtcccagca tccccagaag ggccccggtt aggacaagaa aaaggctgtg
    56581 tgcctcggag gagaggctgg gtgccccctg gtgcggagtg gtccccttgc cgggcatggg
    56641 gcctcagtct gtgagactcc gcgcctccgt cgtctgaact tgagcaaaga cagagctagg
    56701 cctagcctgg ggacccgagg gcctgccctc atccgcaatt ctgcctcttt tctggccact
    56761 gaggaccggg ctagggaagc tgtaggggca gtaggggtgg gaggagagag ggttccgggt
    56821 caagggactg gaggggtctg gggcagggaa gtggaggggt ctagggcggg cccttccccc
    56881 tccgctgttg ggccctcgct gaaggggccg aggctgagcc ccaaggtggg cctgtggggg
    56941 cggggcgggg gagcccctca gcagctggac cgcaggaggc cgaccagggt ctggaactcc
    57001 tcttcccctc cctccgccgc cccctctccc ttctccca



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH005190. Human Ca2+ ATPase...[gi:2052520] Links  


LOCUS       HSATP2A1S1              2315 bp    DNA     linear   PRI 10-JUL-2001
DEFINITION  Human Ca2+ ATPase of fast-twitch skeletal muscle sarcoplasmic
            reticulum adult and neonatal isoforms (ATP2A1) gene, exons 1, 2 and
            3.
ACCESSION   U96773
VERSION     U96773.1  GI:2052511
KEYWORDS    .
SEGMENT     1 of 9
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2315)
  AUTHORS   Zhang,Y., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G., Yee,W.C.,
            Schrank,B., Cornblath,D.R., Boylan,K.B. and MacLennan,D.H.
  TITLE     Characterization of cDNA and genomic DNA encoding SERCA1, the
            Ca(2+)-ATPase of human fast-twitch skeletal muscle sarcoplasmic
            reticulum, and its elimination as a candidate gene for Brody
            disease
  JOURNAL   Genomics 30 (3), 415-424 (1995)
  MEDLINE   96423024
   PUBMED   8825625
REFERENCE   2  (bases 1 to 2315)
  AUTHORS   Zhang,Y.L., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G.,
            Yee,W.C., Schrank,B., Cornblath,D.R., Boylan,K.B. and
            MacLennan,D.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1997) Banting and Best Department of Medical
            Research, University of Toronto, 112 College Street, Toronto,
            Ontario M5G 1L6, Canada
FEATURES             Location/Qualifiers
     source          1..2315
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            1140..1257
                     /gene="ATP2A1"
                     /number=1
     exon            1565..1582
                     /gene="ATP2A1"
                     /number=2
     exon            1963..2045
                     /gene="ATP2A1"
                     /number=3
BASE COUNT      471 a    667 c    683 g    494 t
ORIGIN      
        1 aggcctctgg ggctggccac agaaaccccg ttggttagag cacagtgtgg gatgaggtga
       61 ccctcagtgc acgacttggg gtgacccctg cccccatcct gagacagtta cccctccccc
      121 tctgccatca gcacattctg tagcctcttg ggttacttgg ctgccttggt gtcccatttt
      181 cttgggggtg gggtggggat tccctatcca ggatgggggg gccctcaggg ctctgttccc
      241 agaggctgag ttagagcgat ggggaagggg gggggcagtt ttggggagag acaggcagtg
      301 ctggctttgc tcaccagggc ctggacacta aatcccttgt tgatggctgt ggcaacccct
      361 ccctagggta gggttaccat cttcggccct gtccccttga ctctctcccc tcacttcccc
      421 ttgtccctct aggagccact cacttcctct agcccccaaa agatgttctc ccttcatcag
      481 tcccccaaag gcttggggta tctctgccac tgcttcagca aatggggtga ggaggaagga
      541 gactgcggca atggaaacag gctccgggca gatgaggcag gaaggggggt gtgaggaaag
      601 ggacaggtga ggccggggat ggaagagggc tcagggaaga actgggggga tgagtttgga
      661 atgggaaatt caatgcagct ggggaagtcg aggcaatggg ggggcagggt cagtagcaga
      721 tgacagaaag tgaagtctct ccctacccca cttccctggg gctggggcta cctttgcgtc
      781 cctcatgagt gacatctcag gctgcagccc cactgttccc cctctgtcag cagaaatatc
      841 tctctttttc tgacccctcc tgctggagtc tcagccagcc aatccctgat ctggtggagg
      901 ggggagcccg gcctccccct gctccctcat aaggaccagc tgggggccgg ggggtggccg
      961 gctgctcaag tgggacgggg gtcagagctt tgtggaggga agaaaaacct ggagggggca
     1021 ggagagtaaa aagaagaaac ccaggcagac aggcagttgg acacactgag gaagaccccc
     1081 cacgagtggg aaccccctgg aaggaacaca ccggccccgg cccccaggaa gggagcacaa
     1141 tggaggccgc tcatgctaaa accacggagg aatgtttggc ctattttggg gtgagtgaga
     1201 ccacgggcct caccccggac caagttaagc ggaatctgga gaaatacggc ctcaatggta
     1261 agtgtccctt ggaagagagc tggtaattaa tgccctcctg cacccccaaa acacacgcac
     1321 agccatgcac gcgtttctcc ttcagggttt cttaaggaag agctgggccg ttgtccaatg
     1381 ctcgcagggg gaagaagata ctgagaaaac agagtcccga gatccagagt tttgggaggt
     1441 tttaatggga tgaacttgat gaacctcaag gtggcctgat tctcttagcc acaaagtctt
     1501 gggtgtgggg ggcatgaggc tgaaccccaa atgaatctgt ccttttcttc tttttcctgg
     1561 gtagagctcc ctgctgagga aggtaagtta ctggaatccc tgaactctca taaatgacca
     1621 ccccccaccc cgccctgtcc cactccctcc tccctgcctc tttagattct ttgagcaaat
     1681 atcccttccc aaaaggcaaa tctccctccc taaaggttag agtcctgtcc ggggcagaag
     1741 tctcccagga ggcgctttct ccttgaagca gccaaccctt gaactgcccc ccactttgcc
     1801 gagtctgttt ctggactcca ggcgagcttc ttagcccttc tctgggcacc aagctgtctg
     1861 cccaccaccc tagagcctcc ccactgcagc cgagtccagg cgctccatcc cagaccttca
     1921 cccactagac cttaaccggg ccctcccctt gcctctcccc agggaagacc ctgtgggagc
     1981 tggtgataga gcagtttgaa gacctcctgg tgcggattct cctcctggcc gcatgcattt
     2041 ccttcgtaag tgtgggaggg tctctggggg ctggctgggg gtgtgaggct gggatcgggc
     2101 gaatgcgggg ctcgcagtca ctggatcctc ccgtccgagt cccgagcatc ccattgtaca
     2161 gactggggcg ggctggcggc cagcagcggg tgtgattcgc gtcctcctct ctcctcccct
     2221 gcaccccaga ggcaggtttt attttaagct ttaagggtgt tctcagccaa aacaccgaag
     2281 ctaagccacc ctcgcggctt caagagcttg gagct
//
LOCUS       HSATP2A1S2               631 bp    DNA     linear   PRI 10-JUL-2001
DEFINITION  Human Ca2+ ATPase of fast-twitch skeletal muscle sarcoplasmic
            reticulum adult and neonatal isoforms (ATP2A1) gene, exon 4.
ACCESSION   U96774
VERSION     U96774.1  GI:2052512
KEYWORDS    .
SEGMENT     2 of 9
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 631)
  AUTHORS   Zhang,Y., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G., Yee,W.C.,
            Schrank,B., Cornblath,D.R., Boylan,K.B. and MacLennan,D.H.
  TITLE     Characterization of cDNA and genomic DNA encoding SERCA1, the
            Ca(2+)-ATPase of human fast-twitch skeletal muscle sarcoplasmic
            reticulum, and its elimination as a candidate gene for Brody
            disease
  JOURNAL   Genomics 30 (3), 415-424 (1995)
  MEDLINE   96423024
   PUBMED   8825625
REFERENCE   2  (bases 1 to 631)
  AUTHORS   Zhang,Y.L., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G.,
            Yee,W.C., Schrank,B., Cornblath,D.R., Boylan,K.B. and
            MacLennan,D.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1997) Banting and Best Department of Medical
            Research, University of Toronto, 112 College Street, Toronto,
            Ontario M5G 1L6, Canada
FEATURES             Location/Qualifiers
     source          1..631
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            214..318
                     /gene="ATP2A1"
                     /number=4
BASE COUNT      111 a    202 c    151 g    167 t
ORIGIN      
        1 acgcccatcc gctgagtaag gcggtggcag gacctgcagt ggatggacag accctcagac
       61 ggatggtggg ccacagcgcc ccgacggtgc ccggcccctc ctgctggctc ctgcactctc
      121 ctgcacagtt ctcccctttg cagtggtcca cttcctttct ccatctgttt tggggcctca
      181 ttacctgtca ttctcctttc ccctgctccc caggtgctgg cctggtttga ggaaggtgaa
      241 gagaccatca ctgcctttgt tgaacccttt gtcatcctct tgatcctcat tgccaatgcc
      301 atcgtggggg tttggcaggt tagcgttgac ccttccttac cccttcatgt cccaacagtg
      361 aagaagaggc caaccctccc tccagtctcc tcctcctcca tcacctcccc catacttgcc
      421 tcttcctctg gtcctatccc ctggtctgga atgggatgga gtgtggggaa gaggtgggag
      481 actgtgaccc actgtcactt cctggctatg tgaccctgag caagttcctt catccctctg
      541 agcctcagtt tcttcatcca taaaatgggg ctagcaatcc agtgtgaaat cgactaagat
      601 gatgcatgtt cctggcacac agtaggaatt c
//
LOCUS       HSATP2A1S3               555 bp    DNA     linear   PRI 10-JUL-2001
DEFINITION  Human Ca2+ ATPase of fast-twitch skeletal muscle sarcoplasmic
            reticulum adult and neonatal isoforms (ATP2A1) gene, exon 5.
ACCESSION   U96775
VERSION     U96775.1  GI:2052513
KEYWORDS    .
SEGMENT     3 of 9
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 555)
  AUTHORS   Zhang,Y., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G., Yee,W.C.,
            Schrank,B., Cornblath,D.R., Boylan,K.B. and MacLennan,D.H.
  TITLE     Characterization of cDNA and genomic DNA encoding SERCA1, the
            Ca(2+)-ATPase of human fast-twitch skeletal muscle sarcoplasmic
            reticulum, and its elimination as a candidate gene for Brody
            disease
  JOURNAL   Genomics 30 (3), 415-424 (1995)
  MEDLINE   96423024
   PUBMED   8825625
REFERENCE   2  (bases 1 to 555)
  AUTHORS   Zhang,Y.L., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G.,
            Yee,W.C., Schrank,B., Cornblath,D.R., Boylan,K.B. and
            MacLennan,D.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1997) Banting and Best Department of Medical
            Research, University of Toronto, 112 College Street, Toronto,
            Ontario M5G 1L6, Canada
FEATURES             Location/Qualifiers
     source          1..555
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            334..472
                     /gene="ATP2A1"
                     /number=5
BASE COUNT       97 a    163 c    162 g    133 t
ORIGIN      
        1 tttttttttt gtactttcgt agagacggac ggggtttcac catgttgctc caggctggtc
       61 tcgaactcct gacttcaagt gatcccgccc tgcctcggcc tcccagagtg ctgggattac
      121 agcgtgagtc accacgcccg gcctgatttt ctttgattct tctttgttcc ctccccgggg
      181 ccctctcacc tgttttcacc tgtaggtgac agtttcctca acatacacac acccctgcct
      241 gtgtggggtt ttgttgcctc ccccgtgcca ggagccacaa ctccataact gcctcctgtg
      301 tataaccctg cctcctccac cctgtctcct caggagcgga acgcagagaa cgccatcgag
      361 gccctgaagg agtatgagcc agagatgggg aaggtctacc gggctgaccg caagtcagtg
      421 caaaggatca aggctcggga catcgtccct ggggacatcg tggaggtggc tggtgagtga
      481 cagggacggc tggtccagat gggaggcctt ggggctgagg cctaggagat gccgggggct
      541 ggtcaggctc ggatc
//
LOCUS       HSATP2A1S4               919 bp    DNA     linear   PRI 10-JUL-2001
DEFINITION  Human Ca2+ ATPase of fast-twitch skeletal muscle sarcoplasmic
            reticulum adult and neonatal isoforms (ATP2A1) gene, exon 6.
ACCESSION   U96776
VERSION     U96776.1  GI:2052514
KEYWORDS    .
SEGMENT     4 of 9
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 919)
  AUTHORS   Zhang,Y., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G., Yee,W.C.,
            Schrank,B., Cornblath,D.R., Boylan,K.B. and MacLennan,D.H.
  TITLE     Characterization of cDNA and genomic DNA encoding SERCA1, the
            Ca(2+)-ATPase of human fast-twitch skeletal muscle sarcoplasmic
            reticulum, and its elimination as a candidate gene for Brody
            disease
  JOURNAL   Genomics 30 (3), 415-424 (1995)
  MEDLINE   96423024
   PUBMED   8825625
REFERENCE   2  (bases 1 to 919)
  AUTHORS   Zhang,Y.L., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G.,
            Yee,W.C., Schrank,B., Cornblath,D.R., Boylan,K.B. and
            MacLennan,D.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1997) Banting and Best Department of Medical
            Research, University of Toronto, 112 College Street, Toronto,
            Ontario M5G 1L6, Canada
FEATURES             Location/Qualifiers
     source          1..919
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            304..384
                     /gene="ATP2A1"
                     /number=6
BASE COUNT      247 a    231 c    270 g    171 t
ORIGIN      
        1 cacctgccca tcccttctca gggctcacag ccagtgggaa gagatgtggg gagttcaaga
       61 ccagcctgag caacagagcg agactccatc tcgccgtgga cacgggggca gaagggatgg
      121 gtgtcacgat gaggggttgc cagcttcagg agctcaagcc aagggcctca tgcaagccct
      181 gggagcctcc ctgagacctg aaggatggat gaggagaacg agtcagacac agattgggtt
      241 tttctttgga aggaagaaaa ttccattccc aagtgacctc cctcttccct actctctcca
      301 cagtggggga caaagtccct gcagacatcc gaatcctcgc catcaaatcc accacgctgc
      361 gggttgacca gtccatcctg acaggtctgc tggcctgggt gggaagatgc atgggggtgg
      421 gatgtgggga ggaagaggca acaaggggga ggtgagtgga aagacagaga accctcactg
      481 tctgagcaac ataaagagac cctgtctcta caaaaaaatt ttaaaaacta gctgagcatg
      541 ttgttgagtg cctgtagtcc tgactactca ggatgctgag gcaggaggat cgcctgggta
      601 caggaggtca aggcagcagt gagctgtgat cacaccactg cactccatcc tgggccaccc
      661 tgtctctgtc aaaataaaaa caaaaggctg tgcgaggtgg ctcatgcctg taatcccagc
      721 actttgggag gccgaggtgg gtggatcacc tgaggtcagg agttagagac cagcctggcc
      781 aacatggtga aaccccatct ctaccaaaaa atacaaaaaa cacaaaaatt agccaggcgt
      841 ggtggcaggc acctgtagtc ccagctactc aggaggctga tacagtagaa ctgcatgaac
      901 ccaggaggct gaggctgca
//
LOCUS       HSATP2A1S5              2290 bp    DNA     linear   PRI 10-JUL-2001
DEFINITION  Human Ca2+ ATPase of fast-twitch skeletal muscle sarcoplasmic
            reticulum adult and neonatal isoforms (ATP2A1) gene, exons 7, 8 and
            9.
ACCESSION   U96777
VERSION     U96777.1  GI:2052515
KEYWORDS    .
SEGMENT     5 of 9
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2290)
  AUTHORS   Zhang,Y., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G., Yee,W.C.,
            Schrank,B., Cornblath,D.R., Boylan,K.B. and MacLennan,D.H.
  TITLE     Characterization of cDNA and genomic DNA encoding SERCA1, the
            Ca(2+)-ATPase of human fast-twitch skeletal muscle sarcoplasmic
            reticulum, and its elimination as a candidate gene for Brody
            disease
  JOURNAL   Genomics 30 (3), 415-424 (1995)
  MEDLINE   96423024
   PUBMED   8825625
REFERENCE   2  (bases 1 to 2290)
  AUTHORS   Zhang,Y.L., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G.,
            Yee,W.C., Schrank,B., Cornblath,D.R., Boylan,K.B. and
            MacLennan,D.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1997) Banting and Best Department of Medical
            Research, University of Toronto, 112 College Street, Toronto,
            Ontario M5G 1L6, Canada
FEATURES             Location/Qualifiers
     source          1..2290
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            181..266
                     /gene="ATP2A1"
                     /number=7
     exon            414..711
                     /gene="ATP2A1"
                     /number=8
     exon            1499..1665
                     /gene="ATP2A1"
                     /number=9
BASE COUNT      467 a    689 c    556 g    578 t
ORIGIN      
        1 aaaaaaaagg agcaatggac ttcctagctg ccttcagtgg cttcttccag agtccagaca
       61 tccccaggat ggctacttgg tcttaccagg agctgttcgc tgagcaggga gagaggttag
      121 gcacggaagc ctgggagaag gacatgatgt catccgaaac ccttggcccc ttctccacag
      181 gcgagtctgt atctgtcatc aaacacacgg agcccgttcc tgacccccga gctgtcaacc
      241 aggacaagaa gaacatgctt ttctcggtga gcaatccggg accagccatc acacactcag
      301 tcaagccagg tgcccgggtt ggagaaacat ggcgtgtgag aagagatggc gtggggagat
      361 gcggcatgag ggtcacctct tgcctgattc cctgcctcct ctttccttcc cagggcacca
      421 acattgcagc cggcaaggcc ttgggcatcg tggccaccac cggtgtgggc accgagattg
      481 ggaagatccg agaccaaatg gctgccacag aacaggacaa gacccccttg cagcagaagc
      541 tggatgagtt tggggagcag ctctccaagg tcatctccct catctgtgtg gctgtctggc
      601 ttatcaacat tggccacttc aacgaccccg tccatggggg ctcctggttc cgcggggcca
      661 tctactactt taagattgcc gtggccttgg ctgtggctgc catccccgaa ggtatgaaag
      721 cctttctttt ctcctcccat ttgctaatcc catctgcaaa gacccctttt cttttctttt
      781 cttttctttt ttcttttttg ttttgagatg agtcttgctc tgtcacccag gctggagtgc
      841 agtggcgtga tctcggctca ctgcaagctc gcctcgcggg gttcacgcca ttctcctgcc
      901 tcagcctccc tagcagctgg gactacaggt gcacgccgcc atgcctggct aattttttta
      961 tatttttagt agatacgggg tttcaccgtg ttagccagga tggtctcgat ctcctgacct
     1021 cctgatctgc ccgccttggc ctcccaaagt gctgggatta caggcgtgag ccaccgcacc
     1081 cggccttgct cagtgcaacc tctgcctcct gggttcaagt gattcttctg cctctgcctc
     1141 ccaagtagct gggattacag gtgcacgcca ccatgcctgg ctaatttttg tgtttttgta
     1201 atctcgaact cctggattca aacaatcctc ttgtcccagc ctcctgggta actggtgcca
     1261 caggcaatca ccaccacacc cagctaattt ttaaatattt tcagaggagt ctcactgtgt
     1321 tttccaggct ggtctcaaac tcctggcctc aagtgatcct cccgcctcag cctccaaaag
     1381 tgctaggatt ataggtgtgc accaacgtgc cagctggggg ctactattta gccccttgca
     1441 ggttccctca caccctcccc ttggcaggtt ccctcacacc ctccctccct ccccacaggt
     1501 cttcctgcag tcatcaccac ctgcctggcc ctgggtaccc gtcggatggc aaagaagaat
     1561 gccattgtaa gaagcttgcc ctccgtagag accctgggct gcacctctgt catctgttcc
     1621 gacaagacag gcaccctcac caccaaccag atgtctgtct gcaaggtcag gagcagtgtg
     1681 ggcagcgcgc tcagtcagaa ggctgcctgt gggggttaaa tgggcctcca aagatagagg
     1741 tctcccttca tcactgctgg cttctcatct gggcctgcaa aatgctcaaa gggcagtaga
     1801 acgccatcct gtctgccatg aacgggatca gagaggaccc ttgtgcccca gccagacagc
     1861 tcactctgac caccatccac gcaccaggca ctgccacaca tcgtctctca tccctcacaa
     1921 tagcccattc agaggatggt ctcattatct tcattttttt taattaaaaa aaattattat
     1981 tatttttgag acagagtctc actctgtcgc ccaggctgga gtgcagtggt gtgatcttgg
     2041 ctcactgcaa cctctgcctc ccgggttcaa gcgattcatc tgcctcagcc tccctagtag
     2101 ctgggactac aggtgcctgc caccacgcct ggctaatttt tttgcatttg taatagagac
     2161 agggttcgcc atgttggcca ggctggtctt gaactcctgg cctcaagtga tgtgcccctg
     2221 ttcgcttccc aaagtgctgg gatgataggt gtgagccacg gcacccagcc taacttcctt
     2281 ttttagagga 
//
LOCUS       HSATP2A1S6              1186 bp    DNA     linear   PRI 10-JUL-2001
DEFINITION  Human Ca2+ ATPase of fast-twitch skeletal muscle sarcoplasmic
            reticulum adult and neonatal isoforms (ATP2A1) gene, exons 10, 11
            and 12.
ACCESSION   U96778
VERSION     U96778.1  GI:2052516
KEYWORDS    .
SEGMENT     6 of 9
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1186)
  AUTHORS   Zhang,Y., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G., Yee,W.C.,
            Schrank,B., Cornblath,D.R., Boylan,K.B. and MacLennan,D.H.
  TITLE     Characterization of cDNA and genomic DNA encoding SERCA1, the
            Ca(2+)-ATPase of human fast-twitch skeletal muscle sarcoplasmic
            reticulum, and its elimination as a candidate gene for Brody
            disease
  JOURNAL   Genomics 30 (3), 415-424 (1995)
  MEDLINE   96423024
   PUBMED   8825625
REFERENCE   2  (bases 1 to 1186)
  AUTHORS   Zhang,Y.L., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G.,
            Yee,W.C., Schrank,B., Cornblath,D.R., Boylan,K.B. and
            MacLennan,D.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1997) Banting and Best Department of Medical
            Research, University of Toronto, 112 College Street, Toronto,
            Ontario M5G 1L6, Canada
FEATURES             Location/Qualifiers
     source          1..1186
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            199..287
                     /gene="ATP2A1"
                     /number=10
     exon            550..652
                     /gene="ATP2A1"
                     /number=11
     exon            866..997
                     /gene="ATP2A1"
                     /number=12
BASE COUNT      217 a    352 c    340 g    277 t
ORIGIN      
        1 tgtttaaggc tgttgataga gaatgattgc aacagcttgg tgggaagtgg gtagggtata
       61 cgggggggat gaggaggggt gagtaggagg gaaatggtgg gaaggtaggt gttggcagtg
      121 caggcctccc ttgatgcagg tgcccacctt gggatgggaa gaggtggaca tctgtgtgcc
      181 tgcccttctc ccctgcagat gtttatcatt gacaaggtgg atggggacat ctgcctcctg
      241 aatgagttct ccatcaccgg ctccacttac gctccagagg gagaggtgta agtcacccag
      301 gcatcttctc cccagctcct cacgcccctt cccacacccc ctttctccct ggggccctcc
      361 atgtgagttt tgctctcctt tcactctcct cttcctcccc gaccttgttc tctctcctga
      421 tatccgagtt ggctctcccc actgtccttc cttaccctct gctgcctgct cttcctgtgc
      481 cctctctgca tctcattccc tgttcttctc cactgtctct gtcccttcct cccctgaccc
      541 tgctgccagc ttgaagaatg ataagccagt ccggccaggg cagtatgacg ggctggtgga
      601 gctggccacc atctgtgccc tctgcaatga ctcctccttg gacttcaacg aggtaacctc
      661 tccttcccct tccagttggc tcagagtctg gcctcctccg aaggccagga ggaaagggtt
      721 ggaggaaggg gacccagtac acccagccct ctgccagggt gcaagggagg cagtggtttg
      781 cttccttctt acgctaggtg gaaggacggt atgacaggta ggagcctggg gcaccgactt
      841 cctcttcctc ctctgcccat ctcaggccaa aggtgtctat gagaaggtcg gcgaggccac
      901 cgagacagca ctcaccaccc tggtggagaa gatgaatgtg ttcaacacgg atgtgagaag
      961 cctctcgaag gtggagagag ccaacgcctg caactcggtg agcctgcgga cgccctgcca
     1021 cagggccgtc tccactctat gctgcataga ctagaggaag gcagaggccc tggttgggca
     1081 gagcctagtg cctgtgctgg gaccccaccc aggcagcaga cagccccctg cagcttctcc
     1141 acaggctgat gtgggtgggt accattgtaa tggggatgaa tggcgg
//
LOCUS       HSATP2A1S7               772 bp    DNA     linear   PRI 10-JUL-2001
DEFINITION  Human Ca2+ ATPase of fast-twitch skeletal muscle sarcoplasmic
            reticulum adult and neonatal isoforms (ATP2A1) gene, exons 13 and
            14.
ACCESSION   U96779
VERSION     U96779.1  GI:2052517
KEYWORDS    .
SEGMENT     7 of 9
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 772)
  AUTHORS   Zhang,Y., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G., Yee,W.C.,
            Schrank,B., Cornblath,D.R., Boylan,K.B. and MacLennan,D.H.
  TITLE     Characterization of cDNA and genomic DNA encoding SERCA1, the
            Ca(2+)-ATPase of human fast-twitch skeletal muscle sarcoplasmic
            reticulum, and its elimination as a candidate gene for Brody
            disease
  JOURNAL   Genomics 30 (3), 415-424 (1995)
  MEDLINE   96423024
   PUBMED   8825625
REFERENCE   2  (bases 1 to 772)
  AUTHORS   Zhang,Y.L., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G.,
            Yee,W.C., Schrank,B., Cornblath,D.R., Boylan,K.B. and
            MacLennan,D.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1997) Banting and Best Department of Medical
            Research, University of Toronto, 112 College Street, Toronto,
            Ontario M5G 1L6, Canada
FEATURES             Location/Qualifiers
     source          1..772
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            103..228
                     /gene="ATP2A1"
                     /number=13
     exon            336..554
                     /gene="ATP2A1"
                     /number=14
BASE COUNT      160 a    227 c    213 g    172 t
ORIGIN      
        1 agacttatgt tagtctcagc cttagcttgg gtctgctctg acctccagat tcttttgacc
       61 attttaagag gactggtctc ccctccctgt ctcctctccc aggtgatccg ccagctaatg
      121 aagaaggaat tcaccctgga gttctcccga gacagaaagt ccatgtctgt ctattgctcc
      181 ccagccaaat cttcccgggc tgctgtgggc aacaagatgt ttgtcaaggt cagaaatcgg
      241 aatgtgcctc agccccctct tcttcctact cctagccacc tgtcactgcc ctggaaggaa
      301 agtggtggtc tctgaatgct gttctggtct cctagggtgc ccctgagggc gtcatcgacc
      361 gctgtaacta tgtgcgagtt ggcaccaccc gggtgccact gacggggccg gtgaaggaaa
      421 agatcatggc ggtgatcaag gagtggggca ctggccggga caccctgcgc tgcttggccc
      481 tggccacccg ggacaccccc ccgaagcgag aggaaatggt cctggatgac tctgccaggt
      541 tcctggagta tgaggtaagc agctgggagc ctcccactgt cgtgagctgg tgaagggccg
      601 ggtcccagcc atccactcac agctccacca cccggatcat ttcctacctc gtcagtcaag
      661 gggtgcagtg actcacacct gtaatctcag cactttggga agctgagatg ggaaaagcgc
      721 ttgagcccag gagttcaaga ccagcctggg caatgaagtg agaaccccat ct
//
LOCUS       HSATP2A1S8              1035 bp    DNA     linear   PRI 10-JUL-2001
DEFINITION  Human Ca2+ ATPase of fast-twitch skeletal muscle sarcoplasmic
            reticulum adult and neonatal isoforms (ATP2A1) gene, exon 15.
ACCESSION   U96780
VERSION     U96780.1  GI:2052518
KEYWORDS    .
SEGMENT     8 of 9
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1035)
  AUTHORS   Zhang,Y., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G., Yee,W.C.,
            Schrank,B., Cornblath,D.R., Boylan,K.B. and MacLennan,D.H.
  TITLE     Characterization of cDNA and genomic DNA encoding SERCA1, the
            Ca(2+)-ATPase of human fast-twitch skeletal muscle sarcoplasmic
            reticulum, and its elimination as a candidate gene for Brody
            disease
  JOURNAL   Genomics 30 (3), 415-424 (1995)
  MEDLINE   96423024
   PUBMED   8825625
REFERENCE   2  (bases 1 to 1035)
  AUTHORS   Zhang,Y.L., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G.,
            Yee,W.C., Schrank,B., Cornblath,D.R., Boylan,K.B. and
            MacLennan,D.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1997) Banting and Best Department of Medical
            Research, University of Toronto, 112 College Street, Toronto,
            Ontario M5G 1L6, Canada
FEATURES             Location/Qualifiers
     source          1..1035
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            432..767
                     /gene="ATP2A1"
                     /number=15
BASE COUNT      232 a    290 c    309 g    204 t
ORIGIN      
        1 aaaaattagc tggacatgtt ggtgcatgcc tgtagtccca gctacttggg agggtgaggt
       61 aggaggatcg cttgagcctg gtagattgag gctgcagtga gctatgatcg caccgttgta
      121 ccactgcact ccagcctggg tgacagagca ataccctgtc tcaaaaaaaa aaagatacga
      181 tctgacccat gatgggcctg gcaccataac ccgtgcaggt gttagatgaa tgagatgttt
      241 cctgcctctg agcttccaag gccccactga ggtctgacac caggccctga ggcgcaagcc
      301 cagggttcag gcttcccacc cctccccacc acttcctgac ctttcacccc atccccaccc
      361 cccaccagct tcctccaggg gagttttcca gatccccacc tgacctgtgg ctctctgctg
      421 tatctcccca gacggacctg acattcgtgg gtgtagtggg catgctggac cctccgcgca
      481 aggaggtcac gggctccatc cagctgtgcc gtgacgccgg gatccgggtg atcatgatca
      541 ctggggacaa caagggcaca gccattgcca tctgccggcg aattggcatc tttggggaga
      601 acgaggaggt ggccgatcgc gcctacacgg gccgagagtt cgacgacctg cccctggctg
      661 aacagcggga agcctgccga cgtgcctgct gcttcgcccg tgtggagccc tcgcacaagt
      721 ccaagattgt ggagtacctg cagtcctacg atgagatcac agccatggtg agagggccag
      781 gcagctgcag ccttagtgtc cacggagatg accagatgac tgtgctgggg agagtggggc
      841 ccagggaccc agagggctta agatgcaaga aggggtgggg attcagaccc caaggaagag
      901 tctgaaggag gatttgtggg ctgggctggc actttgggag gctgaggtgg gcggatcact
      961 tgagcaggat tcgagaccag cctggcaaca tagcaagacc tcatctctac tcaaacaaac
     1021 ttaaaataaa tttag
//
LOCUS       HSATP2A1S9              3040 bp    DNA     linear   PRI 10-JUL-2001
DEFINITION  Human Ca2+ ATPase of fast-twitch skeletal muscle sarcoplasmic
            reticulum adult and neonatal isoforms (ATP2A1) gene, exons 16 to 23
            and complete cds.
ACCESSION   U96781
VERSION     U96781.1  GI:2052519
KEYWORDS    .
SEGMENT     9 of 9
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3040)
  AUTHORS   Zhang,Y., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G., Yee,W.C.,
            Schrank,B., Cornblath,D.R., Boylan,K.B. and MacLennan,D.H.
  TITLE     Characterization of cDNA and genomic DNA encoding SERCA1, the
            Ca(2+)-ATPase of human fast-twitch skeletal muscle sarcoplasmic
            reticulum, and its elimination as a candidate gene for Brody
            disease
  JOURNAL   Genomics 30 (3), 415-424 (1995)
  MEDLINE   96423024
   PUBMED   8825625
REFERENCE   2  (bases 1 to 3040)
  AUTHORS   Zhang,Y.L., Fujii,J., Phillips,M.S., Chen,H.S., Karpati,G.,
            Yee,W.C., Schrank,B., Cornblath,D.R., Boylan,K.B. and
            MacLennan,D.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1997) Banting and Best Department of Medical
            Research, University of Toronto, 112 College Street, Toronto,
            Ontario M5G 1L6, Canada
FEATURES             Location/Qualifiers
     source          1..3040
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            join(U96773.1:1140..2315,U96774.1:1..631,U96775.1:1..555,
                     U96776.1:1..919,U96777.1:1..2290,U96778.1:1..1186,
                     U96779.1:1..772,U96780.1:1..1035,1..>2588)
                     /gene="ATP2A1"
     mRNA            join(U96773.1:1140..1257,U96773.1:1565..1582,
                     U96773.1:1963..2045,U96774.1:214..318,U96775.1:334..472,
                     U96776.1:304..384,U96777.1:181..266,U96777.1:414..711,
                     U96777.1:1499..1665,U96778.1:199..287,U96778.1:550..652,
                     U96778.1:866..997,U96779.1:103..228,U96779.1:336..554,
                     U96780.1:432..767,227..447,550..752,946..1031,1144..1277,
                     1395..1512,1687..1804,2065..2106,2529..>2588)
                     /gene="ATP2A1"
                     /product="Ca2+ ATPase of fast-twitch skeletal muscle
                     sacroplasmic reticulum, neonatal isoform"
                     /note="SERCA1b"
     mRNA            join(U96773.1:1140..1257,U96773.1:1565..1582,
                     U96773.1:1963..2045,U96774.1:214..318,U96775.1:334..472,
                     U96776.1:304..384,U96777.1:181..266,U96777.1:414..711,
                     U96777.1:1499..1665,U96778.1:199..287,U96778.1:550..652,
                     U96778.1:866..997,U96779.1:103..228,U96779.1:336..554,
                     U96780.1:432..767,227..447,550..752,946..1031,1144..1277,
                     1395..1512,1687..1804,2529..>2588)
                     /gene="ATP2A1"
                     /product="Ca2+ ATPase of fast-twitch skeletal muscle
                     sacroplasmic reticulum, adult isoform"
                     /note="SERCA1a"
     CDS             join(U96773.1:1140..1257,U96773.1:1565..1582,
                     U96773.1:1963..2045,U96774.1:214..318,U96775.1:334..472,
                     U96776.1:304..384,U96777.1:181..266,U96777.1:414..711,
                     U96777.1:1499..1665,U96778.1:199..287,U96778.1:550..652,
                     U96778.1:866..997,U96779.1:103..228,U96779.1:336..554,
                     U96780.1:432..767,227..447,550..752,946..1031,1144..1277,
                     1395..1512,1687..1804,2529..2554)
                     /gene="ATP2A1"
                     /note="SERCA1a"
                     /codon_start=1
                     /product="Ca2+ ATPase of fast-twitch skeletal muscle
                     sacroplasmic reticulum, adult isoform"
                     /protein_id="AAB53113.1"
                     /db_xref="GI:2052522"
                     /translation="MEAAHAKTTEECLAYFGVSETTGLTPDQVKRNLEKYGLNELPAE
                     EGKTLWELVIEQFEDLLVRILLLAACISFVLAWFEEGEETITAFVEPFVILLILIANA
                     IVGVWQERNAENAIEALKEYEPEMGKVYRADRKSVQRIKARDIVPGDIVEVAVGDKVP
                     ADIRILAIKSTTLRVDQSILTGESVSVIKHTEPVPDPRAVNQDKKNMLFSGTNIAAGK
                     ALGIVATTGVGTEIGKIRDQMAATEQDKTPLQQKLDEFGEQLSKVISLICVAVWLINI
                     GHFNDPVHGGSWFRGAIYYFKIAVALAVAAIPEGLPAVITTCLALGTRRMAKKNAIVR
                     SLPSVETLGCTSVICSDKTGTLTTNQMSVCKMFIIDKVDGDICLLNEFSITGSTYAPE
                     GEVLKNDKPVRPGQYDGLVELATICALCNDSSLDFNEAKGVYEKVGEATETALTTLVE
                     KMNVFNTDVRSLSKVERANACNSVIRQLMKKEFTLEFSRDRKSMSVYCSPAKSSRAAV
                     GNKMFVKGAPEGVIDRCNYVRVGTTRVPLTGPVKEKIMAVIKEWGTGRDTLRCLALAT
                     RDTPPKREEMVLDDSARFLEYETDLTFVGVVGMLDPPRKEVTGSIQLCRDAGIRVIMI
                     TGDNKGTAIAICRRIGIFGENEEVADRAYTGREFDDLPLAEQREACRRACCFARVEPS
                     HKSKIVEYLQSYDEITAMTGDGVNDAPALKKAEIGIAMGSGTAVAKTASEMVLADDNF
                     STIVAAVEEGRAIYNNMKQFIRYLISSNVGEVVCIFLTAALGLPEALIPVQLLWVNLV
                     TDGLPATALGFNPPDLDIMDRPPRSPKEPLISGWLFFRYMAIGGYVGAATVGAAAWWF
                     LYAEDGPHVNYSQLTHFMQCTEDNTHFEGIDCEVFEAPEPMTMALSVLVTIEMCNALN
                     SLSENQSLLRMPPWVNIWLLGSICLSMSLHFLILYVDPLPMIFKLRALDLTQWLMVLK
                     ISLPVIGLDEILKFVARNYLEDPEDERRK"
     CDS             join(U96773.1:1140..1257,U96773.1:1565..1582,
                     U96773.1:1963..2045,U96774.1:214..318,U96775.1:334..472,
                     U96776.1:304..384,U96777.1:181..266,U96777.1:414..711,
                     U96777.1:1499..1665,U96778.1:199..287,U96778.1:550..652,
                     U96778.1:866..997,U96779.1:103..228,U96779.1:336..554,
                     U96780.1:432..767,227..447,550..752,946..1031,1144..1277,
                     1395..1512,1687..1804,2065..2069)
                     /gene="ATP2A1"
                     /note="SERCA1b"
                     /codon_start=1
                     /product="Ca2+ ATPase of fast-twitch skeletal muscle
                     sacroplasmic reticulum, neonatal isoform"
                     /protein_id="AAB53112.1"
                     /db_xref="GI:2052521"
                     /translation="MEAAHAKTTEECLAYFGVSETTGLTPDQVKRNLEKYGLNELPAE
                     EGKTLWELVIEQFEDLLVRILLLAACISFVLAWFEEGEETITAFVEPFVILLILIANA
                     IVGVWQERNAENAIEALKEYEPEMGKVYRADRKSVQRIKARDIVPGDIVEVAVGDKVP
                     ADIRILAIKSTTLRVDQSILTGESVSVIKHTEPVPDPRAVNQDKKNMLFSGTNIAAGK
                     ALGIVATTGVGTEIGKIRDQMAATEQDKTPLQQKLDEFGEQLSKVISLICVAVWLINI
                     GHFNDPVHGGSWFRGAIYYFKIAVALAVAAIPEGLPAVITTCLALGTRRMAKKNAIVR
                     SLPSVETLGCTSVICSDKTGTLTTNQMSVCKMFIIDKVDGDICLLNEFSITGSTYAPE
                     GEVLKNDKPVRPGQYDGLVELATICALCNDSSLDFNEAKGVYEKVGEATETALTTLVE
                     KMNVFNTDVRSLSKVERANACNSVIRQLMKKEFTLEFSRDRKSMSVYCSPAKSSRAAV
                     GNKMFVKGAPEGVIDRCNYVRVGTTRVPLTGPVKEKIMAVIKEWGTGRDTLRCLALAT
                     RDTPPKREEMVLDDSARFLEYETDLTFVGVVGMLDPPRKEVTGSIQLCRDAGIRVIMI
                     TGDNKGTAIAICRRIGIFGENEEVADRAYTGREFDDLPLAEQREACRRACCFARVEPS
                     HKSKIVEYLQSYDEITAMTGDGVNDAPALKKAEIGIAMGSGTAVAKTASEMVLADDNF
                     STIVAAVEEGRAIYNNMKQFIRYLISSNVGEVVCIFLTAALGLPEALIPVQLLWVNLV
                     TDGLPATALGFNPPDLDIMDRPPRSPKEPLISGWLFFRYMAIGGYVGAATVGAAAWWF
                     LYAEDGPHVNYSQLTHFMQCTEDNTHFEGIDCEVFEAPEPMTMALSVLVTIEMCNALN
                     SLSENQSLLRMPPWVNIWLLGSICLSMSLHFLILYVDPLPMIFKLRALDLTQWLMVLK
                     ISLPVIGLDEILKFVARNYLEG"
     exon            227..447
                     /gene="ATP2A1"
                     /number=16
     exon            550..752
                     /gene="ATP2A1"
                     /number=17
     exon            946..1031
                     /gene="ATP2A1"
                     /number=18
     exon            1144..1277
                     /gene="ATP2A1"
                     /number=19
     exon            1395..1512
                     /gene="ATP2A1"
                     /number=20
     exon            1687..1804
                     /gene="ATP2A1"
                     /number=21
     exon            2065..2106
                     /gene="ATP2A1"
                     /number=22
     exon            2529..>2588
                     /gene="ATP2A1"
                     /number=23
BASE COUNT      505 a   1054 c    853 g    628 t
ORIGIN      
        1 tgtgccgttc tttgttctaa acctgagttt tttggccaag tgttggtggg cacatgcctg
       61 taagttccca aacactttgg gaggctgagg caggaggatt gcttgagccc aggagttcaa
      121 gaccagcctg ggcaacagag tgagacctca tccctaaaat aaaacctttt ttaaaaagga
      181 gggatgtgtg aaggtgccct aagcccacct tctcctcctc cctcagacag gtgatggcgt
      241 caatgacgcc cctgccctga agaaggctga gattggcatt gccatgggat ctggcactgc
      301 cgtggccaag actgcctctg agatggtgct ggctgacgac aacttctcca ccatcgtagc
      361 tgctgtggag gagggccgcg ccatctacaa caacatgaag cagttcatcc gctacctcat
      421 ttcctccaac gtgggcgagg tggtctggtg agcagctggg tgggcgtcca ggaggaagcc
      481 ggggttaggg tggggtggct gcaggtctgg gaggcaggac agaggcgtga ccacctcctt
      541 ccccaccagt atcttcctga ccgctgccct ggggctgcct gaggccctga tcccggtgca
      601 gctgctatgg gtgaacttgg tgaccgacgg gctcccagcc acagccctgg gcttcaaccc
      661 accagacctg gacatcatgg accgcccccc ccggagcccc aaggagcccc tcatcagtgg
      721 ctggctcttc ttccgctaca tggcaatcgg gggtgagctg gaggggttcc tcgatcctcc
      781 ccaccccttg ggactaaccc cctctctggg acaccagctc ccccatgcag gtgctgagag
      841 ggtcttcttc cttggccagc ctgtccatgg ccacatgagg ccctcaaccc tcgatgcccc
      901 ctatctcccc agccctgacc cccgactccc ctctctccac cacaggctat gtgggtgcag
      961 ccaccgtggg agcagctgcc tggtggttcc tgtacgctga ggatgggcct catgtcaact
     1021 acagccagct ggtaggggga ggccacaaag gaggggacca ggaggggggg gggatgcagg
     1081 agggtaccag gagggtggca tggaggtggc cctggacctc agtctcccgt accttccctg
     1141 cagactcact tcatgcagtg caccgaggac aacacccact ttgagggcat agactgtgag
     1201 gtcttcgagg cccccgagcc catgaccatg gccctgtccg tgctggtgac catcgagatg
     1261 tgcaatgcac tgaacaggtg ggggcccccc agctacaccc accaccctcc cctgaggcca
     1321 ctgcccacat cctccactgt gccgcccacc tccttctcct cactgttcct tctccctccc
     1381 cttcccctct gcagcctgtc cgagaaccag tccctgctgc ggatgccacc ctgggtgaac
     1441 atctggctgc tgggctccat ctgcctctcc atgtccctgc acttcctcat cctctatgtt
     1501 gaccccctgc cggtgaggtt tcttccgccc agggccgccc accccagcac tggggagccc
     1561 acgggcgccc atgaccactc ccaccagggg cgccgatgtg ggaggctggt gggagtgggc
     1621 tgggcagtgc tggtctctgg ctccctcccc accccctcct gagagggcgc ttgtcccctg
     1681 ccccagatga tcttcaagct ccgggccctg gacctcaccc agtggctcat ggtcctcaag
     1741 atctcactgc cagtcattgg gctcgacgaa atcctcaagt tcgttgctcg gaactaccta
     1801 gagggtaagg agtgccctct ctgtcccaag ccctggcccc accacagccc cttccccatg
     1861 acgtccgtcc cccgtccccg ccccgtactt tgcaggtggt aagtttctca gccctggcag
     1921 gacctgtgtc cgccccgttc cccctgcgcc tgcaggccac atctccgggg cagccccact
     1981 gcctcctcag cccccacagc ccctatagcc cccatgccac ctccctgcct tgataacagt
     2041 gcctcttgtc ctctctggcc ataggataac tgttccccct cctccatctc tgagcccgtg
     2101 tcacaggtat cacccccttc ttgccctcag cccagctgct gtgcccctgc cacccgcgcc
     2161 ccctcagccc ctttcgcgtc gcatccaagg tcacttgtgc tcgcagctcc acctggagcc
     2221 gttgccactg ctgctgctgc tcttccagtc agggtgggcc gctggctccc actcgggcgt
     2281 cagtttggct cccaggccct gggcagtgcc agcctctggg cccgtctgct gcgctgcgtt
     2341 gcgctggctg tgtgctgggc tgcagtgggg gggggcgggt gtctggggac gcaggtgagt
     2401 aggggagaaa ctggcagggt ggtaagcttc tgagcctcca ggtaagtgcg tgcctgggag
     2461 atgcacctgg gagggacctc gctgccctcc tgccgcctgc ctcatccctt ctcttttccc
     2521 ctttccagat ccagaagatg aaagaaggaa gtgagcatcc ttttgctctg tcctccccac
     2581 cccgatagtg acacatcttc aggcagagct gtggcacaga cccccgtcct gtcccccaca
     2641 cccgtgtcat gtgtctgttt ataaacatgt ccccttccct ttccttcccc ctcggccacc
     2701 cgcctccctc tcaaccttgt aaattcccct tcccaacccc gaggggcttg cagggacaag
     2761 gcgaccgact gcgctgagct gcttatttat tgaaaataaa cgacggaaaa gtctggcctt
     2821 gcctctgtgc aagcttggag gcctgggtcg ccgctgtgga caagcgtctt agtgtcatgc
     2881 agaccagaag gcagctgctg tcccagggcc ggggcacctc actgcctctg atggggactc
     2941 cagcccccat ggctccgctg ttgccctggg gcaggggacg ggctgggggg caggggaggg
     3001 ctggagccca ggaggcagca cagcagccag aaagcaggcc 
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Z18948. H.sapiens mRNA fo...[gi:396712] Links  


LOCUS       HSS100E                  738 bp    mRNA    linear   PRI 11-MAY-1994
DEFINITION  H.sapiens mRNA for S100E calcium binding protein.
ACCESSION   Z18948
VERSION     Z18948.1  GI:396712
KEYWORDS    calcium binding protein; calcium binding protein S100E.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Engelkamp,D., Schafer,B.W., Mattei,M.G., Erne,P. and Heizmann,C.W.
  TITLE     Six S100 genes are clustered on human chromosome 1q21:
            identification of two genes coding for the two previously
            unreported calcium-binding proteins S100D and S100E
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 90 (14), 6547-6551 (1993)
  MEDLINE   93342029
REFERENCE   2  (bases 1 to 738)
  AUTHORS   Engelkamp,D.
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-1992) Dieter Engelkamp, Pediatrics, Division of
            Clinical Chemistry, University of Zurich, Steinwiesstrasse 75,
            Zurich, CH-8032, Switzerland
FEATURES             Location/Qualifiers
     source          1..738
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="1q21"
                     /clone="S100E"
                     /tissue_type="heart"
     exon            1..78
     exon            79..224
     CDS             84..389
                     /codon_start=1
                     /product="S100E calcium binding protein"
                     /protein_id="CAA79471.1"
                     /db_xref="GI:396713"
                     /db_xref="SWISS-PROT:P33764"
                     /translation="MARPLEQAVAAIVCTFQEYAGRCGDKYKLCQAELKELLQKELAT
                     WTPTEFRECDYNKFMSVLDTNKDCEVDFVEYVRSLACLCLYCHEYFKDCPSEPPCSQ"
     exon            225..738
     polyA_signal    723..728
BASE COUNT      151 a    220 c    212 g    155 t
ORIGIN      
        1 agtctcagat tggtaaacac ccgaactggt caactctcaa gagaccatct ggttcaggtt
       61 cctgactggg ccagcgagtg aggatggcca ggcctctgga gcaggcggta gctgccatcg
      121 tgtgcacctt ccaggaatac gcagggcgct gtggggacaa atacaagctc tgccaggcgg
      181 agctcaagga gctgctgcag aaggagctgg ccacctggac cccgactgag tttcgggaat
      241 gtgactacaa caaattcatg agtgttctgg acaccaacaa ggactgcgag gtggactttg
      301 tggagtatgt gcgctcactt gcctgcctct gtctctactg ccacgagtac ttcaaggact
      361 gcccctcaga gcccccctgc tcccagtagc ctctgctcca gggggtgcgc tggctgtcgg
      421 gggctgggca tgtctcccac accccctcct accctctctc ctgtacccct ttcaatctgg
      481 acttgcccag gtcttctgcg atcagttaac ccattttacc taggaggccc agagatgtga
      541 gggctccttc ctcaggatgc ccagcgaatg aggggtagag ccactctggg gcccagcctg
      601 cctgccgcac ccctgtggcc tcccttgtgg atgggaggag gcgggatctg ctctgaggcc
      661 ctcgaggctc agcagagcgt gcaccaatga gaccacgatg ggaaagggcc tatttaactc
      721 ctaataaaaa actggcat
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L11690. Human bullous 230...[gi:402479] Links  


LOCUS       HUMBPAG1B               8684 bp    mRNA    linear   PRI 07-NOV-1994
DEFINITION  Human bullous 230 kDa pemphigoid antigen (BPAG1) mRNA, complete
            cds.
ACCESSION   L11690
VERSION     L11690.1  GI:402479
KEYWORDS    bullous pemphigoid antigen.
SOURCE      Homo sapiens cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Elgart,G.W. and Stanley,J.R.
  TITLE     Cloning of the 5' mRNA for the 230-kD bullous pemphigoid antigen by
            rapid amplification of cDNA ends
  JOURNAL   J. Invest. Dermatol. 101 (2), 244-246 (1993)
  MEDLINE   93346806
   PUBMED   8345227
COMMENT     Ref [1] reports bp 1-1822.
FEATURES             Location/Qualifiers
     source          1..8684
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="6p12-p11"
                     /cell_type="keratinocyte"
     gene            1..8684
                     /gene="BPAG1"
     CDS             103..8052
                     /gene="BPAG1"
                     /codon_start=1
                     /product="bullous pemphigoid antigen"
                     /protein_id="AAA52288.1"
                     /db_xref="GI:403124"
                     /db_xref="GDB:G00-125-207"
                     /translation="MHSSSYSYRSSDSVFSNTTSTRTSLDSNENLLLVHCGPTLINSC
                     ISFGSESFDGHRLEMLQQIANRVQRDSVICEDKLILAGNALQSDSKRLESGVQFQNEA
                     EIAGYILECENLLRQHVIDVQILIDGKYYQADQLVQRVAKLRDEIMALRNECSSVYSK
                     GRILTTEQTKLMISGITQSLNSGFAQTLHPSLTSGLTQSLTPSLTSSSMTSGLSSGMT
                     SRLTPSVTPAYTPGFPSGLVPNFSSGVEPNSLQTLKLMQIRKPLLKSSLLDQNLTEEE
                     INMKFVQDLLNWVDEMQVQLDRTEWGSDLPSVESHLENHKNVHRAIEEFESSLKEAKI
                     SEIQMTAPLKLTYAEKLHRLESQYAKLLNTSRNQERHLDTLHNFVSRATNELIWLNEK
                     EEEEVAYDWSERNTNIARKKDYHAELMRELDQKEENIKSVQEIAEQLLLENHPARLTI
                     EAYRAAMQTQWSWILQLCQCVEQHIKENTAYFEFFNDAKEATDYLRNLKDAIQRKYSC
                     DRSSSIHKLEDLVQESMEEKEELLQYKSTIANLMGKAKTIIQLKPRNSDCPLKTSIPI
                     KAICDYRQIEITIYKDDECVLANNSHRAKWKVISPTGNEAMVPSVCFTVPPPNKEAVD
                     LANRIEQQYQNVLTLWHESHINMKSVVSWHYLINEIDRIRASNVASIKTMLPGEHQQV
                     LSNLQSRFEDFLEDSQESQVFSGSDITQLEKEVNVCKQYYQELLKSAEREEQEESVYN
                     LYISEVRNIRLRLENCEDRLIRQIRTPLERDDLHESVFRITEQEKLKKELERLKDDLG
                     TITNKCEEFFSQAAASSSVPTLRSELNVVLQNMNQVYSMSSTYIDKLKTVNLVLKNTQ
                     AAEALVKLYETKLCEEEAVIADKNNIENLISTLKQWRSEVDEKRQVFHALEDELQKAK
                     AISDEMFKTYKERDLDFDWHKEKADQLVERWQNVHVQIDNRLRDLEGIGKSLKYYRDT
                     YHPLDDWIQQVETTQRKIQENQPENSKTLATQLNQQKMLVSEIEMKQSKMDECQKYAE
                     QYSATVKDYELQTMTYRAMVDSQQKSPVKRRRMQSSADLIIQEFMDLRTRYTALVTLM
                     TQYIKFAGDSLKRLEEEEIKRCKETSEHGAYSDLLQRQKATVLENSKLTGKISELERM
                     VAELKKQKSRVEEELPKVREAAENELRKQQRNVEDISLQKIRAESEAKQYRRELETIV
                     REKEAAERELERVRQLTIEAEAKRAAVEENLLNFRNQLEENTFTRRTLEDHLKRKDLS
                     LNDLEQQKNKLMEELRRKRDNEEELLKLIKQMEKDLAFQKQVAEKQLKEKQKIELEAR
                     RKITEIQYTCRENALPVCPITQATSCRAVTGLQQEHDKQKAEELKQQVDELTAANRKA
                     EQDMRELTYELNALQLEKTSSEEKARLLKDKLDETNNTLRCLKLELERKDQAEKGYSQ
                     QLRELGRQLNQTTGKAEEAMQEASDLKKIKRNYQLELESLNHEKGKLQREVDRITRAH
                     AVAEKNIQHLNSQIHSFRDEKELERLQICQRKSDHLKEQFEKSHEQLLQNIKAEKENN
                     DKIQRLNEELEKSNECAEMLKQKVEELTRQNNETKLMMQRIQAESENIVLEKQTIQQR
                     CEALKIQADGFKDQLRSTNEHLHKQTKTEQDFQRKIKCLEEDLAKSQNLVSEFKQKCD
                     QQNIIIQNTKKEVRNLNAELNASKEEKRRGEQKVQLQQAQVQELNNRLKKVQDELHLK
                     TIEEQMTHRKMVLFQEESGKFKQSAEEFRKKMEKLMESKVITENDISGIRLDFVSLQQ
                     ENSRAQENAKLCETNIKELERQLQQYREQMQQGQHMEANHYQKCQKLEDELIAQKREV
                     ENLKQKMDQQIKEHEHQLVLLQCEIQKKSTAKDCTFKPDFEMTVKECQHSGELSSRNT
                     GHLHPTPRSPLLRWTQEPQPLEEKWQHRVVEQIPKEVQFQPPGAPLEKEKSQQCYSEY
                     FSQTSTELQITFDETNPITRLSEIEKIRDQALNNSRPPVRYQDNACEMELVKVLTPLE
                     IAKNKQYDMHTEVTTLKQEKNPVPSAEEWMLEGCRASGGLKKGDFLKKGLEPETFQNF
                     DGDHACSVRDDEFKFQGLRHTVTARQLVEAKLLDMRTIEQLRLGLKTVEEVQKTLNKF
                     LTKATSIAGLYLESTKEKISFASAAERIIIDKMVALAFLEAQAATGFIIDPISGQTYS
                     VEDAVLKGVVDPEFRIRLLEAEKAAVGYSYSSKTLSVFQAMENRMLDRQKGKHILEAQ
                     IASGGVIDPVRGIRVPPEIALQQGLLNNAILQFLHEPSSNTRVFPNPNNKQALYYSEL
                     LRMCVFDVESQCFLFPFGERNISNLNVKKTHRISVVDTKTGSELTVYEAFQRNLIEKS
                     IYLELSGQQYQWKEAMFFESYGHSSHMLTDTKTGLHFNINEAIEQGTIDKALVKKYQE
                     GLITLTELADSLLSRLVPKKDLHSPVAGYWLTASGERISVLKASRRNLVDRITALRCL
                     EAQVSTGGIIDPLTGKKYRVAEALHRGLVDEGFAQQLRQCELVITGIGHPITNKMMSV
                     VEAVNANIINKEMGIRCLEFQYLTGGLIEPQVHSRLSIEEALQVGIIDVLIATKLKDQ
                     KSYVRNIICPQTKRKLTYKEALEKADFDFHTGLKLLEVSEPLMTGISSLYYSS"
BASE COUNT     3067 a   1571 c   1918 g   2128 t
ORIGIN      
        1 gagctgccac ttttcaccgt tagaagtaga gctttttcca gacctcctac cttttagtct
       61 actttgaaag gtgaaagaaa gaacatcgtt tcaggaataa aaatgcacag tagtagttat
      121 agttaccgta gcagtgattc tgtgtttagt aacactacca gcactcgaac cagtcttgat
      181 tcaaatgaaa atcttctctt ggttcattgt ggtccaacac tgatcaactc ttgcattagc
      241 ttcggtagtg aatcctttga tggacacagg ttagaaatgt tgcaacagat tgccaacaga
      301 gttcagaggg acagtgtcat ctgtgaagac aaactgattc ttgctggaaa tgctcttcag
      361 tctgattcta aaagattaga atcaggagtg cagtttcaga atgaagcaga aattgctggg
      421 tatatacttg aatgtgagaa ccttttacgc cagcatgtaa ttgatgtaca gattcttatt
      481 gatggaaaat actaccaggc agatcaattg gtacagaggg ttgcaaaact gcgtgacgaa
      541 attatggcct taaggaacga atgttcttct gtgtacagca aaggacgcat actgacaaca
      601 gaacagacaa agctcatgat atcaggaatc actcaaagtt taaactcagg atttgcacag
      661 accttacacc ctagtctgac ctcagggctg acccagagtt taacaccttc cctaacctct
      721 tctagtatga cttctggcct gtcatcaggg atgacttccc gcctgactcc atctgtcact
      781 ccagcttata cacctggttt cccatcagga ttagttccaa atttcagttc aggagtagag
      841 ccaaattcat tgcaaacttt gaagttgatg cagatccgaa aaccccttct aaagtcttct
      901 ttgctggatc aaaatttaac agaagaagaa atcaatatga aatttgttca ggatcttttg
      961 aattgggttg atgagatgca ggtacaactg gaccgcactg agtggggctc agatttgcca
     1021 agtgttgaaa gccatttaga aaatcataaa aatgttcata gagctattga agaatttgaa
     1081 tctagtctca aagaagctaa aatcagtgag attcaaatga cagcacctct taaactgact
     1141 tatgcagaaa agttgcacag attagagagt cagtatgcaa aactcttgaa tacatccagg
     1201 aatcaagaac ggcaccttga tacactccat aattttgtaa gtcgtgcgac taatgaactt
     1261 atttggttga atgaaaaaga agaggaggaa gttgcttatg actggagtga gagaaacacc
     1321 aacatagcta ggaaaaaaga ttatcatgct gaattaatga gagaacttga tcaaaaggaa
     1381 gaaaatatta aatcagttca ggagatagca gagcagctac ttctagaaaa tcatccagcc
     1441 cggttaacta ttgaggccta cagagcggca atgcagacgc agtggagctg gatcttacag
     1501 ctctgccagt gtgtggagca gcacataaag gagaacacag cgtatttcga gtttttcaat
     1561 gatgccaaag aagctactga ttacttaagg aatctaaaag atgccattca gcggaagtac
     1621 agctgtgata gatcaagcag cattcacaag ctagaagacc ttgttcagga atcaatggaa
     1681 gagaaagaag aacttctgca gtacaaaagc actatagcaa acctaatggg aaaagcaaaa
     1741 acaataattc aactgaagcc aaggaattct gactgtccac tcaaaacttc tattccgatc
     1801 aaagctatct gtgactacag acaaattgag ataaccattt acaaagacga tgaatgtgtt
     1861 ttggcgaata actctcatcg tgctaaatgg aaggtcatta gtcctactgg gaatgaggct
     1921 atggtcccat ctgtgtgctt caccgttcct ccaccaaaca aagaagcggt ggaccttgcc
     1981 aacagaattg agcaacagta tcagaatgtc ctgactcttt ggcatgagtc tcacataaac
     2041 atgaagagtg tagtatcctg gcattatctc atcaatgaaa ttgatagaat tcgagctagc
     2101 aatgtggctt caataaagac aatgctacct ggtgaacatc agcaagttct aagtaatcta
     2161 caatctcgtt ttgaagattt tctggaagat agccaggaat cccaagtctt ttcaggctca
     2221 gatataacac aactggaaaa ggaggttaat gtatgtaagc agtattatca agaacttctt
     2281 aaatctgcag aaagagagga gcaagaggaa tcagtttata atctctacat ctctgaagtt
     2341 cgaaacatta gacttcggtt agagaactgt gaagatcggc tgattagaca gattcgaact
     2401 cccctggaaa gagatgattt gcatgaaagt gtgttcagaa tcacagaaca ggagaaacta
     2461 aagaaagagc tggaacgact taaagatgat ttgggaacaa tcacaaataa gtgtgaggag
     2521 tttttcagtc aagcagcagc ctcttcatca gtccctaccc tacgatcaga gcttaatgtg
     2581 gtccttcaga acatgaacca agtctattct atgtcttcca cttacataga taagttgaaa
     2641 actgttaact tggtgttaaa aaacactcaa gctgcagaag ccctcgtaaa actctatgaa
     2701 actaaactgt gtgaagaaga agcagttata gctgacaaga ataatattga gaatctaata
     2761 agtactttaa agcaatggag atctgaagta gatgaaaaga gacaggtatt ccatgcctta
     2821 gaggatgagt tgcagaaagc taaagccatc agtgatgaaa tgtttaaaac gtataaagaa
     2881 cgggaccttg attttgactg gcacaaagaa aaagcagatc aattagttga aaggtggcaa
     2941 aatgttcatg tgcagattga caacaggtta cgggacttag agggcattgg caaatcactg
     3001 aagtactaca gagacactta ccatccttta gatgattgga tccagcaggt tgaaactact
     3061 cagagaaaga ttcaggaaaa tcagcctgaa aatagtaaaa ccctagccac acagttgaat
     3121 caacagaaga tgctggtgtc cgaaatagaa atgaaacaga gcaaaatgga cgagtgtcaa
     3181 aaatatgcag aacagtactc agctacagtg aaggactatg aattacaaac aatgacctac
     3241 cgggccatgg tagattcaca acaaaaatct ccagtgaaac gccgaagaat gcagagttca
     3301 gcagatctca ttattcaaga gttcatggac ctaaggactc gatatactgc cctggtcact
     3361 ctcatgacac aatatattaa atttgctggt gattcattga agaggctgga agaggaggag
     3421 attaaaaggt gtaaggagac ttctgaacat ggggcatatt cagatctgct tcagcgtcag
     3481 aaggcaacag tgcttgagaa tagcaaactt acaggaaaga taagtgagtt ggaaagaatg
     3541 gtagctgaac taaagaaaca aaagtcccga gtagaggaag aacttccgaa ggtcagggag
     3601 gctgcagaaa atgaattgag aaagcagcag agaaatgtag aagatatctc tctgcagaag
     3661 ataagggctg aaagtgaagc caagcagtac cgcagggaac ttgaaaccat tgtgagagag
     3721 aaggaagccg ctgaaagaga actggagcgg gtgaggcagc tcaccataga ggccgaggct
     3781 aaaagagctg ccgtggaaga gaacctcctg aattttcgca atcagttgga ggaaaacacc
     3841 tttaccagac gaacactgga agatcatctt aaaagaaaag atttaagtct caatgatttg
     3901 gagcaacaaa aaaataaatt aatggaagaa ttaagaagaa agagagacaa tgaggaagaa
     3961 ctcttgaagc tgataaagca gatggaaaaa gaccttgcat ttcagaaaca ggtagcagag
     4021 aaacagttga aagaaaagca gaaaattgaa ttggaagcaa gaagaaaaat aactgaaatt
     4081 cagtatacat gtagagaaaa tgcattgcca gtgtgtccga tcacacaggc tacatcatgc
     4141 agggcagtaa cgggtctcca gcaagaacat gacaagcaga aagcagaaga actcaaacag
     4201 caggtagatg aactaacagc tgccaataga aaggctgaac aagacatgag agagctgaca
     4261 tatgaactta atgccctcca gcttgaaaaa acgtcatctg aggaaaaggc tcgtttgcta
     4321 aaagataaac tagatgaaac aaataataca ctcagatgcc ttaagttgga gctggaaagg
     4381 aaggatcagg cggagaaagg gtattctcaa caactcagag agcttggtag gcaattgaat
     4441 caaaccacag gtaaagctga agaagccatg caagaagcta gtgatctcaa gaaaataaag
     4501 cgcaattatc agttagaatt agaatctctt aatcatgaaa aagggaaact acaaagagaa
     4561 gtagacagaa tcacaagggc acatgctgta gctgagaaga atattcagca tttaaattca
     4621 caaattcatt cttttcgaga tgagaaagaa ttagaaagac tacaaatctg ccagagaaaa
     4681 tcagatcatc taaaagaaca atttgagaaa agccatgagc agttgcttca aaatatcaaa
     4741 gctgaaaaag aaaataatga taaaatccaa aggctcaatg aagaattgga gaaaagtaat
     4801 gagtgtgcag agatgctaaa acaaaaagta gaggagctta ctaggcagaa taatgaaacc
     4861 aaattaatga tgcagagaat tcaggcagaa tcagagaata tagttttaga gaaacaaact
     4921 atccagcaaa gatgtgaagc actgaaaatt caggcagatg gttttaaaga tcagctacgc
     4981 agcacaaatg aacacttgca taaacagaca aaaacagagc aggattttca aagaaaaatt
     5041 aaatgcctag aagaagacct ggcgaaaagt caaaatttgg taagtgaatt taagcaaaag
     5101 tgtgaccaac agaacattat catccagaat accaagaaag aagttagaaa tctgaatgcg
     5161 gaactgaatg cttccaaaga agagaagcga cgcggggagc agaaagttca gctacaacaa
     5221 gctcaggtgc aagagttaaa taacaggttg aaaaaagtac aagacgaatt acacttaaag
     5281 accatagagg agcagatgac ccacagaaag atggttctgt ttcaggaaga atctggtaaa
     5341 ttcaaacaat cagcagagga gtttcggaag aagatggaaa aattaatgga gtccaaagtc
     5401 atcactgaaa atgatatttc aggcattagg cttgactttg tgtctcttca acaagaaaac
     5461 tctagagccc aagaaaatgc taagctttgt gaaacaaaca ttaaagaact tgaaagacag
     5521 cttcaacagt atcgtgaaca aatgcagcaa gggcagcaca tggaagcaaa tcattaccaa
     5581 aaatgtcaga aacttgagga tgagctgata gcccagaagc gtgaggttga aaacctgaag
     5641 caaaaaatgg accaacagat caaagagcat gaacatcaat tagttttgct ccagtgtgaa
     5701 attcaaaaaa agagcacagc caaagactgt accttcaaac cagattttga gatgacagtg
     5761 aaggagtgcc agcactctgg agagctgtcc tctagaaaca ctggacacct tcacccaaca
     5821 cccagatccc ctctgttgag atggactcaa gaaccacagc cattggaaga gaagtggcag
     5881 catcgggttg ttgaacagat acccaaagaa gtccaattcc agccaccagg ggctccactc
     5941 gagaaagaga aaagccagca gtgttactct gagtactttt ctcagacaag caccgagtta
     6001 cagataactt ttgatgagac aaaccccatt acaagactgt ctgaaattga gaagataaga
     6061 gaccaagccc tgaacaattc tagaccacct gttaggtatc aagataacgc atgtgaaatg
     6121 gaactggtga aggttttgac acccttagag atagctaaga acaagcagta tgatatgcat
     6181 acagaagtca caacattaaa acaagaaaag aacccagttc ccagtgctga agaatggatg
     6241 cttgaagggt gcagagcatc tggtggactc aagaaagggg atttccttaa gaagggctta
     6301 gaaccagaga ccttccagaa ctttgatggt gatcatgcat gttcagtcag ggatgatgaa
     6361 tttaaattcc aagggcttag gcacactgtg actgccaggc agttggtgga agctaagctt
     6421 ctggacatga gaacaattga gcagctgcga ctcggtctta agactgttga agaagttcag
     6481 aaaactctta acaagtttct gacgaaagcc acctcaattg cagggcttta cctagaatct
     6541 acaaaagaaa agatttcatt tgcctcagcg gccgagagaa tcataataga caaaatggtg
     6601 gctttggcat ttttagaagc tcaggctgca acaggtttta taattgatcc catttcaggt
     6661 cagacatatt ctgttgaaga tgcagttctt aaaggagttg ttgaccccga attcagaatt
     6721 aggcttcttg aggcagagaa ggcagctgtg ggatattctt attcttctaa gacattgtca
     6781 gtgtttcaag ctatggaaaa tagaatgctt gacagacaaa aaggtaaaca tatcttggaa
     6841 gcccagattg ccagtggggg tgtcattgac cctgtgagag gcattcgtgt tcctccagaa
     6901 attgctctgc agcaggggtt gttgaataat gccatcttac agtttttaca tgagccatcc
     6961 agcaacacaa gagttttccc taatcccaat aacaagcaag ctctgtatta ctcagaatta
     7021 ctgcgaatgt gtgtatttga tgtagagtcc caatgctttc tgtttccatt tggggagagg
     7081 aacatttcca atctcaatgt caagaaaaca catagaattt ctgtagtaga tactaaaaca
     7141 ggatcagaat tgaccgtgta tgaggctttc cagagaaacc tgattgagaa aagtatatat
     7201 cttgaacttt cagggcagca atatcagtgg aaggaagcta tgttttttga atcctatggg
     7261 cattcttctc atatgctgac tgatactaaa acaggattac acttcaatat taatgaggct
     7321 atagagcagg gaacaattga caaagccttg gtcaaaaagt atcaggaagg cctcatcaca
     7381 cttacagaac ttgctgattc tttgctgagc cggttagtcc ccaagaaaga tttgcacagt
     7441 cctgttgcag ggtattggct gactgctagt ggggaaagga tctctgtact aaaagcctcc
     7501 cgtagaaatt tggttgatcg gattactgcc ctccgatgcc ttgaagccca agtcagtaca
     7561 gggggcataa ttgatcctct tactggcaaa aagtaccggg tggccgaagc tttgcataga
     7621 ggcctggttg atgaggggtt tgcccagcag ctgcgacagt gtgaattagt aatcacaggg
     7681 attggccatc ccatcactaa caaaatgatg tcagtggtgg aagctgtgaa tgcaaatatt
     7741 ataaataagg aaatgggaat ccgatgtttg gaatttcagt acttgacagg agggttgata
     7801 gagccacagg ttcactctcg gttatcaata gaagaggctc tccaagtagg tattatagat
     7861 gtcctcattg ccacaaaact caaagatcaa aagtcatatg tcagaaatat aatatgccct
     7921 cagacaaaaa gaaagttgac atataaagaa gccttagaaa aagctgattt tgatttccac
     7981 acaggactta aactgttaga agtatctgag cccctgatga caggaatttc tagcctctac
     8041 tattcttcct aatgggacat gtttaaataa ctgtgcaagg ggtgatgcag gctggttcat
     8101 gccacttttt cagagtatga tgatatcggc tacatatgca gtctgtgaat tatgtaacat
     8161 actctatttc ttgagggctg caaattgcta agtgctcaaa atagagtaag ttttaaattg
     8221 aaaattacat aagatttaat gcccttcaaa tggtttcatt tagccttgag aatggttttt
     8281 tgaaacttgg ccacactaaa atgttttttt ttttttacgt agaatgtggg ataaacttga
     8341 tgaactccaa gttcacagtg tcatttcttc agaactcccc ttcattgaat agtgatcatt
     8401 tattaaatga taaattgcac tcgctgaaag agcacgtcat gaagcaccat ggaatcaaag
     8461 agaaagatat aaattcgttc ccacagcctt caagctgcag tgttttagat tgcttcaaaa
     8521 aatgaaaaag ttttgccttt ttcgatatag tgaccttctt tgcatattaa aatgtttacc
     8581 acaatgtccc atttctagtt aagtcttcgc acttgaaagc taacattatg aatattatgt
     8641 gttggaggag gggaaggatt ttcttcattc tgtgtatttt ccgg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U30521. Human P311 HUM (3...[gi:963091] Links  


LOCUS       HSU30521                2036 bp    mRNA    linear   PRI 27-AUG-1995
DEFINITION  Human P311 HUM (3.1) mRNA, complete cds.
ACCESSION   U30521
VERSION     U30521.1  GI:963091
KEYWORDS    .
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2036)
  AUTHORS   Studler,J.M., Glowinski,J. and Levi-Strauss,M.
  TITLE     An abundant mRNA of the embryonic brain persists at a high level in
            cerebellum, hippocampus and olfactory bulb during adulthood
  JOURNAL   Eur. J. Neurosci. 5 (6), 614-623 (1993)
  MEDLINE   94084289
   PUBMED   8261136
REFERENCE   2  (bases 1 to 1814)
  AUTHORS   Studler,J.M.
  TITLE     Direct Submission
  JOURNAL   Submitted (29-JUN-1995) Jeanne-Marie Studler, INSERM U-114, College
            de France, 11 Place Marcelin Berthelot, Paris cedex 05, 75231,
            France
FEATURES             Location/Qualifiers
     source          1..2036
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /sex="female"
                     /tissue_type="cerebellum"
                     /dev_stage="2 year-old"
     gene            1..2036
                     /gene="3.1"
     CDS             203..409
                     /gene="3.1"
                     /note="putative"
                     /codon_start=1
                     /product="P311 HUM"
                     /protein_id="AAA74903.1"
                     /db_xref="GI:963092"
                     /translation="MVYYPELFVWVSQEPFPNKDMEGRLPKGRLPVPKEVNRKKNDET
                     NAASLTPLGSSELRSPRISYLHFF"
     polyA_signal    2005..2010
                     /gene="3.1"
BASE COUNT      532 a    446 c    406 g    652 t
ORIGIN      
        1 tttcctcttt ctctaagagt ctctctctcc ctttccctct ctctcccccc aatctgtctt
       61 tctagcatgt tgcccttttt caaccacatt tgtgtttcag gtgtagagag gagagagagt
      121 gaacagggag cggggctttt gtctgttggt ctccctggac tgaagagagg gagaatagaa
      181 gcccaagact aagattctca aaatggttta ttacccagaa ctctttgtct gggtcagtca
      241 agaaccattt ccaaacaagg acatggaggg aaggcttcct aagggaagac ttcctgtccc
      301 aaaggaagtg aaccgcaaga agaacgatga gacaaacgct gcctccctga ctccactggg
      361 cagcagtgaa ctccgctccc caagaatcag ttacctccac tttttttaat cgtaacacct
      421 ccatttgtat tacatatggt gtatgggtat tgatgaggtc atggtatcat atatgggatt
      481 tttttctgtg taaatcatca agtataagaa gaaactatgg gactctgagc cttgctttag
      541 agaatttaca gtggacaaat aggtgtcatc aaaccagttt ttaatcattc tgactcaagt
      601 gaaaacgctc agaatttcac actgtgaatc cacgtttaca acccttacag gtgggccttc
      661 aggcctggtt cgctacaaca atgtcttcca caactcaaac tcccaccgcg ctcacacaac
      721 cggtccactc ctgccttttc actcacacag ctcccgactg cttcttgcag aggctgagag
      781 tccccccccc cacctttttt tttcatttag atgtaacaaa cctagtagtt tatgttcatc
      841 aattgtctgt atatctctat attttatcca tgtactcttt tgatgtatag aagtagtttg
      901 aaactcattg tttccttgtg gtaagtgacc gagatgctgc cacaggacct gagacactga
      961 tgaatggtgc tattttggac tttcaacatg ctccttggcg aggtagctct gatggagtta
     1021 ttttttattt ccatgttcta agaaggtgtt ggtactctgt ttccctgaat gttgttctct
     1081 agactggatt gacttgtttt ccttgtgtct tcagtgtggc tttcttcctc agtgttgtag
     1141 gttgagcgaa tgctaccaga gtgtgagaga ccattgtctc gttggctggc gctcacggac
     1201 atgcagtcac ggtagcggga gcaatcacaa aactgtaatt tacttaccaa atctcttcct
     1261 ttccgtagcc tcgcctgcct gacttagaga aagaaaagca ataattttac aggcattttg
     1321 aggtgtctct ttgggttctt tctgtttgaa aggatatttg tcgaaaaaaa gagcaaaacc
     1381 gttttaaata aactccccct ggaaaaaaac ccaaaacact ggcatctgag taggaatatg
     1441 aaaatgacac cttttccaaa tattaaattg gaaaacaagg tctacaaaat catgatactt
     1501 ttttaaaagg cagagcattc ttttttcggc aattttgata agcaaggtgt agatttacat
     1561 ttttgtcctt gctcccaacg aaatggataa acaaaaataa attaccatct actcatggaa
     1621 tgttgttgtg ttagccagtc tgaaagccca ccttaatttt tatataactg tctttagctc
     1681 ttcttttgac agggcaggcc ttgttctgaa ctgtttcgct tctgactgtt aaacaccgat
     1741 gacgcatgca ctgcacttct tcgttttctt cttgctcccc cattggcctg agtttcttgt
     1801 gcattactcc tctccctcct tcgttagaat aggtatatca gctgtgtaaa tagagcaaga
     1861 aaacagtatt ctgcatctgt ggcatttatg tagagttgca gttgtgtact gctgaaaatg
     1921 caggcttttg taacagtgtg atctttactg atgcactcat gacaagtacc caatgtattt
     1981 tagctatttt agtagtattt gttcaataaa tacgcaagct gtaaggtaac tgtctg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X16863. Human Fc-gamma RI...[gi:31321] Links  


LOCUS       HSFCGR31                 887 bp    mRNA    linear   PRI 12-SEP-1993
DEFINITION  Human Fc-gamma RIII-1 cDNA for Fc-gamma receptor III-1 (CD 16).
ACCESSION   X16863 M31936
VERSION     X16863.1  GI:31321
KEYWORDS    Fc-gamma receptor; Fc-gamma receptor III-1; Fc-gamma RIII-1 gene.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 887)
  AUTHORS   Ravetch,J.V. and Perussia,B.
  TITLE     Alternative membrane forms of Fc gamma RIII(CD16) on human natural
            killer cells and neutrophils. Cell type-specific expression of two
            genes that differ in single nucleotide substitutions
  JOURNAL   J. Exp. Med. 170 (2), 481-497 (1989)
  MEDLINE   89328325
REFERENCE   2  (bases 1 to 887)
  AUTHORS   Ravetch,J.V.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-APR-1990) Ravetch J.V., Sloan Kettering Institute,
            Dept. 6008 RRL 921, 1275 York Ave., New York, NY. USA
COMMENT     See  for Human Fc-gamma RIII-2 receptor.
FEATURES             Location/Qualifiers
     source          1..887
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="primary-peripheral blood granulocytes"
                     /note="Allele: NA-2"
     CDS             34..735
                     /note="(AA 1-233)"
                     /codon_start=1
                     /product="Fc-gamma receptor III-1 (CD 16)"
                     /protein_id="CAA34753.1"
                     /db_xref="GI:31322"
                     /db_xref="SWISS-PROT:P08637"
                     /translation="MWQLLLPTALLLLVSAGMRTEDLPKAVVFLEPQWYSVLEKDSVT
                     LKCQGAYSPEDNSTQWFHNESLISSQASSYFIDAATVNDSGEYRCQTNLSTLSDPVQL
                     EVHIGWLLLQAPRWVFKEEDPIHLRCHSWKNTALHKVTYLQNGKDRKYFHHNSDFHIP
                     KATLKDSGSYFCRGLVGSKNVSSETVNITITQGLAVSTISSFSPPGYQVSFCLVMVLL
                     FAVDTGLYFSVKTNI"
     variation       141
                     /note="c is g in NA-1 allele"
     variation       147
                     /note="t is c in NA-1 allele"
     variation       227
                     /note="g is a in NA-1 allele"
     variation       277
                     /note="a is g in NA-1 allele"
     variation       349
                     /note="a is g in NA-1 allele"
BASE COUNT      228 a    236 c    206 g    217 t
ORIGIN      
        1 tctttggtga cttgtccact ccagtgtggc atcatgtggc agctgctcct cccaactgct
       61 ctgctacttc tagtttcagc tggcatgcgg actgaagatc tcccaaaggc tgtggtgttc
      121 ctggagcctc aatggtacag cgtgcttgag aaggacagtg tgactctgaa gtgccaggga
      181 gcctactccc ctgaggacaa ttccacacag tggtttcaca atgagagcct catctcaagc
      241 caggcctcga gctacttcat tgacgctgcc acagtcaacg acagtggaga gtacaggtgc
      301 cagacaaacc tctccaccct cagtgacccg gtgcagctag aagtccatat cggctggctg
      361 ttgctccagg cccctcggtg ggtgttcaag gaggaagacc ctattcacct gaggtgtcac
      421 agctggaaga acactgctct gcataaggtc acatatttac agaatggcaa agacaggaag
      481 tattttcatc ataattctga cttccacatt ccaaaagcca cactcaaaga tagcggctcc
      541 tacttctgca gggggcttgt tgggagtaaa aatgtgtctt cagagactgt gaacatcacc
      601 atcactcaag gtttggcagt gtcaaccatc tcatcattct ctccacctgg gtaccaagtc
      661 tctttctgct tggtgatggt actccttttt gcagtggaca caggactata tttctctgtg
      721 aagacaaaca tttgaagctc aacaagagac tggaaggacc ataaacttaa atggagaaag
      781 gaccctcaag acaaatgacc cccatcccat gggagtaata agagcagtgg cagcagcatc
      841 tctgaacatt tctctggatt tgcaacccca tcatcctcag gcctctc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  

&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U20157. Human platelet-ac...[gi:780132] Links  


LOCUS       HSU20157                1505 bp    mRNA    linear   PRI 21-APR-1995
DEFINITION  Human platelet-activating factor acetylhydrolase mRNA, complete
            cds.
ACCESSION   U20157
VERSION     U20157.1  GI:780132
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1505)
  AUTHORS   Tjoelker,L.W., Wilder,C., Eberhardt,C., Stafforini,D.M.,
            Dietsch,G., Schimpf,B., Hooper,S., Trong,H., Cousens,L.S.,
            Zimmerman,G.A., Yamada,Y., McIntyre,T.M., Prescott,S.M. and
            Gray,P.W.
  TITLE     Anti-inflammatory properties of a platelet-activating factor
            acetylhydrolase
  JOURNAL   Nature 374 (6522), 549-553 (1995)
  MEDLINE   95214779
   PUBMED   7700381
REFERENCE   2  (bases 1 to 1505)
  AUTHORS   Tjoelker,L.W.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-JAN-1995) Larry W. Tjoelker, ICOS Corporation, 22021
            20th Ave. S.E., Bothell, WA 98021, USA
FEATURES             Location/Qualifiers
     source          1..1505
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="sAH 406-3"
                     /cell_type="macrophage"
                     /tissue_type="myeloid"
                     /clone_lib="in vitro differentiated macrophage cDNA
                     library"
     CDS             162..1487
                     /codon_start=1
                     /product="platelet-activating factor acetylhydrolase"
                     /protein_id="AAC50126.1"
                     /db_xref="GI:780133"
                     /translation="MVPPKLHVLFCLCGCLAVVYPFDWQYINPVAHMKSSAWVNKIQV
                     LMAAASFGQTKIPRGNGPYSVGCTDLMFDHTNKGTFLRLYYPSQDNDRLDTLWIPNKE
                     YFWGLSKFLGTHWLMGNILRLLFGSMTTPANWNSPLRPGEKYPLVVFSHGLGAFRTLY
                     SAIGIDLASHGFIVAAVEHRDRSASATYYFKDQSAAEIGDKSWLYLRTLKQEEETHIR
                     NEQVRQRAKECSQALSLILDIDHGKPVKNALDLKFDMEQLKDSIDREKIAVIGHSFGG
                     ATVIQTLSEDQRFRCGIALDAWMFPLGDEVYSRIPQPLFFINSEYFQYPANIIKMKKC
                     YSPDKERKMITIRGSVHQNFADFTFATGKIIGHMLKLKGDIDSNVAIDLSNKASLAFL
                     QKHLGLHKDFDQWDCLIEGDDENLIPGTNINTTNQHIMLQNSSGIEKYN"
BASE COUNT      438 a    311 c    333 g    423 t
ORIGIN      
        1 gctggtcgga ggctcgcagt gctgtcggcg agaagcagtc gggtttggag cgcttgggtc
       61 gcgttggtgc gcggtggaac gcgcccaggg accccagttc ccgcgagcag ctccgcgccg
      121 cgcctgagag actaagctga aactgctgct cagctcccaa gatggtgcca cccaaattgc
      181 atgtgctttt ctgcctctgc ggctgcctgg ctgtggttta tccttttgac tggcaataca
      241 taaatcctgt tgcccatatg aaatcatcag catgggtcaa caaaatacaa gtactgatgg
      301 ctgctgcaag ctttggccaa actaaaatcc cccggggaaa tgggccttat tccgttggtt
      361 gtacagactt aatgtttgat cacactaata agggcacctt cttgcgttta tattatccat
      421 cccaagataa tgatcgcctt gacacccttt ggatcccaaa taaagaatat ttttggggtc
      481 ttagcaaatt tcttggaaca cactggctta tgggcaacat tttgaggtta ctctttggtt
      541 caatgacaac tcctgcaaac tggaattccc ctctgaggcc tggtgaaaaa tatccacttg
      601 ttgttttttc tcatggtctt ggggcattca ggacacttta ttctgctatt ggcattgacc
      661 tggcatctca tgggtttata gttgctgctg tagaacacag agatagatct gcatctgcaa
      721 cttactattt caaggaccaa tctgctgcag aaatagggga caagtcttgg ctctacctta
      781 gaaccctgaa acaagaggag gagacacata tacgaaatga gcaggtacgg caaagagcaa
      841 aagaatgttc ccaagctctc agtctgattc ttgacattga tcatggaaag ccagtgaaga
      901 atgcattaga tttaaagttt gatatggaac aactgaagga ctctattgat agggaaaaaa
      961 tagcagtaat tggacattct tttggtggag caacggttat tcagactctt agtgaagatc
     1021 agagattcag atgtggtatt gccctggatg catggatgtt tccactgggt gatgaagtat
     1081 attccagaat tcctcagccc ctctttttta tcaactctga atatttccaa tatcctgcta
     1141 atatcataaa aatgaaaaaa tgctactcac ctgataaaga aagaaagatg attacaatca
     1201 ggggttcagt ccaccagaat tttgctgact tcacttttgc aactggcaaa ataattggac
     1261 acatgctcaa attaaaggga gacatagatt caaatgtagc tattgatctt agcaacaaag
     1321 cttcattagc attcttacaa aagcatttag gacttcataa agattttgat cagtgggact
     1381 gcttgattga aggagatgat gagaatctta ttccagggac caacattaac acaaccaatc
     1441 aacacatcat gttacagaac tcttcaggaa tagagaaata caattaggat taaaataggt
     1501 ttttt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH011005. Homo sapiens...[gi:15341629] Links  


LOCUS       L11671S1                 979 bp    DNA     linear   PRI 29-AUG-2001
DEFINITION  Homo sapiens transmembrane glycoprotein (CD53) gene, exon 1.
ACCESSION   L11671
VERSION     L11671.1  GI:291896
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Angelisova,P., Vlcek,C., Stefanova,I., Lipoldova,M. and Horejsi,V.
  TITLE     The human leucocyte surface antigen CD53 is a protein structurally
            similar to the CD37 and MRC OX-44 antigens
  JOURNAL   Immunogenetics 32 (4), 281-285 (1990)
  MEDLINE   91055810
   PUBMED   1700763
REFERENCE   2  (bases 1 to 979)
  AUTHORS   Korinek,V. and Horejsi,V.
  TITLE     Genomic structure of the human CD53 gene
  JOURNAL   Immunogenetics 38 (4), 272-279 (1993)
  MEDLINE   93307785
   PUBMED   8319976
COMMENT     On Jun 12, 1993 this sequence version replaced gi:180144.
FEATURES             Location/Qualifiers
     source          1..979
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="Jurkat"
                     /cell_type="T-cell"
     exon            509..606
                     /gene="CD53"
                     /number=1a
                     /evidence=experimental
     exon            512..606
                     /gene="CD53"
                     /number=1b
                     /evidence=experimental
     exon            515..606
                     /gene="CD53"
                     /number=1c
                     /evidence=experimental
     intron          607..>979
                     /gene="CD53"
                     /number=1
BASE COUNT      266 a    218 c    210 g    285 t
ORIGIN      
        1 gaattcctag agactggctg aagccagaca gagggagtat gaggaggggg cagagaatct
       61 tcagaaagac attccgaagt ctcgggccta gaaaatgttt agaaatattg gttcctgtgt
      121 tcccctccag tctgtctacc tgaatgggaa actggtggtg gggccaggca tacttctttt
      181 tgaacaaata cacagttaat tctgagatgc caccttgatt gagaagcact gtccttttct
      241 aagcaaaaat cattgccaaa ttaccctaga atccaccact gctcacagat ctgtgcctga
      301 accaacttgg ttttgaagtt ttcttttgtt tcacaaaact agtttagggt gggctgtgag
      361 aggaagcttc taaaaagcaa ctgggaagta ctaaatctct gagactaaag agggcggact
      421 cagcctcact tcctccttct tctcatctgg gcttcctgtc gtcacagcat gatcatattt
      481 tttcaccctt cacttctcct tttacacaaa tagccccgga tatctgtgtt accagccttg
      541 tctcggccac ctcaaggata atcactaaat tctgccgaaa ggactgagga acggtgcctg
      601 gaaaaggtaa ggttgatgac gttttaactt ggcttgctgt ctaatgtggg gtagggcgta
      661 ttcttagggg tctccatata cctctctagt aaccataaaa tctccggcaa cttctctttc
      721 acaaatattt agtgcacaaa gtaaaacaga ttagcaagat cctagtaaca tccccaaagg
      781 caatcaggcc agaaatactt ttacttaatc aggttcccat tatctaaacc tctgagtcga
      841 ggcattcaca tgccttaatt gcttcagagc ttgaattact ctgtgtagta attcctgata
      901 gccctggagc ttctatgcga gatgacattg gggagatttc ctatgtgctc tggtcttcag
      961 aaaattggct gtgatttaa
//
LOCUS       L11671S2                9100 bp    DNA     linear   PRI 29-AUG-2001
DEFINITION  Homo sapiens transmembrane glycoprotein (CD53) gene, exons 2
            through 8.
ACCESSION   L11670
VERSION     L11670.1  GI:180145
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Angelisova,P., Vlcek,C., Stefanova,I., Lipoldova,M. and Horejsi,V.
  TITLE     The human leucocyte surface antigen CD53 is a protein structurally
            similar to the CD37 and MRC OX-44 antigens
  JOURNAL   Immunogenetics 32 (4), 281-285 (1990)
  MEDLINE   91055810
   PUBMED   1700763
REFERENCE   2  (bases 1 to 9100)
  AUTHORS   Korinek,V. and Horejsi,V.
  TITLE     Genomic structure of the human CD53 gene
  JOURNAL   Immunogenetics 38 (4), 272-279 (1993)
  MEDLINE   93307785
   PUBMED   8319976
FEATURES             Location/Qualifiers
     source          1..9100
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /sex="male"
                     /dev_stage="adult"
     gene            join(L11671.1:509..979,1..>8850)
                     /gene="CD53"
     mRNA            join(L11671.1:509..606,324..403,1275..1466,3259..3333,
                     3894..3989,5585..5665,6739..6823,8053..>8850)
                     /gene="CD53"
                     /product="transmembrane glycoprotein"
                     /note="alternatively spliced"
     mRNA            join(L11671.1:512..606,324..403,1275..1466,3259..3333,
                     3894..3989,5585..5665,6739..6823,8053..>8850)
                     /gene="CD53"
                     /product="transmembrane glycoprotein"
                     /note="alternatively spliced"
     mRNA            join(L11671.1:515..606,324..403,1275..1466,3259..3333,
                     3894..3989,5585..5665,6739..6823,8053..>8850)
                     /gene="CD53"
                     /product="transmembrane glycoprotein"
                     /note="alternatively spliced"
     exon            324..403
                     /gene="CD53"
                     /number=2
     exon            1275..1466
                     /gene="CD53"
                     /number=3
     exon            3259..3333
                     /gene="CD53"
                     /number=4
     exon            3894..3989
                     /gene="CD53"
                     /number=5
     exon            5585..5665
                     /gene="CD53"
                     /number=6
     exon            6739..6823
                     /gene="CD53"
                     /number=7
     exon            8053..>8850
                     /gene="CD53"
                     /number=8
BASE COUNT     2565 a   1862 c   1925 g   2745 t      3 others
ORIGIN      
        1 ctgcagttag agctaaagca ctggcctggt ctctgtcact gtgactcatc ctcttctaag
       61 aatggctcta ttttcttcta caatacttaa ttcaatgtta agcattgttg gcttgccgaa
      121 ttgggcaaaa actgaatgtc taagatgagc cagttggtat ttcagtttag aactaggaaa
      181 gcgcagagtt tggggaaact ttccagagga gagccacagc tcaaacccaa ggtaatgcta
      241 gagatccctg aacatttgtg cactctgatc tctggctacc ttacagagtg aggtaactta
      301 ctttcttctt ccttattcct tagggcaaga atatcacggc atgggcatga gtagcttgaa
      361 actgctgaag tatgtcctgt ttttcttcaa cttgctcttt tgggtaagtg tatctcttct
      421 gagcacggtt tagctcacct ttacccagct aagctctcct cattcctcta ctgaagaggc
      481 tggtgtgggg taaaatgcat gcgtgtttgg gtcatgtcca gcacaaccat cctagaaaca
      541 aaagagtaat aatttttccc ctcttcctgc cacagaataa acctgcaaaa ttgaaaggta
      601 ccagtgcagt tattaatgga ggtgctgttt ctctctgagg gctttagttt actactactc
      661 atagctaaca tatataggtg agaaacctga gactgtgctc tggatattga gtgcctctgt
      721 ttttcttgtc tttttttcta gattccttga aacaataagt gattgaaagg aaagtcaccc
      781 tttccaggga ggtttcactg tataaaattc agcaattgaa tttcagatat aggatgggtt
      841 ggtacttgtg tatatgagaa gattattata agtttggctt ctctagtttt ctgcctagca
      901 agatgagctt cctcaagcag ggcatcttct ctacatggtc atggtttcta cagagaaagc
      961 catgtgtgag cgactgaaag ataaggaaaa agaagttaga cctatatatg gtgaagatat
     1021 tgtgtggctc acagaagtga ctttccttgg tatctggaga tgggtaggaa gagttgtgag
     1081 taggtctcca catgagaaag ctgagattgc ctctccaatg taagcatgga aaagtttggt
     1141 aaagaaattg ttcatgagac tcttgattga gccagttaaa aataaagtga ttctgatctc
     1201 aaggaagtga gaatatgagg agatacccaa gaaccacatt ttgcttcagg atgttgtggg
     1261 attcttcctt tcagatctgt ggctgctgca ttttgggctt tgggatctac ctgctgatcc
     1321 acaacaactt cggagtgctc ttccataacc tcccctccct cacgctgggc aatgtgtttg
     1381 tcatcgtggg ctctattatc atggtagttg ccttcctggg ctgcatgggc tctatcaagg
     1441 aaaacaagtg tctgcttatg tcggtgagtc cttacagcag atgtggtgcc ccaagtcggg
     1501 gagatggtgc ctaaatccca ggcaagatgt ttcttggtag atcacttata ggcacagaaa
     1561 gatcatagga aaatgagatt ggtacctcag atcagccaag tctgcccaga ccaatagtat
     1621 attatgtatg actagggtcc catggacagg tgaaagaggg cattgctatc ccccttgtag
     1681 ccatttttat attgctaata atgacttttg tcattattgt aattcccctt tatagcattt
     1741 tattttttct ttatgtctgg atgaaatcag gtgtcatcca tcttccaaaa tcctcctcat
     1801 gctagctatt attatgtact ttcagattct gactgagagt cgctaactaa tggaaaataa
     1861 tttctaggct ctttcctccc caaatgccaa aggatgcttc cttaactcat gtctgcacaa
     1921 gatactccgt agagagatcc aacttaggtg agggctctgg ctgcagccat tgaggtctaa
     1981 tgtgagagat gctacagcct tttgcatgtt ccctatcagg tactatcctg aggagtacct
     2041 taaggcttag gttttgcctg taagcacagt gcctgtagag cactgagctc tacatttctg
     2101 tagtgttccc agaacagagc ttgttgtaac agtgccactc taccaatagg cgtgggtttc
     2161 taatcacatt ggttcttcct taaatcattt ggttaatctt ttttccatta tggttcaaca
     2221 ttttggtggt ggttttttgt tttgtttttt tcttcttgtt gttgttttgg agatggagtt
     2281 tcactcttgt tgcccaggct ggagtgcaat ggcatgatct cagctcacca cagcctccgc
     2341 ctcccgggtt caagccattc tcctgcctca gcctcccgag tagctgggat tacagtcatg
     2401 cgccaccatg tccggctaat tctgaatttt tagtagagac agggtttctc catattggtc
     2461 aagctggtct ctaactcccg actttaggtg atccacctgc ctcggcctcc caaagtgttg
     2521 ggattatagg cgtaacgact gcgcccagtc tgtatgggtc aactcttaaa ctggagccaa
     2581 gagaaagaat tttttaaaaa gtccctcttc tcagatagtt gtcagactaa tggcaaagga
     2641 tggaagatag cagacatggg gtaaggtaaa tgttttaagc agtcaaaatt aatgttgggg
     2701 taaaaaaggt gaaggagaag gaataagtga aaatgttttc tgttgtgtgt tattagcata
     2761 aaaggagtaa gcatcgaaag ctggaaacaa atatgagtta gaaccatggc taggaccctt
     2821 ctttctacca ttgtacaggg ctcctctcag ccactaccag cagtcctcca cccagaggta
     2881 tccctgatct tgtagaaagg accaggcccc aaatgatcta gtttaccaac cttttgctta
     2941 actgcttcaa taggcagatg tcaaaattgc ttaacttatt caacaggtgg tggtgtgtat
     3001 gtattatgca tgtagtgctc tgataggcac agcaggtagg agagagcaaa gaagacagtt
     3061 gatctgccct ctagagtcta attgtagaaa aagaacacat attttcacaa aacaaaagaa
     3121 ggcttacaac ggcctgagga ggcagaacag gaacaccctg tacgtgtgca aatgcccctg
     3181 gatgctccca gagctgagtg ggagtgggac gagaatgggg atcagtgctg tgagaatgta
     3241 tctgctttgt cccagttctt catcctgctg ctgattatcc tccttnctga ggtgaccttn
     3301 nccatcctcc tctttgtata tgaacagaag gtaagttata aagacaacaa cttattgtct
     3361 taattactga aagtggggag tatgcagtgg agaagttggt acaaagttac agaataagtt
     3421 ctataataga gatagacatg aagtggaagg atagaggaaa cagagagtaa ttgtaattgg
     3481 ggggaaaaat ttgtatgaaa gagattgtat ctgagtggta tcttgagggg tgcctggaaa
     3541 gagcagtaaa acaagtgtct cttcctctac ttgctttcct ctgtgtgttt ggcaggagag
     3601 aatgtctgcc tcagtgccta aggatagccc ttgctttaat tgctcctttt cctcccttgt
     3661 aaagccagag ctctagaagg aagcaagcct actaaatact tcttccttca atgccacctc
     3721 atgctcagca tgtatgccca tagataaaca ccctcccctc acccttagtt ctgaggagac
     3781 catttggaag ggaagcgcaa gtggaaccac taacctatac tggaaattcc ttattccttg
     3841 aactcacctg ctttttacca tgtctcctct gctggaatgt gcctgcccag ctgaatgagt
     3901 atgtggctaa gggtctgacc gacagcatcc accgttacca ctcagacaat agcaccaagg
     3961 cagcgtggga ctccatccag tcatttgtga gtacaggtgg aatcctcttc agatcagccc
     4021 agacttcatt ttcaagccta aatccttggg ggctagttcc tttttctgga agtttcagaa
     4081 tctaaggtcc acatccctga atcccagaat aatgccttgg ctatcacaaa catgggagcc
     4141 cagtaattag tctgattagt acaaagttct ctacattctc tctttcatcc ttttctaaca
     4201 tgaataggtt tattttataa gttctgctag gatgtgaaga agacccaaac acagcaaact
     4261 ggattagttt atgtattttc caaaatttta ctgaaaacag cattgtataa cacaagaaat
     4321 tgccattgag ttcccgagtt gcccaaatca ggcttgttac catgcccacc aacatcccat
     4381 tcctcatgtg ctgtttccac ccacaaacgt gtatatgtac agatatacaa gctctgcatt
     4441 cctgacatga tgtgttggga gataaagatg gagccttgca ccagtataat ctatttgtgt
     4501 ctcgaaacag taccactatg aaagcacgct ggcttagtgt ggagtagaag aaatggcaca
     4561 ggaattagag cctggagacc agaattgggt agccctgggg aagtcactta acttatttag
     4621 atctccaatt ctttattttt ttaattcagt ttagaaaact ttttattaca taatttttag
     4681 aaatatacaa aagagaaaaa cagaaaaatg aatctcccaa ctcctcaata gttaacattt
     4741 tccagtgtca tctcatctaa ttccccacac ttttttcgtt agaatatgtt aaagcaaata
     4801 ctagacaata tattatttca cccactaaat ataaaagaac ttctttttaa tacacctaaa
     4861 attttataat ttcttaatac cactaataaa gtctataatt aaatttccct aattttctca
     4921 aaaaatttta attgacttgt tcaaatcaag atcttaacaa ggcctatgtg ttatatttga
     4981 atgataggtc tctctattta tcttttaatc tataatagta cacctctttg ttcttgctct
     5041 ttgtggaaag aaattaggct atttgtcctt tagacttctc cactctctag atcaggctag
     5101 ttgcttcctc gggtattttg gtatcaggga gtctctgtct cccataagcc ccatgaactt
     5161 aaaagtttga tgtggttttt gttttgtggt aagactgcca tagctgtcac ttcctgttgc
     5221 atcgtgtaag caggtacata atgtctgatg gtcctccttt ctgtgatctt aagattggtt
     5281 ggtgggtcca ggggtggtca gcctaatccc tccattatgc agtccctggg tgccttattc
     5341 ttgatctgta aaatgtgact agattaagca caggggtctc tactccacag ggatcttatg
     5401 tgaatgaaat ggtataacag attagaaagc actttgtttt aaggagccca tagcaatcag
     5461 accaattctg gacttctgct atagagtcag atctgaaggg cacacctttt cctctaaggt
     5521 ccacagcttt ttttcactgt tgactttcta accatcatca ttttgggggt ttggctttta
     5581 gctgcagtgt tgtggtataa atggcacgag tgattggacc agtggcccac cagcatcttg
     5641 cccctcagat cgaaaagtgg aggtaatttt gtcggcaatg tttctgttat tgacctcttt
     5701 gtttaaatgt ttaattacct cggaaactgc agtcatagag gacctagacc ttctattgag
     5761 aaacagggga ccttgaataa aagagaggcc agggcaacaa ccttgggtaa ttagaaaagt
     5821 cagaaaaaca tacgaacaaa ctcatttaga ctagagacac tgtgattgat cttgctacac
     5881 tagactatta cattagaggg gaacagttac ttttgtgtga aagtaggaga gggttgtgtc
     5941 tagatatttc ttaagcaaga agtaggtctc cttatggtta aagtgaaatg tatagggttg
     6001 agatagaaaa gtttctccct ctccctcttt ctctgctctt cttccttgga gatggcagaa
     6061 tccagcccct taggaaatga atcataggtg aaggagtaag gagttgaggg agacagagtt
     6121 agtggaacta ctgaaacaac ctgtccaatt aatttggacc tccagaatag gctctgagaa
     6181 gaagccacaa ctatcttcca actagactga atccctgagg tcttgtctcc tcatgttatc
     6241 tgctcctgaa ggggtttgga aatctccagg gtttttcagg tttgtggaga aagactagga
     6301 caaccactga ccagcaactg ccctggcact tggtagggct atgatggatt tactgaatgt
     6361 tgaagcagaa agtgaaatgc aaaccaattt tagtattgca tgccctatgt taatctctgg
     6421 tcagcactga gtgttcaaag acagtaggac gtcggttgct gacctgcctc ttagaagcta
     6481 gtttaactca gcgggtaagg atctaggact tctacattag ttaccactgt aatgataaca
     6541 ccaccagaaa agtctgtagt ttaatatttc ccaccttatg cctgtttctt cattcacgca
     6601 aagaaaataa aaatataata cctaagcctc tttgtattac ataaagcaaa atgcaaagca
     6661 ctgtatcttc caaatacttc ctcttgatat ggtggaatta tagagtagta tcatttgtaa
     6721 ctgaaatgtc ttctagggtt gctatgcgaa agcaagactg tggtttcatt ccaatttcct
     6781 gtatatcgga atcatcacca tctgtgtatg tgtgattgag gtaagagctt aaccacaggg
     6841 ttattgtgag gattacatga gttaagtcag gtaagatttc agaataatac caggtacaca
     6901 gtatttacac aataaatgtt agctattttt actaatatat gaattccccc agccaagtag
     6961 caaataatgt aattaacaat ttgctttaag gtatatagaa aatgtgctat aagaacatct
     7021 cttggccggg cgtggtggct cacgcctgta atcccagcac tttgggaggc tgaggcaggc
     7081 agatcacgag gtcaggagat caagaccatc ctggctaaca tggtgaaacc ccgtctctac
     7141 taaaaataca aaaaattaac cagacgtagt ggcaggtgtc tgtagtccca gctacttggg
     7201 aggctaaggt aggagaatgg cgtgaacctg ggaggcggcg ttgtagtgag tcaagatcgt
     7261 gccactgcct cctgcctggg cgacagagcg agactctgtc tccagaaaaa aaaaaaaaaa
     7321 aaagaacatc tctctggtca attcattcct cagagatatg agtgattcac atgattcaca
     7381 gtcaaacaaa aaagccacca agcagaatcc cactgtgatc cctcctcagc tggtctgaaa
     7441 aatacgaatt gataaagtat ttcatttgaa aacctgatct tgcatgttaa ggggctgatg
     7501 acgaaaattg taatcaattt cctctttgtt tctgtgctta gtttgacaag tgatgggtga
     7561 attgagggta gttttttgtc ctttttaata aaaaaggaca aaaataatgt gatatttcta
     7621 acatttttct acccaagtgt tgggtatatc atagattagt taaaactcaa tcaggaagct
     7681 cagagaaaat gatttttctc tgtttggaaa caaaggaggc acagagcatg gatagagata
     7741 actcattgca cagtgttcag tgagcagaat atgaccatcc tagcagcagg gcttctgtga
     7801 cttcctcaga agataaaaca tgctccagta ctttacagcc tgttatcttg tcatcattga
     7861 ccccgtttct ctcctcactg ctattcttta accaaaggaa agagctcctg agagagactt
     7921 gaaagtaatg gttggaaggg tggtttcagt gtaatggata cattcttttt ctcagaggcc
     7981 aatcccaggc attgtgaaag aaatggcttt tcttatgaag acttcaaatt ttcccaactc
     8041 ttttcacagg tgttggggat gtcctttgca ctgaccctga actgccagat tgacaaaacc
     8101 agccagacca tagggctatg atctgcagta gttctgtggt gaagagactt gtttcatctc
     8161 cggaaatgca aaaccattta tagcatgaag ccctacatga tcactgcagg atgatcctcc
     8221 tcccatcctt tcccttttta ggtccctgtc ttatacaacc agagaagtgg gtgttggcca
     8281 ggcacatccc atctcaggca gcaagacaat ctttcactca ctgacggcag cagccatgtc
     8341 tctcaaagtg gtgaaactaa tatctgagca tcttttagac aagagaggca aagacaaact
     8401 ggatttaatg gcccaacatc aaagggtgaa cccaggatat gaatttttgc atcttcccat
     8461 tgtcgaatta gtctccagcc tctaaataat gcccagtctt ctccccaaag tcaagcaaga
     8521 gactagttga agggagttct ggggccaggc tcactggacc attgtcacaa ccctctgttt
     8581 ctctttgact aagtgccctg gctacaggaa ttacacagtt ctctttctcc aaagggcaag
     8641 atctcatttc aatttcttta ttagagggcc ttattgatgt gttctaagtc tttccagaaa
     8701 aaaactatcc agtgatttat atcctgattt caaccagtca cttagctgat aatcacagta
     8761 agaagacttc tggtattatc tctctatcag ataagatttt gttaatgtac tattttactc
     8821 ttcaataaat aaaacagttt attatctcaa tcacaacatt cctatatatc aaacactcct
     8881 tccatgaccc agcctgatta ccctgattaa tgcaccaaac caggtgtatt aattgtctcc
     8941 tgctgcataa aatattactc caaaatttag tggctgagga caacaaacat ttattatctc
     9001 atggtttttg tgggtcagga atctaggagc agcttagctg ggtgattctg gttcacagtc
     9061 tctcatgtaa ctgcaatcaa catgtcagcc tgggctgcag 
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M27492. Human interleukin...[gi:186289] Links  


LOCUS       HUMIL1RA                4910 bp    mRNA    linear   PRI 06-JAN-1995
DEFINITION  Human interleukin 1 receptor mRNA, complete cds.
ACCESSION   M27492
VERSION     M27492.1  GI:186289
KEYWORDS    interleukin receptor.
SOURCE      Human adult T cell and dermal fibroblast, cDNA to mRNA, clones
            lambda-[4,16,3,9,12].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4910)
  AUTHORS   Sims,J.E., Grubin,C.E. and McMahan,C.J.
  JOURNAL   Unpublished (1989)
REFERENCE   2  (bases 29 to 2008)
  AUTHORS   Sims,J.E., Acres,R.B., Grubin,C.E., McMahan,C.J., Wignall,J.M.,
            March,C.J. and Dower,S.K.
  TITLE     Cloning the interleukin 1 receptor from human T cells
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 86 (22), 8946-8950 (1989)
  MEDLINE   90046906
   PUBMED   2530587
COMMENT     Draft entry and computer-readable sequence for [2],[1] kindly
            submitted by J.E.Sims, 06-DEC-1989.
FEATURES             Location/Qualifiers
     source          1..4910
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="2q12"
     gene            1..4910
                     /gene="IL1R"
     CDS             83..1792
                     /gene="IL1R"
                     /note="interleukin 1 receptor precursor"
                     /codon_start=1
                     /protein_id="AAA59137.1"
                     /db_xref="GI:307046"
                     /db_xref="GDB:G00-125-254"
                     /translation="MKVLLRLICFIALLISSLEADKCKEREEKIILVSSANEIDVRPC
                     PLNPNEHKGTITWYKDDSKTPVSTEQASRIHQHKEKLWFVPAKVEDSGHYYCVVRNSS
                     YCLRIKISAKFVENEPNLCYNAQAIFKQKLPVAGDGGLVCPYMEFFKNENNELPKLQW
                     YKDCKPLLLDNIHFSGVKDRLIVMNVAEKHRGNYTCHASYTYLGKQYPITRVIEFITL
                     EENKPTRPVIVSPANETMEVDLGSQIQLICNVTGQLSDIAYWKWNGSVIDEDDPVLGE
                     DYYSVENPANKRRSTLITVLNISEIESRFYKHPFTCFAKNTHGIDAAYIQLIYPVTNF
                     QKHMIGICVTLTVIIVCSVFIYKIFKIDIVLWYRDSCYDFLPIKASDGKTYDAYILYP
                     KTVGEGSTSDCDIFVFKVLPEVLEKQCGYKLFIYGRDDYVGEDIVEVINENVKKSRRL
                     IIILVRETSGFSWLGGSSEEQIAMYNALVQDGIKVVLLELEKIQDYEKMPESIKFIKQ
                     KHGAIRWSGDFTQGPQSAKTRFWKNVRYHMPVQRRSPSSKHQLLSPATKEKLQREAHV
                     PLG"
     sig_peptide     83..133
                     /gene="IL1R"
                     /note="interleukin 1 receptor signal peptide; G00-125-254"
     mat_peptide     134..1789
                     /gene="IL1R"
                     /product="interleukin 1 receptor; G00-125-254"
BASE COUNT     1426 a   1044 c   1044 g   1396 t
ORIGIN      
        1 tagacgcacc ctctgaagat ggtgactccc tcctgagaag ctggacccct tggtaaaaga
       61 caaggccttc tccaagaaga atatgaaagt gttactcaga cttatttgtt tcatagctct
      121 actgatttct tctctggagg ctgataaatg caaggaacgt gaagaaaaaa taattttagt
      181 gtcatctgca aatgaaattg atgttcgtcc ctgtcctctt aacccaaatg aacacaaagg
      241 cactataact tggtataaag atgacagcaa gacacctgta tctacagaac aagcctccag
      301 gattcatcaa cacaaagaga aactttggtt tgttcctgct aaggtggagg attcaggaca
      361 ttactattgc gtggtaagaa attcatctta ctgcctcaga attaaaataa gtgcaaaatt
      421 tgtggagaat gagcctaact tatgttataa tgcacaagcc atatttaagc agaaactacc
      481 cgttgcagga gacggaggac ttgtgtgccc ttatatggag ttttttaaaa atgaaaataa
      541 tgagttacct aaattacagt ggtataagga ttgcaaacct ctacttcttg acaatataca
      601 ctttagtgga gtcaaagata ggctcatcgt gatgaatgtg gctgaaaagc atagagggaa
      661 ctatacttgt catgcatcct acacatactt gggcaagcaa tatcctatta cccgggtaat
      721 agaatttatt actctagagg aaaacaaacc cacaaggcct gtgattgtga gcccagctaa
      781 tgagacaatg gaagtagact tgggatccca gatacaattg atctgtaatg tcaccggcca
      841 gttgagtgac attgcttact ggaagtggaa tgggtcagta attgatgaag atgacccagt
      901 gctaggggaa gactattaca gtgtggaaaa tcctgcaaac aaaagaagga gtaccctcat
      961 cacagtgctt aatatatcgg aaattgaaag tagattttat aaacatccat ttacctgttt
     1021 tgccaagaat acacatggta tagatgcagc atatatccag ttaatatatc cagtcactaa
     1081 tttccagaag cacatgattg gtatatgtgt cacgttgaca gtcataattg tgtgttctgt
     1141 tttcatctat aaaatcttca agattgacat tgtgctttgg tacagggatt cctgctatga
     1201 ttttctccca ataaaagctt cagatggaaa gacctatgac gcatatatac tgtatccaaa
     1261 gactgttggg gaagggtcta cctctgactg tgatattttt gtgtttaaag tcttgcctga
     1321 ggtcttggaa aaacagtgtg gatataagct gttcatttat ggaagggatg actacgttgg
     1381 ggaagacatt gttgaggtca ttaatgaaaa cgtaaagaaa agcagaagac tgattatcat
     1441 tttagtcaga gaaacatcag gcttcagctg gctgggtggt tcatctgaag agcaaatagc
     1501 catgtataat gctcttgttc aggatggaat taaagttgtc ctgcttgagc tggagaaaat
     1561 ccaagactat gagaaaatgc cagaatcgat taaattcatt aagcagaaac atggggctat
     1621 ccgctggtca ggggacttta cacagggacc acagtctgca aagacaaggt tctggaagaa
     1681 tgtcaggtac cacatgccag tccagcgacg gtcaccttca tctaaacacc agttactgtc
     1741 accagccact aaggagaaac tgcaaagaga ggctcacgtg cctctcgggt agcatggaga
     1801 agttgccaag agttctttag gtgcctcctg tcttatggcg ttgcaggcca ggttatgcct
     1861 catgctgact tgcagagttc atggaatgta actatatcat cctttatccc tgaggtcacc
     1921 tggaatcaga ttattaaggg aataagccat gacgtcaata gcagcccagg gcacttcaga
     1981 gtagagggct tgggaagatc ttttaaaaag gcagtaggcc cggtgtggtg gctcacgcct
     2041 ataatcccag cactttggga ggctgaagtg ggtggatcac cagaggtcag gagttcgaga
     2101 ccagcccagc caacatggca aaaccccatc tctactaaaa atacaaaaat gagctaggca
     2161 tggtggcaca cgcctgtaat cccagctaca cctgaggctg aggcaggaga attgcttgaa
     2221 ccggggagac ggaggttgca gtgagccgag tttgggccac tgcactctag cctggcaaca
     2281 gagcaagact ccgtctcaaa aaaagggcaa taaatgccct ctctgaatgt ttgaactgcc
     2341 aagaaaaggc atggagacag cgaactagaa gaaagggcaa gaaggaaata gccaccgtct
     2401 acagatggct tagttaagtc atccacagcc caagggcggg gctatgcctt gtctggggac
     2461 cctgtagagt cactgaccct ggagcggctc tcctgagagg tgctgcaggc aaagtgagac
     2521 tgacacctca ctgaggaagg gagacatatt cttggagaac tttccatctg cttgtatttt
     2581 ccatacacat ccccagccag aagttagtgt ccgaagaccg aattttattt tacagagctt
     2641 gaaaactcac ttcaatgaac aaagggattc tccaggattc caaagttttg aagtcatctt
     2701 agctttccac aggagggaga gaacttaaaa aagcaacagt agcagggaat tgatccactt
     2761 cttaatgctt tcctccctgg catgaccatc ctgtcctttg ttattatcct gcattttacg
     2821 tctttggagg aacagctccc tagtggcttc ctccgtctgc aatgtccctt gcacagccca
     2881 cacatgaacc atccttccca tgatgccgct cttctgtcat cccgctcctg ctgaaacacc
     2941 tcccaggggc tccacctgtt caggagctga agcccatgct ttcccaccag catgtcactc
     3001 ccagaccacc tccctgccct gtcctccagc ttcccctcgc tgtcctgctg tgtgaattcc
     3061 caggttggcc tggtggccat gtcgcctgcc cccagcactc ctctgtctct gctcttgcct
     3121 cgacccttcc tcctcctttg cctaggaggc cttctcgcat tttctctagc tgatcagaat
     3181 tttaccaaaa ttcagaacat cctccaattc cacagtctct gggagacttt ccctaagagg
     3241 cgacttcctc tccagccttc tctctctggt caggcccact gcagagatgg tggtgagcac
     3301 atctgggagg ctggtctccc tccagctgga attgctgctc tctgagggag aggctgtggt
     3361 ggctgtctct gtccctcact gccttccagg agcaatttgc acatgtaaca tagatttatg
     3421 taatgcttta tgtttaaaaa cattccccaa ttatcttatt taatttttgc aattattcta
     3481 attttatata tagagaaagt gacctatttt ttaaaaaaat cacactctaa gttctattga
     3541 acctaggact tgagcctcca tttctggctt ctagtctggt gttctgagta cttgatttca
     3601 ggtcaataac ggtcccccct cactccacac tggcacgttt gtgagaagaa atgacatttt
     3661 gctaggaagt gaccgagtct aggaatgctt ttattcaaga caccaaattc caaacttcta
     3721 aatgttggaa ttttcaaaaa ttgtgtttag attttatgaa aaactcttct actttcatct
     3781 attctttccc tagaggcaaa catttcttaa aatgtttcat tttcattaaa aatgaaagcc
     3841 aaatttatat gccaccgatt gcaggacaca agcacagttt taagagttgt atgaacatgg
     3901 agaggacttt tggtttttat atttctcgta tttaatatgg gtgaacacca acttttattt
     3961 ggaataataa ttttcctcct aaacaaaaac acattgagtt taagtctctg actcttgcct
     4021 ttccacctgc tttctcctgg gcccgctttg cctgcttgaa ggaacagtgc tgttctggag
     4081 ctgctgttcc aacagacagg gcctagcttt catttgacac acagactaca gccagaagcc
     4141 catggagcag ggatgtcacg tcttgaaaag cctattagat gttttacaaa tttaattttg
     4201 cagattattt tagtctgtca tccagaaaat gtgtcagcat gcatagtgct aagaaagcaa
     4261 gccaatttgg aaacttaggt tagtgacaaa attggccaga gagtgggggt gatgatgacc
     4321 aagaattaca agtagaatgg cagctggaat ttaaggaggg acaagaatca atggataagc
     4381 gtgggtggag gaagatccaa acagaaaagt gcaaagttat tccccatctt ccaagggttg
     4441 aattctggag gaagaagaca cattcctagt tccccgtgaa cttcctttga cttattgtcc
     4501 ccactaaaac aaaacaaaaa acttttaatg ccttccacat taattagatt ttcttgcagt
     4561 ttttttatgg cattttttta aagatgccct aagtgttgaa gaagagtttg caaatgcaac
     4621 aaaaatattt aattaccggt tgttaaaact ggtttagcac aatttatatt ttccctctct
     4681 tgcctttctt atttgcaata aaaggtattg agccattttt taaatgacat ttttgataaa
     4741 ttatgtttgt actagttgat gaaggagttt tttttaacct gtttatataa ttttgcagca
     4801 gaagccaaat tttttgtata ttaaagcacc aaattcatgt acagcatgca tcacggatca
     4861 atagactgta cttattttcc aataaaattt tcaaactttg tactgttaaa 
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X52425. Human IL-4-R mRNA...[gi:33833] Links  


LOCUS       HSIL4R                  3597 bp    mRNA    linear   PRI 26-MAY-1992
DEFINITION  Human IL-4-R mRNA for the interleukin 4 receptor.
ACCESSION   X52425
VERSION     X52425.1  GI:33833
KEYWORDS    B cell growth factor; IL-4-R gene; interleukin; interleukin 4
            receptor.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3597)
  AUTHORS   Idzerda,R.L., March,C.J., Mosley,B., Lyman,S.D., Bos,T.V.,
            Gimpel,S.D., Din,W.S., Grabstein,K.H., Widmer,M.B., Park,L.S.,
            Cosman,D. and Beckmann,M.P.
  TITLE     Human interleukin 4 receptor confers biological responsiveness and
            defines a novel receptor superfamily
  JOURNAL   J. Exp. Med. 171 (3), 861-873 (1990)
  MEDLINE   90171849
COMMENT     Data kindly reviewed (11-MAR-1991) by Beckmann M.P.
FEATURES             Location/Qualifiers
     source          1..3597
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_type="T cell, tissue=peripheral blood, clone=T22-8"
     gene            1..3597
                     /gene="IL-4-R gene"
     mRNA            <1..3597
                     /gene="IL-4-R gene"
     CDS             176..2653
                     /gene="IL-4-R gene"
                     /codon_start=1
                     /product="interleukin 4 receptor"
                     /protein_id="CAA36672.1"
                     /db_xref="GI:33834"
                     /db_xref="SWISS-PROT:P24394"
                     /translation="MGWLCSGLLFPVSCLVLLQVASSGNMKVLQEPTCVSDYMSISTC
                     EWKMNGPTNCSTELRLLYQLVFLLSEAHTCIPENNGGAGCVCHLLMDDVVSADNYTLD
                     LWAGQQLLWKGSFKPSEHVKPRAPGNLTVHTNVSDTLLLTWSNPYPPDNYLYNHLTYA
                     VNIWSENDPADFRIYNVTYLEPSLRIAASTLKSGISYRARVRAWAQCYNTTWSEWSPS
                     TKWHNSYREPFEQHLLLGVSVSCIVILAVCLLCYVSITKIKKEWWDQIPNPARSRLVA
                     IIIQDAQGSQWEKRSRGQEPAKCPHWKNCLTKLLPCFLEHNMKRDEDPHKAAKEMPFQ
                     GSGKSAWCPVEISKTVLWPESISVVRCVELFEAPVECEEEEEVEEEKGSFCASPESSR
                     DDFQEGREGIVARLTESLFLDLLGEENGGFCQQDMGESCLLPPSGSTSAHMPWDEFPS
                     AGPKEAPPWGKEQPLHLEPSPPASPTQSPDNLTCTETPLVIAGNPAYRSFSNSLSQSP
                     CPRELGPDPLLARHLEEVEPEMPCVPQLSEPTTVPQPEPETWEQILRRNVLQHGAAAA
                     PVSAPTSGYQEFVHAVEQGGTQASAVVGLGPPGEAGYKAFSSLLASSAVSPEKCGFGA
                     SSGEEGYKPFQDLIPGCPGDPAPVPVPLFTFGLDREPPRSPQSSHLPSSSPEHLGLEP
                     GEKVEDMPKPPLPQEQATDPLVDSLGSGIVYSALTCHLCGHLKQCHGQEDGGQTPVMA
                     SPCCGCCCGDRSSPPTTPLRAPDPSPGGVPLEASLCPASLAPSGISEKSKSSSSFHPA
                     PGNAQSSSQTPKIVNFVSVGPTYMRVS"
     sig_peptide     176..250
                     /gene="IL-4-R gene"
     mat_peptide     251..2650
                     /gene="IL-4-R gene"
                     /product="interleukin 4 receptor"
     misc_feature    872..943
                     /gene="IL-4-R gene"
                     /product="interleukin 4 receptor"
                     /note="putative transmembrane region"
     polyA_signal    3579..3584
                     /gene="IL-4-R gene"
BASE COUNT      794 a   1034 c   1039 g    730 t
ORIGIN      
        1 ggcgaatgga gcaggggcgc gcagataatt aaagatttac acacagctgg aagaaatcat
       61 agagaagccg ggcgtggtgg ctcatgccta taatcccagc acttttggag gctgaggcgg
      121 gcagatcact tgagatcagg agttcgagac cagcctggtg ccttggcatc tcccaatggg
      181 gtggctttgc tctgggctcc tgttccctgt gagctgcctg gtcctgctgc aggtggcaag
      241 ctctgggaac atgaaggtct tgcaggagcc cacctgcgtc tccgactaca tgagcatctc
      301 tacttgcgag tggaagatga atggtcccac caattgcagc accgagctcc gcctgttgta
      361 ccagctggtt tttctgctct ccgaagccca cacgtgtatc cctgagaaca acggaggcgc
      421 ggggtgcgtg tgccacctgc tcatggatga cgtggtcagt gcggataact atacactgga
      481 cctgtgggct gggcagcagc tgctgtggaa gggctccttc aagcccagcg agcatgtgaa
      541 acccagggcc ccaggaaacc tgacagttca caccaatgtc tccgacactc tgctgctgac
      601 ctggagcaac ccgtatcccc ctgacaatta cctgtataat catctcacct atgcagtcaa
      661 catttggagt gaaaacgacc cggcagattt cagaatctat aacgtgacct acctagaacc
      721 ctccctccgc atcgcagcca gcaccctgaa gtctgggatt tcctacaggg cacgggtgag
      781 ggcctgggct cagtgctata acaccacctg gagtgagtgg agccccagca ccaagtggca
      841 caactcctac agggagccct tcgagcagca cctcctgctg ggcgtcagcg tttcctgcat
      901 tgtcatcctg gccgtctgcc tgttgtgcta tgtcagcatc accaagatta agaaagaatg
      961 gtgggatcag attcccaacc cagcccgcag ccgcctcgtg gctataataa tccaggatgc
     1021 tcaggggtca cagtgggaga agcggtcccg aggccaggaa ccagccaagt gcccacactg
     1081 gaagaattgt cttaccaagc tcttgccctg ttttctggag cacaacatga aaagggatga
     1141 agatcctcac aaggctgcca aagagatgcc tttccagggc tctggaaaat cagcatggtg
     1201 cccagtggag atcagcaaga cagtcctctg gccagagagc atcagcgtgg tgcgatgtgt
     1261 ggagttgttt gaggccccgg tggagtgtga ggaggaggag gaggtagagg aagaaaaagg
     1321 gagcttctgt gcatcgcctg agagcagcag ggatgacttc caggagggaa gggagggcat
     1381 tgtggcccgg ctaacagaga gcctgttcct ggacctgctc ggagaggaga atgggggctt
     1441 ttgccagcag gacatggggg agtcatgcct tcttccacct tcgggaagta cgagtgctca
     1501 catgccctgg gatgagttcc caagtgcagg gcccaaggag gcacctccct ggggcaagga
     1561 gcagcctctc cacctggagc caagtcctcc tgccagcccg acccagagtc cagacaacct
     1621 gacttgcaca gagacgcccc tcgtcatcgc aggcaaccct gcttaccgca gcttcagcaa
     1681 ctccctgagc cagtcaccgt gtcccagaga gctgggtcca gacccactgc tggccagaca
     1741 cctggaggaa gtagaacccg agatgccctg tgtcccccag ctctctgagc caaccactgt
     1801 gccccaacct gagccagaaa cctgggagca gatcctccgc cgaaatgtcc tccagcatgg
     1861 ggcagctgca gcccccgtct cggcccccac cagtggctat caggagtttg tacatgcggt
     1921 ggagcagggt ggcacccagg ccagtgcggt ggtgggcttg ggtcccccag gagaggctgg
     1981 ttacaaggcc ttctcaagcc tgcttgccag cagtgctgtg tccccagaga aatgtgggtt
     2041 tggggctagc agtggggaag aggggtataa gcctttccaa gacctcattc ctggctgccc
     2101 tggggaccct gccccagtcc ctgtcccctt gttcaccttt ggactggaca gggagccacc
     2161 tcgcagtccg cagagctcac atctcccaag cagctcccca gagcacctgg gtctggagcc
     2221 gggggaaaag gtagaggaca tgccaaagcc cccacttccc caggagcagg ccacagaccc
     2281 ccttgtggac agcctgggca gtggcattgt ctactcagcc cttacctgcc acctgtgcgg
     2341 ccacctgaaa cagtgtcatg gccaggagga tggtggccag acccctgtca tggccagtcc
     2401 ttgctgtggc tgctgctgtg gagacaggtc ctcgccccct acaacccccc tgagggcccc
     2461 agacccctct ccaggtgggg ttccactgga ggccagtctg tgtccggcct ccctggcacc
     2521 ctcgggcatc tcagagaaga gtaaatcctc atcatccttc catcctgccc ctggcaatgc
     2581 tcagagctca agccagaccc ccaaaatcgt gaactttgtc tccgtgggac ccacatacat
     2641 gagggtctct taggtgcatg tcctcttgtt gctgagtctg cagatgagga ctagggctta
     2701 tccatgcctg ggaaatgcca cctcctggaa ggcagccagg ctggcagatt tccaaaagac
     2761 ttgaagaacc atggtatgaa ggtgattggc cccactgacg ttggcctaac actgggctgc
     2821 agagactgga ccccgcccag cattgggctg ggctcgccac atcccatgag agtagagggc
     2881 actgggtcgc cgtgccccac ggcaggcccc tgcaggaaaa ctgaggccct tgggcacctc
     2941 gacttgtgaa cgagttgttg gctgctccct ccacagcttc tgcagcagac tgtccctgtt
     3001 gtaactgccc aaggcatgtt ttgcccacca gatcatggcc cacgtggagg cccacctgcc
     3061 tctgtctcac tgaactagaa gccgagccta gaaactaaca cagccatcaa gggaatgact
     3121 tgggcggcct tgggaaatcg atgagaaatt gaacttcagg gagggtggtc attgcctaga
     3181 ggtgctcatt catttaacag agcttcctta ggttgatgct ggaggcagaa tcccggctgt
     3241 caaggggtgt tcagttaagg ggagcaacag aggacatgaa aaattgctat gactaaagca
     3301 gggacaattt gctgccaaac acccatgccc agctgtatgg ctgggggctc ctcgtatgca
     3361 tggaaccccc agaataaata tgctcagcca ccctgtgggc cgggcaatcc agacagcagg
     3421 cataaggcac cagttaccct gcatgttggc ccagacctca ggtgctaggg aaggcgggaa
     3481 ccttgggttg agtaatgctc gtctgtgtgt tttagtttca tcacctgtta tctgtgtttg
     3541 ctgaggagag tggaacagaa ggggtggagt tttgtataaa taaagtttct ttgtctc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: S57235. CD68=110kda trans...[gi:298664] Links  


LOCUS       S57235                  1722 bp    mRNA    linear   PRI 28-JUN-1993
DEFINITION  CD68=110kda transmembrane glycoprotein [human, promonocyte cell
            line U937, mRNA, 1722 nt].
ACCESSION   S57235
VERSION     S57235.1  GI:298664
KEYWORDS    .
SOURCE      Homo sapiens promonocyte cell line U937.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1722)
  AUTHORS   Holness,C.L. and Simmons,D.L.
  TITLE     Molecular cloning of CD68, a human macrophage marker related to
            lysosomal glycoproteins
  JOURNAL   Blood 81 (6), 1607-1613 (1993)
  MEDLINE   93200523
   PUBMED   7680921
  REMARK    GenBank staff at the National Library of Medicine created this
            entry [NCBI gibbsq 127492] from the original journal article.
            This sequence comes from Fig. 3.
FEATURES             Location/Qualifiers
     source          1..1722
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..1722
                     /gene="CD68"
     CDS             16..1080
                     /gene="CD68"
                     /note="110kda transmembrane glycoprotein; This sequence
                     comes from Fig. 3"
                     /codon_start=1
                     /product="CD68"
                     /protein_id="AAB25811.1"
                     /db_xref="GI:298665"
                     /translation="MRLAVLFSGALLGLLAAQGTGNDCPHKKSATLLPSFTVTPTVTE
                     STGTTSHRTTKSHKTTTHRTTTTGTTSHGPTTATHNPTTTSHGNVTVHPTSNSTATSQ
                     GPSTATHSPATTSHGNATVHPTSNSTATSPGFTSSAHPEPPPPSPSPSPTSKETIGDY
                     TWTNGSQPCVHLQAQIQIRVMYTTQGGGEAWGISVLNPNKTKVQGSCEGAHPHLLLSF
                     PYGHLSFGFMQDLQQKVVYLSYMAVEYNVSFPHAAKWTFSAQNASLRDLQAPLGQSFS
                     CSNSSIILSPAVHLDLLSLRLQAAQLPHTGVFGQSFSCPSDRSILLPLIIGLILLGLL
                     ALVLIAFCIIRRRPSAYQAL"
BASE COUNT      435 a    539 c    401 g    347 t
ORIGIN      
        1 gcgggcggtt cagccatgag gctggctgtg cttttctcgg gggccctgct ggggctactg
       61 gcagcccagg ggacagggaa tgactgtcct cacaaaaaat cagctacttt gctgccatcc
      121 ttcacggtga cacccacggt tacagagagc actggaacaa ccagccacag gactaccaag
      181 agccacaaaa ccaccactca caggacaacc accacaggca ccaccagcca cggacccacg
      241 actgccactc acaaccccac caccaccagc catggaaacg tcacagttca tccaacaagc
      301 aatagcactg ccaccagcca gggaccctca actgccactc acagtcctgc caccactagt
      361 catggaaatg ccacggttca tccaacaagc aacagcactg ccaccagccc aggattcacc
      421 agttctgccc acccagaacc acctccaccc tctccgagtc ctagcccaac ctccaaggag
      481 accattggag actacacgtg gaccaatggt tcccagccct gtgtccacct ccaagcccag
      541 attcagattc gagtcatgta cacaacccag ggtggaggag aggcctgggg catctctgta
      601 ctgaacccca acaaaaccaa ggtccaggga agctgtgagg gtgcccatcc ccacctgctt
      661 ctctcattcc cctatggaca cctcagcttt ggattcatgc aggacctcca gcagaaggtt
      721 gtctacctga gctacatggc ggtggagtac aatgtgtcct tcccccacgc agcaaagtgg
      781 acattctcgg ctcagaatgc atcccttcga gatctccaag cacccctggg gcagagcttc
      841 agttgcagca actcgagcat cattctttca ccagctgtcc acctcgacct gctctccctg
      901 aggctccagg ctgctcagct gccccacaca ggggtctttg ggcaaagttt ctcctgcccc
      961 agtgaccggt ccatcttgct gcctctcatc atcggcctga tccttcttgg cctcctcgcc
     1021 ctggtgctta ttgctttctg catcatccgg agacgcccat ccgcctacca ggccctctga
     1081 gcatttgctt caaaccccag ggcactgagg gggtttgggg tgtggtgggg gggtaccctt
     1141 atttcctcga cacgccgctg gctcaaagac aatgttattt tccttccctt tcttgaagaa
     1201 caaaaagaaa gccgggcatg acggctcatg cctgtaatcc cagcactttg ggaggctgag
     1261 gcaggtggat cactggaggt caggtctttg aggccagccc tagccaacat ggtgtaaaca
     1321 ctgtctctac taaaaataca attagccagg tgtggcggcg taatcccatg ctaacctgta
     1381 atcccagcta cttgggaggc tgaggcagag ctgcttgaac cctggaagtg gaggttgcag
     1441 tgagcctgtc atcgctccac tgagccaaga tcgctcccac tgcactccag cctgggcgac
     1501 agagccagac tgtctcaaat aaataaatat gagataatgc agtcgggaga agggagggag
     1561 agaattttat taaatgtgac gaactgcccc cccccccccc cccagcagga gagcagcaaa
     1621 atttatgtaa atctttgacg gggttttcct tgctcctgcc aggattaaaa gtccatgagt
     1681 ttcttgctca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U03882. Human monocyte ch...[gi:472555] Links  


LOCUS       HSU03882                2232 bp    mRNA    linear   PRI 22-JUN-1994
DEFINITION  Human monocyte chemoattractant protein 1 receptor (MCP-1RA)
            alternatively spliced mRNA, complete cds.
ACCESSION   U03882
VERSION     U03882.1  GI:472555
KEYWORDS    .
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2232)
  AUTHORS   Charo,I.F., Myers,S.J., Herman,A., Franci,C., Connolly,A.J. and
            Coughlin,S.R.
  TITLE     Molecular cloning and functional expression of two monocyte
            chemoattractant protein 1 receptors reveals alternative splicing of
            the carboxyl-terminal tails
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 91 (7), 2752-2756 (1994)
  MEDLINE   94195821
   PUBMED   8146186
REFERENCE   2  (bases 1 to 2232)
  AUTHORS   Myers,S.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-DEC-1993) Scott J. Myers, Cardiovascular, The
            Gladstone Institutes, 2550 23rd Street, San Francisco, CA 94110,
            USA
FEATURES             Location/Qualifiers
     source          1..2232
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="ccr2-9a"
                     /cell_line="MonoMac 6"
                     /clone_lib="MonoMac6-#3"
     CDS             40..1164
                     /standard_name="monocyte chemoattractant protein 1
                     receptor"
                     /note="alternatively spliced; MCP-1RA"
                     /codon_start=1
                     /product="MCP-1 receptor"
                     /protein_id="AAA19119.1"
                     /db_xref="GI:472556"
                     /translation="MLSTSRSRFIRNTNESGEEVTTFFDYDYGAPCHKFDVKQIGAQL
                     LPPLYSLVFIFGFVGNMLVVLILINCKKLKCLTDIYLLNLAISDLLFLITLPLWAHSA
                     ANEWVFGNAMCKLFTGLYHIGYFGGIFFIILLTIDRYLAIVHAVFALKARTVTFGVVT
                     SVITWLVAVFASVPGIIFTKCQKEDSVYVCGPYFPRGWNNFHTIMRNILGLVLPLLIM
                     VICYSGILKTLLRCRNEKKRHRAVRVIFTIMIVYFLFWTPYNIVILLNTFQEFFGLSN
                     CESTSQLDQATQVTETLGMTHCCINPIIYAFVGEKFRSLFHIALGCRIAPLQKPVCGG
                     PGVRPGKNVKVTTQGLLDGRGKGKSIGRAPEASLQDKEGA"
BASE COUNT      602 a    464 c    508 g    658 t
ORIGIN      
        1 ggattgaaca aggacgcatt tccccagtac atccacaaca tgctgtccac atctcgttct
       61 cggtttatca gaaataccaa cgagagcggt gaagaagtca ccaccttttt tgattatgat
      121 tacggtgctc cctgtcataa atttgacgtg aagcaaattg gggcccaact cctgcctccg
      181 ctctactcgc tggtgttcat ctttggtttt gtgggcaaca tgctggtcgt cctcatctta
      241 ataaactgca aaaagctgaa gtgcttgact gacatttacc tgctcaacct ggccatctct
      301 gatctgcttt ttcttattac tctcccattg tgggctcact ctgctgcaaa tgagtgggtc
      361 tttgggaatg caatgtgcaa attattcaca gggctgtatc acatcggtta ttttggcgga
      421 atcttcttca tcatcctcct gacaatcgat agatacctgg ctattgtcca tgctgtgttt
      481 gctttaaaag ccaggacggt cacctttggg gtggtgacaa gtgtgatcac ctggttggtg
      541 gctgtgtttg cttctgtccc aggaatcatc tttactaaat gccagaaaga agattctgtt
      601 tatgtctgtg gcccttattt tccacgagga tggaataatt tccacacaat aatgaggaac
      661 attttggggc tggtcctgcc gctgctcatc atggtcatct gctactcggg aatcctgaaa
      721 accctgcttc ggtgtcgaaa cgagaagaag aggcataggg cagtgagagt catcttcacc
      781 atcatgattg tttactttct cttctggact ccctataaca ttgtcattct cctgaacacc
      841 ttccaggaat tcttcggcct gagtaactgt gaaagcacca gtcaactgga ccaagccacg
      901 caggtgacag agactcttgg gatgactcac tgctgcatca atcccatcat ctatgccttc
      961 gttggggaga agttcagaag cctttttcac atagctcttg gctgtaggat tgccccactc
     1021 caaaaaccag tgtgtggagg tccaggagtg agaccaggaa agaatgtgaa agtgactaca
     1081 caaggactcc tcgatggtcg tggaaaagga aagtcaattg gcagagcccc tgaagccagt
     1141 cttcaggaca aagaaggagc ctagagacag aaatgacaga tctctgcttt ggaaatcaca
     1201 cgtctggctt cacagatgtg tgattcacag tgtgaatctt ggtgtctacg ttaccaggca
     1261 ggaaggctga gaggagagag actccagctg ggttggaaaa cagtattttc caaactacct
     1321 tccagttcct catttttgaa tacaggcata gagttcagac tttttttaaa tagtaaaaat
     1381 aaaattaaag ctgaaaactg caacttgtaa atgtggtaaa gagttagttt gagttgctat
     1441 catgtcaaac gtgaaaatgc tgtattagtc acagagataa ttctagcttt gagcttaaga
     1501 attttgagca ggtggtatgt ttgggagact gctgagtcaa cccaatagtt gttgattggc
     1561 aggagttgga agtgtgtgat ctgtgggcac attagcctat gtgcatgcag catctaagta
     1621 atgatgtcgt ttgaatcaca gtatacgctc catcgctgtc atctcagctg gatctccatt
     1681 ctctcaggct tgctgccaaa agccttttgt gttttgtttt gtatcattat gaagtcatgc
     1741 gtttaatcac attcgagtgt ttcagtgctt cgcagatgtc cttgatgctc atattgttcc
     1801 ctaatttgcc agtgggaact cctaaatcaa attggcttct aatcaaagct tttaaaccct
     1861 attggtaaag aatggaaggt ggagaagctc cctgaagtaa gcaaagactt tcctcttagt
     1921 cgagccaagt taagaatgtt cttatgttgc ccagtgtgtt tctgatctga tgcaagcaag
     1981 aaacactggg cttctagaac caggcaactt gggaactaga ctcccaagct ggactatggc
     2041 tctactttca ggccacatgg ctaaagaagg tttcagaaag aagtggggac agagcagaac
     2101 tttcaccttc atatatttgt atgatcctaa tgaatgcata aaatgttaag ttgatggtga
     2161 tgaaatgtaa atactgtttt taacaactat gatttggaaa ataaatcaat gctataacta
     2221 tgttgataaa ag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH002689. Human EV12 protei...[gi:182279] Links  


LOCUS       HUMEVI21                 604 bp    DNA     linear   PRI 08-NOV-1994
DEFINITION  Human EV12 protein gene, exon 1.
ACCESSION   M55266
VERSION     M55266.1  GI:182277
KEYWORDS    EVI2 protein.
SEGMENT     1 of 2
SOURCE      Human lymphoblastoid cell DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 604)
  AUTHORS   Cawthon,R.M., O'Connell,P., Buchberg,A.M., Viskochil,D.,
            Weiss,R.B., Culver,M., Stevens,J., Jenkins,N.A., Copeland,N.G. and
            White,R.
  TITLE     Identification and characterization of transcripts from the
            neurofibromatosis 1 region: the sequence and genomic structure of
            EVI2 and mapping of other transcripts
  JOURNAL   Genomics 7 (4), 555-565 (1990)
  MEDLINE   90353953
   PUBMED   2117566
FEATURES             Location/Qualifiers
     source          1..604
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="17q11.2"
                     /cell_type="lymphoblast"
     exon            187..383
                     /partial
                     /gene="EVI2A"
                     /note="G00-125-191"
                     /number=1
BASE COUNT      182 a     99 c    119 g    204 t
ORIGIN      
        1 aatgacacta aagcatgtag caggcctata gaatgagcat actgtcaaag gcattaagtg
       61 gaaaaccctc cctgctctgg aggcggggat tggttctaca aagcctcctg tcatgccagt
      121 ggccaactag agaagttgga gctttttgat tatttttttt tttttcctcc tgtgacatgt
      181 gcctttagtt gcagtggaaa gaaatgtgtc atctgtggtt tggtttttaa aagtggaaaa
      241 ctagctgcac atatcctttt ttactgcaga tttactttaa ggctcatatt ctccaagtct
      301 attctgcttt aaaaagaaga caagaaaaga agtggtttat caaaatcacg ttataatcag
      361 attttgacca agcattttgt aaggtaggtc atatatatac ggcttttctt atgccttttc
      421 acttctaacg tctgacagat tagtgttcga ggaaatgcta tcaaagtgtg taagaatctc
      481 actaacaaac tagaattgtg actgaaaatg tatacttttt ttttctgtcc tgttacataa
      541 attttaagtt agaaatgaaa aaatttgcca tcgccatgga gttttaaatt aggattacga
      601 agga
//
LOCUS       HUMEVI22                1798 bp    DNA     linear   PRI 08-NOV-1994
DEFINITION  Human EV12 protein gene, exon 1.
ACCESSION   M55267
VERSION     M55267.1  GI:182278
KEYWORDS    EVI2 protein.
SEGMENT     2 of 2
SOURCE      Human lymphoblastoid cell DNA, and cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1798)
  AUTHORS   Cawthon,R.M., O'Connell,P., Buchberg,A.M., Viskochil,D.,
            Weiss,R.B., Culver,M., Stevens,J., Jenkins,N.A., Copeland,N.G. and
            White,R.
  TITLE     Identification and characterization of transcripts from the
            neurofibromatosis 1 region: the sequence and genomic structure of
            EVI2 and mapping of other transcripts
  JOURNAL   Genomics 7 (4), 555-565 (1990)
  MEDLINE   90353953
   PUBMED   2117566
FEATURES             Location/Qualifiers
     source          1..1798
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="17q11.2"
                     /cell_type="lymphoblast"
     gene            join(M55266.1:187..604,1..1563)
                     /gene="EVI2A"
     mRNA            join(M55266.1:187..383,198..1563)
                     /partial
                     /gene="EVI2A"
                     /product="EVI2 protein"
                     /note="G00-125-191"
     intron          order(M55266.1:384..604,1..197)
                     /gene="EVI2A"
                     /note="G00-125-191"
     CDS             220..918
                     /gene="EVI2A"
                     /number=1
                     /codon_start=1
                     /product="EVI2 protein"
                     /protein_id="AAA52413.1"
                     /db_xref="GI:182280"
                     /db_xref="GDB:G00-125-191"
                     /translation="MEHTGHYLHLAFLMTTVFSLSPGTKANYTRLWANSTSSWDSVIQ
                     NKTGRNQNENINTNPITPEVDYKGNSTNMPETSHIVALTSKSEQELYIPSVVSNSPST
                     VQSIENTSKSHGEIFKKDVCAENNNNMAMLICLIIIAVLFLICTFLFLSTVVLANKVS
                     SLRRSKQVGKRQPRSNGDFLASGLWPAESDTWKRTKQLTGPNLVMQSTGVLTATRERK
                     DEEGTEKLTNKQIG"
     sig_peptide     220..297
                     /partial
                     /gene="EVI2A"
                     /note="G00-125-191"
     mat_peptide     298..915
                     /partial
                     /gene="EVI2A"
                     /product="EVI2 protein"
                     /note="G00-125-191"
BASE COUNT      658 a    310 c    303 g    527 t
ORIGIN      
        1 taatagaaat taaaatgctt cttcatacat agctgaatag aaaagaattt gttgagaagg
       61 aattcagggt agcgaatatt aggcataagc ttgtagttta cttgtaacat ctcaacacta
      121 tcttttaact acaattacca aaaactagga tccattattc tttcacaaac taacaaatta
      181 tattgctatc ccaacagatt gccaagtatg cccacggaca tggaacacac aggacattac
      241 ctacatcttg cctttctgat gacaacagtt ttttctttgt ctcctggaac aaaagcaaac
      301 tatacccgtc tgtgggctaa cagtacttct tcctgggatt cagttattca aaacaagaca
      361 ggcagaaacc aaaatgaaaa cattaacaca aaccctataa ctcctgaagt agattataaa
      421 ggtaattcta caaacatgcc tgaaacatct cacatcgtag ctttaacttc taaatctgaa
      481 caggagcttt atataccttc tgtcgtcagc aacagtcctt caacagtaca gagcattgaa
      541 aacacaagca aaagtcatgg tgaaattttc aaaaaggatg tctgtgcgga aaacaacaac
      601 aacatggcta tgctaatttg cttaattata attgcagtgc tttttcttat ctgtaccttt
      661 ctatttctat caactgtggt tttggcaaac aaagtctctt ctctcagacg atcaaaacaa
      721 gtaggcaagc gtcagcctag aagcaatggc gattttctgg caagcggtct atggcccgct
      781 gaatcagaca cttggaaaag aacaaaacag ctcacaggac ccaacctagt gatgcaatct
      841 actggagtgc tcacagctac aagggaaaga aaagatgaag aaggaactga aaaacttact
      901 aacaaacaga taggttagtg aagaaaaatg caaagtagca atgagaaggc ttatggagta
      961 aaaatgaagt cagttggtat ttaatcccaa agtgttgttc tgattatcta aaatttgaca
     1021 tggtagacct tgcaatttag aatcaagcag gtgagacagg gagaagtatg cctgcttaat
     1081 tatttaaact gtgtactttt gttttgacac tgaatatttt aaaaagcaaa taataaaata
     1141 actaagcatt tgaggaaaat tttaaggata aattgaggaa actgattaat agagatagca
     1201 agggataatt aaataaatat tccctatgta gcaacagtgg ttagatgatc tttgtctgaa
     1261 tgtaataaaa ctttgaatag ttttagtgtg tccttaaagc caagtatatg ctttaacatc
     1321 aaatggaagt caaattccta atgcatagat agagagagct aaactgtgta atttaatggt
     1381 atcttccttg ctggatgtgg cagaatccac accagcttat caaccaacac agctaatttt
     1441 agaataggtc ctttatcttt ccatatggca cacgtaagaa agtgtttttc tactattaat
     1501 attaaattaa aacctttact tttgtataat aaattaaaac tcagaataaa cctgtgacca
     1561 cgtatatttg cattcacttt attactttag agaacacatt gtaaagatca ataagaaata
     1621 gagcacaact aaaataaata agatttatag ccacaccaat aggctagtgt aaacgaaagt
     1681 atgtttcact gtttatgatt aataatattc atcttttcta taaatactac ttactggaac
     1741 attaacaaca agtccaaagg ttgattaatt ttgactcagg agcagagcta tgattata
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X13334. Human CD14 mRNA f...[gi:29740] Links  


LOCUS       HSCD14R                 1367 bp    mRNA    linear   PRI 31-MAR-1995
DEFINITION  Human CD14 mRNA for myelid cell-specific leucine-rich glycoprotein.
ACCESSION   X13334
VERSION     X13334.1  GI:29740
KEYWORDS    antigen; CD14 antigen; cell surface glycoprotein; glycoprotein.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1367)
  AUTHORS   Setoguchi,M., Nasu,N., Yoshida,S., Higuchi,Y., Akizuki,S. and
            Yamamoto,S.
  TITLE     Mouse and human CD14 (myeloid cell-specific leucine-rich
            glycoprotein) primary structure deduced from cDNA clones
  JOURNAL   Biochim. Biophys. Acta 1008 (2), 213-222 (1989)
  MEDLINE   89287330
REFERENCE   2  (bases 1 to 1367)
  AUTHORS   Yamamoto,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-OCT-1988) Yamamoto S., Dept of Pathology, Medical
            College of Oita, Idaigaoka 1-1506, Hazamamachi, Oita Gun, Oita,
            Japan
FEATURES             Location/Qualifiers
     source          1..1367
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="THP-1"
                     /cell_type="macrophage."
     CDS             120..1247
                     /codon_start=1
                     /product="leucine-rich preprotein (AA -19 to 356)"
                     /protein_id="CAA31711.1"
                     /db_xref="GI:29741"
                     /db_xref="SWISS-PROT:P08571"
                     /translation="MERASCLLLLLLPLVHVSATTPEPCELDDEDFRCVCNFSEPQPD
                     WSEAFQCVSAVEVEIHAGGLNLEPFLKRVDADADPRQYADTVKALRVRRLTVGAAQVP
                     AQLLVGALRVLAYSRLKELTLEDLKITGTMPPLPLEATGLALSSLRLRNVSWATGRSW
                     LAELQQWLKPGLKVLSIAQAHSPAFSCEQVRAFPALTSLDLSDNPGLGERGLMAALCP
                     HKFPAIQNLALRNTGMETPTGVCAALAAAGVQPHSLDLSHNSLRATVNPSAPRCMWSS
                     ALNSLNLSFAGLEQVPKGLPAKLRVLDLSCNRLNRAPQPDELPEVDNLTLDGNPFLVP
                     GTALPHEGSMNSGVVPACARSTLSVGVSGTLVLLQGARGFA"
     sig_peptide     120..176
                     /note="signal peptide (AA -19 to -1)"
     mat_peptide     177..1244
                     /product="leucine-rich glycoprotein (AA 1 - 356)"
     misc_feature    228..236
                     /note="pot. N-linked glycosylation site"
     misc_feature    270..278
                     /note="pot. N-linked glycosylation site"
     misc_feature    963..971
                     /note="pot. N-linked glycosylation site"
     misc_feature    1086..1094
                     /note="pot. N-linked glycosylation site"
BASE COUNT      269 a    441 c    392 g    265 t
ORIGIN      
        1 ccggccggcc gaagagttca caagtgtgaa gcctgaagcc gccgggtgcc gctgtgtaga
       61 aagaagctaa agcacttcca gagcctgctg agctcagagg ttcggaagac ttatcgacca
      121 tggagcgcgc gtcctgcttg ttgctgctgc tgctgccgct ggtgcacgtc tctgcgacca
      181 cgccagaacc ttgtgagctg gacgatgaag atttccgctg cgtctgcaac ttctccgaac
      241 ctcagcccga ctggtccgaa gccttccagt gtgtgtctgc agtagaggtg gagatccatg
      301 ccggcggtct caacctagag ccgtttctaa agcgcgtcga tgcggacgcc gacccgcggc
      361 agtatgctga cacggtcaag gctctccgcg tgcggcggct cacagtggga gccgcacagg
      421 ttcctgctca gctactggta ggcgccctgc gtgtgctagc gtactcccgc ctcaaggaac
      481 tgacgctcga ggacctaaag ataaccggca ccatgcctcc gctgcctctg gaagccacag
      541 gacttgcact ttccagcttg cgcctacgca acgtgtcgtg ggcgacaggg cgttcttggc
      601 tcgccgagct gcagcagtgg ctcaagccag gcctcaaggt actgagcatt gcccaagcac
      661 actcgcctgc cttttcctgc gaacaggttc gcgccttccc ggcccttacc agcctagacc
      721 tgtctgacaa tcctggactg ggcgaacgcg gactgatggc ggctctctgt ccccacaagt
      781 tcccggccat ccagaatcta gcgctgcgca acacaggaat ggagacgccc acaggcgtgt
      841 gcgccgcact ggcggcggca ggtgtgcagc cccacagcct agacctcagc cacaactcgc
      901 tgcgcgccac cgtaaaccct agcgctccga gatgcatgtg gtccagcgcc ctgaactccc
      961 tcaatctgtc gttcgctggg ctggaacagg tgcctaaagg actgccagcc aagctcagag
     1021 tgctcgatct cagctgcaac agactgaaca gggcgccgca gcctgacgag ctgcccgagg
     1081 tggataacct gacactggac gggaatccct tcctggtccc tggaactgcc ctcccccacg
     1141 agggctcaat gaactccggc gtggtcccag cctgtgcacg ttcgaccctg tcggtggggg
     1201 tgtcgggaac cctggtgctg ctccaagggg cccggggctt tgcctaagat ccaagacaga
     1261 ataatgaatg gactcaaact gccttggctt caggggagtc ccgtcaggac gttgaggact
     1321 tttcgaccaa ttcaaccctt tgccccacct ttattaaaat cttaaac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X52425. Human IL-4-R mRNA...[gi:33833] Links  


LOCUS       HSIL4R                  3597 bp    mRNA    linear   PRI 26-MAY-1992
DEFINITION  Human IL-4-R mRNA for the interleukin 4 receptor.
ACCESSION   X52425
VERSION     X52425.1  GI:33833
KEYWORDS    B cell growth factor; IL-4-R gene; interleukin; interleukin 4
            receptor.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3597)
  AUTHORS   Idzerda,R.L., March,C.J., Mosley,B., Lyman,S.D., Bos,T.V.,
            Gimpel,S.D., Din,W.S., Grabstein,K.H., Widmer,M.B., Park,L.S.,
            Cosman,D. and Beckmann,M.P.
  TITLE     Human interleukin 4 receptor confers biological responsiveness and
            defines a novel receptor superfamily
  JOURNAL   J. Exp. Med. 171 (3), 861-873 (1990)
  MEDLINE   90171849
COMMENT     Data kindly reviewed (11-MAR-1991) by Beckmann M.P.
FEATURES             Location/Qualifiers
     source          1..3597
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_type="T cell, tissue=peripheral blood, clone=T22-8"
     gene            1..3597
                     /gene="IL-4-R gene"
     mRNA            <1..3597
                     /gene="IL-4-R gene"
     CDS             176..2653
                     /gene="IL-4-R gene"
                     /codon_start=1
                     /product="interleukin 4 receptor"
                     /protein_id="CAA36672.1"
                     /db_xref="GI:33834"
                     /db_xref="SWISS-PROT:P24394"
                     /translation="MGWLCSGLLFPVSCLVLLQVASSGNMKVLQEPTCVSDYMSISTC
                     EWKMNGPTNCSTELRLLYQLVFLLSEAHTCIPENNGGAGCVCHLLMDDVVSADNYTLD
                     LWAGQQLLWKGSFKPSEHVKPRAPGNLTVHTNVSDTLLLTWSNPYPPDNYLYNHLTYA
                     VNIWSENDPADFRIYNVTYLEPSLRIAASTLKSGISYRARVRAWAQCYNTTWSEWSPS
                     TKWHNSYREPFEQHLLLGVSVSCIVILAVCLLCYVSITKIKKEWWDQIPNPARSRLVA
                     IIIQDAQGSQWEKRSRGQEPAKCPHWKNCLTKLLPCFLEHNMKRDEDPHKAAKEMPFQ
                     GSGKSAWCPVEISKTVLWPESISVVRCVELFEAPVECEEEEEVEEEKGSFCASPESSR
                     DDFQEGREGIVARLTESLFLDLLGEENGGFCQQDMGESCLLPPSGSTSAHMPWDEFPS
                     AGPKEAPPWGKEQPLHLEPSPPASPTQSPDNLTCTETPLVIAGNPAYRSFSNSLSQSP
                     CPRELGPDPLLARHLEEVEPEMPCVPQLSEPTTVPQPEPETWEQILRRNVLQHGAAAA
                     PVSAPTSGYQEFVHAVEQGGTQASAVVGLGPPGEAGYKAFSSLLASSAVSPEKCGFGA
                     SSGEEGYKPFQDLIPGCPGDPAPVPVPLFTFGLDREPPRSPQSSHLPSSSPEHLGLEP
                     GEKVEDMPKPPLPQEQATDPLVDSLGSGIVYSALTCHLCGHLKQCHGQEDGGQTPVMA
                     SPCCGCCCGDRSSPPTTPLRAPDPSPGGVPLEASLCPASLAPSGISEKSKSSSSFHPA
                     PGNAQSSSQTPKIVNFVSVGPTYMRVS"
     sig_peptide     176..250
                     /gene="IL-4-R gene"
     mat_peptide     251..2650
                     /gene="IL-4-R gene"
                     /product="interleukin 4 receptor"
     misc_feature    872..943
                     /gene="IL-4-R gene"
                     /product="interleukin 4 receptor"
                     /note="putative transmembrane region"
     polyA_signal    3579..3584
                     /gene="IL-4-R gene"
BASE COUNT      794 a   1034 c   1039 g    730 t
ORIGIN      
        1 ggcgaatgga gcaggggcgc gcagataatt aaagatttac acacagctgg aagaaatcat
       61 agagaagccg ggcgtggtgg ctcatgccta taatcccagc acttttggag gctgaggcgg
      121 gcagatcact tgagatcagg agttcgagac cagcctggtg ccttggcatc tcccaatggg
      181 gtggctttgc tctgggctcc tgttccctgt gagctgcctg gtcctgctgc aggtggcaag
      241 ctctgggaac atgaaggtct tgcaggagcc cacctgcgtc tccgactaca tgagcatctc
      301 tacttgcgag tggaagatga atggtcccac caattgcagc accgagctcc gcctgttgta
      361 ccagctggtt tttctgctct ccgaagccca cacgtgtatc cctgagaaca acggaggcgc
      421 ggggtgcgtg tgccacctgc tcatggatga cgtggtcagt gcggataact atacactgga
      481 cctgtgggct gggcagcagc tgctgtggaa gggctccttc aagcccagcg agcatgtgaa
      541 acccagggcc ccaggaaacc tgacagttca caccaatgtc tccgacactc tgctgctgac
      601 ctggagcaac ccgtatcccc ctgacaatta cctgtataat catctcacct atgcagtcaa
      661 catttggagt gaaaacgacc cggcagattt cagaatctat aacgtgacct acctagaacc
      721 ctccctccgc atcgcagcca gcaccctgaa gtctgggatt tcctacaggg cacgggtgag
      781 ggcctgggct cagtgctata acaccacctg gagtgagtgg agccccagca ccaagtggca
      841 caactcctac agggagccct tcgagcagca cctcctgctg ggcgtcagcg tttcctgcat
      901 tgtcatcctg gccgtctgcc tgttgtgcta tgtcagcatc accaagatta agaaagaatg
      961 gtgggatcag attcccaacc cagcccgcag ccgcctcgtg gctataataa tccaggatgc
     1021 tcaggggtca cagtgggaga agcggtcccg aggccaggaa ccagccaagt gcccacactg
     1081 gaagaattgt cttaccaagc tcttgccctg ttttctggag cacaacatga aaagggatga
     1141 agatcctcac aaggctgcca aagagatgcc tttccagggc tctggaaaat cagcatggtg
     1201 cccagtggag atcagcaaga cagtcctctg gccagagagc atcagcgtgg tgcgatgtgt
     1261 ggagttgttt gaggccccgg tggagtgtga ggaggaggag gaggtagagg aagaaaaagg
     1321 gagcttctgt gcatcgcctg agagcagcag ggatgacttc caggagggaa gggagggcat
     1381 tgtggcccgg ctaacagaga gcctgttcct ggacctgctc ggagaggaga atgggggctt
     1441 ttgccagcag gacatggggg agtcatgcct tcttccacct tcgggaagta cgagtgctca
     1501 catgccctgg gatgagttcc caagtgcagg gcccaaggag gcacctccct ggggcaagga
     1561 gcagcctctc cacctggagc caagtcctcc tgccagcccg acccagagtc cagacaacct
     1621 gacttgcaca gagacgcccc tcgtcatcgc aggcaaccct gcttaccgca gcttcagcaa
     1681 ctccctgagc cagtcaccgt gtcccagaga gctgggtcca gacccactgc tggccagaca
     1741 cctggaggaa gtagaacccg agatgccctg tgtcccccag ctctctgagc caaccactgt
     1801 gccccaacct gagccagaaa cctgggagca gatcctccgc cgaaatgtcc tccagcatgg
     1861 ggcagctgca gcccccgtct cggcccccac cagtggctat caggagtttg tacatgcggt
     1921 ggagcagggt ggcacccagg ccagtgcggt ggtgggcttg ggtcccccag gagaggctgg
     1981 ttacaaggcc ttctcaagcc tgcttgccag cagtgctgtg tccccagaga aatgtgggtt
     2041 tggggctagc agtggggaag aggggtataa gcctttccaa gacctcattc ctggctgccc
     2101 tggggaccct gccccagtcc ctgtcccctt gttcaccttt ggactggaca gggagccacc
     2161 tcgcagtccg cagagctcac atctcccaag cagctcccca gagcacctgg gtctggagcc
     2221 gggggaaaag gtagaggaca tgccaaagcc cccacttccc caggagcagg ccacagaccc
     2281 ccttgtggac agcctgggca gtggcattgt ctactcagcc cttacctgcc acctgtgcgg
     2341 ccacctgaaa cagtgtcatg gccaggagga tggtggccag acccctgtca tggccagtcc
     2401 ttgctgtggc tgctgctgtg gagacaggtc ctcgccccct acaacccccc tgagggcccc
     2461 agacccctct ccaggtgggg ttccactgga ggccagtctg tgtccggcct ccctggcacc
     2521 ctcgggcatc tcagagaaga gtaaatcctc atcatccttc catcctgccc ctggcaatgc
     2581 tcagagctca agccagaccc ccaaaatcgt gaactttgtc tccgtgggac ccacatacat
     2641 gagggtctct taggtgcatg tcctcttgtt gctgagtctg cagatgagga ctagggctta
     2701 tccatgcctg ggaaatgcca cctcctggaa ggcagccagg ctggcagatt tccaaaagac
     2761 ttgaagaacc atggtatgaa ggtgattggc cccactgacg ttggcctaac actgggctgc
     2821 agagactgga ccccgcccag cattgggctg ggctcgccac atcccatgag agtagagggc
     2881 actgggtcgc cgtgccccac ggcaggcccc tgcaggaaaa ctgaggccct tgggcacctc
     2941 gacttgtgaa cgagttgttg gctgctccct ccacagcttc tgcagcagac tgtccctgtt
     3001 gtaactgccc aaggcatgtt ttgcccacca gatcatggcc cacgtggagg cccacctgcc
     3061 tctgtctcac tgaactagaa gccgagccta gaaactaaca cagccatcaa gggaatgact
     3121 tgggcggcct tgggaaatcg atgagaaatt gaacttcagg gagggtggtc attgcctaga
     3181 ggtgctcatt catttaacag agcttcctta ggttgatgct ggaggcagaa tcccggctgt
     3241 caaggggtgt tcagttaagg ggagcaacag aggacatgaa aaattgctat gactaaagca
     3301 gggacaattt gctgccaaac acccatgccc agctgtatgg ctgggggctc ctcgtatgca
     3361 tggaaccccc agaataaata tgctcagcca ccctgtgggc cgggcaatcc agacagcagg
     3421 cataaggcac cagttaccct gcatgttggc ccagacctca ggtgctaggg aaggcgggaa
     3481 ccttgggttg agtaatgctc gtctgtgtgt tttagtttca tcacctgtta tctgtgtttg
     3541 ctgaggagag tggaacagaa ggggtggagt tttgtataaa taaagtttct ttgtctc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M15395. Human leukocyte a...[gi:186933] Links  


LOCUS       HUMLAP                  2776 bp    mRNA    linear   PRI 06-JAN-1995
DEFINITION  Human leukocyte adhesion protein (LFA-1/Mac-1/p150,95 family) beta
            subunit mRNA.
ACCESSION   M15395
VERSION     M15395.1  GI:186933
KEYWORDS    cell adhesion molecule; cell surface glycoprotein; glycoprotein;
            leukocyte adhesion protein.
SOURCE      Human tonsil, cDNA to mRNA, clones 18.1.1, 9.1.1 and 3.1.1.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2776)
  AUTHORS   Kishimoto,T.K., O'Connor,K., Lee,A., Roberts,T.M. and Springer,T.A.
  TITLE     Cloning of the beta subunit of the leukocyte adhesion proteins:
            homology to an extracellular matrix receptor defines a novel
            supergene family
  JOURNAL   Cell 48 (4), 681-690 (1987)
  MEDLINE   87131080
   PUBMED   3028646
COMMENT     Draft entry and computer-readable copy of sequence in [1] kindly
            provided by T.K.Kishimoto, 22-APR-1987.
FEATURES             Location/Qualifiers
     source          1..2776
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1q23-q25"
     gene            1..2776
                     /gene="LYAM1"
     CDS             73..2382
                     /gene="LYAM1"
                     /note="leukocyte adhesion protein beta-subunit precursor"
                     /codon_start=1
                     /protein_id="AAA59490.1"
                     /db_xref="GI:307113"
                     /db_xref="GDB:G00-120-157"
                     /translation="MLGLRPPLLALVGLLSLGCVLSQECTKFKVSSCRECIESGPGCT
                     WCQKLNFTGPGDPDSIRCDTRPQLLMRGCAADDIMDPTSLAETQEDHNGGQKQLSPQK
                     VTLYLRPGQAAAFNVTFRRAKGYPIDLYYLMDLSYSMLDDLRNVKKLGGDLLRALNEI
                     TESGRIGFGSFVDKTVLPFVNTHPDKLRNPCPNKEKECQPPFAFRHVLKLTNNSNQFQ
                     TEVGKQLISGNLDAPEGGLDAMMQVAACPEEIGWRNVTRLLVFATDDGFHFAGDGKLG
                     AILTPNDGRCHLEDNLYKRSNEFDYPSVGQLAHKLAENNIQPIFAVTSRMVKTYEKLT
                     EIIPKSAVGELSEDSSNVVHLIKNAYNKLSSRVFLDHNALPDTLKVTYDSFCSNGVTH
                     RNQPRGDCDGVQINVPITFQVKVTATECIQEQSFVIRALGFTDIVTVQVLPQCECRCR
                     DQSRDRSLCHGKGFLECGICRCDTGYIGKNCECQTQGRSSQELEGSCRKDNNSIICSG
                     LGDCVCGQCLCHTSDVPGKLIYGQYCECDTINCERYNGQVCGGPGRGLCFCGKCRCHP
                     GFEGSACQCERTTEGCLNPRRVECSGRGRCRCNVCECHSGYQLPLCQECPGCPSPCGK
                     YISCAECLKFEKGPFGKNCSAACPGLQLSNNPVKGRTCKERDSEGCWVAYTLEQQDGM
                     DRYLIYVDESRECVAGPNIAAIVGGTVAGIVLIGILLLVIWKALIHLSDLREYRRFEK
                     EKLKSQWNNDNPLFKSATTTVMNPKFAES"
     sig_peptide     73..138
                     /gene="LYAM1"
                     /note="leukocyte adhesion protein signal peptide;
                     G00-120-157"
     mat_peptide     139..2379
                     /gene="LYAM1"
                     /product="leukocyte adhesion protein beta-subunit;
                     G00-120-157"
BASE COUNT      610 a    817 c    834 g    515 t
ORIGIN      194 bp upstream of ApaI site; chromosoome 21.
        1 cagggcagac tggtagcaaa gcccccacgc ccagccagga gcaccgccgc ggactccagc
       61 acaccgaggg acatgctggg cctgcgcccc ccactgctcg ccctggtggg gctgctctcc
      121 ctcgggtgcg tcctctctca ggagtgcacg aagttcaagg tcagcagctg ccgggaatgc
      181 atcgagtcgg ggcccggctg cacctggtgc cagaagctga acttcacagg gccgggggat
      241 cctgactcca ttcgctgcga cacccggcca cagctgctca tgaggggctg tgcggctgac
      301 gacatcatgg accccacaag cctcgctgaa acccaggaag accacaatgg gggccagaag
      361 cagctgtccc cacaaaaagt gacgctttac ctgcgaccag gccaggcagc agcgttcaac
      421 gtgaccttcc ggcgggccaa gggctacccc atcgacctgt actatctgat ggacctctcc
      481 tactccatgc ttgatgacct caggaatgtc aagaagctag gtggcgacct gctccgggcc
      541 ctcaacgaga tcaccgagtc cggccgcatt ggcttcgggt ccttcgtgga caagaccgtg
      601 ctgccgttcg tgaacacgca ccctgataag ctgcgaaacc catgccccaa caaggagaaa
      661 gagtgccagc ccccgtttgc cttcaggcac gtgctgaagc tgaccaacaa ctccaaccag
      721 tttcagaccg aggtcgggaa gcagctgatt tccggaaacc tggatgcacc cgagggtggg
      781 ctggacgcca tgatgcaggt cgccgcctgc ccggaggaaa tcggctggcg caacgtcacg
      841 cggctgctgg tgtttgccac tgatgacggc ttccatttcg cgggcgacgg aaagctgggc
      901 gccatcctga cccccaacga cggccgctgt cacctggagg acaacttgta caagaggagc
      961 aacgaattcg actacccatc ggtgggccag ctggcgcaca agctggctga aaacaacatc
     1021 cagcccatct tcgcggtgac cagtaggatg gtgaagacct acgagaaact caccgagatc
     1081 atccccaagt cagccgtggg ggagctgtct gaggactcca gcaatgtggt ccatctcatt
     1141 aagaatgctt acaataaact ctcctccagg gtcttcctgg atcacaacgc cctccccgac
     1201 accctgaaag tcacctacga ctccttctgc agcaatggag tgacgcacag gaaccagccc
     1261 agaggtgact gtgatggcgt gcagatcaat gtcccgatca ccttccaggt gaaggtcacg
     1321 gccacagagt gcatccagga gcagtcgttt gtcatccggg cgctgggctt cacggacata
     1381 gtgaccgtgc aggttcttcc ccagtgtgag tgccggtgcc gggaccagag cagagaccgc
     1441 agcctctgcc atggcaaggg cttcttggag tgcggcatct gcaggtgtga cactggctac
     1501 attgggaaaa actgtgagtg ccagacacag ggccggagca gccaggagct ggaaggaagc
     1561 tgccggaagg acaacaactc catcatctgc tcagggctgg gggactgtgt ctgcgggcag
     1621 tgcctgtgcc acaccagcga cgtccccggc aagctgatat acgggcagta ctgcgagtgt
     1681 gacaccatca actgtgagcg ctacaacggc caggtctgcg gcggcccggg gagggggctc
     1741 tgcttctgcg ggaagtgccg ctgccacccg ggctttgagg gctcagcgtg ccagtgcgag
     1801 aggaccactg agggctgcct gaacccgcgg cgtgttgagt gtagtggtcg tggccggtgc
     1861 cgctgcaacg tatgcgagtg ccattcaggc taccagctgc ctctgtgcca ggagtgcccc
     1921 ggctgcccct caccctgtgg caagtacatc tcctgcgccg agtgcctgaa gttcgaaaag
     1981 ggcccctttg ggaagaactg cagcgcggcg tgtccgggcc tgcagctgtc gaacaacccc
     2041 gtgaagggca ggacctgcaa ggagagggac tcagagggct gctgggtggc ctacacgctg
     2101 gagcagcagg acgggatgga ccgctacctc atctatgtgg atgagagccg agagtgtgtg
     2161 gcaggcccca acatcgccgc catcgtcggg ggcaccgtgg caggcatcgt gctgatcggc
     2221 attctcctgc tggtcatctg gaaggctctg atccacctga gcgacctccg ggagtacagg
     2281 cgctttgaga aggagaagct caagtcccag tggaacaatg ataatcccct tttcaagagc
     2341 gccaccacga cggtcatgaa ccccaagttt gctgagagtt aggagcactt ggtgaagaca
     2401 aggccgtcag gacccaccat gtctgcccca tcacgcggcc gagacatggc ttggccacag
     2461 ctcttgagga tgtcaccaat taaccagaaa tccagttatt ttccgccctc aaaatgacag
     2521 ccatggccgg ccggtgcttc tgggggctcg tcggggggac agctccactc tgactggcac
     2581 agtctttgca tggagacttg aggagggctt gaggttggtg aggttaggtg cgtgtttcct
     2641 gtgcaagtca ggacatcagt ctgattaaag gtggtgccaa tttatttaca tttaaacttg
     2701 tcagggtata aaatgacatc ccattaatta tattgttaat caatcacgtg tatagaaaaa
     2761 aaaataaaac ttcaat
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF041262. Homo sapiens immu...[gi:4104892] Links  


LOCUS       AF041262                1395 bp    mRNA    linear   PRI 05-JAN-1999
DEFINITION  Homo sapiens immunoglobulin-like transcript 8 mRNA, complete cds.
ACCESSION   AF041262
VERSION     AF041262.1  GI:4104892
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1395)
  AUTHORS   Colonna,M.
  TITLE     Immunoglobulin-like transcript 8
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 1395)
  AUTHORS   Colonna,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (07-JAN-1998) Basel Institute for Immunology, 487
            Grenzacherstrasse, Basel CH-4005, Switzerland
FEATURES             Location/Qualifiers
     source          1..1395
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q13.4"
                     /clone="3"
                     /cell_type="leukocyte"
     CDS             1..1395
                     /function="activating receptor"
                     /note="immunoglobulin superfamily member; ILT8"
                     /codon_start=1
                     /product="immunoglobulin-like transcript 8"
                     /protein_id="AAD02204.1"
                     /db_xref="GI:4104893"
                     /translation="MTPILTVLICLGLSLGPRTHVQAGPFPKPTLWAEPGSVISWGSP
                     VTIWCQGSLEAQEYRLDKEGSPEPWDRNNPLEPKNKARFSIPSITEHHAGRYRCHYYS
                     SAGWSEPSDALELVMTGAYSKPTLSALPSPVVASGGNMTLQCGSQKGYHQFVLMKEGE
                     HQLPRTLDSQQLHSGGFQALFPVGPVNPSHRWRFTCYYYYMNTPRVWSHPSDPLEILP
                     SGVSRKPSLLTLQGPVLAPGQSLTLQCGSDVGYDRFVLYKEGERDFLQRPGQQPQAGL
                     SQANFTLGPVSPSHGGQYRCYGAHNLSSEWSAPSDPLNILMAGQIYDTVSLSAQPGPT
                     VASGENVTLLCQSWWQFDTFLLTKEGAAHPPLRLRSMYGAHKYQAEFPMSPVTSAHAG
                     TYRCYGSYSSNPHLLSFPSEPLELMVSASHAKDYTVENLIRMGMAGLVLVFLGILLFE
                     AQHSQRNPQDAARR"
BASE COUNT      292 a    463 c    381 g    259 t
ORIGIN      
        1 atgaccccca tcctcacggt cctgatctgt ctcgggctga gtctgggccc caggacccac
       61 gtgcaggcag ggcccttccc caaacccacc ctctgggctg agccaggctc tgtgatcagc
      121 tgggggagcc ccgtgaccat ctggtgtcag gggagcctgg aggcccagga gtaccgactg
      181 gataaagagg gaagcccaga gccctgggac agaaataacc cactggaacc caagaacaag
      241 gccagattct ccatcccatc cataacagag caccatgcgg ggagataccg ctgccactat
      301 tacagctctg caggctggtc agagcccagc gacgccctgg agctggtgat gacaggagcc
      361 tatagcaaac ccaccctctc agccctgccc agccctgtgg tggcctcagg ggggaatatg
      421 accctccaat gtggctcaca gaagggatat caccaatttg ttctgatgaa ggaaggagaa
      481 caccagctcc cccggaccct ggactcacag cagctccaca gtggggggtt ccaggccctg
      541 ttccctgtgg gccccgtgaa ccccagccac aggtggaggt tcacatgcta ttactattat
      601 atgaacaccc cccgggtgtg gtcccacccc agtgaccccc tggagattct gccctcaggg
      661 gtgtctagga agccctccct cctgaccctg cagggccctg tcctggcccc tgggcagagc
      721 ctgaccctcc agtgtggctc tgatgtcggc tacgacagat ttgttctgta taaggagggg
      781 gaacgtgact tcctccagcg ccctggccag cagccccagg ctgggctctc ccaggccaac
      841 ttcaccctgg gccctgtgag cccctcccac gggggccagt acaggtgcta tggtgcacac
      901 aacctctcct ccgagtggtc ggcccccagc gaccccctga acatcctgat ggcaggacag
      961 atctatgaca ccgtctccct gtcagcacag ccgggcccca cagtggcctc aggagagaac
     1021 gtgaccctgc tgtgtcagtc atggtggcag tttgacactt tccttctgac caaagaaggg
     1081 gcagcccatc ccccgctgcg tctgagatca atgtacggag ctcataagta ccaggctgaa
     1141 ttccccatga gtcctgtgac ctcagcccac gcggggacct acaggtgcta cggctcatac
     1201 agctccaacc cccacctgct gtctttcccc agtgagcccc tggaactcat ggtctcagcc
     1261 tcacacgcca aggattacac agtggagaat ctcatccgca tgggcatggc aggcttggtc
     1321 ctggtgttcc tcgggattct gttatttgag gctcagcaca gccagagaaa cccccaagat
     1381 gcagcaagga ggtaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: AU076460. AU076460 Sugano c...[gi:7438889] Links  


IDENTIFIERS

dbEST Id:       4098193
EST name:       AU076460
GenBank Acc:    AU076460
GenBank gi:     7438889

CLONE INFO
Clone Id:       ColF0655
DNA type:       cDNA

PRIMERS
PolyA Tail:     Unknown

SEQUENCE
                ATCCCCTACACTGTTGTCCTACTGCAGCTATTTCAAGGCGCGCGCCTCGTGGTGGACTCA
                CCGCTAGCCCGCAGCGCTCGGCTTCCTGGTAATTCTTCACCTCTTTTCTCAGCTCCCTGC
                AGCATGGGTGCTGGGCCCTCCTTGCTGCTCGCCGCCCTCCTGCTGNTTCTCTC

Entry Created:  Apr 5 2000
Last Updated:   May 4 2000

COMMENTS
                Suzuki,Y., Yoshitomo-Nakagawa,K., Maruyama,K., Suyama,A. and
                Sugano,S. Construction and characterization of a full
                length-enriched and a 5'-end-enriched cDNA library. Gene 200
                (1-2), 149-156 (1997)
                This clone was obtained from a 'full length-enriched' cDNA
                library constructed by 'Oligo-Capping' method. The coding
                region starts from the 50 bp upstream to the 3'-end.

PUTATIVE ID     Assigned by submitter
                5'-end region of H.sapiens mRNA for cathepsin C

LIBRARY
Lib Name:       Sugano cDNA library
Organism:       Homo sapiens

SUBMITTER
Name:           Yutaka Suzuki
Lab:            Department of Virology
Institution:    Institute of Medical Science, University of Tokyo
Address:        4-6-1, Shirokanedai, Minatoku, Tokyo 108-8639, Japan
E-mail:         ysuzuki@ims.u-tokyo.ac.jp

CITATIONS
Medline UID:    20221373
Title:          Statistical analysis of the 5' untranslated region of human
                mRNA using 'Oligo-Capped' cDNA libraries
Authors:        Suzuki,Y., Ishihara,D., Sasaki,M., Nakagawa,H., Hata,H.,
                Tsunoda,T., Watanabe,M., Komatsu,T., Ota,T., Isogai,T.,
                Suyama,A., Sugano,S.
Citation:       Genomics 64 (3): 286-297 2000


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X87212. H.sapiens mRNA fo...[gi:1006656] Links  


LOCUS       HSCATHCGE               1838 bp    mRNA    linear   PRI 07-NOV-1995
DEFINITION  H.sapiens mRNA for cathepsin C.
ACCESSION   X87212
VERSION     X87212.1  GI:1006656
KEYWORDS    cathepsin C.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Paris,A., Strukelj,B., Pungercar,J., Renko,M., Dolenc,I. and
            Turk,V.
  TITLE     Molecular cloning and sequence analysis of human preprocathepsin C
  JOURNAL   FEBS Lett. 369 (2-3), 326-330 (1995)
  MEDLINE   95377428
REFERENCE   2  (bases 1 to 1838)
  AUTHORS   Gubensek,F.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-MAY-1995) F. Gubensek, J. Stefan Institute,
            Department of Biochemistry and Mol. Bio., Jamova 39, POB 100, 61111
            Ljubljana, SLOVENIA
FEATURES             Location/Qualifiers
     source          1..1838
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="C1"
                     /cell_line="E.Coli Y1090"
                     /tissue_type="ileum"
                     /clone_lib="cDNA in lambda gt11"
                     /dev_stage="adult"
     CDS             34..1425
                     /EC_number="3.4.14.1"
                     /codon_start=1
                     /product="cathepsin C"
                     /protein_id="CAA60671.1"
                     /db_xref="GI:1006657"
                     /db_xref="SWISS-PROT:P53634"
                     /translation="MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVG
                     SSGSQRDVNCSVMGPQEKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWF
                     AFFKYKEEGSKVTTYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQ
                     EKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPA
                     PLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTN
                     NSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMK
                     EDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG
                     LRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAI
                     ESIAVAATPIPKL"
     sig_peptide     34..105
     mat_peptide     724..1422
                     /product="ttg start"
     polyA_signal    1815..1820
     polyA_signal    1829..1835
BASE COUNT      511 a    402 c    411 g    514 t
ORIGIN      
        1 aattcttcac ctcttttctc agctccctgc agcatgggtg ctgggccctc cttgctgctc
       61 gccgccctcc tgctgcttct ctccggcgac ggcgccgtgc gctgcgacac acctgccaac
      121 tgcacctatc ttgacctgct gggcacctgg gtcttccagg tgggctccag cggttcccag
      181 cgcgatgtca actgctcggt tatgggacca caagaaaaaa aagtagtggt gtaccttcag
      241 aagctggata cagcatatga tgaccttggc aattctggcc atttcaccat catttacaac
      301 caaggctttg agattgtgtt gaatgactac aagtggtttg ccttttttaa gtataaagaa
      361 gagggcagca aggtgaccac ttactgcaac gagacaatga ctgggtgggt gcatgatgtg
      421 ttgggccgga actgggcttg tttcaccgga aagaaggtgg gaactgcctc tgagaatgtg
      481 tatgtcaaca cagcacacct taagaattct caggaaaagt attctaatag gctctacaag
      541 tatgatcaca actttgtgaa agctatcaat gccattcaga agtcttggac tgcaactaca
      601 tacatggaat atgagactct taccctggga gatatgatta ggagaagtgg tggccacagt
      661 cgaaaaatcc caaggcccaa acctgcacca ctgactgctg aaatacagca aaagattttg
      721 catttgccaa catcttggga ctggagaaat gttcatggta tcaattttgt cagtcctgtt
      781 cgaaaccaag catcctgtgg cagctgctac tcatttgctt ctatgggtat gctagaagcg
      841 agaatccgta tactaaccaa caattctcag accccaatcc taagccctca ggaggttgtg
      901 tcttgtagcc agtatgctca aggctgtgaa ggcggcttcc cataccttat tgcaggaaag
      961 tacgcccaag attttgggct ggtggaagaa gcttgcttcc cctacacagg cactgattct
     1021 ccatgcaaaa tgaaggaaga ctgctttcgt tattactcct ctgagtacca ctatgtagga
     1081 ggtttctatg gaggctgcaa tgaagccctg atgaagcttg agttggtcca tcatgggccc
     1141 atggcagttg cttttgaagt atatgatgac ttcctccact acaaaaaggg gatctaccac
     1201 cacactggtc taagagaccc tttcaacccc tttgagctga ctaatcatgc tgttctgctt
     1261 gtgggctatg gcactgactc agcctctggg atggattact ggattgttaa aaacagctgg
     1321 ggcaccggct ggggtgagaa tggctacttc cggatccgca gaggaactga tgagtgtgca
     1381 attgagagca tagcagtggc agccacacca attcctaaat tgtagggtat gccttccagt
     1441 atttcataat gatctgcatc agttgtaaag gggaattggt atattcacag actgtagact
     1501 ttcagcagca atctcagaag cttacaaata gatttccatg aagatatttg tcttcagaat
     1561 taaaactgcc cttaatttta atataccttt caatcggcca ctggccattt ttttctaagt
     1621 attcaattaa gtgggaattt tctggaagat ggtcagctat gaagtaatag agtttgctta
     1681 atcatttgta attcaaacat gctatatttt ttaaaatcaa tgtgaaaaca tagacttatt
     1741 tttaaattgt accaatcaca agaaaataat ggcaataatt atcaaaactt ttaaaataga
     1801 tgctcatatt tttaaaataa agttttaaaa ataactgc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X53064. Homo sapiens SPRR...[gi:3367692] Links  


LOCUS       HSX53064                2580 bp    DNA     linear   PRI 18-NOV-1998
DEFINITION  Homo sapiens SPRR2A gene encoding small proline rich protein.
ACCESSION   X53064 X53065
VERSION     X53064.1  GI:3367692
KEYWORDS    small proline rich protein; SPRR2A gene.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Gibbs,S., Lohman,F., Teubel,W., van de Putte,P. and Backendorf,C.
  TITLE     Characterization of the human spr2 promoter: induction after UV
            irradiation or TPA treatment and regulation during differentiation
            of cultured primary keratinocytes
  JOURNAL   Nucleic Acids Res. 18 (15), 4401-4407 (1990)
  MEDLINE   90356372
REFERENCE   2
  AUTHORS   Gibbs,S., Fijneman,R., Wiegant,J., van Kessel,A.G., van De Putte,P.
            and Backendorf,C.
  TITLE     Molecular characterization and evolution of the SPRR family of
            keratinocyte differentiation markers encoding small proline-rich
            proteins
  JOURNAL   Genomics 16 (3), 630-637 (1993)
  MEDLINE   93315153
REFERENCE   3
  AUTHORS   Fischer,D.F., Gibbs,S., van De Putte,P. and Backendorf,C.
  TITLE     Interdependent transcription control elements regulate the
            expression of the SPRR2A gene during keratinocyte terminal
            differentiation
  JOURNAL   Mol. Cell. Biol. 16 (10), 5365-5374 (1996)
  MEDLINE   96413286
REFERENCE   4
  AUTHORS   Fischer,D.F., van Drunen,C.M., Winkler,G.S., van de Putte,P. and
            Backendorf,C.
  TITLE     Involvement of a nuclear matrix association region in the
            regulation of the SPRR2A keratinocyte terminal differentiation
            marker
  JOURNAL   Nucleic Acids Res. 26 (23), 5288-5294 (1998)
  MEDLINE   99045591
REFERENCE   5
  AUTHORS   Backendorf,C.M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-MAY-1990) Backendorf C.M.P., Lab. Molecular Genetics,
            Dept. Biochemistry, University of Leiden, Gorlaeus Laboratories,
            Einsteinweg 5, 2300 RA Leiden, NL
  REMARK    Revised by [5]
REFERENCE   6  (bases 1 to 2580)
  AUTHORS   Backendorf,C.M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (29-JUL-1998) Backendorf C.M.P., Lab. Molecular Genetics,
            Dept. Biochemistry, University of Leiden, Gorlaeus Laboratories,
            Einsteinweg 5, 2300 RA Leiden, NL
COMMENT     On or before Oct 14, 2002 this sequence version replaced gi:35697,
            gi:35696.
            See M20030 for mRNA.
FEATURES             Location/Qualifiers
     source          1..2580
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="1"
                     /map="q21"
                     /tissue_type="blood"
                     /clone_lib="EMBL3"
     repeat_region   956..967
                     /note="direct repeat A1"
     repeat_region   1017..1028
                     /note="direct repeat A2"
     repeat_region   1078..1089
                     /note="direct repeat A3"
     misc_feature    1241..1247
                     /note="AP-1 box"
     protein_bind    1320..1335
                     /note="TDE-1"
                     /bound_moiety="Oct-11"
     protein_bind    1373..1387
                     /note="TDE-2; interferon- stimulated response element
                     (ISRE)"
                     /bound_moiety="IRF-1; IRF-2"
     protein_bind    complement(1382..1385)
                     /note="TDE-3; Ets binding site"
                     /bound_moiety="ESE-1"
     protein_bind    1389..1397
                     /note="TDE-4; zinc finger binding site"
                     /bound_moiety="Sp1-like proteins"
     gene            1408..2433
                     /gene="SPRR2A"
     TATA_signal     1408..1413
                     /gene="SPRR2A"
     exon            1438..1479
                     /gene="SPRR2A"
                     /number=1
     intron          1480..2195
                     /gene="SPRR2A"
                     /number=1
     exon            2196..>2433
                     /gene="SPRR2A"
                     /number=2
     CDS             2215..2433
                     /gene="SPRR2A"
                     /note="cornified envelope precursor; contains 3 internal
                     nonapeptide repeats; head and tail domains are substrates
                     for transglutaminase-mediated cross-linking"
                     /codon_start=1
                     /product="small proline-rich protein"
                     /protein_id="CAA37239.1"
                     /db_xref="GI:3367693"
                     /translation="MSYQQQQCKQPCQPPPVCPTPKCPEPCPPPKCPEPCPPPKCPQP
                     CPPQQCQQKYPPVTPSPPCQSKYPPKSK"
BASE COUNT      804 a    504 c    539 g    732 t      1 others
ORIGIN      
        1 aagcttcctt ggaattaatg gtcagataag gagctctagc aatatcactt taaatgctta
       61 atatacaata tttagaaaac cttatgattg taaagagctt aaaaaagatg tgaaagaaca
      121 atcactaaar gcattgacaa catatgtgtg ttagtgaaca atatgtacca aaatggacag
      181 atgagaggtg tacattgggg tgtgagttga taatccagca gactgtggga ctagagggtc
      241 tacgaaaaac aaaggaagaa catacaaaca aattttaaaa cactgtcttt caacactaaa
      301 actgttggaa tagagagcaa gaatacatat tggattcaca ttcagtttat ttctcagatt
      361 cagactagtg ctctgactct gcactctgca acctactcta atattacata tgataacatg
      421 agtctatgca gctgttctct atagatattc acagacacta cagaatgttt tggagattat
      481 gtaatattta aaatattcaa attatactaa aaatgtatgt aaaatgtatt gaacataggc
      541 aagtttcaat acatagattt tgagtgaatg cttgcaactt tggttccatt ctctctactt
      601 ccttaaggtg gtgtccaaga gtacattttt ataaataaaa agttatagta cactcctaag
      661 ggcagcaagt agaaaacgtg ctagggagac tcgatctcac tttggaatct atcctgggag
      721 acaaatgcct ctacaaatgg attagagaag acagttttaa agaggaagat aatcaggtaa
      781 aatctggggt tttatgagag aaagaaagag gtagaagaaa aaatttcaag ctcgaacatc
      841 ggatcaggtg gcacaatgcg gtcaatgcct gcaaactcag ggtaagtatt attctccctg
      901 tttacagttc cgtggaggag aagtgacttg cctgtggtca tacaacagag caaagaaaag
      961 gcttgagcta gaactcaggc ctttgttagg tctccccttc ctcctagcac attggcaaat
     1021 tgcatgagga aagtagaggt acagttgagt tcatgtacaa caataaggca ttcaggtaaa
     1081 gtgaatgagg gcagaagttt tatgatttag ggaaggtgta agacaggaaa atatctttgt
     1141 tcccaattaa gaaagagatc ccttgaccat cagttagaga ttcccccaag tccctctttg
     1201 ccataagtca ctgaaactga gatccaaggc atggcttctg tgagtcagga gagcttaacc
     1261 cagaggagag atttcagaac aggatatttc ctattttgag tatcctgctc atgccagtca
     1321 tggataaatt tgcatctggc ttaagaaatt actggatcag cattgttttg ggtagtttca
     1381 cttcctgctg ggtggggtag caggctctat aaagagatcc tctgctgcac gactcttaaa
     1441 cccctggtac ctgagcactg atctgccttg gagaacctgg tgagtcggct tccttgagtt
     1501 cctctgttct ttgtgccctg aaatgttgag tttaatctga atatggcaag tttggtggat
     1561 ccaatcctat gaaaattgac ttgatgctac ttagtggatg aaaatttaag attagagcac
     1621 aattatatgc tattttagct ttcttttgtt atacaggtag gtatccatat ggacagagaa
     1681 gttaaggggt aacctttgat atgaagaaga aaaaagaaca aagtattttt ctttattctc
     1741 ttgctttcta gtgtccttta caaaggtttg tgtcttagca ggtgtgaaag actacaattc
     1801 tccctgagca gccctttgct ctatgcccaa gtcagcccac ttggacttta taacagataa
     1861 tgatgatagg aatagcatat tagattgccc agggtgtctg aacttgtgac tgcctttctt
     1921 gaattggtta ttttcaggga aataagatgc ttgattcttt ataacagaga taatttattt
     1981 ggaaaaattg tatgagaaaa cacaggattt cctagggaca atgaagcaat ttgttaaagt
     2041 ggaagggaga aaccagaaag tcttgaaaag gtaattaaga atttaaataa tttcttggag
     2101 attggagaaa taatatgcca tggtattaca caagctttgg cttctctctc tggaggattc
     2161 ccttcccacg aacactgttg tatcatttct ttcagatcct gagactccag caggatgtct
     2221 tatcaacagc agcagtgcaa gcagccctgc cagccacctc ctgtgtgccc cacgccaaag
     2281 tgcccagagc catgtccacc cccgaagtgc cctgagccct gcccaccacc aaagtgtcca
     2341 cagccctgcc cacctcagca gtgccagcag aaatatcctc ctgtgacacc ttccccaccc
     2401 tgccagtcaa agtatccacc caagagcaag taacagcttc agaattcatc aggaccaaga
     2461 aaggataagg atatttggct cacctcgttc cacagctcca ccttcatctt ctcatcaaag
     2521 cctaccatgg atacacaggg agcttctttc tccttagcca gtaatctgcc catgatgatc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U60319. Homo sapiens haem...[gi:1469789] Links  


LOCUS       HSU60319                2727 bp    mRNA    linear   PRI 29-OCT-1997
DEFINITION  Homo sapiens haemochromatosis protein (HLA-H) mRNA, complete cds.
ACCESSION   U60319
VERSION     U60319.1  GI:1469789
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2727)
  AUTHORS   Feder,J.N., Gnirke,A., Thomas,W., Tsuchihashi,Z., Ruddy,D.A.,
            Basava,A., Dormishian,F., Domingo,R., Ellis,M.C., Fullan,A.,
            Hinton,L.M., Jones,N.L., Kimmel,B.E., Kronmal,G.S., Lauer,P.,
            Lee,V.K., Loeb,D.B., Mapa,F., McClelland,E., Meyer,N.C.,
            Mintier,G.A., Moeller,N., Moore,T., Morkang,E., Prass,C.E.,
            Quintana,L., Stranes,S.M., Schatzman,R.C., Brunke,K.J.,
            Drayna,D.T., Risch,N.J., Bacon,B.R. and Wolff,R.K.
  TITLE     A novel MHC class I-like gene is mutated in patients with
            hereditary haemochromatosis
  JOURNAL   Nat. Genet. 13 (4), 399-408 (1996)
  MEDLINE   96331279
   PUBMED   8696333
REFERENCE   2  (bases 1 to 2727)
  AUTHORS   Feder,J.N., Gnirke,A., Thomas,W., Tsuchihashi,Z., Ruddy,D.A.,
            Basava,A., Dormishian,F., Domingo,R., Ellis,M.C., Fullan,A.,
            Hinton,L.M., Jones,N.L., Kimmel,B.E., Kronmal,G.S., Lauer,P.,
            Lee,V.K., Loeb,D.B., Mapa,F., McClelland,E., Meyer,N.C.,
            Mintier,G.A., Moeller,N., Moore,T., Morkang,E., Prass,C.E.,
            Quintana,L., Stranes,S.M., Schatzman,R.C., Brunke,K.J.,
            Drayna,D.T., Risch,N.J., Bacon,B.R. and Wolff,R.K.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-JUN-1996) Mercator Genetics, 4040 Campbell Ave.,
            Menlo Park, CA 94025, USA
FEATURES             Location/Qualifiers
     source          1..2727
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
     gene            1..2727
                     /gene="HLA-H"
                     /note="HFE"
     CDS             222..1268
                     /gene="HLA-H"
                     /codon_start=1
                     /product="haemochromatosis protein"
                     /protein_id="AAC51823.1"
                     /db_xref="GI:1469790"
                     /translation="MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGL
                     SLFEALGYVDDQLFVFYDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDF
                     WTIMENHNHSKESHTLQVILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEP
                     RAWPTKLEWERHKIRARQNRAYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTS
                     SVTTLRCRALNYYPQNITMKWLKDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGE
                     EQRYTCQVEHPGLDQPLIVIWEPSPSGTLVIGVISGIAVFVVILFIGILFIILRKRQG
                     SRGAMGHYVLAERE"
BASE COUNT      702 a    606 c    660 g    759 t
ORIGIN      
        1 ggggacactg gatcacctag tgtttcacaa gcaggtacct tctgctgtag gagagagaga
       61 actaaagttc tgaaagacct gttgcttttc accaggaagt tttactgggc atctcctgag
      121 cctaggcaat agctgtaggg tgacttctgg agccatcccc gtttccccgc cccccaaaag
      181 aagcggagat ttaacgggga cgtgcggcca gagctgggga aatgggcccg cgagccaggc
      241 cggcgcttct cctcctgatg cttttgcaga ccgcggtcct gcaggggcgc ttgctgcgtt
      301 cacactctct gcactacctc ttcatgggtg cctcagagca ggaccttggt ctttccttgt
      361 ttgaagcttt gggctacgtg gatgaccagc tgttcgtgtt ctatgatcat gagagtcgcc
      421 gtgtggagcc ccgaactcca tgggtttcca gtagaatttc aagccagatg tggctgcagc
      481 tgagtcagag tctgaaaggg tgggatcaca tgttcactgt tgacttctgg actattatgg
      541 aaaatcacaa ccacagcaag gagtcccaca ccctgcaggt catcctgggc tgtgaaatgc
      601 aagaagacaa cagtaccgag ggctactgga agtacgggta tgatgggcag gaccaccttg
      661 aattctgccc tgacacactg gattggagag cagcagaacc cagggcctgg cccaccaagc
      721 tggagtggga aaggcacaag attcgggcca ggcagaacag ggcctacctg gagagggact
      781 gccctgcaca gctgcagcag ttgctggagc tggggagagg tgttttggac caacaagtgc
      841 ctcctttggt gaaggtgaca catcatgtga cctcttcagt gaccactcta cggtgtcggg
      901 ccttgaacta ctacccccag aacatcacca tgaagtggct gaaggataag cagccaatgg
      961 atgccaagga gttcgaacct aaagacgtat tgcccaatgg ggatgggacc taccagggct
     1021 ggataacctt ggctgtaccc cctggggaag agcagagata tacgtgccag gtggagcacc
     1081 caggcctgga tcagcccctc attgtgatct gggagccctc accgtctggc accctagtca
     1141 ttggagtcat cagtggaatt gctgtttttg tcgtcatctt gttcattgga attttgttca
     1201 taatattaag gaagaggcag ggttcaagag gagccatggg gcactacgtc ttagctgaac
     1261 gtgagtgaca cgcagcctgc agactcactg tgggaaggag acaaaactag agactcaaag
     1321 agggagtgca tttatgagct cttcatgttt caggagagag ttgaacctaa acatagaaat
     1381 tgcctgacga actccttgat tttagccttc tctgttcatt tcctcaaaaa gatttcccca
     1441 tttaggtttc tgagttcctg catgccggtg atccctagct gtgacctctc ccctggaact
     1501 gtctctcatg aacctcaagc tgcatctaga ggcttccttc atttcctccg tcacctcaga
     1561 gacatacacc tatgtcattt catttcctat ttttggaaga ggactcctta aatttggggg
     1621 acttacatga ttcattttaa catctgagaa aagctttgaa ccctgggacg tggctagtca
     1681 taaccttacc agatttttac acatgtatct atgcattttc tggacccgtt caacttttcc
     1741 tttgaatcct ctctctgtgt tacccagtaa ctcatctgtc accaagcctt ggggattctt
     1801 ccatctgatt gtgatgtgag ttgcacagct atgaaggctg tgcactgcac gaatggaaga
     1861 ggcacctgtc ccagaaaaag catcatggct atctgtgggt agtatgatgg gtgtttttag
     1921 caggtaggag gcaaatatct tgaaaggggt tgtgaagagg tgttttttct aattggcatg
     1981 aaggtgtcat acagatttgc aaagtttaat ggtgccttca tttgggatgc tactctagta
     2041 ttccagacct gaagaatcac aataattttc tacctggtct ctccttgttc tgataatgaa
     2101 aattatgata aggatgataa aagcacttac ttcgtgtccg actcttctga gcacctactt
     2161 acatgcatta ctgcatgcac ttcttacaat aattctatga gataggtact attatcccca
     2221 tttctttttt aaatgaagaa agtgaagtag gccgggcacg gtggctcgcg cctgtggtcc
     2281 cagggtgctg agattgcagg tgtgagccac cctgcccagc cgtcaaaaga gtcttaatat
     2341 atatatccag atggcatgtg tttactttat gttactacat gcacttggct gcataaatgt
     2401 ggtacaacca ttctgtcttg aagggcaggt gcttcaggat accatataca gctcagaagt
     2461 ttcttcttta ggcattaaat tttagcaaag atatctcatc tcttctttta aaccattttc
     2521 tttttttgtg gttagaaaag ttatgtagaa aaaagtaaat gtgatttacg ctcattgtag
     2581 aaaagctata aaatgaatac aattaaagct gttatttaat tagccagtga aaaactatta
     2641 acaacttgtc tattacctgt tagtattatt gttgcattaa aaatgcatat actttaataa
     2701 atgtacattg tattgtaaaa aaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF153606. Homo sapiens angi...[gi:5231136] Links  


LOCUS       AF153606                1860 bp    mRNA    linear   PRI 28-JUN-1999
DEFINITION  Homo sapiens angiopoietin-related protein mRNA, complete cds.
ACCESSION   AF153606
VERSION     AF153606.1  GI:5231136
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1860)
  AUTHORS   Kim,M.K., Kim,Y.H., Seo,J.M., Lee,H.M., Chung,H.J., Sohn,M.Y.,
            Hwang,S.Y., Im,S.U., Jung,E.J., Lee,J.H. and Kim,J.C.
  TITLE     A catalogue of genes in the human dermal papilla cells as
            identified by expressed sequence tags
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 1860)
  AUTHORS   Kim,M.K., Kim,Y.H., Suh,J.M., Lee,H.M., Chung,H.J., Sohn,M.Y.,
            Hwang,S.Y., Im,S.U., Jung,E.J. and Kim,J.C.
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-1999) Immunology, Kyungpook National University,
            School of Medicine, 101 Dongin Dong, Jung Gu, Taegu, Taegu 700-422,
            South Korea
FEATURES             Location/Qualifiers
     source          1..1860
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="primary culture"
                     /cell_type="hair dermal papilla"
     CDS             154..1371
                     /note="contains fibrinogen-like domain"
                     /codon_start=1
                     /product="angiopoietin-related protein"
                     /protein_id="AAD41088.1"
                     /db_xref="GI:5231137"
                     /translation="MSGAPTAGAALMLCAATAVLLSAQGGPVQSKSPRFASWDEMNVL
                     AHGLLQLGQGCANTGAHPQSAERAGARLSACGSACQGTEGSTDLPLAPESRVDPEVLH
                     SLQTQLKAQNSRIQQLFHKVAQQQRHLEKQHLRIQHLQSQFGLLDHKHLDHEVAKPAR
                     RKRLPEMAQPVDPAHNVSRLHRLPRDCQELFQVGERQSGLFEIQPQGSPPFLVNCKMT
                     SDGGWTVIQRRHDGSVDFNRPWEAYKAGFGDPHGEFWLGLEKVHSITGDRNSRLAVQL
                     RDWDGNAELLQFSVHLGGEDTAYSLQLTAPVAGQLGATTVPPSGLSVPFSTWDQDHDL
                     RRDKNCAKSLSGGWWFGTCSHSNLNGQYFRSIPQQRQKLKKGIFWKTWRGRYYPLQAT
                     TMLIQPMAAEAAS"
BASE COUNT      391 a    584 c    573 g    312 t
ORIGIN      
        1 gcggatcctc acacgactgt gatccgattc tttccagcgg cttctgcaac caagcgggtc
       61 ttacccccgg tcctccgcgt ctccagtcct cgcacctgga accccaacgt ccccgagagt
      121 ccccgaatcc ccgctcccag gctacctaag aggatgagcg gtgctccgac ggccggggca
      181 gccctgatgc tctgcgccgc caccgccgtg ctactgagcg ctcagggcgg acccgtgcag
      241 tccaagtcgc cgcgctttgc gtcctgggac gagatgaatg tcctggcgca cggactcctg
      301 cagctcggcc aggggtgcgc gaacaccgga gcgcacccgc agtcagctga gcgcgctgga
      361 gcgcgcctga gcgcgtgcgg gtccgcctgt cagggaaccg aggggtccac cgacctcccg
      421 ttagcccctg agagccgggt ggaccctgag gtccttcaca gcctgcagac acaactcaag
      481 gctcagaaca gcaggatcca gcaactcttc cacaaggtgg cccagcagca gcggcacctg
      541 gagaagcagc acctgcgaat tcagcatctg caaagccagt ttggcctcct ggaccacaag
      601 cacctagacc atgaggtggc caagcctgcc cgaagaaaga ggctgcccga gatggcccag
      661 ccagttgacc cggctcacaa tgtcagccgc ctgcaccggc tgcccaggga ttgccaggag
      721 ctgttccagg ttggggagag gcagagtgga ctatttgaaa tccagcctca ggggtctccg
      781 ccatttttgg tgaactgcaa gatgacctca gatggaggct ggacagtaat tcagaggcgc
      841 cacgatggct cagtggactt caaccggccc tgggaagcct acaaggcggg gtttggggat
      901 ccccacggcg agttctggct gggtctggag aaggtgcata gcatcacggg ggaccgcaac
      961 agccgcctgg ccgtgcagct gcgggactgg gatggcaacg ccgagttgct gcagttctcc
     1021 gtgcacctgg gtggcgagga cacggcctat agcctgcagc tcactgcacc cgtggccggc
     1081 cagctgggcg ccaccaccgt cccacccagc ggcctctccg tacccttctc cacttgggac
     1141 caggatcacg acctccgcag ggacaagaac tgcgccaaga gcctctctgg aggctggtgg
     1201 tttggcacct gcagccattc caacctcaac ggccagtact tccgctccat cccacagcag
     1261 cggcagaagc ttaagaaggg aatcttctgg aagacctggc ggggccgcta ctacccgctg
     1321 caggccacca ccatgttgat ccagcccatg gcagcagagg cagcctccta gcgtcctggc
     1381 tgggcctggt cccaggccca cgaaagacgg tgactcttgg ctctgcccga ggatgtggcc
     1441 aagaccacga ctggagaagc cccctttctg agtgcagggg ggctgcatgc gttgcctcct
     1501 gagatcgagg ctgcaggata tgctcagact ctagaggcgt ggaccaaggg gcatggagct
     1561 tcactccttg ctggccaggg agttggggac tcagagggac cacttggggc cagccagact
     1621 ggcctcaatg gcggactcag tcacattgac tgacggggac cagggcttgt gtgggtcgag
     1681 agcgccctca tggtgctggt gctgttgtgt gtaggtcccc tggggacaca agcaggcgcc
     1741 aatggtatct gggcggagct cacagagttc ttggaataaa agcaacctca gaacaaaaaa
     1801 aaaaaaaaaa aagcggagct cacagagttc ttggaataaa agcaacctca gaacaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF039290. Homo sapiens lysy...[gi:4204400] Links  


LOCUS       AF039290                2192 bp    DNA     linear   PRI 06-MAY-1999
DEFINITION  Homo sapiens lysyl oxidase gene, 5' flanking region, exon 1 and
            partial cds.
ACCESSION   AF039290
VERSION     AF039290.1  GI:4204400
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2192)
  AUTHORS   Contente,S., Kenyon,K., Sriraman,P., Subramanyan,S. and
            Friedman,R.M.
  TITLE     Epigenetic inhibition of lysyl oxidase transcription after
            transformation by ras oncogene
  JOURNAL   Mol. Cell. Biochem. 194 (1-2), 79-91 (1999)
  MEDLINE   99318011
   PUBMED   10391127
REFERENCE   2  (bases 1 to 2192)
  AUTHORS   Contente,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-DEC-1997) Pathology, USUHS, 4301 Jones Bridge Road,
            Bethesda, MD 20814-4799, USA
FEATURES             Location/Qualifiers
     source          1..2192
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="5"
                     /map="5q23.3-q31.2"
                     /cell_type="B-cells"
     repeat_region   1..260
                     /rpt_family="Alu"
                     /rpt_type=dispersed
     mRNA            <2074..>2192
                     /product="lysyl oxidase"
     CDS             2074..>2192
                     /EC_number="1.4.3.13"
                     /codon_start=1
                     /product="lysyl oxidase"
                     /protein_id="AAD10703.1"
                     /db_xref="GI:4204401"
                     /translation="MRFAWTVLLLGPLQLCALVHCAPPAAGQQQPPREPPAAPG"
     exon            <2074..>2192
                     /number=1
BASE COUNT      522 a    545 c    501 g    624 t
ORIGIN      
        1 gaattctcca gcctcagcct ccagagtagt ttcgggatta cagacatgtg cccaccatgc
       61 ccagctaatt ttttgctttg tttgtttgtt tgtttgtatt tttagtagag atcggggttt
      121 tgcccatttg gccaggcctg gtctcgaact cctgacctcg aatgataatg atccgcccgc
      181 ccttggcctc ccaaagtgct aggattacag gtgtgagcca ctgcgccagg cctgggcact
      241 ttctttagta gtttgaggag caacattttt gacagtgtcc ttctgctcaa gattcagatc
      301 ccagataaaa ttaaaccatc tagagagatg gcttgattgg ccaaacctgg atctcatgac
      361 cacttcttga agtgggtaag tctcataaat gctcagtcct tccactatgc aactgagtgg
      421 ggtgggtggg aagcccctca aaggaaaatc cggttgttct tactagaaag aaaaggaaaa
      481 tggatgtgag gcagtcaaaa tcagcagagg ccaccacacc accaaaatgt ggtgattaaa
      541 tatggagaga cagagactaa cagaggtatg tgaatattga agtatgtctg gacaatagcc
      601 caatgatgag accaataaaa tggttaccaa aatctggttt tgagtagtag tgttaaatca
      661 gaccatttgt aaccattttt tgttgcaaag tttctagcac tgcccaaacc ctgagtggta
      721 tatgaataac tcgtccatta tgtatctctt cccagtcagc ataatttatc ccccacctat
      781 attcttttct gaccactcct acttccttct ctttaccaaa atctaaactc taaggctgtt
      841 tcttcagcaa cttctttgtt tagattggaa gataaattaa acagcatgcg atgttttact
      901 gactttcagt atttaacaga ggtgatttaa ttttttttta aatccaaagt caaacttctt
      961 tataagatga aggagaaaaa tgtcttataa aatgcatatg tgaagatgcc ttctgagtgc
     1021 tttctcatgc agacttgttc tagtctttaa tgaatcttcc ttgtagacac tgtggagatg
     1081 aaagatggtt ctccacttct actcaaagta caaatcaggc cggcattttg aaaaagagac
     1141 aggtttattc atagctgcag cgttagctgg ctttgttccc tgtacaattt cacttttggt
     1201 tattaaaata ttcactgtag gaaataaatt tgtaacccat ttctcatatt acctacacac
     1261 agaaaaacaa aatttgatat cctggggttt atttgctgag ggcgcttccc ataaaagcga
     1321 gagagtgtgc gttgggaaat gtgtctggtt aactctttta tggataaact ttagtcacaa
     1381 tcctcccccg cccccctctc acccccagca ccctcccaac ctcccgactt cccgcctctc
     1441 aagggctggt gacctaatag catttttctt cgtgcatatt ttggcgtcgc cccatggcct
     1501 ggctgccttc gcctgtctga gttttttgaa attcctgcat gttcgcccca gattaagcca
     1561 gtgtgtctca ggatgtgtgt tccgttttgt tctttcccct taacgctccc tgtgcaacgt
     1621 gtctgggggg aggagggcag ggacgggaga gagggagggg cagaggcgag gagctgtccg
     1681 ccttgcacgt ttccaatcgc attacgtgaa caaatagctg aggggccgcc ccgccagaac
     1741 ggcttgtgta actttgcaaa cgtgccagaa agtttaaaat ctctcctcct tccttcactc
     1801 cagacactgc ccgctctccg ggactgccgc cgctccccgt tgccttccag gactgagaaa
     1861 ggggaaaggg aagggtgcca cgtccgagca gccgccttga ctggggaagg gtctgaatcc
     1921 cacccttggc attgcttggt ggagactgag atacccgtgc tccgctcgcc tccttggttg
     1981 aagatttctc cttccctcac gtgatttgag ccccgttttt attttctgtg agccacgtcc
     2041 tcctcgagcg gggtcaatct ggcaaaagga gtgatgcgct tcgcctggac cgtgctcctg
     2101 ctcgggcctt tgcagctctg cgcgctagtg cactgcgccc ctcccgccgc cggccaacag
     2161 cagcccccgc gcgagccgcc ggcggctccg gg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF039291. Homo sapiens lysy...[gi:4104738] Links  


LOCUS       AF039291                1935 bp    mRNA    linear   PRI 06-MAY-1999
DEFINITION  Homo sapiens lysyl oxidase mRNA, complete cds.
ACCESSION   AF039291
VERSION     AF039291.1  GI:4104738
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1935)
  AUTHORS   Contente,S., Kenyon,K., Sriraman,P., Subramanyan,S. and
            Friedman,R.M.
  TITLE     Epigenetic inhibition of lysyl oxidase transcription after
            transformation by ras oncogene
  JOURNAL   Mol. Cell. Biochem. 194 (1-2), 79-91 (1999)
  MEDLINE   99318011
   PUBMED   10391127
REFERENCE   2  (bases 1 to 1935)
  AUTHORS   Kenyon,K.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-DEC-1997) Pathology, USUHS, 4301 Jones Bridge Rd.,
            Bethesda, MD 20814-4799, USA
FEATURES             Location/Qualifiers
     source          1..1935
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="5"
                     /map="5q23.3-q31.2"
                     /cell_type="fibroblast"
     CDS             250..1503
                     /EC_number="1.4.3.13"
                     /codon_start=1
                     /product="lysyl oxidase"
                     /protein_id="AAD02130.1"
                     /db_xref="GI:4104739"
                     /translation="MRFAWTVLLLGPLQLCALVHCAPPAAGQQQPPREPPAAPGAWRQ
                     QIQWENNGQVFSLLSLGSQYQPQRRRDPGAAVPGAANASAQQPRTPILLIRDNRTAAA
                     RTRTAGSSGVTAGRPRPTARHWFQAGYSTSRAREAGASRAENQTAPGEVPALSNLRPP
                     SRVDGMVGDDPYNPYKYSDDNPYYNYYDTYERPRPGGRYRPGYGTGYFQYGLPDLVAD
                     PYYIQASTYVQKMSMYNLRCAAEENCLASTAYRADVRDYDHRVLLRFPQRVKNQGTSD
                     FLPSRPRYSWEWHSCHQHYHSMDEFSHYDLLDANTQRRVAEGHKASFCLEDTSCDYGY
                     HRRFACTAHTQGLSPGCYDTYGADIDCQWIDITDVKPGNYILKVSVNPSYLVPESDYT
                     NNVVRCDIRYTGHHAYASGCTISPY"
BASE COUNT      486 a    535 c    481 g    433 t
ORIGIN      
        1 ccgcgccgct ccccgttgcc ttccaggact gagaaagggg aaagggaagg gtgccacgtc
       61 cgagcagccg ccttgactgg ggaagggtct gaatcccacc cttggcattg cctggtggag
      121 actgagatac ccgtgctccg ctcgcctcct tggttgaaga tttctccttc cctcacgtga
      181 tttgagcccc gtttttattt tctgtgagcc acgtcctcct cgagcggggt caatctggca
      241 aaaggagtga tgcgcttcgc ctggaccgtg ctcctgctcg ggcctttgca gctctgcgcg
      301 ctagtgcact gcgcccctcc cgccgccggc caacagcagc ccccgcgcga gccgccggcg
      361 gctccgggcg cctggcgcca gcagatccaa tgggagaaca acgggcaggt gttcagcttg
      421 ctgagcctgg gctcacagta ccagcctcag cgccgccggg acccgggcgc cgccgtccct
      481 ggtgcagcca acgcctccgc ccagcagccc cgcactccga tcctgctgat ccgcgacaac
      541 cgcaccgccg cggcgcgaac gcggacggcc ggctcatctg gagtcaccgc tggccgcccc
      601 aggcccaccg cccgtcactg gttccaagct ggctactcga catctagagc ccgcgaagct
      661 ggcgcctcgc gcgcggagaa ccagacagcg ccgggagaag ttcctgcgct cagtaacctg
      721 cggccgccca gccgcgtgga cggcatggtg ggcgacgacc cttacaaccc ctacaagtac
      781 tctgacgaca acccttatta caactactac gatacttatg aaaggcccag acctgggggc
      841 aggtaccggc ccggatacgg cactggctac ttccagtacg gtctcccaga cctggtggcc
      901 gacccctact acatccaggc gtccacgtac gtgcagaaga tgtccatgta caacctgaga
      961 tgcgcggcgg aggaaaactg tctggccagt acagcataca gggcagatgt cagagattat
     1021 gatcacaggg tgctgctcag atttccccaa agagtgaaaa accaagggac atcagatttc
     1081 ttacccagcc gaccaagata ttcctgggaa tggcacagtt gtcatcaaca ttaccacagt
     1141 atggatgagt ttagccacta tgacctgctt gatgccaaca cccagaggag agtggctgaa
     1201 ggccacaaag caagtttctg tcttgaagac acatcctgtg actatggcta ccacaggcga
     1261 tttgcatgta ctgcacacac acagggattg agtcctggct gttatgatac ctatggtgca
     1321 gacatagact gccagtggat tgatattaca gatgtaaaac ctggaaacta tatcctaaag
     1381 gtcagtgtaa accccagcta cctggttcct gaatctgact ataccaacaa tgttgtgcgc
     1441 tgtgacattc gctacacagg acatcatgcg tatgcctcag gctgcacaat ttcaccgtat
     1501 tagaaggcaa agcaaaactc ccaatggata aatcagtgcc tggtgttctg aagtgggaaa
     1561 aaatagacta acttcagtag gatttatgta ttttgaaaaa gagaacagaa aacaacaaaa
     1621 gaatttttgt ttggactgtt ttcaataaca aagcacataa ctggattttg aacgcttaag
     1681 tcatcattac ttgggaaatt tttaatgttt attatttaca tcactttgtg aattaacaca
     1741 gtgtttcaat tctgtaatta catatttgac tctttcaaag aaatccaaat ttctcatgtt
     1801 ccttttgaaa ttgtagtgca aaatggtcag tattatctaa atgaatgagc caaaatgact
     1861 ttgaactgaa acttttctaa agtgctggaa ctttagtgaa acataataat aatgggttta
     1921 tacgacagca acgga
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH006687. Human lysosomal a...[gi:2209014] Links  


LOCUS       HSMANBS01               1086 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, 5' flanking region
            and exon 1.
ACCESSION   U60885
VERSION     U60885.1  GI:2208999
KEYWORDS    .
SEGMENT     1 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1086)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 1086)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..1086
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     misc_feature    1..591
                     /note="5' flanking region"
     exon            592..1056
                     /gene="manB"
                     /number=1
     intron          1057..>1086
                     /gene="manB"
                     /number=1
BASE COUNT      201 a    357 c    281 g    247 t
ORIGIN      
        1 ctcggctcac tgcaacctcc gcctcccagg ttcaagtgat tctcctgcct cagtctccca
       61 agtagctggg attacaggca tgtgccacca tgcccagcta attttttgta tttttacatt
      121 ttgtgtaaaa ctacaaattt tgtgttttca aacccgtctc tacaaaacaa cccatgttgc
      181 ccaggctggt ctcaaactcc caggcccaag tgattcacca gcctggccct cccaaagtgc
      241 tgggattgca ggcttgagcc actgcgcctg ccctgggctt tgttttctta tcagagaatg
      301 aactggtagg aattgggaaa ggcatgaaag actcggggtc cttccccact tgtcagaccc
      361 tcttttctct ccggaacacc cagggatcct tctcaaggag agctggatcc catccctggt
      421 ctccaggggt atttacccaa cgtcccaatc cccacagtat aggactcttc atcagatcct
      481 cctcttaggc aagctaggtc cttccaggac cctagcgctg aaggtccatg ggacggacac
      541 cctggattcc catggacaca ctacaccggc tgggaaaacc cggccccctt aggaaaagca
      601 cttctgctct accggcatta agaggcattc cgtcttggaa ttccggcatt aagaggcatt
      661 ccgtcttcat agcccgtgag acgccagtgt cacctttagc ccaaccagtg ccctgagggt
      721 ggcattttcc taccttcctg taacgacccc cgggattgcc cagggctaca gcctctctcc
      781 cgtgagcctc cagaccgccc ctggccccgc cccccacccc gataggcccg gccgggtctg
      841 ggggcggggc gtttgcggcc tttccagggc cggggaaccc caggaggaag ctgctgagcc
      901 atgggctacg cgcgggcttc gggggtctgc gctcgcggct gcctggactc agcaggcccc
      961 tggaccatgt cccgcgccct gcggccaccg ctcccgcctc tctgcttttt ccttttgttg
     1021 ctggcggctg ccggtgctcg ggccggggga tacgaggtga gtggggcctc cgagctgaaa
     1081 cgtaca
//
LOCUS       HSMANBS02                512 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exons 2 and 3.
ACCESSION   U60886
VERSION     U60886.1  GI:2209000
KEYWORDS    .
SEGMENT     2 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 512)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 512)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..512
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=1
     exon            31..133
                     /gene="manB"
                     /number=2
     intron          134..308
                     /gene="manB"
                     /number=2
     exon            309..482
                     /gene="manB"
                     /number=3
     intron          483..>512
                     /gene="manB"
                     /number=3
BASE COUNT      106 a    143 c    141 g    122 t
ORIGIN      
        1 aaggtatgtg tgtttggggt ccctgtgcag acatgcccca cagtgcagcc gaacatgctg
       61 aacgtgcacc tgctgcctca cacacatgat gacgtgggct ggctcaaaac cgtggaccag
      121 tacttttatg gaagtgagta gaggatgggg actggtcctg ggatccccat ggtccctgta
      181 atccctctgg tcctggacat tagggtgggg ccagtgctac cctaatatcc agggtttggg
      241 ctcctctgtc taggaataac ccccttggct ctgctgttcc ctgagagcct tatccctgtt
      301 atccacagtc aagaatgaca tccagcacgc cggtgtgcag tacatcctgg actcggtcat
      361 ctctgccttg ctggcagatc ccacccgtcg cttcatttac gtggagattg ccttcttctc
      421 ccgttggtgg caccagcaga caaatgccac acaggaagtc gtgcgagacc ttgtgcgcca
      481 gggtgagctt accccaagga agtgaaaaga gg
//
LOCUS       HSMANBS03                254 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exon 4.
ACCESSION   U60887
VERSION     U60887.1  GI:2209001
KEYWORDS    .
SEGMENT     3 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 254)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 254)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..254
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          1..30
                     /gene="manB"
                     /number=3
     exon            31..224
                     /gene="manB"
                     /number=4
     intron          225..254
                     /gene="manB"
                     /number=4
BASE COUNT       40 a     80 c     81 g     53 t
ORIGIN      
        1 ctgaccctga ccttgcctgt cctggcacag ggcgcctgga gttcgccaat ggtggctggg
       61 tgatgaacga tgaggcagcc acccactacg gtgccatcgt ggaccagatg acacttgggc
      121 tgcgctttct ggaggacaca tttggcaatg atgggcgacc ccgtgtggcc tggcacattg
      181 accccttcgg ccactctcgg gagcaggcct cgctgtttgc gcaggtgcga cccgggacct
      241 ctcttgggcc cact
//
LOCUS       HSMANBS04                579 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exons 5 and 6.
ACCESSION   U60888
VERSION     U60888.1  GI:2209002
KEYWORDS    .
SEGMENT     4 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 579)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 579)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..579
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=4
     exon            31..163
                     /gene="manB"
                     /number=5
     intron          164..403
                     /gene="manB"
                     /number=5
     exon            404..549
                     /gene="manB"
                     /number=6
     intron          550..>579
                     /gene="manB"
                     /number=6
BASE COUNT      112 a    162 c    172 g    133 t
ORIGIN      
        1 tcacactgcc ttttcctccc tcatccccag atgggcttcg acggcttctt ctttgggcgc
       61 cttgattatc aagataagtg ggtacggatg cagaagctgg agatggagca ggtgtggcgg
      121 gccagcacca gcctgaagcc cccgaccgcg gacctcttca ctggtagggg gcttggtgag
      181 ggcagggcca gccatggtgc cacacactca gaagggccct gggcttgata tctgctctgt
      241 tgtcactgcc ctggaattcc tatagtctgg gaacaaaggc cctgcatttc cttttgcatt
      301 gggacacaaa ttctgaagcc catcctgggt gggacatggc cggctttgaa accagggaag
      361 gtctgggtga tgggccaccc cttgaacttg gtgtgacctg caggtgtgct tcccaatggt
      421 tacaacccgc caaggaatct gtgctgggat gtgctgtgtg tcgatcagcc gctggtggag
      481 gaccctcgca gccccgagta caacgccaag gagctggtcg attacttcct aaatgtggcc
      541 actgcccagg taaccctggt gtccagaacc ttcgagtcc
//
LOCUS       HSMANBS05                178 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exon 7.
ACCESSION   U60889
VERSION     U60889.1  GI:2209003
KEYWORDS    .
SEGMENT     5 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 178)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 178)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..178
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=6
     exon            31..147
                     /gene="manB"
                     /number=7
     intron          148..>178
                     /gene="manB"
                     /number=7
BASE COUNT       39 a     53 c     45 g     41 t
ORIGIN      
        1 ctggcttaag gactcccctc ttgcctgcag ggccggtatt accgcaccaa ccacactgtg
       61 atgaccatgg gctcggactt ccaatatgag aatgccaaca tgtggttcaa gaaccttgac
      121 aagctcatcc ggctggtaaa tgcgcaggtc agtgcgccta ccctgtggta cccttgtg
//
LOCUS       HSMANBS06                508 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exons 8, 9 and 10.
ACCESSION   U60890
VERSION     U60890.1  GI:2209004
KEYWORDS    .
SEGMENT     6 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 508)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 508)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..508
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=7
     exon            31..113
                     /gene="manB"
                     /number=8
     intron          114..196
                     /gene="manB"
                     /number=8
     exon            197..317
                     /gene="manB"
                     /number=9
     intron          318..399
                     /gene="manB"
                     /number=9
     exon            400..478
                     /gene="manB"
                     /number=10
     intron          479..>508
                     /gene="manB"
                     /number=10
BASE COUNT       94 a    165 c    157 g     92 t
ORIGIN      
        1 gagggctcac tccgtcgcct cccccagcag caggcaaaag gaagcagtgt ccatgttctc
       61 tactccaccc ccgcttgtta cctctgggag ctgaacaagg ccaacctcac ctggtatttg
      121 gggaaactgg ggagcttggg ggggttggca tgccccgtgg gtcatgaccc tgccctcaat
      181 gcccctgccg ctgtaggtca gtgaaacatg acgacttctt cccttacgcg gatggccccc
      241 accagttctg gaccggttac ttttccagtc ggccggccct caaacgctac gagcgcctca
      301 gctacaactt cctgcaggtg ggtaggagcc gggctagagg gggcatgcag ccccgaggcc
      361 cgacaggctg ggcgccccaa catacccctc tgcctccagg tgtgcaacca gctggaggcg
      421 ctggtgggcc tggcggccaa cgtgggaccc tatggctccg gagacagtgc acccctcagt
      481 aagtgtcggg cccaagaggg gaagaggt
//
LOCUS       HSMANBS07                170 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exon 11.
ACCESSION   U60891
VERSION     U60891.1  GI:2209005
KEYWORDS    .
SEGMENT     7 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 170)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 170)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..170
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=10
     exon            31..140
                     /gene="manB"
                     /number=11
     intron          141..>170
                     /gene="manB"
                     /number=11
BASE COUNT       31 a     53 c     60 g     26 t
ORIGIN      
        1 aacctcactg gactcattgt ctatgagcag atgaggcgat ggctgtgctc cagcatcacg
       61 acgccgtcag cggcacctcc cgccagcacg tggccaacga ctacgcgcgc cagcttgcgg
      121 caggctgggg gccttgcgag gtgcgcgggg cgagacttgg gagacacggg 
//
LOCUS       HSMANBS08                546 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exon 12 and 13.
ACCESSION   U60892
VERSION     U60892.1  GI:2209006
KEYWORDS    .
SEGMENT     8 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 546)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 546)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..546
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=11
     exon            31..138
                     /gene="manB"
                     /number=12
     intron          139..399
                     /gene="manB"
                     /number=12
     exon            400..516
                     /gene="manB"
                     /number=13
     intron          517..>546
                     /gene="manB"
                     /number=13
BASE COUNT      114 a    145 c    173 g    114 t
ORIGIN      
        1 acccaccggt ccctttgcgc tcttccgcag gttcttctga gcaacgcgct ggcgcggctc
       61 agaggcttca aagatcactt caccttttgc caacagctaa acatcagcat ctgcccgctc
      121 agccagacgg cggcgcgcgt gacgcgggac gggaggggtg gatctagggc agatgggctt
      181 tagagggggt agttggaaaa tgtttttgga ggactataca ggagtgaaat tacgtgggct
      241 gcgaagctgg gtcagcagag agaacaacgc atctcgaggg ggcttggcct taagtgccgt
      301 gaccacacta ggaccagcca ggggtgtttc tgtgcaaagt gggtgggttt ggagaaggcc
      361 tgctgtgacc catgccctct ctgacccccg cctccccagt tccaggtcat cgtttataat
      421 cccctggggc ggaaggtgaa ttggatggta cggctgccgg tcagcgaagg cgttttcgtt
      481 gtgaaggacc ccaatggcag gacagtgccc agcgatgtga gcccaaacaa cgaatattcc
      541 cccgct
//
LOCUS       HSMANBS09                246 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exon 14.
ACCESSION   U60893
VERSION     U60893.1  GI:2209007
KEYWORDS    .
SEGMENT     9 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 246)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 246)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..246
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=13
     exon            31..216
                     /gene="manB"
                     /number=14
     intron          217..>246
                     /gene="manB"
                     /number=14
BASE COUNT       46 a     94 c     51 g     55 t
ORIGIN      
        1 tgactcagtt tccctcctcc tttatccgag gtggtaatat ttcccagctc agacagccag
       61 gcgcaccctc cggagctgct gttctcagcc tcactgcccg ccctgggctt cagcacctat
      121 tcagtagccc aggtgcctcg ctggaagccc caggcccgcg caccacagcc catccccaga
      181 agatcctggt cccctgcttt aaccatcgaa aatgaggtga gaccccattt caatcccctt
      241 tcctgc
//
LOCUS       HSMANBS10                368 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exon 15 and 16.
ACCESSION   U60894
VERSION     U60894.1  GI:2209008
KEYWORDS    .
SEGMENT     10 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 368)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 368)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..368
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=14
     exon            31..128
                     /gene="manB"
                     /number=15
     intron          129..220
                     /gene="manB"
                     /number=15
     exon            221..338
                     /gene="manB"
                     /number=16
     intron          339..>368
                     /gene="manB"
                     /number=16
BASE COUNT       86 a    104 c    101 g     77 t
ORIGIN      
        1 aaacccatct gtggaccctt ttctgcccag cacatccggg caacgtttga tcctgacaca
       61 gggctgttga tggagattat gaacatgaat cagcaactcc tgctgcctgt tcgccagacc
      121 ttcttctggt aagggaagat caccaggcct gagggtgggc tggtggtggt cggcatggag
      181 ctaggtcccc ttacctgact ctcacctgcc ccaactccag gtacaacgcc agtataggtg
      241 acaacgaaag tgaccaggcc tcaggtgcct acatcttcag acccaaccaa cagaaaccgc
      301 tgcctgtgag ccgctgggct cagatccacc tggtgaaggt cagggactag gaatgatgag
      361 tgggcagt
//
LOCUS       HSMANBS11                370 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exons 17 and 18.
ACCESSION   U60895
VERSION     U60895.1  GI:2209009
KEYWORDS    .
SEGMENT     11 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 370)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 370)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..370
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=16
     exon            31..149
                     /gene="manB"
                     /number=17
     intron          150..238
                     /gene="manB"
                     /number=17
     exon            239..340
                     /gene="manB"
                     /number=18
     intron          341..>370
                     /gene="manB"
                     /number=18
BASE COUNT       79 a     93 c    132 g     66 t
ORIGIN      
        1 cttacaccca cctcacccgt tgtcccgcag acacccttgg tgcaggaggt gcaccagaac
       61 ttctcagctt ggtgttccca ggtggttcgc ctgtacccag gacagcggca cctggagcta
      121 gagtggtcgg tggggccgat acctgtgggg tgagtggcac aggctgggag aggggtgtgg
      181 aagcaagggc agaggggttt atccaaggct cacaaccttg ccatcccatt gggtacagcg
      241 acacctgggg gaaggaggtc atcagccgtt ttgacacacc gctggagaca aagggacgct
      301 tctacacaga cagcaatggc cgggagatcc tggagaggag gtggggggtg actgagagca
      361 ctgagggggt 
//
LOCUS       HSMANBS12                353 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exons 19 and 20.
ACCESSION   U60896
VERSION     U60896.1  GI:2209010
KEYWORDS    .
SEGMENT     12 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 353)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 353)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..353
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=18
     exon            31..118
                     /gene="manB"
                     /number=19
     intron          119..242
                     /gene="manB"
                     /number=19
     exon            243..323
                     /gene="manB"
                     /number=20
     intron          324..>353
                     /gene="manB"
                     /number=20
BASE COUNT       81 a    118 c     89 g     65 t
ORIGIN      
        1 caagcctgat cagctcaccc ccaaccccag gcgggattat cgacccacct ggaaactgaa
       61 ccagacggag cccgtggcag gaaactacta tccagtcaac acccggattt acatcacggt
      121 agctctcccc catcctgcac ctccccacct cgatagaaag ggaatcaccc cttatctgca
      181 gcatctcaaa gctgcctggg gttggggttg actgccctct actttcaccc ttcaactccc
      241 aggatggaaa catgcagctg actgtgctga ctgaccgctc ccaggggggc agcagcctga
      301 gagatggctc gctggagctc atggtgagtg ggtcagagcc catccgaggc cag
//
LOCUS       HSMANBS13                288 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exon 21.
ACCESSION   U60897
VERSION     U60897.1  GI:2209011
KEYWORDS    .
SEGMENT     13 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 288)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 288)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..288
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=20
     exon            31..258
                     /gene="manB"
                     /number=21
     intron          259..>288
                     /gene="manB"
                     /number=21
BASE COUNT       45 a     95 c    111 g     37 t
ORIGIN      
        1 ctcaccccct cctggccact ctccccccag gtgcaccgaa ggctgctgaa ggacgatgga
       61 cgcggagtat cggagccact aatggagaac gggtcggggg cgtgggtgcg agggcgccac
      121 ctggtgctgc tggacacagc ccaggctgca gccgccggac accggctcct ggcggagcag
      181 gaggtcctgg cccctcaggt ggtgctggcc ccgggtggcg gcgccgccta caatctcggg
      241 gctcctccgc gcacgcaggt gaggggcagc ggggtaggca gagaggac
//
LOCUS       HSMANBS14                426 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exons 22 and 23.
ACCESSION   U60898
VERSION     U60898.1  GI:2209012
KEYWORDS    .
SEGMENT     14 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 426)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 426)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..426
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     intron          <1..30
                     /gene="manB"
                     /number=21
     exon            31..186
                     /gene="manB"
                     /number=22
     intron          187..293
                     /gene="manB"
                     /number=22
     exon            294..396
                     /gene="manB"
                     /number=23
     intron          397..>426
                     /gene="manB"
                     /number=23
BASE COUNT       90 a    136 c    132 g     68 t
ORIGIN      
        1 tccaactcag ccccgccctc tctcccgcag ttctcagggc tgcgcaggga cctgccgccc
       61 tcggtgcacc tgctcacgct ggccagctgg ggccccgaaa tggtgctgct gcgcttggag
      121 caccagtttg ccgtaggaga ggattccgga cgtaacctga gcgcccccgt taccttgaac
      181 ttgagggtga gaagggcaaa attgagaagg agatcggaga gaggcaagag agagggagag
      241 aagagaaacc tggctttgcc ccaactcatc tgggcccatc cccttccccg caggacctgt
      301 tctccacctt caccatcacc cgcctgcagg agaccacgct ggtggccaac cagctccgcg
      361 aggcagcctc caggctcaag tggacaacaa acacaggtgg ggccctggtc aggggtaggg
      421 aagggg
//
LOCUS       HSMANBS15                420 bp    DNA     linear   PRI 20-JUN-1997
DEFINITION  Human lysosomal alpha-mannosidase (manB) gene, exon 24, 3' flanking
            region and complete cds.
ACCESSION   U60899
VERSION     U60899.1  GI:2209013
KEYWORDS    .
SEGMENT     15 of 15
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 420)
  AUTHORS   Riise,H.M., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Genomic structure of the human lysosomal alpha-mannosidase gene
            (MANB)
  JOURNAL   Genomics 42 (2), 200-207 (1997)
  MEDLINE   97336044
   PUBMED   9192839
REFERENCE   2  (bases 1 to 420)
  AUTHORS   Riise,H.M.F., Berg,T., Nilssen,O., Romeo,G., Tollersrud,O.K. and
            Ceccherini,I.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUN-1996) Medical Biochemistry, University of Tromso,
            Breivika, Tromso N-9037, Norway
FEATURES             Location/Qualifiers
     source          1..420
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19q; proximal to the centromere"
     gene            join(U60885.1:592..1056,U60886.1:31..133,
                     U60886.1:309..482,U60887.1:31..224,U60888.1:31..163,
                     U60888.1:404..549,U60889.1:31..147,U60890.1:31..113,
                     U60890.1:197..317,U60890.1:400..478,U60891.1:31..140,
                     U60892.1:31..138,U60892.1:400..516,U60893.1:31..216,
                     U60894.1:31..128,U60894.1:221..338,U60895.1:31..149,
                     U60895.1:239..340,U60896.1:31..118,U60896.1:243..323,
                     U60897.1:31..258,U60898.1:31..186,U60898.1:294..396,
                     31..244)
                     /gene="manB"
     mRNA            join(U60885.1:592..1056,U60886.1:31..133,
                     U60886.1:309..482,U60887.1:31..224,U60888.1:31..163,
                     U60888.1:404..549,U60889.1:31..147,U60890.1:31..113,
                     U60890.1:197..317,U60890.1:400..478,U60891.1:31..140,
                     U60892.1:31..138,U60892.1:400..516,U60893.1:31..216,
                     U60894.1:31..128,U60894.1:221..338,U60895.1:31..149,
                     U60895.1:239..340,U60896.1:31..118,U60896.1:243..323,
                     U60897.1:31..258,U60898.1:31..186,U60898.1:294..396,
                     31..244)
                     /gene="manB"
                     /product="lysosomal alpha-mannosidase"
     CDS             join(U60885.1:901..1056,U60886.1:31..133,
                     U60886.1:309..482,U60887.1:31..224,U60888.1:31..163,
                     U60888.1:404..549,U60889.1:31..147,U60890.1:31..113,
                     U60890.1:197..317,U60890.1:400..478,U60891.1:31..140,
                     U60892.1:31..138,U60892.1:400..516,U60893.1:31..216,
                     U60894.1:31..128,U60894.1:221..338,U60895.1:31..149,
                     U60895.1:239..340,U60896.1:31..118,U60896.1:243..323,
                     U60897.1:31..258,U60898.1:31..186,U60898.1:294..396,
                     31..143)
                     /gene="manB"
                     /codon_start=1
                     /product="lysosomal alpha-mannosidase"
                     /protein_id="AAC51362.1"
                     /db_xref="GI:2209015"
                     /translation="MGYARASGVCARGCLDSAGPWTMSRALRPPLPPLCFFLLLLAAA
                     GARAGGYETCPTVQPNMLNVHLLPHTHDDVGWLKTVDQYFYGIKNDIQHAGVQYILDS
                     VISALLADPTRRFIYVEIAFFSRWWHQQTNATQEVVRDLVRQGRLEFANGGWVMNDEA
                     ATHYGAIVDQMTLGLRFLEDTFGNDGRPRVAWHIDPFGHSREQASLFAQMGFDGFFFG
                     RLDYQDKWVRMQKLEMEQVWRASTSLKPPTADLFTGVLPNGYNPPRNLCWDVLCVDQP
                     LVEDPRSPEYNAKELVDYFLNVATAQGRYYRTNHTVMTMGSDFQYENANMWFKNLDKL
                     IRLVNAQQAKGSSVHVLYSTPACYLWELNKANLTWSVKHDDFFPYADGPHQFWTGYFS
                     SRPALKRYERLSYNFLQVCNQLEALVGLAANVGPYGSGDSAPLNEAMAVLQHHDAVSG
                     TSRQHVANDYARQLAAGWGPCEVLLSNALARLRGFKDHFTFCQQLNISICPLSQTAAR
                     FQVIVYNPLGRKVNWMVRLPVSEGVFVVKDPNGRTVPSDVVIFPSSDSQAHPPELLFS
                     ASLPALGFSTYSVAQVPRWKPQARAPQPIPRRSWSPALTIENEHIRATFDPDTGLLME
                     IMNMNQQLLLPVRQTFFWYNASIGDNESDQASGAYIFRPNQQKPLPVSRWAQIHLVKT
                     PLVQEVHQNFSAWCSQVVRLYPGQRHLELEWSVGPIPVGDTWGKEVISRFDTPLETKG
                     RFYTDSNGREILERRRDYRPTWKLNQTEPVAGNYYPVNTRIYITDGNMQLTVLTDRSQ
                     GGSSLRDGSLELMVHRRLLKDDGRGVSEPLMENGSGAWVRGRHLVLLDTAQAAAAGHR
                     LLAEQEVLAPQVVLAPGGGAAYNLGAPPRTQFSGLRRDLPPSVHLLTLASWGPEMVLL
                     RLEHQFAVGEDSGRNLSAPVTLNLRDLFSTFTITRLQETTLVANQLREAASRLKWTTN
                     TGPTPHQTPYQLDPANITLEPMEIRTFLASVQWKEVDG"
     intron          <1..30
                     /gene="manB"
                     /number=23
     exon            31..244
                     /gene="manB"
                     /number=24
     misc_feature    245..420
                     /note="3' flanking region"
BASE COUNT       86 a    123 c    111 g    100 t
ORIGIN      
        1 tcactcctcc ttccccctgc acctctccag gccccacacc ccaccaaact ccgtaccagc
       61 tggacccggc caacatcacg ctggaaccca tggaaatccg cactttcctg gcctcagttc
      121 aatggaagga ggtggatggt taggtctgct gggatgggcc ctccaagccc aagcctcctg
      181 ctccgggggc agaccagact ctgactctcc tcttggggct gctgcattaa aacgtactac
      241 taagactcag gtcgctctgt gactgagtgt gggttttttt tgtctgttat ttgcttgtta
      301 gagggggaca gaatttatga cccagagccg ggtgtggtgg cttgctcctg taatcctaac
      361 aactctggag gctgaggaga gagaatcact tgagcccagg agatctagac tagcctgggc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Y00062. Human mRNA for T2...[gi:34275] Links  


LOCUS       HSLCA                   4597 bp    mRNA    linear   PRI 23-MAR-1995
DEFINITION  Human mRNA for T200 leukocyte common antigen (CD45, LC-A).
ACCESSION   Y00062
VERSION     Y00062.1  GI:34275
KEYWORDS    alternative splicing; cell surface antigen; cell surface
            glycoprotein; leukocyte common antigen; phosphoprotein; T200
            glycoprotein.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4597)
  AUTHORS   Ralph,S.J., Thomas,M.L., Morton,C.C. and Trowbridge,I.S.
  TITLE     Structural variants of human T200 glycoprotein (leukocyte-common
            antigen)
  JOURNAL   EMBO J. 6 (5), 1251-1257 (1987)
  MEDLINE   87275816
REFERENCE   2  (bases 1 to 4597)
  AUTHORS   Trowbridge,I.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-JUN-1987) Trowbridge I.S. The Salk Institute, P.O.
            Box 85800 San Diego, CA 92138-926 USA
FEATURES             Location/Qualifiers
     source          1..4597
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="long arm of chromosome 1"
                     /clone="pHLC-1 and lambdaHLC-1"
                     /cell_line="IB4 (human B cell, human tonsil)"
                     /clone_lib="pcD and lambda gt10"
     CDS             147..3578
                     /note="precursor polypeptide (AA -23 to 1120)"
                     /codon_start=1
                     /protein_id="CAA68269.1"
                     /db_xref="GI:34276"
                     /db_xref="SPTREMBL:Q16614"
                     /translation="MYLWLKLLAFGFAFLDTEVFVTGQSPTPSPTDAYLNASETTTLS
                     PSGSAVISTTTIATTPSKPTCDEKYANITVDYLYNKETKLFTAKLNVNENVECGNNTC
                     TNNEVHNLTECKNASVSISHNSCTAPDKTLILDVPPGVEKFQLHDCTQVEKADTTICL
                     KWKNIETFTCDTQNITYRFQCGNMIFDNKEIKLENLEPEHEYKCDSEILYNNHKFTNA
                     SKIIKTDFGSPGEPQIIFCRSEAAHQGVITWNPPQRSFHNFTLCYIKETEKDCLNLDK
                     NLIKYDLQNLKPYTKYVLSLHAYIIAKVQRNGSAAMCHFTTKSAPPSQVWNMTVSMTS
                     DNSMHVKCRPPRDRNGPHERYHLEVEAGNTLVRNESHKNCDFRVKDLQYSTDYTFKAY
                     FHNGDYPGEPFILHHSTSYNSKALIAFLAFLIIVTSIALLVVLYKIYDLHKKRSCNLD
                     EQQELVERDDEKQLMNVEPIHADILLETYKRKIADEGRLFLAEFQSIPRVFSKFPIKE
                     ARKPFNQNKNRYVDILPYDYNRVELSEINGDAGSNYINASYIDGFKEPRKYIAAQGPR
                     DETVDDFWRMIWEQKATVIVMVTRCEEGNRNKCAEYWPSMEEGTRAFGDVVVKINQHK
                     RCPDYIIQKLNIVNKKEKATGREVTHIQFTSWPDHGVPEDPHLLLKLRRRVNAFSNFF
                     SGPIVVHCSAGVGRTGTYIGIDAMLEGLEAENKVDVYGYVVKLRRQRCLMVQVEAQYI
                     LIHQALVEYNQFGETEVNLSELHPYLHNMKKRDPPSEPSPLEAEFQRLPSYRSWRTQH
                     IGNQEENKSKNRNSNVIPYDYNRVPLKHELEMSKESEHDSDESSDDDSDSEEPSKYIN
                     ASFIMSYWKPEVMIAAQGPLKETIGDFWQMIFQRKVKVIVMLTELKHGDQEICAQYWG
                     EGKQTYGDIEVDLKDTDKSSTYTLRVFELRHSKRKDSRTVYQYQYTNWSVEQLPAEPK
                     ELISMIQVVKQKLPQKNSSEGNKHHKSTPLLIHCRDGSQQTGIFCALLNLLESAETEE
                     VVDIFQVVKALRKARPGMVSTFEQYQFLYDVIASTYPAQNGQVKKNNHQEDKIEFDNE
                     VDKVKQDANCVNPLGAPEKLPEAKEQAEGSEPTSGTEGPEHSVNGPASPALNQGS"
     sig_peptide     147..215
                     /note="signal peptide (AA -23 to -1)"
     mat_peptide     216..3575
                     /product="mature T200 glycoprotein (Aa 1-1120)"
     misc_feature    252..260
                     /note="N-glycosylation site"
     misc_feature    357..365
                     /note="N-glycosylation site"
     misc_feature    441..449
                     /note="N-glycosylation site"
     misc_feature    471..479
                     /note="N-glycosylation site"
     misc_feature    489..497
                     /note="N-glycosylation site"
     misc_feature    666..674
                     /note="N-glycosylation site"
     misc_feature    795..803
                     /note="N-glycosylation site"
     misc_feature    918..926
                     /note="N-glycosylation site"
     misc_feature    1065..1073
                     /note="N-glycosylation site"
     misc_feature    1125..1133
                     /note="N-glycosylation site"
     misc_feature    1248..1256
                     /note="N-glycosylation site"
     misc_feature    1389..1454
                     /note="put. transmembrane region"
     polyA_site      4574..4579
                     /note="region of polyA site"
BASE COUNT     1554 a    809 c    912 g   1322 t
ORIGIN      
        1 cgacatttta actgaactgc gggataaagt gaaatctttc cgtgcagctc tacgagagga
       61 ggaaattgtt cctcgtctga taagacaaca gtggagaaag gacgcatgct gtttcttagg
      121 gacacggctg acttccagat atgaccatgt atttgtggct taaactcttg gcatttggct
      181 ttgcctttct ggacacagaa gtatttgtga cagggcaaag cccaacacct tcccccactg
      241 atgcctacct taatgcctct gaaacaacca ctctgagccc ttctggaagc gctgtcattt
      301 caaccacaac aatagctact actccatcta agccaacatg tgatgaaaaa tatgcaaaca
      361 tcactgtgga ttacttatat aacaaggaaa ctaaattatt tacagcaaag ctaaatgtta
      421 atgagaatgt ggaatgtgga aacaatactt gcacaaacaa tgaggtgcat aaccttacag
      481 aatgtaaaaa tgcgtctgtt tccatatctc ataattcatg tactgctcct gataagacat
      541 taatattaga tgtgccacca ggggttgaaa agtttcagtt acatgattgt acacaagttg
      601 aaaaagcaga tactactatt tgtttaaaat ggaaaaatat tgaaaccttt acttgtgata
      661 cacagaatat tacctacaga tttcagtgtg gtaatatgat atttgataat aaagaaatta
      721 aattagaaaa ccttgaaccc gaacatgagt ataagtgtga ctcagaaata ctctataata
      781 accacaagtt tactaacgca agtaaaatta ttaaaacaga ttttgggagt ccaggagagc
      841 ctcagattat tttttgtaga agtgaagctg cacatcaagg agtaattacc tggaatcccc
      901 ctcaaagatc atttcataat tttaccctct gttatataaa agagacagaa aaagattgcc
      961 tcaatctgga taaaaacctg atcaaatatg atttgcaaaa tttaaaacct tatacgaaat
     1021 atgttttatc attacatgcc tacatcattg caaaagtgca acgtaatgga agtgctgcaa
     1081 tgtgtcattt cacaactaaa agtgctcctc caagccaggt ctggaacatg actgtctcca
     1141 tgacatcaga taatagtatg catgtcaagt gtaggcctcc cagggaccgt aatggccccc
     1201 atgaacgtta ccatttggaa gttgaagctg gaaatactct ggttagaaat gagtcgcata
     1261 agaattgcga tttccgtgta aaagatcttc aatattcaac agactacact tttaaggcct
     1321 attttcacaa tggagactat cctggagaac cctttatttt acatcattca acatcttata
     1381 attctaaggc actgatagca tttctggcat ttctgattat tgtgacatca atagccctgc
     1441 ttgttgttct ctacaaaatc tatgatctac ataagaaaag atcctgcaat ttagatgaac
     1501 agcaggagct tgttgaaagg gatgatgaaa aacaactgat gaatgtggag ccaatccatg
     1561 cagatatttt gttggaaact tataagagga agattgctga tgaaggaaga ctttttctgg
     1621 ctgaatttca gagcatcccg cgggtgttca gcaagtttcc tataaaggaa gctcgaaagc
     1681 cctttaacca gaataaaaac cgttatgttg acattcttcc ttatgattat aaccgtgttg
     1741 aactctctga gataaacgga gatgcagggt caaactacat aaatgccagc tatattgatg
     1801 gtttcaaaga acccaggaaa tacattgctg cacaaggtcc cagggatgaa actgttgatg
     1861 atttctggag gatgatttgg gaacagaaag ccacagttat tgtcatggtc actcgatgtg
     1921 aagaaggaaa caggaacaag tgtgcagaat actggccgtc aatggaagag ggcactcggg
     1981 cttttggaga tgttgttgta aagatcaacc agcacaaaag atgtccagat tacatcattc
     2041 agaaattgaa cattgtaaat aaaaaagaaa aagcaactgg aagagaggtg actcacattc
     2101 agttcaccag ctggccagac cacggggtgc ctgaggatcc tcacttgctc ctcaaactga
     2161 gaaggagagt gaatgccttc agcaatttct tcagtggtcc cattgtggtg cactgcagtg
     2221 ctggtgttgg gcgcacagga acctatatcg gaattgatgc catgctagaa ggcctggaag
     2281 ccgagaacaa agtggatgtt tatggttatg ttgtcaagct aaggcgacag agatgcctga
     2341 tggttcaagt agaggcccag tacatcttga tccatcaggc tttggtggaa tacaatcagt
     2401 ttggagaaac agaagtgaat ttgtctgaat tacatccata tctacataac atgaagaaaa
     2461 gggatccacc cagtgagccg tctccactag aggctgaatt ccagagactt ccttcatata
     2521 ggagctggag gacacagcac attggaaatc aagaagaaaa taaaagtaaa aacaggaatt
     2581 ctaatgtcat cccatatgac tataacagag tgccacttaa acatgagctg gaaatgagta
     2641 aagagagtga gcatgattca gatgaatcct ctgatgatga cagtgattca gaggaaccaa
     2701 gcaaatacat caatgcatct tttataatga gctactggaa acctgaagtg atgattgctg
     2761 ctcagggacc actgaaggag accattggtg acttttggca gatgatcttc caaagaaaag
     2821 tcaaagttat tgttatgctg acagaactga aacatggaga ccaggaaatc tgtgctcagt
     2881 actggggaga aggaaagcaa acatatggag atattgaagt tgacctgaaa gacacagaca
     2941 aatcttcaac ttataccctt cgtgtctttg aactgagaca ttccaagagg aaagactctc
     3001 gaactgtgta ccagtaccaa tatacaaact ggagtgtgga gcagcttcct gcagaaccca
     3061 aggaattaat ctctatgatt caggtcgtca aacaaaaact tccccagaag aattcctctg
     3121 aagggaacaa gcatcacaag agtacacctc tactcattca ctgcagggat ggatctcagc
     3181 aaacgggaat attttgtgct ttgttaaatc tcttagaaag tgcggaaaca gaagaggtag
     3241 tggatatttt tcaagtggta aaagctctac gcaaagctag gccaggcatg gtttccacat
     3301 tcgagcaata tcaattccta tatgacgtca ttgccagcac ctaccctgct cagaatggac
     3361 aagtaaagaa aaacaaccat caagaagata aaattgaatt tgataatgaa gtggacaaag
     3421 taaagcagga tgctaattgt gttaatccac ttggtgcccc agaaaagctc cctgaagcaa
     3481 aggaacaggc tgaaggttct gaacccacga gtggcactga ggggccagaa cattctgtca
     3541 atggtcctgc aagtccagcc ttaaatcaag gttcatagga aaagacataa atgaggaaac
     3601 tccaaacctc ctgttagctg ttatttctat ttttgtagaa gtaggaagtg aaaataggta
     3661 tacagtggat taattaaatg cagcgaacca atatttgtag aagggttata ttttactact
     3721 gtggaaaaat atttaagata gttttgccag aacagtttgt acagacgtat gcttatttta
     3781 aaattttatc tcttattcag taaaaaacaa cttctttgta atcgttatgt gtgtatatgt
     3841 atgtgtgtat gggtgtgtgt ttgtgtgaga gacagagaaa gagagagaat tctttcaagt
     3901 gaatctaaaa gcttttgctt ttcctttgtt tttatgaaga aaaaatacat tttatattag
     3961 aagtgttaac ttagcttgaa ggatctgttt ttaaaaatca taaactgtgt gcagactcaa
     4021 taaaatcatg tacatttctg aaatgacctc aagatgtcct ccttgttcta ctcatatata
     4081 tctatcttat atacttacta ttttacttct agagatagta cataaaggtg gtatgtgtgt
     4141 gtatgctact acaaaaaagt tgttaactaa attaacattg ggaaatctta tattccatat
     4201 attagcattt agtccaatgt ctttttaagc ttatttaatt aaaaaatttc cagtgagctt
     4261 atcatgctgt ctttacatgg ggttttcaat tttgcatgct cgattattcc ctgtacaata
     4321 tttaaaattt attgcttgat acttttgaca acaaattagg ttttgtacaa ttgaacttaa
     4381 ataaatgtca ttaaaataaa taaatgcaat atgtattaat attcattgta taaaaataga
     4441 agaatacaaa catatttgtt aaatatttac atatgaaatt taatatagct atttttatgg
     4501 aatttttcat tgatatgaaa aatatgatat tgcatatgca tagttcccat gttaaatccc
     4561 attcataact ttcattaaag catttacttt gaatttc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  

    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Y00638. Human mRNA for le...[gi:34280] Links  


LOCUS       HSLCAR                  4315 bp    mRNA    linear   PRI 23-MAR-1995
DEFINITION  Human mRNA for leukocyte common antigen (T200).
ACCESSION   Y00638
VERSION     Y00638.1  GI:34280
KEYWORDS    leukocyte common antigen.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4315)
  AUTHORS   Streuli,M., Hall,L.R., Saga,Y., Schlossman,S.F. and Saito,H.
  TITLE     Differential usage of three exons generates at least five different
            mRNAs encoding human leukocyte common antigens
  JOURNAL   J. Exp. Med. 166 (5), 1548-1566 (1987)
  MEDLINE   88061067
REFERENCE   2  (bases 1 to 4315)
  AUTHORS   Saito,H.
  TITLE     Direct Submission
  JOURNAL   Submitted (07-OCT-1987) Saito H., Dana-Farber Cancer Institute, 44
            Binney Street, Boston MA 02115
COMMENT     The protein has various names: T200, B220 and CD45. Differential
            usage of three exons of the human LCA gene yields at least five
            different mRNAs thereby generating the diversity of LCA proteins.
FEATURES             Location/Qualifiers
     source          1..4315
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="chromosome 1"
                     /clone="LCA6/2"
                     /tissue_type="tonsil lymphocytes"
     CDS             87..4001
                     /note="LCA (AA -23 to 1281)"
                     /codon_start=1
                     /protein_id="CAA68669.1"
                     /db_xref="GI:34281"
                     /db_xref="SWISS-PROT:P08575"
                     /translation="MYLWLKLLAFGFAFLDTEVFVTGQSPTPSPTGLTTAKMPSVPLS
                     SDPLPTHTTAFSPASTFERENDFSETTTSLSPDNTSTQVSPDSLDNASAFNTTGVSSV
                     QTPHLPTHADSQTPSAGTDTQTFSGSAANAKLNPTPGSNAISDVPGERSTASTFPTDP
                     VSPLTTTLSLAHHSSAALPARTSNTTITANTSDAYLNASETTTLSPSGSAVISTTTIA
                     TTPSKPTCDEKYANITVDYLYNKETKLFTAKLNVNENVECGNNTCTNNEVHNLTECKN
                     ASVSISHNSCTAPDKTLILDVPPGVEKFQLHDCTQVEKADTTICLKWKNIETFTCDTQ
                     NITYRFQCGNMIFDNKEIKLENLEPEHEYKCDSEILYNNHKFTNASKIIKTDFGSPGE
                     PQIIFCRSEAAHQGVITWNPPQRSFHNFTLCYIKETEKDCLNLDKNLIKYDLQNLKPY
                     TKYVLSLHAYIIAKVQRNGSAAMCHFTTKSAPPSQVWNMTVSMTSDNSMHVKCRPPRD
                     RNGPHERYHLEVEAGNTLVRNESHKNCDFRVKDLQYSTDYTFKAYFHNGDYPGEPFIL
                     HHSTSYNSKALIAFLAFLIIVTSIALLVVLYKIYDLHKKRSCNLDEQQELVERDDEKQ
                     LMNVEPIHADILLETYKRKIADEGRPFLAEFQSIPRVFSKFPIKEARKPFNQNKNRYV
                     DILPYDYNRVELSEINGDAGSNYINASYIDGFKEPRKYIAAQGPRDETVDDFWRMIWE
                     QKATVIVMVTRCEEGNRNKCAEYWPSMEEGTRAFGDVVVKINQHKRCPDYIIQKLNIV
                     NKKEKATGREVTHIQFTSWPDHGVPEDPHLLLKLRRRVNAFSNFFSGPIVVHCSAGVG
                     RTGTYIGIDAMLEGLEAENKVDVYGYVVKLRRQRCLMVQVEAQYILIHQALVEYNQFG
                     ETEVNLSELHPYLHNMKKRDPPSEPSPLEAEFQRLPSYRSWRTQHIGNQEENKSKNRN
                     SNVIPYDYNRVPLKHELEMSKESEHDSDESSDDDSDSEEPSKYINASFIMSYWKPEVM
                     IAAQGPLKETIGDFWQMIFQRKVKVIVMLTELKHGDQEICAQYWGEGKQTYGDIEVDL
                     KDTDKSSTYTLRVFELRHSKRKDSRTVYQYQYTNWSVEQLPAEPKELISMIQVVKQKL
                     PQKNSSEGNKHHKSTPLLIHCRDGSQQTGIFCALLNLLESAETEEVVDIFQVVKALRK
                     ARLGMVSTFEQYQFLYDVIASTYPAQNGQVKKNNHQEDKIEFDNEVDKVKQDANCVNP
                     LGAPEKLPEAKEQAEGSEPTSGTEGPEHSVNGPASPALNQGS"
     sig_peptide     87..155
                     /note="signal peptide (AA -23 to -1)"
     mat_peptide     156..3998
                     /product="mature LCA (AA 1 - 1281)"
     misc_feature    156..1811
                     /note="extracellular domain"
     misc_feature    1812..1877
                     /note="transmembrane domain"
     misc_feature    1878..3998
                     /note="cytoplasmic domain"
BASE COUNT     1422 a    878 c    892 g   1123 t
ORIGIN      
        1 ggaaattgtt cctcgtctga taagacaaca gtggagaaag gacgcatgct gtttcttagg
       61 gacacggctg gcttccagat atgaccatgt atttgtggct taaactcttg gcatttggct
      121 ttgcctttct ggacacagaa gtatttgtga cagggcaaag cccaacacct tcccccactg
      181 gattgactac agcaaagatg cccagtgttc cactttcaag tgacccctta cctactcaca
      241 ccactgcatt ctcacccgca agcacctttg aaagagaaaa tgacttctca gagaccacaa
      301 cttctcttag tccagacaat acttccaccc aagtatcccc ggactctttg gataatgcta
      361 gtgcttttaa taccacaggt gtttcatcag tacagacgcc tcaccttccc acgcacgcag
      421 actcgcagac gccctctgct ggaactgaca cgcagacatt cagcggctcc gccgccaatg
      481 caaaactcaa ccctacccca ggcagcaatg ctatctcaga tgtcccagga gagaggagta
      541 cagccagcac ctttcctaca gacccagttt ccccattgac aaccaccctc agccttgcac
      601 accacagctc tgctgcctta cctgcacgca cctccaacac caccatcaca gcgaacacct
      661 cagatgccta ccttaatgcc tctgaaacaa ccactctgag cccttctgga agcgctgtca
      721 tttcaaccac aacaatagct actactccat ctaagccaac atgtgatgaa aaatatgcaa
      781 acatcactgt ggattactta tataacaagg aaactaaatt atttacagca aagctaaatg
      841 ttaatgagaa tgtggaatgt ggaaacaata cttgcacaaa caatgaggtg cataacctta
      901 cagaatgtaa aaatgcgtct gtttccatat ctcataattc atgtactgct cctgataaga
      961 cattaatatt agatgtgcca ccaggggttg aaaagtttca gttacatgat tgtacacaag
     1021 ttgaaaaagc agatactact atttgtttaa aatggaaaaa tattgaaacc tttacttgtg
     1081 atacacagaa tattacctac agatttcagt gtggtaatat gatatttgat aataaagaaa
     1141 ttaaattaga aaaccttgaa cccgaacatg agtataagtg tgactcagaa atactctata
     1201 ataaccacaa gtttactaac gcaagtaaaa ttattaaaac agattttggg agtccaggag
     1261 agcctcagat tattttttgt agaagtgaag ctgcacatca aggagtaatt acctggaatc
     1321 cccctcaaag atcatttcat aattttaccc tctgttatat aaaagagaca gaaaaagatt
     1381 gcctcaatct ggataaaaac ctgatcaaat atgatttgca aaatttaaaa ccttatacga
     1441 aatatgtttt atcattacat gcctacatca ttgcaaaagt gcaacgtaat ggaagtgctg
     1501 caatgtgtca tttcacaact aaaagtgctc ctccaagcca ggtctggaac atgactgtct
     1561 ccatgacatc agataatagt atgcatgtca agtgtaggcc tcccagggac cgtaatggcc
     1621 cccatgaacg ttaccatttg gaagttgaag ctggaaatac tctggttaga aatgagtcgc
     1681 ataagaattg cgatttccgt gtaaaagatc ttcaatattc aacagactac acttttaagg
     1741 cctattttca caatggagac tatcctggag aaccctttat tttacatcat tcaacatctt
     1801 ataattctaa ggcactgata gcatttctgg catttctgat tattgtgaca tcaatagccc
     1861 tgcttgttgt tctctacaaa atctatgatc tacataagaa aagatcctgc aatttagatg
     1921 aacagcagga gcttgttgaa agggatgatg aaaaacaact gatgaatgtg gagccaatcc
     1981 atgcagatat tttgttggaa acttataaga ggaagattgc tgatgaagga agaccttttc
     2041 tggctgaatt tcagagcatc ccgcgggtgt tcagcaagtt tcctataaag gaagctcgaa
     2101 agccctttaa ccagaataaa aaccgttatg ttgacattct tccttatgat tataaccgtg
     2161 ttgaactctc tgagataaac ggagatgcag ggtcaaacta cataaatgcc agctatattg
     2221 atggtttcaa agaacccagg aaatacattg ctgcacaagg tcccagggat gaaactgttg
     2281 atgatttctg gaggatgatt tgggaacaga aagccacagt tattgtcatg gtcactcgat
     2341 gtgaagaagg aaacaggaac aagtgtgcag aatactggcc gtcaatggaa gagggcactc
     2401 gggcttttgg agatgttgtt gtaaagatca accagcacaa aagatgtcca gattacatca
     2461 ttcagaaatt gaacattgta aataaaaaag aaaaagcaac tggaagagag gtgactcaca
     2521 ttcagttcac cagctggcca gaccacgggg tgcctgagga tcctcacttg ctcctcaaac
     2581 tgagaaggag agtgaatgcc ttcagcaatt tcttcagtgg tcccattgtg gtgcactgca
     2641 gtgctggtgt tgggcgcaca ggaacctata tcggaattga tgccatgcta gaaggcttgg
     2701 aagccgagaa caaagtggat gtttatggtt atgttgtcaa gctaaggcga cagagatgcc
     2761 tgatggttca agtagaggcc cagtacatct tgatccatca ggctttggtg gaatacaatc
     2821 agtttggaga aacagaagtg aatttgtctg aattacatcc atatctacat aacatgaaga
     2881 aaagggatcc acccagtgag ccgtctccac tagaggctga attccagaga cttccttcat
     2941 ataggagctg gaggacacag cacattggaa atcaagaaga aaataaaagt aaaaacagga
     3001 attctaatgt catcccatat gactataaca gagtgccact taaacatgag ctggaaatga
     3061 gtaaagagag tgagcatgat tcagatgaat cctctgatga tgacagtgat tcagaggaac
     3121 caagcaaata catcaatgca tcttttataa tgagctactg gaaacctgaa gtgatgattg
     3181 ctgctcaggg accactgaag gagaccattg gtgacttttg gcagatgatc ttccaaagaa
     3241 aagtcaaagt tattgttatg ctgacagaac tgaaacatgg agaccaggaa atctgtgctc
     3301 agtactgggg agaaggaaag caaacatatg gagatattga agttgacctg aaagacacag
     3361 acaaatcttc aacttatacc cttcgtgtct ttgaactgag acattccaag aggaaagact
     3421 ctcgaactgt gtaccagtac caatatacaa actggagtgt ggagcagctt cctgcagaac
     3481 ccaaggaatt aatctctatg attcaggtcg tcaaacaaaa acttccccag aagaattcct
     3541 ctgaagggaa caagcatcac aagagtacac ctctactcat tcactgcagg gatggatctc
     3601 agcaaacggg aatattttgt gctttgttaa atctcttaga aagtgcggaa acagaagagg
     3661 tagtggatat ttttcaagtg gtaaaagctc tacgcaaagc taggctaggc atggtttcca
     3721 cattcgagca atatcaattc ctatatgacg tcattgccag cacctaccct gctcagaatg
     3781 gacaagtaaa gaaaaacaac catcaagaag ataaaattga atttgataat gaagtggaca
     3841 aagtaaagca ggatgctaat tgtgttaatc cacttggtgc cccagaaaag ctccctgaag
     3901 caaaggaaca ggctgaaggt tctgaaccca cgagtggcac tgaggggcca gaacattctg
     3961 tcaatggtcc tgcaagtcca gctttaaatc aaggttcata ggaaaagaca taaatgagga
     4021 aactccaaac ctcctgttag ctgttatttc tatttttgta gaagtaggaa gtgaaaatag
     4081 gtatacagtg gattaattaa atgcagcgaa ccaatatttg tagaagggtt atattttact
     4141 actgtggaaa aatatttaag atagttttgc cagaacagtt tgtacagacg tatgcttatt
     4201 ttaaaatttt atctcttatt cagtaaaaaa caacttcttt gtaatcgtta tgagtgtata
     4261 tgtatgtgtg tatgggtgtg tgtttgtgtg agagacagag aaagagagag aattc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L49431. Homo sapiens TNFR...[gi:1160972] Links  


LOCUS       HUMSCPA                 2531 bp    mRNA    linear   PRI 18-JAN-1996
DEFINITION  Homo sapiens TNFR2-TRAF signalling complex protein mRNA, complete
            cds.
ACCESSION   L49431
VERSION     L49431.1  GI:1160972
KEYWORDS    cellular apoptosis inhibitor; signalling protein.
SOURCE      Homo sapiens cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2531)
  AUTHORS   Rothe,M., Pan,M.G., Henzel,W.J., Ayres,T.M. and Goeddel,D.V.
  TITLE     The TNFR2-TRAF signaling complex contains two novel proteins
            related to baculoviral inhibitor of apoptosis proteins
  JOURNAL   Cell 83 (7), 1243-1252 (1995)
  MEDLINE   96128127
   PUBMED   8548810
FEATURES             Location/Qualifiers
     source          1..2531
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             195..2051
                     /codon_start=1
                     /product="TNFR2-TRAF signalling complex protein"
                     /protein_id="AAC41942.1"
                     /db_xref="GI:1160973"
                     /translation="MHKTASQRLFPGPSYQNIKSIMEDSTILSDWTNSNKQKMKYDFS
                     CELYRMSTYSTFPAGVPVSERSLARAGFYYTGVNDKVKCFCCGLMLDNWKLGDSPIQK
                     HKQLYPSCSFIQNLVSASLGSTSKNTSPMRNSFAHSLSPTLEHSSLFSGSYSSLSPNP
                     LNSRAVEDISSSRTNPYSYAMSTEEARFLTYHMWPLTFLSPSELARAGFYYIGPGDRV
                     ACFACGGKLSNWEPKDDAMSEHRRHFPNCPFLENSLETLRFSISNLSMQTHAARMRTF
                     MYWPSSVPVQPEQLASAGFYYVGRNDDVKCFCCDGGLRCWESGDDPWVEHAKWFPRCE
                     FLIRMKGQEFVDEIQGRYPHLLEQLLSTSDTTGEENADPPIIHFGPGESSSEDAVMMN
                     TPVVKSALEMGFNRDLVKQTVQSKILTTGENYKTVNDIVSALLNAEDEKREEEKEKQA
                     EEMASDDLSLIRKNRMALFQQLTCVLPILDNLLKANVINKQEHDIIKQKTQIPLQARE
                     LIDTILVKGNAAANIFKNCLKEIDSTLYKNLFVDKNMKYIPTEDVSGLSLEEQLRRLQ
                     EERTCKVCMDKEVSVVFIPCGHLVVCQECAPSLRKCPICRGIIKGTVRTFLS"
BASE COUNT      786 a    436 c    522 g    787 t
ORIGIN      
        1 tctaagtagt atcttggaaa ttcagagaga tactcatcct acctgaatat aaactgagat
       61 aaatccagta aagaaagtgt agtaaattct acataagagt ctatcattga tttcttttgg
      121 tggtaaaaat cttagttcat gtgaagaaat ttcatgtgaa tgttttagct atcaaacagc
      181 actgtcacct actcatgcac aaaactgcct cccaaagact tttcccaggt ccctcgtatc
      241 aaaacattaa gagtataatg gaagatagca cgatcttgtc agattggaca aacagcaaca
      301 aacaaaaaat gaagtatgac ttttcctgtg aactctacag aatgtctaca tattcaactt
      361 tccccgccgg ggtgcctgtc tcagaaagga gtcttgctcg tgctggtttt tattatactg
      421 gtgtgaatga caaggtcaaa tgcttctgtt gtggcctgat gctggataac tggaaactag
      481 gagacagtcc tattcaaaag cataaacagc tatatcctag ctgtagcttt attcagaatc
      541 tggtttcagc tagtctggga tccacctcta agaatacgtc tccaatgaga aacagttttg
      601 cacattcatt atctcccacc ttggaacata gtagcttgtt cagtggttct tactccagcc
      661 tttctccaaa ccctcttaat tctagagcag ttgaagacat ctcttcatcg aggactaacc
      721 cctacagtta tgcaatgagt actgaagaag ccagatttct tacctaccat atgtggccat
      781 taactttttt gtcaccatca gaattggcaa gagctggttt ttattatata ggacctggag
      841 atagggtagc ctgctttgcc tgtggtggga agctcagtaa ctgggaacca aaggatgatg
      901 ctatgtcaga acaccggagg cattttccca actgtccatt tttggaaaat tctctagaaa
      961 ctctgaggtt tagcatttca aatctgagca tgcagacaca tgcagctcga atgagaacat
     1021 ttatgtactg gccatctagt gttccagttc agcctgagca gcttgcaagt gctggttttt
     1081 attatgtggg tcgcaatgat gatgtcaaat gcttttgttg tgatggtggc ttgaggtgtt
     1141 gggaatctgg agatgatcca tgggtagaac atgccaagtg gtttccaagg tgtgagttct
     1201 tgatacgaat gaaaggccaa gagtttgttg atgagattca aggtagatat cctcatcttc
     1261 ttgaacagct gttgtcaact tcagatacca ctggagaaga aaatgctgac ccaccaatta
     1321 ttcattttgg acctggagaa agttcttcag aagatgctgt catgatgaat acacctgtgg
     1381 ttaaatctgc cttggaaatg ggctttaata gagacctggt gaaacaaaca gttcaaagta
     1441 aaatcctgac aactggagag aactataaaa cagttaatga tattgtgtca gcacttctaa
     1501 atgctgaaga tgaaaaaaga gaggaggaga aggaaaaaca agctgaagaa atggcatcag
     1561 atgatttgtc attaattcgg aagaacagaa tggctctctt tcaacaattg acatgtgtgc
     1621 ttcctatcct ggataatctt ttaaaggcca atgtaattaa taaacaggaa catgatatta
     1681 ttaaacaaaa aacacagata cctttacaag cgagagaact gattgatacc attttggtta
     1741 aaggaaatgc tgcggccaac atcttcaaaa actgtctaaa agaaattgac tctacattgt
     1801 ataagaactt atttgtggat aagaatatga agtatattcc aacagaagat gtttcaggtc
     1861 tgtcactgga agaacaattg aggaggttgc aagaagaacg aacttgtaaa gtgtgtatgg
     1921 acaaagaagt ttctgttgta tttattcctt gtggtcatct ggtagtatgc caggaatgtg
     1981 ccccttctct aagaaaatgc cctatttgca ggggtataat caagggtact gttcgtacat
     2041 ttctctctta aagaaaaata gtctatattt taacctgcat aaaaaggtct ttaaaatatt
     2101 gttgaacact tgaagccatc taaagtaaaa agggaattat gagtttttca attagtaaca
     2161 ttcatgttct agtctgcttt ggtactaata atcttgtttc tgaaaagatg gtatcatata
     2221 tttaatctta atctgtttat ttacaaggga agatttatgt ttggtgaact atattagtat
     2281 gtatgtgtac ctaagggagt agtgtcactg cttgttatgc atcatttcag gagttactgg
     2341 atttgttgtt ctttcagaaa gctttgaata ctaaattata gtgtagaaaa gaactggaaa
     2401 ccaggaactc tggagttcat cagagttatg gtgccgaatt gtctttggtg cttttcactt
     2461 gtgttttaaa ataaggattt ttctcttatt tctcccccta gtttgtgaga aacatctcaa
     2521 taaagtgctt t
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U37547. Human IAP homolog...[gi:1145292] Links  


LOCUS       HSU37547                3532 bp    mRNA    linear   PRI 05-JUN-1996
DEFINITION  Human IAP homolog B (MIHB) mRNA, complete cds.
ACCESSION   U37547
VERSION     U37547.1  GI:1145292
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3532)
  AUTHORS   Uren,A.G., Pakusch,M., Hawkins,C.J., Puls,K.L. and Vaux,D.L.
  TITLE     Cloning and expression of apoptosis inhibitory protein homologs
            that function to inhibit apoptosis and/or bind tumor necrosis
            factor receptor-associated factors
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 93 (10), 4974-4978 (1996)
  MEDLINE   96209843
   PUBMED   8643514
REFERENCE   2  (bases 1 to 3532)
  AUTHORS   Uren,A.G. and Vaux,D.L.
  TITLE     Direct Submission
  JOURNAL   Submitted (04-OCT-1995) Anthony G. Uren, The Walter and Eliza Hall
            Institute, Royal Parade, Parkville, Victoria 3050, Australia
FEATURES             Location/Qualifiers
     source          1..3532
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="liver"
                     /dev_stage="fetal"
     gene            1..3532
                     /gene="MIHB"
     CDS             1160..3016
                     /gene="MIHB"
                     /note="IAP homolog B; apoptosis inhibitor; Interacts with
                     TRAF1 and TRAF2 in yeast two hybrid system; Mammalian IAP
                     homolog B"
                     /codon_start=1
                     /product="MIHB"
                     /protein_id="AAC50508.1"
                     /db_xref="GI:1145293"
                     /translation="MHKTASQRLFPGPSYQNIKSIMEDSTILSDWTNSNKQKMKYDFS
                     CELYRMSTYSTFPAGVPVSERSLARAGFYYTGVNDKVKCFCCGLMLDNWKLGDSPIQK
                     HKQLYPSCSFIQNLVSASLGSTSKNTSPMRNSFAHSLSPTLEHSSLFSGSYSSLSPNP
                     LNSRAVEDISSSRTNPYSYAMSTEEARFLTYHMWPLTFLSPSELARAGFYYIGPGDRV
                     ACFACGGKLSNWEPKDDAMSEHRRHFPNCPFLENSLETLRFSISNLSMQTHAARMRTF
                     MYWPSSVPVQPEQLASAGFYYVGRNDDVKCFCCDGGLRCWESGDDPWVEHAKWFPRCE
                     FLIRMKGQEFVDEIQGRYPHLLEQLLSTSDTTGEENADPPIIHFGPGESSSEDAVMMN
                     TPVVKSALEMGFNRDLVKQTVQSKILTTGENYKTVNDIVSALLNAEDEKREEEKEKQA
                     EEMASDDLSLIRKNRMALFQQLTCVLPILDNLLKANVINKQEHDIIKQKTQIPLQARE
                     LIDTILVKGNAAANIFKNCLKEIDSTLYKNLFVDKNMKYIPTEDVSGLSLEEQLRRLQ
                     EERTCKVCMDKEVSVVFIPCGHLVVCQECAPSLRKCPICRGIIKGTVRTFLS"
     misc_feature    1295..1498
                     /gene="MIHB"
                     /note="encodes BIR repeat 1"
     misc_feature    1709..1909
                     /gene="MIHB"
                     /note="encodes BIR repeat 2"
     misc_feature    1964..2167
                     /gene="MIHB"
                     /note="encodes BIR repeat 3"
     misc_feature    2870..2977
                     /gene="MIHB"
                     /note="encodes RING finger motif"
BASE COUNT     1133 a    560 c    711 g   1128 t
ORIGIN      
        1 gaattctatg gagtgtaatt ttgtgtatga attatatttt taaaacattg aagagttttc
       61 agaaagaagg ctagtagagt tgattactga tactttatgc taagcagtac ttttttggta
      121 gtacaatatt ttgttaggcg tttctgataa cactagaaag gacaagtttt atcttgtgat
      181 aaattgatta atgtttacaa catgactgat aattatagct gaatagtcct taaatgatga
      241 acaggttatt tagtttttaa atgcagtgta aaaagtgtgc tgtggaaatt ttatggctaa
      301 ctaagtttat ggagaaaata ccttcagttg atcaagaata atagtggtat acaaagttag
      361 gaagaaagtc aacatgatgc tgcaggaaat ggaaacaaat acaaatgata tttaacaaag
      421 atagagttta cagtttttga actttaagcc aaattcattt gacatcaagc actatagcag
      481 gcacaggttc aacaaagctt gtgggtattg acttccccca aaagttgtca gctgaagtaa
      541 tttagcccac ttaagtaaat actatgatga taagctgtgt gaacttagct tttaaatagt
      601 gtgaccatat gaaggtttta attacttttg tttattggaa taaaatgaga ttttttgggt
      661 tgtcatgtta aagtgcttat agggaaagaa gcctgcatat aattttttac cttgtggcat
      721 aatcagtaat tggtctgtta ttcaggcttc atagcttgta accaaatata aataaaaggc
      781 ataatttagg tattctatag ttgcttagaa ttttgttaat ataaatctct gtgaaaaatc
      841 aaggagtttt aatattttca gaagtgcatc cacctttcag ggctttaagt tagtattact
      901 caagattatg aacaaatagc acttaggtta cctgaaagag ttactacaac cccaaagagt
      961 tgtgttctaa gtagtatctt ggtaattcag agagatactc atcctacctg aatataaact
     1021 gagataaatc cagtaaagaa agtgtagtaa attctacata agagtctatc attgatttct
     1081 ttttgtggta aaaatcttag ttcatgtgaa gaaatttcat gtgaatgttt tagctatcaa
     1141 acagtactgt cacctactca tgcacaaaac tgcctcccaa agacttttcc caggtccctc
     1201 gtatcaaaac attaagagta taatggaaga tagcacgatc ttgtcagatt ggacaaacag
     1261 caacaaacaa aaaatgaagt atgacttttc ctgtgaactc tacagaatgt ctacatattc
     1321 aactttcccc gccggggtgc ctgtctcaga aaggagtctt gctcgtgctg gtttttatta
     1381 tactggtgtg aatgacaagg tcaaatgctt ctgttgtggc ctgatgctgg ataactggaa
     1441 actaggagac agtcctattc aaaagcataa acagctatat cctagctgta gctttattca
     1501 gaatctggtt tcagctagtc tgggatccac ctctaagaat acgtctccaa tgagaaacag
     1561 ttttgcacat tcattatctc ccaccttgga acatagtagc ttgttcagtg gttcttactc
     1621 cagcctttct ccaaaccctc ttaattctag agcagttgaa gacatctctt catcgaggac
     1681 taacccctac agttatgcaa tgagtactga agaagccaga tttcttacct accatatgtg
     1741 gccattaact tttttgtcac catcagaatt ggcaagagct ggtttttatt atataggacc
     1801 tggagatagg gtagcctgct ttgcctgtgg tgggaagctc agtaactggg aaccaaagga
     1861 tgatgctatg tcagaacacc ggaggcattt tcccaactgt ccatttttgg aaaattctct
     1921 agaaactctg aggtttagca tttcaaatct gagcatgcag acacatgcag ctcgaatgag
     1981 aacatttatg tactggccat ctagtgttcc agttcagcct gagcagcttg caagtgctgg
     2041 tttttattat gtgggtcgca atgatgatgt caaatgcttt tgttgtgatg gtggcttgag
     2101 gtgttgggaa tctggagatg atccatgggt agaacatgcc aagtggtttc caaggtgtga
     2161 gttcttgata cgaatgaaag gccaagagtt tgttgatgag attcaaggta gatatcctca
     2221 tcttcttgaa cagctgttgt caacttcaga taccactgga gaagaaaatg ctgacccacc
     2281 aattattcat tttggacctg gagaaagttc ttcagaagat gctgtcatga tgaatacacc
     2341 tgtggttaaa tctgccttgg aaatgggctt taatagagac ctggtgaaac aaacagttca
     2401 aagtaaaatc ctgacaactg gagagaacta taaaacagtt aatgatattg tgtcagcact
     2461 tcttaatgct gaagatgaaa aaagagaaga ggagaaggaa aaacaagctg aagaaatggc
     2521 atcagatgat ttgtcattaa ttcggaagaa cagaatggct ctctttcaac aattgacatg
     2581 tgtgcttcct atcctggata atcttttaaa ggccaatgta attaataaac aggaacatga
     2641 tattattaaa caaaaaacac agataccttt acaagcgaga gaactgattg ataccatttt
     2701 ggttaaagga aatgctgcgg ccaacatctt caaaaactgt ctaaaagaaa ttgactctac
     2761 attgtataag aacttatttg tggataagaa tatgaagtat attccaacag aagatgtttc
     2821 aggtctgtca ctggaagaac aattgaggag gttgcaagaa gaacgaactt gtaaagtgtg
     2881 tatggacaaa gaagtttctg ttgtatttat tccttgtggt catctggtag tatgccagga
     2941 atgtgcccct tctctaagaa aatgccctat ttgcaggggt ataatcaagg gtactgttcg
     3001 tacatttctc tcttaaagaa aaatagtcta tattttaacc tgcataaaaa ggtctttaaa
     3061 atattgttga acacttgaag ccatctaaag taaaaaggga attatgagtt tttcaattag
     3121 taacattcat gttctagtct gctttggtac taataatctt gtttctgaaa agatggtatc
     3181 atatatttaa tcttaatctg tttatttaca agggaagatt tatgtttggt gaactatatt
     3241 agtatgtatg tgtacctaag ggagtagtgt cactgcttgt tatgcatcat ttcaggagtt
     3301 actggatttg ttgttctttc agaaagcttt gaatactaaa ttatagtgta gaaaagaact
     3361 ggaaaccagg aactctggag ttcatcagag ttatggtgcc gaattgtctt tggtgctttt
     3421 cacttgtgtt ttaaaataag gatttttctc ttatttctcc ccctagtttg tgagaaacat
     3481 ctcaataaag tgctttccaa aaaaaaaaaa aagtcgacgc ggccgcgaat tc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH007100. Human elastin gen...[gi:4033460] Links  


LOCUS       HUMEL01                  958 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 19.
ACCESSION   M16983 J02948
VERSION     M16983.1  GI:4033459
KEYWORDS    .
SEGMENT     1 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 958)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 958)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     On Dec 18, 1998 this sequence version replaced gi:479008.
            The numbering of the exons as given in [1] for the complete gene is
            given in the definition line.  The exons given in the Features
            table is as the exons appear in each particular protein. The
            alternate products elastin A, B, and C given in the Features table
            correspond to cDNA clones cHEL4, cHEL3 and cHEL2 respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..958
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     exon            <10..958
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=1
BASE COUNT      144 a    269 c    355 g    190 t
ORIGIN      
        1 ctccccgaga tggcgggtct gacggcggcg gccccgcggc ccggagtcct cctgctcctg
       61 ctgtccatcc tccacccctc tcggcctgga ggggtccctg gggccattcc tggtggagtt
      121 cctggaggag tcttttatcc aggggctggt ctcggagccc ttggaggagg agcgctgggg
      181 cctggaggca aacctcttaa gccagttccc ggagggcttg cgggtgctgg ccttggggca
      241 gggctcggcg ccttccccgc agttaccttt ccgggggctc tggtgcctgg tggagtggct
      301 gacgctgctg cagcctataa agctgctaag gctggcgctg ggcttggtgg tgtcccagga
      361 gttggtggct taggagtgtc tgcaggtgcg gtggttcctc agcctggagc cggagtgaag
      421 cctgggaaag tgccgggtgt ggggctgcca ggtgtatacc caggtggcgt gctcccagga
      481 gctcggttcc ccggtgtggg ggtgctccct ggagttccca ctggagcagg agttaagccc
      541 aaggctccag gtgtaggtgg agcttttgct ggaatcccag gagttggacc ctttggggga
      601 ccgcaacctg gagtcccact ggggtatccc atcaaggccc ccaagctgcc tggtggctat
      661 ggactgccct acaccacagg gaaactgccc tatggctatg ggcccggagg agtggctggt
      721 gcagcgggca aggctggtta cccaacaggg acaggggttg gcccccaggc agcagcagca
      781 gcggcagcta aagcagcagc aaagttcggt gctggagcag ccggagtcct ccctggtgtt
      841 ggaggggctg gtgttcctgg cgtgcctggg gcaattcctg gaattggagg catcgcaggc
      901 gttgggactc cagctgcagc tgcagctgca gcagcagccg ctaaggcagc caagtatg
//
LOCUS       HUMEL02                  211 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 18.
ACCESSION   M17265 J02948
VERSION     M17265.1  GI:181999
KEYWORDS    .
SEGMENT     2 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 211)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 211)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..211
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=1
     exon            55..201
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=18
     intron          202..>211
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=2
BASE COUNT       29 a     52 c     76 g     54 t
ORIGIN      
        1 ctagcccctc tgaggttccc ataggttagg ggaacaatgc tttttcttcc acaggagctg
       61 ctgcaggctt agtgcctggt gggccaggct ttggcccggg agtagttggt gtcccaggag
      121 ctggcgttcc aggtgttggt gtcccaggag ctgggattcc agttgtccca ggtgctggga
      181 tcccaggtgc tgcggttcca ggtgagctgg g
//
LOCUS       HUMEL03                  118 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 17.
ACCESSION   M17266 J02948
VERSION     M17266.1  GI:182000
KEYWORDS    .
SEGMENT     3 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 118)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..118
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=2
     exon            55..108
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=17
     intron          109..>118
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=3
BASE COUNT       23 a     31 c     34 g     30 t
ORIGIN      
        1 tgctgcctcc aatgctgctg cctgagcatg ttgtgtccct tttggtctct ccaggggttg
       61 tgtcaccaga agcagctgct aaggcagctg caaaggcagc caaatacggt gagtgcta
//
LOCUS       HUMEL04                  229 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 16.
ACCESSION   M17267 J02948
VERSION     M17267.1  GI:182001
KEYWORDS    .
SEGMENT     4 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 229)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 229)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..229
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=3
     exon            55..219
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=16
     intron          220..>229
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=4
BASE COUNT       26 a     63 c     79 g     61 t
ORIGIN      
        1 gcccagcctc tctcactgag gcttcttttc tacttggctc ccttccctct gcaggggcca
       61 ggcccggagt cggagttgga ggcattccta cttacggggt tggagctggg ggctttcccg
      121 gctttggtgt cggagtcgga ggtatccctg gagtcgcagg tgtccctagt gtcggaggtg
      181 ttcccggagt cggaggtgtc ccgggagttg gcatttcccg tgagcctta
//
LOCUS       HUMEL05                  106 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 15.
ACCESSION   M17268 J02948
VERSION     M17268.1  GI:182002
KEYWORDS    .
SEGMENT     5 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 106)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 106)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..106
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=4
     exon            55..96
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=5
     intron          97..>106
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=5
BASE COUNT       20 a     32 c     31 g     23 t
ORIGIN      
        1 gaggagaccc aggcacggct tctgagggtc tctatctttc tcgtttcctt gtagccgaag
       61 ctcaggcagc agctgccgcc aaggctgcca agtacggtaa gtgccc
//
LOCUS       HUMEL06                  151 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, intron M with a possible exon 14.
ACCESSION   M17269 J02948
VERSION     M17269.1  GI:182003
KEYWORDS    .
SEGMENT     6 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 151)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 151)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     Even though none of the mRNAs described in [1] contained the
            sequences in segment 6, this sequence resembles elastin exon 14 in
            other organisms, and could be included in a different variant of
            elastin mRNA by alternative splicing.
            The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.  ORF     +    55  +   141
            elastin precursor, possible exon.
FEATURES             Location/Qualifiers
     source          1..151
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..>151
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=5
BASE COUNT       29 a     39 c     60 g     23 t
ORIGIN      
        1 cccccaaaaa gtgagtactg gaggggcaag gctgaaagtt ctccactccc cgaggtgctg
       61 caggagcagg agtgctgggt gggctagtgc caggtcccca ggcggcagtc ccaggtgtgc
      121 cgggcacggg aggagtgcca ggtgagctgt g
//
LOCUS       HUMEL07                  121 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 13.
ACCESSION   M17270 J02948
VERSION     M17270.1  GI:182004
KEYWORDS    .
SEGMENT     7 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 121)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 121)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..121
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..>121
                     /gene="ELN"
                     /note="elastin A; G00-119-107"
                     /number=5
     intron          <1..54
                     /gene="ELN"
                     /note="elastin B and C; G00-119-107"
                     /number=5
     exon            55..111
                     /gene="ELN"
                     /note="elastin B and C; G00-119-107"
                     /number=13
     intron          112..>121
                     /gene="ELN"
                     /note="elastin B and C; G00-119-107"
                     /number=6
BASE COUNT       26 a     40 c     33 g     22 t
ORIGIN      
        1 agcagggagg ggtgtgagag attactctct caccccttct cttcacacct ccaggagtgg
       61 ggaccccagc agctgcagct gctaaagcag ccgccaaagc cgcccagttt ggtaagtccc
      121 c
//
LOCUS       HUMEL08                  226 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 12.
ACCESSION   M17271 J02948
VERSION     M17271.1  GI:182005
KEYWORDS    .
SEGMENT     8 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 226)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 226)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..226
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=6
     exon            55..216
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=12
     intron          217..>226
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=7
BASE COUNT       19 a     51 c     82 g     74 t
ORIGIN      
        1 tctgtcctct ttgatcaggt cttggttaat gatcagctct tctcaatctt gcagggttag
       61 ttcctggtgt cggcgtggct cctggagttg gcgtggctcc tggtgtcggt gtggctcctg
      121 gagttggctt ggctcctgga gttggcgtgg ctcctggagt tggtgtggct cctggcgttg
      181 gcgtggctcc cggcattggc cctggtggag ttgcaggtga gtttca
//
LOCUS       HUMEL09                  109 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 11.
ACCESSION   M17272 J02948
VERSION     M17272.1  GI:182006
KEYWORDS    .
SEGMENT     9 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 109)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 109)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..109
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=7
     exon            55..99
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=7
     intron          100..>109
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=8
BASE COUNT       18 a     48 c     21 g     22 t
ORIGIN      
        1 agcctccatg ggccccgcct ccatctctaa tccccctctc tctccctccc tcagctgcag
       61 caaaatccgc tgccaaggtg gctgccaaag cccagctccg tgagtgcct
//
LOCUS       HUMEL10                  190 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 10.
ACCESSION   M17273 J02948
VERSION     M17273.1  GI:182007
KEYWORDS    .
SEGMENT     10 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 190)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 190)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..190
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=9
     exon            55..180
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=10
     intron          181..>190
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=9
BASE COUNT       18 a     49 c     69 g     54 t
ORIGIN      
        1 tccttagggg catgctccct gcctgctgtc gccaccactg ccctctgtct gcaggagctg
       61 cagctgggct tggtgctggc atccctggac ttggagttgg tgtcggcgtc cctggacttg
      121 gagttggtgc tggtgttcct ggacttggag ttggtgctgg tgttcctggc ttcggggcag
      181 gtgcagatga 
//
LOCUS       HUMEL11                  109 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 10A.
ACCESSION   M17274 J02948
VERSION     M17274.1  GI:182008
KEYWORDS    .
SEGMENT     11 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 109)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 109)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..109
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..>109
                     /gene="ELN"
                     /note="elastin A and B; G00-119-107"
                     /number=9
     exon            1..99
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=10
     intron          100..>109
                     /gene="ELN"
                     /note="elastin C; G00-119-107"
                     /number=10
BASE COUNT       22 a     38 c     30 g     19 t
ORIGIN      
        1 gtgcagatga gggagttagg cggagcctgt cccctgagct cagggaagga gatccctcct
       61 cctctcagca cctccccagc accccctcat cacccagggg tgcatagta
//
LOCUS       HUMEL12                  103 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 9.
ACCESSION   M17275 J02948
VERSION     M17275.1  GI:182009
KEYWORDS    .
SEGMENT     12 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 103)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..103
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=10
     exon            55..93
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=9
     intron          94..>103
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=11
BASE COUNT       21 a     38 c     22 g     22 t
ORIGIN      
        1 tcccaggcac agagctcggc tcctgaccac tccccaactt ttctttctcc ccagtacctg
       61 gagccctggc tgccgctaaa gcagccaaat atggtgagtg cac
//
LOCUS       HUMEL13                  136 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 8.
ACCESSION   M17276 J02948
VERSION     M17276.1  GI:182010
KEYWORDS    .
SEGMENT     13 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 136)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 136)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..136
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=11
     exon            55..126
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=8
     intron          127..>136
                     /gene="ELN"
                     /number=12
BASE COUNT       23 a     35 c     48 g     30 t
ORIGIN      
        1 agggagaccc atcgttcaga aatggaacac tcattttccc tcctctcccc gcaggagcag
       61 cagtgcctgg ggtccttgga gggctcgggg ctctcggtgg agtaggcatc ccaggcggtg
      121 tggtgggtga gttgat
//
LOCUS       HUMEL14                  124 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 6.
ACCESSION   M17277 J02948
VERSION     M17277.1  GI:182011
KEYWORDS    .
SEGMENT     14 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 124)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 124)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..124
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=12
     exon            55..114
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=6
     intron          115..>124
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=13
BASE COUNT       24 a     43 c     37 g     20 t
ORIGIN      
        1 ggagggaatc taaccagtac agagtgcctc cctgaactcg gtctgtgttc ccaggagccg
       61 gacccgccgc cgccgctgcc gcagccaaag ctgctgccaa agccgcccag tttggtgagc
      121 actg
//
LOCUS       HUMEL15                  139 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 5.
ACCESSION   M17278 J02948
VERSION     M17278.1  GI:182012
KEYWORDS    .
SEGMENT     15 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 139)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 139)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..139
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=13
     exon            55..129
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=5
     intron          130..>139
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=14
BASE COUNT       21 a     34 c     52 g     32 t
ORIGIN      
        1 gcttcagtcc cacctttctg accagcggag tctaatgctc agctgtctcc acaggcctag
       61 tgggagccgc tgggctcgga ggactcggag tcggagggct tggagttcca ggtgttgggg
      121 gccttggagg tgagagttg
//
LOCUS       HUMEL16                  103 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 4.
ACCESSION   M17279 J02948 M20425
VERSION     M17279.1  GI:182013
KEYWORDS    .
SEGMENT     16 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 103)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..103
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=14
     exon            55..93
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=4
     intron          94..>103
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=15
BASE COUNT       21 a     34 c     25 g     22 t      1 others
ORIGIN      
        1 gcctgaccag gtggcattgg cattcctgag ccgtcatgtg cctcatctcc ccaggtatac
       61 ctccagctgc agccgctaaa gcagctaaat acggtgagtn ccc
//
LOCUS       HUMEL17                  118 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 4.
ACCESSION   M17280 J02948 M20426
VERSION     M17280.1  GI:182014
KEYWORDS    .
SEGMENT     17 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 118)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 118)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..118
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..>118
                     /gene="ELN"
                     /note="elastin A; G00-119-107"
                     /number=15
     intron          <1..54
                     /gene="ELN"
                     /note="elastin B and C; G00-119-107"
                     /number=15
     exon            55..108
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=15
     intron          109..>118
                     /gene="ELN"
                     /note="elastin B and C; G00-119-107"
                     /number=16
BASE COUNT       12 a     33 c     43 g     30 t
ORIGIN      
        1 agggcctctt cccgatgggg gtgtcttatc ctgaccccac ctgcctcttc tcaggtgctg
       61 ctggccttgg aggtgtccta gggggtgccg ggcagttccc acttggaggt aggggtgg
//
LOCUS       HUMEL18                  109 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 2 (4A).
ACCESSION   M17281 J02948
VERSION     M17281.1  GI:182015
KEYWORDS    .
SEGMENT     18 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 109)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 109)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..109
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=16
     exon            55..99
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=2
     intron          100..>109
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=17
BASE COUNT       23 a     37 c     24 g     25 t
ORIGIN      
        1 gctggagtca gtttccaccc ctaccaaccc accaacctga aatctctcct gcaggagtgg
       61 cagcaagacc tggcttcgga ttgtctccca ttttcccagg tatgccagg
//
LOCUS       HUMEL19                  352 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene intron A with an Alu repetitive sequence.
ACCESSION   M17283 M22741
VERSION     M17283.1  GI:182016
KEYWORDS    .
SEGMENT     19 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 352)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 352)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     Many Alu repetitive sequences are located in the introns of the
            human elastin gene, especially in the region where exons 2 and 3
            are located in other organisms.  Comparable exons 2 and 3 were not
            found in the human gene.  The exons in these entries are named as
            they are in [Connect. Tissue Res. 16, 197-211 (1987)]. The
            alternate products elastin A, B, and C given in the Features table
            correspond to cDNA clones cHEL4, cHEL3 and cHEL2 respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..352
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     intron          <1..>352
                     /gene="ELN"
                     /note="elastin"
                     /number=17
     repeat_region   1..352
                     /note="Alu repeat"
     repeat_region   1..13
                     /note="Alu 5' direct repeat"
     misc_feature    1
                     /gene="ELN"
                     /note="sequence not numbered in [Connect. Tissue Res. 16,
                     197-211 (1987)]"
     repeat_region   340..352
                     /note="Alu 3' direct repeat"
BASE COUNT      105 a     74 c    103 g     69 t      1 others
ORIGIN      
        1 tagtgagggg gattggctgg gcntggtggc ctcacgcctg taatcccagc actttgggag
       61 gcctaggtgg gtggatcaac ttgaggtcca aggagttcga gacccagtct ggtcaaacat
      121 ggtgaaccct gtctctacta aaaaaaatgg caaaaattag ccaaacgtgg tggacgcctg
      181 taatcccagc tactcgggag gctgaggcgg gagaatcact ggagcctggg aagcggaggt
      241 tgcagtgagc caagatcgca ccactgcact ccagcctggg tgacagagca agaccccatc
      301 tcaaaaaaat aataataaaa taaaatataa aaaattatat agtggggggg at
//
LOCUS       HUMEL20                 1448 bp    DNA     linear   PRI 08-MAR-2002
DEFINITION  Human elastin gene, exon 1.
ACCESSION   M17282 J02948 M20427 M20428
VERSION     M17282.1  GI:182017
KEYWORDS    .
SEGMENT     20 of 20
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1448)
  AUTHORS   Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J.,
            Rosenbloom,J. and Ornstein-Goldstein,N.
  TITLE     Structure of the 3' region of the human elastin gene: great
            abundance of Alu repetitive sequences and few coding sequences
  JOURNAL   Connect. Tissue Res. 16 (3), 197-211 (1987)
  MEDLINE   87274906
   PUBMED   3038460
REFERENCE   2  (bases 1 to 1448)
  AUTHORS   Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N.,
            Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J.
  TITLE     Alternative splicing of human elastin mRNA indicated by sequence
            analysis of cloned genomic and complementary DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5680-5684 (1987)
  MEDLINE   87289668
   PUBMED   3039501
COMMENT     Polyadenylation signals are located at positions 950-955 and
            1182-1187.
            The alternate products elastin A, B, and C given in the Features
            table correspond to cDNA clones cHEL4, cHEL3 and cHEL2
            respectively.
            Entry awaiting author review 18-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..1448
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="7cen-q21.1"
     gene            join(M16983.1:10..958,M17265.1:1..211,M17266.1:1..118,
                     M17267.1:1..229,M17268.1:1..106,M17269.1:1..151,
                     M17270.1:1..121,M17271.1:1..226,M17272.1:1..109,
                     M17273.1:1..190,M17274.1:1..109,M17275.1:1..103,
                     M17276.1:1..136,M17277.1:1..124,M17278.1:1..139,
                     M17279.1:1..103,M17280.1:1..118,M17281.1:1..109,1..1285)
                     /gene="ELN"
     CDS             join(M16983.1:10..958,M17265.1:55..201,M17266.1:55..108,
                     M17267.1:55..219,M17268.1:55..96,M17270.1:55..111,
                     M17271.1:55..216,M17272.1:55..99,M17273.1:55..180,
                     M17274.1:1..99,M17275.1:55..93,M17276.1:55..126,
                     M17277.1:55..114,M17278.1:55..129,M17279.1:55..93,
                     M17280.1:55..108,M17281.1:55..99,55..98)
                     /gene="ELN"
                     /note="elastin C precursor"
                     /codon_start=1
                     /product="elastin"
                     /protein_id="AAC98395.1"
                     /db_xref="GI:182021"
                     /db_xref="GDB:G00-119-107"
                     /translation="MAGLTAAAPRPGVLLLLLSILHPSRPGGVPGAIPGGVPGGVFYP
                     GAGLGALGGGALGPGGKPLKPVPGGLAGAGLGAGLGAFPAVTFPGALVPGGVADAAAA
                     YKAAKAGAGLGGVPGVGGLGVSAGAVVPQPGAGVKPGKVPGVGLPGVYPGGVLPGARF
                     PGVGVLPGVPTGAGVKPKAPGVGGAFAGIPGVGPFGGPQPGVPLGYPIKAPKLPGGYG
                     LPYTTGKLPYGYGPGGVAGAAGKAGYPTGTGVGPQAAAAAAAKAAAKFGAGAAGVLPG
                     VGGAGVPGVPGAIPGIGGIAGVGTPAAAAAAAAAAKAAKYGAAAGLVPGGPGFGPGVV
                     GVPGAGVPGVGVPGAGIPVVPGAGIPGAAVPGVVSPEAAAKAAAKAAKYGARPGVGVG
                     GIPTYGVGAGGFPGFGVGVGGIPGVAGVPSVGGVPGVGGVPGVGISPEAQAAAAAKAA
                     KYGVGTPAAAAAKAAAKAAQFGLVPGVGVAPGVGVAPGVGVAPGVGLAPGVGVAPGVG
                     VAPGVGVAPGIGPGGVAAAAKSAAKVAAKAQLRAAAGLGAGIPGLGVGVGVPGLGVGA
                     GVPGLGVGAGVPGFGAGADEGVRRSLSPELREGDPSSSQHLPSTPSSPRVPGALAAAK
                     AAKYGAAVPGVLGGLGALGGVGIPGGVVGAGPAAAAAAAKAAAKAAQFGLVGAAGLGG
                     LGVGGLGVPGVGGLGGIPPAAAAKAAKYGAAGLGGVLGGAGQFPLGGVAARPGFGLSP
                     IFPGGACLGKACGRKRK"
     CDS             join(M16983.1:10..958,M17265.1:55..201,M17266.1:55..108,
                     M17267.1:55..219,M17268.1:55..96,M17270.1:55..111,
                     M17271.1:55..216,M17272.1:55..99,M17273.1:55..180,
                     M17275.1:55..93,M17276.1:55..126,M17277.1:55..114,
                     M17278.1:55..129,M17279.1:55..93,M17280.1:55..108,
                     M17281.1:55..99,55..98)
                     /gene="ELN"
                     /note="elastin B precursor"
                     /codon_start=1
                     /product="elastin"
                     /protein_id="AAC98394.1"
                     /db_xref="GI:182020"
                     /db_xref="GDB:G00-119-107"
                     /translation="MAGLTAAAPRPGVLLLLLSILHPSRPGGVPGAIPGGVPGGVFYP
                     GAGLGALGGGALGPGGKPLKPVPGGLAGAGLGAGLGAFPAVTFPGALVPGGVADAAAA
                     YKAAKAGAGLGGVPGVGGLGVSAGAVVPQPGAGVKPGKVPGVGLPGVYPGGVLPGARF
                     PGVGVLPGVPTGAGVKPKAPGVGGAFAGIPGVGPFGGPQPGVPLGYPIKAPKLPGGYG
                     LPYTTGKLPYGYGPGGVAGAAGKAGYPTGTGVGPQAAAAAAAKAAAKFGAGAAGVLPG
                     VGGAGVPGVPGAIPGIGGIAGVGTPAAAAAAAAAAKAAKYGAAAGLVPGGPGFGPGVV
                     GVPGAGVPGVGVPGAGIPVVPGAGIPGAAVPGVVSPEAAAKAAAKAAKYGARPGVGVG
                     GIPTYGVGAGGFPGFGVGVGGIPGVAGVPSVGGVPGVGGVPGVGISPEAQAAAAAKAA
                     KYGVGTPAAAAAKAAAKAAQFGLVPGVGVAPGVGVAPGVGVAPGVGLAPGVGVAPGVG
                     VAPGVGVAPGIGPGGVAAAAKSAAKVAAKAQLRAAAGLGAGIPGLGVGVGVPGLGVGA
                     GVPGLGVGAGVPGFGAVPGALAAAKAAKYGAAVPGVLGGLGALGGVGIPGGVVGAGPA
                     AAAAAAKAAAKAAQFGLVGAAGLGGLGVGGLGVPGVGGLGGIPPAAAAKAAKYGAAGL
                     GGVLGGAGQFPLGGVAARPGFGLSPIFPGGACLGKACGRKRK"
     CDS             join(M16983.1:10..958,M17265.1:55..201,M17266.1:55..108,
                     M17267.1:55..219,M17268.1:55..96,M17271.1:55..216,
                     M17272.1:55..99,M17273.1:55..180,M17275.1:55..93,
                     M17276.1:55..126,M17277.1:55..114,M17278.1:55..129,
                     M17279.1:55..93,M17281.1:55..99,55..98)
                     /gene="ELN"
                     /note="elastin A precursor"
                     /codon_start=1
                     /product="elastin"
                     /protein_id="AAC98393.1"
                     /db_xref="GI:182019"
                     /db_xref="GDB:G00-119-107"
                     /translation="MAGLTAAAPRPGVLLLLLSILHPSRPGGVPGAIPGGVPGGVFYP
                     GAGLGALGGGALGPGGKPLKPVPGGLAGAGLGAGLGAFPAVTFPGALVPGGVADAAAA
                     YKAAKAGAGLGGVPGVGGLGVSAGAVVPQPGAGVKPGKVPGVGLPGVYPGGVLPGARF
                     PGVGVLPGVPTGAGVKPKAPGVGGAFAGIPGVGPFGGPQPGVPLGYPIKAPKLPGGYG
                     LPYTTGKLPYGYGPGGVAGAAGKAGYPTGTGVGPQAAAAAAAKAAAKFGAGAAGVLPG
                     VGGAGVPGVPGAIPGIGGIAGVGTPAAAAAAAAAAKAAKYGAAAGLVPGGPGFGPGVV
                     GVPGAGVPGVGVPGAGIPVVPGAGIPGAAVPGVVSPEAAAKAAAKAAKYGARPGVGVG
                     GIPTYGVGAGGFPGFGVGVGGIPGVAGVPSVGGVPGVGGVPGVGISPEAQAAAAAKAA
                     KYGLVPGVGVAPGVGVAPGVGVAPGVGLAPGVGVAPGVGVAPGVGVAPGIGPGGVAAA
                     AKSAAKVAAKAQLRAAAGLGAGIPGLGVGVGVPGLGVGAGVPGLGVGAGVPGFGAVPG
                     ALAAAKAAKYGAAVPGVLGGLGALGGVGIPGGVVGAGPAAAAAAAKAAAKAAQFGLVG
                     AAGLGGLGVGGLGVPGVGGLGGIPPAAAAKAAKYGVAARPGFGLSPIFPGGACLGKAC
                     GRKRK"
     intron          <1..54
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=17
     exon            55..>98
                     /gene="ELN"
                     /note="G00-119-107"
                     /number=1
     misc_feature    55
                     /gene="ELN"
                     /note="numbered 2318 in [2]"
     polyA_signal    1049..1054
                     /gene="ELN"
                     /note="G00-119-107"
     polyA_signal    1280..1285
                     /gene="ELN"
                     /note="G00-119-107"
BASE COUNT      298 a    442 c    338 g    350 t     20 others
ORIGIN      
        1 agccgaaact gagaggggcc ggactcacag tgatgtgcac ctcctcccgt ccaggtgggg
       61 cctgcctggg gaaagcttgt ggccggaaga gaaaatgagc ttcctaggac ccctgactca
      121 cgacctcatc aacgttggtg ctactgcttg gtggagaatg taaacccttt gtatccccat
      181 cccatgcccc tccgattccc caccccagga gggaacgggc aggccgggcg gcttgcagat
      241 ccacagggca aggaaacaag aggggagcgg ccaagtgccc cgaccaggag gccccctact
      301 tcagaggcaa gggccatgtg gtcctggccc cccaacccca tcccttccca cctaggagct
      361 ccccctccac acagcctcca tctccagggg aacttggtgc tacacgctgg tgctcttatc
      421 ttcctggggg gagggaggag ggaagggtgg cccctcgggg aaccccctac ctggggctcc
      481 tctaaagatg gtgcagacac ttcctgggca gtcccagctc cccctgccca ccaggaccca
      541 ccgttggctg ccatccagtt ggtacccaag cacctgaagc ctcaaagctg gattcgctct
      601 agcatccctc ctctcctggg tccacttggc cgtctcctcc ccaccgatcg ctgttcccca
      661 catctggggc gcttttgggt tggaaaacca ccccacactg ggaatagcca ccttgcccct
      721 tgtaagaatc catccgccca tccgtccatt catccatcgg tccgtccatc catgtcccag
      781 ttgaccgccg gcaccattag ctggctgggt gcacccacca tcaacctggt tgacctgtca
      841 tggccgcctg tgccctncct nanccccatc ctacantccc ccagggcgtg cggggctgtg
      901 cagactgggg tgccaggcat ctcnnnccca cccggggtnt ccccanatgc agtactgtat
      961 annccccatc cctccctcgg tccactgaac ttcagagcag ttcccattcc tgccccgncc
     1021 atctttttgt gtctcgctgt gatagatcaa taaatatttt attttttgtc ctggatattt
     1081 ggggattatt tttgattgtt gatattctct tttggtttta ttgttgtggt tcattgaaaa
     1141 aaaaaagata attttttttt ctgatccggg gagctgtatc cccagtagaa aaaaaatttt
     1201 aatcactcta atatacctct ggatgannca nacctttttt tttattaaga aaagagattt
     1261 aactgcttca gaaatgacta ataaatgaaa accctttaaa ggaaactgtg tcttngcttc
     1321 cttggtatga tttaatctgc cttcaactgt tggcctggnt ggggnnangg gctctgcttc
     1381 agggaacctc caccacccaa attgtatttg agaggttgcc caaccaaaag cccctgctgc
     1441 ctggcttc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X14046. Human mRNA for le...[gi:29793] Links  


LOCUS       HSCD37                  1125 bp    mRNA    linear   PRI 12-SEP-1993
DEFINITION  Human mRNA for leukocyte antigen CD37.
ACCESSION   X14046
VERSION     X14046.1  GI:29793
KEYWORDS    antigen; CD37 antigen; cell surface glycoprotein; transmembrane
            protein.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Classon,B.J., Williams,A.F., Willis,A.C., Seed,B. and
            Stamenkovic,I.
  TITLE     The primary structure of the human leukocyte antigen CD37, a
            species homologue of the rat MRC OX-44 antigen
  JOURNAL   J. Exp. Med. 169 (4), 1497-1502 (1989)
  MEDLINE   89176904
  REMARK    revised by [3]
REFERENCE   2
  AUTHORS   Classon,B.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (13-JAN-1989) Classon B.J., MRC Cellular Immunology Unit,
            Sir William Dunn, School of Pathology, University of Oxford,
            Oxford, OX1 3RE, England
  REMARK    revised by [3]
REFERENCE   3  (bases 1 to 1125)
  AUTHORS   Classon,B.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-MAY-1990)
COMMENT     The human leukocyte antigen CD37 is a species homolog of the rat
            MRC OX-44 antigen.
            Data kindly reviewed (23-Jun-1989) by Classon B.J.
FEATURES             Location/Qualifiers
     source          1..1125
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="CD37.1"
                     /cell_line="CLL leukemia"
     CDS             64..909
                     /note="CD37 (AA 1-244)"
                     /codon_start=1
                     /protein_id="CAA32204.1"
                     /db_xref="GI:29794"
                     /db_xref="SWISS-PROT:P11049"
                     /translation="MSAQESCLSLIKYFLFVFNLFFFVLGSLIFCFGIWILIDKTSFV
                     SFVGLAFVPLQIWSKVLAISGIFTMGIALLGCVGALKELRCLLGLYFGMLLLLFATQI
                     TLGILISTQRAQLERSLRDVVEKTIQKYGTNPEETAAEESWDYVQFQLRCCGWHYPQD
                     WFQVLILRGNGSEAHRVPCSCYNLSATNDSTILDKVILPQLSRLGHLARSRHSADICA
                     VPAESHIYREGCAQGLQKWLHNNLISIVGICLGVGLLELGFMTLSIFLCRNLDHVYNR
                     LARYR"
     old_sequence    747..749
                     /note="ggg was gg in [1]"
                     /citation=[2]
BASE COUNT      191 a    388 c    291 g    255 t
ORIGIN      
        1 gtctccccca ctgtcagcac ctcttctgtg tggtgagtgg accgcttacc ccactaggtg
       61 aagatgtcag cccaggagag ctgcctcagc ctcatcaagt acttcctctt cgttttcaac
      121 ctcttcttct tcgtcctcgg cagcctgatc ttctgcttcg gcatctggat cctcatcgac
      181 aagaccagct tcgtgtcctt tgtgggcttg gccttcgtgc ctctgcagat ctggtccaaa
      241 gtcctggcca tctcaggaat cttcaccatg ggcatcgccc tcctgggttg tgtgggggcc
      301 ctcaaggagc tccgctgcct cctgggcctg tattttggga tgctgctgct cctgtttgcc
      361 acacagatca ccctgggaat cctcatctcc actcagcggg cccagctgga gcgaagcttg
      421 cgggacgtcg tagagaaaac catccaaaag tacggcacca accccgagga gaccgcggcc
      481 gaggagagct gggactatgt gcagttccag ctgcgctgct gcggctggca ctacccgcag
      541 gactggttcc aagtcctcat cctgagaggt aacgggtcgg aggcgcaccg cgtgccctgc
      601 tcctgctaca acttgtcggc gaccaacgac tccacaatcc tagataaggt gatcttgccc
      661 cagctcagca ggcttggaca cctggcgcgg tccagacaca gtgcagacat ctgcgctgtc
      721 cctgcagaga gccacatcta ccgcgagggc tgcgcgcagg gcctccagaa gtggctgcac
      781 aacaacctta tttccatagt gggcatttgc ctgggcgtcg gcctactcga gctcgggttc
      841 atgacgctct cgatattcct gtgcagaaac ctggaccacg tctacaaccg gctcgctcga
      901 taccgttagg ccccgccctc cccaaagtcc cgccccgccc ccgtcacgtg cgctgggcac
      961 ttccctgctg cctgtaaata tttgtttaat ccccagttcg cctggagccc tccgccttca
     1021 cattcccctg gggacccacg tggctgcgtg cccctgctgc tgtcacctct cccacgggac
     1081 ctggggcttt cgtccacagc ttcctgtccc catctgtcgg cctac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF096289. Homo sapiens unco...[gi:4457111] Links  


LOCUS       AF096289                8177 bp    DNA     linear   PRI 22-MAR-1999
DEFINITION  Homo sapiens uncoupling protein-2 (UCP2) gene, nuclear gene
            encoding mitochondrial protein, complete cds.
ACCESSION   AF096289
VERSION     AF096289.1  GI:4457111
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 8177)
  AUTHORS   Pecqueur,C., Cassard-Doulcier,A.M., Raimbault,S., Miroux,B.,
            Fleury,C., Gelly,C., Bouillaud,F. and Ricquier,D.
  TITLE     Functional organization of the human uncoupling protein-2 gene, and
            juxtaposition to the uncoupling protein-3 gene
  JOURNAL   Biochem. Biophys. Res. Commun. 255 (1), 40-46 (1999)
  MEDLINE   99185293
   PUBMED   10082652
REFERENCE   2  (bases 1 to 8177)
  AUTHORS   Pecqueur,C., Raimbault,S., Bouillaud,F. and Ricquer,D.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-OCT-1998) CEREMOD, CNRS, 9 rue jules hetzel, Meudon
            92190, France
FEATURES             Location/Qualifiers
     source          1..8177
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="11"
                     /map="11q13"
                     /tissue_type="placenta"
                     /clone_lib="Clontech HL1067j"
     gene            1..8177
                     /gene="UCP2"
     mRNA            join(1..124,1211..1367,4371..4595,4752..4962,5831..6025,
                     6106..6207,7177..7357,7727..8177)
                     /gene="UCP2"
                     /product="uncoupling protein-2"
     repeat_region   2999..3270
                     /rpt_family="AluJo"
                     /rpt_type=dispersed
     repeat_region   3273..3299
                     /rpt_type=tandem
     repeat_region   3300..3583
                     /rpt_family="AluSx"
                     /rpt_type=dispersed
     CDS             join(4470..4595,4752..4962,5831..6025,6106..6207,
                     7177..7357,7727..7841)
                     /gene="UCP2"
                     /note="mitochondrial carrier protein; transporter of the
                     inner mitochondrial membrane; member of the uncoupling
                     protein family"
                     /codon_start=1
                     /product="uncoupling protein-2"
                     /protein_id="AAD21151.1"
                     /db_xref="GI:4457112"
                     /translation="MVGFKATDVPPTATVKFLGAGTAACIADLITFPLDTAKVRLQIQ
                     GESQGPVRATASAQYRGVMGTILTMVRTEGPRSLYNGLVAGLQRQMSFASVRIGLYDS
                     VKQFYTKGSEHASIGSRLLAGSTTGALAVAVAQPTDVVKVRFQAQARAGGGRRYQSTV
                     NAYKTIAREEGFRGLWKGTSPNVARNAIVNCAELVTYDLIKDALLKANLMTDDLPCHF
                     TSAFGAGFCTTVIASPVDVVKTRYMNSALGQYSSAGHCALTMLQKEGPRAFYKGFMPS
                     FLRLGSWNVVMFVTYEQLKRALMAACTSREAPF"
     repeat_region   6466..6499
                     /rpt_type=tandem
     repeat_region   6526..7068
                     /rpt_family="AluSx"
                     /rpt_type=dispersed
     misc_feature    7899..7996
                     /gene="UCP2"
                     /note="low complexity sequence"
     polyA_signal    8157..8161
                     /gene="UCP2"
BASE COUNT     1708 a   2261 c   2020 g   2188 t
ORIGIN      
        1 cactgcgaag cccagctgcg cgcgccttgg gattgactgt ccacgctcgc ccggctcgtc
       61 cgacgcgccc tccgccagcc gacagacaca gccgcacgca ctgccgtgtt ctccctgcgg
      121 ctcggtgagc ctggccccag ccctgcgccc tttgcgcccc ccacgcttgt tctgcgtgcg
      181 ctgcccgctc ttccatttac cttctctccc acccaagttt gtactctttt ctttctctcg
      241 gttttatttt ttgtttttgt ttgtttgttt gagacaggct ttcgctctgt ctcccaggct
      301 ggagtgcagt ggcgcgatct cggctcactg cagcctccac ctcccaggtt caagcgatcc
      361 gcctgccgag tagctgggat tacaggcgcc cgccaccacg cctggctaat ttttgtgttt
      421 tgtagagatg gggtttcgcc atgttggcca ggctggcctc gaactgctga gctcaagcaa
      481 tccgcccgcc tcggcctcac aaagtcgtag aattttaggc atgagcctcc gggtccggcc
      541 tgtgctaatc ctttctgtcc ttggttcttt atttctcttc tctctttttc ttagtccctt
      601 ttgttctttc cctctcccgt tcaggggctg tcgtttgagc ctccaccttt tcactccctc
      661 ctttccacca cgatgccgag ccctgccttg gatggggacc atcagcgatg accacaatga
      721 ctctccctta ccaggcagct ccaggcagtg ttcctgcacc gcctttccca gggcttgggg
      781 gctttttcta gtgggctttg agctgctcaa tctggcctct gcagggccgg ctcccagccc
      841 ttccaacctc ctcacagccc gacctgggac ctagccaatt cccggagagt ctctgtccca
      901 tcgtgacccc ctcacaactc tcccactcac caaagtctga tgactgtgct agggggtgct
      961 tatatagagt actgagtgtt acaaaagcag aagtctggat gagaaccaat ttgtgatatt
     1021 aagcaggtgg ggtgggggtg gggagtgtac ctaggttcat tttccgccct gcttttcccc
     1081 tttccagtgt gtgcacttaa ccagtccctg ggccctgttc cccatccccc tccaaggcat
     1141 ggattgggtg ggcttgtgtg tcttggggca ggtggccctt tctaaactct ctgcctttgc
     1201 tcacccacag gacacatagt atgaccatta ggtgtttcgt ctcccaccca ttttctatgg
     1261 aaaaccaagg ggatcgggcc atgatagcca ctggcagctt tgaagaacgg gacaccttta
     1321 gagaagcttg atcttggagg cctcaccgtg agaccttaca aagccgggta agagtccagt
     1381 ccaaggaaga ggtctcttgc tgcctcctaa ccctgtggtc taggggcagg agtcagcagg
     1441 gcattaacaa aaataattac catccccacc cccgacagtg aagtggctct ttccagttca
     1501 cagagcactc tcacacctcc ccgctctcat tctggccctt cagctgactc ggacaagcca
     1561 aggatcttgg tccccatttt ataaaggaga aaactgaggc ccacgtgtaa cagtgattgg
     1621 ccccaagtca tcccgggagc cagcagaaga gctaggacag gaacctattg ttctaacttc
     1681 atattgatgc tagcttttga ctatccctga aaccgagatt ggtaatcagc ccggctctga
     1741 aactggttat ttgctgggga ctgtaaaata ggattaacta tttctagtcc tgcattttaa
     1801 ttgctgttag tagggccatc ttacccaccc tctgaaggac ctgacttggc aagcccaagg
     1861 caacattcag aatatggcag ctgaacctct gtgcacttgt ctttgggcag cagctgggtc
     1921 ttattcttct ctggccttca caacatcctg caacccagct caaggtcagg aatgtgacag
     1981 actcatgtca tcatatctct gatgcccaga gaagggatac catttgcctg agccttctca
     2041 gtactgttta atcagcctgt gagaactttc cttgtgaaag gccctgtctg tgcctggggc
     2101 tgataaaaca gcaagaacga actgaggagc tgggcagcag tgcaaagcaa atactaccag
     2161 ctttggtgcc tgtaagtgtg gctcttactc atctcacatg gaaataaggg cagccacctt
     2221 gcagggctgc tctgaggatt gagctaatac agtgccctgg gcgttggggt ggggaaagtt
     2281 gtggagcacc tcctggggga agggggtgtc agagcaggga atctggggag tccgagggca
     2341 ccttcatcaa cccaatctgt catttgagca ccagtcttca ctgagcctcg tgggcaagct
     2401 ggagggaaac aggaataagg tcaggccctg ttctataggt cccagtgtag ttgctatggt
     2461 gagtatcttc atttccctgc ttgccccagc cacctggagt gagaagccca agaggaagct
     2521 gggtgagctg tttgtttcca tgggtctctg tgttcacagc tgactccctt caccagccag
     2581 ccctttcacc tgagccccag caacaaaggc agtcaggcgg ggctcaaagc agctgctcca
     2641 atgaagtcaa agaaataagc tcaggggaag aagcaggtca ccctccccca ctagggtgct
     2701 gggctcactt cctcctgggg cagtggagga gggtgtggtt ccaactcaga acaaaatggg
     2761 gcttttggtt tactttatca ctcttcacag ctctgacctg gacccctcat gccctgcctg
     2821 tcttgtggtg taagtgcgga tccccctaag ttggaggaaa ggaaactggc ccaaacaaaa
     2881 aggagagcag ttttctctgc atcacatggt aggccaggag gagtctaatg ccccagagtt
     2941 tactctcagc ccccaaaatc acctagctaa atgttacctt atctaagaag tccttaggtt
     3001 ttttggggtt tgtttttttt ttttttgaga caaggtctca ctctctcacc cagactggag
     3061 cacagtggca caatcacagc tcactgcagc ctcaacctcc tgggctcaag caatcgtccc
     3121 aagtagctgg gactataggc ctgcaccacc atgtccagct aatttatttt tatttatatt
     3181 ttttagacag ggtctcatta tgttgccctg gctggtcttg aactcctggg ttcaagcagt
     3241 cctcccacct ctgcctccca aagtgctagg tttttttttg tttgtttgtt tgtttgtttt
     3301 ttgaaacaga gtcttgctct gtcgcctagg ctggagtgca gtggcacgat ctcagctact
     3361 gcaacctcca cctcctgggt tcaagtgatt ctcctgcctc agcctcctaa gtagttggga
     3421 atacaggcgt gtgccaacac acccagctca tttttgtatt tttagcggag atggggtttt
     3481 gccatgttgg ccaagctggt ctcaaactcc tgacctcagg tgattcgccc gcctcagcct
     3541 cccaaagtgc tgggtttaca ggcgtgagcc accacaccca gcccaagaag tcttttctga
     3601 tcacccactc ttccttctct cccaatggca ttagttgttc cctcctttgc attttgagag
     3661 tatgtcctgt aagccccaaa tgcagcttga atcatctgcc catccacccc ctgtgcccaa
     3721 cagtaagcct cctctagagt agatactatc tcctgcatct cagtgaacca ctgcccagca
     3781 aagcagtctt gctaaaacaa tgactctaga gatcctaagc tgtgtgagag ctggaggaga
     3841 gaattagact gatggtctgg gaagggattg aattagtcat cttgtacctt ttcttcttga
     3901 cttaagttcc agacctgtag caaccattcc tgcttagaca tccagaacat aagcctatgg
     3961 gtctgtgcct gttgggtctt agtctgggtg aaacttttct ctacttctgt cagctctcca
     4021 gatgaaccac agaagcagga atgtgggcat catcagtgaa atctctgcat acagcagaca
     4081 aagggctggt ccagtggctg tttatgaggc agcgctagga gagctctgat ccagactctc
     4141 cctgcagtga aagggaggga gcccttcatg aagtattgac tgcttgagca ggaattgctt
     4201 caccagcacc taactgagtg cctctcgagc tcacatcggt tttccctcat gaggccactt
     4261 ggagtcttgc tgagggactt ggttctatta gggaaggtga gtttggggat ggtgagcagg
     4321 gagggcctgg ggacattgtg gctaatgggg cttttctcct cttggcttag attccggcag
     4381 agttcctcta tctcgtcttg ttgctgatta aaggtgcccc tgtctccagt ttttctccat
     4441 ctcctgggac gtagcaggaa atcagcatca tggttgggtt caaggccaca gatgtgcccc
     4501 ctactgccac tgtgaagttt cttggggctg gcacagctgc ctgcatcgca gatctcatca
     4561 cctttcctct ggatactgct aaagtccggt tacaggtgag gggatgaagc ctgggagtct
     4621 tgatggtgtc tactctgttc cctccccaaa gacacagacc cctcaagggc cagtgtttgg
     4681 agcatcgaga tgactggagg tgggaagggc aacatgctta tccctgtagc taccctgtct
     4741 tggccttgca gatccaagga gaaagtcagg ggccagtgcg cgctacagcc agcgcccagt
     4801 accgcggtgt gatgggcacc attctgacca tggtgcgtac tgagggcccc cgaagcctct
     4861 acaatgggct ggttgccggc ctgcagcgcc aaatgagctt tgcctctgtc cgcatcggcc
     4921 tgtatgattc tgtcaaacag ttctacacca agggctctga gcgtgagtat ggagcaaggg
     4981 tgtaggcccc ttggcccttt tttctcagtg atgattgatc ttagttcatt cagccatata
     5041 gttttttagg ccccacgatc cctaggaaga tcaggggaac agagaactgg aaggggccct
     5101 ggtcctccac atagttccta agcacctggg ctataccagg ctctgagcag ggcgtcatcc
     5161 catcacagtc ttcaacacca ccttgggagt aggtagtatc atcccagtgt tatagaagaa
     5221 gagactgagg tgggaaggca gtgggtagag tggggacttg gccaggggca cacagtagag
     5281 agccagaaaa cacacagtag agagccagga cactcgtctc taaggccagc gttcttccct
     5341 ttcacctcct tagtatgcca tgccaaccct ccattttaca catgacgaaa cagagcccca
     5401 aacaaaaggt tgtctttccc agatcacatg gcaggaagaa gtaaagctga cctgagatcc
     5461 caagtcttag gaatcccagt cctcagaaag ccacttctct ctgagccttg gttttcacat
     5521 ttgtcagatg gaaatgattg tgatttctca gggctgttga gcaggtaaat gaaaatgttt
     5581 tatgaaagaa agcaccaagt ttcattttgg tcttagccct tgctatgtcc ctagcaagaa
     5641 gtagatattc atagggatat tttgtttgat gtgaggagtt cttacagcaa gagcttgtag
     5701 aaggccaaaa gcttctggat tctaatccca aaagcaggag atgacagtga cagggtggtt
     5761 ttggtgagga gagatgaggt agaaaatgag tgcaagcccg ctggccactg accccatggc
     5821 tcgcccacag atgccagcat tgggagccgc ctcctagcag gcagcaccac aggtgccctg
     5881 gctgtggctg tggcccagcc cacggatgtg gtaaaggtcc gattccaagc tcaggcccgg
     5941 gctggaggtg gtcggagata ccaaagcacc gtcaatgcct acaagaccat tgcccgagag
     6001 gaagggttcc ggggcctctg gaaaggtgtg taccagttgt tttcccttcc ccttttcctc
     6061 ctccccgata ctctggtctc acccaggatc ttcctcctcc tacagggacc tctcccaatg
     6121 ttgctcgtaa tgccattgtc aactgtgctg agctggtgac ctatgacctc atcaaggatg
     6181 ccctcctgaa agccaacctc atgacaggtg agtcatgagg tagacggtgc tgggtctcac
     6241 ccttccccca tgccaggagc aggtgcgggg gtctagctga caccagaaga ccacatcttt
     6301 tcatcctatt tgccctttgc agggagagta agatatctct tacttgccat attgaagcca
     6361 attgggatga agctcccact ttgcacattg aggaactgag gctagattgg caaaatgact
     6421 ctttcaggtc ctcagaagat gtctcagctg gagtccctgt ctgtttttgt ttttttgttt
     6481 gtttgttttt tgtttttttt gagatagagt ctcactctgt tacccgtgta atctcagctc
     6541 actgcaacct tctcctcctg ggttcaagcg attcttgtgc ctcagcctcc cgagtagctg
     6601 ggatgacagg tgtgcaccag cacactggct aatttttgta tttttagtag agatggagtt
     6661 tcaccatgtt agccaggctg gtctcgaact cctggcctca agtgatctgc ccaccttggc
     6721 ctcccaatgt gctgggatta caggtgtgag cctctgcgcc ccatcctctt gtttgttttt
     6781 tgagacaggg tcttgctcgg ttgcccaggc tggagtgcag tggggtgatt aatggctcat
     6841 tgcagcctcg acctccctga ctcaagcaat cctcccacct cagcctcctg agtagctggg
     6901 gctgactaca ggcatgcaca ctgtgcctgg ctaatttttg tattttgtag agacagggtt
     6961 tttgccatgt tacccagtct ggtcttgaac tcctgggctc aagtgatcca cccacctcgg
     7021 cctccaaaag aagtcctgga ttacaggcat gagacattgt gcccagcctc tctgtctctt
     7081 taaaatcatg aaaactcgta gctacttaag taattctcct gccttctgga atgatgggtg
     7141 aagatcttga ctgccttgcc tgctcctcct tggcagatga cctcccttgc cacttcactt
     7201 ctgcctttgg ggcaggcttc tgcaccactg tcatcgcctc ccctgtagac gtggtcaaga
     7261 cgagatacat gaactctgcc ctgggccagt acagtagcgc tggccactgt gcccttacca
     7321 tgctccagaa ggaggggccc cgagccttct acaaagggtg agcctctggt cctccccacc
     7381 cagttcaggc ctcttggcta tgcatgtcta ttatgggtgg gagagaacca cctggaagtg
     7441 agtagcagcc aagtgtgact atttctgatc ctggtcgtgg catttcacca gcattcacct
     7501 atccccttaa ttccttcctc ccagaattgc taccatcact gtttattagg tgttaaatgg
     7561 agactcaaag ggaattcatg cttatagcca agcagctgtg agctcagttc attgagtcct
     7621 cccagcctcc tttgggacag agcaactggg ttggattgaa taccaggccc agtgagggaa
     7681 gtgggaggtg gaggtgcccc catgacctgt gatttttctc ctctaggttc atgccctcct
     7741 ttctccgctt gggttcctgg aacgtggtga tgttcgtcac ctatgagcag ctgaaacgag
     7801 ccctcatggc tgcctgcact tcccgagagg ctcccttctg agcctctcct gctgctgacc
     7861 tgatcacctc tggctttgtc tctagccggg ccatgctttc cttttcttcc ttctttctct
     7921 tccctccttc ccttctctcc ttccctcttt ccccacctct tccttccgct cctttaccta
     7981 ccaccttccc tctttctaca ttctcatcta ctcattgtct cagtgctggt ggagttgaca
     8041 tttgacagtg tgggaggcct cgtaccagcc aggatcccaa gcgtcccgtc ccttggaaag
     8101 ttcagccaga atcttcgtcc tgcccccgac agcccagcct agcccacttg tcatccataa
     8161 agcaagctca accttgg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D11086. Human mRNA for in...[gi:303611] Links  


LOCUS       HUMIL2RG                1451 bp    mRNA    linear   PRI 01-FEB-2000
DEFINITION  Human mRNA for interleukin 2 receptor gamma chain.
ACCESSION   D11086
VERSION     D11086.1  GI:303611
KEYWORDS    interleukin 2 receptor; interleukin 2 receptor gamma chain.
SOURCE      Human Lymphocyte T cell, cell line MOLT4, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1451)
  AUTHORS   Takeshita,T., Asao,H., Ohtani,K., Ishii,N., Kumaki,S., Tanaka,N.,
            Munakata,H., Nakamura,M. and Sugamura,K.
  TITLE     Cloning of the gamma chain of the human IL-2 receptor
  JOURNAL   Science 257 (5068), 379-382 (1992)
  MEDLINE   92335883
REFERENCE   2  (bases 1 to 1451)
  AUTHORS   Asao,H.
  TITLE     Direct Submission
  JOURNAL   Submitted (06-MAY-1992) Hironobu Asao, Tohoku Univ. School of
            Medicine, Dept. of Microbiology; 2-1 Seiryo-machi Aoba-ku, Sendai,
            Miyagi 980, Japan (Tel:022-273-9073, Fax:022-272-7273)
COMMENT     On Jul 26, 1993 this sequence version replaced gi:219889.
            Submitted (06-May-1992) to DDBJ by:
            Hironobu Asao               
            Department of Microbiology                             Tohoku
            University                         
            School of Medicine       
            2-1 Seiryo-machi
            Aoba-ku, Sendai 980                     
            Japan   
            Phone:  022-273-9073          
            Fax:    022-272-7273.
FEATURES             Location/Qualifiers
     source          1..1451
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="MOLT4"
                     /cell_type="T cell"
                     /tissue_type="Lymphocyte"
     gene            1..1451
                     /gene="hIL-2Rg"
     CDS             15..1124
                     /gene="hIL-2Rg"
                     /codon_start=1
                     /product="interleukin 2 receptor gamma chain"
                     /protein_id="BAA01857.1"
                     /db_xref="GI:219890"
                     /translation="MLKPSLPFTSLLFLQLPLLGVGLNTTILTPNGNEDTTADFFLTT
                     MPTDSLSVSTLPLPEVQCFVFNVEYMNCTWNSSSEPQPTNLTLHYWYKNSDNDKVQKC
                     SHYLFSEEITSGCQLQKKEIHLYQTFVVQLQDPREPRRQATQMLKLQNLVIPWAPENL
                     TLHKLSESQLELNWNNRFLNHCLEHLVQYRTDWDHSWTEQSVDYRHKFSLPSVDGQKR
                     YTFRVRSRFNPLCGSAQHWSEWSHPIHWGSNTSKENPFLFALEAVVISVGSMGLIISL
                     LCVYFWLERTMPRIPTLKNLEDLVTEYHGNFSAWSGVSKGLAESLQPDYSERLCLVSE
                     IPPKGGALGEGPGASPCNQHSPYWAPPCYTLKPET"
     sig_peptide     15..80
                     /gene="hIL-2Rg"
     mat_peptide     81..1121
                     /gene="hIL-2Rg"
                     /product="interleukin 2 receptor gamma chain"
     polyA_site      1451
                     /gene="hIL-2Rg"
BASE COUNT      347 a    422 c    313 g    369 t
ORIGIN      
        1 gaagagcaag cgccatgttg aagccatcat taccattcac atccctctta ttcctgcagc
       61 tgcccctgct gggagtgggg ctgaacacga caattctgac gcccaatggg aatgaagaca
      121 ccacagctga tttcttcctg accactatgc ccactgactc cctcagtgtt tccactctgc
      181 ccctcccaga ggttcagtgt tttgtgttca atgtcgagta catgaattgc acttggaaca
      241 gcagctctga gccccagcct accaacctca ctctgcatta ttggtacaag aactcggata
      301 atgataaagt ccagaagtgc agccactatc tattctctga agaaatcact tctggctgtc
      361 agttgcaaaa aaaggagatc cacctctacc aaacatttgt tgttcagctc caggacccac
      421 gggaacccag gagacaggcc acacagatgc taaaactgca gaatctggtg atcccctggg
      481 ctccagagaa cctaacactt cacaaactga gtgaatccca gctagaactg aactggaaca
      541 acagattctt gaaccactgt ttggagcact tggtgcagta ccggactgac tgggaccaca
      601 gctggactga acaatcagtg gattatagac ataagttctc cttgcctagt gtggatgggc
      661 agaaacgcta cacgtttcgt gttcggagcc gctttaaccc actctgtgga agtgctcagc
      721 attggagtga atggagccac ccaatccact gggggagcaa tacttcaaaa gagaatcctt
      781 tcctgtttgc attggaagcc gtggttatct ctgttggctc catgggattg attatcagcc
      841 ttctctgtgt gtatttctgg ctggaacgga cgatgccccg aattcccacc ctgaagaacc
      901 tagaggatct tgttactgaa taccacggga acttttcggc ctggagtggt gtgtctaagg
      961 gactggctga gagtctgcag ccagactaca gtgaacgact ctgcctcgtc agtgagattc
     1021 ccccaaaagg aggggccctt ggggaggggc ctggggcctc cccatgcaac cagcatagcc
     1081 cctactgggc ccccccatgt tacaccctaa agcctgaaac ctgaacccca atcctctgac
     1141 agaagaaccc cagggtcctg tagccctaag tggtactaac tttccttcat tcaacccacc
     1201 tgcgtctcat actcacctca ccccactgtg gctgatttgg aattttgtgc ccccatgtaa
     1261 gcaccccttc atttggcatt ccccacttga gaattaccct tttgccccga acatgttttt
     1321 cttctccctc agtctggccc ttccttttcg caggattctt cctccctccc tctttccctc
     1381 ccttcctctt tccatctacc ctccgattgt tcctgaaccg atgagaaata aagtttctgt
     1441 tgataatcat c
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X54941. H.sapiens ckshs1 ...[gi:29976] Links  


LOCUS       HSCKSHS1                 717 bp    mRNA    linear   PRI 30-APR-1992
DEFINITION  H.sapiens ckshs1 mRNA for Cks1 protein homologue.
ACCESSION   X54941 X55505
VERSION     X54941.1  GI:29976
KEYWORDS    Cdc28 protein kinase; Cks1 protein.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 717)
  AUTHORS   Richardson,H.E., Stueland,C.S., Thomas,J., Russell,P. and Reed,S.I.
  TITLE     Human cDNAs encoding homologs of the small p34Cdc28/Cdc2-associated
            protein of Saccharomyces cerevisiae and Schizosaccharomyces pombe
  JOURNAL   Genes Dev. 4 (8), 1332-1344 (1990)
  MEDLINE   91032985
REFERENCE   2  (bases 1 to 717)
  AUTHORS   Richardson,H.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (28-AUG-1990) Richardson H.E., Biochemistry Dept,
            University of Adelaide, P O Box 498, Adelaide 5001, South Australia
FEATURES             Location/Qualifiers
     source          1..717
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="ckshs1"
                     /cell_line="HeLa"
                     /clone_lib="lambda-ZAPII (strategene)"
     gene            1..717
                     /gene="ckshs1"
     CDS             10..249
                     /gene="ckshs1"
                     /codon_start=1
                     /product="Cks1 protein homologue"
                     /protein_id="CAA38702.1"
                     /db_xref="GI:29977"
                     /db_xref="SWISS-PROT:P33551"
                     /translation="MSHKQIYYSDKYDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWR
                     NLGVQQSQGWVHYMIHEPEPHILLFRRPLPKKPKK"
     polyA_signal    696..701
                     /gene="ckshs1"
BASE COUNT      192 a    157 c    155 g    213 t
ORIGIN      
        1 agagcgatca tgtcgcacaa acaaatttac tattcggaca aatacgacga cgaggagttt
       61 gagtatcgac atgtcatgct gcccaaggac atagccaagc tggtccctaa aacccatctg
      121 atgtctgaat ctgaatggag gaatcttggc gttcagcaga gtcagggatg ggtccattat
      181 atgatccatg aaccagaacc tcacatcttg ctgttccggc gcccactacc caagaaacca
      241 aagaaatgaa gctggcaagc tacttttcag cctcaagctt tacacagctg tccttacttc
      301 ctaacatctt tctgataaca ttattatgtt gccttcttgt ttctcacttt gatatttaaa
      361 agatgttcaa tacactgttt gaatgtgctg gtaactgctt tgcttcttga gtagagccac
      421 caccaccata gcccagccag atgagtgctc tgtggaccca cagcctaagc tgagtgtgac
      481 cccagaagcc acgatgtgct ctgtatccag aacacacttg gcagatggag gaagcatctg
      541 agtttgagac catggctgtt acagggatca tgtaaacttg ctgtttttgt tttttctgcc
      601 gggtgttgta tgtgtggtga cttgcggatt tatgtttcag tgtactggaa actttccatt
      661 ttattcaaga aatctgttca tgttaaaagc cttgattaaa gaggaagttt ttataat
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AL136794. Homo sapiens mRNA...[gi:12053100] Links  


LOCUS       HSM801762               3237 bp    mRNA    linear   PRI 20-MAR-2002
DEFINITION  Homo sapiens mRNA; cDNA DKFZp434C011 (from clone DKFZp434C011);
            complete cds.
ACCESSION   AL136794
VERSION     AL136794.1  GI:12053100
KEYWORDS    .
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3237)
  AUTHORS   Poustka,A., Klein,M., Mewes,H.W., Gassenhuber,J. and Wiemann,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-MAR-2002) MIPS, Am Klopferspitz 18a, D-82152
            Martinsried, GERMANY
COMMENT     Clone from S. Wiemann, Molecular Genome Analysis, German Cancer
            Research Center (DKFZ); Email s.wiemann@dkfz-heidelberg.de;
            sequenced by DKFZ (German Cancer Research Center,
            Heidelberg/Germany) within the cDNA sequencing consortium of the
            German Genome Project.
            This clone (DKFZp434C011) is available at the RZPD in Berlin.
            Please contact the RZPD: Ressourcenzentrum, Heubnerweg 6, 14059
            Berlin-Charlottenburg, GERMANY; Email: clone@rzpd.de Further
            information about the clone and the sequencing project is available
            at http://mips.gsf.de/proj/cDNA/.
FEATURES             Location/Qualifiers
     source          1..3237
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="DKFZp434C011"
                     /tissue_type="testis"
                     /clone_lib="434 (synonym: htes3). Vector pSport1; host
                     DH10B; sites NotI + SalI"
                     /dev_stage="adult"
     gene            1..3237
                     /gene="DKFZp434C011"
     CDS             225..2123
                     /gene="DKFZp434C011"
                     /note="similarity to GTPase-activating proteins"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="CAB66728.1"
                     /db_xref="GI:12053101"
                     /db_xref="SPTREMBL:Q9H0H5"
                     /translation="MDTMMLNVRNLFEQLVRRVEILSEGNEVQFIQLAKDFEDFRKKW
                     QRTDHELGKYKDLLMKAETERSALDVKLKHARNQVDVEIKRRQRAEADCEKLERQIQL
                     IREMLMCDTSGSIQLSEEQKSALAFLNRGQPSSSNAGNKRLSTIDESGSILSDISFDK
                     TDESLDWDSSLVKTFKLKKREKRRSTSRQFVDGPPGPVKKTRSIGSAVDQGNESIVAK
                     TTVTVPNDGGPIEAVSTIETVPYWTRSRRKTGTLQPWNSDSTLNSRQLEPRTETDSVG
                     TPQSNGGMRLHDFVSKTVIKPESCVPCGKRIKFGKLSLKCRDCRVVSHPECRDRCPLP
                     CIPTLIGTPVKIGEGMLADFVSQTSPMIPSIVVHCVNEIEQRGLTETGLYRISGCDRT
                     VKELKEKFLRVKTVPLLSKVDDIHAICSLLKDFLRNLKEPLLTFRLNRAFMEAAEITD
                     EDNSIAAMYQAVGELPQANRDTLAFLMIHLQRVAQSPHTKMDVANLAKVFGPTIVAHA
                     VPNPDPVTMLQDIKRQPKVVERLLSLPLEYWSQFMMVEQENIDPLHVIENSNAFSTPQ
                     TPDIKVSLLGPVTTPEHQLLKTPSSSSLSQRVRSTLTKNTPRFGSKSKSATNLGRQGN
                     FFASPMLK"
     polyA_site      3227
                     /gene="DKFZp434C011"
BASE COUNT      938 a    668 c    749 g    882 t
ORIGIN      
        1 gcgaagtgaa gggtggccca ggtggggcca ggctgactga atgtatctcc tagctatgga
       61 ctaaataata catgggggga aataaacaag tattcatgag ggtgaaaatg tgacccagca
      121 ggaaaattac aactattttc aattgacgtt gaataggatg agtcatggaa tttaagtgat
      181 ttactgaaga ttatactact ggtagataga agagctaaag aaagatggat actatgatgc
      241 tgaatgtgcg gaatctgttt gagcagcttg tgcgccgggt ggagattctc agtgaaggaa
      301 atgaagtcca atttatccag ttggcgaagg actttgagga tttccgtaaa aagtggcaga
      361 ggactgacca tgagctgggg aaatacaagg atcttttgat gaaagcagag actgagcgaa
      421 gtgctctgga tgttaagctg aagcatgcac gtaatcaggt ggatgtagag atcaaacgga
      481 gacagagagc tgaggctgac tgcgaaaagc tggaacgaca gattcagctg attcgagaga
      541 tgctcatgtg tgacacatct ggcagcattc aactaagcga ggagcaaaaa tcagctctgg
      601 cttttctcaa cagaggccaa ccatccagca gcaatgctgg gaacaaaaga ctatcaacca
      661 ttgatgaatc tggttccatt ttatcagata tcagctttga caagactgat gaatcactgg
      721 attgggactc ttctttggtg aagactttca aactgaagaa gagagaaaag aggcgctcta
      781 ctagccgaca gtttgttgat ggtccccctg gacctgtaaa gaaaactcgt tccattggct
      841 ctgcagtaga ccaggggaat gaatccatag ttgcaaaaac tacagtgact gttcccaatg
      901 atggcgggcc catcgaagct gtgtccacta ttgagactgt gccatattgg accaggagcc
      961 gaaggaaaac aggtacttta caaccttgga acagtgactc caccctgaac agcaggcagc
     1021 tggagccaag aactgagaca gacagtgtgg gcacgccaca gagtaatgga gggatgcgcc
     1081 tgcatgactt tgtttctaag acggttatta aacctgaatc ctgtgttcca tgtggaaagc
     1141 ggataaaatt tggcaaatta tctctgaagt gtcgagactg tcgtgtggtc tctcatccag
     1201 aatgtcggga ccgctgtccc cttccctgca ttcctaccct gataggaaca cctgtcaaga
     1261 ttggagaggg aatgctggca gactttgtgt cccagacttc tccaatgatc ccctccattg
     1321 ttgtgcattg tgtaaatgag attgagcaaa gaggtctgac tgagacaggc ctgtatagga
     1381 tctctggctg tgaccgcaca gtaaaagagc tgaaagagaa attcctcaga gtgaaaactg
     1441 tacccctcct cagcaaagtg gatgatatcc atgctatctg tagccttcta aaagactttc
     1501 ttcgaaacct caaagaacct cttctgacct ttcgccttaa cagagccttt atggaagcag
     1561 cagaaatcac agatgaagac aacagcatag ctgccatgta ccaagctgtt ggtgaactgc
     1621 cccaggccaa cagggacaca ttagctttcc tcatgattca cttgcagaga gtggctcaga
     1681 gtccacatac taaaatggat gttgccaatc tggctaaagt ctttggccct acaatagtgg
     1741 cccatgctgt gcccaatcca gacccagtga caatgttaca ggacatcaag cgtcaaccca
     1801 aggtggttga gcgcctgctt tccttgcctc tggagtattg gagtcagttc atgatggtgg
     1861 agcaagagaa cattgacccc ctacatgtca ttgaaaactc aaatgccttt tcaacaccac
     1921 agacaccaga tattaaagtg agtttactgg gacctgtgac cactcctgaa catcagcttc
     1981 tcaagactcc ttcatctagt tccctgtcac agagagtccg ttccaccctc accaagaaca
     2041 ctcctagatt tgggagcaaa agcaagtctg ccactaacct aggacgacaa ggcaactttt
     2101 ttgcttctcc aatgctcaag tgaagtcaca tctgcctgtt acttcccagc attgactgac
     2161 tataagaaag gacacatctg tactctgctc tgcagcctcc tgtactcatt actactttta
     2221 gcattctcca ggcttttact caagtttaat tgtgcatgag ggttttatta aaactatata
     2281 tatctcccct tccttctcct caagtcacat aatatcagca ctttgtgctg gtcattgttg
     2341 ggagctttta gatgagacat ctttccaggg gtagaagggt tagtatggaa ttggttgtga
     2401 ttctttttgg ggaagggggt tattgttcct ttggcttaaa gccaaatgct gctcatagaa
     2461 tgatctttct ctagtttcat ttagaactga tttccgtgag acaatgacag aaaccctacc
     2521 tatctgataa gattagcttg tctcagggtg ggaagtggga gggcagggca aagaaaggat
     2581 tagaccagag gatttaggat gcctccttct aagaaccaga agttctcatt ccccattatg
     2641 aactgagcta taatatggag ctttcataaa aatgggatgc attgaggaca gaactagtga
     2701 tgggagtatg cgtagctttg atttggatga ttaggtcttt aatagtgttg agtggcacaa
     2761 ccttgtaaat gtgaaagtac aactcgtatt tatctctgat gtgccgctgg ctgaactttg
     2821 ggttcatttg gggtcaaagc cagtttttct tttaaaattg aattcattct gatgcttggc
     2881 ccccataccc ccaaccttgt ccagtggagc ccaacttcta aaggtcaata tatcatcctt
     2941 tggcatccca actaacaata aagagtaggc tataagggaa gattgtcaat attttgtggt
     3001 aagaaaagct acagtcattt tttctttgca ctttggatgc tgaaattttt cccatggaac
     3061 atagccacat ctagatagat gtgagctttt tcttctgtta aaattattct taatgtctgt
     3121 aaaaacgatt ttcttctgta gaatgtttga cttcgtattg acccttatct gtaaaacacc
     3181 tatttgggat aatatttgga aaaaaagtaa atagcttttt caaaatgaaa aaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AJ001015. Homo sapiens mRNA...[gi:3171911] Links  


LOCUS       HSRAMP2                  780 bp    mRNA    linear   PRI 29-MAY-1998
DEFINITION  Homo sapiens mRNA encoding RAMP2.
ACCESSION   AJ001015
VERSION     AJ001015.1  GI:3171911
KEYWORDS    RAMP2 gene.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   McLatchie,L.M., Fraser,N.J., Main,M.J., Wise,A., Brown,J.,
            Thompson,N., Solari,R., Lee,M.G. and Foord,S.M.
  TITLE     RAMPs regulate the transport and ligand specificity of the
            calcitonin-receptor-like receptor
  JOURNAL   Nature 393 (6683), 333-339 (1998)
  MEDLINE   98282119
REFERENCE   2  (bases 1 to 780)
  AUTHORS   Fraser,N.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-SEP-1997) Fraser N.J., Receptor Systems,
            GlaxoWellcome, Medicines Research Centre, Gunnels Wood Road,
            Stevenage, Herts. SG1 2NY, U.K
COMMENT     Co-expression of RAMP2 with the calcitonin-receptor-like-receptor
            (CRLR Acc. No. U17473) results in the production of a functional
            adrenomedullin (ADM) receptor.
FEATURES             Location/Qualifiers
     source          1..780
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="SK-N-MC"
                     /cell_type="Neuroblastoma"
     gene            1..780
                     /gene="RAMP2"
     CDS             69..596
                     /gene="RAMP2"
                     /function="CRLR receptor-activity-modifying-protein"
                     /codon_start=1
                     /evidence=experimental
                     /protein_id="CAA04473.1"
                     /db_xref="GI:3171912"
                     /db_xref="SPTREMBL:O60895"
                     /translation="MASLRVERAGGPRLPRTRVGRPAAVRLLLLLGAVLNPHEALAQP
                     LPTTGTPGSEGGTVKNYETAVQFCWNHYKDQMDPIEKDWCDWAMISRPYSTLRDCLEH
                     FAELFDLGFPNPLAERIIFETHQIHFANCSLVQPTFSDPPEDVLLAMIIAPICLIPFL
                     ITLVVWRSKDSEAQA"
     sig_peptide     69..173
                     /gene="RAMP2"
     mat_peptide     174..593
                     /gene="RAMP2"
                     /product="unnamed"
BASE COUNT      153 a    265 c    190 g    172 t
ORIGIN      
        1 ggatataggc gcccccacac ccgggcccgg ctaagcgccg ccgccgctcc tcgcctcctt
       61 gctgcacgat ggcctcgctc cgggtggagc gcgccggcgg cccgcgtctc cctaggaccc
      121 gagtcgggcg gccggcagcc gtccgcctcc tccttctgct gggcgctgtc ctgaatcccc
      181 acgaggccct ggctcagcct cttcccacca caggcacacc agggtcagaa ggggggacgg
      241 tgaagaacta tgagacagct gtccaatttt gctggaatca ttataaggat caaatggatc
      301 ctatcgaaaa ggattggtgc gactgggcca tgattagcag gccttatagc accctgcgag
      361 attgcctgga gcactttgca gagttgtttg acctgggctt ccccaatccc ttggcagaga
      421 ggatcatctt tgagactcac cagatccact ttgccaactg ctccctggtg cagcccacct
      481 tctctgaccc cccagaggat gtactcctgg ccatgatcat agcccccatc tgcctcatcc
      541 ccttcctcat cactcttgta gtatggagga gtaaagacag tgaggcccag gcctaggggg
      601 cacgagcttc tcaacaacca tgttactcca cttccccacc cccaccaggc ctccctcctc
      661 ccctcctact cccttttctc actctcatcc ccaccacaga tccctggatt gctgggaatg
      721 gaagccaggg ttgggcatgg cacaagttct gtaatcttca aaataaaact ttttttttga
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U60805. Human oncostatin-...[gi:1794210] Links  


LOCUS       HSU60805                4171 bp    mRNA    linear   PRI 23-JAN-1997
DEFINITION  Human oncostatin-M specific receptor beta subunit (OSMRB) mRNA,
            complete cds.
ACCESSION   U60805
VERSION     U60805.1  GI:1794210
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4171)
  AUTHORS   Mosley,B., De Imus,C., Friend,D., Boiani,N., Thoma,B., Park,L.S.
            and Cosman,D.
  TITLE     Dual oncostatin M (OSM) receptors. Cloning and characterization of
            an alternative signaling subunit conferring OSM-specific receptor
            activation
  JOURNAL   J. Biol. Chem. 271 (51), 32635-32643 (1996)
  MEDLINE   97115791
   PUBMED   8999038
REFERENCE   2  (bases 1 to 4171)
  AUTHORS   Mosley,B. and Cosman,D.
  TITLE     Direct Submission
  JOURNAL   Submitted (13-JUN-1996) Bruce Mosley, Molecular Biology, Immunex
            Corporation, 51 University St., Seattle, WA 98101, USA
FEATURES             Location/Qualifiers
     source          1..4171
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..4171
                     /gene="OSMRB"
     CDS             368..3307
                     /gene="OSMRB"
                     /codon_start=1
                     /product="oncostatin-M specific receptor beta subunit"
                     /protein_id="AAC50946.1"
                     /db_xref="GI:1794211"
                     /translation="MALFAVFQTTFFLTLLSLRTYQSEVLAERLPLTPVSLKVSTNST
                     RQSLHLQWTVHNLPYHQELKMVFQIQISRIETSNVIWVGNYSTTVKWNQVLHWSWESE
                     LPLECATHFVRIKSLVDDAKFPEPNFWSNWSSWEEVSVQDSTGQDILFVFPKDKLVEE
                     GTNVTICYVSRNIQNNVSCYLEGKQIHGEQLDPHVTAFNLNSVPFIRNKGTNIYCEAS
                     QGNVSEGMKGIVLFVSKVLEEPKDFSCETEDFKTLHCTWDPGTDTALGWSKQPSQSYT
                     LFESFSGEKKLCTHKNWCNWQITQDSQETYNFTLIAENYLRKRSVNILFNLTHRVYLM
                     NPFSVNFENVNATNAIMTWKVHSIRNNFTYLCQIELHGEGKMMQYNVSIKVNGEYFLS
                     ELEPATEYMARVRCADASHFWKWSEWSGQNFTTLEAAPSEAPDVWRIVSLEPGNHTVT
                     LFWKPLSKLHANGKILFYNVVVENLDKPSSSELHSIPAPANSTKLILDRCSYQICVIA
                     NNSVGASPASVIVISADPENKEVEEERIAGTEGGFSLSWKPQPGDVIGYVVDWCDHTQ
                     DVLGDFQWKNVGPNTTSTVISTDAFRPGVRYDFRIYGLSTKRIACLLEKKTGYSQELA
                     PSDNPHVLVDTLTSHSFTLSWKDYSTESQPGFIQGYHVYLKSKARQCHPRFEKAVLSD
                     GSECCKYKIDNPEEKALIVDNLKPESFYEFFITPFTSAGEGPSATFTKVTTPDEHSSM
                     LIHILLPMVFCVLLIMVMCYLKSQWIKETCYPDIPDPYKSSILSLIKFKENPHLIIMN
                     VSDCIPDAIEVVSKPEGTKIQFLGTRKSLTETELTKPNYLYLLPTEKNHSGPGPCICF
                     ENLTYNQAASDSGSCGHVPVSPKAPSMLGLMTSPENVLKALEKNYMNSLGEIPAGETS
                     LNYVSQLASPMFGDKDSLPTNPVEAPHCSEYKMQMAVSLRLALPPPTENSSLSSITLL
                     DPGEHYC"
BASE COUNT     1169 a   1004 c    911 g   1087 t
ORIGIN      
        1 gggccgcctc tgcacgtccg ccccggagcc cgcacccgcg ccccacgcgc cgccgaggac
       61 tcggcccggc tcgtggagcc cttcgcccgc ggcgtgagta cccccgaccc gcccgtcccc
      121 gctctgctcg cgccctgccg ctgcgccgcc ctcggtggct tttccgacgg gcgagccccg
      181 tgctgtgcgg gaaagaatcc gacaacttcg cagcccatcc cggctggacg cgaccgggag
      241 tgcagcagcc cgttcccctc ctcggtgccg cctctgccca gcgtttgctt ggctgggcta
      301 ccacctgcgc tcggacggcg ctcggagggt cctcgccccc ggcctgccta cctgaaaacc
      361 agaactgatg gctctatttg cagtctttca gacaacattc ttcttaacat tgctgtcctt
      421 gaggacttac cagagtgaag tcttggctga acgtttacca ttgactcctg tatcacttaa
      481 agtttccacc aattctacgc gtcagagttt gcacttacaa tggactgtcc acaaccttcc
      541 ttatcatcag gaattgaaaa tggtatttca gatccagatc agtaggattg aaacatccaa
      601 tgtcatctgg gtggggaatt acagcaccac tgtgaagtgg aaccaggttc tgcattggag
      661 ctgggaatct gagctccctt tggaatgtgc cacacacttt gtaagaataa agagtttggt
      721 ggacgatgcc aagttccctg agccaaattt ctggagcaac tggagttcct gggaggaagt
      781 cagtgtacaa gattctactg gacaggatat attgttcgtt ttccctaaag ataagctggt
      841 ggaagaaggc accaatgtta ccatttgtta cgtttctagg aacattcaaa ataatgtatc
      901 ctgttatttg gaagggaaac agattcatgg agaacaactt gatccacatg taactgcatt
      961 caacttgaat agtgtgcctt tcattaggaa taaagggaca aatatctatt gtgaggcaag
     1021 tcaaggaaat gtcagtgaag gcatgaaagg catcgttctt tttgtctcaa aagtacttga
     1081 ggagcccaag gacttttctt gtgaaaccga ggacttcaag actttgcact gtacttggga
     1141 tcctgggacg gacactgcct tggggtggtc taaacaacct tcccaaagct acactttatt
     1201 tgaatcattt tctggggaaa agaaactttg tacacacaaa aactggtgta attggcaaat
     1261 aactcaagac tcacaagaaa cctataactt cacactcata gctgaaaatt acttaaggaa
     1321 gagaagtgtc aatatccttt ttaacctgac tcatcgagtt tatttaatga atccttttag
     1381 tgtcaacttt gaaaatgtaa atgccacaaa tgccatcatg acctggaagg tgcactccat
     1441 aaggaataat ttcacatatt tgtgtcagat tgaactccat ggtgaaggaa aaatgatgca
     1501 atacaatgtt tccatcaagg tgaacggtga gtacttctta agtgaactgg aacctgccac
     1561 agagtacatg gcgcgagtac ggtgtgctga tgccagccac ttctggaaat ggagtgaatg
     1621 gagtggtcag aacttcacca cacttgaagc tgctccctca gaggcccctg atgtctggag
     1681 aattgtgagc ttggagccag gaaatcatac tgtgacctta ttctggaagc cattatcaaa
     1741 actgcatgcc aatggaaaga tcctgttcta taatgtagtt gtagaaaacc tagacaaacc
     1801 atccagttca gagctccatt ccattccagc accagccaac agcacaaaac taatccttga
     1861 caggtgttcc taccaaatct gcgtcatagc caacaacagt gtgggtgctt ctcctgcttc
     1921 tgtaatagtc atctctgcag accccgaaaa caaagaggtt gaggaagaaa gaattgcagg
     1981 cacagagggt ggattctctc tgtcttggaa accccaacct ggagatgtta taggctatgt
     2041 tgtggactgg tgtgaccata cccaggatgt gctcggtgat ttccagtgga agaatgtagg
     2101 tcccaatacc acaagcacag tcattagcac agatgctttt aggccaggag ttcgatatga
     2161 cttcagaatt tatgggttat ctacaaaaag gattgcttgt ttattagaga aaaaaacagg
     2221 atactctcag gaacttgctc cttcagacaa ccctcacgtg ctggtggata cattgacatc
     2281 ccactccttc actctgagtt ggaaagatta ctctactgaa tctcaacctg gttttataca
     2341 agggtaccat gtctatctga aatccaaggc gaggcagtgc cacccacgat ttgaaaaggc
     2401 agttctttca gatggttcag aatgttgcaa atacaaaatt gacaacccgg aagaaaaggc
     2461 attgattgtg gacaacctaa agccagaatc cttctatgag tttttcatca ctccattcac
     2521 tagtgctggt gaaggcccca gtgctacgtt cacgaaggtc acgactccgg atgaacactc
     2581 ctcgatgctg attcatatcc tactgcccat ggttttctgc gtcttgctca tcatggtcat
     2641 gtgctacttg aaaagtcagt ggatcaagga gacctgttat cctgacatcc ctgaccctta
     2701 caagagcagc atcctgtcat taataaaatt caaggagaac cctcacctaa taataatgaa
     2761 tgtcagtgac tgtatcccag atgctattga agttgtaagc aagccagaag ggacaaagat
     2821 acagttccta ggcactagga agtcactcac agaaaccgag ttgactaagc ctaactacct
     2881 ttatctcctt ccaacagaaa agaatcactc tggccctggc ccctgcatct gttttgagaa
     2941 cttgacctat aaccaggcag cttctgactc tggctcttgt ggccatgttc cagtatcccc
     3001 aaaagcccca agtatgctgg gactaatgac ctcacctgaa aatgtactaa aggcactaga
     3061 aaaaaactac atgaactccc tgggagaaat cccagctgga gaaacaagtt tgaattatgt
     3121 gtcccagttg gcttcaccca tgtttggaga caaggacagt ctcccaacaa acccagtaga
     3181 ggcaccacac tgttcagagt ataaaatgca aatggcagtc tccctgcgtc ttgccttgcc
     3241 tcccccgacc gagaatagca gcctctcctc aattaccctt ttagatccag gtgaacacta
     3301 ctgctaacca gcatgccgat ttcatacctt atgctacaca gacattaaga agagcagagc
     3361 tggcaccctg tcatcaccag tggccttggt ccttaatccc agtacaattt gcaggtctgg
     3421 tttatataag accactacag tctggctagg ttaaaggcca gaggctatgg aacttaacac
     3481 tccccattgg agcaagcttg ccctagagac ggcaggatca tgggagcatg cttaccttct
     3541 gctgtttgtt ccaggctcac ctttagaaca ggagacttga gcttgaccta aggatatgca
     3601 ttaaccactc tacagactcc cactcagtac tgtacagggt ggctgtggtc ctagaagttc
     3661 agtttttact gaggaaatat ttccattaac agcaattatt atattgaagg ctttaataaa
     3721 ggccacagga gacattacta tagcatagat tgtcaaatgt aaatttactg agcgtgtttt
     3781 ataaaaaact cacaggtgtt tgaggccaaa acagatttta gacttacctt gaacggataa
     3841 gaatctatag ttcactgaca cagtaaaatt aactctgtgg gtgggggcgg ggggcatagc
     3901 tctaatctaa tatataaaat gtgtgatgaa tcaacaagat ttccacaatt cttctgtcaa
     3961 gcttactaca gtgaaagaat gggattggca agtaacttct gacttactgt cagttgtact
     4021 tctgctccat agacatcagt attctgccat catttttgat gactacctca gaacataaaa
     4081 aggaacgtat atcacataat tccagtcaca gtttttggtt cctcttttct ttcaagaact
     4141 atatataaat gacctgtttt cacgcggccg c
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AL356292. Human DNA sequenc...[gi:21530888] Links  


LOCUS       AL356292              154068 bp    DNA     linear   PRI 19-JUN-2002
DEFINITION  Human DNA sequence from clone RP11-363I22 on chromosome 1, complete
            sequence.
ACCESSION   AL356292
VERSION     AL356292.21  GI:21530888
KEYWORDS    HTG.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 154068)
  AUTHORS   Howden,P.
  TITLE     Direct Submission
  JOURNAL   Submitted (19-JUN-2002) Wellcome Trust Sanger Institute, Hinxton,
            Cambridgeshire, CB10 1SA, UK. E-mail enquiries:
            humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk
COMMENT     On Jun 21, 2002 this sequence version replaced gi:20068401.
            During sequence assembly data is compared from overlapping clones.
            Where differences are found these are annotated as variations
            together with a note of the overlapping clone name. Note that the
            variation annotation may not be found in the sequence submission
            corresponding to the overlapping clone, as we submit sequences with
            only a small overlap as described above.
            This sequence was finished as follows unless otherwise noted: all
            regions were either double-stranded or sequenced with an alternate
            chemistry or covered by high quality data (i.e., phred quality >=
            30); an attempt was made to resolve all sequencing problems, such
            as compressions and repeats; all regions were covered by at least
            one plasmid subclone or more than one M13 subclone; and the
            assembly was confirmed by restriction digest. The following
            abbreviations are used to associate primary accession numbers given
            in the feature table with their source databases: Em:, EMBL; Sw:,
            SWISSPROT; Tr:, TREMBL; Wp:, WORMPEP; Information on the WORMPEP
            database can be found at
            http://www.sanger.ac.uk/Projects/C_elegans/wormpep This sequence
            was generated from part of bacterial clone contigs of human
            chromosome 1, constructed by the Sanger Centre Chromosome 1 Mapping
            Group.  Further information can be found at
            http://www.sanger.ac.uk/HGP/Chr1
            RP11-363I22 is from the library RPCI-11.2 constructed by the group
            of Pieter de Jong. For further details see
            http://www.chori.org/bacpac/home.htm
            VECTOR: pBACe3.6
            -------------- Genome Center
            Center: Wellcome Trust Sanger Institute
            Center code: SC
            Web site: http://www.sanger.ac.uk
            Contact: humquery@sanger.ac.uk
            --------------.
FEATURES             Location/Qualifiers
     source          1..154068
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="1"
                     /clone="RP11-363I22"
                     /clone_lib="RPCI-11.2"
BASE COUNT    46562 a  31289 c  31194 g  45023 t
ORIGIN      
        1 gaattcaaga ccagcctggg caacaaagcg agactcttgt ctaaaacaaa caaaaaaaag
       61 ctgagtatgg tggcacgtgc ctgtagtccc acttagttgg gagactgagg caggaggatc
      121 cttgagccct ggaggtcaag gctgcagtca gctgtgattg cactactgta ctcctgcctg
      181 ggcaagagca agaccctgtc tcaaaagaaa aaaaaaaaaa aagtggcctg ggggaaacag
      241 gacagtcata taccccaaga atggaaattt tcatgccacc ccccttcctt ccttacaaat
      301 tctaatgctg attgcatgaa cgaggtgttg agagagactt ttcttcattt agacatttgc
      361 cctagtgaat gagggagcaa gagaatgcta agagggatga gagaatatat ttctggctct
      421 aggactttga catccaggta attgaatctg ggacattctc attaggttag aaaaggaaag
      481 atgagaaaat cactagcagg atatgcaatt ccagagacca tctgctccaa aaattagttt
      541 tgtttctcat tgctccttat tcagcgacag ttatttcaat aagctacatt atttagcaga
      601 gtctgagaga caaaggctgt gaaaacatgt atttatgata gaaaacaaac tcagaaaaat
      661 ataatagata gttcactttt tatctcattc acattctata gctggagaac tcaaaatccc
      721 atttagatat ttctgcttat ccaaaagcac actgggtgaa actaaaccac caaatgggag
      781 aaaatgttca ttgggtgtgt gtggcttcac ccagtttatt tacctatgga gtggaagtgt
      841 agggagaaat aaggtctgct tataatggtc aaggtctatg gagaagccct gaagttgctc
      901 tccccaaatc acaagtctga ttcaagaaaa ggaaacaaaa atgatgaaaa catctcatca
      961 cacaaaactc agtgatgggg tctctgacag tcaccagcca gcaggggaaa aggagaaatc
     1021 cacctgccgg ctttaagatt tattgaaggc tgccagcaca gcccagatca tttctgtggc
     1081 actaggcttt gtcccttcca cttcagggtc cagttctact aagtccttgg ctcgattcat
     1141 tgccacatca tacttgtcat ctgtcagaga ggagaagaca ttctctagca catcagagga
     1201 gtgggctagc accaggagtg ctagtgttcg cttgtccata cgctgagggt catttaccca
     1261 ccgctctagt acactatctt gaagtttttt cactagtcgc tgtttctctg ttgtattggt
     1321 cactggatga gtagtcatgt caaatagcag gaaattctgc ttctcagtgg ttagaatacc
     1381 tttctctact aggttctttg cgatgcgctc tcgtacattt ctcagctggt actgtaattt
     1441 gaaggggttc caggtctcac ctatgagaaa aaagaaatag gacataatga actgaaagag
     1501 aaaagcaaaa tgaaacatgt agttgtttgg aaacttgaat ggctaaaatg tttaaaagca
     1561 taatcttcat tcctcaattc atttaatcct tagtccttta taatctggct ttgactccaa
     1621 ccacatttca cattaaacca gcttcaaaaa atgtcaccaa ggacttccat attgctagag
     1681 cttatagact ttatttctgt tcttatgtga cctctgtgac actgtaatta tttatgtatt
     1741 tttcaaattc tatttaggac accagtctct cccagatgat ttctactcta tattgtggaa
     1801 gttctatttg ctggtgaagc taagaggact gaggttccct accctatttg tagagcagaa
     1861 gctctatccc aggcctggta ggctgacaat atttagactt caattgctcc cattccagct
     1921 tgttcgtaga ttagagattc tgcaacagaa aaggcaagtc aagaagacca ggggctaaca
     1981 cctctttccc ccagcaccta ctcatacagc aggtgggtat cactgtagga gaagtgggcc
     2041 cttgtgcaga agttctgccc aggagggaag gaaggcctga agaatgaaga gctccaaagc
     2101 tctgcctgag aggactgact ttatttggaa cacagtgtgg agaattctat gcctaagggt
     2161 gttgaaaacc atagagatct tggtggagag caattacaag gtgctgctag ctctatggca
     2221 ctggtatgac aaacccaaat ggcaaagcct ccagaagttt aacaggcaga accagagaaa
     2281 cagaaagcca agaaaagtcc ttctggaatc atgaccaaca ctgggggttg ggaagactgt
     2341 gtgcttgcac caggctgcac ccactcagga gcaatcagag caggacgtgg ggcttgactt
     2401 gagagtattc cccagaccac acatagatct atcaaaaaag ggtggatgct tcactggcac
     2461 caagggctaa agcacaacca ctgaaccaac actggctgaa caataagcta ccctaactca
     2521 gaggcaactc ctagaaagct aagcttaaaa ataataataa tagccgggtg tggtggtaca
     2581 cgcctgtagt cccagctact tggaggctta ggcaggaaaa cccaggaggc ggaggtttca
     2641 gtcagctgag attgtaccac tgcactccag cctgagtgac agagtgagac ttcatctcaa
     2701 gaaatgaaaa ataaaaaata aaaataaaat taaaaaaatc acatacatcc ctggtgtatg
     2761 tccaaggatg tactctctcc gagaagaaat cagagaggga acttccaagc tgctagttcc
     2821 tcactgaatg tggggcaaaa tgtaaactcc ctgatttgtg atggctatgt caaaatcaca
     2881 cacacatatc caacagtaaa ggataaaaaa atttaattgg ctaaaaggtc ttaagcacaa
     2941 cctttgacta ataagtggct tctactgacc cactggcaac acacagggag ttaggccaaa
     3001 agataaaaac aagaaaatat gagcagtgac atcagaggct gcacactaca agagaaacag
     3061 acttcagaga gctgctgcag gcaagtcact aaggaaacaa attaccaagc aaacaacaca
     3121 atcaacctct agtgggaagg agaaaccagt atagcaggtg tccccaaccc ctggaccatg
     3181 gactggtact ggtctgcagc ctgttaggaa cccggctgca cagcaggagg tgagccatgg
     3241 gtgagtaaag ttgagctctg cctcccatca gatcagcagt ggcattaggt tcccacagga
     3301 gtgtgaaccc tattgtgaac tgcacatggg agggatctag gttgcatgtt ccttgtgaga
     3361 ctctaataaa tgcaatgcac ttgaatcata ccgaaaccat ccccccgtct cccaccccag
     3421 tggaaaaact gtcttccacg aaacctgtcc tggtgccaga aaagttgggg acagctgcag
     3481 tgtagtatcc agagttgcac aattatctaa aatgtccagt ttttgacaaa aaagtatgaa
     3541 gtatacaaag aataaattgt gacccattga cccattcaga agaaaaaaag caggcaatgg
     3601 aaactgcctt tgagaacgct aaaatgttgg actttacaaa gactataaag cagcttttat
     3661 aaatatgttc aaagacctaa gggaaaccat gacttatcat atagagaata taaataaaga
     3721 cagaaattat aaaaaagatc catttagacc aggtgcggtg gctcacgcct gtaatcccag
     3781 cattttgaga ggctgaggtg ggtggatcat ttgaggccag gagttcaaga caagcctggc
     3841 caacatggcg aaacgccatc tctactaaaa atacaaaaat tagccaggta tggtgggcac
     3901 ctgtaatccc agctgctcag gaggctgagg cagagaatcg cttgaacctg ggaggtggag
     3961 gctgcaatga gctgagatag tgccactgca ctccagcctg ggcaacaaag caagactcca
     4021 tctcaaaaaa acaaaaacaa aaaaaccatt tagaaatcct agagtttaaa agtacaacag
     4081 actgggcatg gtggctcacg tctgtaatcc cagcactttg ggaggctgag gcaggcggat
     4141 tacttgaagc caggagtttg agaccagcct ggccaatatg gtgaaaccct gtctctacta
     4201 aaaatacaaa attagccggg tatggtggtg tgtgcctgta gttccagcta ctcaggaggc
     4261 tgaggcagga gaatcacttg aacccaggag gcggaggttg caatgagcca agatcacacc
     4321 attacactcc agcctcagcg acagagcgaa actcaaaaaa aaaaaaaaaa aaaaaagtac
     4381 aacaaacaaa atgaaaaaat cactcaaggg atccaacaag agatctgagc tggcaaaata
     4441 aagaattagc aaacttagag atcaaagaaa ttatgtaata tgaagaacaa ggagaaaaga
     4501 gaatgaagaa aaatgaacag actcagagaa atataaagta tcatcaactt gatgaacata
     4561 cataaaatgg aagcaccaga aggaaaggag agaaagaggc agaaaaaata ctcaagaaag
     4621 taatggctaa aatcttgcca aacgtgatga aaaacactca accacacatt ggagaagttc
     4681 aacaaactcc aagttggatg aacacaaaga gacccatatc caacatcata gtaaaaatgt
     4741 tgaaaaccaa aaataagtgg aaaatcatga aagcagaaga gaaaaatgac ttatcacatc
     4801 acagtggaac tttgtaagct taataactga cttcccatta gaaaaaatgc aggccagaag
     4861 aatgagtaga cttattcaaa gtgagggtag aaactattag ccaagaatct tataactggc
     4921 aaactatctt tcagaaataa aagcaaagca ggtgtggtgg catgcacttg tagtcccagc
     4981 tactcaggag gctgaggcag gagaatcact tgaacccagg aagtggaggt tgcattgagc
     5041 tgagatcgtg ccaccgcact ccagactggg tgacagagtg agactctgtc tcaaaaaaaa
     5101 aaaaaaaaaa aaaaaaaaaa ccctaaaaaa aacaaaacaa aaatgaaggc aaagaaaaat
     5161 ttctaaaaag gagaaaaaat aaatacattc ctagatctga aagagttcac tgctagcaga
     5221 cctgcttata agaaatatta atactaaaac taaagggaag ttgttaaggc tgagagcagg
     5281 acacactaga cagtaatttg aatccacgtg aacaaacagt aaaagtaatt atgtaagtaa
     5341 taaaaaagat ggtataattg gatacttcat ctcaactgat ttataaagca attgcacaaa
     5401 acaatattta tataattgtt attgttgagt tataattttt aactttttaa acttttgcgc
     5461 aaggttataa cacatagaaa tgtaataata gcacaaagaa agagggtggg aaaaaagttg
     5521 tgttgaagta aggaaaatga caacagatgg taacttgaat ccacaggaag aaatgaagaa
     5581 aaccagaaat ggtaaataag gaggttaata taacaaactc tataagtata tatttgtgct
     5641 ttcttctctt ctttaataat tacaacaatg aattgttgaa tttgtaacac atagagatgt
     5701 aatacgtgta acaatattgg catggtgggc ccatgcctgt aatcccagaa ctttgggatg
     5761 ccaaggtggg cagattgctt gagcccagca gtttaagacc agtctgggta acatagtgaa
     5821 actttgtcta aaaaaacaaa acaaaacaaa aaacaacaac aacaaaaaac ccaaaagcca
     5881 aaaaaacaaa aaaaaaaaaa aagaaaagaa acatatagca atattaacac aaaaagaggg
     5941 agagggaata gagttatata gaagtaaagt ttccttggaa ttaaacttgg tataaatctt
     6001 ttttttttta tttttttttt tttttgagac ggagtctcac actgtcgccc tggctggagt
     6061 gcaatggcat gatctcagct cactgcaacc gctgcctgcc gggttcaagt gattctcctg
     6121 cctcagcctc ctatgttaga tttactggag aaagacttga aagcaacttt tttttttttt
     6181 tttttttttt ggagacaggg tcttgctttg ttgccaggct ggaatggagt ggtgtgatct
     6241 tggctcactg cagccaccac ctccccagcc caactgatcc tcccatctca gcctcctgag
     6301 tagctgggac tataggcatg taccaccatg ttcagctaat tttatgtttt gtagagatga
     6361 ggtctcactg ttgaccaggc tagtctcaaa ttcctggact caagtgatct tcccacctga
     6421 gtgtcccaaa gtactgggat tacaggtgtg agccaccaca cccactacaa ctatcttttg
     6481 tttttttttt tgagatggag tttcactctt gtcacccagg ctggagtgca gcggcacaat
     6541 cttgtctcac tgcaacctcc acctcccagg ttcaggtgat tctcctgcct cagcctccca
     6601 agtagctggg attacaggtg cccaccacca cgcccagcta atttttgtat tttcagtaca
     6661 gacggggttt caccatgttg gccaggctgg tgttgaactc ctgacctcca gtgatccacc
     6721 tgccttggcc ttccaaagtg ctgggattac aggtgtgagc caccgcaccc agccacaact
     6781 gtcttaaaga tgctcaaata tttaaaaaaa tggataaaga caggaaaata atgtagaagc
     6841 aaaatgataa tatcaataaa gagatagaga aagggaccaa aaggaaattc tggagctaaa
     6901 acttataata actgaaataa aaaaatatac taggggggct gggcatggtg gctcattcct
     6961 gtaatcccag cactttggga ggccaaggtg ggtgagtcac ttgaggtcag gagttcaaga
     7021 ccagcctggc cagtataggg aaaccctatc tctactgaaa atacaaaaat tagctagtgt
     7081 ggtggtggga gcctgtaatc cctgctactc gggaggctga ggcagggaga agtgcttgaa
     7141 cctaggaggc agaggttgca gcgagccaag attgtgccac tgcactccag cctgggaggc
     7201 agagcaagac attccgttta aaaaaaaaaa acaaaaaaac aaaaaacgaa aaacaaattt
     7261 gaagaagcag aagaaataat cagcaagcag ttccctgtta ggatacaccc tccagatcaa
     7321 cagggaaata ttatggctac atttgagcct tttcctttta aatttgggaa acgccatgag
     7381 gggcccgtct gggcctgttc caaaccgggg catttccagc ttaggccatt ccctcatccc
     7441 tgtacaatat ctgtcccccg ccatagcagg tagtgccgca gtagatttat gctgcacaaa
     7501 agctgtgagc cttctgcctg gggaactccc gcaaaaggtc ccaacaggag tctgtggacc
     7561 cttgccagcg ggaacgataa gattacttct aggagatcta gtttaagttt aaaaggagta
     7621 caaatacata caggagtcct tgattcagat tacagtgggg aaattcaaat tgttatatct
     7681 acttctgttc cctggaaagc agagccagga gagcgtatag cacagctcct gattgtaccg
     7741 tatgtggaaa tggggaaaag tgaaattaaa tgaacaggag gatttggaag cacaaataaa
     7801 caaggcaaag ctgcttattg gatgaatcaa attactgata aatgtcctac ctgtgaaata
     7861 actattcagg gaaagaaatt taaaggtttg gtagctatag gagcagacat ttcaatcatt
     7921 tctctacagc actggccgtc tgcgtggcca attcaacccg cccaatttaa catagttgga
     7981 gttggtaagg accctgaagt gtatcaaagt agttacattt tccattgtga agggcccaat
     8041 ggacaacctg ggactattca accagttata acttatgtac ctataaattt atggagaaga
     8101 gatttattac aacaatgggg agcacaagtt ctaattccag agcaattata tagccctcaa
     8161 aatcaacata tgatgcatga aatggggcat gtccctggta tgggactagg aaaaaatttg
     8221 caaggtttga aggaaccact tcaagtggaa agacaaagtt cctgccaagg tttaggatat
     8281 catttttgat ggcagccatt gttaagcctc cagagcctat acctttaaaa tggttaacag
     8341 ataagccaat ttggatagaa cgatggccac taagtaaaga gaaactagag gctttaaagg
     8401 acttcgttaa tgaacaatta gaaaacggac acagctccaa cattttcctc ttggaattct
     8461 ccagttttcg caattaagaa aaaatcaggt aaatggagaa tgttaacaga tcttagagcc
     8521 atcaattcag ttatacaacc tatgggagca ttacagccag gacggccttc tcctactaca
     8581 attctaaaaa ttcgccttta atagttatag atttaaaaga ctgtttcttt attatcccct
     8641 tagctgagca agactgtgaa tggtttgcat ttacaattcc tgcagtaaac aacctgcagc
     8701 ctgctaagcg ttttcactgg aaagtgttgc cacaaggcat gttaaacagt ccaacaattt
     8761 gccagactta tgtagggcaa acaattgaac ctactcgtaa aaaattttca cagtgttaca
     8821 ttattcacta tatggatgat gtatttgtgc tgcccccact caagaaatat tactccaatg
     8881 ttatgatcac ttgcaaaatt cgatttctcg tgctggttta attacagctc ctgacaaaat
     8941 tcagactact acaccttact cctacttggg gaccttagta aatgacacta ccattgtgcc
     9001 acaagaagta gccatacata gggatcaatt gaaaacatta aatgactttc aaaaattact
     9061 aggggacatt aattggatac gacctgctct aggcatccct acctatgcca tgagcaatct
     9121 attttctatc cttagaggag atcctagtct cactagccct cggcaattaa caaaagaagc
     9181 tgaggcagag ctgcagcaaa tcgaaaagca agtccataaa gctaaaacaa atagaataga
     9241 tccagagaag actctagatt tgctaatttt tccaactcag cattcaccta ctggtgttat
     9301 tgtccaagag caggacttag tagaatggct ttttctttca catactaatt cacagactct
     9361 aactccttat ttggatcaaa tcgctaccat gataggaaat gggagaactc ggattgttaa
     9421 actataggat atgatcctga aaaaattatt gtccctctca cgaaggcaca aatacagcag
     9481 gcttttataa ataatcttac ttggcaaacc cacttagcta actttgtggg tattctcgat
     9541 aatcattttc ctaaaatgaa actgtttcaa tttttgaaat taactaattg gattctccct
     9601 agaataacta aatttaaacc aattgaaggt gctgagaatg tttttacaga tgggtctggt
     9661 aatggtaaag cttcttattc tggctctaaa ggtaaagttt tccagacgcc ctatacttca
     9721 cctcaaaaag cagagcttgt agctgtaact gaggtactga ctgcttttaa tatgcctatt
     9781 tatgtgattt ctgattcttc atatgtggtt cattccacac agttaattga aaatgctcag
     9841 ttacgatttc atacagatga acaactgatg actttattta cccagttaga aacagcagtt
     9901 aggagtagaa tgcacccttt ttacatcact cacattaggg ctcataaacc tcttcccgga
     9961 catttaactg aagggaatca aatggctgat tgcctagctg ctaatgcaat atctaatgct
    10021 aaacactttc gcaatttaac ccatgttaat gcctctggtc tcaaaagcag atacagcatt
    10081 acctggaaag aagctaaagc tattatctag cgatgcccaa cttgtcaaat ggtatattcc
    10141 tcatctttta caggaggagt taatccttga ggactggaac ctaattctct ttggcaaatg
    10201 gatgtcacac atgttccctc atttgggaga ctagcttatg tacacgtatg tgtggacacc
    10261 ttttctcact ttgtctgggc tacatgccaa atcaggagag tcttctgcct gtgttaaacg
    10321 tcaccttttg cagtgtttgt ggtgatgagc attccagctt ctattaaaac agataatgcc
    10381 ccaggctata ctagccaagc tctagctaca tttttctcta tatggaatat taaacacaat
    10441 actggtatcc catataattc tcaaggacaa gccacagtgg aaagaatgaa tctctcccta
    10501 aaagagcagt tgcaaaaaca aaacatgggg gaaacaggga ttacaggaca ccccatatac
    10561 aactgaatct agcatcatta actttaaatt ttttgagcct gcctaaaggc cagatgttag
    10621 cagcagctga acagcatcta accagctgca aagacagaag caaaacaact ggtttggtgg
    10681 agagatccgg tagcaaaaag ttgggaaata ggtaaaataa taacttgggg tagaggttat
    10741 gcttgtgttt ctccaggacc gaatcaacag ctgatttgga taccatcaag acatctgaaa
    10801 ccttatcatg agccagatgc cgaggaagag attccgggag gatcccgagg accccccggt
    10861 tgcagccatg tcgagactga tgctgaggag gaccccaact gtcattagca acacccgttg
    10921 aacacagcca cccacctgag ggcagatcaa gaagctgtca cagatggagg aagaaaacct
    10981 gaggaaatcg ggacaaccag tcgcaatgag taatttaatg gtagctatga tagtggtgat
    11041 caccattgct gtgcgtattc cttcaacaag ggctgacaca gagaacaatt atactcattg
    11101 ggcatattta tcaatcttgg ctggcaataa tgcctggatg taatcactct atgatgcagt
    11161 tacacatgct ttctgacctc agtatttacc ataataaatc tgctcctata gttgaggcat
    11221 accaccctca gaaacctatg tgtaaacaaa attgaacctg gccagagaaa atgaacgtac
    11281 ttgtttagga agattgcatt gcagaacagg aagaggtgct gcgcaatgat tcttatggaa
    11341 tcattactga ttggtcccat aaggggatgt ttagcttgaa ttgcacctct cagtttgcgt
    11401 gtcatggcca cactatgttc agctggtttg aacaaaatgg tcagatggta gaaatggtaa
    11461 gaagtatgac aagagttcct attatctgga aacatggtgg tagagtggca cctcaatctc
    11521 aaatgatatg gcccgctgta ggagctaaac ataaggattt gtggaaacta ttaatggctc
    11581 ttaataagat ccaaatttgg gaaagaataa aaaaccatct aaaaggacac tctacaaact
    11641 tgtctttgga tattgcaaaa ttaaaagaac aaatatttaa agcatcccag gcacacctga
    11701 ccttaatgcc aggaactgga gtgcttgaag gagctgcaga cagattaaca gctagtaacc
    11761 cattaaaatg gataaaaaca cttggaagcc ctttgatttc aatgataatt gtgttattaa
    11821 tctgtgttgt ttgtctctgt atagtctgca ggtgtggatc ctgacttctg tgagaagtag
    11881 ctcaccttga caaagctgcc tttgctttta ttgctttgta aaacaaagaa gggggacgtg
    11941 ttgggaatag gcctcaaaaa tctggccata aactggcccc aaaactggcc ataaacaaaa
    12001 tctctgcagc actgtgacat gcttgtgatg gccttgatgc ccatgctgga aggttgtggg
    12061 tttactggaa tgagggcaag gaacacctgg cccacccagg gcggaaaact gcttaaggtg
    12121 ttcttaaacc acaaacaata gcatgagaga tccgtgcctt aaggacatgt tcatgctgca
    12181 gataactagc cagagcccgt ccctttattt cggcccatcc ctttatttcc cataaggaat
    12241 acttttagta aaccttatga ttggcttgct gtcaataaat atgtgggtaa atctctgttc
    12301 aaggctctca gctctgaagg ctgtgagacc cctgatttcc cactccacac tatatttctg
    12361 tgtgtgtgtg tctttaattc ctctagcgcc actgggttag ggtctccaca actgagctgg
    12421 tctcggcaag attaatatcc agaatatata taaagaacct ctggccaggc acagtggctc
    12481 atgcctgtaa tcccagcatt ttgggagacc aagatgggta gactgtctga gctcaggagt
    12541 tcagagacca gcctgggtga catagcaaaa ccccatctcc accaaaaaca caaaaaatta
    12601 gccaggtgtg atggtgcgca cccatagtcc cagttacttg ggaggctgag gtgggaggat
    12661 ccttgagccc gggagataga ggttgcagtg agccgagatt gcaccacggc actccatcct
    12721 gggtgacaga gtgagaccgc atctcataaa taaatacaaa aaaacctcaa caacaacaaa
    12781 acaatcaaac ttaaaaatgg gaaaagtatt ttatatttct ccaaagaaga tatataaatg
    12841 gtcagtaagc acataaaaat atgctcaata tttttagtca ttaggaatga catgaaaact
    12901 attttcattt gaaaataaaa atcaaaacca caatgagata acattttagg cctactcgga
    12961 tgtctataat taaaaaaaag aaaagaaaga gaagaacaag tggtgatcag catgtggaca
    13021 aattggaacc ctggtgcata ctggtgggaa tataaaatgg tgcagtctct gtggaaatgt
    13081 ttggtgattt ctcaaaaggt taaacataaa actgctatat gacccagcaa ttctattcct
    13141 aggtatatat ccaaaagaat taaaaacagg aacttgaaca gatatttgtg tatcaatgtt
    13201 cacagcagca ttattcacaa tagccaaaag gtggaaacaa ggcaagtgtc catcaacaga
    13261 tgaatggtta aacgaaagtg ttatatacat acaacggaat atcactcagc cataaacagg
    13321 aatgaaattt tgatatatgc tgtgacatga atggaccttg aaaacattat gcttggtgaa
    13381 taagctaggg aaagtaatga cagatattgt atgattctac ttatatgagg tacctagagt
    13441 aagcaaattc atagaaaaag aaagtagaat agaggttgcc agggactggg gggagagaga
    13501 aagaggaagt tatcatttaa taggtacaga gttttaatgg gtaatgatga ggaagctttg
    13561 ggtatagtat agatagaggt gacggttaca aaatattgca aatatattta atttcacaga
    13621 attgtacact taagaatggc taaaataata ataaatgttg gctgggcgcg gtggctcacg
    13681 cctgtaatcc cagcactttg ggaggccaag gcgggcagat cacgaagtca agagttcgag
    13741 acaagcctgg ccaatatggt gaaaccctgt ctctactaag aatacaaaaa ttagccaggc
    13801 gtggtggcac gtgcctgtaa tcccagctac ttgggaggct gaggcaggag aattgcttga
    13861 acccgggagg cagaggttgc agtgagctga gattgcatca ctgtactcca gcctggacga
    13921 cagagcaaga ctccatctcg aaataataat aataataata aatgttatgt atattttacc
    13981 acagtaagaa tgttactcgt tgatttacat atgacccagt tatctactcc tagctacata
    14041 cccacaagaa atgaaaacat gttcaaacaa aaacttgtcc gtgcgtgttc acagcagcat
    14101 tattcacaat acccaaaatg tggaaataac tcaaagattc atcaactgat gactggataa
    14161 acaaatgtgg tatggccatg caataaaata ttatttggca aattaaaact gggaaattct
    14221 aataagaata atagactgca tcaatgccaa tattctcatt gtcctctcaa taaagatgtt
    14281 ttaaaagttt cttgtgaaga atactattta agggcttttg gggtggggtg gcagaagata
    14341 aggtgaagct ttggaaaact ttaggtatca attaaccatg ctttttctgt ccctcattag
    14401 aattaaatag tgaaccacat ctatctcatc cagcacgatg aatttcggta catcttttac
    14461 taaatttcag ggtaggataa agataacaaa gtgtgaaatt catatattat tttaagtcat
    14521 cttaccagtg agtagctcta tccatgtttg gacagtttct gtgggttcag ttgctttgat
    14581 gtgtttcaga gtttcatcca gtaaaacatc acctgttggg ctgtctgact ttagcagtac
    14641 ctttgagaaa aggagaaaga aggaaccctg attccttcac ttcattcaag ttaaaatctt
    14701 tcaatatatc atgtatatgt tactggttaa aaattgcaag aatgtaagga agcataatat
    14761 actgtctctg tcctcaagaa gtttaaaata tagtatggag tgatagtgca aagacaaaaa
    14821 taacatttga agtgggccct agtgacaaac tggatatgaa tgataatgga gaaaagtttc
    14881 aataacaact cattttgagt ctggaaacca acttttgaaa ccattaataa aggtaggaaa
    14941 cagctgggct cagtggctca tgcctgtaat cttagcactt tgggaggctg aggcgggtgg
    15001 atcacctgag ctcaggagtt tgagaccagc ctgggcaaaa tggcaaaatc ttttctctat
    15061 taaaaaaata aaaatttaaa taaataaata aataaataaa caaaggcagg aaagtcagaa
    15121 aaggaagtca gttttgggga aaggttgata atttggtttt ctttcatttt tgttaagttt
    15181 ggttttaaat gatatgggat agccaagttg aaatttctag ccaatagtta aatatgtttc
    15241 tggaggtgga aaaacaagac ctgcctagaa atttgaaaat taatctcatg aaggtgacag
    15301 acgaagctat aaaggtaaat gatatctctg aaaggacaga agatcactca agtgatacaa
    15361 actgagactt ggatgttaag aaatggagaa gtaaggaaac agtgtatagt cagagaagca
    15421 agagaataag ataataacaa aggccaataa tgaagaaagt ttcaaagtga taaaaagtgt
    15481 caaatgacac atataggtta aaagagaaaa ataaatcaaa gactgaaaag aagctggtga
    15541 tacctaagaa tgtagtttaa gtggagtgct taggatgata aagaagttgt gagtggaaaa
    15601 attcagtaag aaaatggata caggagtatg gttggaaagg acttgcagta aatggaagga
    15661 aacagaatat tattggagaa taaccagggt taatcaatga ctgacctgac ccatatgagt
    15721 ccctctccct acatcatact ctatttcgta attttttttt tttagagacg gagtcttgca
    15781 ctgtccccca ggctggagtg cagtggcggg atcttggctc actgcaagct ccaccttccg
    15841 ggttcacgcc attctcctgc ttcagcctcc cgagtagctg tgactacagg cgcccgccac
    15901 aatgccccgc taattttttt tttaaatatt tctcgtagag atgggatttc accgtgttag
    15961 ccaggatggt ctcaatcgcc tgacctcatg atccgcccgc ctcggcctct caaagtgctg
    16021 ggattacaag cgtgagccac tgtgcccggc tatttcataa tgttatagct atagatatag
    16081 gaggaattta atcaacagca tcctgagtat gatttttctg cataaaaaag gaagacaaaa
    16141 ccaggagctt cctataactc tttggactat ggatactttg atttctccct cttgcaatag
    16201 ggtaattaac ctgtttagac aatggaactg ctgagtcagt gatatttatt ttatcttact
    16261 actcaaatct ttctattctt cccaaatgta ctgttggtat ttgcctttcc ataagccatg
    16321 gacgttcaca gaggattcag tacctttctg tctagtagtc gcttcttacg catggtcggg
    16381 ggttccagat agattcgacc ccgcatggcc agctctatca ggatgccccc tcgcaggcct
    16441 gatgatatgc agtcattcca gaaagatgtg tacccctagg aaaggagaaa agagaagaga
    16501 aagaatgttt tgagaggaat actgacttcc agcaggcacc ctaagaacag gccgctttgt
    16561 tgtgattcgt ttcacatcat actacatgaa ctcaaagtat agagtaagag gatgtttctt
    16621 taatcctctt aaacaccccc aaatatgatt ttctgtctga cttcaagatc tttagtgaat
    16681 ttccatggtt tgctgttagt caataaatga tgtgcaattt ttttttttga gacagggtct
    16741 cgctttttcc cccaggctga aatgcagtgg catgatcatg gctcactgca gccttgacct
    16801 cctgggctca agcaatcctt ccacctcagc ctcctgagta gctgggacta caggtatgtg
    16861 ccaccatgcc tggctgtttt taaaaaaaat ttttttgtag agacaaggtc ttgctatgtt
    16921 gcctacgctg gtctcgaact cctggcctca agctatcctc ccaccttggc ctccgaaagt
    16981 gttgggatta cagacatgag tgactgcacc tggtttgaca tgcaaaattt taatttttga
    17041 ggagaatttt ggcttttttg aagtctgttt ccatgacaca aaatttcaga gtggggtaaa
    17101 ataagtcaga aaaaagtaaa aagcatttga gcttttaatc ctcctagata tgtattaatt
    17161 tgaagttctt ttttcttttt tttgagatgg aggcttgctc tgtcgcccag gctggagtgc
    17221 agtgacgtga tcttggctca ctgcaacctc cgcctcccag gttcaagcga ttctcctgcc
    17281 tcagcctcct gagtagctgg gattacaggt gtgcactgcc acacccagct aatttttgta
    17341 tttttagtag agacgggatt tcactatgtt ggtcaagctg gtctcaaact cctgacctcg
    17401 tcatccacca gcctcagtct cccaaagtgc tgggaataca ggagtgagcc accgtgccca
    17461 gccaatttga agttcttttg atgggaggta taatagaaac ccactgtcat taagtgcctt
    17521 ggttatgtta cttaaacttt ctgaagtaca ttgtcctcat ctataaaatg cagacaccaa
    17581 tatatatttt cataagattg ttatgagggt tacatgagat acaatatata agaaagcatt
    17641 tagtatagtg cctggcatat agcaggcact taaaatgtta gctccccttc cctttgtata
    17701 gggaatatac gtataagaga gccactggta ttgtctaaca gcagtaacgt ttaattacta
    17761 atgtctattg aatacttatt atatactaga tatgtgctaa tacttattat gtactagatt
    17821 tgtgctaagc tttttacatg cattatctga ttatatcttc acaacaaccc tataaaggaa
    17881 gtactactac tacccacaca ttaaagatga gaaaactaag gcttacagag gttatacagc
    17941 ataacagagg tcacgaagct aaatagagaa atttttaagt ggttgttttt ttggtggtgt
    18001 tttactgata gatcatagtt gtgcatattt tggggttact tttttttttt aagagacagg
    18061 cggggtcttg ctccattatc caggctgaaa tgcagtgggg ccatcatagc tcactgtaga
    18121 cttgaactcc agggctcaaa caattgtcct gcctcagcct cccaagtagc tcagactaca
    18181 ggcatgagac actgcaccta gctataataa ctatttaaaa aattttttaa ttcaattttc
    18241 ctctttattg gcttggaagt tatgtacttt tagaagttat ccttgaaatt ttaacatgca
    18301 tgatacatta atttaaaatt aacataaaac ctaaatttta aatattaata ttttaaccac
    18361 tcttttagtt gtccagtatt tttttttagc ctcataaatt agatatcatc atcatcatta
    18421 ttattttata caggtagcac ttgtttagat ttacacaggt gcttacgaat ttctttgctc
    18481 accgttcctt cctgtatctc agacttcctt tctggcacca tttttcttct tcctaaagta
    18541 gaatttcctt aagtgaaagt ctgttggagg cagtttctat tttgcttatc tgaaatgtct
    18601 ttgtatttca ccatcatcct tgaaagatag tttctctgga cataaaattc caggttgatc
    18661 attccaggtt atcacattga agatagtatg ccctcttctt ccagcttcta atatagcagt
    18721 gagaaatcta tctttcttat tgattgtggt ttttttgtag gtggtgtatc ttttctttgg
    18781 tcctttaaga tcttctcttt tggtgttttg cagttttact ataacatgac tgtgcatgaa
    18841 tttatttatt ctgtttggta tatgttgtgc ttcccataac tgtggtttca tttcttagct
    18901 tcaaaaattc tcagctctta ggtctctaag cactttggct ctcctttatt ttcttttttc
    18961 tttccttcta ggacttcagt ctttatttta accccgtctt ttaacctttt atattttcca
    19021 tttgcttgtt ctctgtattg ctttctggag agtttttgtt tgtttgtttg tttctgagat
    19081 ggagtctcac ctgtcaccca ggctggagtg caatggcaca atctcggctc actgcaacct
    19141 ccgtctccca ggttcaagca attctcctgc ctcagcctcc caagtagctg ggattacagg
    19201 tgcacctcac cacgcctggc taattttttt gtggagatgg ggttttatca tgttggccag
    19261 gctggtcttg aactcctgac ctcaggtgat ctgcctgcct cagcctccca aagtgctgga
    19321 attacaagca tgagccacca gcctggcctt tggagagttt ttaaaaaata tccggcttcc
    19381 tatttactaa ttcaaactaa ccctgagatt ttgatttcaa caatttttca ttagtagaag
    19441 ttttatttag ttatttttca catttgtctg gtgatttgtg attttttgtt tattttgtga
    19501 ttattttatt tctttaaaaa tgtcatattt ttagtctgat tattttaata tgggacaata
    19561 tatgaggtcc ttgatacctg acaatatctg aggactttat ttgtcacttc ttccttgttt
    19621 gtatgtttga tgatctttga ttaagaggtt cacgtttaac gttttgtgaa tttacgagat
    19681 tctggaggcc taaattgagg acatcttcct tcagagggaa agatctgttt ctgccgagag
    19741 ccaagtacag agggattgtg ggagtgccac acaactggga gcacatttaa gcctagtttc
    19801 cccactttac ctcaaccatc aagtttattc ctgaccaacc aactcctgaa tctcaagact
    19861 gatttggtaa tttgccccag ggtaatttga tcttcatgtt atctttattg ctcacctctt
    19921 ggattttaat tttggtgttt gggggcccct tgaagattat ttctactttc caagactcaa
    19981 taatacatta aaaattaagc ttattcaaga cttaatatct gaccaccata ctgccagggc
    20041 tagaggacct ctaattctat ttgtcctgct atagccagga caatctttaa catcaagtgt
    20101 aatgatataa ctccctcccc tcatgcttaa aaatctttca aacattcctc tcagccttca
    20161 ggatggaatc caaatggcta atgcaggagg atcgcttgag gccaagagtt ggccagcctt
    20221 tgggaacaca gcgagacctt gtctgtaaaa acaaaacaaa acaaaacaaa agtggctatt
    20281 tactagtctt ttcttttttt ttctttcttt tttttttttt tttgagacgg agtctctctc
    20341 tgtcacccag gctggagtgc agtggcgcaa tctcggctga ctgtaagctc cacctgccgg
    20401 gttcacgcca ttctcctgcc tcagcctccg agtagctggg actacaggcg cccgccacca
    20461 cacccagcta aattttttta tatttttggt agagatgggg tttcaccgtg ttagctagga
    20521 tggtcttgat ctcctgacct cgtgacccgc ctgcctcggc ctcccaaagt gctgggatta
    20581 caggagtgag ccaccacgcc cggcccacta gtcttttctt tggcctcttt cccctattgg
    20641 cctttgcaca cactgctctg ttgaaaagaa ccaatattct tccttccaaa actcctcttt
    20701 gtagttaagc gttaagtaca ggggtcacat ttccacataa ccttctctaa cattgtttac
    20761 ttctccactc caccctctgg attaggttct gcacctgggt gtgcccatac atacttctct
    20821 cataacttct attatatcat attgcaaaca actatttctc ctgaccagac tacaaattcc
    20881 ttaaggagag aatatccctt atgccgcaaa aatatttttg aaagcataaa tttatgctta
    20941 aaagtaaatg ttaacctggc aaaagttcag cttgagccat aaaatattcc ttagacatct
    21001 ccaaatttcc tgagggtctg atctttgact caaggttggg attagttccc tcccctgcct
    21061 gcctcatggc tactccaagt catatttgtt tttttggttt tgtttttgtt tggacacagg
    21121 gtatcgctct gtcgcccagg ctggggtgca gtggcatgat ctcatctcac tgcaacctcc
    21181 acctccctgg ttcaagcaat cctctcacct cagcctccca agtagttggg actacaggcg
    21241 cgtgccacca cacccagcta atatttgtat ttttagtaga gatggggttt tgccatgttg
    21301 cccaggctgg tctcgaactc ctcagctcaa gtggtatgcc cacctcagcc tcctgaagta
    21361 ctaaaattac aggtgtgagc cactgtgcct ggccccaact catatttgtg actcctcttt
    21421 aaggaagggg taggcttttt ttttaggtta acacttgttt ttttttatag tttagctacc
    21481 taattaaagt taaactataa cttccttcag aacacgtaat tatgtaggca ttatttattc
    21541 tcgtaataaa tcttaacagg tgtgcaaaca ttttagccaa tattaaagca actggaagaa
    21601 agttttcttt tgttgtctca tttcctcctg accatgtaac ttctgtttca gtgacagtga
    21661 catcagattg tcaacaaaac tataaatgag aagcagtata aagaaatgat gaagatcagg
    21721 ataaagaaac tacatttttg aatgtcacac tttcctattg tagcatctca aaacatacaa
    21781 aactttccaa acgagttttt aactttttta ttaagaatat gcaataatag acaaaagtat
    21841 aaaaacatat atgtagagtt ttaagaaaat aattataaaa caaacacctg tataactaca
    21901 agaaagagac tggatttaga aaaagaaaat aactgagaaa agagagaaca tgatagtaaa
    21961 tgttcccctg tactgtgggt acatatgacc aataaagatt ataatattgt catctatagt
    22021 aatatttaga acagattgaa acaaatttag gtgcagaata gcctggaggt aagggaaaat
    22081 acaattattt atatttaaag gtcattttcc tcttagtgta tctatctaac ttagtgaatt
    22141 ctccagccat tcataataaa gattctatat taaatggcca atgctttggt atgtctgagc
    22201 aggaagacct agcacgtaca tactaaagtt caccatttga tcattaaatc tcattaccaa
    22261 aggctatcta aagaaaattt taggacctac ccaacacaga tggaattgca cattgtctca
    22321 tggctaatgg agaatctcag agaatttgca gaataagata ttcagatagt ggactttaat
    22381 atacatctgt aggccgggcg cggtggctca cacctgtaat ccctgcactt tgggaggtca
    22441 aggcgggtgg atcacctgag gttaggagtt cgagaccagc ctggccaaca tgatgaaacc
    22501 ctgtctctac taaaaataca aaaattagct gggtgtggtg gcacgtgcct gtaatcccag
    22561 ctactcagga ggctgaggca ggataatcac ttgaacctgg gaggtggagg ttgcagttag
    22621 ctgagatcac accactgcac tccatcctgg gcaacaagag caaaactccg tctcaaaaaa
    22681 aaaaactgca ggcccggcgt ggtcgctcag gactgtaatc ccagcatttt gggaggctga
    22741 cgtgggtgga tagcctgagg tcagcagttc aagaccagcc tggccaacat ggtgaaaccc
    22801 cgtttctact aaaaaatata aaaattggcc aggtgtggtg gtagacgcct gtaatcccag
    22861 ccactcagga agctgaggca gaagaattgc ttgaacccca gaggtggagg ttgcagtgag
    22921 ctgagatctc accactgcac tccagcctgg gcgacaagat tgaaactcca tctcaaaaaa
    22981 aattgtaaaa aagaaagaaa aatcccatta tagaaatgta tactttaggc tgggtgtggt
    23041 ggctcacgcc tgtaatccca gcactttggg aggctgaggc gagtggatca tgaggtcagg
    23101 agatcgagac catcctggct aacacggtga aaccccgtct ctactaaaaa tacaaaaaat
    23161 tagccgggcg tggtggcggg tgcctgtagt cccagctact tgggaggctg aggcaggaga
    23221 atggcatgaa cccaggaggc ggagcttgca gtgagccgag attgtgccac tgcactccag
    23281 cctgggctac agagcaagac tctgtctcaa aaaataaata aataaataaa aagaaatgta
    23341 gactttagaa agtgaaagtg ttctagaatc ttatcccctc tagaggatca ctaccaacag
    23401 tttaatgggt acttttctag atttttatat acatattcta tgtttacact tattaatatt
    23461 atttctagaa atggcattac actgagtggt aacttttttc attaacccat tattctgggt
    23521 atcattccac atcagtaaat atacttcttc atcccttgta atgactgcat agtattgcat
    23581 tttacatgca ttataatttt atctatccaa gccccttact gatgtgcatt taggttgctt
    23641 acagtatttt gttattatta aaaaatgttg gctgggtgtg gtggctcaag cctgtaatcc
    23701 cagcactttg ggaggccatg gtgggtggat cacctgaggt caggagtttg agaccagcct
    23761 ggccaacatg gtgaaaccca gtctccacta aaaatacaaa aaaattagcc gggcatggtg
    23821 gcccgtgcct gtaatcccag ctactcagga gggtgaggca ggagaatcct atgaatccag
    23881 gaggcagagg ctgcagggag gtgagttcgt gccactgcac tccagcctgg gcggcagagt
    23941 aagactccat ctcaagaaaa aaaaaaaaaa gtaaagacgt attcatttgt gtgtgtattt
    24001 ctgttgaata tctagaagaa aattctgggt caaatattac acatatttaa actttaaaga
    24061 ttgcacaatg ggaattcaat aaattcagac agtcaacact ttatgctgaa tttccactta
    24121 gtcaagtatt caaagtgccc ttccttattc aaacttcctt gaagacacac cttaaaacaa
    24181 ctctgaaggt taaagatctt aaaagttgcc tgcttggcaa aaaaagagga aagaagggcc
    24241 gggcgcggtg gctcacgcct gtaatcccag cactttggga ggccaaggcg ggcggatcac
    24301 ctaaggtcag agtttgagac cagcctggcc aacatggtga aacactgtct ctactaaaaa
    24361 tacaaaatta gccgggtgtg gtgccgcatg cctgtaatcc cagctattcg ggaggctgag
    24421 acaaggagaa tcgcttgaac ctgggaggcg gaggttgcag tgagccgaga tcgcaccact
    24481 gcactccaac ctggacaacg agagcaaaac tccgtctcaa aaaaaaaaaa aaaaaaaaaa
    24541 gaggaaagaa aacaaaacaa cttaaagcat cattaaaaaa tatgctcata aggagttttt
    24601 aaaataacat tagaaaatac ttgtgataaa gttgaatgaa aaatagccag atatataatt
    24661 gagtatccag gattattaca atgatataaa ataatgtcta cattaaaaca tggcttttta
    24721 acctttccaa attatctaca gtgagaaaat atcacttttt aaaggaaaat tttcagaaaa
    24781 aaagaacccc tccatctccc caagtctgaa gatttaacat taaaattgct tcaccttcca
    24841 tgagagaaga aatgttatca tgttgctttg aaccctttta tatctgccca attccaaagc
    24901 ccaaggaaag tataatgatc ttcaggggtt atcaaaaatt ctacaagcaa aactaatttt
    24961 attatttttt tcctcctaaa aaattactct aagaaaattt agtgtactgt ttgtccttga
    25021 atcactctat taaaagagtg gtattcagga tttgatgtaa attcctcttt tacaaattaa
    25081 tgtgattcat cgaaactcca attcctattt ctacatccac acagctcata aagacatttt
    25141 atgtatgtat gtatgtattt gagacggagt ctcaatctat cgcccaggct ggagtgcagt
    25201 ggcacaatca tggctcactg caacctctgc ctcccaggtt caagcgattc tcctgcctca
    25261 gcctcccgag tagctgggat tacaggggcc tgccaccaca ttcagttaat ttttgtattt
    25321 ttagtagaga cgggacttca tcatgttggc caggctggtc tcgaactcct gacctcaggt
    25381 gatccaccca cctcagcctc ccaaagtgct gggattacag gtgtgagcca ctgcacctgg
    25441 ccaagagatt ttataaaata gatcaagcat taaaagccca ctctgtttaa tcctattgga
    25501 gaatgaagaa ctgtgacctg atgctgttct ggtttaacag ataatttcag cctgatcgtg
    25561 aggaagggat tataaatata ggcaaatgga caggatttct caatgaatga aatgaagatt
    25621 gaatcctttc aggtttattt taaaaggtgg gtagaaagat gcaaagctag ttacaatgat
    25681 cagtttatta gtttttgcag ggcttctgtt ttttttttcc ttatcaaaag tttagcaaat
    25741 aaaaacaaca tgctaatgcg ttgtcagtga aagggaaatg caaaataaag gaaagcaact
    25801 ggtcatcttt gaattcccat ataagcttta aatctcattc tggtcagtaa aaatatcatt
    25861 agttgtttta ccctaaaaat ttttattttt tttatttttt agacatagtc tccctctgtc
    25921 acctaggctg gagtgacctg gccggaaaga ggaattttaa ttgctacaca ttgtccaaag
    25981 aaaacggaag aaaagccagg cgcggtggct catgcctgta atcccagcac tttgggaagc
    26041 cgaggcgggc ggatcacaag gttaggagtt cgagaccagg gtggccaaca tagtgaaacc
    26101 ctgtctctac taaaaacaca aaaaattagc tgggcaggcg tggtggtggg cacctgtaat
    26161 cccagctact tgggaggctg aggcaggaga atcatgtgaa tcaggaggcg gaggttgcag
    26221 tgagctgaga tcgcaccact gcactccagc ccgggtgaca gtgcaagact ccgtctcaaa
    26281 aaaacaaaac aaaacaaaaa acaaaagaaa acagaagaaa agagaaagtt gagaaggggg
    26341 agcaaaaatt tgaaaacagg ccaggctcag tggctcacgc cagtaatcct agtactttgg
    26401 gaggccaagg tgggagtact gcctgagctc gggagttcag gaccagcctg ggcaacacag
    26461 tgaaaccccg tctctaccaa aagtacaaaa aaaaaactgg ccgggtgcgt ctgtgggtcc
    26521 cagctacttg ggaggctgag gcaggagaat tgcttgaacc cgggaggtgg aggttgcagt
    26581 gagccgagat tgtaccactg tactccaacc tgggtgacaa agtgagactc ccatctccaa
    26641 aaaaaaaaaa aaaagaaaaa aaatggaaaa catgtccagg ctctaagtct atctccccac
    26701 aactcacgtg atttttttaa acaataaaat atttatcaat ttattcatcc acttatattt
    26761 agcttgaatc tgagtgactt atggaaattt tagaaattaa aatccatccc caaaacaacc
    26821 aattatagta tcattacaat aagtatttag ccatcttaag ggacaacttc aaaagaggca
    26881 gtactctttt gaatatagaa gttttcatat gtttgttaat aagagcaaaa taaactatca
    26941 ttagagtgaa caggtaacct acagagtggg agaatttttt tttttttttt gagatggagt
    27001 ctttctctgt cacccaggct ggagtgcagt ggcatgatct tggctcattg caacctccgc
    27061 ctcctgggct caagcgattc tcctgcctcg gcctcccgag tagctgggat tacaggcata
    27121 tgtcactagg cctagctaat ttttttgtat tttgagtaca gacagggttt caccatgttg
    27181 gccagggtag tttcgaactc ctgacctcag gtgatctacc tgccttggcc tcccaaagtg
    27241 ctggtattac aggcgtgaac caccgcactc ggcctttttc ttttttttaa agagatgagg
    27301 tcttggcctg gtatggtggc tcctacctat aatcccagca ctttgccaag ctgaggcggg
    27361 aggatagctg aggcacgagg accacttaag tccaaagttc gtaaccagct taggcaacat
    27421 actgagactc catctatata aaaaatttta aaaattagcc aggtgtggtg gcacatgcct
    27481 gtagtcccac ttactcagga ggctgaggtg gaaggatccc tcaagcccag gaggtcaagg
    27541 gggcagtgag ccatgatcat gccactgcac tctagcctgg gcaagagagt gagaccctgt
    27601 ctcaaaggaa aaaaaagaga gaaacagggt ttcactctgt catacaggct agagtacagt
    27661 ggcacaatca tggctcactg cagcctcaaa ctcctgggct tgagttatcc tcccttgtgg
    27721 ttaggactac aggtgtgtac caccacgtct ggctaattaa aaaaaaaaat ttgtagcaat
    27781 agggtctggc tatgttgccc aggctggtgt tgaagacctg gcctcgagtg atcctcccac
    27841 ttcagcctcc caagtagcta ggattacaga tgtgagccac ctggcctggc ttccaatgat
    27901 ttaaaacgga cttagaaaca caagcaatta tgttacgttc aaaccttaga agagcaaaac
    27961 taggtaagtg tagatttttt ttttttgtgg tcaaatagct tttctctcct tgattactag
    28021 catatacaca ttgtagaaaa tttgaaaaca aaaaaattca taaaaaataa aaatcaccca
    28081 taaattccat taccctatga tcatatgggt gtatgtacaa ccaaactttt ctgtatacat
    28141 aaaatacatg tatcttctac ccatcaacct attttcccat ctccttctct cccaacccca
    28201 attctggggc gtcccttcat agcaaaggtt tgtgtgaatt atggaacaac agtatgatat
    28261 atatgttata aacctcaaca aaatattacc agcctgggca gcatgttgaa accccgtctc
    28321 tactaaaaat acaaaaaatt agataggtgt ggtggcatgt gcctgtagtc ccagctactt
    28381 gggaggctga ggcaggagaa ctacttgaat ctgggaggtg gaggttgcag tgagccaagc
    28441 tcgcgccact acaccccagc ctgggcgaca gagtaagaga gactctgtct caaaaaaaaa
    28501 aaaaaaaaaa aaaaaaaaag tattgaggag tagagataga agaaactgcg aagagtactg
    28561 atttcatcat ctttcactac agggagtcag taggatattg tctaaaactg agtaatgtgg
    28621 taagaaaaca caaccccaat ttttttttaa catttttgat tgaactcctc cctccctccc
    28681 tcccttcctt tttttttttg tgtctccctt tgtcactcaa ccaggctgca gtagaggtgc
    28741 agtggtgtga acatggctca ctgtagcccc aacctcctag actcaaacaa tcgtctggcc
    28801 tcagcctcct aagtagctgg aactaccagc gcatgccacc atgcctggct aatttttgta
    28861 ttttttgcag agacaaggtt ttccgatgtt gcccaagcta gtctccaact cctgagctca
    28921 ggtgatgctc ctgccttggc ctctcaaagt gctgggatta caggctgagc caccatgccc
    28981 agccaacagt ttcaaacata cacaaagtag agagattatg tagctctatg tagccatcac
    29041 ctagtttcaa cagttatcaa tataatcttt catctttacc acccacccca cccaaaccca
    29101 gattatttct gtacaaattc cagctactat ataatttcat ccaaaaaata ctgcatattc
    29161 tccccaagag attaggattt tttaaaaaaa gaactgtaat accaatatca tacctaaaac
    29221 aacccaataa gaactccagt aagccttcat attccatcaa ttgtttagta aatttttttt
    29281 tacagttgct ttgtttgaat ctggagccaa agtccatact tagtcttctg tatttcatac
    29341 attctatttt tttttttttt tttttgagat ggagtttcac tctgtcaccc aggctagcag
    29401 gctggagtgc aatggctcga tcttggctca ctgcaacctc cgcctcccgg gttcaagcaa
    29461 ttctcctgcc tcagactccc gagtggttgg gattacagcc acccgccacc aagcctggct
    29521 aatttttgta tttttagtag agactgggtt tcaccatgtt ggccagggtg gtcttgaact
    29581 cctgacctca ggtgatccac ctgcctcagc ttccaaaaaa tgctgggatt acaggcgtga
    29641 gccaccgtgc ccagcctaca tgtgtaattt taaaagtaaa atgaaatata atttcatttc
    29701 atttggggcc tcagactccc aaagtgctgg gattacaggt gtgagccacc gtgccaggct
    29761 acatgcgtaa ttttatttta ttatttattt atttattttt gagatggagt ctcactctgt
    29821 cacccaagct ggagtgcagt ggcgcgatct tggctcactg caacctctgc ctcccgggtt
    29881 caagcaattc tcctgcttca gcctcctgag tagctgggat tacaggcatg agccaccgtg
    29941 cctggcctat ttatttattt atttttgaga cagagtctcg ctccattgcc caggctggag
    30001 tgcagtggtg tgatcttggc ttacggcaac ttctgcctcc tgggtccaag tgattctctt
    30061 gcctcaacct cctgagtagc tgggattaca ggcgtatgtc accgtgcctg gctaagtttt
    30121 tgtattttta gtagagatgg ggtttcatca tgttggccag gctggtcttg aactcctgac
    30181 ctcaggtgat ccacctgcct cagcctccaa aaaatgctgg gattacaggc gtgagccacc
    30241 atgcccagcc tacatgtgta attttaaaag taaaatgaaa tataatttta cttaacagaa
    30301 aaaaaatgaa gtgaaacaat tactaaactt caaacatttc tttttttttt ttttttaatt
    30361 tgagacaggg tctcactctg tcacccacgc tggagtgcat aatcgtagct cactgcaccc
    30421 tctacttctc gagctcaagc aatcctacca cctcagcctc ctaagtggat ggaactacgg
    30481 gcacatgcca ccatgcccat ttaatctttt aattctttta gagatgaggg tcttgctatg
    30541 ttgcccaagc tggtctcaaa ctcatagact caagtgatcc tcccaccttg gcctcccaaa
    30601 gtgctaggat tacaggcatg agccaccaca cctggcctca aacagttgtt tcttataaga
    30661 gatatgtcat gatgttggct atgggttatc aagtaaaaag gaaaaaaaaa tttttttaaa
    30721 gagaaatgtg gctgggcacg gtggctcatg cctgtaatcc cagcactttg ggaggctgat
    30781 gtgggcggat cacctgaagt caggagttcg agaccagcct taccaacatg gagaaacccc
    30841 gtctctacta aaaatacaaa attagctggg cgtggtggca catacctgta atcccagcta
    30901 ctcgggaggc tgaggcagga gaatcacttg aacccgggag gcggaggttt cagtgagccg
    30961 agatcgcctt gggcaacaag atcgaaactc cgtctcaaaa aaaaaaaaaa aaaaaaaaag
    31021 gacaaatgtc ctataaattc ctttctaaga tttaaaaaag agattattaa gaggttgggg
    31081 tcatttttaa aaactagatg tttcaatttt agataacgtc taagtgactc cttattatga
    31141 aagatacaag ttaatatcct tcccatattt tattacctta cttcccaatt ttaaaattat
    31201 aacttttatt ctgtgtagac tcataatatg tatattctat tctataacta taattcacca
    31261 gatctgctct actttatctg aatgtatgca gagagagctg atgctggtta catgagagtt
    31321 tattatattg ttttttacac atttttgtat gcctgaatta tttgtaattt tgaaaaaaag
    31381 gaatgtttta aactaaaaaa tgatgtttgg cggggtgcgg tggctcatgc caccgaggca
    31441 agcagatcat aaggtcaaga gatcaagacc atcctggcca acatggtgaa accccatctc
    31501 tagtaaaaac acaaaaatta gctggtgtgg tggcacacgc ctgtagtccc agctactcgg
    31561 gaggctgagg cagaactgct tgaacctggg aggcagaggt tgcagtgagc cgagagccaa
    31621 gatccagcct ggcgagggag caaggctccg tctcaaaaaa acaaaaagtt taagaatatt
    31681 tacagtattt ctgggaagta agatttcttc tttttttaat gttgttaatt atacttccct
    31741 ctattgctta aattctttat caagagcatt ttttcctaaa acaaacagga aaaacaagtt
    31801 tgtgtgaacc ttcacaattt taagactggt attattccct taaatctttc ttatgagtga
    31861 aaatgttaaa agggtgatag tgctgagaaa gatttgtcac ataagcataa atagtacaat
    31921 tagatgcata catagaatgg gcagtggtat tatgtctgca gtattctact tgaccaaccc
    31981 aaagcactac cccaaatagc gaagtacaga agctcagaaa tcacagcaca gttttctcac
    32041 tattctacta ggcaaaaatt ctagattaaa gttgaaaatg aaatctagaa tttagtcttc
    32101 tggcagacat ttaagttcct cacctattac caaaatgaaa gaccccaaaa cttaattgca
    32161 tgcagtcaca aatcttatct caaaagttaa ttagccaaaa tttaaagctg gttagaaata
    32221 ataaaaaggg ctgggcgcaa tggctcgcac ctgcaatcct agcactttgg gagaccgaag
    32281 caggactgct tgaggtcagg agttagagac cagcctgggc aacaaagtga aactccctgt
    32341 ctctacaaga aaataaaaaa aattagccag gcatggcggc atacgcctgt agtcccagct
    32401 atttcggagg ccaaagtgag aatatccctt gagcccaaga ggtggagcct gcagtgaggc
    32461 atgttcgtgc cactgcaccc cagcctggac gacagagtgg agactgttac caaaaataat
    32521 aataataata aaaaggccaa gtgtggtggc tcatgtctgt aatctccaca ctttgggaga
    32581 ctgaagcagg cagatcactt gaggctggga gttcgagacc cgcctggcta acacagtgaa
    32641 accctgtctc tattttaaaa aaagaataat aataaaaaaa agaataaaat gattattgct
    32701 aaatttttat cgactcctaa ttcagatgaa tttaagagct agactccaca gaaatgattc
    32761 cttatttagc agagggaaca aaatccagag aagtagggag tttcaacagg acgcttctgc
    32821 aacagtttag attggaggta ataagataaa gaagcaaaaa tgggaatgga aaggaggtat
    32881 caatggcaaa gagtgcagat ctaagactgc aagaattctt tcatgtattt ccaactctgg
    32941 aactatcacc agtacccaga gagaggtata tgaaaacaaa ataaactgtt atccttcctc
    33001 tacacctcaa taattcatct aaaatcccta gaagccacac ataactaatt cagatatcaa
    33061 gtgggtttcc tatcccccag attatatccc caagtaaatt aagagttttt tttaaaggat
    33121 tatacattat ccaagtaaga aaatagtgaa caatcgtatt tacttttttt tcatcctatg
    33181 tgtgaggccg ctgaaacaaa agtcaaagat tactaagaat actaggttag gctctcttct
    33241 aggaaacatt tttcagtgcc ctcttgttcc aataactaag gtcaggccat tggttgctca
    33301 gtttgtaaga gctgattcaa ctgatcattt tcaagtcaaa aaataagaaa ctactgcttt
    33361 gaaagtttaa ggctatccta cagatattta ctttaaagca tgtaagtggc tatttatgcc
    33421 cagcattcaa ctaaggtctc tctggccata agaaaatgac catttaatta agagccaaga
    33481 ctaatggttt aaacaacaac aaaaacataa taataatggt tattatttat tgagcattta
    33541 catgtgtcat gctctgagtt aaaattttta tataatcagg ctgggcgcag tggctcatgc
    33601 ctgtaatccc agcactttgg gaagccgagg cacctgaggt caggagttca agaccagact
    33661 gggcaacatg gtgaaaccct gtctctactg aaaatacaaa aactagccag gcatggtggt
    33721 ggacacctag aatcccagct actcgggagg ctgaggcagg agaaccactt gaacctggaa
    33781 ggtgaatgta gtgagctgag atcgcaccac tgcactccag tctgggtgac aagagcaaaa
    33841 ctccgcctca aaaaaataag tttttaaaaa tataattagt taatctaatc ctacaatagc
    33901 actattaagt caatattatc tctattaaat aagaatgaag ggaaaaaatg tgaaaatgga
    33961 gcccaagtca tatagctgat aagtggtaga accgggattt gaatgtagat gtgtctgatt
    34021 ccaaagcctg cgcttttatc cactgctgat aaaaaaatta gaaaataaaa gatggttact
    34081 aagcacactg agaggtatga taaatgcaag agaattcaga gaatgtaaag ttcagtgctg
    34141 gatttgtgat gggctttaaa acaaaatatg ggaaggaaac atggcaggaa tatataaaat
    34201 agacaaatgg gactggagtg gtaagtagac taagcagcga gaaataaggt tagtcagatg
    34261 aagtggggct tagaaagcca ggaaaacaca gtcctgctct cagggaggtt gcagactaac
    34321 cttaaagtta ataataatat acatttattc aatggaatcc tatgaaacca ttaaaaataa
    34381 tggggtatgt ttatatttat tggcataaaa tatgtccaca ttataatact tggaaaaaga
    34441 aggctacaaa atgatatgtc tgattataat caattttttg gtaattatgc atctgtggct
    34501 gggcatggtg gctcacacct gtaatcctag cactttggga ggctgaggca ggtggatcac
    34561 ctgaggccat gaattcaaga ccagcctggc caacgtggca aagccctgtc tctactaaaa
    34621 atacaaaaat tagctgggta tgtggcaggc gcctgtaatc ccatctactc tggtaactga
    34681 ggcatgagaa ttgcttgaac ttgggaggtg gagctttcag tgagctgcag gtaccacggc
    34741 actccagcct gagtgacaga ggagacaatg tctcaaagaa aaaaaaaaat gatgcatctg
    34801 tgtatacata tatatgcact gaaaacagtc tataaggata tatattacaa ttttaacagt
    34861 agttatcttt gggaagcaag attataggtg gtttttattt tcttctttac acctttaaaa
    34921 aaaggtataa tttacataga gtgaaatgca tagatgttaa gttcagtgag ttttgacaaa
    34981 agcatatatc catgttatct cacaccccaa gcaagatctg caacacttct atcatcccag
    35041 taagtgttct catggagggc caggagcggt ggctcacacc tgtaatccca gcactttggg
    35101 aggctgaggc aggtagatca cttgagtcta ggagtttgag accagcctgg gcaacatggc
    35161 aaaatcccag ctctataaaa attacaaaga attagctggg cgtggtggcc tgcacctgta
    35221 gttccagcta cttgggaagc tgaggcaggg ggatcacctg ctcctgggga aggtgaggct
    35281 tcagtgagcc ctgatagtgc cactgcactc cagtcttggt gacagactct ttctcaaaaa
    35341 aaaaaaaaaa aaaaaaaaaa aaaagtgttc tcatgtagat ctctgtctga atttcttagg
    35401 gtaaatgtat ttcactttta aagtatcaaa aagtaataaa attactttta tatcttagac
    35461 aaattttaaa aactaactaa aaaagaaaac aaataaaaaa gagaatacag taacctcaaa
    35521 ccctttgatt cagtattttt ttctatttgc cattcctacc ttaaacttct tggacaactg
    35581 tgaataggta aggtcaaaca gagaaaataa agaataactg gcacggactt tcttcctttt
    35641 ccatgttcct atagcatttt tttggtacct ttctcctagt acttacatac tgtgtctagc
    35701 tgtatggtta ctgcataaat atcttttctc ctctatcaaa tccctatgct tctgtaggaa
    35761 gataagcgca tcttacttac ttttatattt tccttacttt tctcagtgcc ttgcatcaag
    35821 gaggtactca aggccgggtg cagtggctca tgcctatagt cccagcactt tgggaggccg
    35881 aggcaggcag atcacatgag gtcaggagtt catgacctgc ccaggcaaca tggcgaaacc
    35941 ccatcgctat taaaaataca aaaattggcc tggagcggtg gctcacgcct gtaatcccag
    36001 cactttggaa ggccgaggcg ggcggatcac gaggtcagga gatcgagacc atcctggcca
    36061 acatggtgaa accccgtctc tactaaaaat acaaaaaatt agctgggcgt ggtggtgggc
    36121 gcctgtagtc gcagctaccc gggaggctga ggcaggagaa tggcgtgaac ccgggaggcg
    36181 gagcttgcag tgagccgaga tcgcgccact gcactccagc ctgggcgaca gagactctgt
    36241 ctcaaaaaaa aaaaaaaaaa aaaaattagc tgggcttggt ggtgggagcc tgtaatccca
    36301 gctactcggg aggctgaggt accagaagta cttgaaccca ggaggcggaa gcagcgagcc
    36361 aagatcacgc tgctgcactc cagcttgggt gacagagcga gactctctct caaaaaaaaa
    36421 aaaaaaaaaa aaaaaaaaaa aaaaagagag atactcaata aatgttggct cacttgagct
    36481 ggagaaaaaa aatcacatca ttagacaatt taatgatact tcctgagagt gagctgtcct
    36541 tcaggcggat gtacataaat aaatatgcaa ttatcttcat aacctatttt ttctaaaacc
    36601 tttctaaatc attgttatga gctgaattgt gtcccctcaa atttcatatg ttgaagttat
    36661 aacccccatt acctcagaat acatctgtct ttgaagatag ggcttttaga agaggtgatt
    36721 aagttaaaat gagggtgtta tagtgggccc taaaccaatc tgactggtgt tcttttgaga
    36781 ggaggaaatt tggacaccta gagacgccag gaaggcattt gcatagagga cacagcagga
    36841 agtctgcagg gcaaggtgag aagtctcaga agaaaccaaa cccatcaatc ttgatcttgg
    36901 acttctaatc cctagaactg tgagaaaatt aatttctgtc atttaagcca ccaagtctgt
    36961 ggtattttag cagggcatct ttagcaaact aatataatcc tactctcaca gaaaaatcta
    37021 gaagaggctt gtctagtcat ttagtcttaa gcttttttaa tgttttattt atttttagag
    37081 acagggtctc actatgttgc ctaggctcgt ttcaaactcc tgggctcaag caatcctcct
    37141 gcctcagcct cccaaagtgc tgagattaaa ggtgtgagac atgatgccca gccttggttt
    37201 taagctttaa gatgcaagaa aaaataacta tttgcttact catatttatc tctttttagt
    37261 ataatagtgt ctccctcctt ccataaggaa catgtgtgac atgggttcta atctatagct
    37321 gaagagtgaa ctgacaagaa aaataatgac tatttgtaca ctagaccttc tgatgtgatt
    37381 attttaattt tttttttgga gagagtttca ctcttgctgc ccagactgta gtgcaatggt
    37441 gcgatctcag ctcactgcaa actccgcctc ctgggtttaa gcaattctcc tgcctcagcc
    37501 tcccaagtag ctgggattac gggcatgcac taccacgccc agctaatttt tgtattatta
    37561 gtagagatgg ggtttcacca tgttggccag gctggtctca aattcctgac ctcaggtgat
    37621 ccacccgcct cggcctccca aagtgctggg attacaggtg tgagccacca tgcccagcga
    37681 taatatgata attataatta aatattattt acttattcgt taaacatatc ttccttttgc
    37741 cagttgggtt attcaaactt cttacttgga cttgttgccc tttatgacta gaatatcttg
    37801 tttcaaaaag gagaagagaa aaataatgta tagaattttc ctatttcctc cttaatattt
    37861 gaggcaatgg aatggaaaat ttgaagtaaa aataaaacta ataaaatcat tcagaattcc
    37921 ttacagtagt attttccctt tttctgacat tattgatgcc tagttaatgt tttttatagg
    37981 atcccaaaat ctttttccag gagtgagctt tgctggggta ttaaacattt gaacaagaaa
    38041 gatatggata agacctgaaa aataactcat atgccaaata ttattgtgaa acataataat
    38101 cttttatctt aataggaggc ataaattaaa atccatatga atctggccag atgcagtggc
    38161 ttatgcctgt aatcccagca ctttgggagg ctgaagcagg tggatcactt gaggtcagga
    38221 gtttgagact agcctggcca atatggtgaa accctgtctc tactaaaaat aaaaaattag
    38281 ccaggcgtgg tggcatgcac cttatagtcc cagctactca ggaggctgag gcaagagaat
    38341 tgcttgaacc cacgaggtgg aggttgcagt gggttgagat cgcaccgctg cactctagcc
    38401 tgggcaacag aatgagagtc catctcaaaa aataaaaata aaaatataaa aaataaaatc
    38461 cataagaatc tattcaagga ggtgaaatta tttgggtcac actaaacagt taacgatcct
    38521 gaacaattct ctttatttca ggaagaaata cagagaaaca tgtgaaactt gctttgtaag
    38581 tatgtctttt tttttttttt ttttgagaca gagtcttgct ctgttgccca ggctagagtg
    38641 cagtggcgca accttggctc actgcaactt ctgcctccct ggttcaagca attctcctgc
    38701 ctcagcctcc cgagtagctg agattacagg tgcgtgccac cacacccggc taatttttgt
    38761 atttttagta aagatggggt tttaccatat tggccaggct ggtcttgaat tcctaacctc
    38821 atgatctgcc tgccttggcc tcccaaagtg ctaggattac aggcgtgagc taccgcaccc
    38881 agccagcatg tcctttttaa actgcaaagt cagatatttg tttttgtttt gtttttcctg
    38941 ttttgagaca gagtcgtgct ctgtggccca ggctggagca cagtggtgtg atcatagctc
    39001 aatgcagctt cgaactcctg ggctcaagtg atcctctcac ctcaacctcc tgagtagctg
    39061 ggactatagg ttcacgcagg gctgttttca tttttttgta gagattgggt ctcattttgt
    39121 tgccctggct tgtcttgaac tcctggcctg aagtaatcct cccattttgg cctcccaaac
    39181 tgctgggaat acaggcagtg aaacaccacg cccagttgca gagtcagttt gaatgtaaat
    39241 tccacaattt attgcccaaa tgatcattta actttgactc ctataaagag aatgatacct
    39301 gccttgcagg attaaacaat agaaactgca aaatgttttg aaaataaatt atacaaacaa
    39361 aatatggagg caaaagtgat gaaatggctg acagcaagtt tataggcaca gtggttacag
    39421 ccaaaatccc aaattcaagt gggcagcttt gtctcaatat caatactaga tattttatat
    39481 tttgggagag tactgtgtgc atttggggat tggaagaatt agggagaaga tcttaatgaa
    39541 atagcttaag ctatattttc ctactattaa aacaggtctt tggtcttcca gtttatttcc
    39601 ttttatacca atagtaaata gttatttggg gttaatatgc ctggatataa taaaaaacaa
    39661 aagttaattt ttttaacagc ttatgtttaa agaaaaaaac ctgacaaaca agcccttctg
    39721 atacagttgc cctattcaca atcgcagtag ctactgggag acttattgtt cattttgcgc
    39781 gattaggaaa ctaaaaggaa agaccaggta gtaacttaaa ttagactttt aatttttttt
    39841 tttaagacaa ggtcttgttc tttgctcggc ctgctattaa tattttaatt ataataaaat
    39901 gctttaggat tgagtcactt tttaaggcgg ccaattttct tcattcatag ccaataatca
    39961 acaagttgga cactgactga ctgtgactga ttttaaaaac ctttttgggc tgggcgtggt
    40021 ggctcacatc tgtaatccca gcactttggg aggtcgaggc aggcggatca cctgaggtca
    40081 ggagttcgac accagcctgg ctaacgtggt aaaaccccgt ttctactaaa aatacaaaaa
    40141 attagccggg cgtggtggtg catgcctgta atcccagcta cttgggaggc tgacacagga
    40201 gaattgaacg cgggaggcag aggttgcggt gagccaagat cgtgccacta cactctagct
    40261 taggtaacaa gagcgaaact ctgtctcaaa aaaaaaacaa aaaaaaactt tttgattagt
    40321 gaattttttt taccagcttt aaagaagctt ttacttctaa tctcctattg ctaataaaga
    40381 tgaactatac ttgatggagg atcatcagca gcttgaaaaa gcgtagcatt aacatacttt
    40441 ataattaaaa atacatatga atttcatacg tataattccc atacgtatag tggataacag
    40501 tagattgttt gaaaaaagat aactgacaag taccaggaga attttagcag cactgttcat
    40561 aatgaactct tgaatgtgca ttaacccata gaatgaatag tggtatattc atgtaatgaa
    40621 atattaatca ggatgaactg gattaaacta tatgcaacaa catggacaaa tctcacaaac
    40681 attatattga tcaaaagaat acattaaaat attaacccaa taaaatagag tttaaaaaca
    40741 ggcaaaacta aaatatattg tttagggata tacatacatg tcaaaactac agggaagtgg
    40801 gcagcatgtg attaggggca ggcataagag aagccttcca ggatgctgat attgttctat
    40861 tggtgattac atgactgttg gctttcaaac tacttgttaa actgtatata tgtgctttat
    40921 acaattttct ggatacactg tatttcacaa ttttagaaaa cttaaacaat aaagcatgta
    40981 actggatgtt cagctctcat acttcagttt taaaacactg gagttcatta aaaggagatg
    41041 ggatgctttc agacatctgt ggttattcca gttgcacatc actgaaatta ttaggaacat
    41101 ccagttgatg ttcttttttc ctagaacctg tctctttaca gactaactca ctcttattcc
    41161 agttaagagt tattacctcc tggcggggcg cagtggctca cacctgtaat cccagcactt
    41221 tgggaggcca aggcagttgg atcacctgag gtcaggaatt caagaccagc ctggccaaca
    41281 tggtgaaacc ccgtctctac aaaaatacaa aaattagccg gggatggtgg caggtgcatg
    41341 taataccagc tacttgggag gctgaggaag cagaatcgct tgaatccggg aggtggaggt
    41401 tgcagtgagc caagatcacg ccactgcttt gcagcctggg tgacagaatg agattctatc
    41461 tcaaacaaac aaacaacaaa aaaaagagtt atcacctcct gtaagtcttt cccaacatct
    41521 cttcttccct ctcctactgc tccttcccac cacccatctg atttacgtgc ttttcctctg
    41581 tgtttccttc ctactcagaa cacacttata cacatatcac attgttctga aatggtctgc
    41641 tgactttctg gcttctactc ccagcatctc caaccatccc tatgccacta aaccataaat
    41701 tgacctcaaa agcagggaca attgtttttg ttctctcttt ctatgacata tgagtcattc
    41761 aataaaattt ttggtgatta aacaaataag tgaaaatgta ttgccatatt tccaatggat
    41821 aatacttctt gccaattttt accacagtga catgagaaat atgatgaaaa gcttttcttt
    41881 cacagtaagt agtaagtatt caataaatgg cattactagg agaaagcttt ccaagaaaga
    41941 gaggatggtt gaaaaacatt atccacaaat taaatagatg agaaatgaaa agaccactag
    42001 gagtcaaaag cagtaattgg cttactcccc ttacaggtaa cttaagaaac gattctaata
    42061 aatgaaaagt tttatacatt agcaagtcaa ttaaaacact gttcatgagt taataatcta
    42121 tatcaaaact cttaccaaag gcaataataa caaataagca gattcctgat ttcaacagga
    42181 gataaaccta ttttattttg gtcaggcttg tgctcaatgt ttactttctc agttgaggtc
    42241 agtgttattg aactgagata gtatctaatt ttaactgtgc ttgaaaggtt atactcttca
    42301 aattatattt aaatttcttg attgttttca taagacaatg atacttaata tttaatttaa
    42361 ttagtttata ttttttgagg cagtcttact ctgtcaccca ggctagagtg cagtagtgtg
    42421 accttggctc attgcagcct ctgcctcttg ggttcaagag attctcccat cagcatagct
    42481 aggattacag gtgcatgcca ccacacccag ctaatttttt ttttccttta gtagagatgg
    42541 ggtttcacca tgttggccag gctggtctcg aactcctggc ctcaagtgat ttgcccgcct
    42601 caaacctccc aaagtgctgg gattatagat gtgagccact gcgtctggcc agatgcttaa
    42661 tattttaaaa ccagtctgtg gtagttaccc ctcagaagaa tttttttttt tcttttaaga
    42721 cagggtctca ctctgtcacc caggttggag tacagtggca caaacacagc tcactccagc
    42781 cttgaactcc tgggctcaag caatcctctc gcctcaatct cccaagtagc tgaaactgca
    42841 ggtatgtgac accatgccag gctaattttt atattttttg tagagatgag gtctcactat
    42901 gttgcccagg ttggtctaga attcctggcc tcaggcaatc cttctgtctc ccaaagtgct
    42961 gggattacag acaggagcca ccgtgcctgc ttgaaagaca catagtttct aaaaaaatga
    43021 taatactggg ttctaaacag aggatcttag ggaactctcc agtaaaagtt tgggagtttg
    43081 ggacacaaaa tttaacattt aggatttttt ccttttttga cttgtcacaa atcccaaacc
    43141 atgtaaaaac aagcaattat atacacatta aaacatactc acaaagaaca taaaaatgtt
    43201 attaagccct taatttggac acaaaaatga aatgggcttt gctgttttca ttaggctgat
    43261 tctcttttcc ctaaaggaaa gagtctaaaa gtctctgcct ggtaacaaaa tctataatct
    43321 ggtccaaatc gaatttactg attttgaatc ccattgttct ccacctactt cagtcatact
    43381 ggcctactca cagttatctc ccaaactcat caagctcatg atacctttgc tcactcaggt
    43441 tccctcatcc agaacgtgtt tccttttctc cattcatgtt aatcctccca tttttaaagg
    43501 catggcttgt tttactgcct ttttctattc ttccccggat tactctatct catgatggtt
    43561 tctcctctct ctaactccct tggctcattc taacatgcaa aaacttgctt tttattgttg
    43621 tttactttaa aaaaaatatt ttgcattgtg aaatataatg tacacacaga agaataaaca
    43681 aaacaaagta aaaagcttaa ttattccaaa gtgaagatct atggcccagg tgcagtggct
    43741 cacgcttgta atcccagcac tttgggaggc caaggcaggt ggatcacttg agatcaggag
    43801 ttcaagacta gcctggctaa catgctgaaa acccttccct actaagaata caaaaattag
    43861 ccaggaggga cagagcaaga ctccgtatca aaaataaata aataaataaa aaaggccggg
    43921 agcagtagtt tacgcctgta atcccagcac tttgggaggc cacacctata atcccagcac
    43981 tttgggaggc caagttgggt gggtcgcctg agaggtcagg agttcaagac cagcctgacc
    44041 aatatggtga aaccccatct ctactaaaaa tacaaaaatt agccaggcat ggtggcgggt
    44101 gcctgtaatc ccagctactc aggagattga gacaggagaa ttgcttgaac tcgggaggcg
    44161 caggttgcag tgagccaaga tggtgccact atactccaac ctgggtgaca gagcaagact
    44221 ctgtcttaaa aacaaaaaca aaaacaaaaa caaaactatg aaacttcctc taggtccaga
    44281 aacagaaaat tgttactatc tagaaatcct ctccattttc cttcctaatt actaacaccc
    44341 ttttccttcc cctatccctt acaaaaagat aatcactatt ctaaaaacta tagtaattcc
    44401 tggcttgatt ttgtttataa ttgtaccacc taaatatgca acctcaaata caatagtttt
    44461 ttgagtttca tataaatgga atcatacagt atgtatattt ttgtgtttgg ctttttttat
    44521 tgaacataat gtgagaaata tacatgttgt ataacaatgt tcattttcat tctgtataga
    44581 aaacatattt atactcattt aaacaaatgg atgactcttt tttttttttg atgattcatt
    44641 tttatccatt ctgctgttga tggaccttcg ggttacttaa cttttaaaaa gtatgtatct
    44701 ttttgtctct ccagctagaa tgcaagtttc ttgaaggcag ggaccatata ctatactatt
    44761 ttatatttac cttagtatta atatttagca taatcctaag cacataatgg gtgtcctcaa
    44821 tcacattaaa tattgattaa atgaatctat tgatagtttg agagataaga aaaaaataag
    44881 tacagttggg aatcaattta atatttttag agatagttat gtaactggcc ttttgtaact
    44941 actactattt aaataaatca caaagtttga cctcttctat tcatttacac ctaaggattt
    45001 attgtctttt attcttcctg agccaaatga aagctaaata tttgttaact atcctcaaaa
    45061 ataaaattag agcaattcaa aaggtggtta attgttatgt gctattattt attatttttt
    45121 gaggccttgc tctgttgttc aggctggagt gcagtgacac aatcctggtt cactgaagcc
    45181 tcgacctccg aggcttaagc catcctcttg actcagcctc ctgagtagct aggactacag
    45241 gcatgtgcca ccatgtccag ctaatttttt tgttttaaat tttttgtaga gatggtgtct
    45301 caccatgttg cccaggctgg tcttgaactc ctgggcccaa gcaatcctct taccttggcc
    45361 tcccaagtgt taggattaca ggaattagcc attgcatcca gccatctgct tattttgtac
    45421 agtatctcac caatatgtta agtactcacc atggccatag ctcttaatga gctgctttat
    45481 tttttattct atttcctaaa atttatatta accattataa acacatgaag acaagcccaa
    45541 aagcaaaaat tccagtgcat aaggtcagca ggaaattaat cattcctatt caaagctaaa
    45601 acaaaccaaa tagcttttag ttctatatcc tctttttata cagtcagtga ctgtttccta
    45661 ggtaatgtgt aaacgtctca aaagagtatt tatatcacct aatcttcagt cctattctga
    45721 cattctcttg gagagctggt tttgcctttt cgtttatttg ccttttataa aactgaaggc
    45781 tcatcaactt ctcagcaaac aggaaaatat atttccgggc aagaatttct gatataccta
    45841 catcttggcc aggtacacat aagcatgtct ctaatattag ttttaacaat acacagacta
    45901 taactggtct gtaaattttg ttcaagttgg taatttaaca agaaaaaaag taaaaaggaa
    45961 aatagtaaat aaatgaaagg caaaatctaa acaacaatgg tatttccaag tcgaagaatt
    46021 atgtagaaaa gaagagctgt tactacttgt atgcagccaa ttcacttcta gctaaggata
    46081 attatcaaag ccaaatttag tacttttaaa atctaaaata acattctaat tttgtatgtt
    46141 ttcaaaactg aggccagtaa taaatatacc tctttctttt cttgcaggcg gtgggcgagg
    46201 gggggcggtt atttaaggct ttcctggaaa gaaagcagat tgtttaaaat gttaccttcg
    46261 taacttcttc atgagggaat agtgtgatga gaaaacacag caaaaagtta caaatgaagg
    46321 atgatgatgt atactagtag agaagccagg gaaatttctg gttatatcac aagactcagg
    46381 accttaagac taactgtact ataaaataaa tgttccatta aataaggaga catttaggga
    46441 aaaaagagaa aagctaaaat tatttttaaa aattttcaaa aattaaaata atcctcaatt
    46501 gtttatgtat gtgtgtgtgt gtgtgtgtgt gtatatatat atatatatat atatatatat
    46561 tttttttttt tttttttttt tttttttttg aggcagggtc tcactctgtc atccaggctg
    46621 gagtgcagtg gtgcaatctt ggctcattgc aacctccgcc tcccaggttc aagtgattct
    46681 cctgcctcag cctcccgagt acctgggatt acaggcaccc gccgccacac atggctaatt
    46741 ttttattttt agtagagacg gggtttcacc atgttggcca ggctagtctt gaactcctgg
    46801 cctcaagtga ttcactcacc tcggctcccc aaagtgctgg gattacaggc gtgagccacc
    46861 gcgtaatcca actgttgata tttgccttat aaagatgcac agaggaaaat aaaaaattat
    46921 ttctgctata agaaacctac caattgttta tttctgtaag gcaactaatc acacatgttc
    46981 cttgtctttt catgtaaaag actatttctc ctcaaagctg aagaagactt cagtgtgtgg
    47041 gcccttgttc ttactttaat aaacggtatt atttcgcctt taatctaact tgggaaacta
    47101 caagtacaaa actttatact atatcctcct ctcctgtatc ttctaaaatt ttggagtaat
    47161 tttgtaatta aatacacttc acgtaattct acttctggat ttttaccctc caagtatcgg
    47221 tcactgtccc aactcctcca tcttccagat ggttagagca caattctaag accaatttat
    47281 tattaaatat attccattag gttacttcct aaaacattcc aatatttaat agtctttgag
    47341 atagcctagc aaacctaact gcattacctc tttatctttt agtcccagaa gcaatacttc
    47401 ttccataaga gtaaggcgga tatccttaga gtctccagaa tcttcattgt ctggactttt
    47461 ctcccaatta ctgtcttcct cactttccat cttcttttca gagttcttgc ttatttcagt
    47521 gcgacgggcc cggtgagtta aagtggtcat tctcacctgt ttctggaggg agtggtgaaa
    47581 aaaaaaatcc atatgtatgc ttatgtaaat aaatcatata ttcatacatg catactaaac
    47641 atccatacaa ataggaaata tttacagatt taaatataga aagaaaggaa ggagtaatga
    47701 ccagaaaaaa atgggtgggt aatacagtat cttttccaaa gtacaatagc ttggacttca
    47761 cagatggcac tgacaaaaga tcaaatgagt aattctaaag tccagtagaa atgctcatct
    47821 ttcaagcctg ttgacttctt tttttctttt tcttcttctt ttcttgagag ggagtcttgc
    47881 tctgttgccc aggctgggag tgcagtggtg cgatgttggc tcactgcaac ctccgcttcc
    47941 caggttcaag tgattcttgt gcctcagcct cctgagtagt tgggattaca ggtgtgcgcc
    48001 accatgtctg gctaattttt gtatttttag tagagatggg gtttcaccat gttggccagg
    48061 ctggtcttga actcctgacc tcaagtgatc tgcctgcctg ggcctcccaa agtgctagga
    48121 ttacaggcat gagccaccag gcccggccaa acctgctgat tttttctgca taatctataa
    48181 agaaggagtg aatatccacc attgtcactc ctctttcaca aaatggatcc caagttaagt
    48241 aagatggaac aacaaaaaat tgtgagcatc cagccccaaa atgcatataa taatttgtat
    48301 ctaatttcta taaacgttcc atggtactac attataaaaa aacacaaaaa atagaaagta
    48361 aaaaagaata cgaaaaacta aatagaaata catgttctat atttccttcc cataccctaa
    48421 tgactcattt catgtactca ctttggagat cacaagtcta gacgataaca ttcagaaaat
    48481 catgtgtagt aattccacag tcagtaaact tagcagctct agtcctagct agccatatct
    48541 tgcaaataca caaggagctg caaacaaaaa ttactggcaa gagagtgttg aaagattaac
    48601 aaattttctc aatttaaaaa aaaaagtcaa agaacatgtc ttttattcta tgtactacgg
    48661 tcaaaagaaa ctgcactaaa tcatttgtga taaagtttat agttagcata ctttatttta
    48721 tataaatcta gatatcagtt tttaagataa tatataacaa atattttaaa tcagcaatta
    48781 tataaaggag tatctttttc aagatcaaaa taatggtatt taagagttgc tatgactgga
    48841 ctcgtttcta ctcctctagc aatatttccc attgctccta ataagtagtg ttgccagact
    48901 ttttctcaaa aatgtctcat gctttctttc tgcttctgct cctctgtact tgctggggtc
    48961 ttcacccttt tctcttcctc tttacattct acccattctt taaaggcctt gtgaaggcct
    49021 ttaagtgacc tcctcttttc tattacctat gtgtctcatg gccactccct cagctgtttc
    49081 acagagcttt atttatactc acttccttta aagtatttat tatagtcttt ctcttggtat
    49141 agctgtctct caactgtaaa cttcttaaga acagggatat tgtcttcttg ggtttatata
    49201 ttctccatag caactagcac tgtctttaac attgttattg ctcgataaat ttatattgat
    49261 taaaatgaat actcagaagt gcatgtgaac atttaaaata gaaatacaaa tgaagtgagc
    49321 tacaaaagtt tttttgtaca agggtatatt tcaaatataa tttgttatgg gaggtgggaa
    49381 gaaaagtgaa ggaggaagag aaccatgaaa taaattctta agcgcactaa gcctatctgg
    49441 cactgtgaat agggactatt tcctaaacca atccaagaaa ctttaaacct tcgcggggca
    49501 gggggggagg gggagagggg gcggtttctt tgagaaatta gtttcatctg cacatttact
    49561 agttaaaaat cctacttgaa agaaaacaaa tggcccaaat gaatacaaac ctttccctcc
    49621 cataaaataa taattttaaa aaatagaggt tttttttggt ggggggaagg gacgttagga
    49681 aatgtaattt tctcgttgct tacctgatgt tactcctggg ttatggctgg acgcaggctg
    49741 tagcttgtgc tgtattttag aaggacacct gttctcccca ccccaccccc gtcctagtag
    49801 gtgaaaaact tagccacgca gagagcacaa acttcctgat ccttcgatgc agagaaggaa
    49861 taacacatca gcttcctcag gatcactagg caacctacct tggtgtttct tcattgcgcc
    49921 aatcagaaat caccaaggtt gagtcccgcc aatcggagct cacaggggga ggctctcccc
    49981 cagctattct gttgcgatcc agacaaacac caccaaaagg aggcaggttt gcgttcaacc
    50041 tgacagctgg tcaatccaat gaaggaacct tggtggaaga gaggaaagag ggctgcgctc
    50101 cccaccccca ttaaaccagc caatcaaagc gctgagtgag ctaacatggg ctgccaatct
    50161 cgaaaggttt ctgtgggtta atttcagcgt agcaggtgtg ttgctgtagt agcgtatact
    50221 gggacaatta agtgagatgt gttgggatcc aagcaataca gctaaaaata aataaatatt
    50281 tttacattgg ttgtttctga cagtcagttt atccttgact tgactgttcc agaaaaacat
    50341 gatctgggtc agtttgtgta cgtttatatg gtttacctta aacaggatga atacatttct
    50401 accatcgcat atgtaccagc attccctata gcgcttatta ctgacaaacc catggatgtg
    50461 tatgtttttg gaatatggct gctgaaacta caagcggcaa acagaccaag ttttaagtct
    50521 tcctttttct aaagcaaaga aagcaggcaa ggctggaact agctcccagg tttctcatta
    50581 aaagcacagc atttttcata ctaccactca aaaattctag aaagtgtgct caactcttct
    50641 tgcaaagtaa tagctaccga aatcatctaa aaattagtat tccctatccc cccatcctcc
    50701 taccagcctt tcaaatgaat gaagccatat ttcctgctac ttaaataaaa attagaagcg
    50761 ataaaagggc ctacgtactg aacaatattt tatttatgct gcaaaaaatg ccatacttta
    50821 aaaatcagtc tttttgtcct gtaaaaaaaa gcatagtaaa ggtaaagcac caattcttaa
    50881 attgtacatt atatattaaa atgttaatac attatgtcaa aatattgaag aacattgttt
    50941 taataacagc acaatgacaa aagagccagt taaatggtat aattttaaca taagtaaaaa
    51001 gtgaatccat accaaatttt aataccaaag taaacattac tgtttagaaa aatggcatta
    51061 gagggcctta acagttagta tatatttaaa ggaaatatta agtaggtaat gaacaaaatc
    51121 aattttgaaa ttttacttat taccgaatca attatgacat ttgtactttt gcttttattc
    51181 aaaagttcta ttggatttat ctcaagatta aggaccacaa tatgacagtc agccaaaaac
    51241 ttagttttag tgtacaaact gctttaaact acatatacat cttcagagtt agggaaatat
    51301 aatatagggt ccttcagttt aaatggtgag aagaactctg caagcctgca gaacaaaaat
    51361 aatttttata tatgttcctt tggttcacta aactttctcc tttttggcac tgactcttga
    51421 ctagaagaat caaagtgatg gaggacctgt caaaagaaaa caactactaa gaatagatac
    51481 tttcctgttt aaatgtcttt aattgtagat tttttcatgt atctgagatt tctcttcttt
    51541 ttaaaaaaat ttatgcctag catacctagc ataactatat ctgggtttct taaaagtaga
    51601 gtaaaattta ttattaacat cttccaatgg ctgctgaaag tatgtacttg caaaatgtat
    51661 aatcttcagt agggctgctc tagaagtgag aagatagatt taacattatg tcctcaaaat
    51721 agccaaatat ttcagagcaa taagaaggga aaaaagattt taaaatgaac tacaaataga
    51781 aaaacattct accttggaat catgtacgat ttaaatcatg tcttgctttt gtactcttga
    51841 gacgttaaca tttgagtttt aaaaatagtc ttaggttttc agttgtttaa ttctaaccta
    51901 ctaataatac taaaatatct agttggaaaa atatgaataa ttatctcaga aatggcaaat
    51961 acattatctg atttggtaaa ataaggttct aaatatgaaa caactatagg gtttaagtca
    52021 ggaaaatatt tttttctcca gaggtactag gaggagaccc tatattttta gggaacacat
    52081 gtatttagta aattatcctc cttctacttc agttttgatc tctagaaagc tcagatcaaa
    52141 tttgtggccc aatttcagtt ctatataaac aaaatggcct tagtattctt gtactgtcac
    52201 tttctcattc tatattctct tatatttttc ctggcactga tttttttttt ttttaaagac
    52261 ggagtcttgg tctgtcgcca ggctgtagta cagtggcgcg atcttggctc actgcaacct
    52321 ccacctccca ggttcaagcg attctcctgc ctcaccctcc cgagtagctg ggactacagg
    52381 cgtgcgccac cacgcccagc tatttttttt tttttttgta tttttggtag agacagggtt
    52441 tcaccatgtt ggccaggagg gtctcgatct cttgacctcg tgatccacct gcctcagtct
    52501 cccaaagtgc tgggattaca ggcatgagcc actgcaccca gctgattttt aatttatata
    52561 ttataaattc ctccaattac tttaaatcat tcatggaatg agacaggaga tgtttacatg
    52621 gaaaaaacta ataaaccaac atagggcatc ttatttcaaa atcacctcga tttttagaac
    52681 aaaatatatt taaacaaaat ttgaatatca gaactctcag gtacatcgaa agatatcaaa
    52741 ttcatcattt taaatttaat ctgtagtaat gatattctga tccttcccat tgtattcaaa
    52801 ctatacatta tcataagact tactattctc ccagattcat gttgacttct cttccgattt
    52861 tctttggaag attttactgg ttgatttcca tttgcctcca caaagataga ataattacta
    52921 attggttatt tctcaaagaa cagccaacac ttgatttttc aatcttatgc aacgatatgc
    52981 aatatttatt tatttattta ctttttaggg acagggtctc cctctgtcac ccaggctgga
    53041 atgcagtggc accatcatgg ctcactgcag cctgaaactt ctggcctcaa gggatcctcc
    53101 tgcctcagcc tgccaaagtg ctgggattac aggaatgagc tacagtgccc agctggatat
    53161 acaatttttt taaaaaacag ctttattgag atataattta gatactatat gattcaccca
    53221 tttataagtg tacaattcaa tgattttcag taccttcaca gagttgtaca accatcacta
    53281 caatcaattt tacatcactt tcatcacccc acaaacaaac cccatatacc ttagctatca
    53341 ctctcttatt ctgccatcct cccaccagcc ttaagactgt ctcaataact ttcccagttc
    53401 tgactttcaa atgaaaggaa taatacaata tgtggacttt tgtgagtaac ttcctttaat
    53461 tagcatgttt tcaaggttca tctaagttgt agcatgtatt ggcacttcat tttttttatg
    53521 gctgatattc aattgtatga atataccaca ttttttaatc cattcatctg ttgatgtata
    53581 tttaggtttt tcccatcttt cggctattgt gaataatgct gtatatatag ttattaaaat
    53641 gaatattgga aaatgactaa tgattaggtg tcatacaact attctctaaa aggatatact
    53701 tcttttctga tgcttgaaat gagctttcca atgataactt ccttagtata tataataatg
    53761 aatattttca tttcttgccc ttttctaacc acaattcagt aataattctc aaaatcagga
    53821 gtacacttgc tcagagttca tatccaaaaa catgtgcaat ttcagaaata aatttatgtc
    53881 cctaaatcat cttaatgtta cattacaagg agatcttgct taataatttt taaaatgctg
    53941 gtctctatcc ctatatccta gtaaaggtta ttcatcctac atataaactg gctaggatga
    54001 acattttgct tctagaagta gtaaatcttg acattccttt atctcttgct ttcatgagat
    54061 actcatttga atgctgcctt ctgccagttc taacttccca ttgagaatag gggtggggga
    54121 agacacttaa aaatggcact ataatgagta gtgttccaat aaatgtttaa taaccagctc
    54181 taggagagaa acaaaccttg aggcatttgc tgatttctgt gctgtgaata ctaccaacat
    54241 gtccaattta agctaccaac atgatatctc tgagtgggga gttgggaata aatgttctag
    54301 cacaccacta acagagtgag ctatttattt cctaaacttt agatatttta tatattttac
    54361 agaatgcttt ttcttcaccc caatgaacct ttgagagtaa cttaaaggtc taaaatcaaa
    54421 tatgaatttt gatttagtat aattcatacc aaaagctaaa ataaagcagt gatttttttc
    54481 ccttaagaat atgagattca ttttatacaa gatatataca aaagattatt ttgtaatgtc
    54541 ttattaacaa gatactcaat ttataagact gagcatggtg ctcacttcag cagcacatat
    54601 gctaaaatta gaacgatata gagaagatta gcatgtccct tgcacaagga tgacatgcaa
    54661 attcatgaag cattccatat ttttcagagt gaacagagaa cctacagaat gggagaaaat
    54721 ttctgcaatc tatccatctg acaaaggtct aatatccaga atctataaga aacttaaaca
    54781 aatttacaag aaaaaataaa caaccccatt aaaaagtggg taaaggacat gaacagacac
    54841 ttctcaaaag aagacattca tgtggccaac aaacatatga aaaacactca acatcactga
    54901 tcattagaga aatgcaaatc aaaaccacaa tgagatacca tctcatgcca gtcagaatgg
    54961 tgattattaa aagttaagaa acaacagatg ctgacgaggc tatggagaaa taggaacact
    55021 tttacactgt tggttggaat gtaaattagt tcaaccattg tggaagacag tgtggtgatt
    55081 cttcaaagat ctagaaccag aaataccatt tgtcccagta atcccattac taggtatata
    55141 cccagaggaa tataaatcat tctattacaa agatacatgc acctgtatgt tcactgcacc
    55201 actattcaca atagcaaaga cagaatcaac ccacatgccc atcagtgata gactggataa
    55261 agaaaataag cagccataaa aagaaatgaa atcatgtcct ttgcagggac atggatggag
    55321 ctggaagcca ttatcctcag caaactaaca caggaacaga aaaccaaaca ccacatgttc
    55381 tcacttataa gtggtagata aacaatgaga acacatggac acaaagaggg gaacaataca
    55441 cactggggcc tgtcaggggg ttggagggag ggagagcatt aggataaata gctaatgcat
    55501 gtggggctta atacctaggt aatagattga taggtgcagc aaatcactat ggcacacgtt
    55561 tacctatgta acaaacctgc acgtcctgca catgtatctt ggaacttaaa ttaaatgaaa
    55621 tttaaaaaga aaaaaagatc aagcatggga acattctggc attctaggaa atttctctgg
    55681 gagtctggat gatagtatca cttcttccca aaacctttcc cggatccctc tctttaatat
    55741 accttcagta aaattaacca ttcctttctt tgtgcctatg tagcctattc agacttctat
    55801 aatggctctg tatcattgct tgtttatttc cagttactaa acttcagcta agaccatgaa
    55861 gtatgtgtta ttcaggttat atcctaagca cctagtaaag tacctgacac ataataggta
    55921 cttagtaaat actggtttaa taaacgaaat attaaaataa actcacatat gggggagaat
    55981 tacaaggtta caatggaaaa caaacctaaa gagatattta ccattttatt ctgaaagact
    56041 tttccacttc ttgttttgct ttcagacata tcaagttcag atgttttatt gactaactgc
    56101 tcaacctaaa aaagaaaata attacaaaaa atataactta gtataataaa accttttcat
    56161 aacacaataa atggggtcca aaaatttaac cacataaaat ttagggtctt gtcaagttaa
    56221 aaccaataat tttgagttgg ttcatactgt tcaaaaagca caaaagtaaa gatagtgaaa
    56281 ctgacacttt acataattgg ccaaccactg aactgagaaa gtaccaaatg atcaccaggc
    56341 accacccttc cagaagacaa gaaatttagt ctacttattt gagaaatggg ctgggcagag
    56401 gaagacagac aatcaatcat caatcatatt ttttccaagt tcatattacc taaatctctt
    56461 atttcaggat tttactgtat ttgaaaagca gaataaaaca tgtatgctcc aaggcaacac
    56521 aaatgtatta tatattccac ttataaatac aaagcagggt ggaataaata actgcataag
    56581 attctatcta gctctgaaat tctcagattc taccaatagt atttaggttt attactaaac
    56641 aaatccacca gtaatatact caaaattcat caacatataa tttccagatt tagttaataa
    56701 caagcattct aaatattgaa ttttattcta aaagacctgg aataagaaaa tctgaaaaat
    56761 aaaaaattag cgcataagca actgtcctgt gaacaaaaca aaagtgagat gaaactatca
    56821 agtttatacc tgagaatgag aaatagaaag atctggactt tctttagacc tcataatttc
    56881 atcttcctca caaactaaac ttggttctgt aaaaaaaaaa aaaaaaagtt acccgggtat
    56941 aaaattgagt tggaagtgag ttaatgtaca ttttcattaa cttcttacct tcaagttcag
    57001 aagatgcagg gtttttttcc tgttcttcca ttttagtttc aatgtccaaa tcatcctaca
    57061 taattatgtg aagaaaaaaa ttgacacatt tcaaaaagga actaaaacaa ctgagcagaa
    57121 agcatttatg aacaaatttg tttcatgtga aggctaaata caatataaaa ctactggccg
    57181 ggcatggtgg ctcatgcttg tattactagc actttggaag gccatggagg gaggattgct
    57241 tgagactagt agttccagat tagcctgggc aatatgatga gacctcgtct ctatgaaaaa
    57301 attttaaaaa ttagccaggt gtggtggcat gtttctgtca cctgaggctg aggtgagagg
    57361 atcgcttgag cccagaagtt tgaggctgca gtagccgtga tcatcccact gcgctccagc
    57421 ctgggagaca ggacgagacc ctgactcaaa cacaaacaaa caaaaaaacc caagccaggc
    57481 gcaatggctt acgcctgtgt tcccagcact ttgggaggct gaggtgggca gatcacttgg
    57541 ggtcaggagt tggagaccag cctggctaac atggtgaaat ctcgtctcta ctaaaaatac
    57601 aaaaattagc ccggcgtggt ggcacatgct tgtaatccca gctactcagg aggctgaggc
    57661 aggagaatac cttgaaccca ggaggctaga ggttgcagtg aaccaagatc tgccactgca
    57721 cttcagcctc ggcaaaagag caagactctg tctcaaaaaa caaaacaaaa caaaacaaaa
    57781 caaacaaacc agaaaacccc aaaaacctac aactacctca tggttgctag aaataaaagt
    57841 gttcctagaa tttctatcaa aaggcaattt attaatccaa ataaagtata aaatactatt
    57901 tttctttatt ttgcagggtt gctttaagga caaatgataa aaatatacaa ctttcggcca
    57961 ggcgtggtgg ctcatgcctg taatcccagc actttgggag gcacaggtgg gcagatcacc
    58021 tgaggtcaag agtccgagac tagcctgacc aacatggaga aaccctgtct ctactaaaaa
    58081 tacaaaatta gccaggcatg gtggcacatg cctgtaatcc cagctacttg ggaagctgag
    58141 gcaggagaat cacttgaacc tgggaggagg aggttgctgt gagctgagat cgtgccattg
    58201 cactccagcc tgggcaacaa gagtgaaact ctgtctcaaa aaaaaaaaaa agtaataaac
    58261 aaaaactctt cagagagtta ttgaaggata agctcagtca tgttatggta gaaattgcaa
    58321 caggtttaat gtctacatta caattattgt tccctttgtg atacatgtga aagagattta
    58381 aatattttaa agaatgctac aaacacctca gccatatttg ttataaagca agactgggat
    58441 ttaggtctta gaaaaaagtt tttttcttta actgaattgg ttgaaaattg attgaatttt
    58501 ctatgtgtaa agatgtggtt agacagccta atcagaatag tgaatctctg tacataagta
    58561 ctgtagtcat aatttagaat aaattgagac acagaataca agagaggtga gattaaagga
    58621 cataatgtta atgcttgaaa gattaaagca ataatgactt ccatatttgc tggaagtgag
    58681 gcaacaaaaa aactctttta tacttctatt catccattca ctcgactaat acttaggacg
    58741 cactaactag aaatccaagt gctaggcaca ggatgattaa aaagttgaat gagacacttt
    58801 tcctgacttc caggaacttc caggctgctt cagaacagtg actttccagg ccctagagaa
    58861 attcaagcac aggctacctg aacacttagt agggatgttg tagaaaagat tccatcatta
    58921 gccgggagtt agactagtgc aatgcttttc aaattatgcc ctagagtcct aggggtttcc
    58981 tcagagtcca ctaagaaggt aagtgacata gggagtggca gagcctccag gaacattccc
    59041 tacttcaacc agaatagttg tgcttttaag ttctagtaca atgttgtttg ggctaagtaa
    59101 aaagttttaa aaccataggc ctagacttta aggtcccttt caaccttgag tttatgatta
    59161 taatacagaa tttacacaaa tgttaatggg agctatgaaa aatcttcaga attgagtcaa
    59221 acatttgtta ttattgttct ttataactct tgaaatacac ttacacttgt ataatgctcc
    59281 tgttcatctt ctacatcttt gtccctcagg attttttgaa atggtgtttt tatttgtttt
    59341 ggtgatagta tagttgagtc aatattttcc attcgttctc tctcagtggt cacttttact
    59401 ttgaagatgt gaaaaggtgt tgagacttct cccacattta aatacatagg ttccccttca
    59461 aatataactc cttcacaatc accatcctta aaaccgggag gctggtaatc tgggggtgta
    59521 actgcaaaaa agtaagtgta aactgtgctg ttcatgaaat taaagaaaat ataattatta
    59581 ttataaaaac acacgcagat tgcaggtatt tcaaatagta tataagggca taaggtaaaa
    59641 catacgaatt actctcctgc tatgtccaag tccccatttt attccctaaa ggtaagcttt
    59701 gttttcttca cttgactgtc tgtgaccttg agtaaattat tcatcatccc tgatttgatt
    59761 ttctcatctg tcaaatggtg aaaataatag tacctacttc atagggtcat tgtgggaata
    59821 aaataaagtg tcgtaaaatg ttgagaagag aactggcaca tagtaaatac tcaataaatg
    59881 ttagttgtta tttttgttgc tttatgttct tctagaagac atgcatataa taaaagatat
    59941 ttcatattta cataggtttt aggaaataca tataaaattt cttaaaaaac acaaatgagc
    60001 tcataccatt ccatacctag ctggttttaa cttaataata tatcttggat aacgtaattt
    60061 atcagcacct tctttttaac agcttgtgta gtattctatt acgcagatgg accataattt
    60121 atttactcag tccccaatga aggacactga agttgtttcc agtttgttcc tattttagcc
    60181 aatgctataa tcaaaatcct tatataactt tgcgtatttg cacaattata tcttcaatat
    60241 atttttctca ctatagaatt tctggattaa agatagtgtg catttaaaat tttgatactg
    60301 ccaaattgtc ctctaaaaat gttgcacctg tttacagtcc ctcgcgtgta caaaagtgcc
    60361 tgctcagccg ggtgctgtgg ctcacgcctg ttattccagt actttgggag gccaaagcgg
    60421 gtgaatcact tgagatcagg agttcaagac tagcctggcc aacacggtga aaccccatct
    60481 gtactaaaaa tacaaaaatt agccaggcat ggtggcgagt gcctgtagtc ccagttactc
    60541 aggaggctga gacaggagaa ttgcttgaac ccaggaggtg gaggctgcag tgagctgaga
    60601 ttatgccact gcactccagc ctgggagaca gagcaagact ctgtctcaaa caaaaaaaag
    60661 ttcctgtttc cccacatcct tatcaataat ggattaacac gttttaccaa tctgataagt
    60721 gaaaactgtt aactcattgt taaatttgag tgaggttata cttcacatgg atatgttcct
    60781 atcactagag aagcatctat tttaaagctt aattttgtaa atctttcttt cttgaatagt
    60841 gatattaagt gtaaatttta taatttcaaa aaatttaggt gctaaaaata tttggaatgg
    60901 aaaccaattg tttcaataat gataaaacat atgtaccaac tgaaacagga tgtattgcaa
    60961 atagtacctt catcatagta aaaaagtttc atggtcaaac aaacatcatt aggtaaaggc
    61021 cccagatttt gcattaggat ataaatcttg cgaatgagga gaatgcttgc tttcttggtg
    61081 tcagtagaca acatgctaga ttcgttgctt tggtttttac tagaagagaa tcacatatta
    61141 atttatttta taaaacctgt ggcttctaat atcataactt tacattttaa ctatttgaga
    61201 ttcttatttc tattatagca ttaagctgtt aacttgaaag ttaccatgtg ttttaatgtc
    61261 actattataa aattttaata ttaatatctg ctagaaataa taacttaatt ttacttaagt
    61321 gcattttata tgaagtgttt aaagaaagca cattatgttt taaaattata tttctatatt
    61381 tctaatcacc taagttctct ttccacaatt aaaattgctt attattagcc acagaatcct
    61441 ggcttgaagc tactgcaagc caccactcca ttttgcatag ccacatcttg atggttaatt
    61501 ggggatgtcc ccgcagtttc agaatgaatc ctgaggttta gctatatagc attaatatga
    61561 ataagtggta atctgtaata caaacagaaa actatagaac aataccttat gaagtccatg
    61621 agtggtccat tattggtgta tttgaatttg aattggtaac attctgaaat tgtcttaaat
    61681 agaaaatatc gcagttatgc ttctgatatt cagtactagc tgctaaatag ctatattctg
    61741 ttttgtatga ctcaacattt agtcaatact tttttctgcc atattaatcc ctattgatcc
    61801 attgaaactt ttgattcctc atagtcagtt ttcatggttt gggtttgggg cttcagtctt
    61861 ggagagacat ttatttccca actcattgta atgtatgtta ttgtagggaa aagagagatc
    61921 agactgttac tgtgtctatg tagaaaggga agacataaga gactccattt tgaaaaagac
    61981 ttgtacttta aataattgct ttgctgagat gttgttaatt tgtagctttg ccccagccac
    62041 tttgacccaa ccactttgat ccaatctgga gctcacagaa acatatgttg tatgaaatca
    62101 aggtttaagg gatctagggc tgtgcaggac gtgccttgtt aacaaaatgt ttacaagcag
    62161 tatacttagt aaaagtcatc gccattctct agtctcaata aaccaggggc acaatgcatt
    62221 gcggaaagcc gcagggacct ctgcccttga aagcggggta ttgtccaagg tttctcccca
    62281 tgtgatagtc tgaaatatgg ccttgtggga tgagaaagac ctgaccgtcc cccagcccga
    62341 cacccataaa aggtctgtgc tgaggtggat tagtaaaaga ggaaagcctc ttgcagttga
    62401 gatagaggaa ggccactgtc tcctgcctgc ccctgggaac tgaatgtctc ggtataaaac
    62461 ccgattgtac atttgttcaa ttctgagata ggacaaaaac cgccctatgg tgggaggcga
    62521 gacatgtttg cagcaatgct gccttgttat tctttactcc actgagatgt ttgggtggtg
    62581 agaaacaaat ctggcttacg tgtacgtcca gtcatagtac cttcccttga acttaattat
    62641 gacatagatt ctattgctca catgtttgtt gctgaccttc tccttattat caccctgccc
    62701 tcctactaca ttccttttta ctaaaataat gaagataata atcaataaaa actgagggaa
    62761 ctcagagacc ggcgccggtg caggtccttg gtatgctgag cgccggtccc ctgggcctac
    62821 tattgtttct ctatactttg tctctgtgtc ttatttcttt tctcagtctc tcgtcccacc
    62881 cgactagaaa tacccacagg tgtggagggg caggccaccc cttcagttat ttataccaca
    62941 tggaaggaac tccttagcag gcagataagg gaggcaacat acctcagcag agtggctaac
    63001 acagggcata aatctttgct atactgtaac cacaggcaag ccacttaagc cttagtttaa
    63061 gtattagttt ccccagctaa aaaagagatt aaagacagta catatctcat tgatagttgt
    63121 gaataaatat tcacaaatgg gataatttat acagggtact ttataaagta cctgacatat
    63181 agtaaacatt caataaatat tattattgta tcaacaaggt tgctaattta ctgatagaaa
    63241 aaatgcacaa ctatataaca tatgaaaata aatgaacatt ttctgaacaa tcaattagtt
    63301 ttctgaacac aatctattaa ttcacagcct ttacccaaat gatctgcttt gtcagcatgg
    63361 atacatatta agcattaatc gcaggtgtga gtatcaaaga aaagacacat tctcagccct
    63421 cctgaagcct gaatacttct attgctttct ttatatttca ttgcttacat agtgatgttt
    63481 ttaaacgtct atttttaaca ccagtttaaa tctacctctc cttgactgaa tgaaacgtta
    63541 tattgtaaca gctatagcgt aatgggtaaa cacccaaatt gtgaactggc cttcctaggt
    63601 tcaaaaacct ggttctgcca cttcagccac tatgacctta ggcaagaaac cccttgtact
    63661 tagatttcct catctgtaaa gcggagatgc taatagtggt tacctcatga aggttgggat
    63721 gaataaatga gataatgcac ataaagcccc aaagaacaga gcctaatata aagtaagcac
    63781 tcagtagatc tttccccctt gtttcttatc ttctggtttt ccctttcacg ttctcccttc
    63841 tctcctttaa tggtaagata aataggtagc tatgaatgga ttgggatgaa tattcgaagt
    63901 ttaaggatga ctttcttact ttcctaaaac tcaagccctt tcataattct aaaatgtagt
    63961 ataaaaaata ctataaagaa ttagacgtca ggagatttaa gttaagtttc taacagtttg
    64021 taagctttgt ggcctaagat cagtcacata atctctgtca tgctaatctg tttaaaggta
    64081 acaatccaag ccctgcctta cctcagaaga gttttatgaa gctcaaatgt gaaaatatat
    64141 atgaaagtat attgattaaa gcataattaa aagcataaga tattcttata ttatgtttcg
    64201 tgttattttt agatattaga ggcattatgt ttctttagta actcaagcac acttacctga
    64261 ggatcttctg ggtttgtgta tacctattta aaaacaattt acaattgagt tatctaaggc
    64321 taagtattca agaactaaat tagatgttta gactaaactt catgtgaact cagcaaaaca
    64381 aacaaataaa aatgaccacc caaaatatta ctttttcatt ccattgaata gatcagttat
    64441 ttgaataatg tgacataatc aattgatagt tttagtgtca tttaattaag caaggccaaa
    64501 tttggcagta ttaaaaaaag aacacacaat ttaaaaatac ttacagctag aacaaccatc
    64561 cttagctgaa atacaacaaa gaacaaaaat cttacttgta gtctataaaa taatttttca
    64621 ttacgaatca taagaattct ttgtcctatt tcaacatgta ttcctaataa caaataatca
    64681 ttgcagatct tgactgcata agatcaatgc ctataatgca gtaccttaga tatgcttccc
    64741 tatcccctga aatatctatt gctactaaaa ttactgatga ttgaaattca tgtcagcact
    64801 ttatttcatt gacttaagca aaaaggacat tactgccagt tacaaggaaa aaatatatat
    64861 attgtgaaat aacacaatat atgtgttata ctgaaaacaa tgagaactga ttttgaaaaa
    64921 agaaaaagca gagttcaggg ccctatgaat tcaacagagg gaggaagttt tggagaccat
    64981 aggaaaagaa aatcatagaa aataatttaa tcaagtgatc tgccaaaaat ctaagagtca
    65041 tccttgattc tgatcttttc cttgtattac acaaccagtc catcaacaaa tccaatcact
    65101 tctactgcca aatctatcct gaatctcctc tctctgctat cctggtctaa gcaaccatta
    65161 tttctcacca gtactacagt aatggcctag gtgctattct ccttactccc aatcttaatc
    65221 tccttctaca gcatattctc taagttgtag gcaaagtgat ttaaaacaat tattttttaa
    65281 gacagagtat cactctgtca ctcaggctgg agtacagtgg catgatctca gcttactgca
    65341 atctccacct cccaggttca agcgattctc ctgcctcagc ctcctgagta actgggatta
    65401 caggcgcgcc aacacgccta gctcattttt gtatttttag tagagacaag gtttcaccat
    65461 gttggccagg ctggtctcaa actcctaacc ttaggtgatc cacgtgtctt ggcctcccaa
    65521 ggtgctagga ttacaggcat gagccaccac gcctggccta aaacaatttt taaatgtaaa
    65581 tcaaatatta catttctatg tataagcttt gaatggcttt tctttgcacc taaaataaaa
    65641 gccactgagt gctctggccc tgcttatctg cccaggctca tctcatgtta ttctccccca
    65701 ctggacaatt gccctcagtc atattggcct tctttttgtt cctggaactc tctaaacttc
    65761 caattctttg catttgctgc tctgccagga gtattctctc ctaaaatcaa tgcctggctg
    65821 gtttcttctc attcaggtct cttcaatgtc acttcctcag agaagctcct atgaccatcc
    65881 catctaaagt tacttcacct ttccatttac cacatacttt tgctaaaaat tttaatggca
    65941 atttaacatt attgaaaagt atatgatttg ttttattaac tacttgtttt cctactttac
    66001 aagggacctt ttttgttttg ttgatcactg ttagcttacc cagttcctac ctagaagaga
    66061 tgcctaacac ataacgggcg ctcaatattt actacgcctt aatggggaag attaggaacc
    66121 atgaataaaa attaatcttc ttgaggctga ggcaggagaa tggcgtgaac ctgggaggcg
    66181 gagcttgcag taagccgaga tggtgccact gcattccagc ctgagcgaca gagcaagact
    66241 ccatctcaaa aaaaaaaaaa tttcatcttc ttcacaatta gcctagtgct ggacttcatt
    66301 atgcaagttg gtactaaatt atactaagta tgcaaaaagg attacaatac atcttaaaac
    66361 cttcttaatt ttgctaattc agaatttgtg acaatttaac ctagattatg tggaaattta
    66421 tttttgaacc ctaataagga aataatttat taaattatta gtataaacaa gcttgtagaa
    66481 ggcatgtgaa actaatgaat aaactatttg aaaagactct ttaattcatt aacttttttg
    66541 taagttaaca cgtgttatta atacaaataa gtgttaaaaa aactagcttg gaaaaatttg
    66601 tttagtgtta attccattat ctttaaatga gttgatgatt agtctttaaa atattcaaat
    66661 gtttaggctt ttcagtaatt cagactgacc tacactcaag ttagttcaaa tgaatgtttt
    66721 actacatcat aactccagtt atatttgtag atcataaact gtagaaaaaa aaggataaag
    66781 agattgcaag acttacatat tttttctgta aagcatcata acatcctagc atcctaaaaa
    66841 aaaaatcaag gattcatttt tagactgaat gcatttccag aaaatagtta ttttcataaa
    66901 tttccatatt atctcattac taaataatac aaaaatgtct tattaaagct cccttcttat
    66961 taactagctt aaaacaagac aaaacataaa aaacaatgtg gcagaaatgt aaaatgatga
    67021 tagagatgga agggaaaata caaaatatct gacagctatt tgtggggaac aataataaaa
    67081 taaataaagc ctgcttgcat ggggcacaaa ataaagaaac atttattttt aaaatttatt
    67141 tttccaaata cccagacttc aaggagagga aaacttcggt ggaaaattta atgacctaca
    67201 aaaatgctat ctagcattaa agggtgattc actaaaattt ggttaaacac caaaggtatc
    67261 caaggtacaa aaaaaagtaa aatgagaaaa aaataatttt ctctctttag gtattctttt
    67321 acttgccatt tcactaactg tgtagatcct gggcaatttt tatcttctct cagtattttg
    67381 acacaaagat ctaaacacaa aaatgatgaa atatagagtt acttaagaaa agtaaacaac
    67441 tagaaatttt aaatgaagtt tattttcttc tgctgtgtat tacaggtaaa ttattattat
    67501 aattattatt attttgagac agggtctcac tctatcaccc aggatggagt gcggtgacat
    67561 gatctcggct cactggaact tctgccttcc gagttcaagc cattctccca cctcagcctc
    67621 cccagtagct gggacttcag gtgcgccacc atgaccggct aatttttgta ttttttggta
    67681 gagacagggt ttcctcatgt tgggcagact agtctcgaac tcctgacctc aggtgacccg
    67741 cctgcctcaa cctcccaaag tgctgggatt acaggcgtga gccaccttgc ccagcacagg
    67801 taaattatta atagcaataa ttaatattat taatgtaatt aaaggaagaa aggttatatg
    67861 aaagtagtat tttaatattt attataagat gccatcataa taattaggat gcaagctgat
    67921 gggtacaaaa atatttcata aaaaacttat agaatccatg aaattagata tgttgctttc
    67981 atgccattgt tatacaagaa tcttagaatc ttatcccagt aaaatacttg taaaggccaa
    68041 gaacttaacg ggctatcaaa aacaaagagg cctgcaaaga agcaacatgt aacacagtgg
    68101 aaaagactga gagtctctca tatcctcagg taccttaatt gaaaaacagg tatttgtaag
    68161 actgctacat gaattaacaa gatatcgtat ataacatagt gcttaacata ttgcccagca
    68221 cataactggt gtacaatatt atgcatgtat aatgtactga gaaataagtt attattgtag
    68281 tataaactgg ctgtattatt attattattt taatttaaaa aatagagatg gagtctcctt
    68341 atgttgacca ggctggtctt gaattcctgg cctcaagtga tcctcctgcc tcagcctccc
    68401 aaagtgctgg gattacagac gtgagcttcc acgcctggcc ctggctgtat tatttttact
    68461 tccttaaaat agcacaagaa catacactac tactgtatgt tttctattga tttctttcat
    68521 cattataagc ataatttcag gtaaaattat acttgagaaa ctgaggaaaa attttatgat
    68581 taagtgcttc taaaggtatt gaatatatat atatatgtat atattttaaa accatttagt
    68641 tatccaaaca aaaacaaaaa cttgtacatc agtgcggcat tattcataat agccaaaaag
    68701 agaacccaaa tgttcatcaa ctaataaagg gataaataaa atgtggtaca gccatgcaat
    68761 agaacattat tcagcaacaa aaagggatga agcactgatc catgctacaa tacggaggaa
    68821 ccttgaaaac atgctaagtg aaagccagtc acaaaggacc actttttttt tttttttttt
    68881 tttttttttt tttgagactg agtcttgctc tgttgccagg ctggagtgca gtggcacaat
    68941 ctcggctcac tgcaacctcc gcctcccggg ttcaagcgat tctcctgctc agcctcccaa
    69001 gtagctggga ctacaggcat gcaccaccat gcccggctaa tttttgtatt tttagtagag
    69061 acggggtttc accatgttgg ccaggatggt ctccatctct tgactttgtg atccgcccgc
    69121 ctcggcctcc caaagtgctg ggattacagg cgtgagccac tgcacccggc caggaccaca
    69181 ttttatatta tttcatttat atgaaatgtc cagagtaagt aaatctgtac agacagaaag
    69241 tagattggtg attgcctagg actgggaagg ttgaggggaa gtgcacagtg ccttaagggg
    69301 tatggagttt cttcttggga tgaagaaatg ttctaaaatt gactgtggtg atggttgtac
    69361 aaatctgtta atttattaaa aatcattgaa atgggccggg cgcggtggct cacgcctgta
    69421 atcacaccac tttgggaggc cgaggcgggc agatcacaag gtgaggaaat cgagaccatc
    69481 ctggctaaca cggtgaaacc ccgtctctac taaaaataca aaaaattagc cgggcgtggt
    69541 ggcgggcgcc tgtagtccca gctactcgtg aggctgaagc aggagaatgg cgtgaaccca
    69601 ggaggtggag cttgcagtga gcggagatgg caccactgca ctccagcccg ggcgagaccc
    69661 cgtctcaaaa aaaaaaaaaa aaaaaatcat tgaaatgtac actttaaaca agtgaactgt
    69721 caggtacgtg aattgtatta ataaagcttc tattaaaata aaatttctaa gtcaaaaatt
    69781 atagttttgg agcaagcaaa aataaaaata ccattttaaa agaaagcttt aaaataaata
    69841 ttgtattacc atctagatat cttgttccat aagcgcattc tgggaatatt cccctcaaat
    69901 acgtgataca ggatactgaa actgctagaa gcctcttcac taacaccaaa gactggtgtt
    69961 cagttgatat cttattggga aataccagtg cactctaaaa ttcaagggaa agtagaacac
    70021 tttatttaaa tttattaaaa ataaagtttc tcttgactta acatatatcc aattttgtat
    70081 aatcatagta ctcaccccaa gaattcctcc ctatattggt tagtaatccc ccaaaagtca
    70141 gagcggcatg taaaatgtga gggtatttct agggtaatag cctaaaggac atatgcaaaa
    70201 agtatctaaa aatcttttaa tttttttttt ttatttttaa aaaaattgag acaggatctc
    70261 actgtattgc ccaggctgga gtgcagtggc acaatcacag ctgcctacaa cctccattgg
    70321 atgcaagttg atgggcacaa agtatttcat taaaaaagta taggccgggt gcagtggctc
    70381 acgcctgtaa tcccagcact ttgggaggct gaggcgggtg gatcacaagg tcaggagttc
    70441 aagaccagcc tggccaaatg gtgaaaccct gtctctacta aaaatacaaa aattagctgg
    70501 gcgtggtggc aggcacctgt aaacccagct actcttgtaa tcccagcact tgggaggctg
    70561 aggcagggaa ttgcttgagc ccaggaggca gaggttgctg tgagccaaga tcgcaccact
    70621 gcactccagc ctgagtgaca aagcgagact ccatctcaaa aaaaaagtat ataatcatta
    70681 tataattctt cagattatat tcaaatatat caaatgcttg tattttgagg atgaataaaa
    70741 caaatgttaa tagtggctgg tattttaaaa acaccttttt gggacaccca tataaagtgt
    70801 ttttcagttt aatggttttt agtttattca cagaggtgtg tgtgcaacaa tcaccaccaa
    70861 tttcataaca ttttcttcat cccaaaaagg aaacccatac caattagcaa tcactccctg
    70921 tcccttatcc catcccacgt cctacccatt agcctatctg tccctataaa tttgcctatt
    70981 ctagacattt catatatcct cgcacaacct ctaaagtagc tgagactaca ggcgttgcac
    71041 caccatgcct ggctaatttt tttttttttt ttttttgtag agatgagatc tcactatgtt
    71101 acccaggctg acctcaaact cttgcgctca agcaatcctc tcacctcagc ctcagctcaa
    71161 aaagtaccta aaaatcttaa gaatcccctt gcagattaac tgtctgtaaa tgcataacac
    71221 atggatagat tacattgatt tttctctttg gtcactgagt aggtataaca gaggttaagt
    71281 gtaatgcaaa atcctataca aatatagcaa atgcatattt gtgtctcggt tacaaacatt
    71341 cattttcttt ctttttgaga cggagtctca ctgttaccca ggctggagtg cagtggtgtg
    71401 atctcgactc actgcaacct cccccttctg agttcaagcg attctcctgc ctcagccacc
    71461 ttcatagctg agactacagg cgtgagccac cgtgcccggc ctcattttct tatttattta
    71521 attttattct gcttatatat tgaacattca tgttctagag gtggactgct atgaacaata
    71581 atttcatcaa agcaacaaag cagtattcat taatacctaa tgtaaattaa atgtgaatga
    71641 gcaatttaat gcaatgcagc aaacatccaa tagaattctc aattaatgcc gggcgcggtg
    71701 gctcacgcct gtaatcccag cactttggga ggccgaggcg ggcggatcac gaggtcagga
    71761 gatcgagacc atcccggcta aaacggtgaa accccgtctc tactaaaaat acaaaaaatt
    71821 agccgggcgt agtggcgggc gcctgtagtc ccagctactt gggaggctga ggcaggagaa
    71881 tggcgtgaac ccgggaggcg gagcttgcag tgagccgaga tcccgccact gcactccagc
    71941 ctgggcgaca gagcgagact ctgtctcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
    72001 aaaaaagaat tctcaattaa ttcaaaatac ttcagcaaat atcttcattt cttaaaacat
    72061 gcagtttcct ggtatcataa aaccctcaaa tagttgagcc ctaacttcca acagagtatt
    72121 tgagacttca agaaacaata aaagcagcta ttattaagat ataccaaaat acaagaaatc
    72181 ataccatggg agtcctctgc aactgggcag tggccatctt cttctgataa agtattttct
    72241 ttaattcaac cttgtgacac aaatacaagc tttaacttag agctcatttt tttcctctaa
    72301 catttcagtt taccacatat aaaattcaaa acaattttat gtgaataaat ttgaatttct
    72361 ttaattaaaa tagagtcttc aaaaacgtag aggtcgatat tgcagtagaa ccattttgag
    72421 tagggtcctg atttgaagcc atttgactag caagtctcaa acaaatgggt tattaagaaa
    72481 ctgtttaaga gaacttttta aaaaggcaca atcttcctat ttttattccc tatacaaatt
    72541 caaacaatga caacattgta caggtaggta ttcagtaatg aaaaattgcc tggcaaaccc
    72601 tctaaacttt ttcagtttta tttgttttta tttttcattt ttttgagacg gagtttcgct
    72661 ctgtcgccag gctggagtgc agtggcgcga tctcagctca ctgcaagctc cgcctcccgg
    72721 gttcaagcga ttctcctgcc tcagcctccc gagtagctgg gaccacaggt gcccaccata
    72781 tatatatata caattttttt tttcgagacg gtgtctctct ctgttgccca ggctggagtg
    72841 taatggcgcg atctcggctc accgcaacct ccccctctgg gttcaagtga ttctcctgcc
    72901 tgagcctcct gagtagctgg gactccaggc gtgccaccac gcccagctaa tttttgtatt
    72961 tctagtagac acggggtttc accatgttgg ccaggaaact cctgacctcg tgatccgccc
    73021 gcctcggccc ccaaggtgtt gggattatag gcgtgaagca ccgtgcccgg ccgctaattt
    73081 ttttttctat ttttagtgga gacggggttt cgccatgttg gtcaggctgg tctcaaatga
    73141 tcctccggcc tcagccgccc aacttgctgg gattacagga agttgtgtgt ccggaaccac
    73201 tgtgtccggc cccagtttca cttttcttaa tcagacaatt cagctggaag agtttactaa
    73261 agaggcaaac ttcatagaag aacctattta cgttaaacgt ttaaatttat cctgacaaat
    73321 cctgaattcg ccgtccgtac tcaccccatc tcaagaacct ctgttgacaa gtaagtgggt
    73381 atacattttt accgcaaatg ttttaccact aagaccttaa gaaacttaaa agtaggggga
    73441 agctgacaga caaagagatt aaaaagtaaa atatacgcac acacacacag atatatacac
    73501 actaatatat attacattac gtctgagggg cgggcgccgg agaccagaag agctgcacga
    73561 ggctgcacgc gctgtgcccg acgcgcatgc gctttccttc aacggtcacc cgctggggcg
    73621 cgacagtggc tttttgaccc ctggtcgagg tcacctttcc cgccaagcgc agcgcaaggc
    73681 gcactgggag cctcagacac gggaagcctc agcgccacgt ctcaggcagt ggaaaagcca
    73741 gacagaaggg ggaaggttaa gtaactgcat gtgcagttac tgaatttctg attgtttttc
    73801 tgacttttga ttttctttct tttttttttt tttttttttt tgagacggag ttttgctctt
    73861 gtcgtccagg ctgaagtgca gtggcatgat cttggctcac tgcaacctct gcctccaggg
    73921 ttcaagcaat tctcctgcct cagcctccca agtagctggg attacaggcg tccgccacca
    73981 tgcccagcta atttttgtat ttttagtaga gacggggttt tgccatgttg gccaagctgg
    74041 tctcaaactc ccgacctcag gtgacctgcc tccctcggcc tcccaaagtg ctgggattac
    74101 aggcgtgagc caccacaccc ggctgacttt tgattttctc attggaacta ccacaaattg
    74161 cattcccacc ttccaatctc ctgtatcttc aggaaaccat ttgctttagc agtcttctac
    74221 ttagccactt ccactgtgtt atgtcattgg ggacaatgga tagctttgaa tgtaatttgt
    74281 ctcatttaga aacaaaataa aatcagagtg ggtaggtatt aatagttgga gattattcag
    74341 tagtacaagc ttcttagttt acagattatt atatatccaa atacatttct taccagtgta
    74401 aacattcaat tacaggcata ccttggagat attgcaagtt cagttccaga ccctccaata
    74461 aagcaaacaa cataataaag caaattacac aattttttag ttttccatgt gtataaaagt
    74521 tatgttggct agatgtggtg gctcatgcca ataatcccag cactttggga ggctgaggcg
    74581 gaacaattgc ttgaggccag aaattgggga tcagcttggg caacaaagtg agaccctgtc
    74641 tctacaaaaa ataaaaaaaa ttaggcagag gttgtggcac acacctgtag tctcagctac
    74701 tcaggaggct taggcaggag gatcccttga gcccaggagt ttgcagtggc agtgagccct
    74761 gtttgagcca ctgcactcct gcctaggcgg cagtgagact tgtttcaaaa aaaaaaaatt
    74821 atggttacac tattaatatc ttaatatata tatataaatt tttttttttg agacggagtc
    74881 ttgctctgtc tgtcgcccag gctggagtgc agtggtgcga tctcggctca ctgcaagctc
    74941 cacctcctgg gttcacacca ttctcctgcc tcagcctccc aagtagctgg gactacaggc
    75001 gcccgccacc acacccggct aatttttttg tatttttagt agagacaggg tttcaccgtg
    75061 ttagccagga tggtcttgat ctcctgacct cgtgatctgc ccgccttggc ctcccaaagt
    75121 gctgggatta caggcatgag ccactgcgcc cagcccttaa tatatttatt gtacatatct
    75181 tatgctttct gtagtctgtt aagtgtgcag gagcataatg tctagaaaca caatgcacat
    75241 actttaattt taaaatactt tattcctaaa aaatgctaat gatcatctga acaatcatct
    75301 ttttgccggt ggagggcctt gccttgatgt taatggctgc tgactgatca gggaggtgtt
    75361 tgctgcatgt tggggtgtct gtggcaattt cttaaattag gacaacaatg acattcacca
    75421 catcgattga ctcttccttt tataaaagat ttctctgtag catgtgatgc tgttttacag
    75481 cattttaccc acagtagaaa ttccttcaga attggagtca gtcctttcaa accctgctgc
    75541 tactttgtca actaagttta tgtaacattc taaatccttt gttaccattt caatcttgtt
    75601 ctcagcatct tcaccaggag tagattccat cgattccatc tcaataaatc actttcctca
    75661 atcatccata agaagcaatt ccagcacttt gggaggctga ggcgggtgga tcacgaagtc
    75721 aggagtttga gactagcctg gccaatatgg tgaaaccctg tctctactaa aaatacaaaa
    75781 attagccagg catgatagca cgcctgtaat cccagctact agggaggctg aggtgggaga
    75841 atcgcttgaa cccaggaggc tgaggttaca gtgagctgaa atcacgccac tgtactccag
    75901 cctggacaac agagcaagac actaacttaa aaaaaaaaaa aaagaagcaa ttcctcagca
    75961 gttcaaattt tgtcatgaga ttgtagcaat tcagtcccat cttctcatct tcaggttcta
    76021 cttccaattc tggtactctt gctatttcca catctgcagt tgcttcctcc atggcagtct
    76081 cgaacttctc aaagtcatcc atgaggattg gaataacttc ttctaaactg ctgctaatgt
    76141 tgatattttg acctcctcct atgaatcaca aatattctta atggcatcga gaatagtgac
    76201 gcctttccag aaggttttca attttctttg cctacatcca tcagaggaat cactatggca
    76261 gctacagccc ttacaaaatg tgtttcttaa cttataaggc ttgaaagtca aaattacttc
    76321 ttgatccatg ggctgcagaa tggtcttttt gttagcagac acgaaaacaa cattaatctc
    76381 catgttcacc tccattagag ctcttgagtg accaagtgca ttgacaatga gcagtaatat
    76441 gttgaaagga atcttttttt ctaagcagca gtaggtctca accatgtgct taaaatatcc
    76501 agtgaactat gttgtaagca gatgtggtgt catccaggct ttgttgttcc atttatagat
    76561 cacaggcaga gtagatttag tataattctt aacggcccta gaatttgcag aatgttaagg
    76621 gagcactggc ttcaaatgta agtcaccagc tgcattatcc tcgtatgaga aattcatcct
    76681 gtcctttaaa gctttgaagg cattaatttc tcttctctag ctatgaaagt tttcagtgga
    76741 atcttcttcc acaaagctgt ttcacctacg tggaaaatgt gctgtttagt atagccacct
    76801 gcatcagtta tcttagctag atcttctgga taacttgctg cagcttccac atcagccctt
    76861 gatgctccac cttgcacttt tttttttttt tttttttttt tttgagacgg agtctcactc
    76921 tgtcgcccag gctggagtgc agtggcacga tctcagcact tttgtgttat ggagaatggc
    76981 ttcatccttg aaacctcatg aaccaacatc tgctagcttc agacttttct tctgcagctt
    77041 ctgcagcttc tgcatctctg tcagcctttc tagaattgag gagttagggc cttgctctgg
    77101 attagacttc ggtttaagag catgttgtgg ctggctcaat cttctatcca gaccactcaa
    77161 actttctgca tatcaccaat aaagctgctt cactttctta tcattcatgt gttcactgga
    77221 gtagcacttg taatttcctc caagaacttt ccctttacat tcacaacttg gtgaactatt
    77281 tggcacaaga ggcctttcag cctatcttgg cttttggcat gccttcctca ctaagcttaa
    77341 tcattttcag cgtttaagtg agaggcatgc aactctttct ttcacttgaa cacatagaca
    77401 acattgtagg gttatgaatt gacctaattt aaatattgct gtgtcttaga aaatagggag
    77461 gcccaaggag agggagagag atggggacac agccagtaaa tggagcagtt agaacacaca
    77521 catttattga ttatatttgc cattttacat gggtgcggct tttgacatcc tatgacaatt
    77581 atcatagtaa catcaaagat cactgatcac agatcaccat agtagataat aataataatg
    77641 aaaaagttta aaatattgca agaattacca aaatgtgaca caaagacatg aagtaagcat
    77701 atgctgttgg gaaacggcac tgacagactt acttgattct gggttgctac aaacctgcaa
    77761 tttgtaaaaa aaatgcaata tctgcaaagc acaataaagc aaagcacaat aaaacaagat
    77821 aagcctgtac tcataaactt tgacatatct tcattacaaa ggagaaaaca gccactagat
    77881 atgaaaagat ttgcccaaag tcattcaact aggagagaaa tgaagactag aaccttgtta
    77941 ttagggttaa aaatagtgaa aatcttggtc taataattat tttgaattat aattaagtca
    78001 ctgttttgtg tgtttttttt tgttgttgtt gttgttgttg ttttttgaga cggagtctca
    78061 ctcttttgcc caagctggag tgcagtggca taatcttggc tcactgcaac ctccacctcc
    78121 tgggttcaag ggattctcct gcctcagcct ccccagtagc tgggatcaca ggcacgcacc
    78181 accacatcca actaattttt gtatttttag tagagatggg gtttcaccat gttggccaga
    78241 ctggtcttga actcctgacc tcaggtgatt tgcccacctt ggactcccaa aatgctggga
    78301 ttacaggcat gagccactgt gcccagccac ttttttttct ttagaggcaa ggtctctttc
    78361 agtctcccag gctggagtgc agtgatgcaa tcatacctca ctgcagcctc aaactcctgg
    78421 ctcaagcaat cctccctcct cagcctctgt aacaactagg accgcaggtg tgtgccacaa
    78481 ctctcggcta ctttttaatt tttttgtaga gtcagggtct tgctatgttg tgtagactga
    78541 tcttgaacta ctggcctcaa gtaatactcc tgccttagcc tcccaaagtg tttcaggcat
    78601 gaactactat gccaggccta ttttcatttt ttttcatttt ttttgagaca gagtcttgct
    78661 ctgtcaccca gactggaatg cagtggcgtg atctcgactc actgcaacct ccccttcctg
    78721 ggttcaagca attctcctgc ctcagcctct gaagtaactg gggttacagg cgtccaccac
    78781 cacgccaggc taatttttgt atttttagta aaggtggcat tttgccatgt tggccaggct
    78841 agcctcgaac tcctgacctc aggtgatcgg cttgccttgg cctcccaaag tgctgggatt
    78901 acaggcatga gccaccacgc ccggccccct ttattacatg ttctgattta tagtctttct
    78961 ggtaacttag agtatgcata cttgaagcat catcaagatt ctcatctaga agagaagagt
    79021 caaaaaataa taatttgatt caaagaaata gtaagtctta tgcctgtaat cccagtactt
    79081 tgggagacca aagtgggagg atcacttgag gccaagagtt ggagaccagc ctgggcaaca
    79141 taatgagact cccatctcta cacacacaca cacacacaca cacacacaca cacacacaca
    79201 cacacacaaa taagccagaa atggtggctt gcacctctag tcccagctac tggaggggtg
    79261 gggaaggtgt gtgaggtgag aaaatagctt aagttcagga gttcaaggct gcagtgagca
    79321 gtgatggcat cactgcactc cagcctgagt gacagagtga gaccctgtct caaaaaaaaa
    79381 aaaaaaaaga taagaaagga aaaaagaaaa gagaaaggaa atactaagta tctgtcacat
    79441 ttatgttata tgttttaatg gcattgtctc atggatgcct agttctggaa tatttaaatc
    79501 ttcttgtttc tttttttaat ataatttttc aaattttttg gctaatcttc tctgtatcat
    79561 tcaaatttga gtatatgtgc tgccaaagca agcaccgttt ctttactttt attatacagt
    79621 ttaaaattct gtcaactttt actctgtgat ggctcccaga tttgctttat tttttatttt
    79681 tttgctgttt tcagaaatga tgtatttagt ttagggcttt atcaccacac atatagacta
    79741 ctagaatatc ttttagattg tctctttttt tttcttgaga cagagtttcg ttcttgttgc
    79801 ccaggctgga gtgcaatggc atgatctcgg ctcactgcaa cctccacctc ctgagttcaa
    79861 gcaattctcc tgtctcagcc tctcaagtag ctgggattac aggcacccac caccatgccc
    79921 agctaatttt tgtattttta gtagaggcgg ggtttcactg tgttggccag gctggtctcc
    79981 aactcctgac ctcaagtgat ctgcctgcct cggcctccca aagtgctggg attacaggcg
    80041 taagccacca cgcccggcct aggttgtctc attttaaaat tttatttatt ttattatttg
    80101 ccaatttatt tattttgaaa caaggtcacc caggctggag tgcaatggcg ctatcatggc
    80161 tcactgcagc ctcaatctcc tgggctcaag tgatcctccc acctcagcct ttcccctggt
    80221 agctgggact ataggcacat accaccacac ctggctaact tttttatttt ttgtagagat
    80281 ggagatcttt ttatgtgctc ataacaagag gcaggaggat cacttgcgcc caggagtttg
    80341 agaccagcct cttgaacttc tggctcaagc gatcctccca cctcagcctc ccacagtgct
    80401 gggattacag gtgtgagcca ctgcacctgg cttttaaaaa aattttcttc aagtcatctt
    80461 gcatactact gtctagaaat tgttccagaa taaattctaa atgtttttgt tttaaattta
    80521 tatttttaga taatttttta tctctataag tttgaaatta cttattgtta ataatttccc
    80581 tttatattgg gcatacagtg aagtttttaa aaagtatttc attattctac ttgcttagtt
    80641 caggcattct tttttttttt ttttttaatt tattttttta ttgataattc ttgggtgttt
    80701 ctcacagagg gggatttggc agggtcatgg gacaatagtg gagggaaggt cagcagataa
    80761 acaagtgaac aaaggactct ggttttccta ggcagaggac cctgcggcct tccgcagtgt
    80821 ttgtgtccct gattacttga gattagggat tggtgatgac tcttaacgag catgctgcct
    80881 tcaagcatct gtttaacaaa gcacatcttg caccgccctt aatccattta accctgagtg
    80941 gacacagcac atgtttcaga gagcacaggg ttgggggtaa ggtcacagat caacaggatc
    81001 ccaaggcaga ggaatttttc ttagtgcaga acaaaatgaa aagtctccca tgtctacttc
    81061 tttctacaca gacacggcaa ctatccgatt tctcaatctt ttccccacct ttcccgcctt
    81121 tctattccac aaagccgcca ttgtcatcct ggcccgttct caatgagctg ttgggaacac
    81181 ctcccagacg gggtggtggc cgggcagagg ggctcctcac ttcccagtag gggcggccgg
    81241 gcagaggcgc ccctcacctc ccggacgggg tggctggctg ggcagggggg ctgacccccc
    81301 ccacctccct cccggacggg gcggctggcc gggcgggggg ccgacccccc cacctccctc
    81361 ccggacgggg cggctggccg ggcagagggg ctcctcactt cccagtaggg gcggccgggc
    81421 agaggcgccc ctcacctccc agacggggcg gctggccggg cggagggctc acccccccac
    81481 ctccctcccg gacggggcgg ctggccaggc ggggggctga cccccccacc tccctcccgg
    81541 atggggcggc tggccgggtg ggggggctga cccccacatc tccctcccgg acggggtggc
    81601 tggccgggct gaggggctcc tcacttccca gtaggggcgg ccgggcagag gcgcccctca
    81661 cctcccggac ggggcggctg gccgggcggg gggctgaccc ccccacctgc ctcccggaca
    81721 gggcggctgg ccgggcgggg ggctgacccc ccccacctcc ctcccggatg gggcggctgg
    81781 ccgggcgggg ggctgacacc cccccccacc tccctcccgg acggggtggc tgccgggcgg
    81841 agacgctcct cacttcccag atcgggtggc tgccgggcgg agaggctcct cacttctcag
    81901 acggggcagc tgccgggcgg aggggctcct cacttctcag acggggtggt tgccaggcag
    81961 agggtctcct cacttctcag acggggcggc cgggcagaga cgctcctcac ctcccagaca
    82021 gggtctcggc cgggcagagg cgctcctcac atcccagatg gggcggcggg gcagaggcgc
    82081 tccccacatc tcagacgatg ggcggccggg cagagacgct cctcacttcc tagatgtgat
    82141 ggcggctggg aagaggcctc ctcacttcct agatgggatg gcggccgggc ggagacgctc
    82201 ctcactttcc agactgggca gccaggcaga ggggctcctc acatcccaga cgatgggcgg
    82261 ccaggcagag acactcctca cttcccagac agggtggcag ccgggcagag gctgcaatct
    82321 cagcactttg ggaggccaag gcaggcggct gggaggtgta ggttgtagtg agccgagatc
    82381 acgccactgc actccagcct gggcaccatt gagcactgag tgaacgagac tccgtctgca
    82441 atccccgcac cttgggaggc cgaggttggc ggatcactcg tggttagggg ctggagaccc
    82501 gcccggccaa cacagcgaaa ccccgtctcc accaaaacca gtcaggcgtg gtggcgcgag
    82561 gctgcaatcg caggcactgg tcaggctgag gcaggagaat caggcaggga ggttgcagtg
    82621 agccgagatg gcagcagtac agtccagctt cggctccgca tgagagggag accgtgggga
    82681 gagggggagg gggaggggga gggggaggga gagcagttca ggcattcttt acttgagagc
    82741 cacaactctc aaagtctcct tggatagaat tcagggaatt tatgaatttg tatgggtaaa
    82801 aaattccatt tttattttta ctaatgtagg ggagaacata tgtttccttc tatccatcct
    82861 aggtttaggg ctgagatctc tataacaaaa gagagattag caagagaaaa gcatgcacat
    82921 ttatttaata taagttttac acgacacgag gggctccata aggaaataaa ggcccaaaga
    82981 aacagttgaa cttgaatgtt ttttatagta ggtttgatga agagcagcca gtgatgtaga
    83041 aatgtcatag ggcaaagtgt gaacaagcta aatgtaataa actgggggaa actgagaggc
    83101 ctgtttgttt agattcctct ttgtgtccct gtgcttttgg aggtaaggat gctccttttc
    83161 tctgggtgcc aggagagcac ctcttacatg acagtcttat gatctacttc aggggaaggt
    83221 cagaaaatcc ttcccaggtt ttatggctgc ttctggggag aacagcaggc agaaggtcag
    83281 agtcaccttc ctgcttctgc aatttcctca aatgccttca gcttaaaata ttcaatatgt
    83341 caaagtgctg tattttgtgt agggtgtcct gaaccccgtc actaacctct aactgaaagt
    83401 taggattttc ttcaattctg aacatagttc ataaaccacg gtaatattag caatacttca
    83461 tcatcaaaag aaaaacccag atattttcat atcacattag ttttttgcat ctattttcat
    83521 tatgcttatc actatttcaa atttaggata gctattaact cttatttaat gcatttataa
    83581 agaagcatgc atattaatat ataacaaact tatcttttaa cattttaata actgtatttc
    83641 gatataatta gattcctttg taatcctata tattttatgc atttaaaaac atgattttaa
    83701 gaaggcattc ataggcttta acagatagct aaagactaag aacccatgtt tttattctac
    83761 tttcaatact tgaaaagttg tggataaaac aaaaattatg ttaaaaactg aaaagtaggc
    83821 tgggctcagt ggctcacgcc tgtaatctta gcactttggg aggctggggt gggtggatca
    83881 cctgaggtca ggagtttgag accagcctgg ctaatgtggc aaaaccctgt ctttactaaa
    83941 aatacaaaaa ttagctgggc gtggtggtgc acgcctataa ttccagctac tcgggaggct
    84001 gaggcaggag aatcgcttga actcgggagg cagaagttgc agtgaactga gatggtgcca
    84061 ctgcactcca gcctgggcaa cagagtgaga ctccgtctca aaaaacaaaa caaaactgaa
    84121 aagtaatcaa ctaaatactt ttaaactata atattctata cacttttgat gatagccatg
    84181 tattatggta ttatttatgt ttttatattt agcaattctt gttccttaaa atgtgcttta
    84241 tggagaagga gccagagtta gtgaaatacg tttaggttaa tttaggatat attaatgaca
    84301 ttatcccaca cattgttcat gtcatagaca tgaacttaaa aaatttaaag ttcctgctgc
    84361 ctgcttagat gcatctacta atctagagca gttgtaagga agatccatct gaattagata
    84421 aaaatatgag cagtggagta ttggttattt tattctggtt ggctcagtta tttgccaaat
    84481 aaaggccctg gtcaaattgt tgatgactac gtaaaaataa tttggactaa tacattaaat
    84541 tgttgatcat taaaataatt cttccaaagt gcttgtcttt tatgcatttc tttactttaa
    84601 cataatataa aaatatgtgt tgaatcctct ttttttaaga cagggtcttg ctctgtcacc
    84661 caggctggag tgcagtggtg caattacaag ctcactgcat tctcaacctc ctgggctctt
    84721 gtgatcctcc tgtcttagac atctgcatag cagggaccac aggcacacac caccactcct
    84781 ggatagtttc ttttttattt ttgtggagac aaggtctcac tatgttgtcc aggctggtct
    84841 tgaactcctg ggctcaagac gatcctccta cttcagcctc ccaaagtgtt gggattacag
    84901 gcatgagcca ctacacgtgg cgttttgaat cctcttctgt acaaagatac aatctgagtc
    84961 cttaggacat ttatagccta atattttctt ccatcaaagt tgctaaaccc catcaccaga
    85021 ataccacaat taatattctc aggatagatt attaatagtt gcctaggaat aagaatggta
    85081 tgtggcaatg atagaaattt gaaatggcag atttattaag tatttttaag ccatcaagtt
    85141 acaaaaggct attaggggtc ttaaaaagac aaagagtata caataaaaca ataagcaatt
    85201 acagattgtt ttaggttgaa caatttttgt acagttatat ctatagtaca ttactaattt
    85261 acttagcaaa ctttatcctg agatttgcaa atttaaaaaa atgaagaatc aaactatatt
    85321 ttcttttctg tttttttgaa acagagtctc cactctgtca tccaggctgg agtacagtgg
    85381 tgtgatctca gctcaacata gcctcaacct cttgggttca aggaatctcg tgcctcagcc
    85441 tcccaagtag ctgggattac aggcatgcac caccgtgctc ggctaatttt ttgtattttt
    85501 agtagagatg gggtttcacc atgttagcca ggctgctctt taactcctgg cctcaagttg
    85561 atatgcctgc cttggcctcc caagtactgg gattacaggc gtgagccacc gtgcccggcc
    85621 tcaaactata ttttctaatt agtacagtaa atacacatta atcatgacac aattatttct
    85681 tctggataca gcaggaaaaa ttaagttaag agaaagtgct tcatatttct tgatttgtta
    85741 taaaaaggag agatcctcta gatttctggg taagagggaa agctagcaat cccacaatga
    85801 tttcctttat ttcttgccat ccgaatatat ccttcttcac caaagttgtg gccccagctt
    85861 tagaaaaaga aataatacaa aattacaaat gcgtacaatc aaaccatgtt atcaactttt
    85921 tatctcattg tttttctcct ataagtgatt acatatgaaa ggccgtaaaa caaaattaga
    85981 atgaaataag gtaataattt ctccagggat ttcaggtaac tcaaaataat ggtttggaaa
    86041 gctttagtgc tatattgatg aatagactta atacctaagg aaaactatag taagatccat
    86101 cccagtcctt taatcctcct actttgcaaa tatgccttct ctaaagaaag agtattagta
    86161 tcctttataa gcctttatct ctaaattttt tgataacagt atcatgtgta atgattttct
    86221 atcaaactta aattcctcta gagaatgtca attaagaaat atttttgagt gactactatg
    86281 aaaaaggtat agggcaggga attctaaagc aaacaacaaa aatgcagctc tgaaagagct
    86341 ttttgatcta gaagtggagc tgactttctc ataattaaag ctattgaaaa atgagccagg
    86401 catggtggct cacacttgta atccaagtgc tttgagaggc tgaagtggga ggattgtttg
    86461 aggccagagt ttgagagcag cctgggcaaa atagcaagtc cctgattcta caaaaaaaat
    86521 ttttttttaa ttagccaggc atggtggcat gcacctgtat tccttaccta ctctgcagca
    86581 gaggctactt gttagagatg ttgcagagaa aaatcatggg tcggattagg gttcgagtag
    86641 atcagtgttt cactttttga tttaatggac ttagaggagt tcaaaaagtt tttacagata
    86701 gatatacagt tatcaatttt aaaattagca tgtatggaca tctttttttt ttttttggag
    86761 gtgaagtttc gctcttgttg cccagactgg agtgcaatgg tgcaatctca gctcaccgca
    86821 gcctccatct cctgggttca agtgattctc ctgcctcagc ctcccaagta gctaggactg
    86881 caggcatgtg ccaccacgac ggctaatttt gtatttttag tagagatggg gtttctccat
    86941 gttggtcagg ctggtctcca actcctgacc tcaagtgatc tacccgcctc aggctcccaa
    87001 agagttggga ttacaggtgt gagccaccac gcccggctgt atggtcatct tttttaaaaa
    87061 gcatactatt ggtcgggcac ggtggctcaa gcctgtaatc ccagcacttt gggaggcaga
    87121 gacgggcaga tcatttgagg tcaggagttc gagaccagcc tgatcaacat ggtgaaaccc
    87181 cgtctctact aaaaatacaa aaactagctg ggcatggtag catgagcctg tagtcccagc
    87241 tactcgagag gctgaggtag gagaatcgct tgaacctggg aggcaaggca gaggttgcag
    87301 taagccgaga ttgcgcctgt gcactccagc ctgggcaaca gagtgagact gtctcaaaaa
    87361 aagaaaataa ataaaaacca tactacttac tgtcaccatt atgttgtaaa agaaaataca
    87421 ttttaacacc aaaagcaaaa aatccaaaga cataatctct attttaagga tagtcttttg
    87481 agtaaataaa tttatattaa tgaaaaagtc accaaaatat tcttattttt cttactttac
    87541 caagactgat gaaaaaactt agaaaatcct tgtaagagga atgatttctg aagctcttta
    87601 tcattctgag attttatagt ggtaggtata taaatatcat acaaggcaaa attcattaag
    87661 tgctctagga tacaaataaa agtgtgaaat tgagaaaatg cctagtgtgt ctggagagca
    87721 gtaagaaatt tagttgaaag gaaaactgga tatctttagt agaatagtgg aaaaagagga
    87781 tagaaagctg gtttggggcc atgttgtgaa cagcttagaa cagcaggtgg aggaacatgt
    87841 acttcaatta caaaggcaaa gggaagctgt tagaaatttt ggctcaggga ctcaaagttt
    87901 gtgtttttgc atgcagtgga gaacctaaga aatgctggtt tattgttgga agacaaatga
    87961 caccatttaa aaggactgtg ctttaagagg ccttaaatat tctatgtatc agccctcatt
    88021 ggacattttg caggtcctat ttttttctct ctgtaccatt taacacactt gtctactccc
    88081 accttactgg tctcttggct tcaagacact actatgttct gattttctta tctctctctg
    88141 gtcagtcctt ctcaaacatt ttcatgagct cctttcttta cttcctttca cttaattgcc
    88201 actccactgc ttagaaattg tcaatggctc tccatgcctt tacaataaaa ttcaaactcc
    88261 atcgtcacct ctaataaacc actggctgta gtgggttttt ttgtttgttt gtttgttttt
    88321 gacagagtct caccgtgttg ccctggttgg agtgtggtgg tgcaatcaca gctcactgca
    88381 gcttcgacct cctgtgctca agcaatcctc ctgcctcagc cttacaagta gctaggacca
    88441 catgtgcata ccaccatgct tggccaattt tttttttttt ttttttgtga tggagtcttg
    88501 ctctgtggcc caggctggag tgtagtggca caatctcggc tcactgcaag ctccgcctcc
    88561 cgggttcacg ccattctcct gcctcagcct cctcagtagc tggaattaca ggcgcccacc
    88621 accacgcctg gctaattttt ttgtattttt tagtagagat ggggttttac cgtgttagcc
    88681 aggatggtct cgatctcctg accttgtgat ctgcccgcct cagcctccca aagtgctggg
    88741 attacaggcg tgagccactg cgcctggccc aattttttat tttagtaaag aagaggtctc
    88801 actatgttgc ccaggctgta gcaatgctct tttaatatcc agtttgataa gcaaaagcaa
    88861 taacaagaat ccggccaggc tattattaat attaatagac accacttatt tagcacttac
    88921 tctgccaaac agattaacat aaattatcta atttaatcat catcctgtga ggtgaatatt
    88981 atcttctttt actaaagaaa aaaacagaag ggcagagtgg taaagtagct tgccaaaagt
    89041 cacatagata gtaaattgtg aagccagagg tcaaatctag gattatttta ttccaaagcc
    89101 cactctaact tttatctatg cctatcccta cagttatctc tgtgagtgaa gtcatatggg
    89161 gaatgaaatt cagaccagta catctaggat ctcatgtaat ttatatccta ctatggtcac
    89221 tgcattaaat aataggtaaa ataaatcatt ttaggtgaac tgaatttgga cctgagttct
    89281 agtatgaaaa ttgaattgag ggtctagtta ctagcagtat gaattcagtc agtgagacca
    89341 aggaatccca gaccccattc aacatctaaa caatttactt ttctcagttt taccgaccaa
    89401 gatcaattgc ccccagtttt gcaaaagatc ccagacctct ttttgactga gtcctgctaa
    89461 cttcccttaa acaatgaaac tgataaaaaa aaaaaatctt aagctaaata aagtatctga
    89521 ggacaaagtc ttgccttcag aattatacaa gatctgctcc aaatgttcat ttcggtactt
    89581 accaagttgt atatttttat taattctata gctctttcac tcaaactgtt caactttagg
    89641 tgagtttgtt aactaaatgc ctatctcata agagtacttt aaaaattaaa tgagagaatg
    89701 tatttaaagt actcacacaa tatttggtat atagccaatg cttactaagt gaaggtagct
    89761 atttttatta ttatccaggt atgagtacat cagattggac tcaagtagag tccatctact
    89821 tttttttttt tgagacggag tttcactctt gttgcccagg ctggagtgca atggcacaac
    89881 ctcgcctcac tgcaacctcc acctcccagg ttcaagtgat tctcctgcct cagcctcctg
    89941 agtagctggg attacaggcg tgcgccatca tgcccggata attttgtatt tttagtagag
    90001 atggagtgtc tccatgttgg ccaggctggt ctcgaactcc caacctcagg tgatcctccc
    90061 acctcggcct cccaaagtgc tgggattaca ggcgtgagcc actgagccgg gctgagtccg
    90121 tctactttta aaaaaatgta tagcgttctt ctagggtgat caccagtccc agtttgcctg
    90181 ggactgtcct gtttttaaaa agaaagtcct atgtcccaga aagcctcgta gcccttagca
    90241 aagtgggaaa gttgatcacc ctaccttctc cttgcatatt ataaggctac taacttatta
    90301 ttccttctaa aatcaatgcc agtttccttt gtaatagact gaagtcctgg gcttcttagg
    90361 ctgtgccaga cactcttcta ggcattagga ttacagagac aaagtctttc taaagcctag
    90421 gttctagagg agaaacatat gataaacgaa gaaatacttg atgatatata tgattagtac
    90481 aataaagaaa aataaagcag gataagggat acagagtgtt aagagacagg gctcaatttt
    90541 aggtggaatg gtcgggaaaa gcttcgctga gtggatgttt gcgcaaagat gcaagtgagg
    90601 caaagacctg gataaagggc tagaaataag catgatgtgt caagaaatag ccaaaaggct
    90661 aatgtgctgg gagctgagtg agagagggga gagtagttgg agaggtagtt ggagaggcag
    90721 gctaaggcca gataatgtca tgctttacag gtcatgggtt ttattctgag tgggaaggaa
    90781 agccattaga gggtttgggg tagaagggta acataatcta ctttatgctt taaaaagata
    90841 attctggtta ctgtgtagag gattgattcc aagcaaacaa gagggaagca gctaccagtt
    90901 atgagattat tgcaatagac caggagatgg atggtggctt gaactctaat ggtaacagca
    90961 gaggtattga gaagtgcaag caaatcttgt tttattgtgc tttgctttat tgtgcattgc
    91021 aaattttgca tattttacaa attgaaggtt tgtggcaacc ctgcatggag caagtctatc
    91081 ggcacaatgt ttccaacagc ctgtgctcac tttgtgtccc tgtgtcacat tttggcattt
    91141 ctcataatat ttcaactgtt ttcattatta gcaaatctgt tatggtgatc tgtgatcagt
    91201 gatttttttt ttttttttga gataggatct cactctgtca tccaggttgt tgtgcagtgg
    91261 cacaatcgca gctcactgca gcctcagcct ccaggtctca cgctcctgcc tcagcctccc
    91321 aagtagctgg gaccacaaat atgcaccagc atgccagggt aattaaaaaa aaaaactatc
    91381 tgtagagaca gagtcttgct atgttgccca agctgggatc agtgatcttt aatattacta
    91441 ttgtaattgt cttggggtgc catgagccac gtgcatataa gatggtgaac ttaactgata
    91501 agtggtgtgt gttctgactg ctccactaac tgacacttcc cccatctctc tctctctcct
    91561 ctggcctccc ttttccctga gacacaatga tattgaaatt aggccaatta ataaccttac
    91621 aatgggccag gtgcggtggc tcacgcttgt aatcctagca ctttgggagg ccaaggcagg
    91681 tggatcacct gaggacaaga gttcgagacc agcctggcca acatggcgaa accccgtctg
    91741 tactaaaaat acaaaattag cagggtgtgg tggtgcgtgc ctgtaatccc agttactccg
    91801 gaggctgagg caggagaatt gcttagaacc tgggaggcag aggtttcagt gagccaagat
    91861 tgcaccactg tactccagcc tgggcaacag agggagactg tctcaaaaaa caaaaacaaa
    91921 aaacaacctt acagtggctt ctaagtgttc aagtgaaagg aagagttata catctctcat
    91981 tttacatcaa agctagaaat gattatgctt tgtgaaagcc aaggtaggct gaaagctaga
    92041 cttcttgaac cagttagcca tgttgtgaat gcaaagggaa agctcttgga ggaaattaaa
    92101 atgctactcc agtgaacaca tgaatgaaag caaaacaaca ggccaggtgc agtggctcat
    92161 gcctgtaatt ccagcacttt gggaagctca ggcgggtaga tgacttgagc tcaggagttc
    92221 aagaccagcc tgggcaacaa gctaaaaata cagaaattac ccgtgcctgg tggcacatgc
    92281 ctgtagtctc agctacttgg agggaggcgg aggttgcagt gagctgagat tgtgccactg
    92341 cactccagcc tgggtgacaa agtgagacct gtctcaacaa caacaacaac aacaacaaca
    92401 acaacaaaaa gaaagaaaga agaaaaaagt aaagaaaaaa agtgaaacag ccttattgct
    92461 gatatgattt gagtggccta gataaaagag caaactagcc acaacatgcc cttaagcgaa
    92521 aacctaatcc agagcaaggc cctaactctc ttccattgta agaaggctga tgaaactaaa
    92581 attaaagaag ttaaataacc tgccgaagtt cttctatgcc tccagtgcaa cctatgccac
    92641 ctcgatttta gcatttatca cattgagttg caatggacta cacatgcacc tgcctgtttc
    92701 cttcactcca ttagactatg agtaccttga gggacggtag aagagctgca tttttcattt
    92761 ccaatcgcta gttcctagct tggcacatgg taggtactca atacatgttt gttgaatcag
    92821 tgtccaaatg aagatgcaaa aatgaacaag agactatcct accttgaaga agtttacatt
    92881 ttagtgaatg gggcatggtt tgcacaaata gctataacac aagggatatg tgtaaaatac
    92941 tataatagaa ttcataaagc attgccaaag cttaggaatt agagaaacaa ttctaagtga
    93001 atggggtatg aaggagatag tatttgaggc aggatttaaa aaatgaatag gatttgtaat
    93061 gacttttttt tcattttaaa attaatacct ttcttttttt tccttccttt tttgagacag
    93121 agtctcgctc tgtcaccccg gctggagtgt ggagcgcagt ggtgtgatct tggctcactg
    93181 caagctccgc ctcccaggtt catgccatcc tcctgcctca gcctcccgag tagctgggac
    93241 tacaggcatg cgccaccacg cctggctaat tttttttgta tttttagtag agacagggtt
    93301 tcaccgtgtt agccaggatg gtcttgatct cctgacctag tgatccacct gccttggcct
    93361 cccaaagtgc tgggattaca ggcatgagcc actgtgcccg gcctccttcc tttttttttc
    93421 gagatagggt ctcattctgt cactgaagct agagttgagt gatgtgatca tggctcactc
    93481 acagcggcct caacttcctg ggctcaagtg atattcccac ctcagtctcc caagtagctg
    93541 ggaccacaag tgtgtgtcac catgcctggc ttatttttgt atagagatgg ggtctcacta
    93601 tgttccccat gctggtcttg aacccctggg ctcaagcgat cctcccagtt cagcctccca
    93661 aagtgctggg attacaagca tgagccacta agcctggttg gacatactat gtttaactta
    93721 aatttttttt tgacataggg tcctgctctg tcacctaggc cagagggcag tggtgtgatc
    93781 atggctcact gtagcctcaa actcctcagc tcaaaaaatc ctcctacctc agcctcttga
    93841 gtagttgata ttacaggcac atgccaccat gcccagctaa ttttttaaat ttgtttatag
    93901 agactgggtc tccctatgtt tctggtctta aactcctggc ctcaagtgat cctcctgcct
    93961 tggcctccca aagctctaga gttataagca tgagcttacc acattcagct aatatttata
    94021 ctacatataa attttgtatt tatgtgtatc ttttttcatt taaatatata tcccaagtat
    94081 tttattaggt tattaataat agtctgaata aatctaatgc atggagttga ttatgtcata
    94141 gtttatttaa tcatatacct attgttaggc atttttggct acttactagt ttttactatt
    94201 aaataataat gctacaattc acataagttg tgtatgaaag ttttcccata tttaggatta
    94261 ttttcttcat atagattcca agaaataaaa gtaatgggac actggctgtg cgcagtggct
    94321 tgcgcctgta atcctggcac tttgggaggc caagacaggc agatcaccca agatcaggag
    94381 ttggtagacc agcctggcca acatggtgaa accccatctc tactaaaaat acaaaaatta
    94441 gccaggtgtt ttggtgggca cctgtaatct cagctacttg ggagactgag gcaggagaat
    94501 cacttgaacc tgggagacgg aggttgcagt gagccgagat tgtgccactg cactccagct
    94561 tgggagacag agcgagactc tgtctcaaaa aaaaaagggg gggggaagaa tatgaacatt
    94621 taaaaagttc tttatcatat tgtcaaattg cttccaagaa agagaatagg cagggtaggg
    94681 tggctcatgc ctgtaatccc agcacttggg gagaccgagg tgggcagatc acgaggtcaa
    94741 gagatcgaga tcatcctggc caacatggtg aaactccatc caacatggtg aaactccatc
    94801 tctactaaaa atacaaaaat tggctgggca tggtggcaca tgcctgtagt cccatctact
    94861 cgggaggctg aggcagaaga attgcttgaa ccttaaaggc agaggttgcg gtgagctgag
    94921 atcatgccac tgcactctag cctggtgaca gagtgagact ccatctcaaa aaaaaaaaga
    94981 ataaatacag cttactttcc catcaaactg tataagaatg cccatttcac caagacaggt
    95041 gggatttcta tagacagcat tggacttcac aggcatttca gaaagataca gcatatataa
    95101 agaaacagag gtagggatgt atctgtaaac ttagggaatg atgaataagt agttcactgt
    95161 ggccagagta tgactgtggg tatatgaaaa tatagtggaa gaaatgtcta aggttatgtt
    95221 aaaagctaga taatggagta cctggaatgc ctggccagga gtttggactt catttttaga
    95281 caatggtgaa ctattcagaa tttttgagca ggagttgaca tgatctatat tttaaaaaga
    95341 taccgctata gctgggtaga atggaatctg aattacagca gggccattgg gaatggcaaa
    95401 gagaaaatac atataaaata tgattcacag gccaaatgga tagggctggg tgggttgagg
    95461 ggggaaaagg aaaaggccaa gatgatgctt aggctttaat aatctgaaga tggtaatata
    95521 ttacccaaaa taaggaatag ttttcccttg agcttgtttg aaaatccttt tcagggatga
    95581 gggaagaaag gaagagaaga acattgtaaa cttattgagt atgagagtgt gtatatgttg
    95641 gtagtagggt gatgtaggtg gggagtttgg atttttcaag gagagctgcc aaattggaag
    95701 ttggacaagg aagtgtgtca cttaggaggt cattttcaga gattatgaac tggggatcat
    95761 ctacataatg gcaatagttg aaactgaagg agcaggtgag tttgctttga tatagatgag
    95821 tttgctttga tatagcaaac tcatgtaaca ggcaggtgga taaaataacg taagatatga
    95881 ggctaaagtt gatgtagcaa aatgaatcaa gatagtgaga aatgtgccat tgcatatgtt
    95941 ttttgggtca ttgagagcaa ctttccaacc tctaacacac tttcctcctc caatacatga
    96001 acaattccat acatctggaa acttgattca aatagtgtct tctctgtgga ggtttaggct
    96061 gttaggcatt tcctcttcaa cagtcccata gccatttaga catgtatcat tataccacga
    96121 tgtgtttgtt gtatcgttta aaaatacatc taggctgggt gcggtggtgg ctcacgcctg
    96181 taatcccaac actttgggag gctgggagat atatatatat atatatatat atttataaat
    96241 atatttatat ttatatatag agagatttat atgtatgtat acataatata ttatatatgt
    96301 atattatgta tacataatat attatatatg tatattatgt atacataata tattatatat
    96361 gtatattatg tatacataat atattatata ttatatgtat attatgtata cataatatat
    96421 tatatattat atgtatatta tgtatacata atatattata tattatatgt atattatgta
    96481 tacataatat attatatatt atatgtatat tatgtataca taatatatta tatattatat
    96541 gtatattatg tatacataat atattatata ttatatgtat attatgtata cataatatat
    96601 tatatattat atgtatatta tgtatacata atatattata tattatatgt atattatgta
    96661 tacataatat attatatatt atatgtatat tatgtataca taatatatta tatattatat
    96721 gtatattatg tatacataat atattatata ttatatgtat attatgtata cataatattt
    96781 atatattata tgtatattat gtatacataa tatattatat attatatgta tattatgtat
    96841 acataatatg tacacataat atttatatat tatatgtata ttatgtatac ataatattta
    96901 tatattatat gtatattatg tatacataat atttatatat tatatgtata ttatgtatac
    96961 ataatattta tatattatat gtatattatg tatacataat atttatatat tatatgtata
    97021 ttatgtatac ataatatatt atatattata tgtatattat gtatacataa tatattatat
    97081 attatatgta tattatgtat acataatata ttatatatta tatgtatatt atgtatacat
    97141 aatatttata tattatatgt atattatgta tacataatat attatatatt atatgtatat
    97201 tatgtataca taatatatta tatattatat gtatattatg tatacataat atattatata
    97261 ttatatatgt atattatgta tacataatat attatatatt atatatgtat attatgtatt
    97321 atattatata ttatgtatat tatagattat gtatgcatac ataatatgta ttgtatatta
    97381 tgtatgcata cataatatgt attgtatatt atgtatgtat atataatata tattatatat
    97441 attttttcat gttagattgt gagtgccttt aggacaggat ggtttcttac tattttttgt
    97501 atctccaggg cctagataat gccagtcaca ttgtagatgc ctagtaagtg cttgttgaat
    97561 gaatgagtga agagaagcag atttgaattt caacactaaa agatatagat gatcttcaag
    97621 agcaacttca gtagagaagg gaatttagaa cacctgccgt agggagttcc ttagagagtc
    97681 aataagatat gagggaaata gtttcatttt aaaaagctat cactggtcta cacccaagct
    97741 gatcagtgct gtgaataggt tatcagtgaa ccaagatatt gatcccctca gctactttcc
    97801 ccatgggcca tacgaatatc atcatcttct atgtgtgcca cagtgggaaa agggtaggaa
    97861 agcacctttt gtttgatttg tagactatgt acttcaagta aagatttaag gggtgttatg
    97921 ggttgcattg tgtccccaca aaattcatgt tggaatccta acctccagta cctcagaatg
    97981 tgaacttatt tagagatagg gttgatgtag acataattaa ttaaattaag atgaggtcat
    98041 tagggtgggt cctaatccaa catgaatgat gctcatgaaa agggaaaatt ggccaggcgc
    98101 agtggctcac acctgtaatc ccagtgctct gggagaccaa agcaggagga tcacttgatc
    98161 ccaggagttt gagaccagcc tgggaaacat agagagatcc tgtctctaca aatagaaatt
    98221 aaaattggtc tggtgtggtg gtgcatgcca gtagtgccag ctactttggt ggctgatgtg
    98281 ggaggattgc ttgagcccag gagtttgagg ttgcggtgaa cattgatcac cccactgcac
    98341 tccagcctgg gtgacagagt gagaccctgt ctcaaaaagg ggaagaaagt aaaattatga
    98401 atagagacag aaatgcatac agggagaatg tcatgtgaaa atgaaggcag aggtcagggt
    98461 gattgtccta aaggctgagg aaagtcaaag atcaccagca aactttcaga agctagggga
    98521 gaggcatcaa actttttttt ttaattaaaa gaacatattt aattgacaga ttgtataaac
    98581 tcaaggtata cagtgtgatg acttgatgta tgtgcacatt gtgcaatgat tgacataatc
    98641 aaatgaaata acacatccat ctccacccgt gctgtacctt tgattcccag aatctgttca
    98701 tcttatacct gaaagtttac actctttgat cagcatctcc tcattaccca gcctccagtc
    98761 cctggcagct atcattctac tgtctgtttc tgtaagtttg acttttttag atcccacata
    98821 taagtaagat catacagttt ttttttcctg tgtccacgaa tattttcttc ctcacagcgt
    98881 gagaaggaag taacaccttg atcttagact tccagcctcc agaactggga gacataaatt
    98941 tctgtcattt aagctgctca gttcgtggca ctttgtcatg gtagccatag caaactaaca
    99001 caaagagaaa ctccaggtat gccccaaagt ctcaactctt catatttcct ccattttttc
    99061 cctctagaag ttatgttcct tttgtttatg agtgatatag atgtcagttc aatacatata
    99121 ttgaggggcc tttaggtgcc acccacttag tctgtgctgc ggatatgaag accagacata
    99181 atctctcccc aaaggcacgt acagaacact ggagtgacag acaggtaaat atacaattta
    99241 aatataatgc ctaaaatcct gtgatccagt atgcatagga cattttggca gcatagaaaa
    99301 aagacatgtt cctcagccca aggattcagg aaagactttt ctggaataga ttgtgcctga
    99361 gctacttctg aagaaaagga gaagtaaacc aaaagagaag acaggatgtt ctaggcagag
    99421 ggaacacaaa ccaagatatg aaggaaggta ttacagataa agtggaatgt atgggaacta
    99481 caactgtgga tagtgcagta taaccagagt gaaaatagga ggcaacaaat gccaggaggt
    99541 gaggctggca aggcagggag cacagcctat aggatttgtc ttctgtcagg gaacattaat
    99601 tttacaccga atgaaactca gtagaagagt tataagccca gactcgttca tattttagac
    99661 aaataagttt ggcagcagta tgaaagatgg atttgagggt aaaggggaag aaaatagaga
    99721 gtaaaggaga ccagttagaa gggtacagca atagtccaca ctagagatgc ctgaactaga
    99781 gccgtaatag aagaaacaga gaaggaacag aatcaaggac atagataagg taaaattggt
    99841 ttgactaatt ggttgcttgg ttggacttgg ggaggtctca gatgaatcca tcatatctta
    99901 ttgggtgatt gtgttgtgtg gccctttact gatatgggaa attctgaagg aggaacagta
    99961 tgggggagag agtactgaat tatttggatg tattgagttt ggaaagacta agatatatgt
   100021 aagtgagtgt gtccagataa gtctgaagtt ctgggaagat gtcagaacta gagctagaga
   100081 ttcaaaagct attggcatat aaaggttata gctagagtcc taagagtgga attatagcac
   100141 ccaagaaaaa ataaaataag aagaggaatg ggcagatgat agagccctgg ggaaaactaa
   100201 catttttggg ctaaatagag gagaggaata ggttcatgag agaggctgag aaggaataat
   100261 cacagaagta gtacagaaga taaaccagga aaacaaagtg atgaaggcag aatggtaagt
   100321 gaaaagatgg gggcactgag taatttagaa tgtcactttg taagaagcta taaagttcat
   100381 cattgtcaat atcgtgatca taatgatcat catcaaaatc atgagagtat ttgctgaaac
   100441 tccctcaaat tcctgattcc aattaaatat aaagtgtaaa tatattacct gtttttcaca
   100501 agccagtatt ctttcccatt aagatcacca tagccaacca caagtacacc atgattcaca
   100561 ttctgagtac aggatggttc atagtagaca cctgagaatc aaatatgggt gggaaaaaag
   100621 agtgaatact ataggataat aaagggcatt ttaaaacggt attacttttt attagcctgc
   100681 tttggataca tagcaaggtt aatttaaaaa gttaaggaca tgtccaggca tggtggctca
   100741 tgcctgtaat cccagaagtt tgggaggcca aggcgggcgg gtcacaaggt caagagatcg
   100801 agaccatcct ggccaacatg gtgacacccc atctctacta aaaatacaaa aattagctgg
   100861 gcatggtggc acgcacctgt agtcccagct actcaggagg ctgaggcagg agaattgctt
   100921 gaacccagga ggcagaggtt gcagtgagct gagatcgtgc cacctcactc cagcctggca
   100981 acagagcaag actccatctt aaaaaaaaaa aaaaaagtca aggacatata ctgtcatgca
   101041 taaactaggg ctcataatta tttgccacca atgctgtatt tgtcagtgtt cttgccaagc
   101101 ttagcatcac tccagtcata gttcccccag aaaccttctt tgataagcac accacctttt
   101161 tctaagtttt tttctttttt gtttttttct aagtattttt aaacttgcat acaatcagtt
   101221 attttaattc tctgtgtata caggcatatc tcattttatt gtgctttgct ttattgtgct
   101281 ttgcagatat tagctttttt ttttttttaa ggaacagtgt tttgctctgt tgtctaggct
   101341 gtaatacagg tggtatgatc atagctaatt gtaacctcaa tctcctaggc tccagtcatc
   101401 ctcctgattt ggcctcccaa agcgctggaa ttataggcat gagccactgc acccagctag
   101461 atattgcatt ttttacaaat tgaaggtttg tggcaactct acattcagca agtctattgg
   101521 taccattttc ccaaaagcat gtgctcacat tgtgtccttg taccacattt tggtaattct
   101581 cacaatattt caaactttct catcattatt atatctgtta tggtcatctg tgatcaatga
   101641 tctttgatgt tacttcataa ttttgagggt gagctacaaa ccacacctat ataagatagc
   101701 agacttaacc cacaaatgtt gtgtgtgttc tgactgatcc acggactggc cacaccccca
   101761 tctctctcct caggcctccc tgttccctga gacacaatga tattgaaatt atgccacatg
   101821 ataaccctat gatagcttct aagtgttcaa gtaaaaagaa aagttgcata tctctcactt
   101881 taagtcaaaa gctagatgtg attaagctta gcgggaaagg catgttgaaa gccaagacag
   101941 gccagaacct agcctaggcc acttgtaccg aacagtcagg catgctgtga atgtaaagga
   102001 aaagtgcttg aaggaaatta aaaatgctgc tctgctgaac acacaaatga caagaaagtg
   102061 aaacagcctt attgctgata cagagaaaat gtgagtggtc tagatacaag agtaagccag
   102121 cccagcatgc ctttacgcca aaacctaatc cagaacaaga ccctaactct cttctattgt
   102181 aagaaggctg agagaggcga ggaagctaca gaagaaaagt ttgaagccag cagatgttgg
   102241 ttcatgaggt ttaaaattgc ctcgcctcac cccgccccgc ctcaccccgc cccgccccgc
   102301 cccgccccac cttgccctgc cctgccctgc cctgcccctc tttcttcctt ctttctcttt
   102361 ctttcttcct tgttttttag agacaggttt tcactctgtt gcccaggttg gagtgcagtg
   102421 gtatgatcat agctcactgt aacctcaaat tcctgggttc aaggagtcct cctgcctcac
   102481 cctcccaagt agctgagact acaagtgagt gtcaccatat ccggctaatt tttattttta
   102541 tttttcgtag agataagggt ctcgcgatgt tgctcaggct ggtgtcaaac tcccaggctc
   102601 aagcaatcct cccacctcag cctccaaagc tagaattaca ggcatgaacc accgtgctca
   102661 gccaatatcc tttctttctt tctttaaatt ctaattattg tttattttat tttttaccac
   102721 ttctgtagag gaagaaagaa ggatgacgcg catctacacc aacagacact gggcctttat
   102781 tggccacagc ttctttcagg acatcttctc tgccataagg aagttcagtg tactttgaac
   102841 atgtggcagc acgatatttt gagtcatatt gacatttctg atcctgcaaa aagaagtata
   102901 aatatgatga agtatacttc aactcctgta taaatcacct gtatcctaag cctggatttt
   102961 aaatgtctcc cttcctttct ttaacttttt catgtgtcta aggctccttc ctttgcaaat
   103021 cttccaaatc ctaacagtcc cgggtttcag caataatgga ggaaatacag aactattcta
   103081 ttctttcaaa ttgtgaacat cctcatacat gcaaaacttg ccaggacaat ggcccctcag
   103141 actttcaggt ggactgtagg gaaatattac atagagataa taaactctag atgagagctt
   103201 taaaatcttt ttgatgcctg caagatgatc tgcagctaaa atgatggtat ataaagacca
   103261 tataatatgg gaaacacaag tatttcaagt gggaaaagga aagttacaaa ttcatgtttg
   103321 ttgtatatac aactattttg tatttattta tattatttat ttttttgaga taggctctca
   103381 ctccattgtc caggctggag tgcagtagca taatcatggc tcactgcagc ctccaactcc
   103441 tgggccaagt gattctccca cctcagcttc ctgagtagct ggaactgcta ctcaggtgtg
   103501 caccaccatg cctggctaat ttttttgatt tttagtagag gtgaggtctc actatgttgc
   103561 acaggctggt ctcaaactct tgagctcaag caatcctccc acctcgacct cccaaagtgc
   103621 tgacattaca ggtgtgagcc accacacttg gccaactatg ttttttaagt gcttaacaag
   103681 aaaataactt gaaagaaata aaccaaaaca atagtggtta aggtagaagg tttttgagtg
   103741 atatttttat atttttaaaa tcttctgtaa catggacaca atccttaggt tatcgttata
   103801 tcaaaatgta taatgaattt ctgaggaaga ggagagtgtt atttagcttc tcagtgggtg
   103861 gaagaatgat gtggttaatg atctgagaat aaggttgagc atgggaggct gcagaagaaa
   103921 aagtttaaat acagagaaat aattttaagg atagtttgct taaaatctgt ttaattttgg
   103981 gatggagtct tgctccgttg ccgaggctgg agtacagtgg cgtgatctcg gctcactgca
   104041 cctccacctc ctgagttcaa gcgattctcc tgcttcagcc tcccgagtag ctgggaccac
   104101 aggcatgcac caccatgcct ggctaatttt tgtatttttg gtagagacag ggtttcacca
   104161 cgttggccag gctggtctca aactcctgac ctcaggtgat ccacccacct cggccttcca
   104221 aagtgctgag attataggta tgagccacta cgcccagtca ttaaaatctg tttttaagca
   104281 tgaatgctac ttccttattt gtactactga tatattaaga agagtacttc atcagaaaca
   104341 gaactcttga gaatttaaga attcctcttc agggttctct ctcacacaca atactgcttc
   104401 tcaagcaatt ggcatggtgg caagcccagc acagtcagtc agtggataac atgagatcat
   104461 ggggcagcac aaaaagtttg cttaagacgc accatggctt tgtagggata ggaagcgtct
   104521 gagtcgatgc ccttgttatc aatgatgtac tggaaagccg ttgtcatgaa gccaccattg
   104581 cagcctttgt ttccatattt ttcagttgag caatccacca ggttctgggc actgagagac
   104641 accagctttc ctgttttcag cttcagctgt gcttccaggg cccccacagc actgaaagcc
   104701 cagcaagcac cacaagaacc ctaaaacaga tacaaggtca caaacgcaaa tcacagaaaa
   104761 tcattcggaa tgacttccat aacccaaact ggacaacgct cagctacatc agttctgtcc
   104821 ctagttccca gaaagagctc ccccactaca gtttccttct gtctttaaca atgggaatga
   104881 agactaaatt taactcaatt gttatctagg caaggtccaa ccttgataga gatttgcaat
   104941 cagccaagct cagaacataa tggccatatt caaatatact tagaaacaat actcactgag
   105001 catgtaccaa attgcaccta ttttcaccca taagtaaggg agatagttgc ctacagtaat
   105061 gagattttga gagagctagg atgatttcct tgacctaaaa taccctttct cctgtcatat
   105121 gatgcctgaa acataagaag tttaatccta ggttggcatt tttcaaacct ttgttatctt
   105181 cagaattttt ttgattaaaa aatattttac aaggaggcat aaacaaacaa aaaaaggcaa
   105241 agctgcttta gttttaaaag gggcagagtg tccagagccc catcataatt gttctctttc
   105301 tctctccttc cctcctccca ccttgctctc caccaaaagg aagcaaactg acattgcttt
   105361 tgattgaagc aaaattgttc attaaaatgc aagcacccag agaaaggagc tcattaactt
   105421 actcagagtt aacatctaaa tatgtggttt attgggtctt ctaaaagctg ttattttaga
   105481 gaacctcggg taagaaatcc aaaactaatt gaaagaaaaa tatgtaggtg gaaaagaata
   105541 agagattagc agtaccattt acttcttttt gtttgtttgt tttgagacag ggtttccttc
   105601 tgtcactcag actgaagtgc agtggtgtga tcatggctca ctgtagcctc aacctcccag
   105661 gctcaagcga tcctcccatc tcagcccctc aagtagctgg gactacagga gcgcaccatg
   105721 cctggctaat ttttattttt attttttgta gagtcagggc ctcactatat tgcctaggtt
   105781 ggtttctaac tcctgggctc aagcgatcct tctgcctctg cctcccaaag tgctggaatc
   105841 acagatgtga gccactgtgc ctggccacca tttacttcct aataaatttt tactttcttc
   105901 attgatcaca acctggggta gctgctcaga atgctttgat tcctctcact taccctgacc
   105961 tctatggtga tacatgtgtt ttacctctaa ctaatttagg atttttgcaa tttttttaaa
   106021 acaagcaaac taaaggaaaa tacaacagca taggaacaat aattttacaa ggcacatatt
   106081 tcatattcta acgatgcaag actagccatg aattaacttg agtggctcaa tttatagaac
   106141 actctatctc tagttctttc ttctccttca actttgaaac tcacaagtga aatggagaaa
   106201 cttcctgatc accagggctc tctcacacaa cactgcttct caagcaattg gcttattgga
   106261 gagtccagtg cagtcaatgc atagcgtgag caataaaagt ctgtgttaaa cacctatttg
   106321 cttagttcag tgtttgtcta agtaggtggc tctatgatca cttagaaaag cactgtgggg
   106381 gctgaacacg agggctcatg cctataatcc caggactttg ggaggtggag gtaggaggat
   106441 cacttgagct caggagcttg agatcagctt gggcaacatg gcaaaaccca tctctgctaa
   106501 aaatacaaag aaaaaaaaag tggtggtgtg tgcctgtagt caccgtgtag gctgagatgg
   106561 gaggatcact tgagcccaag aggcagaggt tgcagtgagc ccagatcgca ccactgcagt
   106621 ccagcctggg caacagaggg agaccctgtc tcaaacaaaa caaaacaaaa caaaacaaaa
   106681 accactgtgg agaatccagt tgtccctcag tatccacaag ggattggttc caggacctct
   106741 gagcatacca aagtcaagaa ttttcaagtt ccttatataa attactgagt attcgcatat
   106801 aacctacaca tcctcctata cacttttttt tttttttttt tttaagacag ggtctcactc
   106861 tatcgcccag gctggagtgc agtggcaaga tctcagctca ctgcaacctc tgcctcccag
   106921 gttcaagtga ttctcctgcc tcaccctccc gagtagctgg gattacaggc gcccaccacc
   106981 acgccaggct aatttttgta ttttcagtag agacgggact tcaccatgtt ggccaagctg
   107041 gtctcgaact cctgacctca agtgatctgc ctcccttgac ctggaaaagt gctgggatta
   107101 caggcatgag ccactacgcc cggccctccc ttatacttta cctttttttt tttttttgag
   107161 atggagtctc gctctttagc tcaggctgga gtgccatggt gtgatctcag ctcactgcaa
   107221 cctctgcctc ccaggttcaa gtgattctcc tgcctcactc tcctgagtaa ctgggattat
   107281 aggcatgtgc caccaggccc agttaatttt tgtattttta gtagagacgg ggtttcatca
   107341 tgttggccag gctgatctgg aactcctggc ctcaggtgat gcgcctgcct cagcctccca
   107401 aagtaatggt attataggcg tgagtgcctg gcctattttt tattttcatt tttttctgga
   107461 tatttttgat ccacggttgg tggaatctgc agatgtggaa gctgtggata tggagggcca
   107521 actgtataaa taaaatataa tatgtgatcc ttgcccttaa ggaacttata atttagttgg
   107581 gaagacaaga ataacaaaaa taactagaaa gtaacacgtg ggactgtaag attcactgtc
   107641 aaattctgta gctagactaa gcatttaaag agctctacct agggttcaga ggcttgggat
   107701 ggtaatactc acttgatatt tcacttcagt aacacaccct ttctctctcc agtccacaga
   107761 atcaggcaat atccgattag ggtttgactt atatgtgata tttctctgcc actggctggg
   107821 aactctcagg gaactcatca aagacatcac ttcttcactg gtctacaaag caaatataca
   107881 gtcaggcatt gtttaatgac tggaatatgt tctgagaaat gcatcgttag gtgattttgt
   107941 cattgtgaaa acaccatagt gtgtacttac acaaacctag agcgtattgg ctcctacata
   108001 cctaggctgt atggtagggc ccattgctcc taaactacaa acctgcacgg catggtactg
   108061 tactgaaggt tgtaggcaat tgtagaacaa tatgaagtac ttgtgtatct aaatatatct
   108121 aaacatagaa aaggtagagt gaaaatacag tataaaaagt aaaaatggga gctgggcatg
   108181 gtggctcacg cctgcaatcc cagcactttg ggaggccgag ccgggcaaat cacctgaggt
   108241 caggagttca agaccagcct ggccaacatg gtgaaacccc atctctacta aaaatacaaa
   108301 aattagctgg gcatggtggt gcatgcctgt aatcccagct atttgggagg ctgaggcagg
   108361 agaatcactt gagcctggga ggcggaggtt gcagtgagct gagatcgcgc cagtgcactc
   108421 cagcctgggc gacagagtga gactccatct caaagaaaaa aaaaaaaagt aaaaatggta
   108481 cttctatata ggtcatttac tatgaaaggc ctggagttgc tctgggtgag tcagtgagtg
   108541 agtggagagt gaatgtgaag gcccaggaca ttactgtaga ctttataaac acggtgcact
   108601 ttggctacac taaatttgtt aaaacatttt tctttcttca ataataaatt aaccttagtt
   108661 taatgcaact tttttactta ataaactttt aattttttaa atgttttgtc ttctatggta
   108721 accccttaaa atacaaacac attgtacagc tgtacgaaaa tattttcttc tttatcacct
   108781 tcttctataa gctttttcta tttttaattt tttaaaagtt tttacacttt tttgttaaaa
   108841 actaaaatac aaacacacac acacattagc ctaggcctat acaaggtcag gatcatcagt
   108901 atcactgcat ttcgtctcca cctcttgtcc cactggaagg tcttcagggg caacaacacg
   108961 tatggagctg tcatctcctc tgataacaat gttttctact ggatgcctcc tgaaggacct
   109021 gcctgaggct gttttacagt taacttttta aaagtataag taggagtaca ctctaaaata
   109081 acaattaaaa gtatagtata gtaaatacat aaaccagtaa catagtcatt tattattatt
   109141 aagcattttg tactgtgcaa aattttatgt gctatacttt cataccactg gcagagcagt
   109201 aggtttgttt acagcagcat caccacagac aagtgagtag tgtgttgtgc tacattgtta
   109261 tggtggctac aacatcacta gctaggcaag aggaactttt cagctccatc ataatcttaa
   109321 aggaccacca tggtatatgt ggtccatcag tggccaaaat gcatatgaca catgactata
   109381 tatctctttc acttacatcc ttctttcttt tctttctttc tttttttttt ttttttttct
   109441 cttttgagac ggagtttcac tctttttgcc caggctggag tgcaatggca tgacctcggc
   109501 tcactgcaat ctctgcctcc cgggttcaag caattctctt gcctcagcct cccaaatagc
   109561 tgagattaca ggcatgtgcc accacgccca gctaattttg tatttttagt agagacagga
   109621 tttctctatg ttggtcaggc tagtctcaaa ctcccgacct caggtgatct gcccaccttg
   109681 gcctcccaaa gtgctgggat tacaggtgtg agccactgtg cccagcctat cctactttct
   109741 aatatcctaa taactgtaac tgtgaatata tttaaaaact gccatggaat aaaaaaactg
   109801 agaaaaaaca gtaatagttg tgaccaacac aggttccatt aggatctggt ggggcgaatt
   109861 atctgggttt gtttagtttc acacttggca ctagcatccc agctaggcac cctcatgggg
   109921 aaggagaaaa agagaatggt aaaaactaga agttacaggg aggcagattt ttgtttcaaa
   109981 gcaaggcaga attttataaa agggctgtct aaaaagtgga ctaaactggt ttcccatgtg
   110041 agaaataatt acaaaggaga ttcctgcatt taagaaggaa ttcagtctac ctagggcgtt
   110101 cccttcttca ctttggcaac agtggtaaag ttttgcagga cagaacagaa tctgaaaacc
   110161 aataggggat tatttgtgat attaaatacc cacattctcc cattaagaaa gcagatgggt
   110221 catactaaat tgtcaatcta gacttaaata agtatggcaa cttgctataa aaacctgtca
   110281 atagtcttta ggtaatactt tctagaattt attaaaaaaa aaacaatggt agtctctgag
   110341 tttaaaattt cctctaaaga agtgatttgt taaagaaaga aagcagtcaa aatagaacta
   110401 agtaagaaat ttctgtacac gttcttacgt ccttcaattc tactgttttc caactttgta
   110461 ctatacattc ctttgttatt gttgtttacc ttcaaattgc ttttggggag acagaaagga
   110521 aatgatattt acccagacgt gaaagtggga tttcttgtaa tgtacctacc atgtctccca
   110581 ggtggttcat gcccagatcg tatgagtgca ttcccattga atgctccagg ttgtgaagca
   110641 tcacaaactt tagattcttt tcccagatga gacgtcgtac tgcttcttca ttctaaaaca
   110701 taatgaagaa gaacatagtc atactgcatc ctggacaaat aagtgtttga tgatgaaaat
   110761 ggcaattttt tttttttttg agatggagtc tcactctgtc acccaagctg gagtccaatg
   110821 gcacgatctc agcttacttc aacctccgat tctcaggttc aagtgattct cctgcttcaa
   110881 ccttccgagt atctgggatt acaggcatgt gccaccacgc ctagctaatt tttgtatttt
   110941 tagtagagac ggggtttcgt catgttggcc aggctggtct caaactgttg acctcagggg
   111001 atctgcctgc ctcagcctcc caaagtactg ggattacagg tgtgagccac tgcacccgcc
   111061 caaaaatagc atttttgaag cattctacca aatgtggatc tgcccgagag catgcaaata
   111121 cataaatata tccaatataa ctgctaactt cacatattaa tcttaattct tgctcttaac
   111181 tgctttgatg gtttaatatt attttatgat aaacatgaat ttccttctct ttttaacatt
   111241 tattattatt attattatta ttttgacaga gtcttgctct gtcactcggc tggatggagt
   111301 gcagtgaagt ggcctgatta ctgctcacca tagccttgac ctcctgggac ccagcaatct
   111361 tcacacctca gcctcctgca tagctgggat gactggtgtg tgccactgca gccagttttt
   111421 ctgtttgttt gtttgtttgt tttggtagag actgggtctc cctatgttgc aggctcatat
   111481 tgaattcctg ggctcaagca atcctcccct atcagcctcc caaagtcctg cgattattgg
   111541 tgtaagccac cgcgcccagc attttttttt ttttttttta gacagggtct tgctcttttg
   111601 cccaggctgg agtgcagtgg tatgatcata gctcactaca gcctcaaact cttgggctca
   111661 aactatcctc ccatctcagc tgcccgagta gctaggactg caagcctgca ccaccacaac
   111721 cagcttattt ttatttttat tttattatta tttttttttt ttgtagagac agggtctccc
   111781 tatgttgcct aggctggtct caaattcctg gccttaaagg atccttcccc cttggcctcc
   111841 caaagtgctg ggattacagg catgaaccat acacccagcc atgatttgct ctttcttttt
   111901 gatgttttac aacttcgtta catgtatctg ggtgtaggtt tctttattta ccttcggctg
   111961 tgtgcttttg attggaggac tcaggacctt ttctcatatc tggaaaattc tcaatcagtg
   112021 tctccacata tattgcttct ccaccattcc acctaattct cttcttcaga aactcctatc
   112081 cgaagtattt tggagcctca aactctcttt catctctctt acttgcgttt tcatatattt
   112141 ttctttttat ttctaggtgc tgcattctgg gtgaattcct cagtactatc tttcagatta
   112201 ctgattctat ccagctcttt ccagtctgga aatccttttg atttttatcg caatgacaat
   112261 attttttatt ttcaagattt ctaatttatt tcttgtccat tagtgtgcat attattcctt
   112321 tatatatagt cttttgaaaa ttcatttttg attctattat tttaatttta ttcagagtga
   112381 atttacatgc tgaccgacga ttttcttgtc cttcaggaga ttcatttaca tagaatacaa
   112441 tgatgtagaa ggcaaggtgg atggtgcctc ctacagtagc tgaagccaga gtttccaata
   112501 agctggaacc ccagggagca gtcagcaaca actgtgttct gcctcatttc attgagcggg
   112561 tgagcccagt cccagaggca ctgggtagca agtctggatg caggtttctt tcccttggag
   112621 caaggtttta ctgttacccc agtaatggac ataatgggca tgcatgatgg gcttacttcc
   112681 aaaccacaag atgagaagca atggtataaa ggcaaacaaa tctaactaag tgcaagaaag
   112741 gaaaaaaaaa aaaagaagtg cactttgaga tcaattataa aggtaggata cgaatcaaaa
   112801 gtcaggttca ttcatcctga gggactggaa ttattaaaca gagggatagc attcatttaa
   112861 ccagatagtg gtgctaattg cacatctaag tccagtcaat cttgaaatga ggaacctgtt
   112921 cataaagggt gtgggtgagg gtgaaattca tcaccaccaa cgaaaccgta tgatcattta
   112981 aatagatgca ggaaaagcat ttgacagaat tcaacatcct ttcataacaa acattctcaa
   113041 caaattacat atagaaggaa tgttcctcaa cacaaatagg ccatatatgg caagactaca
   113101 gctaacacca tactcaattg ggtgaaaatt taaaggccat tctgctaaga tcagaaaaag
   113161 acaagggtac ccactctcac catttctagt caatatagtg ctgaaaatcc tagccagagc
   113221 aattaggcaa gaaaaataaa tacaagatct ccaaataaga gaaggaagaa gtgaaattgc
   113281 gtttgtttgt tgataacatg atcatataga aagtcttggc caggtgtggt ggctcatgcc
   113341 tgtactttgg gaggttgagg tgggaggatt gtgtgaatct aggagttcaa caccagcctg
   113401 gacaacatgg caagacccct tctctacaaa aaacaaaaat agaaattagc caggtgtggt
   113461 ggcatgtgcc tacagtccca gttacttgga aggctgaggc aagagaatca cttgagctgg
   113521 agtttgaggc tgcagtaagc tatgatcaca ccacttcact ccaacctggg tgacagagca
   113581 agatctcatc tcaaaaaaaa aaaaaaagaa ggttgggcaa ggcggcttat gcttgtaatc
   113641 ccagcactct gggaggccaa gatgggttga tcatgaggtc agaagttcga gaccagcctg
   113701 accaacatgg tgaaacccca tctctactaa aaacacaaaa attagctagg tgtggtggcg
   113761 tgtgccggta atcccagcta ctcaggaggc tgaggcagga gaattgcttg aacctgggag
   113821 gcggaggttg cagtgagccg agattgcgcc actgcactcc agcctgggcg acagagcatg
   113881 aatccgcctc aaaaaaaaaa aaaaaaaaag agagaagaaa gactccatca aaaaactatt
   113941 agaactgata catgaattca gtaaagttgt aagacacaaa gtcaacatac aaaagtcagt
   114001 agtattttta tacactaaca actatttgaa aaagaaatta ggaaaacaat tctatttaca
   114061 acagcattta aaacatattt aggagttaat ttaaccaagg agttgaaaag ttatatgctg
   114121 aaaactgtaa aacattgata aaagaaactg aagacaaaat aggccgggcg tggtggctca
   114181 cacctgtaat cccagcacta tgggaggccg aagcaggcag atcacctgag gtcgggagtt
   114241 tgagaccagc ctgaccaaca tggaaaaacc ccgactctac taaaaataca aaattacccg
   114301 ggcgtggtgg cacatgcctg taatcccagc tactcaggag gctgaggcag gagaattgct
   114361 tgaaccccag aagcagaggt tgtggtgagc agaggtcacg ccactgcacc ccagcctggg
   114421 caacaagagc aaaattccat ctcaaaaaaa caagaaaaaa agaaaaaaga aagaaagaaa
   114481 tttaagataa aataaataaa tgggaagata ttccatgctt atggattgga agaattaata
   114541 ttgctaaaat gttcatacta tccaaagcaa actagaaatt cagtgcaatt tctatcaaaa
   114601 ttctaatgtt attcttcaca gaaatagaaa aaacaatcct caaattcatg cggaactaca
   114661 aaagtcctta aatagtcaaa gtaatcttta gcaagaggaa caaagctgga ggcattacaa
   114721 aatgcctggt ttcaaaatat attacaaaac tatagtaatt taaacagcat agtactggca
   114781 tagaaacaga cacatttacc aatggaatag gatatagagc ccagaaataa cctgcaactt
   114841 tatggtcaat caattttcag caaaggtgcc aagaacacac aatggggaaa ggatagtctg
   114901 tttaataaat agtattggga aaactggcta tccaaatgca gaaaaatgaa attgaaccct
   114961 tatgtcaccc catatacaat aatcaactca aaatgaatta aagcacaaac ataaggcctg
   115021 aaactatata caaccactat aagaaaatat aggggaaaac tccacaacat tggtccgggc
   115081 aacggttttt tgaatatgac ccctaaagaa caggcaacaa tagcaaaaat agacgaacgg
   115141 gactccatca aactaaaaag ctttcgtaca ataaaggaaa caattaacag agtgaaaaga
   115201 taactcatag attgggagaa aatatttgca aatcatacat ttgataaggg gctaatatcc
   115261 caaatataga aggaactcaa agtaactatc aagaaaacaa ccctgtttaa aaatgggcag
   115321 aagacctgaa tagacatttc tcaaaacaag acacatacaa atggccaaca gatatatgaa
   115381 aaaatgctta acatcaccag tcatcaggga aatgtaaagt aaaaccacaa tgagatatca
   115441 cctcacatct gttagaatgg ctattatcaa aacgatgaaa gataagtgtt ggtgaggatg
   115501 tggagaaaag ggaaccctca tacactttta gtaggaatgt aaattagtac agccatttgg
   115561 gaaaacagta tggaggttcc tcaaaaaact aaaaatataa ttaccatatc atccagcaat
   115621 ctcacttctg ggtatatgtc ccaaggaagc gcaatcagta tgtcaatgag atagctgcac
   115681 ttccatgttc attgtagcat tattcacaat agccaagata cggaatcaac cttataaagt
   115741 gtccattaaa agacgaatgg ataaaaaagt gtggtacata ttgtattcca tacataatgg
   115801 aatactattt agccttaaaa aaaaaaaaag gaaattctgt cattttcaac aacatggatg
   115861 ggtctagaga acgttatgct aagtgaaata agccaggcac agaaagacaa gtactgcata
   115921 atctcattta catgtggaat ctaaaacagt tgaactcata aaagtagaga gtagaatggt
   115981 ggttagcaga ggctgggcgg agggtagaca gggaaagggg agatgttagg caaagagtac
   116041 agagtttcat ttgacaagaa gactacattc tagcaatcta ttgcatagca tggtgactac
   116101 agataataat aataatgtat tttatatgtc aaaattactg aaagggtaga ttttaaatgt
   116161 tcttatcaca atgaaaggat aagtatgtga ggtgctgaat aggttaatta gcctaattta
   116221 atcattctgc aatgtatatt tgtatcataa catcacattg tactccataa atagacacaa
   116281 ttattattta attataaata aataaaaata aaacttagag tgtatgacct aagagtgtca
   116341 tttttataca aaataatatg tcaaataaat aaatatatga tgctagctta caaacagaaa
   116401 tattacatag ttacaggact tagttattta aagcagtagg gttaaggtct taacatcagc
   116461 aagtgtgctt tgaaataaaa acaaaagtac aaaatgggaa gtgatagtca agaaacaata
   116521 tctggccaca ctatgtgggt attaacgcct tttcgctcct cagccatccc aaatgtggcc
   116581 cttcctcagg agcttgtgct gatttgccct tggcctgaga ccttcactct attatgtatt
   116641 ttgcttgctc cctggcttcc atcaattctc tgctcaaatg gcacccttat ccgtgaggcc
   116701 ttccctaacc ctcctgtcat atagatagac ttcttaccac caccaccacc cagggccctt
   116761 cctactctcc atgccttctg ttatttttat ctacaacact tatcaccatc tagcatacta
   116821 catatctgct tgtttatttt cttatttcct gtctctctca ctataatata agctccataa
   116881 caggaatttt gttttgattc ctgctgtatg gtaccaagaa cagtatctgg caagaaacaa
   116941 gtacataata tttatttatt tatttttgag atggagtctc actctgtcac ccaggctgga
   117001 atgcaatggc acaatcttgg ctcactgcaa cctccgcctt cccggctcaa gtgattctct
   117061 tgcctcagcc tcccgagtag ctgggattac aggcgcccac caccacatcc agctaatttt
   117121 tgtattttta gtagagatgg ggtttcgcca tgttggctag gctggtctcg aactcctgac
   117181 ctcaggtgat ccgctcacct tggccttcca aagggctagg attacaggcg cgagccacca
   117241 cacccagctt aaatatttac tgaaaacact gcctgtaaga acacaatttt aagaggtaga
   117301 aaacagtact cctttaaaag catatcgctc gagtggcaat ccaatctacc ttttccttgt
   117361 attgtttgcc ataggttttc ttccagagat gccagtggtg atccagggta ggatctttat
   117421 gcaactgtgc cactgcagag gagcacacca agagcacaca aaccagccgt ttcattctgt
   117481 gtaaggaaag gtaacatagg aagtgttgct attagctgca tatgttgtga ccacatgttt
   117541 tcataagaga aaataacaat gcaaagtcta tcaaagggac agactattat aaacagaatt
   117601 atacaggaaa attcctggga tatatagtca cagggaacag ctgcagattt atgtcagaaa
   117661 tactacttcc aatgcttagt agcactttaa aaagtgattt ccaggaattt cccaatatct
   117721 ggtaaaccaa acacatctag agggttagaa aacagggact ttccaggatt tgccaacaaa
   117781 tctaatcaaa ggaaatctct ttctctctct ctcaaaaaaa aaaaaaaatc atccttaaga
   117841 gagacaaacc tctgaaaaat cagtttatct cctgaatttt tgaatacatg ggctcttctt
   117901 aactgacctg tttaaatatc cacagtcttt acagatagtt aaaagacaat tctcatctat
   117961 tgctttcaca tctttaggta ggtaattttc ctttttacgt tttttttttt ttaatgatca
   118021 gacatctgtt tcaaatgagg caattaacta ctctttaaaa tgccacttct gtatttaatt
   118081 caagagacat caacagaaat tcagcgacta gctccctcaa actggattct actttctttt
   118141 ccatttcttt ttaattgaaa acatctggaa ttttaaacat gtaagccttt aaattatatt
   118201 ccaaagaaag ccatgtttca gtctcaactt agatgcttag atgctcaagc agacattaac
   118261 tttttaaaaa tagagaagag gcaaattgaa cgacacgatt ttttagttta cttcctgaaa
   118321 ccaaagattt aaaacctagc aggcagaaca agttacaaac acagagtgtt tgaagaccaa
   118381 atgggagaaa aagaacaaag agtacatacg tgatagaacc agcagttgct cccacagtaa
   118441 gagtccttga attagtgggc tctcttctaa gatttcaaag caacaaggag atttcagttc
   118501 aattgacttg aaaagaaatt ggaacttgtc acatgaggta ctcaaaatga ggaagtgaac
   118561 tttcatttca gtttcattta aaatctagct agtacagtca cctctagtca tttccctatg
   118621 gtcttgggga caaagtggat acagtaagcc atcaagaaga gcccttgtct gagcccagtg
   118681 gctcatgcct ataatcccag cactttggga ggctgaggcg ggaggatcac ttgaggccag
   118741 gagttcaaga ccagcctagg caatatagtg agaccctgtc tctgcaaaaa aatttttttt
   118801 aattagctgg gcatggtggt gcatgcttgt agtcccagct gagggggagg gtgaggtggg
   118861 agaattgctt gagtctagga ggacaaggct gcagtgagcc atgattatgc cactgtgttt
   118921 cagcctggat gacatatcaa gaccctatct caaaaaaaaa attaaattga attttaaaaa
   118981 caccaacttt tacatggaaa acatttagtt ataaagagat ttagataaag ctccccaaat
   119041 cccagctcct tgtatgattt cctaggaaaa cctgtcttac tgcttttaac acaactcatt
   119101 catgatctga tattgtctgg ttctcactca gccattcctt cgttcatcaa actcaaaatg
   119161 tctttattaa tatatactat atgtatagtt tagtaatatg cgtgtgtgtg tgtgtgtgtg
   119221 tgtgtatgtg ttagtatatt catggaaaaa aggatacact caaatatatc caattgagaa
   119281 tgattacctg ggtggggtgg gagcaagacg aactatcatc ttttaaattt gcattttcct
   119341 aattactaat gaggttgagg atcttttcac acatttattg accgtttgaa ttttctgtga
   119401 atttcctgtt catctttttt gctcattttt ctattgtatt gttgcatttt cttagtgatt
   119461 tgtgggaggt gtgtgtgtgt gtgtgtgttg tgtgttttcc gagacggagt cgtgctttgt
   119521 ctcccatgct ggagtgcatt ggtgcgatct tggctcactg caacatccac ctcccaggtt
   119581 caagcgattc tctgcctcag actcccaagt agctgggatt acaggcacct gccagcatgt
   119641 ccagctaatt tttgtatttt tagtagagac agggtttcac catcttggcc aggctggtct
   119701 tgaactcttc acctcgtgat ccacccgcct cagcctccca aagtgctggg attacaggtg
   119761 tgagccaccg tgcccggcca aggtgtatgt gttttaaaaa tatattctgg cttctaatct
   119821 ttgacagtta tataccctta aaaacatcct cccttagtct tccacttgtc ttttatttta
   119881 ttttttcttc ctttgcataa gcctatgaaa ttccacttgt cctttagtct tgtttatagt
   119941 gagtttttct atatgaaatt tttttatgtt gaggtagaac attttattca gcttttccgt
   120001 ataatcaatc tatactgttt ttgtataaga aaattttcct catccaaatg taataaagat
   120061 acactcatct tatttttata ctttcagaga gtgttttatt tatcttttct atggctactg
   120121 cttttaggtg tttaattcac ctggaattta gttttgtgta tggtatgacg tagggagctt
   120181 attttctttg ttttccaatt ttgattctac ctctattgta caccaaattc tcatacatag
   120241 ataggtcagt ttctgagctc atgaagctgt tctttttatc ttttgctact tcagtatcac
   120301 acagttttaa ttaaaacagc tttgtaatat gtctccacgt atggtaaggc aagtcatcta
   120361 ggccacccta tctttgttct tctttttaaa cattgtcttt gtattttgta cttgttaact
   120421 cttctttata tgaatttttt gaatcagttt atcaagttcc ctgaaaaact ctgttggagt
   120481 tttttgcttt tttcttttgg agacagggtt tcattctgtc acccaggctg gagtgcagtg
   120541 atgcaatcat ggctcactgc agccttgacc tcctgggctc aagtgatcct cccgcctcag
   120601 cctctggagt ggctgggact acaggtgcac accaccatgc ctggctgatt ttgttttttg
   120661 ttttttgtag ataaggggtt tcaccatgtt gcccaggctg gcctcaaact cctgggctca
   120721 agcaatctgc ccaccttggc ctcctgaagt gctgggatta caggcatgag cccccacatc
   120781 tggctattag agtttttatt tgaatatatg cataaattta gggagaagta acatcattat
   120841 aatgttgatt catctcagct gtagtagaca tgtttgtgtt tgtagccata cagttttttt
   120901 taaaaagata ctgcttatgg aaaatttaaa acacatgtaa aagtaaaaat agtagtaaaa
   120961 tgagctgccg tgaaaccatt gtcttgtttc aacaatgatc agtacatggc caatcttatt
   121021 ttatttatat tcccactctc actacctgaa ttatattaaa taaattctag acattctgtc
   121081 atttcatacg taaatgtccc tgtgtgtgtc tttaaaagac agagactatt aaacaaatat
   121141 aataccacaa taccatcatc acatctaaaa taaatgaaca ataattttct tttttttttt
   121201 tttctgagac agagtctcac tcacttgccc aggctggagt gcagtggtgt gatcccagct
   121261 cactgcaatc tctgcctgcc gagttcaagc gattctcctg cctcagcctc ccaagtggct
   121321 gggattacag gcatgtgcca tcacacccag ctaatttttg tatttttagt agagatgggg
   121381 tttcaccaca ttggccaggc ttgtctcgaa ctcctgacca taggtaatct gcccacctca
   121441 gcctccctaa gtactgggtt tacaggtgtg agccacttta ttgtcataag aaattaagac
   121501 aataatttct taatgtcatc agacatccta tcaatgttca cattttctgg actgtcttat
   121561 acatttttta atagtttgtt tgaattgaga tttaaataag ctccatatat tgtgaaacca
   121621 ttgagttatg tttatatcat tatgaattca tggatttaaa catatttgat gtgtttcaat
   121681 ccattccagt tttatcctta ttgatactca tattgtctca tttttggtct gtaggaattt
   121741 attcaaactg gctctttggt gcagacagag tttgtatatt tggcataata ttagtactct
   121801 ttaaaagctt cctttctttc taatatgaca ggctattgca ggtttgtctt gaacaattcc
   121861 tgccccagac ctggaatcag ccatttctcc aaggagctct ggttcctttt agtggaaaca
   121921 atgtttaaat accataatct aggaactagc agtgtttggg atagcagatt gaccattgtt
   121981 gctaagcctt ttcagtagct agagctaaga aatatctttt ttaaaaaaat taaataaatc
   122041 acaagctgac actgatactt caaattcaaa ttcaagacta tagggtttac tcaccccatc
   122101 aatcttatat ctgcacttct tttcaaacat gcttaaaaaa tctgggttct ctaaaatatc
   122161 aactcaagta cttgtttaat ttatcctgta aaattgcaga ataacaacaa caagcctacg
   122221 ccaataagat tgttaatgaa aacaaattag aggttttaaa gtggaggggg acaggttttt
   122281 ctccattaag ccaaatgtcg ctaacgatgt aagtcaatta ctgtgttttg aagtcacctg
   122341 gaatagttcc ctgtatggtt atgctccacc tccatacaca ggtttatttg tttcatttca
   122401 cttcttcctt ttagggcttg ctttttacac ttatgtaatt tcattttata attatgtaaa
   122461 acatttatgt ggatccaaag tgaaatcatc acaatgaggt atattcaaag aactctagcc
   122521 tctacttttg tctgttttac acgatttcct tttttcctta attggtaatc atttaaaaaa
   122581 tttatggttt atcttcctat ttttagaaat atgcaaatat gcctatatat taatattctt
   122641 tcatcttaag tgaacactaa catattacac atatttttct ctatcttgtt tttttcactt
   122701 agcatttttt ttgttttagt caatcactta acattttatt aagaaaaatt attcaggcga
   122761 aagacattag caggtgtcct ttcagtgatg tgctgggtat tatttagtgg acctttacag
   122821 gaattgtttg attcttttta gagttaagtt gtaaaaaact gataactatg caaatgatcc
   122881 ctatacatgg caatgcaaat aaattttgta ttgtagattt cactgaatag agaaatagag
   122941 aatgtagatc tctgtcagtc gaattctttt tttttttttt ttgagacacg gccttacttc
   123001 atcgcccacg ctggagtgca gtggtgcgat ctcagctcac tctaacctcc gcctcccgga
   123061 tttaagcgat tctcctgcct cagcctcccc agtagctgag attacaaatg cacaccacca
   123121 cacccagcta atttttgcat ttttagtaga tacagggttt caccgtgttg gccaggccgg
   123181 tctcgaactc ctgacctcag gtgatccacc cacctcggct tcccagagtg ctgggattac
   123241 aggcatgagc caccatgccc ggcctcaaat tcttatttga actgacttta atcttactga
   123301 tcccttatta attaatgagt taaacttcga tgcgttcgtt gtatatttgc tctcagcttt
   123361 ttagaaacta tactttagca attaaattct gataactcca cagtagtata tacagtgttc
   123421 atttattttt atggccacct agcatctttt gaacaatatt cccatgtttg gagaatttcc
   123481 cacgttataa attttgcttc tccaagttga agccaaagct ccattttcaa ggatccagtg
   123541 cagttaaggc acagacatga gacctagcct ttgccagtca ggcacacctg tatagaacat
   123601 cattttgcaa gtgggtaagt gaggaaccag aagtatgtaa aatcctcctt ttggtgagaa
   123661 tggcagtaga gatgtctccc agcagcagca gcaccagcag gagcaggagc agcagcagat
   123721 attctaatat ctggtcccca gtgtcacctg ttcaaactgt agtgtttatg tcaaatattg
   123781 tgtctgctca gtggcatcat aggttgagca gtcctctctg tggttttggt gatgtttgtg
   123841 gttacacaat aaacaaacct caccgacttt ctggcctttc ataggttctt tgagtttccg
   123901 gcactttttt tttttttttt tgagatggag tttcactctt gttgtccagg atggagtgca
   123961 atggcatgat ctcggctcac tgcaacctcc gcctcccggg ttcaagcgat tctcctgcct
   124021 cagcctccca agtagctggg attacaggca tgcgccacca cgccaggcta attttgtatt
   124081 tttagtagag acagggtttc gccacgttgg ccaggctggt ttcaaacccc tgacctcaga
   124141 tgatccgccc gccttggcct cccaaagtgc tgagattaca ggcgcaagcc actgcgcctg
   124201 gctgcttcct gcacatttta aataaactgt agtacccgga tagatgaacc tagtattgac
   124261 atccacagcc gtgttgagtt tggaaaaaca agggtggagt gtcagaagct agaatgggga
   124321 aggtgctttt ctcaggctgc gaatacagga aactcaagaa gaaagctatt gtggagctct
   124381 caataatttc aggttgcctt ctctatcctt ctttacacat ctgatctcat acctgatgtt
   124441 ttttgctttt agccttgata cctctatatg tatttctatc catagaaata gacagttcaa
   124501 tttccttaat gagaaattag cataggaaca aatttctttc catcaagatt aaatggtaga
   124561 caactgcttt tacaagggac catgtatttt tccagtcatt gtttctctgt ctgacgaggt
   124621 ctataaatct acactcagta cagagcataa acgtggtgat taggcaacag cacaccttct
   124681 gtctccattc ctgggtattg cctacatgga attgtgtatg aaattatgtc tctacctgtt
   124741 tccggtaatc ctctaccaag tgggaaaaaa ttttggggtt aattgtgttt tctagtggta
   124801 ctctaaaccg gattcaggac ggcagcggcc atctaaatgt cttcctttgt ttatttctaa
   124861 atccataaat taattccaga tttaccaaga aactaaattg agttgatccc aaaatgaaac
   124921 agggctctca gggacatact cacttcttcc cagcctgcct ttggactcag taagtaagca
   124981 gagactgtgt ttatttgtct gatatacatg acccctggtt aatatgtagg tcgtagaaga
   125041 aaccaaaatt ttacttatgg aaggcaggga gaatagagat gaattacctt ttgtgagaaa
   125101 aagaacaagt gacatattaa ctgctattgt attttctgtt gtcaggacgt gggctgagta
   125161 aaaagcttaa aagcttttgc atatgtcttg gttctgccat ctattggtta taaggcctgg
   125221 aacaagtgat ttaacatgcc catggctcac tttccctatg cataagtacg gaaatgctga
   125281 cttagtggca actaaacagt ggaagacacc agcaatagaa tgtaattata atatcgctcc
   125341 atgagatggt aagattttac agttccaact aattattcct gagtccgttc cttttgcaga
   125401 acaagtttct aataaagtac tcaggattaa gatagatggt tatgagcaag tgctaccatt
   125461 taggatattc atttgaaata ctaatgctcc tggcattttt tttttttttt tttttttttg
   125521 agacagagtc tcgctctgtc acccaggctg gagtgcagtg gcgtgatctc agatcactgc
   125581 aagctctgcc tcctgggttc atgccattct cctgcctcag cctcccaagt agctgggact
   125641 acaggcgcct gccacacgcc tggctaattt tttgtatttt tagtagagac agggtttcac
   125701 catgttagcc aggatggtct cgatctcctg acctcgtgat ccgcccgcct cggcctccca
   125761 aagtgctggg attacaggcg tgagccaccg cgcccagccg cgctcctggc attttttatt
   125821 tttatttatt tatttttttt ttttgagaca gagccttgct ctgtcgccta ggctggagtg
   125881 cagtggcatg atcttggctc acagcaaact cttcctcctg ggttcaagtg attctcagct
   125941 tcctgagtag ctgggattac aggcatgcca ccatgcctgg ctagtttttg tatttttagt
   126001 agagacgggg tttcgccatg ttggccaggc tggtctcaaa ctcctgacct cgggtgatcc
   126061 acccacctca gcctcctaaa gtgctgggat tacaggcatg agccaccatg cctggccaag
   126121 aaaaacatct ttaatacaaa aacacagaaa ggttgacaat aataggaagc taaaagctag
   126181 atcacacaaa cgcaagcaac agcaaaatgt gggtataatt attcaaatat cagtcagagt
   126241 agactttaaa gcaagaagca ttatgaggct aaagagggaa ctttcataat gataaagaaa
   126301 taaatgataa agggacccgc caggaagaga ttataatctt aaatctgtat gctcctcaaa
   126361 acatggcctc agagtcccat aaagcaaaga ttgccagagc taaaaggaaa aatagataaa
   126421 atctgcaatc ataatgggag gctttgagat acctctttca atagcttgta gaataagcaa
   126481 atggaataaa ttaatagcga tatagaagat tcgaataatg cgattaatga acttgtccta
   126541 attaagatat ctagagcctg tcctcaacaa cagaatcctt tttttttttt tttctttttt
   126601 tttcacccag gctggagagc agaggcagga tcacggctca cggcagcttt gaactgctgg
   126661 gcccaagtga tcctcccagc tcagcctccc atgtagctgg gaccccagcc ccaagccact
   126721 gcactggcct cagaatattt attttgaagg gcacatagaa tatttaacaa aattgactct
   126781 gtatgtgctg ggccataaaa tattcaagtc atttagagta tgttctctga ccacagtgag
   126841 attaagctag gcccaataac aaaaatgtaa ttagaaaacc tcccatttaa gtgatatact
   126901 tcttaataac catggttatt aagaagaaga aatcataatg gcaattagaa aatattttga
   126961 actgagcaat aatttttaaa aatgacatat aaaaatgtgt gagatccagc taaaccagtg
   127021 cttagaaaga aatttataac tttttaatgc atatattaga aaagaagagg ccggcgcggt
   127081 ggctcacgcc tgtaatccta gcactttggg aggctgaggt gagcggatca cgaggtcagg
   127141 agattaagac catcctggct aacatggtga aaccccgtct ctactaaaaa tacaaaaaaa
   127201 ttagccgggc atggtagcgg gtgcctgtag tcccagctac tcgggaggct gaggcaggag
   127261 aatggcgtga gcctgggagg cggagcttgc agtgagccgg gatcgcacca ctgcactcca
   127321 gcctgggcgc gagcgagact ccgtctcaaa aaaaaaaaag aaaaagaaaa gaagaatggc
   127381 taagaatcag tggtctaagc tgccacctca agaagcaaag ggaagaacat cagagatgtt
   127441 ttgatgttct ctatttcccg cccttttttt ttgtttgaga tggagtctgg ctctgtcgcc
   127501 caggctggag tgcagtggtg caatcttggc tcactgcaac ctacatctcc tgggttcaag
   127561 tgattcttct gccttagcct cccaagtagc tgagactaca ggcgttcatc accacaccca
   127621 gttaattttg tattttaagt agagatgggg gggggagttc tccatgttca tcaggctggt
   127681 ctcgaactcc tgacctcagt taatccactc gccttggcct cccaaagtgc tgggattaca
   127741 ggcgtgagcc accgcgcttg gccatctttt ggactaaata aagactgttg agctactctg
   127801 gcagtactct ttttagaaaa gcacttgcag caaaattgtg ctttcaataa atgatattaa
   127861 attgtgcatt ttatctttga actctcaaac tctgttattc tttttcaaaa taggaagttt
   127921 ctagcaaata tggaaaagag gaaataccca aatcgatatg ttttaatgaa ttaaatacag
   127981 atattatgca tcttctgtgc caggcattgt acagggcact gaagatcttt cctgcccatt
   128041 attttatttg atcctcccaa caacatatga ggtggtagtg gtagcaatag tatccctatt
   128101 ctatagttta ggaaactggg acttagaaag ccttgtctaa aaccattgtg gaaataagtg
   128161 gtggaactga aacttgaagt ccacaccaac atccttgatg cctgggctag atattgtatt
   128221 tcataccagg ctggagcagc ggtgtctcag agctggcttg tattgacact cgttcacgaa
   128281 agctgcttat acccacttca tcctcactcc atctttagta gcattgcatt ggtagcttaa
   128341 aataagcaaa ctctacaaat caggtctccc ttattttatt ttattttatt ttattttatt
   128401 tatttttttg agacagagtc ttagtctatc acccaggctg aagtgcagtg gcgtgatctt
   128461 agctcactgc agcctccacc ttccaagttc aagtgattct cctgcttcag ccgcccacca
   128521 ccatgcctgg ctaatttttt tttttttttt tttttttttt tttttgagac agagtcttac
   128581 tctgtcaccc aggctggagt gcagtggcgc gatctcagct cactgcaacc tccatcccca
   128641 gggttcaatc aattctcctg cctcagcctc ctgagtagct gggattacag gcatgcgaca
   128701 ccacacctgg ctaatttttt gtatttttag tagagacagg gtttcaccat aatggccagg
   128761 ctggtctcaa actcctgacc tcaggtaatc cgccacctca gcctaccaaa gtgctgggat
   128821 tacaagcgtg agccactgcg cccggcccgc gcccggctaa tttttgtatt tttttgtaga
   128881 gatgggcttt caccacattt gcccggctgg tctccacctt cctgagttca agtgatctgc
   128941 ctgcctcggc ctcccaaagt gctaggttta cagatctgag ccactgcgcc cagcctattt
   129001 attgttttaa tctggagagc cagttgttaa acacttacta gcatgcaact gggctggact
   129061 cagaaggcac ggcatctgct taagttcctt ttctttgaac tttcttattt tgttcaagac
   129121 ttggcattgt aagaagtcgc aatactggct tgaattacta tagtttctca gtttcagatt
   129181 ttacagaatg gtaaaacttg aagaaaatat tagagggcca acttaagggt ttttttttag
   129241 tttatcaaat aagggtattt aaaatgtatt atcttcactc atcatttcaa aattgaagga
   129301 gcattttaac accaaaaaag gataaaaagg aaataatctc agaatttaga agttgattat
   129361 atttagtttt gttaaaaact cactccagct gggtgtggtg gctcatacct gtaatcccag
   129421 cactttggga ggtcgaggca ggcagatcac ttcaggccag gagttcaaga ccagccttgg
   129481 caagatggcg aaacctcgtc tctaataaaa atacaaaaaa ttagctgggc atggtagtgt
   129541 gtgcctgtaa tcccagctac tcgggaggct gaggtgggag gatcacctga gcactggagg
   129601 tcaaggctgc agtgaaccat gatggcacca ctgcactcca ctctgggtga cagtgagaga
   129661 gagaccctgt ctcaaaccaa caaaacagaa caaaaaagtc tcactccatt gtctatattt
   129721 taatcattta actgaagact aatgaaaata ttgtcaagga caaatatccc acaaaacagg
   129781 gatttttgtt tgttcagatg tatctcaagc acctagaaca gtatctagca ctcagtaaat
   129841 attcgatgaa agaatgaagg atttgcagga atctggtgac caaagtttgg caaccagagg
   129901 tgtaagctac ccaacttggc atcttggcta gtcagtcagc taacatattt gaagtattta
   129961 agaagatatg aaagaaaaaa atgcacaaat cctatcttag gttgagaaaa tctagttggg
   130021 agttatagaa gatggatgcc taaacatggc ttaatctcaa ggacaactaa tatcaaatta
   130081 gttgctctac ttttaagaga actatgtagc gcctttattc aagatattag aaatgccttt
   130141 gctagaaacg ggagagtctt ttcacggttc actaattttc cctaggaatc tgcagaacca
   130201 atgagaatga aagcttttga acgtacgatg atgaggcagg attctgaagc ttctatttaa
   130261 ctgtattgtc ggggacattg aaaacatcaa ttgtaactgt accaatgaga tggtgttggt
   130321 agcagcggtg tccctcttct atagatgagg aaactggaac ttagggaggc ttaagcaaga
   130381 tcattgtgga aggaagtggt ggaagtgtgg gcatagtgcc acaaacatta aaaacaggaa
   130441 aggtgagcac agaacaactt tcctttaagt catttctcct tcttacatct acctggtaag
   130501 tcagcccagc cctttggtag aggcacctgt taccaatggc agcaataaca gaagaggcag
   130561 aggaagtgaa gatacctaag gtggtgctaa ggctaaagat gaatgacagt tgcttcattt
   130621 tatttgtaaa gtgcaagtgc aggccagtaa tagccttctt taaccagtgt attttaagtc
   130681 ctatacatac aagcagtttc caacttgcaa aagtgttgtg ttttaaaaga catgtaaaga
   130741 ggctgtttaa aaccaaaaat agaggccggg agcagtgcct catgcctgta atgccagcac
   130801 tttgggaggc tgaggtgggt ggatcacctg aggtcaggag ttcaagacca gtctggccaa
   130861 catggtgaaa ctaaaaatac aaaaatcagc cgggtgtggt ggtgcatgcc tgtaatccca
   130921 gctactcggg aggctgaggc aagagaatca cttgaacctg ggaggcagag gttgcagtga
   130981 gccgagatgg cgccactgca ctccggcctg ggcgacagag attctgtctc aaaaacaaac
   131041 aaaaaaaaaa acaaaaatat actctcctta agaagtatta taaagtggtt atttgtttct
   131101 taaacatgtc aataaaaatt catatgattt agtccataat gtagctgaaa aattatgtgt
   131161 acacaatgga aaacagtact agaaatactg ctgtaaacat aatagacttg acctcaatta
   131221 gacttgttcc agcagctcct gagactgggc ttctattgaa tgagcctgaa tttcattccc
   131281 actgattcaa cagtggtagc caactctcat gtagcagaca ttctgactct ggccaggcac
   131341 tgctctaagc actttacata tattgactca tttcatcacc acaaaaatcc tgcgaggtag
   131401 gtattattct tttctctatt ttacaaaagc cctaacccta gcccaagatc accttgctag
   131461 taaatggaag aggcagaatc aggatccagg ctgcctggca ccagagttca tgctcttaac
   131521 cactatgtta tgttgccttt cactaaaata taattggctt ctttcatatc aacctttaat
   131581 tcaatagaga ggaaagggaa agtaattcct gagggtgttg gtatggcagg caatcatgca
   131641 attctataga gtagatactc agaatttttt tttattgaat gcaagtgata aattattatt
   131701 gctttgagtg atgggggtct tacaaattaa tttttgtaca tttatgatag ctagagtaag
   131761 aaggtattca atagaaagac acatggtatc cagaagcagg tatgaaaaca ataaaataaa
   131821 ataaagatgc agatggaatt ttctactgag gctaggaaga tgctagcctt tctctggtta
   131881 cttaaatatt ccatcattac acttcaaacc gcaatgccat gtaaggtttc aatctgtccc
   131941 agtaagtcct tgctactttg ttaagtctct gagggcttca agtagatact tttgatgtat
   132001 tttccagctt ttctatttgc cctcagtggg aaggctggac tgaattatgt agttcactat
   132061 gctggaaggg caagtccctg agtccctgat tcatgatttt accatcctcc acaacctctg
   132121 aaacagcgcc gtccaacaga aatgcagtgt aaggcacaca gtaaatctaa gttctctggt
   132181 ggccatattt aaaaagtaaa acacgggcca ggcacggtgg ctcacgcctg taataccagc
   132241 actttgagag gctgaggcgg gcggatcacc tgaggtcagg agtttgagac cggcctggac
   132301 aaaatggtga aaccctgcct ttactaaaaa tacaaaaatt agccgggcat ggtggtgggc
   132361 acctgtaatc ccagctactc aggaggtgag gcatgagaat catttgaacc caggaggcag
   132421 aggttgcagt gagccgagat cacgccactg cactccagcc tgggcaacag agcaagactc
   132481 catctgaaaa aaaaaaaagg aaaacaaaac aaaaaagcga gataacatta attttaagcc
   132541 acattttagg ccaggcgcag tggctcacgc ctgtaatccc agcactttgg gaggccaagg
   132601 caggcagatc acggatcacg aggtcaggag ttcgagacca gcctggccaa tatggtgaaa
   132661 ccccgtctct attaaatata tatataaatt agctgggcac ggtggtgcgc acctgttgtc
   132721 ccagctactt gggaggctga ggcagaagaa tcgcttgaat ccaggaggcg gaggttgcag
   132781 taagccaaga tcgcaccact gcactccagc ctgggggata gagcaagact ctgtctcaaa
   132841 aggaaaaaaa aaagaagaaa aaagaaaaac acattttatt taactcaatg ataatatcca
   132901 taacattatc ctttctattt ataatcaata aattattaat gagatatttt tggtatcatc
   132961 ttcaaaattc agttttggag agcaccgttg tcagttggtc caagtgtcca ggtctgaggc
   133021 atccgcccaa accctcccac ttctggcccc tcaaactgga agaacattca tcatgcattt
   133081 tgtgtccaag gctcaaaagt ttgttaccca agatgatgag tgctgacatg gatgcagttg
   133141 atgctgaaaa tcaggtggaa ctagaggaaa acaacacgac ttattaatca agtgtggaaa
   133201 ttccaactca tacttgaaga tctctctgca agagcaaaca caggtaagga agaaaatctg
   133261 aagctaaaat cagaaaacca agttgttgga caacatatag aaaatctcat gtcagcttct
   133321 ggtgtttttc aaacaactga caccgaaagc aaaagaaagt aaggaattga cacacttctg
   133381 ttttacagaa ttgctgctga taattttttc ttttaaactt ggacagattc caaaaagtta
   133441 cagcatcttt gtggcttcat tgaatattta tgaagaaaat gtcaggtgag gcaaaattaa
   133501 cagcattaac aggagacttc cctaagtttg tatattatat tagtctatga aaacatgcag
   133561 tctctccctc tccctccccc tccccctccc tctccccacg gtctccctct ctttccacgg
   133621 tctccctctc atgcggagcc gaagctggac tgtactgctg ccatctcggc tcactgcaac
   133681 ctccctgcct gattctcctg cctcagcctg ccgagtgcct gcgattgcag gcacgcgccg
   133741 ccacgcctga ctggttttgg tggagacggg gtttcgctgt gttggccggg ccagtctcca
   133801 gcccctaacc gcaagtgatc cgccagcctc ggcctcccga ggtgccggga ttgcagacgg
   133861 agtctcgttc actcagtgct caatggtgcc caggctgaag tgcagtggtg tgatctcggc
   133921 tcgctacaac ctccacctcc cagccgcctg ccttggcctc ccaaagtgcc gagattgcag
   133981 cctctgcccg gccgccaccc tgtctgggaa gtgaggagtg tctctgcctg gccgcccatc
   134041 gtctgggatg tgaggagccc ctctgcctgg ctgcccagtc tggaaagtga ggagcgtctc
   134101 cgcccggccg ccatcccatc taggaagtga ggagcgcctc ttcccggccg ccatcccatc
   134161 taggaagtga ggagcgcctc ttcccggccg ccatcccatc taggaagtga ggagcgcctc
   134221 ttcccggccg ccatcccatc taggaagtga ggagcgcctc ttcccggccg ccatcacatc
   134281 taggaagtga ggagcgtctc tgcccggccg cccatcgtct gagatgtggg gagcgcctct
   134341 gccccgccgc cccatctggg atgtgaggag cgcctctgcc cggccgagac cccctctggg
   134401 aggtgaggag cgtctctgcc cggccgccac gtctgagaag tgaggagccc ctccgcccgg
   134461 cagccgcccc gtctgagaag tgaggagcct ctccgcccgg cagccacccc atctgggaag
   134521 tgaggagcgt ctccgcccgg cagccacccc gtccgggagg gaggtgggag gtcagccccc
   134581 tgcccggcca gccgccccgt ccgggaggga ggtggggggg tcagcccccc gcccggccag
   134641 ccgccccgtc cgggaggtga ggggcgcctc tgcccggccg cccctactgg gaagtgagga
   134701 gcccctctgc ccggccacca ccctgtctgg gaggtgtgcc caaccgctca ttgagaacag
   134761 gccaggatga caatggcggc tttgtggaat agaaaggtgg gaaaggtggg gaaaagattg
   134821 agaaatcgga tggttgccgt gtctgtgtag aaagaagtag acatgggaga cttttcattt
   134881 tgttctgtac taagaaaaat tcttctgcct tgggatcctg ttgatctgtg accttacccc
   134941 caaccctgtg ctctctgaaa catgtgctgt gtccactcag ggttaaatgg attaagggcg
   135001 gtgcaagatg tgctttgtta aacagatgct tgaaggcagc atgctcgtta agagtcatca
   135061 ccactcccta atctcaagta cccagggaca caaacactgc ggaaggccgc agggtcctct
   135121 gcctaggaaa accagagacc tttgttcact tgtttatctg ctgaccttcc ctccactatt
   135181 gtcccatgac cctgccaaat ccccctctgt gagaaacacc caagaattat caataaaaaa
   135241 ataaattaaa aaaaaaaaaa aggaaaacat gcaaatgaat tgtagaaact ttatcattac
   135301 agttgcacat attggccagg tgcagtggca catgcctgta atcccagaac tttgggaggc
   135361 ggaggcatgt ggattgcctg aggtcaggag ttcgagacca gcctggcaac atggtgaaac
   135421 cccattccta ctaaaaatac aaaataatta gccaggcgtg gtggcttatg cctgtaatcc
   135481 cagctattcg gcaggccgag gcaggggaat tgcttgaacc tgggaggtgg aggctgcagt
   135541 gagctgaggt tgctccgttg cactccagcc ttggtgacag agcgagactg catttaaaaa
   135601 aaaagaattg catatattta tgaaacttaa agatgaatgt tttatcaaat tttccttgat
   135661 ttgtagattt agcactgtct tttattagag gcttactaag atatacaaga aaaataacca
   135721 cacgttgtga aaaagtgacc ggaatcatac actgaatgcg tagcctcatg taccctgtcc
   135781 gtcatcttat gcctcttctc cacttgcctc ttcctcttta ccttcctcga aggaaagaat
   135841 tggtttcaca tttgtaaaag tcattttaat agttaatcat ctcagagagt aacctgcact
   135901 ttaattgttg aaacttaacc aaaataagat acagaaagta tctgtatctg agaaacagct
   135961 agggcttgtc attttttata tttagtatta agacaagaat gctggtttct ctttaatcca
   136021 tttaaaacag agggaagcta taaaaataaa gatttttctt tgaggctgaa ttgtcaactt
   136081 aggaagattt gttgttaaaa atttgttttt gcacaaagta actcacttca tcttatctgg
   136141 aaagataagt tggtcaagtg tatgtttaaa atacaaaatt tagaggaaaa atagaaatag
   136201 ggtgaaaaag tacttggtaa acagtagtga cgaactgtga atattttcac tccagatttt
   136261 gttatccctg gcacagagta gatcttttgg gaaatatata cggaagtgga ttaagtttga
   136321 ctaactttat gtaagccaca tctagaagag aacagttaca aagagtttgg tctctagatt
   136381 tatttgtacc cagcagatca acttttgcaa aattccttcg cagtgcagta gtattagaat
   136441 tgtgaatgaa ataggagtgt ttcgcatata tgcttattga caatcttctc agtattttat
   136501 cttacttgtt ctctcagaat tttctgtcaa ctcaactact tgatttgcag tcatcctttg
   136561 ttattatcct ttaacaagtt ttatcttatt ttatttattt tttgagacag ggtcttcctc
   136621 tgtcatccag gctggagtgt agtggcacaa tctcggctca ctgcaacctc tgcctcccaa
   136681 gatcaagcaa ttctcccacc tcagcttccc aagtagctgg gactaaaggc acgtgccatc
   136741 acacctggtt aattttttta ttttctgtag cgacggggtt tcatcatgtg gccgaggctg
   136801 gtctcaaact cctagactca agcgatctgc ccaccatggc ctcccaaagt gctgggatta
   136861 caggcatgag ccaccacgcc cagcctggcc tcaaatttta aaaatttata ggttccattt
   136921 ggtaaagaaa tcagtatcag aagtaatgct aaaatcttat aataggaaaa gagatccact
   136981 aatatagcct atggttatta gatttgggct acttttaatc atggaataat cttatgtatt
   137041 ggtgtaagag ttgatgaatg actttacctg tatgaattag aatattcaaa ctgcaaacat
   137101 tttgcatccc ttttgtgacc taatttacag acatttaaat tgtgctgcaa ttctgctttg
   137161 ccatttaata aaaagctgtt tcagaaaaaa aaatccagtt ttgtatttta tacttacagc
   137221 acatctcaat ttgaactttt cagtgtcaca tctcaagtgc tcaatagctt cttgcagtta
   137281 atggctactg cattcgacca cacaggtcta agccctcatc gcttttccaa aagcagagtt
   137341 ctgatcaaaa gatttcctgt tttgaccagg tgcagtggct cgcgcctgta atcccagcac
   137401 tttgggaggc tgaggcggga ggattgcttg agcccaggag ttcatgacca gcctgggtaa
   137461 catggcaagg tcccatctct acaaaaaata caaaatttag ctgggtgtgg tggcatgcgc
   137521 ctgtggtccc agctactctg gaggctgagg taggaggatt gtttgagccc aggaagtcaa
   137581 agctgcagtg agtcgtgatc atgtcactgc actccagcct gggtgataga atgagactcc
   137641 atctcaaaag atatagatag agatagatag atagagatat agataaatat agatatagat
   137701 atggaataat atttttaaaa tatattatta tataattata gatctatttc ctgtttcagc
   137761 actgtggctc acacctttaa tcccagcatt ttgggaagcc aagggaggga ggatcccttg
   137821 aagccaggag ttctcaagac cagcctgggc aacatagtga gaccctgtct ctgcaaaaaa
   137881 taaaagccag acacagtgta cacccctgta gtcccagcta cttgagaagc tgaggcagga
   137941 ggatcccttg agcgcaggag gttgccactg cactccagcc tgggtgacag actaagacct
   138001 tccttctaaa aaaataaaaa caaacaaaaa tattttctgt gtcaaaactc ttcaatactt
   138061 ccatattgcc tactaatttg aaccaaagtc cttattccag tatttcagat gctgtacaaa
   138121 acaaaagcat tgctctatct ttgcagtttc atctttgcta tcctcacaca agaaccttat
   138181 actctaagca actagttttc ctcatatttc cagatcatct atgcttgtca ccctctgcac
   138241 tgttgacttg cttaatgatc ctaatagtcc tagctaacat taattggatg ccaattatat
   138301 gccaatgagt gtgctagact ctttgtgcat tattatttaa tcttcacaac tctatgaggt
   138361 agatattatt atcctcattt tacagaggag aagttaaact tctgagaggc taacttgttc
   138421 aagaggacaa aactgtccat tgacctagaa gtgaaccaac cttgttttta aagtagtgct
   138481 ttcccaaaat tatcgtgttg cctctctctt tctggaattt ccttttctca tttctgctgc
   138541 ttgaaattct gctcctcctt aatggcccat ctctgaaaat agctcttaca aaaagccttt
   138601 ctcggccagg tgcagtggct cacgcctgta atccgagcac tttgggaggc cgaggcgggt
   138661 ggatcacctg aggtcaggag ttcaagacca gcctgaccaa cacggcgaaa ccctgtctct
   138721 actaaaaata caaaattagc cgggtgtggt ggcgcatgcc tgtagtccca cctactcagg
   138781 aggctgaggt gggagaattg cttgaacccg ggagtcggag gttgcagtga gccgagatcg
   138841 cacattgcac tccagcctgg gcaacaagag tgaaactccg tctcaaaaaa aaaaaaaagc
   138901 ctttctttat ccccaactgg atgtcacttt ttgagcttct accaagcttt gaggctttga
   138961 ggctccttgt ggcagacaca tagactgagc gcagaaaaac catcctctta attttctgtt
   139021 taatatgata attatttgta tttttgcctt atgtaactac ctgttagccc ataaagtaca
   139081 ggggccctgt agctcctact cttcttcgtg tcctccacag tatatgatgc gatggtcagc
   139141 agacactaaa taaatatttt ctgatgattt aaattatttc agcaccttca ttccagtctg
   139201 ttgtggaatc tcatcctgca catcactgcc aaactgtgct cctaaaacac atttttatca
   139261 tgctgttttc tagaaatgtt ttctgttgcc tgctgcatga agtcctgcgc ccctgccttc
   139321 caaggctctt tccccatgct ctggatccct ttcctgagca ttttactcaa tttgtttttg
   139381 tttttgtttt aaagaggggg tctcgctctg ttgcccaggc tggagtgcag tggcgcgatc
   139441 tcggctcact gcaaactctg cctccctggt tcaagcaatt ctctgcctca gcctcccgag
   139501 tagctgggat tacaggcaca cgcctccatg ccaatttttt gtagtttagt ggagatgggg
   139561 tttcaccgtg ttgcccaggc tggtgttgaa ctcctgagct caggcaatct gatggctcat
   139621 gcctgtaatc ccagcacttt gggaggccga ggctggtgga tcacttgagg ccaggagttt
   139681 gagaccagtc tggccaacat ggtgaaacca cgtttctact aaaaatataa aaattagtgc
   139741 ctcatctgta gtcccagcta cttgggaggc tgaggcacga gaatcacttg aatccagcag
   139801 gcggagattg cagtgagctg agatcgcacc attgcactac agcctgggtg acagaacaag
   139861 actagtctca aacaacaaca acaacaacaa aaacaaacaa acaaaacccc agattttgaa
   139921 aactcagcct ctgatgtagc catgctggtc ccatggacct ccccagttgc ccaatcattc
   139981 ttgcctcctt tctagaatgg ctctctccat tccttggcct ctttgttctg caagatccac
   140041 ttcaagtact gcctcctcag agccttctct aactacccca actcttcttc cctctagcta
   140101 cacttgcacg atacaaattg taccacgaaa ttctgattct gtatttgtag cacgtaagat
   140161 agttttatct cactagctaa cttgaaaata caggatcttg tcctactcat ctttgtattg
   140221 cactgtggct taacacatat ttagataata atcaataaac attcattgaa ttaacttttc
   140281 ctaattataa tattctacat ggtttgtctt catttctggt tcatcatgac ttggcagtgg
   140341 cttctgaggc tttaagcctt aggtttctct ttgaatctcc tcctggtatg caaaagtctg
   140401 taaacacata cagtgtttga atacattagg ggaacatgtc attagataat ctatccttgt
   140461 atcacttaag gtattttata aacttacaca tcatcctgtg acattccaaa gctgtaaact
   140521 ggatttttgc aaagctgtaa attgccctga aatggggcac actttttgca aatttatttc
   140581 aatagcgatt gaaaacaaaa aaacaaggca aatgttttgc acaacagttg cacaacaact
   140641 aagtcttatc cagaatgctg cacaacacta agtcttatcg ggaatgcaga ctccctttca
   140701 tcacaaacga ggcaacatct cctgaataac gtggaaaagg agaaatgaaa gcaaaataag
   140761 gaagctttgt ccctgcttgt tacagttgta ccttttcaag cttttgaagt aaccatctct
   140821 tagcttcatg ccccctgaaa atcagctgct ttaagtttct ctgtcttatt tagtctgtgt
   140881 gtgtgtgtgt gtgtgagaga gagagagaga gagacagtct tgctctgtcg cccaggctgg
   140941 agtgcagtgg tgcaatctcg gctcactgca acctccacct ccacgttcaa gcaattctcc
   141001 tgcctcagcc tcctgagtag ctgggattac aggcgcgcca tcacgcctgg ctaatttttg
   141061 ttttttgttt ttttttgtaa tagagacagg gttttgctat attggccagg ctggtcttga
   141121 actcctgacc tcaggcgatc tacctgcctc agcctcccaa agtgctggga ttacaggcgt
   141181 gagccactgc gccctaccct tatttagtct taaagaaaag gattttgaat tttaggatga
   141241 ttcattccaa tgctagttct agtctctgtt aatttaccca agtgtgctag gaatttaata
   141301 acatacatta ttttcataat aaaatagttt acttcttact gacctcactg tccaagaaat
   141361 gcacaaaggt gaggtgagga atacccttca tggactatag gagaatatgg gactagaaaa
   141421 gacgatttta ccataatgtt gagtttcccc tcaattaact ttatgttaat attatgcaat
   141481 aactttctga agaaagaaaa cagaattctt gagtcattag ggaaaatagg aatgttctga
   141541 attgaaaata tatattcttt ctgatattct gtgtagctta tctgcttggt gcattatttg
   141601 atcctagtag acatccccca ttatcctaca agtactcctt gtcacctagc aatgtgtact
   141661 caaagaagga gcttaatttt accaagagaa caaaataaga cataattagg gagtactcta
   141721 gaagccagtg ataagacccc tcccctaaca aatgaataac aattggtgtg acacatttga
   141781 tgtgttacaa ttaattacaa agtaagtcac aagaaatgta ttactaataa aaaagtacca
   141841 acagtttgaa agtgacatgt ttaacaatta gagccaattc agcagattat ccctataata
   141901 atattatgtt gatagatgtg cattccagta tgctttctgg gagagagaag gggagaatta
   141961 tgggtagcta ccaacgactg atattatttt gtgagcgtgt ccgtggaaat gtgtcgaaaa
   142021 tgtgaacaac atctgtcttc tctatagtaa tatgggaaag ttgagaatag ctaagctaaa
   142081 aggtgttgga agtatacttt ttttcaccag cttcccttcc tccttctcgt ggtaacagca
   142141 ccctgatttt aaaatgggga aaccctgaca ctcctgggct taagatagaa ccaatcttgc
   142201 ccctagcttc aagagtgggc ataggtccca aggccagcct gtcaaagttt tgtatccctc
   142261 cagctacaat gactggttgg ggaatgagca tgtgacacaa tcacagcctg caagacctgt
   142321 atcacgaggt tctttgttgc aaagaagagt agtcgatttt ggctgactca gaatttggta
   142381 gaagttacaa actgtaaaat gtggaactgg ctgaatggag ggcgtgggga tagggcaatg
   142441 aggactctag tccaaactgg aaattatgaa ttcatgaaat atctgttgcc tattgttttg
   142501 gggtaagacc acaggcccac tggagtgtgg tatgagaaga aatggtagtg ctgggcgcag
   142561 tggctctccc ctgtagtccc agcactttgg gagatagaca ggtggattgc ttgagcctag
   142621 gatttagaga ccagcctgag caacatggca aaacccttct ctacaaaata catgctgggt
   142681 gtggtgacac acacctgtag tcccagctac ccgggaggct gagatgggag gatcacttga
   142741 gtgcaggagg tcgaggctgc agtgagccat gattgtgcca ctgcactcaa gcctgggcaa
   142801 cagagtgagg ccctgtctca aaaaaaaaag aagagaaaga aatgatagga aaaattcagc
   142861 atgttggcat atgttggctc cttttgtagg cttcagaaag tggctcacgt ctgtaatccc
   142921 agcactttgg gaggtcgagg tgtgtggatc acgtgaggtc aggtgttcga gaccagcctg
   142981 gacaagacgg tgaaaccccg tctctactaa aaatacaaaa attagctggg tgtggtggcg
   143041 ggcgcctata atcccagcta ctcaggaggc tgaggcagga gaattgcttg aacccgggag
   143101 gcagaggttg cagtgtgcca agatcacacc actgtactcc agtctgggcg acagagcaag
   143161 acttgtctca aaaaaaagca aaaagcaaaa caaaacaaaa aacaagaaag aatacaagaa
   143221 agttactcag taacttaatc tgaaagtgtt ctatctaaaa gcgtagagag aataatatat
   143281 cgctttgcta agtgaagttc tttctgccta tagcctgcaa tataaaatcc catagtttag
   143341 agccctatag ggtagaaaaa gtaccaaact ggtcgggcga ggtggcttac acctgtaatc
   143401 tcagcacttt gggaggctga ggtgggcgga tcacttgaga tcaggaattt gagaccagcc
   143461 tcgccaatat ggcaaaaccc catctctact aaaaatacaa aagttagccg ggtgtggtgg
   143521 tgggagtctg taattccagc tactctgagg caggagaatc gcttgaaccc aggaggcaga
   143581 ggctgcagtg agccaagatt gtaccactgc actccagcct gggcaacaga gcaagactcc
   143641 atctcaaaaa aaataaataa aaagcaccaa actaaactta atgcagaggc aaggaggcaa
   143701 aagatttagc aagaaataag accagagttc caggcttctg tcatgcacct ccagctaagc
   143761 attttcccag aaaagagggc aaagctcaga ttatgggtat acatctgctc ttatggcttt
   143821 tattttgagc tacttggaca agtaccatta aatggaggtc aggggatggc acagcactga
   143881 ggttgattcc taggagtttg aaattctcag gtttaaacat tttatctaga aaaacatttt
   143941 ttgggtgtga caactcagac caaaagctac tgagctttta tgggaacgta ttgccaaaga
   144001 aaccccaggc ccagcaatct attgtatttt ctaggtttcc aatttgtacc acagctaaag
   144061 ttgagtaggg catgcccaga aagggcattt ccatgtgccc atctctggca tagccatgga
   144121 gaataagaga caagtaagat cctctagcag gaagaagcag gagccatgct agaccataaa
   144181 gaagatcccc atagagagcg agcctcgagt ttcccagtta cctcgccaag gaactcattc
   144241 tactgccagg caaggagtct tccatattcc tgcctagcag gacataaatt atgctacaga
   144301 tcaacaataa ctgtgtgcta tttttcatcc attctcctct ttgcagaatt agagttttca
   144361 ttgtaattac tctgttccta ttttttttaa agatatattg gctaagatag tttaagtttt
   144421 acccagtata agcgaatctg gccacttctc tctccctcta ccatgaccac agtttaagtt
   144481 cccaccattt ccccactaga cttacgtaac agccctctaa cttccctact tccacgctac
   144541 actccgccct acagtctatt cttccctgta ccccagttca catcactccc ctgtacaaaa
   144601 ctttcccatt taacaaatat taagcatctg cttgccaggc ctcaaggagc catatccaaa
   144661 tttgagaagg aaatcagaaa gaaatcatca gccagaaatc ctggtcttaa gagttgaaca
   144721 ctgtaaccaa ataaggccct gggttgtctc tcaccgggga agaggagttg ataagcactc
   144781 tgttatacag gcatgagcat ttcttgaata actccttatt tctaacagca ttctaatttt
   144841 gtttagataa ccattgtccg taacaccaca gactgaaatg aagtcagaca ggtggcctgg
   144901 gccgggcgca gtggcttaca ctggttaatc ctagcacttt gggaggctga agcaggtgga
   144961 ttactggagt ctaggagttc gagaccagcc tgggcaacat ggcaaaacct catctttaca
   145021 aaaaatacaa aaattagtgg ggcatggtga catgtgcctg tagtcccagt tactcgggag
   145081 gctgaggtgg gaggatcgct tgagctcggg aggtcgatgc tgcagtgaga tgtcattgtg
   145141 ccactgcact ccaatccggg tgacagagtg agaccctgtc tcaaaaaaaa ataaattaat
   145201 taataaatta aggccgggcg cggtggctca cgcctgtaat cccagcactt tgggaggccg
   145261 aggcgggtgg atcatgaggt caggagatcg agaccatcct ggctaacaag gtgaaacccc
   145321 gtctctacta aaaatacaaa aaattagccg ggcacggtgg cgggcgcctg tagtcccagc
   145381 tactcgggag gctgaggcag gagaatggcg tgaacccggg aagcggagct tgcagtgagc
   145441 caagattgcg ccactgcagt ccgcagtccg gcctgggcga cagagcgaga ctccgtctca
   145501 aaaaaaaaaa aaaattaata aataaataaa ttaattaatt aattaaaaaa aaaaaaaagg
   145561 ccgggcctgg tggctcacgc ctgtaatccc agcactttgg gaggccaaga cgggtgatca
   145621 cttgaggtca ggagttggag accagcctgg ccaacatggt gaaaccctgt ctctactaaa
   145681 aataccaaaa attagccggg caaggtggca ggcacctgta atcccagcta ctcaggaggc
   145741 tgagacagga gaatcgcttg aacatgggag gcacacgttg cagtgagctg agatcatgcc
   145801 attgcattcc agcctggaca acaagagcaa aactccatct caaaaaacaa acaaacaaaa
   145861 aaaacagcct gggaccagat catgctgctt gtaggccact taatacttct gtattatatt
   145921 ctaagttcac taaaatataa acccaatgag gtcttgaatt tttgtcactg ttgaattacc
   145981 agaacagtgc ctgactcaca gtaaaggctt aatatttgtt aaataaatga aagagtaatg
   146041 ggaaagtttt gtacagggaa aaacagactg tagggtggag tgtagtgcag aagtagggaa
   146101 gttagaggac tatcacataa gactaggggg aaacaaaggg aacttaaact gtgtggtcat
   146161 gctagaggga gagagaaacg gccagattca tattatattt tgaagaggca gggacagaat
   146221 ttgctaatat ggaccaaaag agaagtagag aagaataaca acaaaatgat tgagcaatat
   146281 ttaatttgga atcacatggc taaataatat atatatatat attttttttt tgagatggag
   146341 tctcactctg tctcccaggc tggagtgcag tggcgcgatc gcagctcact gcaacctctg
   146401 cctccagggt tcaagcgatt cttgtgcctc agcctcccaa gtagccggga ttacaggcat
   146461 gagccactgc gcccagctaa tttttgtatt tttagtagag acagggtttc accttgctgg
   146521 ccaggctggt ctcgaactcc tggcctcaag tgatccaccc acctcagcct cccaaaatgc
   146581 tgggattaca ggcttgaggc actgtgcccg gcctctttct ttcttctctc tttctctttt
   146641 acctgagatg gggtctcact atgttgccca ggctggattc aaacttctgg gctcaagcaa
   146701 tcctcctacc tcagcctccc tagtagctgg gactatgcag gcatgccacc gtcccaacca
   146761 gaattttatt tctttttatg gctgaataat attccattgt gtagaatatt accacattag
   146821 gctatccatt catctgttaa tggacactgg gttgtttcta cctttgggct tttgtgaata
   146881 atgctgctaa gaacattggt gtatgagtat ctgaatccct gcattcaatt attttggatc
   146941 taaatccagg agtggaattg ctggattata tggtaattct atgtttaagt ttttgtggaa
   147001 ccaacacact gttttccact gcagctgcac ccttttacat tcccactaac aacgtacgag
   147061 atttcagttt cttcacatcc tcatcaacac ttgttatatt ccaggtttgt tgttgttgtt
   147121 gttgttgtaa tagccattct aatgggtatg aagtgtatct cactgtggtt ttggtttaca
   147181 tttccttata ggctagtgat gttgagcata ttttcatgtg cttattcacc atttgtatat
   147241 ctgttttggt gaaatatgta ttcaagtcct ttgcccattt ttaatttttc ttattgtaga
   147301 gttgtgggag ttctttgaat attttggata ttaattcttt catcagatat ataacttgtg
   147361 gatatcttct cccattctgt gttgtcttta ccctctctta atagtgtcct ttggtggaca
   147421 aaaaaattct aattttgaca gaagtccaat ttatctaact tttttctttt ttttttcttg
   147481 agacagagtc tcactctgtc atccaggctg gagtgcagtg gcatgatctc agctccctgc
   147541 aacctccgcc tcacagggtc aagtgattct cctgcctcag cctcccaagt agctgggatt
   147601 acaggggccc gccaccacgc ccggctaatt tttgtatttt agtaaagaca gggtttcacc
   147661 atgttggcca ggctggtctc aaactcctga cctcaggcaa tccgcctgcc tcagcctctc
   147721 agagtgttgg gattacaggc gtgagctaag gcgcccggcc ctaatttttt gttgttcttt
   147781 cttgtgctgt ggtgtcataa ccactgccta attcaaggtc aggaagattt acccctgtgt
   147841 tttctaagag ttttagctct tacctttagg tttttagcca ttttgaatta attttttata
   147901 tgaggtggag tagtgttcca acttcattct tctccatgtt gttattcagt tgtccagcat
   147961 gactttttga aaagactatt ctttcccacc ttgaattgtc ttggcacccc atgtcaattg
   148021 aatgtggttt atttctacac ccttaattct attccattga tctgtatgcc ttgactacta
   148081 ttggtttgaa gttaagtttt gaaattgaga agtgcaagtc ctccaacttt gttctttttc
   148141 aaggttgttt tggctattct gaagcccttg aatgtccata tgaatcttag agccagcttg
   148201 ccaatttctg caagaaagcc agctgagatt ttaatacaaa ttgtgttaat tctgtagatc
   148261 aatttggaga ttattgccat attaacaaca gtgtcatgct gggtgcagtg gcccacacct
   148321 ataattccag cactttggga ggccgaggcg gccaacatag ggagacaagc tggtctcaaa
   148381 ctgctgggct caggcgatcc tctcacctcg gcctcccaca aaataatttt taaaaaatca
   148441 gctaagcatg gtggtgttca ccgtagtcca agctacttag gagactgagg tgggaagaac
   148501 tcttgagccc aggagatcaa ggctgcagtg agctatgagt gccactgact ccagcctggg
   148561 caacagaggg agaccctgac gaaagagaga gaaagacaga gcgagagaga gagaaaagaa
   148621 aaaataacct taaaatgttt ttcaagaccc aagtttccat gtcacttaat ttagtctccc
   148681 ctcaggcaga ttaaacactg agatctcctt atatccactt aatcccagaa atgtagtaca
   148741 gttcttggca cagagtaaga actcaatata tattggctga aggaatgaac aaataaatga
   148801 attgggatga aatgtttatg tgacagtctc tacaaccaga ctgtgagcgc atcaagtaca
   148861 tagaagatgt tctattcatc ttttttcctc cagcacctaa cacagtatcc aacacaaaat
   148921 gaatgtgtac tagatgctat ttattaaact gtagacagga gataaatact ttatttcaaa
   148981 agtgcaaaga agggaagaca aatcacattt ctttttatca tgtaaacttg taccaaagac
   149041 ttatgaatag atttatgaat tggagtagaa aacatagaca tttctacctt gaggatatag
   149101 aagggaactt aggaagtgag aagtcagatt accctagtcc tctagggtgc ttaaagctaa
   149161 ctagtcccgt gaatgagaat ctaacctatg tgaaaatctc cagcctgtac ctgtacagca
   149221 tcagccctgg gacaacacag tcaggggcag gctgtagtca catctcttag cctaagagta
   149281 cacagctgtc agggaaagtc ctgatggcca cagtgaaaaa ggtcatgggt ggagagaagc
   149341 aaagtaggaa ggatcatttg aagcacaaac aaatggggaa actgaacaga caatctcagt
   149401 atcaccacat ctgcttcaaa aatagcacac caactccctt ccaaagtgca tcgttacact
   149461 gcaccatcgt ggaagaaatg gaagagcagg atggatttgg ctggctggag tcacatcttg
   149521 gggaagctgg ccaggttggc aatgccacag gcgttgttct tatttcgagc catgaggata
   149581 tatcctttgt ttccccagtt ttctccccag ctgtaagacc aatcaagaaa aatacttagt
   149641 actctcagtt tatttattca ttaaaatatt tactgagtat tgtgcgaggt actgatggta
   149701 cacaaaaatg aatatgatgt tttttaactt taaagagttt aaagtcaaga aatagggaga
   149761 aggcataata ctgtaggtat caaaggaggg aaagattcct tctgatcaag agaatcttag
   149821 aggcttgatg taagaggtgg catttgagct gagccaaaag gctaagtaga atttcgacct
   149881 gtgggaaggg gatgccagtg gaagggtgtg aaatgaacaa cagaggcagg aaagtgtggg
   149941 gcactgttag gatcatgtag ctgaaaagca gtgtgcctaa cagggtgtag tgggagagat
   150001 gtctggaaag gcagattgag atcaggttga attaccttta ctgcaaggtt aaaaagcttc
   150061 acctttactc tgtgacccat catcgtgagg agccagtgag ggattttgag tgggagagtg
   150121 atatgatcag agctgtattt ggggaatatt aatcttgaaa atgtgagttt agagcataag
   150181 aaagaggctg aagctggtga tcttgatttg ggagtaaacc ccataggaga gatggatgaa
   150241 actatgagag tggatgagat tactaaggaa atgagtagcg agactagaag acaagagaga
   150301 cgacgtgaga aagaggagag aggagagatt gtctgtttat cttcctgaat tgttttcagt
   150361 cctctttccc caaccagcaa cttctctaat tacccaacac attttattta aaacctgtac
   150421 cctcccccac tctccccttc tccaggggcc cccagtgttc ctgatcctgc ccgggtgtgt
   150481 cctttcttcc tcccagctct cttggaagct atctccacac atggtacctg atcatgaaag
   150541 aacagggtaa gtgcttggac ctgggagtca agagaaggga caacacagga cagaagctgt
   150601 gggagacttg ggaagatgct caggttcacc ttgagttgga actgtgatat aaaaagttgc
   150661 ttcaggtttg aacaacacca ctgcgtgagt ctgtcagcga cccagagtct agggctcctg
   150721 ctccttggag ggcactaagg acaggagatt tagccagtgc tgctgccagg ttaatggtga
   150781 tcagccagga aaggaaaact ataattgtta tcaatctgcc cctccccaaa agtcacaagg
   150841 tgctaaagag aaatgggaag caaggtaccc tggagttttc tttgttcccc aacataggaa
   150901 agtaaacact aacttttttt gtttatatct tatttctaaa aaatgtttct cttcacaatc
   150961 tttggcaagt tcttatatac gtgtatctta agtgtatttt gaaggaaaaa aactgttcaa
   151021 aggaatatta aacctttacg ctgttcccct actgttaaag acagattgct aatggaaacc
   151081 atctaagaaa ctgatcacat tagagacaaa agaaagtaaa aatacgtaat tccaaagctc
   151141 ctgctggttc tttgtactgc tttatctcat ggttagttag gtctgtctta ttgggctctg
   151201 agctcactga aggtgtcatc tgctcgttgg tttatgtatg tcactaagca ccttgcacac
   151261 agggagcatc tgcccagcac ataattgagc actattgtct ggcggatatg gtttggatgt
   151321 gtgtcccctc ccagtctcat gctgaaatgt gatccccaat gtgggaggtg gggcatagtg
   151381 ggaggtgttt gggttatgag agtggattcc tcatgaatgg cttggtgcca tcgccttggt
   151441 gatgagtgag ttctcactct attagttcat gcaagagctg gttctttaaa gagtcggcac
   151501 ctccctctct ctcttgctcc ctctctcact atgtgacatg cctgctccct cttttgcctt
   151561 ctgccatgag taaaagctct gaggcctcat cagaggctga gcagatacta gtgccacgct
   151621 ttgtatagcc aaacaaacct cgtttcttta taaattaccc agtctcgggt attcttttgt
   151681 ggcaatgcaa aatggactaa cacactggcc attaagtgtc agactgagaa ttaagagcct
   151741 ctgactgaga agatagaagt gagaactctg agataatttt ttcttgattt gggacagaga
   151801 aaggaatatc gggaagctgg aggtgaggtt gagtgttaaa agggtgactg aataacaaaa
   151861 gtagtgttcc catcattacc tgtttttaat tatccagtgc ttgtttccct tctggattcc
   151921 atatcccact gccaaaactg catggttcag attatcgcta ttgcagcttt catcataata
   151981 cacacctaga atacaaacta ccagcgtgag gctctatgca atccaagggt caccctgtgg
   152041 gatctcccag tctaggagac tgtttgaaga attccatccg tgtcaggagg gtttaccaca
   152101 gagatgctgc tccatactgc agcagttgtt aatttctctc tggccacctc catgtgaata
   152161 caaaaactaa gtactgaaaa tacccaaggt ccttcgagaa accatcaagt ttgtatcata
   152221 aaagacagtg ctgtatagga tcagcagctt cttacctttg ctgtaaaact ggaaggaggt
   152281 caggcttgca tcaatggcca cagagacagg tcccactcgg gccactgccc tcttcagggc
   152341 tttctcattc ccctcgggga tctctctgta ccctctgcat ttagctgcct tgcctgttgg
   152401 gttgtacata caactctctt cctggaagaa acaagattgt aataggacta ggacaaagca
   152461 ataggcaatg agtcagtgag taagaaactc aaccaatatt ttgagtgatg ccagtgaact
   152521 aacagaggca gcagtctagg gtttctagag agggtctgtt cttcctcaga aaaacgaata
   152581 tgaaaagagg attcccgcct tccaccacac ataaaaacca actacaggcc aggtgcggtg
   152641 gctcatgcct gtaatcccag cactttggga ggctgaggag ggtggatcac gaggtcagga
   152701 gatcgagacc atcctggcta acacagtgaa accctgtctc tactaaaaat gcaaaaaaat
   152761 tagccaggcg tggtagcggg cgcctgtagt cccagctact ggggaggctg aggcaggaga
   152821 atggcgtgaa cctgggaggc agaccttgta gtgagccaag atcgcgccat tgcactccag
   152881 cctgggcgag agcgagactc catctcaaaa aaaaaaaaaa tcaactacag gtgttttaag
   152941 acataaatgt gggaggaaaa cagtagggaa agatttctga aataagattt ttttaaaaag
   153001 cataaaccat cttctggcca catagtatgt attagcctca gggggcagga aggcaccagc
   153061 tgctgtgtgg gtaggaggcc aaggccaagg cttagctaga gggaagacct gggccctctg
   153121 cacctatatc aggggaactg gtgtgatcct tgtgctctca tcaaagacaa aatataaaag
   153181 tacataacaa gacactaaaa tagattaaag aacttagcaa tttagcctat gacttccaac
   153241 actatatttt acaggtagtt agcaatgttt cattggcaag ccacaattat ggaatcgaat
   153301 aatagcccat gttaaggtgg tgtattcttt tcagcaattc attttcctac agactacccc
   153361 ttcagaaccc taaggttgca tccacaagaa cttattatct actactcagc aggctgaggc
   153421 aggaggattg tttgtgccca ggagtttgag cctttagtgt actgtgatca tgctgtgaat
   153481 agccactcta ctccagcctg tgcaagatag tgagataccc tctaaaaaaa tacattatct
   153541 aacattatct aaatattaac agcaatagca gcatttgata tgccatttta agattacagt
   153601 ggtctactgc cttgatattt ttcaagatgc ctttgtttct gccatgtgct ccaaatccct
   153661 gatgcctcct ggtgccaaaa attacacaga attacaagtg aaacagaata tcttggaaat
   153721 tgactcaaaa agatattatg tgatgatact ttaaagtcag aataacctgc attatgctgg
   153781 aacaaactta taataccatc cctttaaaat gttatctggc cgccctatca gatctcttct
   153841 tttttatctt tttatttttt gagacagagt ctctgttacc caggctggag tgcaatggca
   153901 tgatcttggc tcactgcaac ctctgcctcc tgagttcgag caattctcct gcctcagcct
   153961 gctgggatta caagcgtgcg caatcacacc tggctaattt ttgtattttt agtagagatg
   154021 gggtttcacc atgttatccc agctggtctc aaactcctga cctcaagt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerSNPSNPTaxonomyTaxonomyHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC002642. Homo sapiens, cat...[gi:12803614] Links  


LOCUS       BC002642                1752 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, cathepsin S, clone MGC:3886 IMAGE:3610589, mRNA,
            complete cds.
ACCESSION   BC002642
VERSION     BC002642.1  GI:12803614
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1752)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site:       http://www.nisc.nih.gov/
            Contact:        nisc_mgc@nhgri.nih.gov
            Shevchenko,Y., Wetherby,K.D., Beckstrom-Sternberg,S.M.,
            Benjamin,B., Blakesley,R.W., Bouffard,G.G., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Guan,X., Gupta,J., Ho,S.-L., Karlins,E., Legaspi,R.,
            Lim,M., Maduro,Q.L., Masiello,C., Mastrian,S.D., McCloskey,J.C.,
            McDowell,J., Pearson,R., Snyder,B., Stantripop,S., Thomas,P.J.,
            Tiongson,E.E., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A.,
            Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 12 Row: c Column: 21
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 806607.
FEATURES             Location/Qualifiers
     source          1..1752
                     /organism="Homo sapiens"
                     /db_xref="LocusID:1520"
                     /db_xref="taxon:9606"
                     /clone="MGC:3886 IMAGE:3610589"
                     /tissue_type="Pancreas, adenocarcinoma"
                     /clone_lib="NIH_MGC_39"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             72..1067
                     /codon_start=1
                     /product="cathepsin S"
                     /protein_id="AAH02642.1"
                     /db_xref="GI:12803615"
                     /translation="MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNE
                     EAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQ
                     RNITYKSNPNWILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLV
                     SLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSK
                     YRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNV
                     NHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI"
BASE COUNT      545 a    339 c    395 g    473 t
ORIGIN      
        1 ggcacgaggt tagaagagag cccactaatt caaggactct taccgtggga gcaactgctg
       61 gttctatcac aatgaaacgg ctggtttgtg tgctcttggt gtgctcctct gcagtggcac
      121 agttgcataa agatcctacc ctggatcacc actggcatct ctggaagaaa acctatggca
      181 aacaatacaa ggaaaagaat gaagaagcag tacgacgtct catctgggaa aagaatctaa
      241 agtttgtgat gcttcacaac ctggagcatt caatgggaat gcactcatac gatctgggca
      301 tgaaccacct gggagacatg accagtgaag aagtgatgtc tttgatgagt tccctgagag
      361 ttcccagcca gtggcagaga aatatcacat ataagtcaaa ccctaattgg atattgcctg
      421 attctgtgga ctggagagag aaagggtgtg ttactgaagt gaaatatcaa ggttcttgtg
      481 gtgcttgctg ggctttcagt gctgtggggg ccctggaagc acagctgaag ctgaaaacag
      541 gaaagctggt gtctctcagt gcccagaacc tggtggattg ctcaactgaa aaatatggaa
      601 acaaaggctg caatggtggc ttcatgacaa cggctttcca gtacatcatt gataacaagg
      661 gcatcgactc agacgcttcc tatccctaca aagccatgga tcagaaatgt caatatgact
      721 caaaatatcg tgctgccaca tgttcaaagt acactgaact tccttatggc agagaagatg
      781 tcctgaaaga agctgtggcc aataaaggcc cagtgtctgt tggtgtagat gcgcgtcatc
      841 cttctttctt cctctacaga agtggtgtct actatgaacc atcctgtact cagaatgtga
      901 atcatggtgt acttgtggtt ggctatggtg atcttaatgg gaaagaatac tggcttgtga
      961 aaaacagctg gggccacaac tttggtgaag aaggatatat tcggatggca agaaataaag
     1021 gaaatcattg tgggattgct agctttccct cttacccaga aatctagagg atctctcctt
     1081 tttataacaa atcaagaaat atgaagcact ttctcttaac ttaatttttc ctgctgtatc
     1141 cagaagaaat aattgtgtca tgattaatgt gtatttactg tactaattag aaaatatagt
     1201 ttgaggccgg gcacggtggc tcacgcctgt aatcccagta cttgggaggc caaggcaggc
     1261 atatcaactt gaggccagga gttaaagagc agcctggcta acatggtgaa accccatctc
     1321 tactaaaaat acaaaaaatt agccgagcac ggtggtgcat gcctgtaatc ccagctactt
     1381 gggaggctga ggcacgagat tccttgaacc caagaggttg aggctatgtt gagctgagat
     1441 cacaccactg tactccagcc tggatgacag agtggagact ctgtttcaaa aaaacagaaa
     1501 agaaaatata gtttgattct tcattttttt aaatttgcaa atctcaggat aaagtttgct
     1561 aagtaaatta gtaatgtact atagatataa ctgtacaaaa attgttcaac ctaaaacaat
     1621 ctgtaattgc ttattgtttt attgtatact ctttgtcttt taagacccct aatagccttt
     1681 tgtaacttga tggcttaaaa atacttaata aatctgccat ttcaaatttc aaaaaaaaaa
     1741 aaaaaaaaaa aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: BQ006623. UI-H-EI1-aza-p-08...[gi:19731530] Links  


IDENTIFIERS

dbEST Id:       11847065
EST name:       UI-H-EI1-aza-p-08-0-UI.s1
GenBank Acc:    BQ006623
GenBank gi:     19731530

CLONE INFO
Clone Id:       IMAGE:5846263 (3')
Source:         National Cancer Institute
Id as DNA:      UI-H-EI1-aza-p-08-0-UI.s1
Id in host:     UI-H-EI1-aza-p-08-0-UI
DNA type:       cDNA

PRIMERS
Sequencing:     M13 FORWARD
PolyA Tail:     yes

SEQUENCE
                TTTTTTTTTTTTTTTTATGGGTAAAAAATTCCATTTTTATTTTTACTAATGTAGGGGAGA
                ACATATGTTTCCTTCTATCCATCCTAGGTTTAGGGCTGAGATCTCTATAACAAAAGAGAG
                ATTAGCAAGAGAAAAGCATGCACATTTATTTAATATAAGTTTTACACGACACGAGGGGCT
                CCATAAGGAAATAAAGGCCCAAAGAAACAGTTGAACTTGAATGTTTTTTATAGTAGGTTT
                GATGAAGAGCAGCCAGTGATGTAGAAATGTCATAGGGCAAAGTGTGAACAAGCTAAATGT
                AATAAACTGGGGGAAACTGAGAGGCCTGTTTGTTTAGATTCCTCTTTGTGTCCCTGTGCT
                TTTGGAGGTAAGGATGCTCCTTTTCTCTGGGTGCCAGGAGAGCACCTCTTACATGACAGT
                CTTATGATCTACTTCAGGGGAAGGTCAGAAAATCCTTCCCAGGTTTTATGGCTGCTTCTG
                GGGAGAACAGCAGGCAGAAGGTCAGAGTCACCTTCCTGCTTCTGCAATTTCCTCAAATGC
                CTTCAGCTTAAAATATTCAATATGTCAAAGTGCTGTATTTTGTGTAGGGTGTCCTGAACC
                CGTCACTAACCTCTAACTGAAAGTTAGGATTTTCTTCAATTCTGAACATAGTTCATANAC
                CACGGTAATATTAGCAATACTTCATCATCAAAAGAAAACCCAGATATTTTCATATCACAT
                TAGTTTTTTGCATCTATTTTCATTATGCTT

Entry Created:  Mar 26 2002
Last Updated:   Mar 26 2002

COMMENTS
                Tissue Procurement: Dr. Jose Mercuende
                cDNA Library preparation: Dr. M. Bento Soares, University of
                Iowa
                cDNA Library Arrayed by: Dr. M. Bento Soares, University of
                Iowa
                DNA Sequencing by: Dr. M. Bento Soares, University of Iowa
                Clone Distribution: Clone distribution information can be
                found through the I.M.A.G.E. Consortium/LLNL at:
                http://image.llnl.gov
                The following repetitive elements were found in this cDNA
                sequence: 43-453, >MER39#Unknown/MER21_group 51-608,
                >MER34#Unknown/MER21_group 603-739, >MER80#DNA/MER1_type
                (matched compliment)

LIBRARY
Lib Name:       NCI_CGAP_EI1
Organism:       Homo sapiens
Tag Lib:        UI-H-EI1
Tag Tissue:     chondrosarcoma
Tag Seq:        ACACTTGCAC
Organ:          Left Pelvis
Tissue type:    Chondrosarcoma
Develop. stage: Adult
Lab host:       DH10B (Life Technologies)
Vector:         pT7T3-Pac (Pharmacia) with a modified polylinker
R. Site 1:      EcoR I
R. Site 2:      Not I
Description:    NCI_CGAP_EI1 is a normalized cDNA library containing the
                following tissue(s): Chondrosarcoma. The library was
                constructed according to Bonaldo, Lennon and Soares, Genome
                Research, 6:791-806, 1996. First strand cDNA synthesis was
                primed with an oligo-dT primer containing a Not I site.
                Double stranded cDNA was ligated to an EcoR I adaptor,
                digested with Not I, and cloned directionally into pT7T3-Pac
                vector. The oligonucleotide used to prime the synthesis of
                first-strand cDNA contains a library tag sequence that is
                located between the Not I site and the (dT)18 tail. The
                sequence tag for this library is ACACTTGCAC.

SUBMITTER
Name:           Robert Strausberg, Ph.D.
E-mail:         cgapbs-r@mail.nih.gov

CITATIONS
Title:          National Cancer Institute, Cancer Genome Anatomy Project
                (CGAP), Tumor Gene Index
Authors:        NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap
Year:           1997
Status:         Unpublished


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M90696. Homo sapiens cath...[gi:806607] Links  


LOCUS       HUMCATS                 1763 bp    mRNA    linear   PRI 19-OCT-2000
DEFINITION  Homo sapiens cathepsin S (CTSS) mRNA, complete cds.
ACCESSION   M90696 S39127
VERSION     M90696.1  GI:806607
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Ritonja,A., Colic,A., Dolenc,I., Ogrinc,T., Podobnik,M. and Turk,V.
  TITLE     The complete amino acid sequence of bovine cathepsin S and a
            partial sequence of bovine cathepsin L
  JOURNAL   FEBS Lett. 283 (2), 329-331 (1991)
  MEDLINE   91257334
   PUBMED   2044774
REFERENCE   2  (bases 1 to 1763)
  AUTHORS   Wiederanders,B., Bromme,D., Kirschke,H., von Figura,K., Schmidt,B.
            and Peters,C.
  TITLE     Phylogenetic conservation of cysteine proteinases. Cloning and
            expression of a cDNA coding for human cathepsin S
  JOURNAL   J. Biol. Chem. 267 (19), 13708-13713 (1992)
  MEDLINE   92317106
   PUBMED   1377692
COMMENT     On or before Oct 19, 2000 this sequence version replaced gi:250802,
            gi:179956.
FEATURES             Location/Qualifiers
     source          1..1763
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="testis"
                     /tissue_lib="lambda gt11 (Clontech #HL10106)"
     gene            1..1763
                     /gene="CTSS"
     CDS             137..1132
                     /gene="CTSS"
                     /note="cysteine proteinase; lysosomal enzyme"
                     /codon_start=1
                     /product="cathepsin S"
                     /protein_id="AAC37592.1"
                     /db_xref="GI:179957"
                     /translation="MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNE
                     EAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQ
                     RNITYKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLV
                     SLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDLKCQYDSK
                     YRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNV
                     NHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI"
     sig_peptide     137..478
                     /gene="CTSS"
     mat_peptide     479..1129
                     /gene="CTSS"
                     /product="cathepsin S"
BASE COUNT      525 a    345 c    400 g    493 t
ORIGIN      
        1 ggggtacctc atgtgacaag ttccaatttc ttttcaagtc aattgaactg aaatctcctt
       61 gttgctttga aatcttagaa gagagcccac taattcaagg actcttactg taggagcaac
      121 tgctggttct atcacaatga aacggctggt ttgtgtgctc ttggtgtgct cctctgcagt
      181 ggcacagttg cataaagatc ctaccctgga tcaccactgg catctctgga agaaaaccta
      241 tggcaaacaa tacaaggaaa agaatgaaga agcagtacga cgtctcatct gggaaaagaa
      301 tctaaagttt gtgatgcttc acaacctgga gcattcaatg ggaatgcact catacgatct
      361 gggcatgaac cacctgggag acatgaccag tgaagaagtg atgtctttga tgagttccct
      421 gagagttccc agccagtggc agagaaatat cacatataag tcaaacccta atcggatatt
      481 gcctgattct gtggactgga gagagaaagg gtgtgttact gaagtgaaat atcaaggttc
      541 ttgtggtgct tgctgggctt tcagtgctgt gggggccctg gaagcacagc tgaagctgaa
      601 aacaggaaag ctggtgtctc tcagtgccca gaacctggtg gattgctcaa ctgaaaaata
      661 tggaaacaaa ggctgcaatg gtggcttcat gacaacggct ttccagtaca tcattgataa
      721 caagggcatc gactcagacg cttcctatcc ctacaaagcc atggatctga aatgtcaata
      781 tgactcaaaa tatcgtgctg ccacatgttc aaagtacact gaacttcctt atggcagaga
      841 agatgtcctg aaagaagctg tggccaataa aggcccagtg tctgttggtg tagatgcgcg
      901 tcatccttct ttcttcctct acagaagtgg tgtctactat gaaccatcct gtactcagaa
      961 tgtgaatcat ggtgtacttg tggttggcta tggtgatctt aatgggaaag aatactggct
     1021 tgtgaaaaac agctggggcc acaactttgg tgaagaagga tatattcgga tggcaagaaa
     1081 taaaggaaat cattgtggga ttgctagctt tccctcttac ccagaaatct agaggatctc
     1141 tcctttttat aacaaatcaa tgaaatatga agcactttct cttaacttaa tttttcctgc
     1201 tgtatccaga agaaataatt gtgtcatgat taatgtgtat ttactgtact aattagaaaa
     1261 tatagtttga ggccgggcac gtggctcacg cgtaatcccg ttacttggga ggccaaggca
     1321 ggcattatca atcttgaggc caggagttaa agagcagcct ggctaacatg gtgaaacccc
     1381 atctctacta aaaatacaaa aaattagccg agcacggtgg tgcatgcctg taatcccagc
     1441 tacttgggag gctgaggcac gagattcctt gaacccaaga ggttgaggct atgttgagct
     1501 gagatcacac cactgtactc cagcctggat gacagagtgg agactctgtt tcaaaaaaac
     1561 agaaaagaaa atatagtttg attcttcatt tttttaaatt tgcaaatctc aggataaagt
     1621 ttgctaagta aattagtaat gtactataga tataactgta caaaaattgt tcaacctaaa
     1681 acaatctgta attgcttatt gttttattgt cccgaattca gttggtttaa tatattgtcc
     1741 tctgtaattt cgatccttct taa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U41766. Human metalloprot...[gi:1235671] Links  


LOCUS       HSU41766                3865 bp    mRNA    linear   PRI 21-MAR-1996
DEFINITION  Human metalloprotease/disintegrin/cysteine-rich protein precursor
            (MDC9) mRNA, complete cds.
ACCESSION   U41766
VERSION     U41766.1  GI:1235671
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3865)
  AUTHORS   Weskamp,G., Kratzschmar,J., Reid,M.S. and Blobel,C.P.
  TITLE     MDC9, a widely expressed cellular disintegrin containing
            cytoplasmic SH3 ligand domains
  JOURNAL   J. Cell Biol. 132 (4), 717-726 (1996)
  MEDLINE   96178079
   PUBMED   8647900
REFERENCE   2  (bases 1 to 3865)
  AUTHORS   Blobel,C.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (04-DEC-1995) Carl P. Blobel, Cellular Biochemistry and
            Biophysics Program, Memorial Sloan-Kettering Cancer Center, 1275
            York Avenue, New York, NY 10021, USA
FEATURES             Location/Qualifiers
     source          1..3865
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="MDA-MB-468 mammary epithelial carcinoma"
     gene            1..3865
                     /gene="MDC9"
     CDS             79..2538
                     /gene="MDC9"
                     /note="Method: conceptual translation supplied by author"
                     /codon_start=1
                     /product="metalloprotease/disintegrin/cysteine-rich
                     protein precursor"
                     /protein_id="AAC50403.1"
                     /db_xref="GI:1235672"
                     /translation="MGSGARFPSGTLRVRWLLLLGLVGPVLGAARPGFQQTSHLSSYE
                     IITPWRLTRERREAPRPYSKQVSYVIQAEGKEHIIHLERNKDLLPEDFVVYTYNKEGT
                     LITDHPNIQNHCHYRGYVEGVHNSSIALSDCFGLRGLLHLENASYGIEPLQNSSHFEH
                     IIYRMDDVYKEPLKCGVSNKDIEKETAKDEEEEPPSMTQLLRRRRAVLPQTRYVELFI
                     VVDKERYDMMGRNQTAVREEMILLANYLDSMYIMLNIRIVLVGLEIWTNGNLINIVGG
                     AGDVLGNFVQWREKFLITRRRHDSAQLVLKKGFGGTAGMAFVGTVCSRSHAGGINVFG
                     QITVETFASIVAHELGHNLGMNHDDGRDCSCGAKSCIMNSGASGSRNFSSCSAEDFEK
                     LTLNKGGNCLLNIPKPDEAYSAPSCGNKLVDAGEECDCGTPKECELDPCCEGSTCKLK
                     SFAECAYGDCCKDCRFLPGGTLCRGKTSECDVPEYCNGSSQFCQPDVFIQNGYPCQNN
                     KAYCYNGMCQYYDAQCQVIFGSKAKAAPKDCFIEVNSKGDRFGNCGFSGNEYKKCATG
                     NALCGKLQCENVQEIPVFGIVPAIIQTPSRGTKCWGVDFQLGSDVPDPGMVNEGTKCG
                     AGKICRNFQCVDASVLNYDCDVQKKCHGHGVCNSNKNCHCENGWAPPNCETKGYGGSV
                     DSGPTYNEMNTALRDGLLVFFFLIVPLIVCAIFIFIKRDQLWRSYFRKKRSQTYESDG
                     KNQANPSRQPGSVPRHVSPVTPPREVPIYANRFAVPTYAAKQPQQFPSRPPPPQPKVS
                     SQGNLIPARPAPAPPLYSSLT"
     sig_peptide     79..168
                     /gene="MDC9"
     mat_peptide     166..2535
                     /gene="MDC9"
                     /product="metalloprotease/disintegrin/cysteine-rich
                     protein"
     misc_feature    167..694
                     /gene="MDC9"
                     /note="encodes the pro-domain"
     misc_feature    695..1315
                     /gene="MDC9"
                     /note="encodes the metalloproteinase domain"
     misc_feature    1316..1588
                     /gene="MDC9"
                     /note="encodes the disintegrin domain"
     misc_feature    1589..2008
                     /gene="MDC9"
                     /note="encodes the cysteine-rich domain"
     misc_feature    2009..2098
                     /gene="MDC9"
                     /note="encodes the EGF-like domain"
     misc_feature    2174..2233
                     /gene="MDC9"
                     /note="encodes the transmembrane domain"
     misc_feature    2234..2535
                     /gene="MDC9"
                     /note="encodes the cytoplasmic domain"
BASE COUNT     1201 a    670 c    842 g   1152 t
ORIGIN      
        1 cggcagggtt ggaaaatgat ggaagaggcg gaggtggagg cgaccgagtg ctgagaggaa
       61 cctgcggaat cggccgagat ggggtctggc gcgcgctttc cctcggggac ccttcgtgtc
      121 cggtggttgc tgttgcttgg cctggtgggc ccagtcctcg gtgcggcgcg gccaggcttt
      181 caacagacct cacatctttc ttcttatgaa attataactc cttggagatt aactagagaa
      241 agaagagaag cccctaggcc ctattcaaaa caagtatctt atgttattca ggctgaagga
      301 aaagagcata ttattcactt ggaaaggaac aaagaccttt tgcctgaaga ttttgtggtt
      361 tatacttaca acaaggaagg gactttaatc actgaccatc ccaatataca gaatcattgt
      421 cattatcggg gctatgtgga gggagttcat aattcatcca ttgctcttag cgactgtttt
      481 ggactcagag gattgctgca tttagagaat gcgagttatg ggattgaacc cctgcagaac
      541 agctctcatt ttgagcacat catttatcga atggatgatg tctacaaaga gcctctgaaa
      601 tgtggagttt ccaacaagga tatagagaaa gaaactgcaa aggatgaaga ggaagagcct
      661 cccagcatga ctcagctact tcgaagaaga agagctgtct tgccacagac ccggtatgtg
      721 gagctgttca ttgtcgtaga caaggaaagg tatgacatga tgggaagaaa tcagactgct
      781 gtgagagaag agatgattct cctggcaaac tacttggata gtatgtatat tatgttaaat
      841 attcgaattg tgctagttgg actggagatt tggaccaatg gaaacctgat caacatagtt
      901 gggggtgctg gtgatgtgct ggggaacttc gtgcagtggc gggaaaagtt tcttatcaca
      961 cgtcggagac atgacagtgc acagctagtt ctaaagaaag gttttggtgg aactgcagga
     1021 atggcatttg tgggaacagt gtgttcaagg agccacgcag gcgggattaa tgtgtttgga
     1081 caaatcactg tggagacatt tgcttccatt gttgctcatg aattgggtca taatcttgga
     1141 atgaatcacg atgatgggag agattgttcc tgtggagcaa agagctgcat catgaattca
     1201 ggagcatcgg gttccagaaa ctttagcagt tgcagtgcag aggactttga gaagttaact
     1261 ttaaataaag gaggaaactg ccttcttaat attccaaagc ctgatgaagc ctatagtgct
     1321 ccctcctgtg gtaataagtt ggtggacgct ggggaagagt gtgactgtgg tactccaaag
     1381 gaatgtgaat tggacccttg ctgcgaagga agtacctgta agcttaaatc atttgctgag
     1441 tgtgcatatg gtgactgttg taaagactgt cggttccttc caggaggtac tttatgccga
     1501 ggaaaaacca gtgagtgtga tgttccagag tactgcaatg gttcttctca gttctgtcag
     1561 ccagatgttt ttattcagaa tggatatcct tgccagaata acaaagccta ttgctacaac
     1621 ggcatgtgcc agtattatga tgctcaatgt caagtcatct ttggctcaaa agccaaggct
     1681 gcccccaaag attgtttcat tgaagtgaat tctaaaggtg acagatttgg caattgtggt
     1741 ttctctggca atgaatacaa gaagtgtgcc actgggaatg ctttgtgtgg aaagcttcag
     1801 tgtgagaatg tacaagagat acctgtattt ggaattgtgc ctgctattat tcaaacgcct
     1861 agtcgaggca ccaaatgttg gggtgtggat ttccagctag gatcagatgt tccagatcct
     1921 gggatggtta acgaaggcac aaaatgtggt gctggaaaga tctgtagaaa cttccagtgt
     1981 gtagatgctt ctgttctgaa ttatgactgt gatgttcaga aaaagtgtca tggacatggg
     2041 gtatgtaata gcaataagaa ttgtcactgt gaaaatggct gggctccccc aaattgtgag
     2101 actaaaggat acggaggaag tgtggacagt ggacctacat acaatgaaat gaatactgca
     2161 ttgagggacg gacttctggt cttcttcttc ctaattgttc cccttattgt ctgtgctatt
     2221 tttatcttca tcaagaggga tcaactgtgg agaagctact tcagaaagaa gagatcacaa
     2281 acatatgagt cagatggcaa aaatcaagca aacccttcta gacagccggg gagtgttcct
     2341 cgacatgttt ctccagtgac acctcccaga gaagttccta tatatgcaaa cagatttgca
     2401 gtaccaacct atgcagccaa gcaacctcag cagttcccat caaggccacc tccaccacaa
     2461 ccgaaagtat catctcaggg aaacttaatt cctgcccgtc ctgctcctgc acctccttta
     2521 tatagttccc tcacttgatt tttttaacct tctttttgca aatgtcttca gggaactgag
     2581 ctaatacttt ttttttttct tgatgttttc ttgaaaagcc tttctgttgc aactatgaat
     2641 gaaaacaaaa caccacaaaa cagacttcac taacacagaa aaacagaaac tgagtgtgag
     2701 agttgtgaaa tacaaggaaa tgcagtaaag ccagggaatt tacaataaca tttccgtttc
     2761 catcattgaa taagtcttat tcagtcatcg gtgaggttaa tgcactaatc atggattttt
     2821 tgaacatgtt attgcagtga ttctcaaatt aactgtattg gtgtaagatt tttgtcatta
     2881 agtgtttaag tgttattctg aattttctac cttagttatc attaatgtag ttcctcattg
     2941 aacatgtgat aatctaatac ctgtgaaaac tgactaatca gctgccaata atatctaata
     3001 tttttcatca tgcacgaatt aataatcatc atactctaga atcttgtctg tcactcacta
     3061 catgaataag caaatattgt cttcaaaaga atgcacaaga accacaatta agatgtcata
     3121 ttattttgaa agtacaaaat atactaaaag agtgtgtgtg tattcacgca gttactcgct
     3181 tccattttta tgacctttca actataggta ataactctta gagaaattaa tttaatatta
     3241 gaatttctat tatgaatcat gtgaaagcat gacattcgtt cacaatagca ctattttaaa
     3301 taaattataa gctttaaggt acgaagtatt taatagatct aatcaaatat gttgattcat
     3361 ggctataata aagcaggagc aattataaaa tcttcaatca attgaacttt tacaaaacca
     3421 cttgagaatt tcatgagcac tttaaaatct gaactttcaa agcttgctat taaatcattt
     3481 agaatgttta catttactaa ggtgtgctgg gtcatgtaaa atattagaca ctaatatttt
     3541 catagaaatt aggctggaga aagaaggaag aaatggtttt cttaaatacc tacaaaaaag
     3601 ttactgtggt atctatgagt tatcatctta gctgtgttaa aaatgaattt ttactatggc
     3661 agatatggta tggatcgtaa aattttaagc actaaaaatt ttttcataac ctttcataat
     3721 aaagtttaat aataggttta ttaactgaat ttcattagtt ttttaaaagt gtttttggtt
     3781 tgtgtatata tacatataca aatacaacat ttacaataaa taaaatactt gaaattctca
     3841 aaaaaaaaaa aaaaaaaaaa aaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: A12027. Macrophage migrat...[gi:490102] Links  


LOCUS       A12027                  4205 bp    DNA     linear   PAT 30-NOV-1994
DEFINITION  Macrophage migration inhibition factor (MRP-14)cDNA from Human
            placenta (formula v).
ACCESSION   A12027
VERSION     A12027.1  GI:490102
KEYWORDS    .
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4205)
  AUTHORS   Odink,K.G., Clerc,R., Cerletti,N., Brueggen,J., Tarcsay,L., Sorg,C.
            and Wiesendanger,W.
  TITLE     Novel lymphokine related peptides
  JOURNAL   Patent: EP 0263072-A 12 06-APR-1988;
            CIBA-GEIGY AG
FEATURES             Location/Qualifiers
     source          1..4205
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            1511..1543
                     /number=1
     intron          1544..2027
                     /number=1
     exon            2028..2191
                     /number=2
     gene            join(2051..2191,2342..2482)
                     /gene="MRP-14"
     CDS             join(2051..2191,2342..2482)
                     /gene="MRP-14"
                     /codon_start=1
                     /protein_id="CAA01001.1"
                     /db_xref="GI:818950"
                     /db_xref="SWISS-PROT:P05109"
                     /translation="MLTELEKALNSIIDVYHKYSLIKGNFHAVYRDDLKKLLETECPQ
                     YIRKKGADVWFKELDINTDGAVNFQEFLILVIKMGVAAHKKSHEESHKE"
     intron          2192..2341
                     /gene="MRP-14"
                     /number=2
     exon            2342..2552
                     /number=3
BASE COUNT     1037 a    952 c   1028 g   1188 t
ORIGIN      
        1 cttgggttgc ttccaccttt tggctcttgt aaataatgct gctatgaaca tgaatgtaca
       61 aacatctgtt tgaatccctg cattcaattc ttttgcatat atacccagga gcagaatgat
      121 ggatcatatg gtaattctgt gtttatttat ttgaggaaca aacttgccgt tttccataac
      181 agctgcacta ttttacattc ccactaacag tgcattaggc ttccaattct ctatgccctc
      241 accaacactt gttttctggg ttttaaaaga agtagtagtc atccttgtag gtgtcaggtg
      301 gtatctcatt gtcgttttgc ttcatgtttt cctaaagatt agtaattttc atatgcttat
      361 tgaccatttg tatatcttct tcggagaagt gtctatttga gtctttcccc aattttgatt
      421 ggtttgtttg ttttttgttg ttgagttgta gggattcttt tatattctgg atattaatcc
      481 cttatcagat atttgtttta caaatatttt ctttgtaaca acagaaacac accacagtct
      541 tcaaggttgg aagccagtta atctgagtag cattttgtta gtggtgggga gaggatttgt
      601 tcctcctgaa atcctgggga attggccacc tcctcttctc ctcttaggca tgaagcgcgt
      661 ctggcttctc caaagaactc ttcccctcca ctacctcaga gttagcttcc tctcttcagc
      721 cagtgatcct ggggtcccag acacaataat taaccaagag agggtgaaag gctccctgct
      781 gtgtttatgc aatggctcag gcccttgtga agtgccgagg gaccccaagc agcctccatc
      841 tcccagggca tggtccatcc ccagctttca cagaacagga aagctgtgga ggagtgtggg
      901 cagcagggta ggaatggata tagcccttgg caacaacaca tttccccaca aagcacccac
      961 ccaaaagaac aacaacgata gttttagttt ttagtaatga gaacaatagt tctcatgact
     1021 aaaagccatc agccaggaca ctgttctcaa cccttttgcg gtctttggac cctttgaaac
     1081 tctgacagaa gccatggagg aatgttctca ctgagtgcat gcactcaaaa tgatgcattc
     1141 aacttcaatt cagtttcagg gatgtatggc ctgaccacca atgcagggga ttagcaatcg
     1201 caatagtgga gagggcatgg gagtgggaat ctggctggat caagcaagtg gatgccagca
     1261 gcccagaaaa agagcccccc tacctgcttt ttccttcctg ggcactattg cccagcaaat
     1321 gccttcctct ttccgcttct cctacctccc cacccaaaat tttcattctg cacagtgatt
     1381 gccacattca ctggttgaga aacagagact gtagcaactc tggcagggag aagctgtctc
     1441 tgatggcctg aagctgtggg cagctggcca agcctaaccg ctataaaaag gagctgcctc
     1501 tcagccctgc atgtctcttg tcagctgtct ttcagaagac ctggtaagtg ggactgtctg
     1561 ggttggcccc gcactttggg cttctcttgg ggagggtcag ggaagtggag cagccttcct
     1621 gagagaggag agagaaagct cagggaggtc tggagcaaag atactcctgg aggtggggag
     1681 tgaggcaggg ataaggaagg agagtatcct ccagcacctt ccagtgggta agggcacatt
     1741 gtctcctagg ctggactttt cttgagcaga gggtggggtg gtaaggaaag tctacgggcc
     1801 cccgtgtgtg tgcacatgtc tctgtgtgaa tggacccttc cccttcccac acgtgtatcc
     1861 ctatcatccc acccttccca ccagaggcca tagccatctg ctggtttggt tatttgagag
     1921 tgcaggccag gacaaggcca tcgcttgggg catgaatcct ctgcgtactg ccctggccag
     1981 atgcaaattc cctgccatgg gattccccag aaggttctgt ttttcaggtg gggcaagttc
     2041 cgtgggcatc atgttgaccg agctggagaa agccttgaac tctatcatcg acgtctacca
     2101 caagtactcc ctgataaagg ggaatttcca tgccgtctac agggatgacc tgaagaaatt
     2161 gctagagacc gagtgtcctc agtatatcag ggtgaggagg ggctgggtgt ggcgggggct
     2221 ctctgcctgg tcctggggct gccctgggcc agcggtcctc cctgccaccc ttcatagatg
     2281 ctatgcctcg gctctctctg agatctttaa actctggctt cttcctcctc aatcttgaca
     2341 gaaaaagggt gcagacgtct ggttcaaaga gttggatatc aacactgatg gtgcagttaa
     2401 cttccaggag ttcctcattc tggtgataaa gatgggcgtg gcagcccaca aaaaaagcca
     2461 tgaagaaagc cacaaagagt agctgagtta ctgggcccag aggctgggcc cctggacatg
     2521 tacctgcaga ataataaagt catcaatacc tcatgcctct ctcttatgct tttgtggaat
     2581 gaggttcctc ggtgtggagg gagggttgga aaacccaaag gaagaaaaag aaatctatgt
     2641 tatcccaccc tacctctcac aagcctttcc tgctttaccc ctcacctggc ctctgcccca
     2701 cattccttca gcccctcatt tcgagcattg gatttgaggc ttaaggattc aaaaagtcgt
     2761 catgaatata gctgatgatt ttatagtggt tctgaaatgg gtcggggatt tgggaacagg
     2821 gtggtagtat aagaacaact gatactgttc tctaagctaa atcttagctt ccagctacct
     2881 gtcttagatg tggctcttgg gaaccttaga gtgatagcta catagaagtg tgtgggtgtg
     2941 tgtgtgtgtg tctgtgtgtg tgtgtgtgag agagagacag acagaaagag agcaagagag
     3001 ggaagggggg agaggctgat tgtgtgtgtg gtgtgatgta ggtggacaat gttcagagtc
     3061 ctccattaac aggataatcc tcacacctgt ccacatacct gtagtttgtc cttggggatt
     3121 ttgaaaattt ttcctccctc tccactccca aactcccaac tcaattaaat gataaaggaa
     3181 taggcaaata ggaaaataaa ttagtaaaac ttaagtcaaa gaataggtta ttcatacgct
     3241 gcctatggga ttctatgctt tgtgatcaga aaattatcta aaaaatactt cccaagggct
     3301 ggtacaaggg aggccagaag acgagtggtt cttctctgag gtggacatta aaaaaagaag
     3361 aaaatgaagg ggaacctttt gacaagaatg tcaccccaaa ctggattttc atgctgtggt
     3421 gtggggaatt ttctgttgtc ctcacttagg tgctggggca gtggtgttag tgatgggtaa
     3481 aaaggtagga agctgtcaca gaatcactaa accagggttc ttaacttgtc tgtctataca
     3541 tctctgaaat tgggttgaag ttgtgtgcat cattttgagt gacgcactga gaacattcct
     3601 ccacggcttc catcgagagt ctcgaaaagg cccaacacct caaaaaggtt aagaacactt
     3661 gtcctgctta ctggttttta gtaacaaatg gcagagtatt tctctctgtc tctctctctt
     3721 tttttttttt tttttttgag acacagggtc ttgtctgtca cgtggactag agtacaatgg
     3781 gcatgatcat gggctcactg tagcctcgaa cacctgggct caagtaatcc tcccacctca
     3841 gcctctttag tagctgggac tacagcatga gccactgccc ttggctaatt tttaaattat
     3901 ttttttgtag agatggaaac ttgctatgtt gcccaggcta gtctcaaact cctggactca
     3961 agcgatcctc ctaccttggc ctcccaaagt gctgagatta cagtgtgatc cacaccacac
     4021 ctggccaaag attggagtat ttttattgct attgttgtgc tgggtgggtg ggtgggtgta
     4081 tgctttgtgg ggacgtgtgt tgttgccaag ggctaaatca gttcctaccc tgctgcccac
     4141 agtcctccac agctttcctg ctctgtgaag ctaaggatac accccgatga taagctgtca
     4201 acata
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMProteinProteinTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Y00278. Human mRNA for cy...[gi:29887] Links  


LOCUS       HSCFANT                  420 bp    mRNA    linear   PRI 12-SEP-1993
DEFINITION  Human mRNA for cystic fibrosis antigen (CFAg).
ACCESSION   Y00278
VERSION     Y00278.1  GI:29887
KEYWORDS    calcium binding protein; cystic fibrosis antigen.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Dorin,J.R., Novak,M., Hill,R.E., Brock,D.J., Secher,D.S. and van
            Heyningen,V.
  TITLE     A clue to the basic defect in cystic fibrosis from cloning the CF
            antigen gene
  JOURNAL   Nature 326 (6113), 614-617 (1987)
  MEDLINE   87173041
REFERENCE   2  (bases 1 to 420)
  AUTHORS   Dorin,S.R.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-APR-1987) S. R. Dorin, Clinical and Population
            Cytogenetics Unit, MRC, Western General Hospital, Crewe Road,
            Edinburgh, Scotland, Great Britain
FEATURES             Location/Qualifiers
     source          1..420
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1q12-1q22"
                     /clone="CFA8-9"
                     /clone_lib="lambda gt10 chronic myeloid leukaemia cDNA"
     CDS             52..336
                     /note="CFAg (AA 1-94)"
                     /codon_start=1
                     /protein_id="CAA68390.1"
                     /db_xref="GI:29888"
                     /db_xref="SWISS-PROT:P05109"
                     /translation="MLTELEKALNSIIDVYHKYSLIKGNFHAVYRDDLKKLLETECPQ
                     YIRKKGADVWFKELDINTDGAVNFQEFLILVIKMGWQPTKKAMKKATKSS"
     misc_feature    85..175
                     /note="non-EF-hand calcium binding site"
     misc_feature    199..289
                     /note="EF-hand calcium binding site"
BASE COUNT      138 a     90 c    101 g     90 t      1 others
ORIGIN      
        1 ctcttgtcag ctgtctttca gaagacctgg tggggnaagt ccgtgggcat catgttgacc
       61 gagctggaga aagccttgaa ctctatcatc gacgtctacc acaagtactc cctgataaag
      121 gggaatttcc atgccgtcta cagggatgac ctgaagaaat tgctagagac cgagtgtcct
      181 cagtatatca ggaaaaaggg tgcagacgtc tggttcaaag agttggatat caacactgat
      241 ggtgcagtta acttccagga gttcctcatt ctggtgataa agatgggctg gcagcccaca
      301 aaaaaagcca tgaagaaagc cacaaagagt agctgagtta ctgcccagag gctgggcccc
      361 tgacatgtac ctgcagaata ataaagtcat caatacctca aaaaaaaaaa aaaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L18960. Human protein syn...[gi:306724] Links  


LOCUS       HUMEIF4C                1202 bp    mRNA    linear   PRI 18-JUL-1994
DEFINITION  Human protein synthesis factor (eIF-4C) mRNA, complete cds.
ACCESSION   L18960
VERSION     L18960.1  GI:306724
KEYWORDS    protein synthesis factor.
SOURCE      Homo sapiens cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1202)
  AUTHORS   Dever,T.E., Wei,C.L., Benkowski,L.A., Browning,K., Merrick,W.C. and
            Hershey,J.W.
  TITLE     Determination of the amino acid sequence of rabbit, human, and
            wheat germ protein synthesis factor eIF-4C by cloning and chemical
            sequencing
  JOURNAL   J. Biol. Chem. 269 (5), 3212-3218 (1994)
  MEDLINE   94148809
   PUBMED   8106356
FEATURES             Location/Qualifiers
     source          1..1202
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /note="subject has leukemia"
     5'UTR           1..207
     CDS             208..642
                     /codon_start=1
                     /product="protein synthesis factor"
                     /protein_id="AAA19812.1"
                     /db_xref="GI:306725"
                     /translation="MPKNKGKGGKNRRRGKNENESEKRELVFKEDGQEYAQVIKMLGN
                     GRLEAMCFDGVKRLCHIRGKLRKKVWINTSDIILVGLRDYQDNKADVILKYNADEARS
                     LKAYGELPEHAKINETDTFGPGDDDEIQFDDIGDDDEDIDDI"
     3'UTR           643..1202
BASE COUNT      366 a    221 c    278 g    337 t
ORIGIN      
        1 ggcacgaggc gccatttgct gccgccgagc gtggacgcag gcggatctct gaagagctgg
       61 gtcgccagcc tctcccgcgc acgttgcctg gcctccagca cctacttggt cccgcgcgct
      121 ccctcgtgtc gcccctcgga gcagcagccg ccgcggtcgc cgctacccgg aaagaagtca
      181 gagacgccgc gagtcgccgc caccgccatg cccaagaata aaggtaaagg aggtaaaaac
      241 agacgcaggg gtaagaatga gaatgaatct gaaaaaagag aactggtatt caaagaggat
      301 gggcaggagt atgctcaggt aatcaaaatg ttgggaaatg gacggctaga agcaatgtgt
      361 ttcgatggtg taaagaggtt atgtcacatc agaggaaaat tgagaaaaaa ggtttggata
      421 aatacctcgg acattatttt ggttggtctc cgagactacc aggataacaa agctgatgta
      481 attttaaaat acaatgcaga cgaagctaga agtctgaagg catacggcga gcttccagag
      541 catgctaaaa tcaatgaaac tgatacattt ggtcctggag atgatgatga aattcagttt
      601 gatgacattg gagatgatga tgaagatatt gatgacatct aaattgaact caacatttta
      661 cattccatct tttctgaaga ttgtcctaca atttggattt tgatcatgac aaagaagatt
      721 aaaatttcat tagcatgaat gcaatttgtt aaagcagact gatttgtttc taagatattt
      781 ttggtttttt taaaactgat aataatgctg aattatctta agtgagatgt taagcccact
      841 ttgttctttt aatgtaatgg agcttatggg tagaagacca tgtctactaa ttacaaaaaa
      901 aaaaaaaaac catgattgct gcttttccta ccacttccag taagaaaatg ggtgttttga
      961 agaaatcatt tgccttgtct cacggaatct gattaagccc tggcctcttg atgtatagag
     1021 tcatggatat tccagttacc tagatattcc cttgagattt tgatacaatt tgagggaggc
     1081 agaagtctgc agttgaagaa aaaaaataag tctgtttgtc atatttaagt agcctgtgcg
     1141 tatttttata ctgattttga tatcatgttc ttttcatagt cgtattttgc caccgtaaac
     1201 at
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M93651. Human set gene, c...[gi:338038] Links  


LOCUS       HUMSET                  2577 bp    mRNA    linear   PRI 09-JAN-1995
DEFINITION  Human set gene, complete cds.
ACCESSION   M93651
VERSION     M93651.1  GI:338038
KEYWORDS    oncogene.
SOURCE      Homo sapiens (individual_isolate Acute undifferentiated leukemia
            patient, strain Caucasian) (tissue library: lambda EMBL3, lambda
            gt11) male bone marrow; testis cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2577)
  AUTHORS   von Lindern,M., van Baal,S., Wiegant,J., Raap,A., Hagemeijer,A. and
            Grosveld,G.
  TITLE     Can, a putative oncogene associated with myeloid leukemogenesis,
            may be activated by fusion of its 3' half to different genes:
            characterization of the set gene
  JOURNAL   Mol. Cell. Biol. 12 (8), 3346-3355 (1992)
  MEDLINE   92334332
   PUBMED   1630450
FEATURES             Location/Qualifiers
     source          1..2577
                     /organism="Homo sapiens"
                     /strain="Caucasian"
                     /isolate="Acute undifferentiated leukemia patient"
                     /db_xref="taxon:9606"
                     /map="9q34"
                     /sex="male"
                     /tissue_type="bone marrow; testis"
                     /germline
                     /tissue_lib="lambda EMBL3, lambda gt11"
     gene            1..2577
                     /gene="set"
     CDS             4..837
                     /gene="set"
                     /codon_start=1
                     /protein_id="AAA60318.1"
                     /db_xref="GI:338039"
                     /translation="MSAQAAKVSKKELNSNHDGADETSEKEQQEAIEHIDEVQNEIDR
                     LNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEE
                     ALHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEFHLNESGDPSSKSTEIKW
                     KSGKDLTKRSSQTQNKASRKRQHEEPESFFTWFTDHSDAGADELGEVIKDDIWPNPLQ
                     YYLVPDMDDEEGEGEEDDDDDEEEEGLEDIDEEGDEDEGEEDEDDDEGEEGEEDEGED
                     D"
BASE COUNT      795 a    449 c    565 g    768 t
ORIGIN      
        1 cacatgtcgg cgcaggcggc caaagtcagt aaaaaggagc tcaactccaa ccacgacggg
       61 gccgacgaga cctcagaaaa agaacagcaa gaagcgattg aacacattga tgaagtacaa
      121 aatgaaatag acagacttaa tgaacaagcc agtgaggaga ttttgaaagt agaacagaaa
      181 tataacaaac tccgccaacc attttttcag aagaggtcag aattgatcgc caaaatccca
      241 aatttttggg taacaacatt tgtcaaccat ccacaagtgt ctgcactgct tggggaggaa
      301 gatgaagagg cactgcatta tttgaccaga gttgaagtga cagaatttga agatattaaa
      361 tcaggttaca gaatagattt ttattttgat gaaaatcctt actttgaaaa taaagttctc
      421 tccaaagaat ttcatctgaa tgagagtggt gatccatctt cgaagtccac cgaaatcaaa
      481 tggaaatctg gaaaggattt gacgaaacgt tcgagtcaaa cgcagaataa agccagcagg
      541 aagaggcagc atgaggaacc agagagcttc tttacctggt ttactgacca ttctgatgca
      601 ggtgctgatg agttaggaga ggtcatcaaa gatgatattt ggccaaaccc attacagtac
      661 tacttggttc ccgatatgga tgatgaagaa ggagaaggag aagaagatga tgatgatgat
      721 gaagaggagg aaggattaga agatattgac gaagaagggg atgaggatga aggtgaagaa
      781 gatgaagatg atgatgaagg ggaggaagga gaggaggatg aaggagaaga tgactaaata
      841 gaacactgat ggattccaac cttccttttt ttaaattttc tccagtccct gggagcaagt
      901 tgcagtcttt tttttttttt tttttttttt ccctcttgtg ctcagtcgcc ctgttcttga
      961 ggtctctttt ctctactcca tggttctcaa tttatttggg gggaaatacc ttgagcagaa
     1021 tacaatggga aaagagtctc tacccctttc tgttcgaagt tcatttttat cccttcctgt
     1081 ctgaacaaaa actgtatgga atcaacacca ccgagctctg tgggaaaaaa gaaaaacctg
     1141 ctccctttgc tctgctggaa gctggagggt gctaggcccc tgtgtagtag tgtatagaat
     1201 tctagctttt ttcctccttt ctctgtatat tgggctcaga gagtacactg tgtctctatg
     1261 tgaatatgga cagttagcat ttaccaacat gtatctgtct actttctctt gtttaaaaaa
     1321 agaaaaaaaa acttaaaaaa atggggttat agaaggtcag caaaggggtg gggtttgaga
     1381 tgtttgggtg ggttagtggg cattttgaca acatggcttc tcctttggca tgtttaattg
     1441 tgatatttga cagacatcct tgcagtttaa gatgacactt ttaaaataaa ttctctccta
     1501 atgatgactt gagccctgcc actcaatggg agaatcagca gaacctgtag gatcttattt
     1561 ggaattgaca ttctctattg taattttgtt cctgtttatt tttgggtttc tttttgtttc
     1621 actggaaagg aaagatgatg ctcagtttta aacgttaaaa gtgtacaagt tgctttgtta
     1681 caataaaact aaatgtgtac acaaaggatt tgatgctttt ctctcagcat aggtatgctt
     1741 actatgacct tccaagtttg acttgtataa catcactgtc aaactttgtc accctaactt
     1801 cgtatttttt gatacgcact tttgcaggat gacctcaggg ctatgtggat tgagtaatgg
     1861 gatttgaatc aatgtattaa tatctccata gctgggaaac gtgggttcaa tttgccattg
     1921 gtttctgaaa agtattcaca tcatttggga taccagatag ctcaatactc tctgagtaca
     1981 ttgtgccctt gatttttatc tccaagtggc agtttttaaa attggccttt tacctggata
     2041 taaattaatt gtgcctgcca ccaccatcca acagacctgg tgctctaatg ccaagttata
     2101 cacgggacag ttgctggcat gtcttcattg gctctctaaa atgtggccaa gaagataggc
     2161 tctcagtaag aagtctgatg gtgagcagta actgtccctg ctttctggta taaagctctc
     2221 aaatgtgacc atgtgaatct gggtgggata atggactcag ctctgtctgc tcaatgccat
     2281 tgtgcagaga agcaccctaa tgcataagct ttttaatgct gtaaaatata gtcgctgaaa
     2341 ttaaatgcca ctttttcaga ggtgaattaa tggacagtct ggtgaacttc aaaagctttt
     2401 tgatgtataa aacttgataa atggaactat tccatcaata ggcaaaagtg taacaaccta
     2461 tctagatgga tagtatgtaa tttctgcaca ggtctctgtt tagtaaatac atcactgtat
     2521 accgatcagg aatcttgctc caataaagga acataaagat ttaaaaaaaa aaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF165281. Homo sapiens ATP ...[gi:5734100] Links  


LOCUS       AF165281                9497 bp    mRNA    linear   PRI 17-AUG-1999
DEFINITION  Homo sapiens ATP cassette binding transporter 1 (ABC1) mRNA,
            complete cds.
ACCESSION   AF165281
VERSION     AF165281.1  GI:5734100
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 9497)
  AUTHORS   Rust,S., Rosier,M., Funke,H., Real,J., Amoura,Z., Piette,J.C.,
            Deleuze,J.F., Brewer,H.B., Duverger,N., Denefle,P. and Assmann,G.
  TITLE     Tangier disease is caused by mutations in the gene encoding
            ATP-binding cassette transporter 1
  JOURNAL   Nat. Genet. 22 (4), 352-355 (1999)
  MEDLINE   99364413
   PUBMED   10431238
REFERENCE   2  (bases 1 to 9497)
  AUTHORS   Rust,S., Rosier,M., Funke,H., Real,J., Amoura,Z., Piette,J.C.,
            Deleuze,J.F., Brewer,H.B., Duverger,N., Denefle,P. and Assmann,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-JUL-1999) Genomics, Rhone-Poulenc Rorer, 2 rue Gaston
            Cr#mieux, Evry 91006, France
FEATURES             Location/Qualifiers
     source          1..9497
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="9"
                     /map="9q31"
     gene            1..9497
                     /gene="ABC1"
     CDS             121..6726
                     /gene="ABC1"
                     /note="ABC transporter; ABC1 protein"
                     /codon_start=1
                     /product="ATP cassette binding transporter 1"
                     /protein_id="AAD49849.1"
                     /db_xref="GI:5734101"
                     /translation="MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGVVGNFNKSIVAR
                     LFSDARRLLLYSQKDTSMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYHNL
                     SLPKSTVDKMLRADVILHKVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPRE
                     KLAAAERVLRSNMDILKPILRTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMRSW
                     SDMRQEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGG
                     NGTEEDAETFYDNSTTPYCNDLMKNLESSPLSRIIWKALKPLLVGKILYTPDTPATRQ
                     VMAEVNKTFQELAVFHDLEGMWEELSPKIWTFMENSQEMDLVRMLLDSRDNDHFWEQQ
                     LDGLDWTAQDIVAFLAKHPEDVQSSNGSVYTWREAFNETNQAIRTISRFMECVNLNKL
                     EPIATEVWLINKSMELLDERKFWAGIVFTGITPGSIELPHHVKYKIRMDIDNVERTNK
                     IKDGYWDPGPRADPFEDMRYVWGGFAYLQDVVEQAIIRVLTGTEKKTGVYMQQMPYPC
                     YVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKGIVYEKEARLKETMRIMGLDNSILWFS
                     WFISSLIPLLVSAGLLVVILKLGNLLPYSDPSVVFVFLSVFAVVTILQCFLISTLFSR
                     ANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFASLLSPVAFGFGCEYFALFEEQ
                     GIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTWYIEAVFPGQYGIPRPWYF
                     PCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVSIQNLVKVYRDGMKVAV
                     DGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSTIRQN
                     LGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPSSKLKSKTSQ
                     LSGGMQRKLSVALAFVGGSKVVILDEPTAGVDPYSRRGIWELLLKYRQGRTIILSTHH
                     MDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSSCRNSSST
                     VSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDIGHELTY
                     VLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAETSDG
                     TLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVKGW
                     KLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ
                     PWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTT
                     APVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADI
                     LQDLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNTQALPPSQEVNDA
                     TKQMKKHLKLAKDSSADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAISSFLNVINNA
                     ILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMSFVPA
                     SFVVFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYVVPATLVIIIFICFQQKS
                     YVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYVVLTSVNLFIGINGSVATF
                     VLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPL
                     SWDLVGRNLFAMAVEGVVFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRI
                     LDGGGQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLT
                     GDTTVTRGDAFLNRNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPE
                     KEVGKVGEWAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPVVFLDEPTTGMD
                     PKARRFLWNCALSVVKEGRSVVLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNR
                     FGDGYTIVVRIAGSNPDLKPVQDFFGLAFPGSVPKEKHRNMLQYQLPSSLSSLARIFS
                     ILSQSKKRLHIEDYSVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTVVDVAVLTSF
                     LQDEKVKESYV"
BASE COUNT     2600 a   2115 c   2217 g   2564 t      1 others
ORIGIN      
        1 caaacatgtc agctgttact ggaagtggcc tggcctctat ttatcttcct gatcctgatc
       61 tctgttcggc tgagctaccc accctatgaa caacatgaat gccattttcc aaataaagcc
      121 atgccctctg caggaacact tccttgggtt caggggatta tctgtaatgc caacaacccc
      181 tgtttccgtt acccgactcc tggggaggct cccggagttg ttggaaactt taacaaatcc
      241 attgtggctc gcctgttctc agatgctcgg aggcttcttt tatacagcca gaaagacacc
      301 agcatgaagg acatgcgcaa agttctgaga acattacagc agatcaagaa atccagctca
      361 aacttgaagc ttcaagattt cctggtggac aatgaaacct tctctgggtt cctgtatcac
      421 aacctctctc tcccaaagtc tactgtggac aagatgctga gggctgatgt cattctccac
      481 aaggtatttt tgcaaggcta ccagttacat ttgacaagtc tgtgcaatgg atcaaaatca
      541 gaagagatga ttcaacttgg tgaccaagaa gtttctgagc tttgtggcct accaagggag
      601 aaactggctg cagcagagcg agtacttcgt tccaacatgg acatcctgaa gccaatcctg
      661 agaacactaa actctacatc tcccttcccg agcaaggagc tggccgaagc cacaaaaaca
      721 ttgctgcata gtcttgggac tctggcccag gagctgttca gcatgagaag ctggagtgac
      781 atgcgacagg aggtgatgtt tctgaccaat gtgaacagct ccagctcctc cacccaaatc
      841 taccaggctg tgtctcgtat tgtctgcggg catcccgagg gaggggggct gaagatcaag
      901 tctctcaact ggtatgagga caacaactac aaagccctct ttggaggcaa tggcactgag
      961 gaagatgctg aaaccttcta tgacaactct acaactcctt actgcaatga tttgatgaag
     1021 aatttggagt ctagtcctct ttcccgcatt atctggaaag ctctgaagcc gctgctcgtt
     1081 gggaagatcc tgtatacacc tgacactcca gccacaaggc aggtcatggc tgaggtgaac
     1141 aagaccttcc aggaactggc tgtgttccat gatctggaag gcatgtggga ggaactcagc
     1201 cccaagatct ggaccttcat ggagaacagc caagaaatgg accttgtccg gatgctgttg
     1261 gacagcaggg acaatgacca cttttgggaa cagcagttgg atggcttaga ttggacagcc
     1321 caagacatcg tggcgttttt ggccaagcac ccagaggatg tccagtccag taatggttct
     1381 gtgtacacct ggagagaagc tttcaacgag actaaccagg caatccggac catatctcgc
     1441 ttcatggagt gtgtcaacct gaacaagcta gaacccatag caacagaagt ctggctcatc
     1501 aacaagtcca tggagctgct ggatgagagg aagttctggg ctggtattgt gttcactgga
     1561 attactccag gcagcattga gctgccccat catgtcaagt acaagatccg aatggacatt
     1621 gacaatgtgg agaggacaaa taaaatcaag gatgggtact gggaccctgg tcctcgagct
     1681 gacccctttg aggacatgcg gtacgtctgg gggggcttcg cctacttgca ggatgtggtg
     1741 gagcaggcaa tcatcagggt gctgacgggc accgagaaga aaactggtgt ctatatgcaa
     1801 cagatgccct atccctgtta cgttgatgac atctttctgc gggtgatgag ccggtcaatg
     1861 cccctcttca tgacgctggc ctggatttac tcagtggctg tgatcatcaa gggcatcgtg
     1921 tatgagaagg aggcacggct gaaagagacc atgcggatca tgggcctgga caacagcatc
     1981 ctctggttta gctggttcat tagtagcctc attcctcttc ttgtgagcgc tggcctgcta
     2041 gtggtcatcc tgaagttagg aaacctgctg ccctacagtg atcccagcgt ggtgtttgtc
     2101 ttcctgtccg tgtttgctgt ggtgacaatc ctgcagtgct tcctgattag cacactcttc
     2161 tccagagcca acctggcagc agcctgtggg ggcatcatct acttcacgct gtacctgccc
     2221 tacgtcctgt gtgtggcatg gcaggactac gtgggcttca cactcaagat cttcgctagc
     2281 ctgctgtctc ctgtggcttt tgggtttggc tgtgagtact ttgccctttt tgaggagcag
     2341 ggcattggag tgcagtggga caacctgttt gagagtcctg tggaggaaga tggcttcaat
     2401 ctcaccactt cggtctccat gatgctgttt gacaccttcc tctatggggt gatgacctgg
     2461 tacattgagg ctgtctttcc aggccagtac ggaattccca ggccctggta ttttccttgc
     2521 accaagtcct actggtttgg cgaggaaagt gatgagaaga gccaccctgg ttccaaccag
     2581 aagagaatat cagaaatctg catggaggag gaacccaccc acttgaagct gggcgtgtcc
     2641 attcagaacc tggtaaaagt ctaccgagat gggatgaagg tggctgtcga tggcctggca
     2701 ctgaattttt atgagggcca gatcacctcc ttcctgggcc acaatggagc ggggaagacg
     2761 accaccatgt caatcctgac cgggttgttc cccccgacct cgggcaccgc ctacatcctg
     2821 ggaaaagaca ttcgctctga gatgagcacc atccggcaga acctgggggt ctgtccccag
     2881 cataacgtgc tgtttgacat gctgactgtc gaagaacaca tctggttcta tgcccgcttg
     2941 aaagggctct ctgagaagca cgtgaaggcg gagatggagc agatggccct ggatgttggt
     3001 ttgccatcaa gcaagctgaa aagcaaaaca agccagctgt caggtggaat gcagagaaag
     3061 ctatctgtgg ccttggcctt tgtcggggga tctaaggttg tcattctgga tgaacccaca
     3121 gctggtgtgg acccttactc ccgcagggga atatgggagc tgctgctgaa ataccgacaa
     3181 ggccgcacca ttattctctc tacacaccac atggatgaag cggacgtcct gggggacagg
     3241 attgccatca tctcccatgg gaagctgtgc tgtgtgggct cctccctgtt tctgaagaac
     3301 cagctgggaa caggctacta cctgaccttg gtcaagaaag atgtggaatc ctccctcagt
     3361 tcctgcagaa acagtagtag cactgtgtca tacctgaaaa aggaggacag tgtttctcag
     3421 agcagttctg atgctggcct gggcagcgac catgagagtg acacgctgac catcgatgtc
     3481 tctgctatct ccaacctcat caggaagcat gtgtctgaag cccggctggt ggaagacata
     3541 gggcatgagc tgacctatgt gctgccatat gaagctgcta aggagggagc ctttgtggaa
     3601 ctctttcatg agattgatga ccggctctca gacctgggca tttctagtta tggcatctca
     3661 gagacgaccc tggaagaaat attcctcaag gtggccgaag agagtggggt ggatgctgag
     3721 acctcagatg gtaccttgcc agcaagacga aacaggcggg ccttcgggga caagcagagc
     3781 tgtcttcgcc cgttcactga agatgatgct gctgatccaa atgattctga catagaccca
     3841 gaatccagag agacagactt gctcagtggg atggatggca aagggtccta ccaggtgaaa
     3901 ggctggaaac ttacacagca acagtttgtg gcccttttgt ggaagagact gctaattgcc
     3961 agacggagtc ggaaaggatt ttttgctcag attgtcttgc cagctgtgtt tgtctgcatt
     4021 gcccttgtgt tcagcctgat cgtgccaccc tttggcaagt accccagcct ggaacttcag
     4081 ccctggatgt acaacgaaca gtacacattt gtcagcaatg atgctcctga ggacacggga
     4141 accctggaac tcttaaacgc cctcaccaaa gaccctggct tcgggacccg ctgtatggaa
     4201 ggaaacccaa tcccagacac gccctgccag gcaggggagg aagagtggac cactgcccca
     4261 gttccccaga ccatcatgga cctcttccag aatgggaact ggacaatgca gaacccttca
     4321 cctgcatgcc agtgtagcag cgacaaaatc aagaagatgc tgcctgtgtg tcccccaggg
     4381 gcaggggggc tgcctcctcc acaaagaaaa caaaacactg cagatatcct tcaggacctg
     4441 acaggaagaa acatttcgga ttatctggtg aagacgtatg tgcagatcat agccaaaagc
     4501 ttaaagaaca agatctgggt gaatgagttt aggtatggcg gcttttccct gggtgtcagt
     4561 aatactcaag cacttcctcc gagtcaagaa gttaatgatg ccaccaaaca aatgaagaaa
     4621 cacctaaagc tggccaagga cagttctgca gatcgatttc tcaacagctt gggaagattt
     4681 atgacaggac tggacaccag aaataatgtc aaggtgtggt tcaataacaa gggctggcat
     4741 gcaatcagct ctttcctgaa tgtcatcaac aatgccattc tccgggccaa cctgcaaaag
     4801 ggagagaacc ctagccatta tggaattact gctttcaatc atcccctgaa tctcaccaag
     4861 cagcagctct cagaggtggc tccgatgacc acatcagtgg atgtccttgt gtccatctgt
     4921 gtcatctttg caatgtcctt cgtcccagcc agctttgtcg tattcctgat ccaggagcgg
     4981 gtcagcaaag caaaacacct gcagttcatc agtggagtga agcctgtcat ctactggctc
     5041 tctaattttg tctgggatat gtgcaattac gttgtccctg ccacactggt cattatcatc
     5101 ttcatctgct tccagcagaa gtcctatgtg tcctccacca atctgcctgt gctagccctt
     5161 ctacttttgc tgtatgggtg gtcaatcaca cctctcatgt acccagcctc ctttgtgttc
     5221 aagatcccca gcacagccta tgtggtgctc accagcgtga acctcttcat tggcattaat
     5281 ggcagcgtgg ccacctttgt gctggagctg ttcaccgaca ataagctgaa taatatcaat
     5341 gatatcctga agtccgtgtt cttgatcttc ccacattttt gcctgggacg agggctcatc
     5401 gacatggtga aaaaccaggc aatggctgat gccctggaaa ggtttgggga gaatcgcttt
     5461 gtgtcaccat tatcttggga cttggtggga cgaaacctct tcgccatggc cgtggaaggg
     5521 gtggtgttct tcctcattac tgttctgatc cagtacagat tcttcatcag gcccagacct
     5581 gtaaatgcaa agctatctcc tctgaatgat gaagatgaag atgtgaggcg ggaaagacag
     5641 agaattcttg atggtggagg ccagaatgac atcttagaaa tcaaggagtt gacgaagata
     5701 tatagaagga agcggaagcc tgctgttgac aggatttgcg tgggcattcc tcctggtgag
     5761 tgctttgggc tcctgggagt taatggggct ggaaaatcat caactttcaa gatgttaaca
     5821 ggagatacca ctgttaccag aggagatgct ttccttaaca gaaatagtat cttatcaaac
     5881 atccatgaag tacatcagaa catgggctac tgccctcagt ttgatgccat cacagagctg
     5941 ttgactggga gagaacacgt ggagttcttt gcccttttga gaggagtccc agagaaagaa
     6001 gttggcaagg ttggtgagtg ggcgattcgg aaactgggcc tcgtgaagta tggagaaaaa
     6061 tatgctggta actatagtgg aggcaacaaa cgcaagctct ctacagccat ggctttgatc
     6121 ggcgggcctc ctgtggtgtt tctggatgaa cccaccacag gcatggatcc caaagcccgg
     6181 cggttcttgt ggaattgtgc cctaagtgtt gtcaaggagg ggagatcagt agtgcttaca
     6241 tctcatagta tggaagaatg tgaagctctt tgcactagga tggcaatcat ggtcaatgga
     6301 aggttcaggt gccttggcag tgtccagcat ctaaaaaata ggtttggaga tggttataca
     6361 atagttgtac gaatagcagg gtccaacccg gacctgaagc ctgtccagga tttctttgga
     6421 cttgcatttc ctggaagtgt tccaaaagag aaacaccgga acatgctaca ataccagctt
     6481 ccatcttcat tatcttctct ggccaggata ttcagcatcc tctcccagag caaaaagcga
     6541 ctccacatag aagactactc tgtttctcag acaacacttg accaagtatt tgtgaacttt
     6601 gccaaggacc aaagtgatga tgaccactta aaagacctct cattacacaa aaaccagaca
     6661 gtagtggacg ttgcagttct cacatctttt ctacaggatg agaaagtgaa agaaagctat
     6721 gtatgaagaa tcctgttcat acggggtggc tgaaagtaaa gaggnactag actttccttt
     6781 gcaccatgtg aagtgttgtg gagaaaagag ccagaagttg atgtgggaag aagtaaactg
     6841 gatactgtac tgatactatt caatgcaatg caattcaatg caatgaaaac aaaattccat
     6901 tacaggggca gtgcctttgt agcctatgtc ttgtatggct ctcaagtgaa agacttgaat
     6961 ttagtttttt acctatacct atgtgaaact ctattatgga acccaatgga catatgggtt
     7021 tgaactcaca cttttttttt ttttttgttc ctgtgtattc tcattggggt tgcaacaata
     7081 attcatcaag taatcatggc cagcgattat tgatcaaaat caaaaggtaa tgcacatcct
     7141 cattcactaa gccatgccat gcccaggaga ctggtttccc ggtgacacat ccattgctgg
     7201 caatgagtgt gccagagtta ttagtgccaa gtttttcaga aagtttgaag caccatggtg
     7261 tgtcatgctc acttttgtga aagctgctct gctcagagtc tatcaacatt gaatatcagt
     7321 tgacagaatg gtgccatgcg tggctaacat cctgctttga ttccctctga taagctgttc
     7381 tggtggcagt aacatgcaac aaaaatgtgg gtgtctctag gcacgggaaa cttggttcca
     7441 ttgttatatt gtcctatgct tcgagccatg ggtctacagg gtcatcctta tgagactctt
     7501 aaatatactt agatcctggt aagaggcaaa gaatcaacag ccaaactgct ggggctgcaa
     7561 gctgctgaag ccagggcatg ggattaaaga gattgtgcgt tcaaacctag ggaagcctgt
     7621 gcccatttgt cctgactgtc tgctaacatg gtacactgca tctcaagatg tttatctgac
     7681 acaagtgtat tatttctggc tttttgaatt aatctagaaa atgaaaagat ggagttgtat
     7741 tttgacaaaa atgtttgtac tttttaatgt tatttggaat tttaagttct atcagtgact
     7801 tctgaatcct tagaatggcc tctttgtaga accctgtggt atagaggagt atggccactg
     7861 ccccactatt tttattttct tatgtaagtt tgcatatcag tcatgactag tgcctagaaa
     7921 gcaatgtgat ggtcaggatc tcatgacatt atatttgagt ttctttcaga tcatttagga
     7981 tactcttaat ctcacttcat caatcaaata ttttttgagt gtatgctgta gctgaaagag
     8041 tatgtacgta cgtataagac tagagagata ttaagtctca gtacacttcc tgtgccatgt
     8101 tattcagctc actggtttac aaatataggt tgtcttgtgg ttgtaggagc ccactgtaac
     8161 aatactgggc agcctttttt ttttttttta attgcaacaa tgcaaaagcc aagaaagtat
     8221 aagggtcaca agtctaaaca atgaattctt caacagggaa aacagctagc ttgaaaactt
     8281 gctgaaaaac acaacttgtg tttatggcat ttagtacctt caaataattg gctttgcaga
     8341 tattggatac cccattaaat ctgacagtct caaatttttc atctcttcaa tcactagtca
     8401 agaaaaatat aaaaacaaca aatacttcca tatggagcat ttttcagagt tttctaaccc
     8461 agtcttattt ttctagtcag taaacatttg taaaaatact gtttcactaa tacttactgt
     8521 taactgtctt gagagaaaag aaaaatatga gagaactatt gtttggggaa gttcaagtga
     8581 tctttcaata tcattactaa cttcttccac tttttccaaa atttgaatat taacgctaaa
     8641 ggtgtaagac ttcagatttc aaattaatct ttctatattt tttaaattta cagaatatta
     8701 tataacccac tgctgaaaaa gaaaaaaatg attgttttag aagttaaagt caatattgat
     8761 tttaaatata agtaatgaag gcatatttcc aataactagt gatatggcat cgttgcattt
     8821 tacagtatct tcaaaaatac agaatttata gaataatttc tcctcattta atatttttca
     8881 aaatcaaagt tatggtttcc tcattttact aaaatcgtat tctaattctt cattatagta
     8941 aatctatgag caactcctta cttcggttcc tctgatttca aggccatatt ttaaaaaatc
     9001 aaaaggcact gtgaactatt ttgaagaaaa cacaacattt taatacagat tgaaaggacc
     9061 tcttctgaag ctagaaacaa tctatagtta tacatcttca ttaatactgt gttacctttt
     9121 aaaatagtaa ttttttacat tttcctgtgt aaacctaatt gtggtagaaa tttttaccaa
     9181 ctctatactc aatcaagcaa aatttctgta tattccctgt ggaatgtacc tatgtgagtt
     9241 tcagaaattc tcaaaatacg tgttcaaaaa tttctgcttt tgcatctttg ggacacctca
     9301 gaaaacttat taacaactgt gaatatgaga aatacagaag aaaataataa gccctctata
     9361 cataaatgcc cagcacaatt cattgttaaa aaacaaccaa acctcacact actgtatttc
     9421 attatctgta ctgaaagcaa atgctttgtg actattaaat gttgcacatc attcattcaa
     9481 aaaaaaaaaa aaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF275948. Homo sapiens ABCA...[gi:9247085] Links  


LOCUS       AF275948              149034 bp    DNA     linear   PRI 17-JUL-2000
DEFINITION  Homo sapiens ABCA1 (ABCA1) gene, complete cds.
ACCESSION   AF275948
VERSION     AF275948.1  GI:9247085
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 149034)
  AUTHORS   Santamarina-Fojo,S., Peterson,K., Knapper,C., Qiu,Y., Freeman,L.,
            Cheng,J.F., Osorio,J., Remaley,A., Yang,X.P., Haudenschild,C.,
            Prades,C., Chimini,G., Blackmon,E., Francois,T., Duverger,N.,
            Rubin,E.M., Rosier,M., Denefle,P., Fredrickson,D.S. and Brewer,H.B.
            Jr.
  TITLE     Complete genomic sequence of the human ABCA1 gene: analysis of the
            human and mouse ATP-binding cassette A promoter
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 97 (14), 7987-7992 (2000)
  MEDLINE   20345099
   PUBMED   10884428
REFERENCE   2  (bases 1 to 149034)
  AUTHORS   Santamarina-Fojo,S., Peterson,K.M., Knapper,C.L., Freeman,L.A.,
            Remaley,A.T., Yang,X.-P., Haudenschild,C.C., Blackmon,E.E.,
            Francois,T.L. and Brewer,H.B. Jr.
  TITLE     Direct Submission
  JOURNAL   Submitted (08-JUN-2000) Molecular Disease Branch, National
            Institutes of Heath, National Heart, Lung and Blood Institute,
            Bethesda, MD 20892, USA
FEATURES             Location/Qualifiers
     source          1..149034
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1454..148034
                     /gene="ABCA1"
     mRNA            join(1454..1674,25831..25989,40385..40478,45012..45153,
                     46423..46541,67729..67850,70831..71007,83960..84052,
                     89010..89250,91962..92101,92433..92549,96758..96955,
                     97702..97907,98428..98604,100391..100613,102360..102581,
                     103642..103846,104951..105064,106862..107033,
                     108023..108154,109460..109602,109806..109943,
                     110646..110866,112124..112196,113183..113385,
                     115053..115101,115297..115410,116806..116954,
                     118604..118728,119964..120062,123094..123283,
                     124804..124898,126208..126240,127363..127468,
                     128943..129017,129539..129708,130939..131116,
                     133114..133229,133341..133485,134526..134649,
                     135736..135865,136130..136250,137038..137100,
                     138008..138114,140469..140610,140982..141116,
                     142060..142163,142646..142738,143397..143640,
                     144581..148034)
                     /gene="ABCA1"
                     /product="ABCA1"
     5'UTR           join(1454..1674,25831..25923)
                     /gene="ABCA1"
     repeat_region   2406..2678
                     /rpt_family="Alu"
     repeat_region   4240..4509
                     /rpt_family="Alu"
     repeat_region   4998..5270
                     /rpt_family="Alu"
     repeat_region   5563..5839
                     /rpt_family="Alu"
     repeat_region   6615..6877
                     /rpt_family="Alu"
     repeat_region   8800..9046
                     /rpt_family="Alu"
     repeat_region   9998..10279
                     /rpt_family="Alu"
     repeat_region   11859..12133
                     /rpt_family="Alu"
     repeat_region   12810..12902
                     /rpt_family="Alu"
     repeat_region   15220..15399
                     /rpt_family="Alu"
     repeat_region   15508..15787
                     /rpt_family="Alu"
     repeat_region   18600..18860
                     /rpt_family="Alu"
     repeat_region   20950..21206
                     /rpt_family="Alu"
     repeat_region   25038..25314
                     /rpt_family="Alu"
     CDS             join(25924..25989,40385..40478,45012..45153,46423..46541,
                     67729..67850,70831..71007,83960..84052,89010..89250,
                     91962..92101,92433..92549,96758..96955,97702..97907,
                     98428..98604,100391..100613,102360..102581,103642..103846,
                     104951..105064,106862..107033,108023..108154,
                     109460..109602,109806..109943,110646..110866,
                     112124..112196,113183..113385,115053..115101,
                     115297..115410,116806..116954,118604..118728,
                     119964..120062,123094..123283,124804..124898,
                     126208..126240,127363..127468,128943..129017,
                     129539..129708,130939..131116,133114..133229,
                     133341..133485,134526..134649,135736..135865,
                     136130..136250,137038..137100,138008..138114,
                     140469..140610,140982..141116,142060..142163,
                     142646..142738,143397..143640,144581..144721)
                     /gene="ABCA1"
                     /codon_start=1
                     /product="ABCA1"
                     /protein_id="AAF86276.1"
                     /db_xref="GI:9247086"
                     /translation="MACWPQLRLLLWKNLTFRRRQTCQLLLEVAWPLFIFLILISVRL
                     SYPPYEQHECHFPNKAMPSAGTLPWVQGIICNANNPCFRYPTPGEAPGVVGNFNKSIV
                     ARLFSDARRLLLYSQKDTSMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYH
                     NLSLPKSTVDKMLRADVILHKVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLP
                     REKLAAAERVLRSNMDILKPILRTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMR
                     SWSDMRQEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALF
                     GGNGTEEDAETFYDNSTTPYCNDLMKNLESSPLSRIIWKALKPLLVGKILYTPDTPAT
                     RQVMAEVNKTFQELAVFHDLEGMWEELSPKIWTFMENSQEMDLVRMLLDSRDNDHFWE
                     QQLDGLDWTAQDIVAFLAKHPEDVQSSNGSVYTWREAFNETNQAIRTISRFMECVNLN
                     KLEPIATEVWLINKSMELLDERKFWAGIVFTGITPGSIELPHHVKYKIRMDIDNVERT
                     NKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDVVEQAIIRVLTGTEKKTGVYMQQMPY
                     PCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKGIVYEKEARLKETMRIMGLDNSILW
                     FSWFISSLIPLLVSAGLLVVILKLGNLLPYSDPSVVFVFLSVFAVVTILQCFLISTLF
                     SRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFAXLLSPVAFGFGCEYFALFE
                     EQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTWYIEAVFPGQYGIPRPW
                     YFPCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVSIQNLVKVYRDGMKV
                     AVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSTIR
                     QNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPSSKLKSKT
                     SQLSGGMQRKLSVALAFVGGSKVVILDEPTAGVDPYSRRGIWELLLKYRQGRTIILST
                     HHMDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSSCRNSS
                     STVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDIGHEL
                     TYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAETS
                     DGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVK
                     GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLE
                     LQPWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEW
                     TTAPVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTA
                     DILQDLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNTQALPPSQEVN
                     DAXKQMKKHLKLAKDSSADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAISSFLNVIN
                     NAILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAXMTTSVDVLVSICVIFAMSFV
                     PASFVVFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYVVPATLVIIIFICFQQ
                     KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYVVLTSVNLFIGINGSVA
                     TFVLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVS
                     PLSWDLVGRNLFAMAVEGVVFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQ
                     RILDGGGQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKM
                     LTGDTTVTRGDAFLNXNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGV
                     PEKEVGKVGEWAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPVVFLDEPTTG
                     MDPKARRFLWNCALSVVKEGRSVVLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLK
                     NRFGDGYTIVVRIAGSNPDLKPVQDFFGLAFPGSVXKEKHRNMLQYQLPSSLSSLARI
                     FSILSQSKKRLHIEDYSVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTVVDVAVLT
                     SFLQDEKVKESYV"
     repeat_region   29916..30180
                     /rpt_family="Alu"
     repeat_region   31050..31314
                     /rpt_family="Alu"
     repeat_region   31692..31973
                     /rpt_family="Alu"
     repeat_region   36167..36440
                     /rpt_family="Alu"
     repeat_region   37617..37892
                     /rpt_family="Alu"
     repeat_region   37999..38166
                     /rpt_family="Alu"
     repeat_region   38172..38269
                     /rpt_family="Alu"
     repeat_region   38995..39271
                     /rpt_family="Alu"
     STS             41547..41565
                     /gene="ABCA1"
                     /note="A009X28"
     repeat_region   41593..41835
                     /rpt_family="Alu"
     STS             41657..41679
                     /gene="ABCA1"
                     /note="A009X28"
     repeat_region   42320..42584
                     /rpt_family="Alu"
     repeat_region   44349..44626
                     /rpt_family="Alu"
     repeat_region   48473..48669
                     /rpt_family="Alu"
     repeat_region   50547..50614
                     /rpt_family="Alu"
     repeat_region   54285..54418
                     /rpt_family="Alu"
     repeat_region   54849..54988
                     /rpt_family="Alu"
     repeat_region   55029..55226
                     /rpt_family="Alu"
     repeat_region   55297..55344
                     /rpt_family="Alu"
     repeat_region   58501..58935
                     /note="LTR"
                     /rpt_family="HERV"
     repeat_region   61246..61489
                     /rpt_family="Alu"
     repeat_region   62812..63077
                     /rpt_family="Alu"
     repeat_region   64082..64306
                     /rpt_family="Alu"
     repeat_region   67123..67382
                     /rpt_family="Alu"
     repeat_region   68499..68731
                     /rpt_family="Alu"
     repeat_region   69481..69760
                     /rpt_family="Alu"
     repeat_region   71998..72233
                     /rpt_family="Alu"
     repeat_region   72244..72501
                     /rpt_family="Alu"
     repeat_region   74051..74288
                     /rpt_family="Alu"
     repeat_region   79108..79388
                     /rpt_family="Alu"
     repeat_region   80915..81196
                     /rpt_family="Alu"
     repeat_region   88098..88375
                     /rpt_family="Alu"
     repeat_region   90913..91197
                     /rpt_family="Alu"
     repeat_region   91226..91293
                     /rpt_family="Alu"
     repeat_region   91338..91485
                     /rpt_family="Alu"
     repeat_region   94304..94562
                     /rpt_family="Alu"
     repeat_region   96058..96308
                     /rpt_family="Alu"
     repeat_region   98156..98294
                     /rpt_family="Alu"
     repeat_region   98840..99121
                     /rpt_family="Alu"
     repeat_region   108310..108555
                     /rpt_family="Alu"
     repeat_region   114538..114808
                     /rpt_family="Alu"
     repeat_region   115816..116088
                     /rpt_family="Alu"
     repeat_region   121712..121989
                     /rpt_family="Alu"
     repeat_region   123420..123701
                     /rpt_family="Alu"
     repeat_region   124156..124414
                     /rpt_family="Alu"
     repeat_region   125618..125891
                     /rpt_family="Alu"
     repeat_region   127679..127963
                     /rpt_family="Alu"
     repeat_region   128003..128284
                     /rpt_family="Alu"
     STS             130046..130067
                     /gene="ABCA1"
                     /note="D9S53"
     STS             130162..130181
                     /gene="ABCA1"
                     /note="D9S53"
     repeat_region   137312..137532
                     /rpt_family="Alu"
     repeat_region   138940..139224
                     /rpt_family="Alu"
     repeat_region   144093..144352
                     /rpt_family="Alu"
     3'UTR           144716..148034
                     /gene="ABCA1"
BASE COUNT    39077 a  31132 c  33597 g  45221 t      7 others
ORIGIN      
        1 cctggagatc ctgttgactg tagcatggag ggggcttgtg cagctgaatg tctgcatgca
       61 ggtggtggga gttctggaat atgatggagc tggaggtggg aagagaagta ggcttggggc
      121 agctctctca tgccacctca ttctggccaa aactcaggtc aaactgtgaa gagtctaaat
      181 gtgaatctgc ccttcaaggt ggctacaaag gtatctttgt caaggtagga gaccttgtgg
      241 cctccacgtg cacttccagg gcctgcttgg gcctcttcta cgggtctgtc ctgagtcttc
      301 tatgaatctg tccttcaggg cagattcata tttagactct tcacagtttg acctgagttt
      361 tggccagaat aaggtgacat ttagtttgtt ggcttgatgg atgacttaaa tatttagaca
      421 tggtgtgtag gcctgcattc ctactcttgc cttttttttt gcccctccag tgttttgggt
      481 agttttgctc ccctacagcc aaaggcaaac agagaagttg gaggtctgga gtggctacat
      541 aattttacac gactgcaatt ctctggctgc acttcacaaa tgtatacaaa ctaaatacaa
      601 gtcctgtgtt tttatcacag ggaggctgat caatataatg aaattaaaag ggggctggtc
      661 catattgttc tgtgtttttg tttgtttgtt ttgtttgttt ctttttttgt ttttgtggcc
      721 tccttcctct caatttatga agagaagcag taagatgttc ctctcgggtc ctctgaggga
      781 cctggggagc tcaggctggg aatctccaag gcagtaggtc gcctatcaaa aatcaaagtc
      841 caggtttgtg gggggaaaac aaaagcagcc cattacccag aggactgtcc gccttcccct
      901 caccccagcc taggcctttg aaaggaaaca aaagacaaga caaaatgatt ggcgtcctga
      961 gggagattca gcctagagct ctctctcccc caatccctcc ctccggctga ggaaactaac
     1021 aaaggaaaaa aaaattgcgg aaagcaggat ttagaggaag caaattccac tggtgccctt
     1081 ggctgccggg aacgtggact agagagtctg cggcgcagcc ccgagcccag cgcttcccgc
     1141 gcgtcttagg ccggcgggcc cgggcggggg aaggggacgc agaccgcgga ccctaagaca
     1201 cctgctgtac cctccacccc caccccaccc cacccacctc cccccaactc cctagatgtg
     1261 tcgtgggcgg ctgaacgtcg cccgtttaag gggcgggccc cggctccacg tgctttctgc
     1321 tgagtgactg aactacataa acagaggccg ggaagggggc ggggaggagg gagagcacag
     1381 gctttgaccg atagtaacct ctgcgctcgg tgcagccgaa tctataaaag gaactagtcc
     1441 cggcaaaaac cccgtaattg cgagcgagag tgagtggggc cgggacccgc agagccgagc
     1501 cgacccttct ctcccgggct gcggcagggc agggcgggga gctccgcgca ccaacagagc
     1561 cggttctcag ggcgctttgc tccttgtttt ttccccggtt ctgttttctc cccttctccg
     1621 gaaggcttgt caaggggtag gagaaagaga cgcaaacaca aaagtggaaa acaggtaaga
     1681 ggctctccag tgacttactt gggcgttatt gttttgtttc gaggccaagg aggcttcggg
     1741 aagtgctcgg tttcggggac tttgatccgg agccccacat ccccaccact tgcaactcag
     1801 atgggaccgg aggcggtgtt aaatggggag acgatgtcct agtacgagct ctggtgaccc
     1861 caggactctg cgctgctgcg cttggggctt gcccgacggt ggagaccggg gagcatctct
     1921 gggcgtggag acccgggcgc agtaccccgg gctcagaggg gtcgggggtt cccgggcgtg
     1981 ctgagggcgc tgctgccggg tggggagagc tgcaggtccg gcaccgagcg ctgctttgtt
     2041 cggagggccc tgagctggct agaaaccctt ctggttgcag gtcggccagt acctacggag
     2101 acaaatgcca gcactgagtc ttcactcggt tcttaagaag ctggtctgtt ctgacctggg
     2161 aattggctat atgctccccg ggactggagc ggcacagtcc cggactgtga atccgggaac
     2221 tcgagttgga ggtgtcccaa acggtccgtg gtgctattgc tcactagagg ccttgggtct
     2281 ttgtttgacc tgaggggtag ggaggtcctg cctacagtct ccgtgcgctc agctgagctg
     2341 gtgtccctgg cgcagagcgc ggacgagttt tgtttccttt ctttttcttt tttttctttc
     2401 ctttaagtct cggtctgtcg cccaggctgg agtgcaatgg aacgatctcc gctcactgca
     2461 acctccgcct cccgggttca agcgattctc ctgcctcagc ctcctgagta gctgggatta
     2521 caggcgcgtc accacatcca gctaattttt gtatttttag tagagacggg gtttcaactt
     2581 gttggccagg ctggtctcga accctcgacc tcaggtgatc caccggcctc ggcctcccca
     2641 agtgctggga ttataggcgt gagccaccgc gcgcggccga gttttgtttc ttttaaaaac
     2701 aagacttagg agagcctgcg gagacccgga ggtggggtgc ccaatcctcc ctctcccacg
     2761 ttccctgcag ccccatcttc cagaccgttg ctgctggtct ctcggggcag cttctgcctg
     2821 ggcgcagatg gggaagctgg gccgaggtgg tgccgtggaa tgaccgggag taaccccggc
     2881 gggcggcgca gaactcggag ctccgccgcg gggctgggct gggctctgcc gtgagggtgg
     2941 gggtgctggg cgcgcgggct gcggtggccc ccggagactg gcccgcagcg cctcctggcc
     3001 ggaggaccta ggaatcggcc ggctctacta ggtgtctttg ctcgcggttc cgactgtgaa
     3061 tccggtgaag accggtggtt gcagacgggg aggaactatg aggttgaggc gaaagcccgt
     3121 tttgtttttt tttttttgtt tttttggttt tttttttgtt agcgtgtttg ccaactccca
     3181 ggccattggt aaagcaggaa ggttcttggg gcggcggacg gtgccagggt tatgtgtagg
     3241 tgcctcttta ggtatatctt ttatcaaaaa gaagcaaaga aataagatta aaaataaaca
     3301 aagaaaaaag ttgtctggca ctggcagtaa ttggcctgcc tttgcagcac tgataccatt
     3361 agcttttaaa atccgacttt tcattgacac ttcaagaaga gaatgggtag tatatacaca
     3421 ttcatctcat agtggacaaa tttcatattt aaaaaaacct tctgggtact gaaatcagca
     3481 agtcacttgc cctccatggc cgaatccctg cttccacgaa gagaacctca caaaaatttc
     3541 ccccaagtta aagagtggaa ttttcttgat ttttttgttc tttttttttt aacggccgta
     3601 gtttagaacc ccagacttaa attatgatct tcttttcaaa caaaacttaa agtccttaag
     3661 ttttcatctc cccttttatt tcaacctatt cttctcatac ctaccacaaa aataatggag
     3721 gctttctgtt gagaaacttt ccgtttctgt tgagagtatc attctcttga gaaactttct
     3781 cctaaatcag agaaagtatg gaagcatgga aagtattcct gagtagaacc tctacagata
     3841 ttacaatatt tttcaaatac aaagtttcca ttgtcagcct gtttcccaag tgcttccaca
     3901 aaccattaaa taattccaca aaccattaaa ataattaatg ctagggaatt ttaggaaaac
     3961 attggtttac aatcagaagg accggggaag tgggtcttca gccttcacga tgactacaag
     4021 ccatttaagg gactagaatt gctactgttg tcagagcaat ttaggagtct gtatttgagc
     4081 acccgcatag tgttccagaa tgacatatct gactgtaacc tggacacgtg tgatatgttg
     4141 tctcccctgc agatgagcat ttgaaatctc aaccctcgta tttctacgag tgcaggccta
     4201 taatggaccc tgggcacatt tttttttttt ttgagatgca gtctcgctct gtttcccagg
     4261 ctggagtgca gtggcacgat atggctcact gcaacctcca cctcctgggt tcaagagatt
     4321 ctcctgcctc agcctcctga gtagctggga ctataggcgc acgccaccat gcctggctag
     4381 tttttgcatt ttcagtagag acagagtttc accatgttgg ccaggatggt ctcgatttcc
     4441 tgacctcttg atccacccgc ctcggcctcc caaagtgctg ggattacagg cgtgaggcac
     4501 cgcgcccgac ccctggacac attttgactt agaacatatt ttcggtttgt gtgagacagt
     4561 gcattagtgc aggattggaa aagagtgatc aggaattgat tgttttcaag gattggttcc
     4621 ttctgctcaa ggaagtccca ttgtaaacat aaaaaaatga atgaaactga agaagttcag
     4681 tgacttagct ttttattatc ttctgtagta cttacctttt tggagaggag ttggttggga
     4741 tatttttcca tttaaatttt ttttttaaag ggatcttctc tcccgtaagc cgggatactt
     4801 aagctatata tgtagtggct acaaattaag gtcttcactg ttttcatttt tagctgctag
     4861 aataagtgaa cattacctta gatagactct tctaattatg aagatatcta gatgtctaga
     4921 aaatatcaaa atgcatgtgg tttttgcatt tctaaaatac ttttaaaacc aaatactttt
     4981 tctttttttt ttttttctga gatggagtct tgctcttttg cctaggctgg agcgcagtag
     5041 aatgatcttg gctcactgca acttccgcct ctcaggttca ggtgattctc ctgcctcaac
     5101 ctcctgagta gctgggatta caggtgcgtg ccaccacacc cggctagttt ttgtgttttt
     5161 agtagggaca gggtttcacc atgttggcca ggctggtctc aagctcctga cctcaagtga
     5221 tctgccagcc tcagcctccc aaagtgctgg gattacaggc atgagccacc acacctggcc
     5281 tcaaatacat ttttttaagt atccagatat taaataaata ataccattat agtagttgtt
     5341 atggtcattt actctagcat caaatgttaa aagatcattc tgaacacttg ttttgtttat
     5401 gctgagagaa ggcctactcc aaaaaatgca accatcttcg tatctgcatg tggatacaac
     5461 catgaatggc caaagttatt gcagtgttga atagacactt atatagcact gtgtggcaag
     5521 tactgtttga aatgtttttc acgtgtttat tcattgtatt tattttgaga tagggtcttg
     5581 ctctgttgca cagggtggag tgcagctgca cagacaaggc tcactgcaac ctcagcctcc
     5641 tgcgctcacg tgatcctcat acatcagcct cttaaggagc tggggccaca ggcacgcacc
     5701 actgctcctg gctaaatttt tacaattttt tgtaaagaca aggtctcact atcttgctca
     5761 gactggtctt gaactcctgg gctcaagtga tcctcccaca tcagcctccc aaagtgctgg
     5821 gattacaggc ataagccact gtgcaggtca tcaaatgcta attgaatttt cacaacaaaa
     5881 ccatttattg tccctagttt acaagattaa gtaaatgaga agctaatttt tctctggcta
     5941 tataccttgc aagaggcaga gctaagactt gaacccagcc agagttcttt aactccagca
     6001 ctaacatttc agctgctgca accagggagc ttttcaagga tgatcaccac attctctaca
     6061 ttcatctgct ataatcctta tcagaatcta cagcctgtat catattttcc ttgttgctgt
     6121 gagtgggtca gccaaattct ctttaacttg aaaccttggt tgcgtaggga ttgcaacatc
     6181 ctggaaagaa tagaataaaa tttactcaac tcaatttttt acttggttca taatgaaaac
     6241 tatactattg cttcagtcag atgtttgcga atagctgtgt gatctcaaaa tgttttccta
     6301 tgtgatctat agaaaatgga atgatagagt attaggctgt aagggcctaa gaaacaaagg
     6361 aaaaagagaa gtgaactgtt agtttagttg taaaacctaa ctttggtgaa ttgtaaaaat
     6421 ttgttataat acaatatgat tcttgcttgt cctgtccttg atgaagttgt ggaccttttg
     6481 aaataagcta tttctctgtt actgctgtta ctgtttagaa atcaaattta gttttttctt
     6541 aagatatacg tatttttgga agataaacac agtttcaaag tctgccttgt tggctgggtg
     6601 cactggctca ttgttgtatt cccagcactt tgggaggcca aagcaggagg atcacttgag
     6661 gtcaggagtt caagaccagc ctggcaaata tggtgaaacc ccgtctctac taacaataca
     6721 aaaattagct gggcgtggtg gtgggtgctt gtaatcccag ctactgggat tgggaggctg
     6781 aagtagaaga attgcttgaa cctgggaggc ggaggttgca ctgagtcgag atcgtgccac
     6841 tttactccaa cctgggcgac agagtgagac tccgtcttga aaaaaaatgt ctgccttgta
     6901 aaagtgaaat aggatgagaa agtgcctttc ttattaattg gtgtaatgaa ttagaaataa
     6961 actctttgaa gacacctctt ggtaaaaata gttacattta ctgttgattt atggtatgtt
     7021 ggatatgttt ttaagttttc cgtgtaataa ctcagttcat tctcatgagt gaaataggtg
     7081 cttttattgt ctttatagat gggaaactga ggtataggca ggctaggtac attattatgg
     7141 agttcgtaag tagtggagct gaagtcgcat cccagacagt ttggcttccg tgagtttacc
     7201 aatctcatgg taaagacttt gtcagactat caaagttttg acaaatgaaa tattagcaaa
     7261 aggccaaaag ggattctcta ttttcatttg agtatcttca cctgaaaata gttgcctgaa
     7321 taagtagcct gcatagaaag gtacatttta gaaatacttg aggccagaga atgaaaagct
     7381 tacataaaat tgatttccgg tggggccttc agttactctc cattctacga agaccacaaa
     7441 tagcattcag gcaaagagca tttattccaa caatggagga gcactggatt tggttcctaa
     7501 aacaaaataa agtttgaaat cctgtctttc ccatgttgaa aacaaagttg gtacaaaacc
     7561 ctttagcttt tgcaaacctc ctttaagacc cgatttaaat gcctccctcc tcatgaagct
     7621 cttctggatc cactctttcc catcactaag ttgaaagtaa gatccccttc tctttacttc
     7681 cattagactt ggattacagc actctttgta tcatgtattt aattctgttt tttaattaca
     7741 gttaacattt atttgtcttc ctcttgagtg tatgcttctc tagaggaagg tctttgattc
     7801 attctcccct ggccttaatt catcccactt aatatggaaa aaatttaata aatgctgact
     7861 tgaataagtc caacaaggag aatgggaagc tcatgtttgc ttcctgtctt ctaaaagact
     7921 acttaagata acagggtaat cacagaaaag cattagaaat agagttatat gagaaacaac
     7981 tgtagttaag gctaggttta tgttagactg agaaatttta gtgcatactt aagttattta
     8041 ggccaggtta ctttttgtag aacaaacatt tcagtttcgc tcagtttcat ttccgtttct
     8101 gaggcagctg tgatttaaga aaatgctcta gtctgtggca ttccatattc aagtactttg
     8161 agttgtatat taatttattt ttgttaataa gagtgagatg actcactaag taatttagag
     8221 atttaaacac ttttttaaaa aacagtaact tcatatgcat tggatctatt cttctataaa
     8281 gtcttttctt ggggggtgtt tgtttaaaat tccccggtgt tttctctgcc aaatccaact
     8341 tccaagaagc atttggaagt caaaacattt tatctggtta gtcttaaagt ccagatattt
     8401 tgtgatagct ggtatttagt ttatgatatt tcccaggaag aactttttag tagttgaacc
     8461 atttatgaaa gacttccttg aagctacctt agagagttga tttagttctt cctaaataag
     8521 taaatagaat attagtatta ggacatcttg gagtatagat gcaaatattg gtgaaaaaga
     8581 acatggatat cagagtcaaa ttaatgtaga ttggaattct ggttattact aatggatatc
     8641 tgacattagg caagttgctg atcactcttt gcctcagttt catcatctgt aaaataggta
     8701 tttgtgtttg tgtataatgt gaaccgtata atataatgct tggcctatag tgaaatttat
     8761 tcatacgagt gttttcagtg attttaaaag cttgctttag gccgggcgcg atggcttctg
     8821 cctataattc cagcactttg ggaggccaag gtgggcggat catgaggtca ggagttcgag
     8881 accagcctga ccagcatggt gaaaccccgt ctctactaaa aatacaaaaa ttagctgggc
     8941 gtggtggtgc aagcctgtaa tcccagctac tcaggaggct taggcagaag aatcgcttga
     9001 acccaggagg cagagattgc agtgagccga gatggtgcca ctgcacagag cgagactcca
     9061 tctcaaaaaa acaaaacaaa acaaaacaaa aatcttgctt tatagtttac ttccacatca
     9121 aattgtcttt atcccatgtt acttgcattg atatcccaga catgaaaaga aaaaaagatg
     9181 ataacaatga cagttattaa attaggttcc actcttattc tagatcacca attcatatta
     9241 ctattcagac ttggaacatt aaattttagt taaacttttt ttcaaatatg catataattg
     9301 tcagtggtta ctatattttg gggaagagat tgttgacttc tttgaagaaa gatacggatt
     9361 ttctcttcag aagaaataca catgggctca tataatccaa attttatgtg taattacagg
     9421 gtgttcatga atgcccacaa atccattaag ccatgtgacc ttggacaagt cattttactt
     9481 ttctgttttt taggttgttg gtctgtaaaa tgatactact tgacttttaa agagcccttc
     9541 aagctcttat gtcctctaac cccaggtctg tattcagaag aaggggtggt cctttaatta
     9601 gagccatcta gagatctgag gaacatgctg ggcattagtg taacatacca tgtggatttt
     9661 gagaggtaaa gaaaaaataa ccagggaatg cctcagagca ttcctgatca gatcgatgac
     9721 agaagaaagg aatgagaggg aggagaggaa gctgttgaaa tttcctattt acctgctttg
     9781 agtgaatgaa gatttgaatc atagaaccag aaggggttct catctgaaat gcaaaggaag
     9841 gaggagttgg tttaattcaa taagtttcag ttgagtaaac atgatttagt gagatactgt
     9901 tcttgcttct gactcaccat ttggaaaatc tctctaaaat aaaattggac tctccatctc
     9961 ggacatcatt ttgggtgtag gttttgcttt tttttttgag atggagtctc gttatgttgc
    10021 ccaggctgga gtgcagtggc gcaatctcgg ctcactgcaa cttccgcctc ccagattcaa
    10081 gcagttctcc tgcctcaacc tcctgagtag ctgggactac aggcgtatgc caccatgtcc
    10141 ggctaatttt tgtatttttt taatagagac ggggttttac tatggtggct aggctggtct
    10201 ttaactcctg accttgtgat ctgcccacct tggcttccca gagtgctggg attacagatg
    10261 tgagccacag tgcccggcct aagttttact tcttataatg gactcctgtt aagccaatag
    10321 gtgatgaaag gaaaccataa ccaactcttc aggctcattc atccttcaag aatagcatgc
    10381 tagtacccat cctaggagga gaattggact atacctcatg aggatagttt gaagtatctc
    10441 agaagaccct cactgggggt aggtcggtaa gacacaaagc tttctaaagc actgtaccaa
    10501 atttgttgtt tgagagatca taacaaatta gaagtggaaa gaagaaggag taaaaggaag
    10561 aagaggtttc tggccaggag cagggagggg gaaggagctg ctaggaagat gtttggttgt
    10621 catatccctg ttcacccttg ctttgcaaaa ttcttgtagg atgccaggtt gggagtaatt
    10681 gtttttcaca agagtcaaac cacgcttgtt ttcttgaaga agcaagtctt tggagggtgg
    10741 tggcttgaaa tctgttggat ctggtattta ggtgatacac tttgacataa aggcaacact
    10801 gatgcaagca gcagctttcc ttggaaaggc agggagaaag tgaaggccca gactgatgag
    10861 cttacactga cctgcagacc cttctccatt cccaggcatg tttggtgcag agtttacctt
    10921 agtggggtta ggctgtgtct ggtactgtga gagagaagga agaagaagat atgatattaa
    10981 caacaacaat acatatttat atttgaaaat taaatgcact aatacaccta tagtgctatt
    11041 aaacaaattt atttgttcag caaatgtttg ttaaacacca tgtactgtgg aaagtactag
    11101 gtgctggaca gaggtcaaaa gacttttaag aatctgccac cattaatgat ctctttctgc
    11161 ttggcattca aggctctttg aaataagact gtgacccact ttgatagttt tgtcctggat
    11221 tataagacac atgctcgaag gaactatagc tggttttctc accagactga ttaacatata
    11281 gtatggtttg gtacctgtta aatgagtctc tctctacagg ttttcatctt ctactttaag
    11341 aaccccttcc tggcattgtt tggctcctca ttgttctgga atctcatgtc catctcatgt
    11401 atttgccttg gttcacactt gatcttctgc tcatattcct cacctactga aattttaccc
    11461 atcaaccaga ccgtgggtga tggaacacag catgggctag ttcttcttat atatgatctc
    11521 atttaatttt cactggaact ctgagatagg tagcatttca gcccactcaa gttgactcaa
    11581 atttttgtaa tcatcaatat attttaaaat aactttatat ttgacctgta attggaaaac
    11641 caatatgagt tatcataaat gaaaggtaat tttaaaaata attaggatga agacaaatta
    11701 tttttctcac agtctatgta taagataaac tattggttcc caaaggcccc agctgacaat
    11761 gagacttctc ttactttgtt gaaaagggaa ttagcaagca ttaaagaggt gtcaaaaaga
    11821 agactaaaca aaagcttact cctttttttt tttgagacga gtcttgctct gttgcccagg
    11881 ctggagtgca gtggcaccac ctcggctcac tgcaacctct tcctcctggg ttcaagcgat
    11941 tctcctgcct cagcctccct agtagctggg attacaggtg catgccacca cacccggcta
    12001 atttttgtat ttttagcaga gatggggttc gtcatgttgg ccaggctggt ctccaactcc
    12061 tgacctcagg tgatccaccc acctcggcct cccaaagtgc tagaatcaca ggtgtgagcc
    12121 gccgcacccg gccgcttact ccttaatgta ctaagaatgt tatatatagg ctgaagaagt
    12181 gctgaaaaga accatatttt ctcatgatgt ggttcaatgt ttaatactgt gcttgttcat
    12241 ctcctaaaat cctctgaata tcacttaaat tcatcctgtg taactctccc acatttgggg
    12301 aaatactgag cttgcctatt attatattag ccccatattt cagatgatgc acctgagccg
    12361 aggagaagtt aaataacttg ttcatggttc catatttggc taatggcaga gccagggatt
    12421 caaactcttg tctctctgac tcccaggttt gtgcttttcc cacttggctg aatttctcat
    12481 gctacctcct ccataacacc tttcctagaa cttttaagga atgcttcccc tgttctctca
    12541 tagcatttta atgtaagttg ccaaagtgtc ccagtttgca cccctaccag caatgtggga
    12601 gaaaatagca acatattttt gatgttgggg tctagtgtta cggtttcttc tgctctttgg
    12661 gatgtatatt ccatggctac tgaataccca agtcccaaac agttttctta gactcagaag
    12721 gtcatccact tcagcttctt catcaaaaag acatattttc tggctgggca cggtggctca
    12781 cacctgtaat cacagcactt tgggaggtgg aggcaggaga attgcttgaa cccaggaggc
    12841 tgaagttgca gtgagcccag atcgtgccac tgccctccag cctgggtgac agagcgaaac
    12901 tctgtgtcaa gaaacaaaaa aaaggacttg ttttctgttc cattacccac agtggtagaa
    12961 tggcgtgcta aatttattct ccagctgcca ttaactgcaa attaaaatct tagtctcttg
    13021 cctctttaat ccaggcttct tcatactata ccagaattta ggataactat tacagtgccc
    13081 tttataggag agaaagaaga aattgtgtct gtagatgtct gttcctttca gcttaaaatg
    13141 gacactgaaa tgttaaatat tggactggcc tcatttattt ctcctgtctg ttggtccaat
    13201 ttgaatctta aggcgtcttt caactggaat tttttgtttc tctcaactaa aaattgttct
    13261 ttgtaagttt gaatcagaac aaaatcctga atgttgaggg tttcctaaag gctgtttctt
    13321 tatgcaaaag cctgaaaccc gatgttgatg ttggctgctt aaaattaact gtgaatcaag
    13381 gcagggtttt tatttttatt ttttttttac tttaatgatt gtgttaatta tagtgaaaac
    13441 cttgagttca cgagaaagaa agcctttggt caagtattgt ttattaagtt gtcagtcttg
    13501 ttgcaggatt tgcaaattta gtggaattag tgccattttt cagtttacaa ttccagtcac
    13561 atttcacatg atcagagcat ggctttcttc tctgtggagc aaatagaggg ctgtctgaca
    13621 cttggttcca gtggcttcca ttaagcagag tggatatgtc cctggagtct gcagagaagg
    13681 gcatggcact ctgaccccag atggcactcc gttttgggac attgtccaat tctagttcat
    13741 agcatatgtg accaacacca gctctcacct gatgtaaaca cttagcgcgt tgttgcttgg
    13801 gggattggat tgtgtgaatt tttcaaaact acagttgaca gaaggaggct accaaaaatg
    13861 aaacccaata attccatttt tgggaattat tcccactttt gttccatttt tcccactttg
    13921 ttctttggca cacagaatgt ttgatttgtg aaaatcttaa taacagtagt tttttctata
    13981 aggaacactc agaatcttga taatattgga ataatacaga tccttttgta ggatcctctc
    14041 agacctcata taatagagtt catgtagtca atatttaaag aaaaacaccc ttaagttttt
    14101 gtttttcaga atcacaagta agtggattta aacttgtgat cttattcccc tttcttctct
    14161 taatttagtg aggcagccag cgagagggtt tgttttggtt attctaaaga aggagtttgc
    14221 ttgtaagttt tggagggcaa gacttagact ctgtgtctct gtgcttgccc tggaactttg
    14281 attaaattgt cactaaccga gttagctggc cctcgccggg ctgcagaaat agaagtgtct
    14341 tgcacacatg acatatgact gtctcaagag ctggctggtg aaaggacgtt ctggagaagg
    14401 ctgccgatac tgtatgaact agaactggac aagagcctgg agattggata actcagtttg
    14461 gcgcaagtaa agggaataaa agtgttaagg tggcaaaatt gtatccaggt gtttataggc
    14521 tccctgagtt cctgacttga gcctatctat gggtttagag ttcaaggctc tttaccagtg
    14581 ctgacaatct tatactctag gttgaacctc cggggaaggt gcccttgctt gatggcatgt
    14641 ttaccagggg ttctagagcc tcaatcacag attctctcta gctcacatga agttaatgaa
    14701 aatgaatgtg cttccctaca aattagagag gctttgagga aaaatcagat taaatgcact
    14761 cctgcttgaa cttatgtttc ttagaacaca gctggaaatt ttgtcacaca aacctttact
    14821 ttcagtgaca tttcttgact ggtttgttac tgtagtgaat ctgctttaac tatcttttct
    14881 tatcgctgag gttttacttc cattctacat gtgattgtgg agcgctgcgt cattgtgggt
    14941 tcagtgtagt ggagagtagg aagatggtga gacacagtag cttgttgcac attgcttaat
    15001 ttatcaggga tcactgatga gttagtacac tagagaagat tgtaggtaga gctgaaaaga
    15061 tggaggaatt ataaggctca gatttctctc tttttttttt tttttaagat ggagtttcac
    15121 tcttgttgtc caggctggaa ttcaatggca tgatctcggc tcactgcaac ctctgcctcc
    15181 cgggttcaag tgagacttga tggtctcact tgatggtttc ctggctcagc ctcctgagta
    15241 gctgagatta caggcaccca ctaccatgcc cagctacttt tttgtatttt tagtggagat
    15301 ggggttttat catgttgacc aggctggtct cgaacttctg acctcaggtg atcacctgcc
    15361 tcagcctccc aaagtgcagg gattacaggc atgagccact gcgtctggcc aaggctcaga
    15421 tttctaatag agatttctaa tggacataga ggctggagga aatgggatgg acaggaaaac
    15481 tgagtcaggt gccaaaaact tgtagggggc cgggtgcggt ggctcacgcc tgtaatccca
    15541 gcactttggg aggctgaggc gggtggatca cgaggtcagg agatcgagac catcctggct
    15601 aacacggcga aaccccgtct ctactaaaaa tacaaaaaat tagccgggcg tgttggcagg
    15661 cggctgtcgg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaagccggg
    15721 aggcggagct tgcagtgagc cgagatcgcg ccactgcact ccagcctggg cgacagagcc
    15781 agactccatc taaataaata aatagataaa taaataaaat aaaataaaaa cttgtaggga
    15841 ggtggcagtg tgctatggag gataggtgca acctctgtga gaatgtagag aaaatagtat
    15901 aagtgagtgg tgaggacccc caagaggggg ttttatagta aaacaatggt cagaagtggc
    15961 aacaggatac cgtataatgc tttcacctct accaatgcac tgggtactgg agagcgctcc
    16021 agtttgctct ggaaaggccc tttctgtgga caaagaatac agaaaagaga ttcctttaat
    16081 aaaccaccca ctctgtgtcc cacccttgat gaatacttca ctgtgaaatt gccagaatta
    16141 atcatggtaa tagctactgt acacttactt tgttccagga actggataaa tgttttacat
    16201 acattatcag ttctattttt tggagaagat acaggggctc agagtcccta gggttccaga
    16261 gctggtgagt agcagagtca ggattcaaac ccagctttct ctgactctaa aacctccttt
    16321 cttcctgctg aaacaattaa ctcaagacaa caaaggagtt aaggatttgg ggagttttct
    16381 gcatggtaga atagacccaa aggaaaagaa agaaagacag tgactaagat ttgggtttgt
    16441 ctgccccacc aaatgctttg agcactttca taaatataaa tccttcaggt tgggagaagg
    16501 ttgaacatct gaagacactg attcttcaga gatgtaatcc aaacaaagtg atctttggtg
    16561 atatggtcac taaaccattt atcccaaaat tctcttggaa aaccgtccta taacagcagg
    16621 gaacattatc cagccaagtt tttctgcaaa taaagggttg ctgatagagg cttgcctgct
    16681 tgtgtttctg tagctcaggg tgtttatgaa ttcactaatc ccttcccttc agatcccttt
    16741 tattctggtg ttatgattgt gactgaaaaa aattgatttt ttttctatga catagaatgt
    16801 tgaaaggttg atttcttttc tagaggaaag attctttttt tctatgtgct acataccccc
    16861 cgaccaggga aaaggcaaat agtggtattg tttgctgaag tcttcctttg aaggttgctt
    16921 ggtgtttgct tagtggaaat cagcagggga agagaggcta tctctaacat tttgttagag
    16981 tttcttcgta gttctatagt gatgaaacaa ggacttgggg tacaggacag atctgctttc
    17041 agaaatcctg gctcttgtga ggtttagaag ccctgagacc atttagctgg tggcaacggg
    17101 aatgttgagg gtgataaata ggatctttgg ttgtccaagt atcagtgaca tgatatagat
    17161 ggagttaaac ctttaggatc tccttattta tttgtttgtt tatttttgag acagggtctt
    17221 gcactgtagc ccaggttgga gtatggtggc atgatcataa ctcactgcag cctcaaactc
    17281 ctgggctcaa gcgatcctgc tgcctcagcc tcccaaggtg ttaggattac aggcatgagc
    17341 cgccacaccc ggtcaggatc tcctgtaaaa ttatattgtt gacaacatga agaattatgc
    17401 ttctcaaaag ctagttatag atttgtacaa tattcataga tttcttgttt cagtttttac
    17461 aaattcatag cccttatttt gaaaattagc tattagcaat aattttgtct aggaaattgg
    17521 atgtgtattc aagtgaaaga aggaagtaca gttacctatt atcttattgt aactaacaat
    17581 caagtaagtg tgatgcattt ggtactttaa aaactgcacc caagttacag attattggaa
    17641 ttaataaaat tcactggatc tatatatttt taaacggaca gtgtgatagc agaacctctt
    17701 atagaatgat agaattcctc tggaatgatt ggataacttc atttcatcct tgacttttac
    17761 cttggaggat ttcttacccc ttttggcttc tcaaatttga ctattaaaat gttgccttta
    17821 aaaataggaa cacagtttca ggggggagta ccagcccatg acccttctgc aaggccccct
    17881 aactcaaggt agtttccctg gaactgtggt ttatggaatg tttcaggagt gtgaggaggt
    17941 ataatttaag gctgtcctag caaggatacc cttaaggata gagggcccag tagcatctgg
    18001 aggccagaaa agttaaactg aggcagtcag attagcttca ggctcaatta agctgatggg
    18061 tcagcctggg agaaattgca ggatgactct caatatcccc tcccaccccc acagcagcca
    18121 cgatctgtct gtctttaatc atgggtgcag tgaacctgtt ctttccaggt gtcttggcct
    18181 tcagtaacct tgttaggctt gtccctgaac gtggctaccg atccaaagac acatgatcag
    18241 agaggcaatt agagaacaga ccttttccaa agcaagcatg ttctgttggg cttagaagtt
    18301 tcatgtccta atattatagg accctgtgca tctctctgga gatgaggcac atgagtcata
    18361 tctgtgattc ttgcttttgt gtcaacatct catgaatagg caatcagagc tttggcacca
    18421 atgtattttc agttcatatc tgatgtagtt aaatccacct cctgctttgt agtttactgg
    18481 caagctgttt ttgatataag acatctagaa cactgtaaat atataacatt tttatttgtc
    18541 tattatacct caattacgaa aaagacatct agaagcaacc tcatcaagag agatactgag
    18601 gccgggcatg gtagctcaca cttgcaatcc cattactttg ggaggctgag gcaggtagat
    18661 cacttgaggt caagagtttg aaaccagcct ggccaacatg ttgaaaccct gtctctatta
    18721 aaaatacaaa aaagttagct gggcttggtg gtgggcacct gtaatcccag ctactccgga
    18781 ggctgaggca ggagaatcac ttgaacctgg gaggcagagg ttgcagtgag ctgagatcac
    18841 accactgcac tccaacctgg gcaccagagt gagattacat ctaaaaaata aaataaagta
    18901 ataaaaaaga gagatattga tagctgttgt tggaaatttc aacttccatc tcacttctgg
    18961 taactttttg gaagtttgtt gaacaaagtg gaatacacgc acatacacac acacacatac
    19021 tctcttgttt gtttaaggtt taatgaaata gctgtcatat aatcactgtt tttgaaagag
    19081 gagaattagt tgctatctgt acattttggg tatgtgaact atttggatag aactctgaga
    19141 aatgcattca gaacaacaaa caaaatcata ggagaaatag ctaagtggga aggggcatat
    19201 aagagttgtt gaaaaagtta tttcttgaga aaccagctct aatgctaggc aagtcacttg
    19261 ctttggggga ggcctcagct tctctgtcta taagattgca gcaggggtgt agtgggaatg
    19321 agtcttcaac attccaagag attttatcta ctaatacgac agtcaaatgg agcatgactt
    19381 tgtggaagcc tctcctcttc cacccagagg ggccaatttc tctgtcccag tgagatgttg
    19441 acacttgtat gatccctgct tggagacttc cctcttctgg aacctgccct ggctcaggca
    19501 tgagggctga ctgtcaccct ttgataggag cccagcacta aagctcatgt gttggcagtg
    19561 ttcttgcggg aaggaaaaag accagccagc ccatttgtta ctgcacaagc aaacagcttc
    19621 tggtagctgt acagatacat gcactttctt tcctcactgt gtttccatag acagatttag
    19681 tgctgtagaa gagtagaggg cagtcacggg aaggagttcc tgtttttctt ttggctatgc
    19741 caaatgggga aaaatcctcc tatcttgtct ttttagtgtc atcctctctc cccttttctt
    19801 cttctttata attctcatct ctcatctctc ctggaaatgt gcatgtcaag ttcaaaaggg
    19861 cacaatgttt tggtgaggaa gaggtgggag aacacgtgcc aggtgctaac tagggtcatc
    19921 atttccccct tcacagccag cttcctgtga atgtgtgtgt gtgtgtgtgt gtgtgtgtgt
    19981 gtgtgtgtgt gtgtatttct tttgccagca tcactgaatc tgtctgctgt ctggtattcc
    20041 aggttttggt ttagggaaaa gtaaaagtaa ttttataatc ccagctgtca tttaagccac
    20101 ccctttgtgg gtagcatatg gtccactctc tcagttcatt gtcctaaaga tgcttcatca
    20161 gaaaggaata acttccaccc cgttactctc tgtcccctta ctctgcttta tttttcttcg
    20221 tcaatcctac caccaccacc cactgtttga acaacccact attatttgtc tgtttcccat
    20281 ccctggtaga ataggagccc catgaatgaa ggaactttgc ttctgttgtt caccactgaa
    20341 tctctaaggt atggaacaca cctggcatgt gataggcact cgataaatat ttgttgtggc
    20401 tcatgggcac cttgcagagt taaggctgca gttgtttgtg gaatttataa gtggtaatga
    20461 atatttatct actattcctc ttccaaggcg atcacacaat aatcaggctt tacactatcc
    20521 agttcttagg tcttccaagt tatgacttgt gaggtatgtt aattatgata atagaaggca
    20581 gtttatttgg ttcagattta ttgatgtgta atttaccaca gtaagacttc ccctttacaa
    20641 aagtatgatg agttttgaca aatggataca catgtgtatc taccactgcc atgctccttt
    20701 tcagtctgtc gtcccctcca cccatgacca ctggtcacca ctgcagtgat ttctgtcccc
    20761 ttcatttcac cttttccaga atgtcatata aatggaatca tgcagtatgt agttttttgt
    20821 gtctggctta tttttcttag cattaggctt ttgggattca tccaggttgt cgcatgtaac
    20881 agtagcttat tcctttttat ggctgagtaa gtgtcccagt tttatttata tatttattta
    20941 tgaggaggtg tctcactctg tcacccaggc tggagtgcgg tagcgcgatc tcagctcact
    21001 gcaacctccg cctcccaggt tcaagcaatt ctcctgcctc ctgagtagct gggattacag
    21061 gcacccaccg ccacgcccaa ctaattttta tatttttagt agagatgggg tttcaccatg
    21121 ttggccaggc tgatctcaaa ctcttgacct caggtgatcc gcccacctct ggctcccaaa
    21181 gtgctaggat tacaggcatg agccactgtg cccagcccca gttttattta ttcaccagtt
    21241 gatggtcttt tcgacaacta attgtttcca gtttttggct attctgtata aggcttctat
    21301 aaatattcac aaatacctag gatgggatga ctgggtcata taatagtact gtataacctt
    21361 agcagaaact gtcaaactat tttccaaagt ggctcttcca ttttacaatt ccacagtgta
    21421 ttgagtccca gtgtctccat acacatgcta gcacttttaa tatttaattt agtgggtatg
    21481 taatgatatc tcattgtggt tttaatttgc atttctctgc agctaatgat gagtgtttct
    21541 gcttatttgg gaaggtttta atttagcagt ctgttgtatt ctgtagatat taataacttc
    21601 aaaatatcag tggcatttgc agttaaaatt tccttaaaaa attggccaaa ggtttccagc
    21661 agtcacttct gccatgccca aactgtatga aacaaggctg aggtgtggag attgtcacat
    21721 tttggcaagg agtgatccac ttgggtgact gatgagaccc agagagcgta cgcctcgggc
    21781 ttgagggtga ggacgggcgg gaagtcgact gcatggccct gctggccttg ggaggctgcc
    21841 cagtccttag ctaaagctgg cagttatggg aaacagactt agattctatt acgtttttca
    21901 ggatgtccca ggagtcacct gggaagctca gcagtccttt gtgactttca agcatatggt
    21961 agaagctgct gaacacagag ctccctcttt ggggataatt tgcccaaatc atttaatcag
    22021 gcttgagaaa tgagttacca caggtccagg agtgctgcca cccttgaatt ctgacaccct
    22081 atttctccta tccgtctctt aattaattaa gcagacatcc ccaagtgctt acgacaagcc
    22141 aggacccttt tgcatactaa ggaaaacagg gatgaaggaa acagaaatgg tctctgctct
    22201 gactcagaag gtagaaatcc tctttcccag ccaagtcttc ctagggagca cgtaggaagg
    22261 gctctgaacc cacgtgtcag ttgcagggga ggatatcagg aaaggacatt gaagaagtgg
    22321 agacctaagt ttgagaccta ggcattagcc aggctagcag tgcttgaaaa agtgtcttag
    22381 gacaagagaa ctcaccagtg aagtcccagt ggtaggagag cgtgcagcat attctgagcc
    22441 tgtatacaca tctccagggc attgcttagc aggtggggag tggcaagaga gtaggctgga
    22501 gtcacagaag ggaggccagg tagaccttgg tgagcactgg actctatgtt caggtgctga
    22561 ggagctggca aaaggtttta agtcggggag aggcatgttc agatatttgg tctagctgag
    22621 taactttggg tgctctgtga caaatggttg ggagaccagt gaggtggcag ttgcggtcat
    22681 ctaggagcag gatcagagtg gcctattgac tgggatgact gtgaagtggg atcctttcca
    22741 gccagtaact ggaaatgtgt atgagggcag aagtgagtgt actgcatttg aaacattgag
    22801 aaatctagta catagtactg tctcttttat atcttttttt tttttttttt tgattttggt
    22861 ttgtttgttc actaacttgg aaaactgatg tggaaatgtc cctttggctt cagttacctg
    22921 agcagaaggg gccgggcatt gccaaactct cctcttagga cagaattgct cccagtattg
    22981 atcattgtgt tctgagatgg gggagcaaat tgtgcaggag gccaggtcag tgccaaggtg
    23041 ggtgggagga attggaggag gaagcttgcc taagtgtgcc cagcaaagcc acggtagaac
    23101 tttctactgt ggctctatgc tacttcttag caaccttctc catgtgcttc ctggagagtc
    23161 cttggagtca gaaccttttt cttgaaaccc agacacttta cttccaagaa aatgctgtcc
    23221 aagaaaactc atccttccct tcttctcatg aacgttgtgt agaggtgtgt cttctcttcc
    23281 tttgagcttt tccactcagg gtttagggga ggtgatattc tatatttggg tttggctctg
    23341 ggtactgcaa cactaggcta ttaagatttc atccttactg ctttgcccct cctatctttc
    23401 cagaaaccca caatggattt gctagaaata atggaacgtc ctgtttggac aggatataac
    23461 catttctcag ctagaggata ttgttggaat gaagaaagat aaatggggag aagggaactc
    23521 acattgcttt ggcacttaaa ttaagccatg tactgtgttg ggaaattatt tatattatct
    23581 cgttgaatcc acagtagaac acagttgaac accatacaag gtaagtattg tcatccttat
    23641 tttaccatga ggaaattgat gcttagagag cataaagcct tggccagggg cacatagttg
    23701 ggaagccggg gctaattcat gcctgggctc tttctgatag ttttcctttt ttaattgtcc
    23761 cctcctcatt gttaccttgg ggatttcaag agattcatgt agcttctaaa tcaacgaact
    23821 gattcctgga gagcagcttc tgtatgagaa aaatctagct aattatttat ttcagtgtct
    23881 ctggaatgca agctctgtcc tgagccactt agaaaacaat ttgggatgac aagcatgtgt
    23941 ctcacaatgc tgctctggtt gccagtgctg tgctgccagt tgtcatcttt gaacaaactg
    24001 atgcagtgct ggtttaactc ttcctctttt tggagtaaga aactttggag gcctgtgtcc
    24061 ttctagaagt ttgctgagca aatggtaagg aaaagaaata ggtcctaagg cttgactatt
    24121 tcagagaatt tcttgattta ttggactgtc aatgaatgaa ttggaataca tagtggtagg
    24181 ctgtcttttc ttctcagaca ctgcaatttc ctccaatctc ttgacttttc tagaagtttt
    24241 aatccaagtc cttgttgggt ggtaagataa aagggtattg ttctactaga gactgacctt
    24301 ggcatggaga tctcatttgg actcacagat ttctagtcta gcgcttggtt ttgtatccat
    24361 acctcgctac tgcattctta gttccttctg ctccttgttc ctcatgccca gtgtcccacc
    24421 ctacccttgc ccctactcct ctagaggcca cagtgattca ctgagccatt tcataagcac
    24481 agctaggaga gttcatggct accaagtgcc agcagggccg aattttcacc tgtgtgtcct
    24541 cccttccatt tttcatcttc tgccccctcc ccagctttaa ctttaatata actacttggg
    24601 actattccag cattaaataa gggtaactgc tggatgggtg gctgggatac acagaatgta
    24661 gtatcccttg ttcacgagaa gaccttcttg ccctagcatg gcaaacagtc ctccaaggag
    24721 gcacctgtga cacccagcgg agtagggggg cggtgtgttc aggtgcaggt ggaacaaggc
    24781 cagaagtgtg catatgtgct gaccgtggga gcttgtttgt cggtttcaca gttgatgccc
    24841 tgagcctgcc atagcagact tgtttctcca tgggatgctg ttttctttcc agagacacag
    24901 cgctagggtt gtcctcatta cctgagagcc aggtgtcggt agcattttct tggtgtttac
    24961 tcacactcat ctaaggcacg ttgtggtttt ccagattagg aaactgcttt attgatggtg
    25021 cttttttttt tttttttgag acagagtctc gctctgtcgc catgctggag tgtagtggca
    25081 caatcttggc tcactgcacc tccgcctgcc gggttcagcg attctcctgc ctcagcctcc
    25141 caagtagctg ggactacagg tgcctgccac catgcccagc taatttttgt atttttagta
    25201 gagacggggt ttcaccgtat tggctaggat ggtctcgatt tcttgacctc gtgatccgcc
    25261 tgcctcggcc tcccaaagtg ctgggattat aggcttgagc caccacgcct ggccgatggt
    25321 gctttttatc atttgaagga ctcagttgta taacccactg aaaattagtc tgtaaggaag
    25381 ttcagggaat agtataagtc actccaggct tgaggcaaaa tttacaaatg ctgctgactt
    25441 tgtatgtaag gggaggcatt ttcttagaga agagaggtag gtctctggga ttccagtatg
    25501 ccatttccat cctcagtgtt tttggccacc tgagagaggt ctattttcag aaatgcattc
    25561 ttcattccca gatgataaca tctatagaac taaaatgatt aggaccataa cacgtagctc
    25621 ctagcctgct gtcggaacac ctcccgagtc cctctttgtg ggtgaaccca gaggctggga
    25681 gctggtgact catgatccat tgagaagcag tcatgatgca gagctgtgtg ttggaggtct
    25741 cagctgagag ggctggatta gcagtcctca ttggtgtatg gctttgcagc aataactgat
    25801 ggctgtttcc cctcctgctt tatctttcag ttaatgacca gccacgggcg tccctgctgt
    25861 gagctctggc cgctgccttc cagggctccc gagccacacg ctgggcgtgc tggctgaggg
    25921 aacatggctt gttggcctca gctgaggttg ctgctgtgga agaacctcac tttcagaaga
    25981 agacaaacag taagcttggg tttttcagca gcggggggtt ctctcatttt ttctttgtgg
    26041 ttttgagttg gggattggag gagggaggga gggaaggaag ctgtgttggt tttcacacag
    26101 ggattgatgg aatctggctc ttatggacac aggactgtgt ggtctggata tggcatgtgg
    26161 cttatcatag agggcagatt tgcagccagg tagaaatagt agctttggtt tgtgctactg
    26221 cccaggcatg agttctgatc cctaggacct ggctccgaat cgcccctgag caccccactt
    26281 tttccttttg ctgcagccct gggagccacc tggctctcca aaagccccta atgggcccct
    26341 gtatttctgg aagctgtggg tgaagtgagt taatggcccc actcttagag atcaatactg
    26401 ggtatcttgg tgtcaatctg gattctttcc ttcaggcctg gaggaatata ataactgaga
    26461 cttgttttat ttctgcagag ggttctaagc cattcacttc ccagatgggc caataatgct
    26521 ttgagtaatc tggagatcat ctttaatgcg caggtgaatg gaactcttcc acagagggat
    26581 gtgagggctg tagagcagag tgaactccct gaaactcaga cgtcagctct ttgtctctct
    26641 atctctgaac acccttcctt agagatccca tctctaggat gcatttctct gtagttagtt
    26701 tctaagtctc ttgttcctgt tctgccttta tttttttttc ctggattcta agccagtatc
    26761 cccacttggc tgtcttaatg tagcttaaca tgtctgtaat caaaatgatc atctttctga
    26821 gattcaaagg gctataaggg actttggaga gaatttcatt cagttttcct caaactagaa
    26881 taatgcttgc actgtctgta aaagaacaaa agtgtcaaag cacccttttg ttcactaaat
    26941 ttcctttttt attatagtgt tacttaaata ttaggaagta aaagtaggta taaacttctt
    27001 tataggctgt tattatacaa ctatatgacc catacatatt tacaaattaa gtgcagccaa
    27061 aattgcaaag tcataccatt caaattaata ccttaaatgt ggtgaggcag ctgttgttca
    27121 actgaaacca aattataagt tgcatggcag taaatgctat catgctgatc attttgagtt
    27181 tggccagtct atattatcat gtgctaatga ttgaattctc cacccatttt tctacttgta
    27241 tgaccttaat ttgatggcac ctgttccatc ctcatgagtt tgctacaatt atactggtgc
    27301 caacgcaatc ataaacacaa atataaactt gggctttgaa atcttgtgcc agaacttggc
    27361 tttaaagtaa gcatttaaaa aatccatatg tgtttattag actttgttta gatgactgtt
    27421 gaaatgaaaa caaagtgttt aaaatcctct tagagaactt aaatataatc cctcagcaat
    27481 atgtatacag atcttccttt gagaaaaact gattgtgttc agcctctcat gttacaaatg
    27541 gggaacctga attctgaggt ctctagtgag agaacaggga ctagaatctg tggatcctat
    27601 ctgttttaat aataattgta aagtataata gataatatta tattaataaa ataaaagcaa
    27661 acacttagaa tgagcttcca tgtgtgaggc actaactgat taggcattat taactagatt
    27721 tattcctttt aaggccccgc gatgtactgt tatttccaca tgttgtagct ggggaacgtg
    27781 ctactcagag aggttaagta acttgtctga ggtccacacc actaacaagg agcacaggta
    27841 gggttcaaat ccagataatc tgactttgga gctggcactc taactcaatg tgcctaatcg
    27901 cttttcagtg gtgtcattat tttgcctatt ctccatctga gaatattgaa gtttctgact
    27961 ccttccttgc ctttctccct gcctcccgtg gttatcccca ggtcttggtg ttccagtcct
    28021 ctatgtccgt ccttactctt attcctttgc tacagtgtga tccagggctc ctgccccttc
    28081 ttatcctggt agagggggcc cacttgctgg gaaattgtct ccgccatggt ttatccatgt
    28141 tgtgtgtcca ttagtgagta gtgggaagaa tcatatcatg ttggcaatga aaggggggct
    28201 atggctctgg ggtagtctag tctgaacctc ttattttacg gatgagaaag ctgaggtaca
    28261 aagcagggaa gggatttctt gaggtcaccc agccagcaac tgagctgcaa ccagaagctg
    28321 agatccccag gactagggcc gagcctcatt ctgtcccatc acagtgactt ttcttccctc
    28381 ctccaaacta tttttatttt ttattttttt gcagctgctt agcagcttga agttagaaga
    28441 aagggcaggg aaaaggtttt ccgtgcttag ccagggaagg aatcctgcaa caggatgtgg
    28501 ggttgggtca ttcaaattgg gccagactcc actggtcttg ttgcttcttg cttggtattg
    28561 cagatgggtt taaaagtgtt aggattagag agataggcag gtttagccaa aggcagtttg
    28621 tagccttgtg gcagagttct ttttaaagaa ggaagtggga tgcaacaccc tgacacaaag
    28681 gggcttaagt tgttatacca ctgcctgcta acctgttttc cttaactctc ttcctgattt
    28741 ctaaaggaag tatattttgc tgaatcagaa agaaaagtga tttatttcaa gttgctgatg
    28801 cttagattgt tagagttgca aagatctggc ttgcatcttg tacaactgac agaactgggg
    28861 ctcagggggg cacaggtgcc cagagttggt cagtcaggaa agtagcacca gaaccagtct
    28921 cctggtggcc ctacagttgc agaccctttt ttgctttgct ctctgtgtat actaaagctt
    28981 ctatgtctct gaatctcaag ttctgactgg tagctacttt ccaatccacc tggcttagat
    29041 ttctagatta tattgtttag acgtcagaac ctcttaaggg ttttggggcc acttgttagc
    29101 tcacatagtg agaaccagcc ctgcccatta ggtaggggaa gaagttagca gtccatgata
    29161 gctgttgcct gcagcatacg gacgttcatt gcgcagttcc tgtctcctga gatcctggag
    29221 tgtatacgct tggcctcaga gcccagcaca gagcctggcc cttgggacat gcttagtaag
    29281 tatttactga atgagtggga aatgtcttaa ggcccattag tttgcaggtc ttgaggaggc
    29341 tcccttgcac taggaagaat agaaagcata cataaagcct gtgtgctgct gccaggaaga
    29401 ctagaaacgc tatgttcagc ctggagctga atggtatacc ccagagcaac cctgttgaaa
    29461 ggcagtgctt gccttttcat tctgtgtcct ggtttgctgg taactcctgg gtcccctgcc
    29521 tctcctgtac ccccattgtg cagactgagg ggggaccatc agccagggtt agttttccgc
    29581 tgtttctgtt aggcaaagaa taaattgaat tgagttgtga aagttgggtg caaagctcag
    29641 tttgggtcca aagtaacagt taacttgtgt gggtggcagg tattcagtac aaacagggct
    29701 ggggacagga aggggaagag aacttcagag ctttcacgat cctcatctgg ttttaggctg
    29761 atccagaggc caaggtcccc atggaacaaa ctggacaaag tgagggtggc cacatggcct
    29821 cttttctttt gcctttatta ttaattttct caaatagatc tgactagtca tgtggctggg
    29881 aaaatagtta attgtgattt tttttttttt aaactgagtc tcactctatt gcccaggctg
    29941 gagtgcagtg gtatgatctc agctcgccgc aacctctgcc tcccgggatc aagcaattgt
    30001 catgcctcag cctcccgggt agctgggatt atgggcacac agcaccacgc ctggctaatt
    30061 tttgtatttt tagtagagac atggttttag catgttggcc aggctggtct tgaactcctg
    30121 acctcaagtg atccacccac cgcagcctcc caatctgctg ggattacagg catgagccac
    30181 tgcacccagc cagagtacca ctatttgggc attctttaat gaaaaagaat gaactatcca
    30241 aaaattaaaa ctcctcattt atgagctttt agagaatttt acagagtaga tggaaactct
    30301 ctgcatcctt tccccacttc tactttcacc tgacacattt cttccctgtc cttactcctg
    30361 ggccggcagc agtggtcatg attccaatcc cagcttggcc accatctgcc tcagtggcct
    30421 aggaaaactc ctttctccag agctttagtt ttctcttcta cggaatgaag aaagttaaaa
    30481 caaatagaca tttattgttt catttggata aatatctatt aagcatctat tacttgtggt
    30541 atggttagct gggtatatag tggtgaagca gctgggcatg agtactgctt tcgtagagct
    30601 tacagttcag tgaggccagc agatgtgaaa catatcatca cacaaataaa aatataacta
    30661 tcaactgtga tgaggattat gaaggaaaaa atccggcaaa ctatggtact ggtgttagat
    30721 actagcaggt gtgggtaggg atttcattta gatttacagg tcgtcacatt aaagctgaga
    30781 gccctgaagt tcaagcaatg gttagccagg caaagatcag aggcctagag atagggaaat
    30841 ccattccagg cagagagact gggggtgcct gtcccctagg tcagggaaca gaagaaagcc
    30901 agtggcactg gtggagtgaa taagactggc gggggatgag ttggtagtag acatgaccag
    30961 atcatttagg gccaattctc ctggggaagg agaatttaat ttaatattta tttatttatt
    31021 tatttattta tttatttatt tatttttcaa gacggagtct agttctgtcg cccaggctgg
    31081 agtgcagtgg acaaactcgg ctcactgcaa cttttgcctc ctggtttcaa gcgattctcc
    31141 tgcctcagcc tcctagtagc tgggattaca gacgcccacc accatgccca gctaattttt
    31201 gtatttttat agagatgcgg tttcaccata ttggccaggc tggtctgaaa ctcctgacct
    31261 tgtgatcctc ctacctcggc ttcccaaagt gctgggatta caggcgtgac ccacagtgcc
    31321 ccctgagaat ttaattttat tttatgtgca agaggattcc ctgaggtagt caggccacat
    31381 tgtctggtga ctcttgggat agagggaact tgaatgacaa aggcccaaga aagcaattgt
    31441 aatcattaca tatacatgga ccattttatg ctgttttctt ctttcattta acattattta
    31501 gtgtgcgtgt tcacatattt ctaaatcatc ttctgattta gaataatgat ttctgatgtg
    31561 taggctgtgt tttatagttt tgaaagtaat actttgatat ccattacttt cttgattctc
    31621 acagcaattc tgaggtgtat gcgttgcaat ttctgtttca cagatgaaga gagtattgtt
    31681 aataagttaa tggccgggca tggtggctca cacctataat tccagcactt tgggagacca
    31741 aggtgggcgg atcacttgag gccaggaatt tgagaccagc ctggtcaatg tggtgacacc
    31801 catctctact aaaaatacaa aaattagcca ggcgtggtag cacttgcctg taatcccagc
    31861 tatttgggag ggtgaggcag gagaatttgc ttgaacctgg caggtggaag ttgcagtgag
    31921 ccaagattgc accactgtac tcctgcctgg gtgacagagc gagactctgt ctcaaaaaat
    31981 aaaaagttgc taagaggagg gctgggatct tttggctcca aatctactgt gggatgatgc
    32041 ctttgacatt cctgatagct gtgcagtaat ccattaacac agtttttata agttcaaacc
    32101 ctgttgccaa catttagatt gttccatgtg tgctgttaca aataaattac tataaagatt
    32161 ctatacattt aatcttttat tatttttgta ttatttctgt aggccaaaat ctgaggaaca
    32221 ggattactag gttgaaggga aatggccctt gaagtgtctg atcagatgtc tttccagagg
    32281 atccaaccaa tttaaatagc caccatcaat gcatgagact ttgtagttca gggaaggcag
    32341 gcctggtttt aaaaatcatt tcccctctct agcatttttc tgatgtgatc cttaagattt
    32401 cactttagtt ttcccaggtc tcattggcat gtatgctgtt agggatgggt ctaaaattaa
    32461 tttttcttca cattcatatc atgtcatccc agtgattatt taataaataa tcacttgatt
    32521 aaatagtgat tccttttcta gttatttttg ggacatttat taaaacctgg atatggtggc
    32581 tcatgcctgt attcccagca ctttgggagg ctgaggtggg gggattgctt gagactagga
    32641 gttcaacacc agcctgggca gcatagcaag actccatctc tataaaaata aggaaattag
    32701 tcaggcatgg tgggtacttg cctggagtcc cagctacttg gaaggctgag gtaggagaat
    32761 tgcttgagtc taggtggtca gactgcagtg agctatgacc atactactgg actccagcct
    32821 gggcaacaga gtgaaactct gtctgaaaaa aaaaaaaaaa aaaaaaaaag atgtgtaggg
    32881 agcaattttg gagttattca tttggtcatt tgatatgtag ttttagtttt ggtgctgata
    32941 gagcccagaa tgtaccctga atttgatgaa cattctgata tatgggggag ctcattgtcc
    33001 cccacttacc tttttgcctc tcagaatatc ttttgatatt tttatctgtt ttttccccat
    33061 tgaatgttat taccttatca agctcaaaaa agtaccctat cgctatttta agttcagttg
    33121 tgttaaatct ataaattagc ttgggaaatt tggatattaa atgaactcat gaagaagcag
    33181 agtttagctc tccttaattc tcatcttcct ttatttatcc tactacagtt ctgtggtttt
    33241 cttttatgta agaagcacat gtttggctaa gttaatgcct aggttttttt gtttatgtgt
    33301 ccattctcac tgtggatagt attctctttt ccccacatta tattaattta actggttttc
    33361 agagactaat agcaatgcta ttatttagga gaatttacct tggttctgat taacttaccc
    33421 atacttgcaa atcatttgca gctttttagt taactttgtg agttctctta gatttatgac
    33481 catgccagaa acagaaagga tattttcatc tcttcctttc tgatgtttat tcttcttgtt
    33541 tccttttttt atcccccatt atattctcaa gaatctctca atactaagaa atagcgactt
    33601 catttttcag cgcagagtgc attattttgg ctaccatgat tcagaagcct cttgcctaag
    33661 gcccaatttt attctgctag ttttctctgt tctttgtaca tggcccttgc gctgccctaa
    33721 ccttgaatta acgtggctaa atctcaagaa tttaagagca ccgtgactgt gtcctcaggc
    33781 tagggaggga aatgggttca cagagtgact ggattgtggt ctatgaactt cggcagccag
    33841 cagcaaaagt caggcatgaa taatcaagtg gacagtgaac atctgtagtg tgggagatgt
    33901 tggcataact atgaatgatg attcaagagt ggtttgatgc atattgaata acatgatgat
    33961 aagtactaga ctctgtgcta agccttctat gtgaaataca tttaattctc ataataactc
    34021 tagagcagtg gttctcaacc ggggccggtt atccccctac cccaccccac cctcaccctt
    34081 ccaccaggga cataacatct ggagatattt ttggttgtca caatcctggg aatgtatgtg
    34141 ctgatattta gaggttgagg tcagggatgc tgctgaactt cgtagaattc ataggagagt
    34201 ctctcacaac acctatctgg ccccaaatgt cagtagggtc actatcaaga aaatctgctc
    34261 tagcagtgcc tgctcatatt atccccatgt tgaaatagca agatgggaag tgcaaagtgg
    34321 tgcttcggta ctcttggagc agctttgact ttggtgagaa acgcctttta aaaacaatgt
    34381 ttcttcccat cttcccaccc catggggagg tgtggggttg ggtgggtagg caccaaagca
    34441 agatttagaa gagttttctg taggaattta taatggtaaa ggatcaactt catttccaag
    34501 ctatttatga gggtttatgt ttaggaaaag tgctaagctt agagaaggag gagaaatctg
    34561 attttattaa tgagtgtagc cataatggca tatcctggca gaagtcaact ttggtttcta
    34621 gagggaggct attatgaaaa gaaatacctg gaacattccc ctgggtttgg aaggtgagtt
    34681 ctaggttcaa tgatgggaag aattttagag gtccaagata aaagggcaaa gattaaattt
    34741 tgtctctcat gagttctctg gctcaggtgg tgtgaacttt gcagacagtc tctttaattc
    34801 actcatacat gctagtctcc cagctcagca agggctttga gagagcaggt gtctgtatgc
    34861 tctggtaagt gaaggcaaag tgcataagga ggttggggtc cataatggcg aagagaagga
    34921 gcccttcagt cagagtggct ttgaatcttg gctctgccat ttgccaatct tggaccattg
    34981 ggcagtgtat taactctttg aatctcagct tcctcttctg taaaatgtgt ataacaagag
    35041 tactaattgg attgtttgat gattaaatga gttaatgtgt ataaagcact cacaaccctg
    35101 gtacatagta agacctttca ttattattat catcatcaat tttttttaac ctcttttcct
    35161 gatctgctta cactcaccag cttcagctgc tccaaatggc ttgtaagatt ttttgtttgc
    35221 cctttgctgt cagttgccat ggggaagatc cattcatttt tttcagtcaa ccaacatatt
    35281 ttgagcatct gctgccctac aggatcctag atatgggggc tgcagagata tccaggaaca
    35341 taagccttga ttaattgggt cagatcagtg ctcagcaggg ctggcaagtg ctaggtttct
    35401 tttaagtggc atatcttaaa aggtatatgt cctaaacata gctttgtgat ggcagcatga
    35461 tgggtacaaa agcacacact taagtgtcag tagatctggg ttcaaacatt ggtgcagttt
    35521 cttatggctc gtaacttgtt caaacctcag tttcttcact tctaaaacgg taatgataca
    35581 acctacctca cagggttatt atgaattaaa tactggagat gagatacaca aaacgtcttg
    35641 agtacacagt agctgcccaa tattggctgt aagtattata aatctacaag ctgtgaatta
    35701 attttacctc tgtggatcct gttgatattt ctagaccatt ccacctagtg gggccatttc
    35761 ctacctgagt cacccgtggt gtcaaataga atgtcatgtg gcctcctgag ttgggtagaa
    35821 ttggctgctc atctcaaccc cgctactgac tatctctgtg atttaccctt cctccagcct
    35881 tagccttgct acatataaaa tcaagacaat aatgtttcct atctcacagg gttgtcctga
    35941 ggattaaatt aagtaattaa tataaaatgt gccttgtaca tattgggccc taaataaaca
    36001 gtagctacta tttatcctta aagtacaaat ggtagtttca gagcttcaag gctgatggct
    36061 atttatctta ctcatactct ttgtttagct tcattttttt cccctaattt cattagtatt
    36121 ttcttttctt tttttttttt tttttttttt tttttttttg aggtgaagtc tcactctgtt
    36181 gcccaggctg gagtgcaatg gagcgatctt ggctcacccc aacctctgtc tcctgggttc
    36241 aaacagttct cctgcctcag cctcccgagt agctgggatt acaggctccc gccaccatgc
    36301 ccagctattt ttttgtattt tcagtagaga tggggtttca cccttttgac caggctggtc
    36361 ttgaactcct gacctcatga tcaacccacc tcagcctccc aaagtgctgg gattacaggt
    36421 gtgagccacc acgcccggcc tcataagtat tttctaaatt tatttacagt catgccattt
    36481 aaaaggaaag ttgtattcct gtctttgtta atatttataa gtgattttat tcagctacaa
    36541 gcttggaatg gcatataatt ttgtattctg cttttttcac ttaatattac atggctaatg
    36601 atttctgtgt ttcataaaca ttattctgat gatggcatga tatattgttg agtacatgta
    36661 ccataattga atcatttccc tattgctatg caattaagtt gtttccaata ttttgcaatt
    36721 ataatgtttc aatgaatgaa taactttatg catatagctt tttgatatct taagttcagt
    36781 ttcctaggat gaatttccag gaatagtaat tgggcaaatg ggataaacat gactcttgaa
    36841 tacgtattgt taacattgct ttcccaaagg gctcaactga tttatatttc cgtgttcatt
    36901 atcttttaaa ccagctcatt tactcaccaa acatttttaa agccattatc atgtggtagg
    36961 cttagtaaga agaaagtgac cctaagggag aagcttatat ataaataggg tccctggtgt
    37021 accaagtgct gatacagaca caaagtacct ggggaaattg agatgaggga gtcctggctc
    37081 agctgggaga aaagttcatt ttcatagagt catggttttg ttctttggca gaaagaaaat
    37141 tgctttcttc cccaccccca cccccagctt tattgaggta taattgacaa ataaaaattg
    37201 tatatcttta agatatgcaa tgtgatatat atgtatatct caacttaaaa aataagctac
    37261 agaataaaaa ggtgtttgct attaaaaaaa aagaaaaggc tgaatgtcat tcccaagctt
    37321 ggaaatttga gtatgttgcc tctttgggat tatttacaga aatattagca agaccagccc
    37381 catctttggt cttgagtact ccactgtcag catgctttct tccagagagg gatccatttg
    37441 cctttatttt tcattctgtt gtgccgtcta tgcaaactat tcttgatagt tttatggtaa
    37501 cagtgttttt ttgttccatg agataattta tacatgctca ttgtggaaaa tttagaaaag
    37561 acaggaaagt attaaaaaca tcactttttt tttttttttt tttttttttt ttttaagaga
    37621 cagagtcttg ctctgtcgcc caggccggag tgcagtggcg tgatctcagc tcacagcaac
    37681 ctccgcttcc caggtttaag tgattctcct gcctcagcct cccaagtagc tgggagtaca
    37741 ggcatgcacc accacgcccg gctaattttg tatttttagt agagatgggg tttcaccatg
    37801 ttggccaggc tggtctcaaa ctcctgacct caggtgatcc gcctgccttg gcctcgcaaa
    37861 gttctgggat tataggcagg agccactgcg ccagccacac ctacgttctt atcatcctag
    37921 tacatccact gtcattatct tgctgtattt ccttctgccc agtctcactc tgatcatgca
    37981 gtggcgtgat catgcagtga tctcggctca ctgcaaccta ggccttctgg gttcgagtga
    38041 ttctcctgcc ttagcctcct gggttcaagt gattctcttg ccttggcctc ccaagtagct
    38101 gggattacag gcatacaccc ccatgcccat ctaatttttg tatttttagt agacacagcg
    38161 tttcactaaa attttgtatt tttagtagag atggggtttc accatgttgg ccaggctggt
    38221 ctccaactcc tgacctcagg tgatccgcct gccttggcct cacaaagtga ttacaggcat
    38281 gagccactgc atccatcgcc aaaaagattt tttaaaagag tttaatgtag aaccatatca
    38341 aaggtctttg gaaataaaaa acagtttttt aaaaatatca gaaataaaac aacaaataaa
    38401 taaataaata aaaaacccaa aacaatctga agcacgagca cctagcagaa aggttcaatt
    38461 atgatctatt catagagtgg aatatcaagt agacattaca ggacatgttt taagattata
    38521 ttttatgtca tgggaaatgc tctcccagta tgatgttaaa tgaaaaaaca gaatacaaaa
    38581 gtatatatgc tgcatagtct caatattgta gagaaaaaat attatttatg tatgcatgaa
    38641 aaaagacaaa agatgttaac agagatccat tgttacttca gtttactagg gattgtctct
    38701 gggaggtagg attaaggtga tttatattta cctttttaaa cttttctgta tttttttatt
    38761 ttcaaatttt ccataaaaat ataaggactt gaagatcaag aaaaaatttc tgctttggct
    38821 cagtgcagtg gctcacgcct gtaatcccag cagtttggga gccctagggg agaggatcac
    38881 ttgaacccaa gagtttgacg ttccagtgag ctatgatctc cggatcgtac cgcctggacg
    38941 atggagcaag accctgtctc aaaaaaaaaa atctttgctt tttttttttg tttgtttttg
    39001 agacggagtc tctctctgtt gccccagctg gagtacagtg gcacaatctc agctcaccgc
    39061 aacctctgcc tcctgggttc aagcgattct cttgcctcag cctcccaagt acctgggatt
    39121 ccatgcaccc accactatgc ccagctactt ttttgtattt tcagtagaga cagggtttca
    39181 ccatgttggc caggctggtc tcgaattcct gacctcagct gatccaccgg ccttggcctc
    39241 ccaaagtgct gggattacag gcatgagcca ctgtgcccag cccaatcttt tgcttttttt
    39301 aaaaaaagaa gacaaaaagg gattttatac cagtattatc ttggctgtgt gactctgaag
    39361 ccacagttgt aagttataat tactctgaaa cacaaggccc tgtgactctt ttgggctctt
    39421 tggtgtttat cttgattaca acgttggaat atagaaatga aaggaatggg agaggtgata
    39481 gacttcaggc agtgtaacta gttgtctgaa cactactggc tcaattatat tgtgtctagt
    39541 gatttccatc ttgtccgtcg gctaatttat cgcctggtaa ctcactgagg cagggttttc
    39601 ctttggagaa acctcattgt tttaaccagt gtatcatgct tgtttagaag ttcaatgatc
    39661 tttttaactc atcggagaag atgatgacca gacctggaca gatggggaag gactttgcac
    39721 tctctcttta cagtcctgag tgcacacagg tcaatatgga actatgtgtg aattttcatt
    39781 gtctttgaga gccctcttct ctgccccata gggagcagct ttgtgtgcaa ttagaggagc
    39841 aagggttgtg tgtatttagc acagcaggtt ggcctggtcc tctcctctca acatagtcac
    39901 cacatacctg gcactatgct aaggctggga atgcagacag atgggtgcct gctttcagag
    39961 tgctcaatgt gctgaggaag ccagcaacag aaacagatga tttcaggagc tccaggaaaa
    40021 tgctacagga ggagtgtgcc tgggttactg gagtagcaca ggaggagggc ttctagctca
    40081 ggctgagatt ttagtaaagg aaattatgcc acgatgaatc ctgaagaatg aatagaagtg
    40141 aaccagataa agcacgatag gaagcatctt cccttaccta agggaagaca cagaggtata
    40201 tggaatggta tgttaaaagg ttgggactcc aaacagttct gttaaagctt agagagtggt
    40261 gggagagact ggagaagttg attaattagt aaatgaagtt gtctgtggat ttcccagatc
    40321 ccagtggcat tggatatcca tattattttt aaatttacag tgttctatct tatttcccac
    40381 tcagtgtcag ctgctgctgg aagtggcctg gcctctattt atcttcctga tcctgatctc
    40441 tgttcggctg agctacccac cctatgaaca acatgaatgt aagtaactgt ggatgttgcc
    40501 tgagactcac caatggcagg gaaaatccag gcaattaacg tgggctaaat tggacttttc
    40561 caaagatgct gtctttggga aacatcacac atgctttgga tcagaaaacc taggcttcta
    40621 atttgttgat aaggcatgaa ctcaggagac tgttttcagt cctagtgaat ggtgataatt
    40681 gtaattataa cagtagacaa catctctttt acacatttta aatcatgaaa atagaataac
    40741 cttactgata attttagaaa gtggtgatta aaagcacatt taagataatg ccttaacacc
    40801 tagtcttttc catatgcatg atgtcttaat cacacattgc aaatcatgga acacagaatt
    40861 ttaagcagca tttgtgtaga acttctcagt tttactaata ttattttatt ttattctcat
    40921 aacaaccttg aatagaactc agatcatctg tcaatcatgt attttgataa cagcctttac
    40981 agtgagcata gaaaatacag tagtggctaa caacacaggc tccagatgtc aggttatctg
    41041 ggtatgaatt ctggtgtcag cattcactaa gcatatgacc ttggacaagt gatttaagtt
    41101 tcttttaaac agagaatagt aatacctacc tcatattatt attgtcagtg tatcatctta
    41161 caatcacagt ctttctctta gggctgggct cagtgggtgg attgacactg cagaaatggc
    41221 cagatctaaa ggatcaacat ttacgtagct gggaaatgta gctgggactt cagtttcact
    41281 gcgctagtga tttttcctac cactaagcag ctcagtccat acccctacga gacccacaag
    41341 cttatgagat actgttcttc caggaaagca gtggggccag ggccaccttt taattgtgtt
    41401 tcttggcctg gtcccatctt tctcacaata tatagcaaca gttatttact tgctgatttt
    41461 ctaatgcaca tcacacatag tcatattaaa cacacacaca cacacacaca cacacacaca
    41521 cccctcaaga aacattttct gagacgtgat ttcctgattt catcaaaaaa gaaaagagcg
    41581 ggccaggcac agtgggaagt caaggtgggt ggatcacttg aggtcaggag tttgaaacca
    41641 gcctggccaa cacggtggaa cctcgtctct actaaaaata caaaaattag ccaggcgtgg
    41701 tggcgcacac ctgtaatccc agctactggg gaggctgagg caggagaatt gcttcaacct
    41761 gcgaggctga ggttgcagtg agccgagatt gcgccattgc actccagcct gggcaacaga
    41821 gtgagactct gtctcaaaaa aaaaaaaaaa aaaaaaagca taaactgaaa tttatatgca
    41881 atttatatgc ctgtgagata attctgtttt ctcttttgga accccaaaga gatttttttg
    41941 attgatgagc aaatacattt tagattttat ttaagcatta tgccaagcac cactgaagta
    42001 taagtttcaa gggcaaactc agttttttca tctactagac gaatgatttt ctggaatgat
    42061 tacaagcagg caagatggtg tagtggaaat agcaaatgtc ttcggcatca gacaagttgg
    42121 ggtttgtttg tatcctgcct ctgcccttca ccgaggttgt gatcttgggc agattgttga
    42181 gttttaacct agattcctct gactccagat cataaatttt cagaaaagtt ctgaaattct
    42241 tgtatatact gatggtaaat gagacttttc cttacatcta tgcacttctt tgtttgtttg
    42301 ttttgagatg gtcttgctct gttgcccaga ctggagtgca gtagtgcaat ctccgctcac
    42361 tacaatgtct gcctcccagg ttccagtgag cctcctgcct cagcctccca aatagctgag
    42421 actacaggca tgtgccacca cgtccggcta atttttgtat ttttagtaga gacagggttt
    42481 tgccatgttg accacactgg tctcgaactc ctggcctcag gtgattcgcc cgcctcagcc
    42541 tcccaaagtg ctgggattac aggcatgagc caccatgccc ggccatatcc atgcacttct
    42601 tgcaacctta ccttcttttc tcatcaccct ccagggacct agttggaaga gcagagttaa
    42661 aagttaaggt gaaacttgga gaggtgtctt gtccctagga acaaaggact ggtttgaaat
    42721 tctctgtaaa tcttccccag ttcaaaccag agttatcaag gtcttaaaaa cttccctggg
    42781 tcctgagagc ccattatatt atttacttgt cttcctgtac acccactgcc tagtcctgat
    42841 cctacttttg tttgcaaata ggatggggca caacgtacaa ggaagggcct ttgccacccc
    42901 tgctaaggga taacctgaaa taccttcacc atcactgccc tgtgctgctt ttcacctatg
    42961 ccagtctgtc tacagtgcca gtgtctcctg gcattgaaag gggagaatct tttggtcctt
    43021 tgagtatttg gttgggttac ataaatctcc ctgaatgaag agcagctgac ttaggcaagg
    43081 ggccttgttt ggttttcctt gaactattaa caggaagata gggagattaa ctgtgtaaat
    43141 gttcaatagg ccagagtccc tgcagagggt ggccacagtg atcagatctt atcacatcct
    43201 tgctttgggt gttgcctctc tggttggagt atggatagaa aagaaagaaa gaccctatat
    43261 tgaaatgcaa agtgcagcaa gtcctgactt tggattaact tctcagccca tttgcatgaa
    43321 aataaaaaga tgaataaaac aaggttccca ctttggaggg aggtggtagc tgtgagatgg
    43381 aaggagtgtt cctgctgggc aacagcagag taagtgctgg ggtagattca ctcccacagt
    43441 gcctggaaaa tcctcatagg ctcatttgtt gagtctttgt cctacaccag gcactctgca
    43501 aaaacgcttt gcctgcaagg tctcatgcga tgctcaccac agctctgtga agttaattgt
    43561 acttttatca ccattttaca gatgagaaaa ctgagggtat ggggtcaatg acttggctaa
    43621 agtcactgct tagcaagctg cagggactgg atgtgaattc caattggttt gactccaaag
    43681 cctgtgaagc tacttgttct tcaccaccta gagctgtggt tcttgataac tgtgaactct
    43741 tttggggtca caaatagccc tgagaatatg atagaagcag gagctctggc ctttctgtcc
    43801 atacctgaac aggtccttgg gttaagagcc cctcgtccag ggcctattaa tcttgatcct
    43861 cataagcagc atccatgtat aaaggccgca aaccaaactg tgccagaccg aatcctagga
    43921 ccaagcccaa atatgtccca tcatcctttt ggtaagaagc tcattgtaag aaagaaagag
    43981 gagagcaaga ggatgaccta gtgcatgggg cctcattgtt ttaattagtg acaaaacaac
    44041 aataataaca acaaaacccc cgaagcttca cagatgacat cagaccccaa gcctgtgtgt
    44101 ttttcaggtg cccttgagga gctttgtagc tggcagagga ggtgaaactg acaaatgttt
    44161 ggcagatgga ggagagtacc agaggggttt gagatgagct aaattccaat ctaaccgcag
    44221 gtgtgaggaa gaggcttgga ttgggaccat ggagatgggg gttctactcc cagtcacgcc
    44281 agctgacttt gcgagtgttc tttgtcagtc actttatctt attttattta tttttatttt
    44341 tttgaaatgg agtttcgctc ttgtcgccca ggctggagtg aaatggcgcg atcttggctc
    44401 actgcaacct ccccctcctg agttcaagcg attctcctgc ctcagcctcc agagtacctg
    44461 ggattacagg cgcctgccac caagcccatc gaatttttgt atgcttagta gagacagggt
    44521 ttcgccatgt tggccagggt ggtcttgaac tcctgacctc aggtgatccg cccaccttgg
    44581 cctcccaaag tgctgggatt acaggcgcga gccactgtgc ccagcccact tcatcttacc
    44641 gtagttacct ccttagagta tgaaaaaata ggcttagggc atccccaagt cccctctatg
    44701 tctgagagct gaggctggct gtcaaagagg aactaaggat gccagggact ttctgcttag
    44761 gacccctctc atcacttctc caacgctggt atcatgaacc ccattctaca gatgatgtcc
    44821 actagattaa gaatggcatg tgaggccaag tttccacctg agagtcagtt ttattcagaa
    44881 gagacaggtc tctgggatgt ggggaatggg acggacagac ttggcatgaa gcattgtata
    44941 aatggagcct caaaatcgct tcagggaatt aatgtttctc cctgtgtttt tctactcctc
    45001 gatttcaaca ggccattttc caaataaagc catgccctct gcaggaacac ttccttgggt
    45061 tcaggggatt atctgtaatg ccaacaaccc ctgtttccgt tacccgactc ctggggaggc
    45121 tcccggagtt gttggaaact ttaacaaatc catgtaagta tcagatcagg ttttctttcc
    45181 aaacttgtca gttaatcctt ttccttcctt tcttgtcctc tggagaattt tgaatggctg
    45241 gatttaagtg aagttgtttt tgtaaatgct tgtgtgatag agtctgcaga atgagggaag
    45301 ggagaatttt ggagaatttg gggtatttgg ggtatccatc acctcgagta tttatcattt
    45361 ctgtatgttg tgaacatttc aagtcctgtc tgctagctat tttggaatat actatatgtt
    45421 gttaatgata tcatgcagca gacgtgcatc tgaatgggct ggctctagga gctagagggt
    45481 aggggctggc acaaagatgc atgctggaag ggtccttgcc cataagaagc tgacagccaa
    45541 ggctagggga gttctgtctt ctctgcatca ggtcacctct ctcacctctg tcactgcccc
    45601 atcagactac aatgtctgca ggtctttctc ccctgagtgt gagctccctg agcaaagcag
    45661 gatgctgccc cttccctttg tattcctggc tcctggcttc agtgcctgga cataagtatg
    45721 ggcataataa gtgtccccca aatgagacat tgaggattct tcaaatgcac aggaccgtga
    45781 tgtgagttag gacggagtaa ggacgatggg atgtggctca ggacaatcct gaggaagctg
    45841 cagctgcggc acgcagggcc acactgtcat gttcatggac cctagactgg ctttgtagcc
    45901 tccatgggcc ccttccatac acaaatatta aaaattatat ttcatgactg cattggtata
    45961 aagatgaata taatccagac cagattcatg attattcata catttttagt gtattaactt
    46021 ttaattctgc ttttaaaata aattaaaaca ttctaatatg cccttaagag tatcccaggc
    46081 ccaggccact gagcctactg tggttcatgg ataagttggc cctgggggca tgtgtgtgca
    46141 tgcatgtgtg tgcacatgca tgatgagccg ggccttgaag ggtggtaaga tttgggtgtg
    46201 tagaccaatg gagaaaggca tttggggcag tgatgatggg tgggggaggg aacatggtga
    46261 tgaatggagc tgggtgtggg gagccatggg agtgggttag ggccagcctg tggaggacct
    46321 gggagccagg ctgagttcta tgcacttggc agtcacttct gtaaagcagc agaggcagtt
    46381 ggcctagcta aagcctttcg ccttttcttg caccctttac agtgtggctc gcctgttctc
    46441 agatgctcgg aggcttcttt tatacagcca gaaagacacc agcatgaagg acatgcgcaa
    46501 agttctgaga acattacagc agatcaagaa atccagctca agtaagtaaa aaccttctct
    46561 gcatccgttt ataattggaa attgacctgc accagggaaa gagagtagcc caggtgtctg
    46621 gggcttgttc ccattagatc ttccccaagg ggtttttctc cttggtggct ggcctgtggg
    46681 gcccctctcc aggaggcatt ggtgaagaaa ctaggggagc tggttgccac agacagtgat
    46741 gtactaatct tctctgggaa gacagaagaa aagtccccag ggaagaatac tacagacttg
    46801 gccttaggga cagctagggg tgcagattgc tgccaactgc attttttctg aagttggcca
    46861 tatggttgca gtgaatggat ttatagacag agtatttctg tgcatataag agcaattaca
    46921 gttgtaagtt gatatggata agtgaaagtt aagcacttct ttctaaaaag agaatgcaat
    46981 tcattttccc ctaatcattt caattagtct gatgggcatt tgaacttgtt gtctttaaaa
    47041 agtgaaatct ttacctctga tctggtaagt atccaggcaa tttcttgtgt gccacccagg
    47101 aggtatctgg ggagtgggca ttttctgact gaggcattgg ctgccatagc atcagagcag
    47161 ccttccaggc agtggcctgg caaggggaca gaggctggtg ggagcagctg gctgagtgca
    47221 gccagtaatg gcatgtgcat ggtctgtaga gaatgtagaa gcaataatga agccgataaa
    47281 agctggtctg cattttatta ttatcatgcg ccggtggttc taaacaatgt cagtgataaa
    47341 ttactcctcc ccatcatgga ccaatggctg ccactgctcc agggaagtgc tttttattcc
    47401 gtttggtgtt tagggaggga tggagttggc tggcctttgc tgaaaggcct accagtttgt
    47461 tttctatttg gcaaaagaag aaatgataaa gttctagagt ttaaaccaga ctcagatttg
    47521 agtttttttt tttttttttt tttttttttt tttttttttt ttttttttaa aggctacaaa
    47581 actgtgcttt ccttgggcag taaaagaggc aatgggcaat gggggacctg atgacaaagg
    47641 gaagcaaagc tgtcttaggg gtggcatgga ggaggtgctg cttcacagca gagagaggta
    47701 tggctgtgct tggagtgtcc acttagacaa ctcctggctg tgcagccagg ccatcgagat
    47761 gctgtttcct tgacctgcag gtcctggtct tgcacatgga tgtttcttct ggtgcaggag
    47821 acagaaaggt agcaacaacc cctgatcaaa gcctcagtcc ttccttattt actggagagc
    47881 ccctgctgat tgaccagagg cacagctggg gatatttcct ttacctctgt agcaagagac
    47941 agcgtggtgc agaggaaagt gctagcatac attaccctgt ggctgcatga ctttgtgaat
    48001 aggttagtta gcaccctttc agccacttct tcttacctgt taatgagata aaacatgtaa
    48061 ttgcttaaaa acagtatttg gcacatagga agcacttagt gaatatgaat tatgattttt
    48121 tttggagtgg tgacatctca accaagccat ttaacccctc agccttactt tcctcaacta
    48181 taaaatagca gctaacttga aatgtaaact ataaaaccta atgtagtatc tggcacatag
    48241 tagattccca ataaatgaga gccagtattc tttctaagac agtgatgcat ttctgagcac
    48301 ctggccttgt tcttctgcct tgcaatttat gcagcagttg aaatagactg gctgatgggg
    48361 gtaagttgtc aagcagactt tctgatctta gtggaggaga ctgccttaaa acaacactaa
    48421 tttccttttt cttttctttt cttttctttt aagacagagc ctcgctctgt caccaggctg
    48481 gagtgcagtg gcgcagtctt ggctcactgc agcctctgcc tcctgggttc aagcgattct
    48541 cctgcctcag cctcctgagt agctgggact ataggcatgc tccaccatgc ccaactaatt
    48601 tttgtatttt tagtagagat gagatttcac catgttggcc aggatggtct cgatctccta
    48661 acctcgtgat ctgcccgcct aggccttcca aagtgctggg attacaggca tgagccacca
    48721 tgcctggcct tctttgagaa gctggagaca tgagttaagt ggtgaagaag ccaaatctgt
    48781 atctaaaaac cctacagtag tgtgcagagc tctgaggaga gaaggtccct tagattttga
    48841 gtgtattatt atgtcagtgc ttgttttaca tctctctgtt cacgcagtat gtcccctttt
    48901 ctgccttgca gctgtttctt aaattctttc tttctttgct tgtcttgcag cacaaaacag
    48961 gcttcagtat agggggaaat gcacagaaac actgcctttt cctacaggaa atcagtaact
    49021 ttttactgat tttgttttta tttacttatt ttatttgttt aaatttattt ttagtttttt
    49081 ttttttttag agacagggtc tctttctgtt acccaggcta gagtgcagtg gtgcccttag
    49141 agctcactga gctcactgca gcctcgaatt cctgggctca agtgatcctc ctgccttagc
    49201 ctcccgaagg gctgggatga caggcatgag ccactgcacc tggccaactt tttgctgatt
    49261 gcgaatagca ctcttgtcaa tttcggagag aagctgagac tggcatatgt cagtatggat
    49321 ccccacttag agacctgtgt ttatctgcac tgacacccca tcacagcatg atgagcttgg
    49381 ccctcctgtg ctgtctctcc cagggctggg aggatccttg aagctgatct ggtttggagc
    49441 tttgtcctca ttcacctcct ttaccacaca cccaccttcc cagggcgggg atctaccact
    49501 cactaagtag cccattctgg gtgttgacag ctctaattgt tagaaaatat tcaccaccct
    49561 gttatgcttt ctagagaaca agtctaattc tgttttcctt gaaatagtcg aagacagctc
    49621 tcatgttttt ctttccctgt tttcccaaag tccatgattt tttaggcaaa atggcctcct
    49681 ttcctcttat ggaatgtttt tcctccccat ttctgcctct cctctggttg tgtttcagta
    49741 tgtctgtgtg cttcttgaag tttactggaa attatgaaag tattctggca cagaggagga
    49801 agggattttt gcctcccttg ttctgagtgc tacatttccg ttaatgcagt ctgagattgt
    49861 attaggcatt ttggcattca cgtcaccttg ttgactcata ttccatgtgc actcaaacaa
    49921 aattgtgatt attttaaata ggcagaattg caagttacgt gttctccatt tctttgttgt
    49981 attgttggct ttttgaacta aagggaaaaa tgtctttttt ctgttttaca tgtttagatt
    50041 ccctatgcta tcctatcctc ccaaaaccat tttagattct gattttgcca tgtattatat
    50101 ctgatactcc tttctcgtca tctagagatg tgataaacaa ctctctttgg cctcattcca
    50161 gtcatcgata actgtgggac aaagacttga agcttggatc agtccagtgg agactaacca
    50221 cccctgtaga cccttttttc ctcaactata aaatagcagc taacttgaaa tgtaaactat
    50281 aaaacctaat gtagtatctg gcacatagta gattcccaat aaatgagagc cagtattctt
    50341 ctaagacagt gatgcatttc tgagcacctg gccttgttct tctgccttgc aatttatgca
    50401 gcagttgaaa tagactggct gatgggggta agttgtcaag cagactttct gatcttagtg
    50461 gaggagactg ccttaaaaca ccactaattt cctttttctt ttcttttctt tctttttttt
    50521 ttttaagaca gagcctcgct ctgtcaccag gctggagtgc agtggcgcaa tcttggctca
    50581 ctgcagcctc tgcctcctgg gttcaagtga ttcttgattc tgtagacact accactcagg
    50641 cctatattgt aatcagtgct gggccactgg gctcctgctt ctgtgatcca gttgggaagt
    50701 ttatcttgtt cttcccttca gcttgcatct gctaaattcg ctggactata cacaggtgat
    50761 ttgtagatat ggggatctct actcaaatac tctcatgatt tccttggcta gagcatcatt
    50821 ttatttccac ttattggaag agaccttaga gaccagttag ttcatttata gataaattag
    50881 ttgattctgt cattcaaccc ttatatattg agcgtctcct atatgcgaat cactgttcta
    50941 agtgccgaga cacagaggtg tccaaaacaa atatggcccc tctccatatg gaatttctat
    51001 tctagagaag aatctgaccc agaaggggga agtgactgtc ccaagtctac acaaccacag
    51061 aagggatatt ctgggaataa atcacggcta aacccccctg ctgctccagg cagttctcct
    51121 tctacagtgc tctattgtgc tgttttaata atctttcaac tgggaagaac tcccatttca
    51181 ggaattaagc cgtggacaaa tcttttaatt atccttgaaa tcatcctaat aagaaatcca
    51241 aggaggaagt cttacagggt gcctcaccca cttttctcat cactggaact ttttagacat
    51301 tttattattt tcttcctaaa ccagagtaca ggcacacaag ttgagtggtg tggtggctaa
    51361 attaattaat gtttgcaagg cagtgtgaga agcattcatt catcttaaat acctatggtg
    51421 actgcaactc agatgtaaaa attggataaa tcctcagaaa ccctagggaa agtgacatgt
    51481 ctgtattttg tctctgtgag atacagactg gcagagataa gtgtttctct gggtgagttt
    51541 tgtgtggtat ctgggatgat tttaggcagt acgtggatga gaacttttaa ttttaaccca
    51601 catccaattg caatttcatg gaaattattg cttaggagga tgttcaacag gaaaaatata
    51661 attaaagtta attcaaaaga aacattttct gtgaatatgg taaaacttgt gagagtagtt
    51721 tgtaaatgat tgaagattgg aaaacattgg tataagagtg agtgtggggt tttgtattaa
    51781 gattcatttt gggaagaaat ccatgctgca tccctcatga agtgtgaact ttgggcatgt
    51841 gttgattctt tctggcccag agtttacctg aagattagct gccctgaggg tcactgagca
    51901 ttaaattaga tgatgtctgt ggatgactga tagtgaagct catagcccac agttgacaca
    51961 taataaattc gagttgcttt gcttcccctt ctgttcctgg ctgactgttt ggcctttgcc
    52021 acttgttcgg cctctctggg ccttaagttt ctttgccttg aaattggaac tttctttggt
    52081 gaaacaacca gaaaatgctt cagcccagaa acttggtcag tacttggatg ggggatcacc
    52141 tgggactacc caaggatgtg ggctgtctgc tagactatag ccccttgagg gcaaggtggg
    52201 cgccttgctc atggttccat cctaagcccc agcacagtag tgggtgcatg gtgagccatt
    52261 agtgaatctt tgtggaatga aggtgggaga aaataaaata cctgtacttc acagggtatt
    52321 gtgagggtca agtaaaagtg ctttaaaaaa ttgtattata tagtttattc ccttgtgtta
    52381 gcccaggtca acagagccta cgaataataa tgatgacaga agttcttcaa aaagtcttgg
    52441 ccttctttct ttcacaaaat tgccccccag agctttctgg aagggcagcc atgaacccag
    52501 aggcctaaag tagatttact gggaagctaa aaatatttac tttatttttc atagctcctt
    52561 tcaaggtcct ctctgggggt cttagcaata tgtttacaca gtggtatgtt tttgtaaggt
    52621 ttgcaaaagt aagatttttt aaaaatacta tcttgtttta aaaagagagc cccctaccaa
    52681 cttgtgtcag cctcaggccc ccacctgcat ctgctcctgc cagggcatgg tggggcaaga
    52741 agcactgctc cccttccaaa gcttccttcc ttgccctgga gtcatcctca ctccccactc
    52801 caagccacct gccatcgctg tgccccctct ctggtgaatc tggcattctt aggtgggctg
    52861 agaagcagac tggcccaagc taaggccttt ctgatggggt tgttgctgct gagaatcatg
    52921 actgggtggg agaaggaggt gaccctttgc tgtcttattt ttactgtgta tttccttttc
    52981 agctacttaa aatgtattgc ttagtgatac ctaatgggtt cattagcctg cttcctactg
    53041 aacatttccg ctcaggcatc cacttggtcc caaggcctgc tcctctccca tattctgaaa
    53101 tctggactac agactctcat tcaactccag gttgcactgt ggacacagtc ccctcttgag
    53161 caggtacctc cttgcagtgg ttgggacgtc ctacttggct catagttggg aagtgcatat
    53221 gctggagctg aagcctcttg ccttcccggg atagggcgtc ctcacatccc ctctgagaag
    53281 ttccccagct tccctctgtt ccccgtttcc acacttagcg aggctcttgt ccactgctac
    53341 atcccccata gccagtccct cagccttgcc attgcttatg ctggtctgga aacaattcct
    53401 agacttgtgg ggcatctggg gaagttctcc atcttttttt agctggcatg acccaagtgg
    53461 tgtgggcagg gctgtggatt ctatggtgtg gctggaagcc aggtagcctc tctctactgt
    53521 acatggaact cagcaacttc tgagtcaagc aagatcttag ctctgcaggt gtcttgccct
    53581 gtccaaagtt atggccacac cagtaccttt taactctaga agcccagtaa gtgttttgtg
    53641 ggaccgcaaa gatcattttc tagacctgct gaatatgcct agaacgggta gggatggctt
    53701 tcacgctgtt cctagggctg acaagtcaca cgtttctggg ggtacataca caccgcggtc
    53761 cctgtgaatg gcactctcca tgagaactgt gatgatttga gttgaatagt gcacagccta
    53821 catggttctc tgccatggcc tggagttcct tatcttgcct tctccagtga ggactagggc
    53881 tgcaactggc ctactttggc tcctgacttg ggggattctg aaataccttt tttttttaag
    53941 gttgtggagc tctctgaagc ttataaggat tttgccagga aaagataaga aatatctttg
    54001 ggcattttgt cactgtgctg gagatgaacc ctttggagga catatcacct tgttgaggtc
    54061 aaggggcgga aagggacagg actggcagag agatccgggg cagcagcctg ccatcccgac
    54121 tgagtatgga gtttctctct cccttcagct gcacttttgt gtggagtcag tggctcagct
    54181 gccacttccc ttatgttcat ggcatgaatc tggcttgtta ggcctttctt tttttttttt
    54241 ttttttgaga tggagtctca ctctgttgcc caggatggag tgcatggagt gtagtggtgt
    54301 gatctcggct cactgcgacc accgcctcct ggattcaagc gattctcctg cctcagcctc
    54361 ctgagaagct gggattacag gcgcatgcca caacaccctg ctaatttttt aatttttatt
    54421 tatttatttt ttttcaaggc agaagaattt ttcttagtac agaacaaaat ggaatctcct
    54481 atgtctactt ctttctacac agacacagca acaatctgat ttctctatct tttccccaca
    54541 tttccccctt ttctattcga caaaactgcc attgtcatca tggcccattc tcaatgagct
    54601 gctgggtaca cctcccagat ggggcggcgg ccgggtagag gggctcctca cttcccagaa
    54661 ggggcggccg ggcagaggcg ccccccacct cccggacggg gcggctggcc gggcgggggc
    54721 tgccccccac ctccctcccg gacggggtgg ctggccgggc gggggctggc ccccacctcc
    54781 ctcccggacg gggcggctgg ccgggcgggg gctgcccccc acctccctcc cggacggggc
    54841 ggctggctgc cctgctaatt tttgtatttt cagtagagat ggggttttac catgttggtc
    54901 aggctggtct cgaactcctg acctgttgat ccacctgcct cagcctccca aagtgttgag
    54961 attacaggca tgagccactg cgcccggctc tgtttttttg tttgtttgtt tgtttttgtt
    55021 tttgttttga gacggagtct tactctgttg cccaggctgg agtgagtgga gcggcacgat
    55081 cttggttcac tgcaacctct gcctcccggg ttcaaacaat tcttcctgcc tcagcctccc
    55141 gaatagctgg gattacaggc acttaccacc aggcctggct aatttttgta ttttttagta
    55201 gagacggggt tttgccatgt tggccatttg aactcctgac catctttgaa ctcctgacca
    55261 tgtttgccat gttgaactct tgacttcagg tgcgttggcc tcccaaagtg ctgggattac
    55321 aggtgtgagc caccatgccc agcctgtttg gcctttctga tatggctctg actaatcttt
    55381 ttggaaatta gtccccaggg ttatactgga ttttacttag ggaaaagggt catgcctctc
    55441 tggctgtcag atttactgat agtactaagg actcagtggg gtggaccttt gattctggtt
    55501 tgatttttga aaatcaaaaa gacgtgagct ccagggagca gggtggcttt ggtgacatgg
    55561 caagatagtt ggctgtggca gggagttgag ggaagtgggt agaaaattaa catcttgtaa
    55621 atattccctg ggaaatatac cttcgtgtta agagaacaga cttggcagcc agatggccta
    55681 ggttcaaatc ttggcttgat gcttatcagc tgtgtaacct tggataattc catacatctc
    55741 tgtgcctcag tttcctcaaa tggaataaca atagtacctc cctcaggact attgtggcaa
    55801 attaatggac gaataagggg aagcacttag tacagtgcct ggcccagcat aggtaccagg
    55861 cttgttctta agctcactgc atttttacaa tcatcataaa atgcagggga tacacacatg
    55921 aaggagccga agttcagaga ggccaagtaa cttgcctaag aaagcacagc tggcaagggg
    55981 cagtaatagg accagaattc cagtttctcg tgctctgttg ttgttatatc ctaaagagag
    56041 cagctctgag tagccagaag cttccctaaa gtcacaggac atggggcatg ggctggctgg
    56101 gatgagaaag gagacaagag ggcttctgaa agaaatgcca gattcactcc acttcctggc
    56161 ttcaggcacc gatggaatgt ttcccaaggc ccatctagaa agaacatcct gtgactcaca
    56221 gccacttctc attttctgtc tgaaccccct cacccattca ggcagctgct aaagttgagg
    56281 ttacagcctc agactatatt ttctgtcctt gtggaacccc agtgtgtcat cttgttggga
    56341 gatctggtga tacatgtgtc aacattatgt catcaaaatg gaaattcttt gaaatcttta
    56401 ggtgattgca attcacgttc tgtatgtatg cacttgtcaa aagttttgat ttgaggcctt
    56461 agaattttat atttggaaac ctttccacta ccatgagttt tcccagacct gtcaaagcca
    56521 ggctgcatct cagaaaccag gtctttgatt tccatccagg gcaagggcct gggcccagct
    56581 gggctgtaag caggtggggg tggggagcaa cgctgcactg caatgttgaa atattacttg
    56641 aactaaatca aatcaaagat cagctttact cagacaagaa tagaaaacac aattgcattc
    56701 gattacagaa tagtgtgtat ccccacaaat atcagactgc ctttaaaaag ttttgaattg
    56761 ttaacatcaa gaacagtgtt gcgtgtctcc tgcttttcca gcataaggtt tatttattct
    56821 gtgggtggca aagagcaatt tgggagtcca gtttgtttct cattgaaagc tttcccattt
    56881 ctggtcctct tgtcactgtt gcattgaggc accaaaaggc aatctcaatg cgacactatt
    56941 caacagacta agttgcaccg gataatgata ccattttaca tttttcatat attattacat
    57001 tgaaggcttc aaacagcact agcaggggca aattggtatt attatctcca tttcattgat
    57061 gaggaaaccg aggctcgaaa gggtaatgtc tgttgtccaa gattaaagag taaagttgta
    57121 cgttgaatcg gggtctgact cctaggctta gcattttctc cccacactat gctgccatgt
    57181 tgcttattcc aacattagga agcataggtg ccatccccag cttttgaggc caatatcacg
    57241 atgaagcatt tttaaaacat ctcattaaat tgctgatata gtggaaagaa ccaaagcttt
    57301 gcagtcaaag ctgtttgggt tcaaatttac cacttgtttg ttctatgacc ctgagcaaga
    57361 tattctccag atctgtttcc tcatttgaaa aatgggaata ataatacgtt tctttatagg
    57421 ctgttcttaa agattctgga aaataatgct aatagtgtgc ctaatgcttg gtaaatatga
    57481 gtcacttttc tgtgcccaca aagcactact atgtccctta ataaattttg ttaattttaa
    57541 aagttagaaa aaaattaaac tatttataca ttgtgtatgt taattcttcc ctagaccagc
    57601 cttaggaaga atctcatccc caacttgtaa actcatcttt ttccgttctt tttgtgcctg
    57661 gacttctcag ggccctgcag gctgattcta gtcccatgtt gtgtggtgtt tgaagtgtct
    57721 ggtccctttt ttcagtgaga gaccagctca tccttgggaa ctgaatgcct caaactctct
    57781 tttctttttc tctcttcccc ttctgtttga tgtagtcttt cctgtttctg gactctgttt
    57841 cttcatactt cctatctctt accttctttt cactcctttt gtcttccagc tgtcctctct
    57901 catttttctg cctctctggt cttcaggtag agttttcatc tcagctatct tcttgctttt
    57961 tctgatgttg gttctttgtt tcttcttctc attctgttca ggtccaaaat tcatttgggt
    58021 caatgttatg tcttagtgtt atcttccatt tccttctgag cttcaaagcc aggctgactg
    58081 tgcccttcca cgccctggcc agtgtgacca ggacatcctt tccctctggg gctgcactgg
    58141 ctctttgggg gaattgttac cattcagggt cttcaaccct cattctaggg acttccagta
    58201 acttctccca ccttccctct tctcagtaag acatgggtat tgtcttattc tgtttctgct
    58261 gctagaagaa aattctcgag actgagtaat ttatacacaa tagaaattta cttctcacag
    58321 ttctggaggc tgggaaatcc aaaatcaagg tgttggcagg tttggtgtct ggtgagggct
    58381 gctctctgct tctaagatgg tgccttgttg ctgcatcctt aggaggggac aaacgccatt
    58441 tcctcaactg gcagaaggga ctgaagcctc tccctcaagc ccttttatag gggtcctcat
    58501 tgtcaggcct ctaagcccaa gccaagccat cgcatcccct gtgacttgca catatacgcc
    58561 cagatggcct gaagtaactg aagaatcaca aaagaagtga aaaggccctg cctcgcctta
    58621 actgatgacg ttccaccatt gtgatttgtt cctgccccac cttaactgag tgattaaccc
    58681 tgtgaatttc cttctcctgg ctcagaagct cccccactga gcaccttgtg accccctgcc
    58741 cctgcccacc agagaacaac cccctttgac tgtaattttc cattaccttc ccaaatccta
    58801 taaaacggcc ccacccctat ctccctttgc tgactctctt ttcggactca gcccacctgc
    58861 agccaggtga aaaaaacagc tttattgctc acacaaagcc tgtttggtgg tctcttcaca
    58921 cagacgcaca tgaaactcat ggtatctgtg aggggcccta atgctattgg tgagggcact
    58981 gctcacataa cttaatcacc ccttaaaggc cctgcctctt aatactgtca cattggtgat
    59041 gaagtttcaa tgtatgaact ttgagggggg acacgttcaa atcatagcag gtgtgctatt
    59101 gccttactag caaagtaatc tgggggaaga tgagtaatgt ctcgttccac ccctgattcc
    59161 agtgctgact gattctggag tggcagagag ggttggaggc tcaccgctct gctcacccga
    59221 ccctctggcc atctcctctt agaatgcaag ggcagggatt ttgttacaca gcgcctcttt
    59281 tgttataggc tagctcctgc cttcaaggag cactgagaaa aatattatcc ctttgaacca
    59341 caacactagt attttgggta ctgtagccat tcagaagttt gttgaactaa tacaagttta
    59401 tattatttgt aaaatactag aaggaatgtg tggttcctaa agttatggta ttcccattgg
    59461 ttgaaagaga agactctggc ttcctaatgc cttggaaggt tcagggagca taagcctaaa
    59521 tatcctggtt tctcttggga attcttcaca gcttgtcatc tatgacttta tccacattta
    59581 ttttgaacat gtttaatttt cagcttgtat tttctcaagg agtatgtgtg agtcagcatt
    59641 cagagtactt aacacactta gtgtcattag tgataactta acccacacgt ataggaagct
    59701 agacctccat ttacagaaaa atggaggctc agagagccta actgactttc ccaaggccac
    59761 ataactggaa aattctggaa ctgagttttt tcaggtatgt cttggtatcc ctcggtgagc
    59821 cttctctgta gagaaggaag tgtctcctgg gttggaaagc ccaggccggg atgaacagca
    59881 tgggtgtcct taggctgtgt gtaccacaca cccctggcct actctcccct gctctaagga
    59941 gtcacctgaa ggagcagctg catcaccccg cctcacccct tctcctcagc acccaacaac
    60001 caggtcttcc cagagagtgc tcagctattg gtgaagcagg catcgtctat tcttcaagaa
    60061 gacagcagag catcaggaaa acaaaacacc aaaaaaatac cataaagaca ctggagtgta
    60121 tttgttacca gcccttgtat aacagagaag tatttcaagg tcttatttgt aaggctggac
    60181 aagctcaggc tgattaaaac cacgagttag aatttgtctc ccgctcccag gccctgctca
    60241 gttgacagct cctagcaggc tgggaggccc caccccaccc ccaccccagg ggtccattga
    60301 agaagaccat ctgggcctgg gatgctgatg tgattttccc tacttctcct tttttttcat
    60361 ttgcctcctc cacccacttc taaagcagct gattgctctg gttccttttc atgttgagca
    60421 aaagagaaga cagatcttga aacctgagct ggggactgga gggcactgac cgagagtaac
    60481 cccgtctggc tcagctcagc tcagcaaagg gctttcctcc tccgccttcc tgagctgctc
    60541 tccctcctct cctttcctcc ttctttccca gcctccccca agctccttca cccctctctt
    60601 ccctctcctc cctcctccca ccttccttgt cctctatgct cagagaactg aaggaacact
    60661 gaatggcctg gaaagcagga cctcgccccc accccgcgtc atacccctcc gcctgcacgt
    60721 gcaaatgtgt tcctcagctt ggtcccagag gcctggacct gggtcccaga ggtcccagct
    60781 ggttccccgg cttggtccca gaggcctgga tgtgtggcaa gaagctgagg gttctgcttc
    60841 gtttcttcca gcccacaaac acacatgtca cctgtttttt gttctttttc ctttcctttg
    60901 ccccacttta cagaagctct tggctaattg gtaggtcact gtttgctcct ccaggcaaaa
    60961 ccaaggaggc tgagctcagc aggtcaagta ttttggtctg tattgaagag actgtggtgg
    61021 gagagccctc ttgaccttct gctggatagt gactttctct tctgctggca gcagtagccc
    61081 agctccgcat cctgtgacag aggtcatacc ttccagttac ccgagcgaac acttggagca
    61141 agtcagcttt ttcccctgaa acagcacgtt cacgttcttg atctttggtt tgcacaacaa
    61201 cctgtgaatg ttattagccc cgtttttttt gtttgtttgt tttttgagac agagtgtcgc
    61261 tcttgttgcc caggctggat tgcaatggcg tgatctcggc tcactgcaac ctctgcctcc
    61321 aagcaattct cctgcctcag cttcctgagt agctgggatt acaggcatgc gccaccacac
    61381 ccggctaatt ttgtattttt agtagagacg gggtttcttc atgttggtca ggctggtctc
    61441 gaacttccga cctcagttga tccacccgtc ttggcctccc aaagtgctgt attagcccca
    61501 ttttatatcc aaggaaactg aggcttaggg agatttaaat aggggggaaa gtatagcttc
    61561 atgattacac agtcaggaaa tggcagagct gactcttgaa ttcagaccct gtgatgctgt
    61621 atccagtgct gattttgttt tgccctgtgt tcctttcagt gtcaggtaag aagacctcag
    61681 agatcatctt ctccaaccca cacattttac tgattataaa atccaggtcc gcattaactc
    61741 attggccaaa gttaagtggc taaatgactc tcctgttcag gaagtattcc ctgtcttgtg
    61801 tggaatgatt ggtctttcgt ccaggtcatc ttccacctat atccatcctt gcagtgactg
    61861 gtggatatga tcattcctgg tgattccctg taaaatacta gatttattta ttctttcata
    61921 aagttatgag tagacaaatc agtcagcttc aacctttcct ttaccttata gatagtatca
    61981 gcagaaagaa ctcaataatt ctgtcttagg aactcagcca ggctaatatg tattataata
    62041 cacacataca tatgcacaca catacacaca tgtgcatacc ttagatcaca aattcaacat
    62101 ctcaacttag ccgccaaagc cacttctgtg tcttgtctat tctgaatatg ctttaaggtt
    62161 aaattagcca tgaatctgtt cagatcagtc attcctgaaa atgatgatga tattagccat
    62221 cctttaccgg gcaccttact cagtgcccgg cattatacta ggtctgcagg tacatttgct
    62281 catttaacac ctagctcagt ccaatgaggt ggatgccatt gtcatctcca ttttatcatt
    62341 taatcaactc agccttcctc tctcctgttc cccactctgc ttctagctgc cactcacctg
    62401 actctcctct tgatctgggc tgacgtggct gagtgggttt agcagaactc ttgtgtcctc
    62461 tttacttggt ttgtttcaag ttcatgtcct gcttcatctt ttccctaact cactgcaagg
    62521 tgaattttct tctctgcaat cggcaattcc ccaaatcaca tcaggtcatt aatttattca
    62581 ctcatttgta ttcattcatt cattagttca tccatccatc tattcattca ttcatccaat
    62641 ttactgagtg cctcctctgt gccaggcact gctggctttg gaattaaaaa ccctggccct
    62701 caaggaattt ctattcttct gggagcagga tatttacatg ggtgactgca gcttgatctg
    62761 tgtggtacta gtggaaaaca taggatgttg ggcacaaaag agtcacaaaa cgccgggcgc
    62821 ggtggctcat gcctctaatc ctagcacttt gggaggccaa ggcgagtgga tcacctgagg
    62881 tcaggagttc aagaccagcc tggccaacat gatgaaaccc caactctact aaaaatacaa
    62941 aaaggaataa tagctgggcg tgatggcggg tgcctgtaat cccaacaact cgggaggctg
    63001 aggcaggaga attgcttgaa cctggggctt tggaggttgc aatgagccaa gagcgggcca
    63061 cttcactcca gcctgggtga aagggtgaaa ctctgtatta aaaaaaaaaa aagtcacaaa
    63121 aaagggctgg ccacctaacc cagcttgagg atgggaaggc cagggcaggc ttcctggagg
    63181 aggtgactca tgggccatgt catgaggatg gggtaagagg agaaggtatg tttcagacat
    63241 ctggctttct taatattctt ggctttccac ccattttcct gccagcaaca taggggagaa
    63301 gactgagacc agcagataca aagccactgt actctgtcct caccccctcc ttttttctcc
    63361 ccttcctact acccagatct gaggttttga gaaatctctt ctctaattat ctgcttgttc
    63421 attcatttgt tgagaatcta ctatgtgcat agcattatga ccacaaaatg tcctggtcgt
    63481 tgtcatcatg agaggtctta ccccttcctc tccagccact tctagggatt tttgactctg
    63541 tccttttcca gaacttggct ccagtctggt tgctcgccat gaagcactta cagataaacc
    63601 tcatcttggg ccagtgcttc catttactgt ctccttttgg cttgcttatc cttccttctg
    63661 ccttcttgaa ttgatttgcc tctttatctc tcctttcagc cttgaaagtt cctcgaaggc
    63721 agggactgtg tccccatctt ttctataaat ggcattgttg tacattggta aagttgagtg
    63781 aaaaaatctt tttgataatc acaaattatt accatatatt gagcacctac catgatgagt
    63841 atctctgatc ttcatatctt tgaaatgtgg atactattac cttcacttaa ttggtaagaa
    63901 aatttgtgct cagggaggta aagtcacttt ctcacagtca cacagctatt tcacaggaga
    63961 gtgcagtatc aaatttaggt ttttctggtc ccaaaccctg atgtttcccc cacatcattg
    64021 tttctcagac ttggctgcaa tcacctgaaa aattttatta aattcttttt tttttttttt
    64081 tgagacagag tctcgctctg tcacccaggc tggagtgcaa tggtgccatc tctgctcagt
    64141 gcaaccactg cctcccacgt tcaagtgatc ctcctgcttc agcctcccaa gtagctggga
    64201 ttacaggcac ccaccaccac actcagctaa ttttgatatg tttagtagag actgggtttc
    64261 accaggctga tctcaaactc ctgacctcaa gtgatctgcc cacctcaccc agccaggttt
    64321 tattaaattc tgatgcctgg gtcttaccct agcagttctc attagttgct ccatggtgtg
    64381 gcctgggctt tgggattttt gtgaagcccc cagctgagtc taacgggcag tcatgtttga
    64441 gaaccagtat actacaccat gttgccttcc tgttttcaca caggttggtg ctagtgtgtg
    64501 agtctaagcc tacatgggtg gatatccacg tgatcaggct gcaagttctt tgtagtggag
    64561 gatctttggc cctccctgct gcttcccaac ccagcgaacc taccacacca tgcacgtgca
    64621 cagccaaggg ttgttgactg tttaatcagt ctctgggttc ttatgtctca tccgggggat
    64681 tcactgatag gaggctggag cttgatcaca tccagatgtt ctcatcttca gagcctgtta
    64741 acataacaga agtcttataa acatgctgag cacactcact ggtctggaga gtgttgcagg
    64801 atgacctagg cccattgcag gggtggcctg ggtcgccctc ccacacttct gccaccctgc
    64861 ttgggaagga gggagctgtt ttgaagttcc ctggtgcaat atgatgtatt gctctggctc
    64921 tgctcaggag gaagctatgg cccacctagt cagtgagggt tagctaatac gtgtattctg
    64981 tttctgttga ggtttcacag aggccttttt tgacccttca ttttatagat aaggcagtgg
    65041 aggcacaact acatgaaatg actcgacaaa taaatggatt tagaacccag gtttctgact
    65101 cccagggtgg tgctttttcc atggttgtac agtgattaat gtctaccttt tcacaccagt
    65161 cctcaactga agacaccagc ttacaccttc cttcctgttt ctcccaagaa cagaaagtga
    65221 ccccggatgt cgctttctgt tcctgggaag gcagttccag tggttagaag tcctgttcac
    65281 tcctggggtg tggcctgggg atggtcctga catccctggg ctcttcctgg acctggccag
    65341 ctaaaaggaa atctcctatg atggtactca gatacttttg gaaccttgtc agccctaatc
    65401 ccatctccta gtgtttagta ttcagctacc cttcactggg cagtaattct gtgccaggcc
    65461 tgatctgggc attggtggta ctaagaacgc atattcccta tcctataggc atagtctggt
    65521 agagacaaca tgcaagtaaa acaatgttcc ttaaactgtg gttccacacc tccctccccc
    65581 aacattaaaa gtgtaaggga tgcttattca aatgtagatt tgtaggctct gcactctaga
    65641 cccactattt cagaatctct ggggactggg cccaagaaac tgcattttcg catgctccct
    65701 aaatgaagct taggtgctct gaggtttgac aactgcagta gagagcctaa tgctaacagt
    65761 gtagagtcac atgtgatggg aagcatcagg taggtagcag tttgcaggag cactgattct
    65821 gagggacact aactgggcct aagaacagct actggctgtc atgaggaata actaggagct
    65881 agccatagag gggtagcagt gaatcatttc tctagcgatg taaatcttgc tcaatttatt
    65941 ctgtctatat aactcaatat tactgaagtt tgcctaaagc agaatacacc tggatcatac
    66001 agcatttatg agagactggc tgggctgtca ggccctcctg ttactttatc tctgcatgtg
    66061 accctcttag ctccgcggat taactcctgt cctcattaag cctcacactg tagccccatt
    66121 ttcagatcaa acctgtttct cttctggtaa atgatttcag tttgcaaagt ttgccctcta
    66181 gaggttgctt agtgcctggc catgtgggct cagttcatgt ggtcctgatg agctggtttt
    66241 atctttatta caaagaagtt aggctgttag gagagtgggt tggaaggaga agaggtagac
    66301 agccaaatga gatgagtcag ggaaaactat actgtttcgg agtcataggg ctcctaccaa
    66361 gcatctggtc agaaacctct cattttggag atcaagaaat tgaggttcag aaagatgaca
    66421 tgaggtacgc agggacgcca ccagacacag cctccaactc tagaaactaa aattctggat
    66481 tcttagtgct gctttttctg ttttgttgca ctggattgaa gccttttcta actgtactca
    66541 gagggcctat tatttaggga gattccgtat gaaatccttt agcaatcaaa tcatttaata
    66601 gggctgatgg tttaaatata tgttaatgtg ttttcctaaa gcctggcaga cctgggtttt
    66661 ggagctttgt atcacatgtt ttatgtttgg aatgaaaatg agaccatgtc tgtgaaggca
    66721 ctttgatatg cgtaatgcac tctgccagtg tttgtcaaaa catggtcccc aggtcagcag
    66781 catcagcatc acctgtaagt gattctccag tcgcatcccg gcccaggaga gggccaaact
    66841 ctaggctttg tggaaggcga gggggtgccg gcccggcccg gcccggcccg gcccggcagt
    66901 ctgcatttta acaagctctc caggtgattc tgatgcatac ttaagtttga gaaccattgc
    66961 ttgttttgca ttaaacagga gattagtctc tgcagcttgt gggaataaag ctttaaatct
    67021 ctccaatttt agctctgtga aaaggcagtg gggagacagg aatgaacgga ctagtgccac
    67081 aaagctcagg tggggtgggt gagatcattt agaagagaaa gaccgggcat ggtggctcac
    67141 gcctgtactg tcagcacttt gggaggccaa ggcaggttgg atcacaaggt caggagtttg
    67201 agaccagcct gcctatcatg gtgaaaccct gtctgtacta aagataaaaa aaaaaaaatt
    67261 tgccagtcat ggtgatgcat acctgtaatc ccagctactc gggaggctga ggcaggagaa
    67321 tctcttgaac ccgggaggcg ggggttgcag tgagctgaga ttccaccatt gcactccaac
    67381 ctaggtgaca gggtgagact ccgtctcaaa ataaaaaaaa aaaaagaaaa ggaaaggctg
    67441 tgtgtgtgtg tatgtgtgtg tgtgtgtgtg tgtgtgtgta acagcaccat cacactgttt
    67501 gagttgagga gcacatgctg agtgtggctc aacatgttac cagaaagcaa tattttcatg
    67561 cctctcctga tatggcgatg ctcccctatc tcattcctgt gtgtgtttag ccaggcaact
    67621 gttgatcatc aatattatga taacgtttct ccactgtccc attgtgccca cttttttttt
    67681 ttttttgagt tacttactaa ataaaaataa aacactattt ctcaatagac ttgaagcttc
    67741 aagatttcct ggtggacaat gaaaccttct ctgggttcct gtatcacaac ctctctctcc
    67801 caaagtctac tgtggacaag atgctgaggg ctgatgtcat tctccacaag gtaagctgat
    67861 gcctccagct tcctcagtag ggctgatggc aattacgttg tgcagctact ggaaagaaat
    67921 gaataaaccc ttgtccttgt aatggtggtg aaggggaggg aggtagtttg aatacaactt
    67981 cacttaattt tacttcccta ttcaggcagg aattgccaaa ccatccagga gtggaatatg
    68041 caacctggcg tcatgggcca gctggttaaa ataaaattga tttctggctt atcacttggc
    68101 atttgtgatg atttcctcct acaagggata cattttaagt tgagttaaac ttaaaaaata
    68161 ttcacagttc tgaggcaata accgtggtta agggttattg atctggagga gctctgtcta
    68221 aaaaattgag gacaggagac tttagacaag ggtgtatttg gagactttta agaattttat
    68281 aaaataaggg ctggacgcag tggcactgag ttgagaactg ttgcttgctt tgcattaaat
    68341 aggagatcag tccctgcagc ttgtgggaat aaggctttaa atctctccaa ttttagctct
    68401 gtgagatggc actggggaaa cagaaatgaa cggactagtg tcacaaagct caggtgggat
    68461 ggacgagatc acttcaaagg tctgtaatcc cacgtctata atcccagcac tttgggaggc
    68521 caaggcggga aaatcacttg aggtcaggag ttcgagacca tcctggccaa catggcaaag
    68581 cctgtctcta ctaaaaatat gaaaattagc tcagcgtggt ggcatgctcc tgtagtccca
    68641 gctactcgtg aggctgagac aggagaatcg tttgaacctg ggaggcggag gttgcagtga
    68701 gccaatatca cgccattgca ctccagcctg gctgacagag tgagactcca tctcaaaaaa
    68761 aaaaaaaaaa aaagaatttt ataaaatcag gaaataatat tagtgtttat gttgaatttt
    68821 aactttagaa tcatagaaaa cttcctctgg catcattatt agacagctct tgtgcagtgg
    68881 gtagcaccag acccagcttg catggttatt gatttttcag agacactttt tgagcttatt
    68941 ctctggcaga aaggggaact gcttcctccc ctatctcgtg tctgcatact agcttgtctt
    69001 tacaagaagc agaagtagtg gaaatgttta ttcttgaaaa taagcttttt gcttcacatg
    69061 atctagaatt tttaaaatta gaaaaatgtg cttactgcgt gcccttctga aactaggaaa
    69121 atatgccttg tgtttaccaa ttgtgtggtt aggagatggg ccaaaggcat caggcttttg
    69181 aaagtagttg catttagcat aatttccatt gccccctgcc aatttcatat ctgtcacatc
    69241 taatcagttt aaaataaggg gcatcctaag catggagatg gtccttggat ggtccttgga
    69301 gtttctgtat tttcagtatt cttttttttg agcatacaag acatttattg aaaaattctt
    69361 gggatcaata cttgtgtaag gaaaggaaag ggaacaaagc atgattgggc agaggcagaa
    69421 gatgacatca acaaaggccc tccgttgtca gtattctttt tttttttttt tttttgagtt
    69481 ggagtctcgc tctgtcaccc aggctggagt gcagtggcgt gatctcagct cactacaacc
    69541 tctgcctccc aggttcaagt gattctctgc ttcagcctcc tgagtagctg ggattacagg
    69601 catgcaccac cacacctggc taattgttgt atttttagta gagacggggt ttcaccatgt
    69661 tggttggcca gactggtctt gaactcctga cctcaggtga tccgtctgcc ttggcctccc
    69721 aaagtgttgg aattacagac atgacccact gcacccggcc tgttgtcagt attcttaaac
    69781 atagacacta actgtaggct gacagcctag cagcaaggac cagttaaaga aatgagtaga
    69841 actgaagtgt gcttgagtat ctctggcagt cagcaaaaac ttaatgggaa tcatggtagg
    69901 ccaaatgttc tgcagtattt caaaagctgc atgggttttg agaggctttt ggtccatcac
    69961 tcaccttagg ttgttctaca agcacatcag cctgccccaa tttaaacagg gcagttagta
    70021 atggtgtaat aaagagtctg cattgttcat tccttcaaca aatactggct ataatgtttc
    70081 agcactgtgg atgcaaagtg agcaggataa acaggctctt ctttcaaagc ttgtggtcca
    70141 ctggaccacg tatgaagtag aatagtttag gtccagaaag gcaattaagt aaaatatgac
    70201 caagaagagg ctctctagtg ggtttggtat aaagaaaaga taagaatgat ttagaattgg
    70261 cctatcaatg agataagagg cctggctttc tggcactctg ctctagggca agtaaaatgg
    70321 agaattccaa attctgaaat tgttagaaca tagttctgtg tcttagttaa atatctacac
    70381 ttacagataa atagcataaa tgctttctcc ccatatttca gcccagtcct acttaaagac
    70441 aacataaatt gcaaaatagt gaggatgttg ttcatctaat aaaagtggtt ccaggaattc
    70501 agactctgga ttcctgtttg ccaaatcatg tgtcccactc ttaagaaaac gagttggact
    70561 ctggattttt ctttgcaaga gggacaagag tgtgggagat actgagttaa tgcaacttgc
    70621 aggttttaag tgtcctgtca ttgtgccttg tgctttgata cattctgagt ttcagtaaag
    70681 agacctgatg cattggactg ttgcaatgga acctgtttta agatcttcaa agctgtattg
    70741 atatgaagtt ctccaaaaga cttcaaggac ccagcttcca atcttcataa tcctcttgtg
    70801 cttgtctctc tttgcatgaa atgcttccag gtatttttgc aaggctacca gttacatttg
    70861 acaagtctgt gcaatggatc aaaatcagaa gagatgattc aacttggtga ccaagaagtt
    70921 tctgagcttt gtggcctacc aagggagaaa ctggctgcag cagagcgagt acttcgttcc
    70981 aacatggaca tcctgaagcc aatcctggtg agtagacttg ctcactggag aaacttcaag
    71041 cactaatgct ttcggaatgt gaggcttttc cttggacagc atgactttgt tttgtagaaa
    71101 agtacggctg gctgggagtt tgtgatataa tttagttcag tggtattcta agtgttctta
    71161 gtgttctttc agacttttgg gccatctccc aaagggtgaa tgggaagaat aagctgggtg
    71221 tggctgagtt taagccaaaa gttttttgtg cttgtttcaa tcagagaaga cctgcttttt
    71281 catgttttta ctattataat actaagcaag agctcatttg aaaacagagt tcttcatatt
    71341 taaaaaaaaa aagtcttgaa accattgatg ggaagatgga tatctattta tgtttaaaaa
    71401 cccatcataa agatgacatt gtgggctgtc acagttggaa ggccctggaa ttagatgaga
    71461 ccacactatt tagcttactt agtaataaca ttgcaaagaa aaattccgac gaagtttttt
    71521 cagcctagga atcaatagtt cagagaagca ctctatgaga atacccattc attcttaacc
    71581 aaaaaatact ggtgagcctg agcagtttgg tcatcagagt gttttatata gttccagaac
    71641 aaatatgtct ctaggtgttc tgagagctct ggtgaaattc ctctcgctac cccaaacatc
    71701 atcatttaat atccaggatt ctggttttct actcaccaga tagattctct taaaaccagg
    71761 gaaagattcc tggaggaagg atgtatctgg aaagagatgt tccttattat aataaaatga
    71821 aattgtaata ctcttggatt ttgtgcagca cgaattcttt atagagagtt ggtcctccca
    71881 gagaattaag aatactcagt ttctggaccc tgttcccaga tcatacccta gaatgtgacc
    71941 ttagaaacac acttcaggat tcataccttt gattgaccat caaaaagttt ttgtatcggc
    72001 caggtgtggt ggctcacgcc tgtaatccca ccactttcgg atgccacggc gggcagatca
    72061 cgtgaggtca gcagtttgag accagcctgg ccaacatggt gaaaccctgt ctctacaaaa
    72121 atacaaaaaa tagccgggcg tgatggtggg cacctgtaat tccagctact cgggaggctg
    72181 aggcaggagg atcgcttgag cctaggaggt ggaggctgca gtgagctgag atctgtctca
    72241 ctctgttgcc caggctggag tgcagtggcg cgatctcgac tcactgcaac ctccgcctct
    72301 caggcttaag tgattctcat gcctcagcct ccgagtacct gggactacag gcacctgcca
    72361 ccacgcccag ctattttttg tatttttagt agacataggg tttcactatg ttgcccagct
    72421 ggtctcgaac tcctgagctc aagtgatctg cccacctcgg cctcccaaag tgttgggatt
    72481 aaaggcatga gtcaccgtgc ctggtcccat gttataattt taaagtaagg tatatttctc
    72541 tacagggatc tttgcaaccc taagtaactg gcctaaaaag ttagagaagc tgacttgtgc
    72601 agacatttgc agcctgttgg tcttttttgt gctgtgaatc atagagggtg aaaggttatt
    72661 atgaatggta caaaactttg ttacaaaacc attttcttgg actgttttgg gctgcttcac
    72721 tgcatgacaa atgctcaccc tttcagctgg aatgattgaa attttggaaa agatgggtgt
    72781 ttttagaaga cattgtaatt tgttccggtg ctgtgcccat tcattccatt tcacttctgt
    72841 ttactcatta aacacctatt gtgtacacaa cccggtaaaa tccctccact cacacaatgc
    72901 ctgaattata ctcatagtag aatgactgtt tagccctcat catctgataa ttaacagctc
    72961 aggtttcaac ctgacagtat ctctctggga ggattagcag cgtgacagag tgcagggaaa
    73021 tgcaccttca gaaccgtcag ctacactgtg tcccatcctg ctgtgttgtg gttgtgcctt
    73081 gtggatgcgt tggtttatga ccaggtattg attaaggtgg ctactaccag gtgctttctg
    73141 catatctcgg gtttgtggag cactcaggtt ctgcttctgc ccctctgctg ttaccaagag
    73201 acctctcttc aaaatggggc tcttgagtta gagtagaatg agtgatcagg attgttttgt
    73261 gtaaagatga tttctgagga aggctttagg atgaaatgac ttccaaacat tttgaaatgt
    73321 gactcttact tattgaatta agcagggcct taattggaat gctgggactg atacttgatt
    73381 tgcattaagc agcctttttc tattgctgct tggttgaaat ttcaacattt gtgatggtag
    73441 atggatgtga catgtgatga cattgcacat gggcagttaa ctgtgccaag aagtgcagca
    73501 gtagcagcaa ccggagatgc aaagcccaac atgatgggga gagaaactct tctttcaata
    73561 tgtgcttctg taccaaaagt ggaatttcac gagagacata ttttgaaaca tttctccttt
    73621 tgtgtgtgcg tgagtgtttc cctgtttcca gccaagggta ttgtgagttt ctcctgggcc
    73681 tccttcagaa tctgggtgct ctggaaagca gtgttttggc aacatgggga aagtatggca
    73741 gtgtgggagg gtcagctggg tctgggtttg aatattgcat ttgaatattt taccagcatt
    73801 gatgtcggat aaattattta gtccctgtaa gcctcagttt tctcttcttc tacatacaca
    73861 taatatattt gactctttgt tgtgattatt ggttacacat atgaagagcc tggtgtgggg
    73921 cctggcacac aataggtgct caataaatag aagttgataa tttaattgac atgagtagta
    73981 gaaattatgt ccttgaaaac aattgcgtca agatagaagt tttcagccag gcacagtggc
    74041 tcacatctgt tgtaatccca gcatattgtg ggggccgagg cggatgaatc acttgaggcc
    74101 aggagttcaa gaccagcctg gccaacgtgg tgaaatcccc tctctactaa aaatacacat
    74161 atttgccagg caggcgtggt ggcgcacacc tgtaatccca gctactgaag aggctgaggc
    74221 acaagaatcg cttgaaccca ggaggtggag gttgcagtga gctgagatca ctccactgca
    74281 ttccagccag cgtgacagag tgagactctg tctcagaaaa agaaaaaaaa gatagaagtt
    74341 ttcttctgta gatcagtgtt agaactcata ccaagcgaag tggtcctggt gagtatttca
    74401 gtgaaaaact gcattcttgc tcagatattg tcaagacttt tcacccaaag attcttattt
    74461 atgtctcagt ccgtaccttg tgtgaaaatt aatactggat gtcagaacgc tgttgtgttt
    74521 ttaaagttcc ctggggttaa gagcagtttc cattaggtgt tctctgcttt ttacttaaaa
    74581 atcttactca tgcattgagc aatatttatt cagttcttat tatgtgtcag gtattttcta
    74641 ggagctggac tcaactcaaa agatatcctt ttgatgagaa caaaggtggg tggatatatg
    74701 aaatattatc tgtgggataa atgcacttag tcatgaggga gacttgttat ggagtgcgct
    74761 cattgtattt gtactgttga gttaacaact tctaggagga gctcagggcc acctggcagg
    74821 ggcttctttt gtcttgctgc tcagcaaggt gtattttgct gtagagtgtg ctgggcaggt
    74881 gaacttttct taactttctc ttgggtcctt cctaaagcag catgtacctt tcccagagcg
    74941 aggagagggc caccttcctg tctcacagaa acctccaatc tgttttggac tgcaggaagg
    75001 agccatagta gtggaccagc aaattttggc ccgagagatt gatttgcttc cgattgtact
    75061 tttttttttt attgctacta taacaagtca ccaaatactt agcagtttat aaccacatac
    75121 atttattatt tcataaggcc agaagccagg atacagtaga gctcacctag gtctacttct
    75181 tagaggctca cagggctgat atcaaggtgt ttgtggggct gtgttttttt gggggggctc
    75241 tgaagatgaa tctgctttgg agctcatcca ggatatcaga tgaattcagt cccgtgcagt
    75301 tgtaggactg aagtccattt ccttgctggc tgtcagcagg ggctggtttt tgctcctaaa
    75361 ggttgccatc attccttctt atgcttttca tgtgactcct ctcccacaag tgggttgagt
    75421 atctctcaca ctttaaacct ctgacgcctc ctgtcacatc tgtctccagt cagagaaagt
    75481 tctttacttt taagggctca ggtgtttagt ttgggcccat tcacataatc caggataatc
    75541 tccttatttt gaggttcata actttaatta catctacaca gtgcatttgc tatgaaatgt
    75601 gcatattcac agactccagg aatagggtgt ggacatcttt gcaggggata ttcagtctgc
    75661 tgtagctttc ttgattggaa ggaatagttt atcatatatt tgaagtgttc cctcagtgcc
    75721 tttggctttt gttgacccct ctgcagctct ctgctttttc ttgcccatat ttggaaagtg
    75781 actctaaagc ataataagca tcacctatta gggtttttaa tgtacaaacc aaacagaggt
    75841 gactttggga ggagaacatc tctgaactag gtatgagaca ttcatcgaaa aaaatccatc
    75901 aagtgtttat tgtatgtctg ctttatacca gcactgttct aggcactgaa gttcgaccac
    75961 caacaaggca gttgtgatgc acttggagct ttcattctga tgggcattga gatccaggct
    76021 gaaggctgag tctgggaatt tgaggaattc tttgtaggtc ctgggtctac agagtgagag
    76081 ctgtcctcgt tccagtttca ctgatgacct ctcgaccagc tccctcacag cagtctttgc
    76141 caacacagac actgtgggct gtagtgggag gaatgtgggg ttgaatgagc taggtttggg
    76201 ctctgtcctt gactcaccat tgcctcagtg atgtgaaaat ggtgtctgat ccttaaggtt
    76261 ggaagtccag gcatgcagat ttatctccat ctcaataacg tggggaaaaa aaagaagtgg
    76321 tttttgtgga accagtggtt ccactgtcca cgtgtcttcc agtgtgccta gcacattccc
    76381 accaagtgga tcttggttca tgagggaaag aagggaaagt gagggtgccc cgaggctccc
    76441 agaagacaag taatgtcaca gctgagctgt gtacagatcg aagaagcaga tggataagga
    76501 gtgaacaaag tcatccttgt cttgaggggt ctttatttag cttcttcttg actcttagac
    76561 aaggacccag aatacagatg gggcttgttg ttaccttcag cctcatggct ttttagggct
    76621 agatacttca gcttgttaca tgcagtcctt aaagcgtctg tgtgggtttt tgcaggagag
    76681 aacacttgct cgtgttctcc tgtctggaac ccggacattg ttggaaagta tcagattttg
    76741 tttggctttg tgtgatttga ctgcccacct gtttacttgc tttctcccag caagcagccg
    76801 caatccccat gggttggtaa tgggtggaat gacacactgt gtagatttac tcttcagact
    76861 ctatgttcac ctcattctta tgggaaaaga agagcactag ctggtagata tagcagtgtg
    76921 ttaatatgac tgctcaactc catttacaca gacattttca ccttagttat actatttctt
    76981 cattaaatat tgttgccaga tctaagatac aggtttaatt ttttcctctg aattatgtgg
    77041 tagtagatgt atttaactat gtttagaaac agcaaaaaat gaagcgtttg aatgcgttaa
    77101 acacatctaa tttgaaagtt aatatttagg ttgttgattt attttttaaa agattagaat
    77161 attccttaga aatgtagtct tataatttcg tatttcaata aaaaatatta aaatgtttcc
    77221 cagaggaaat ctttactgtc atagaaatca ccagaaagag atagcaatta ctcctgggtg
    77281 gtgatagtct ttgatttgtg gtttacttgt tttcagtttg aactaaaaca tactaatccc
    77341 agcctgagtt ggattttgca tatggtcaaa gtgaagtaga gatttttgtc tactttatga
    77401 agatatttaa aggacatttg aaatgtttca atgaacacat tgtaacatgc attcccagga
    77461 ggaaaacaaa attgtgtatt gtgttgaaaa tactattcca atatatgtat acccacgtct
    77521 cattttgtca tagaattcct gaaaaattta gatgtagatg agatttttat aagttgaaaa
    77581 tattttcagt tgcattttaa ttgcacagat gtgttttcta cttctatttg gctgtgactc
    77641 ctaatgcatt gtaatattat cttccagtgt tcatttcgtg tagtgataat ggattaacaa
    77701 atgcattcat tcatttagta aacatttact gagccctaat cagtgccatg ctctggccca
    77761 gggaggtagt catgtgcaag gtccagccct ctctgtttca ttcctggttg gggaatgggt
    77821 gggacagacc aggaaaggag cagctgccgt atagagcagg agacgctctg ctgggtgtga
    77881 tacagtccct gatggggcct cctactgcag tctttccagt caggaaagaa acatcctcta
    77941 agtggggacc taaagaatgg atgtcagcca ggtaaagaac agggagtaga atgttccagg
    78001 tagaggacat agtatatgca ttggtccaga gaggagaggg aagagaaggt ggcatttagg
    78061 ggaactgtac ctgattgaga tagttcagca tggagcagat atagctgtgg ggagtttgat
    78121 gggagaagga acgatgagac gtgatgcctg cagaggtggg caaaggtcgg ctgaggcagg
    78181 tctttcttca cgagccatgt gaggatatag ggccttatcc tcagagtaaa gggagccatt
    78241 aaaggctcca caggctgagt gacttgatga ggtgaatacg aaacagagca tttctgtgaa
    78301 tgtgaatggc ctttggagca gaactaagtt catgaggatg gaaagaatat aatgaagcca
    78361 tctcttagac ccagaaagaa tggggcacaa cagcctttat ctttctgggg ccaatgtcac
    78421 agaatgccat gcttttagga aacatggtat gttgtgatta acacattttg cagaagtggg
    78481 tgggagcttt tgaataataa cagtaagcat ttgtgcattc ttcctgtaat gacattacag
    78541 ttatgatctg aaaatattga gtcatacatg aattcctgtt atcttaactc agaaaatata
    78601 gtccctcact aaaggtttta ttttccttct ttttcccatt tcctttactt cgtataagaa
    78661 agtcacttgt ctcctgggtg caatggagac ctatgtgagt tcatagccaa gagaatgttt
    78721 ttggttagaa aaataatagt aggaattcca agctgtgaat tttttactga agctctttgg
    78781 aaataggatt tggcaagttt tgtctgcctt cgtcaagtaa gcatgagcag gagagcacag
    78841 ttaatagcag gtgcagacac atgattctca gaccgtattt tgtgttctag tttcaaggca
    78901 tgaattcttt cctggggtta attttattca aggaagttat ctgtctgtta gatctgatat
    78961 gtgctcaggc caacatagat tctttaccct tcctttcttc ctgctcacct gtccttcctc
    79021 ttttatcttt tcattgaatt aaaaagaaaa ttatgaaata gtttcaacat gaaaaaaggt
    79081 acagagaata acataaagaa cactcctggc tgggtgtggt ggctcacgcc tgcaatccca
    79141 gcactttggg agtctgaggc agccagatca ctggaggtca ggagttggag accagcttgg
    79201 ccaacatggt gaaacactgc ctctactgaa aatacaagaa ttagccaggc atggtggcgt
    79261 gcacttgtaa tcccagctac gtgagagact gaggcaggag aattgcttga acccaggagg
    79321 gggaggttgc agtgagctga gatcacacca ctgcactcta gcctgggtga cagagtgaga
    79381 ctccgtcttc aaacaaacaa acaaacaaaa agaacactcc tgtaccatca tccatcattt
    79441 tgccgtgctg actccaggtt ctatttaaga aataaaacat tacaggtaca gctgatgcca
    79501 cctctgtttc cctagctcat tcttcagaga taactcttgt cttgcagttg gatgttttaa
    79561 tcctctatat catgtatact tacattctat gtataacaat atttggtact ggcctaaatg
    79621 tgttcacatt gtataagtgt gcatattggc ctgccacttc atttggaatt atgttcttga
    79681 gatttatcaa tgttgataca tgtggaatct ggttaatttt tgccatagta ttctattttt
    79741 atactaaact tttaaaaatc catgcttcta gtctttggct tattttttca ggttatggta
    79801 tgttttggat gcacagaaaa gtaaaattaa gtcatgagca aaatatctgg ataatccaag
    79861 ctttaaactt gatgtagaat ttgaatcatg tgtgttttgt taaccctgtg atgtcaatcc
    79921 atgcctgatt gtgtaactcc aaccaatatt cctttgaaaa tggaaatttg tttatattga
    79981 ctacagattg ccaatattat tagtaaatgc tgagcactta atctcgaata aagaactagt
    80041 ttaaaaatga ttctaacaat ggcattgact gttctacctt attactcatg ggtgggttca
    80101 gccaatgttt ctgttggaga ccaaaaccaa aaacagtcaa attaaacaag cagtcaaacc
    80161 caacatacag actactgata agaaggtcat atcataagat atggcattga atttgtgtct
    80221 gctaatgtaa aaatctgatg cccacagcaa acttaataag gacctatgtt tacatccatg
    80281 ctcaattaca ttcctgggtt aaacagtcat gctttaggcc ctgctgtgtg cctggagttt
    80341 tgctgaagtg tggggctttt aagagaagga gaataagctt gctccagagt taagaaattt
    80401 aaactaaaag tcctaaagat gttggaaaaa ctattgccct tgaagatgta aattcattaa
    80461 gttggagaag accttttata caaacaacag acccattcac tgatttgtac ccttcaggag
    80521 acagatgacc ggtaatggtg acaatgggtg aatgttgggt ttggggtttt tagaaacatc
    80581 tgcacttggt gactactgta tctaattggt gtgacaaacc tggcacccca tgtgtttggc
    80641 accatcttgg gtcctactca gggccaggtg aaccgagtgg cctcttcact gcttcagagt
    80701 ctgaagcaga ttgtagtatg cggaccagac acagaatata ccaccaagca cgttggcgca
    80761 aagcatactg ggaagggagg ctttgtgaac atggtgctgg ttctcaaacc tcagtgctca
    80821 aaagagtctc ctgaggattc ctggatcaca ctcttaaact tctcattcag taggtgttag
    80881 ctggttgaga atctgcattt tttttttttt ttctgagacc gagtctcact ctgatgccca
    80941 ggctggagtg tattggcgcc atcttggctc actgcaacct ctgcctccca ggttcaagca
    81001 attttcctgc ctcagcctct ctagtagctg ggattacaag cacatgccac catgcctggc
    81061 taatttttgt acttttagta gagagggggt ttcgccgtgt tggccaggct cgtcttgaac
    81121 tcctgacctc aggtgatcca cccactttgg cttcccaaag tgttgagatt acaggtgtga
    81181 gccaccatgc ccagcctgaa tctgcatatt taacaagcac cacaggtgat tctgatacag
    81241 tagctcccca aacctcacag tgttagtgaa tcccagtcat ttacaattct gccatgattt
    81301 tggtcatatt caagtgcagc tggtagcatt tttagttaat atatttttta aattaagtca
    81361 cttcttttgg ataattaaat ttaattacaa gggaagctat accactgctg taaaaacatc
    81421 acctgcttta aagagaaggt acataatgaa tatacattaa gataaagatg tatatgtgtg
    81481 tgtgtgtgca catataagta tacacatacc taccataggg attgagtttt ccttcaggtt
    81541 tttcaactga aatgtcaact ttgaggccag ttaatatgtg taagatatat gtgtgtatgt
    81601 atgtctatac atatagacat atacgtaaaa acatacatgg atgcatatag tatatctata
    81661 cacaacctat tatgcatatc atgtatattt catccactta gtattatctt ttattttgcc
    81721 gtttggcaaa tgctcagtaa aagaaaaggg ttagaagggg agaaaggcat tttatcccaa
    81781 gccttcagga atcaggatga ggatgtcttc accttgtggt ggggagtaat tatacaatta
    81841 gagacagcac attggagtgt ggctgatatg ctgtgtgatg atagctctag ctctctgcct
    81901 agcagaggaa ggacatttca atagaagaaa aagtttaaga ccttgccgag aaacagagaa
    81961 aggatgtttg tctttttaag aagttgaaaa ccctgtttgc agacaaaagc cctccagttt
    82021 tggcagtaaa ctttcatgca agggaagaaa aaggcagggg atgacattgt tgacaattgt
    82081 gaggaattac catgtgccag gcactgtgcg aggggctttg tacatatcct ctagttttag
    82141 tgcttataaa aactctgtga tatgtgcaca gcattttaaa ctttgctgca tagtcgagaa
    82201 aatggaagga tggggaattt gagtcatttg cccagggttc tatagctacc ccaggttccc
    82261 atgactggag aattggggca cagggtggcg ggggagagtg agtgacaaga atcctaacaa
    82321 tcttatttcc attgagtcct tataaaagaa gtggattaac taccacgttt ttaagttttt
    82381 cttaaattta ggttatgtgg atctggcgtt tcttgttttg tcctgggttt gttttgtttt
    82441 tgctatgctg tcttgaacat ctgtcatctt gtaggcctaa cggtaaacac aaaaacactt
    82501 tacctcctat agctttcaat taagatctct cagtttgtgt ttgtaatagt tttccaggca
    82561 agttctccct aggttcggct tctagtgtgt taacctttag ttataaagtg aacccaaaga
    82621 gagaaagtag aaacaaaaca cctcacctgt ttttgctcat gaattactct ctatggaagg
    82681 aacaatcatg aacacctctg cgtatcacag aggcctatct gagtctgacg tttaagggag
    82741 accgcgtagg tccctttgag gactgtgaat gtgggagtcc tgggactctg gtgaagaacc
    82801 cgttccagaa gagatgaatg agctggacaa gttctttcat agaaccttta ggcaggtttt
    82861 cttagaaatg cacattgagg attatgcttg gatattgtga tgatcagaat gatactcaat
    82921 cccttctgca tttggaattc tctttgaaag aaaacatccc aggcagctat ttctcagaga
    82981 tagtgagtcc cagccacttc tagacatttt cttgtgtagt ctacattata atttcacagc
    83041 agtctctgat atgacaaatg tcaaaatagc ccaaccttct ctaaacttca gagatgtctg
    83101 atatgatatt gaataaaaca atgctcatag aaacatcaag aaaggtggat tttccctgga
    83161 tacttttttc ctgcttgaca aataacagtg aagaaactga tctcacgtct ttttctcttt
    83221 ggaagcctga acactcagaa cccaacttga ggctcctcag ctatagcaat tctgacttca
    83281 cagtctgtaa attattgttc ttttttttct ttagcttatg ctttctgccc taatttatct
    83341 tttccctgtt ctaatgaatt attgtcctat atctgctgtg cagttaggtg acatataaca
    83401 gcaattaaat atatgaattg gtacatataa agatttgact aaaactcgat gtaaaaataa
    83461 gtgttctaca ttcaatttcc agtgttagaa acagtgctga cttgaacaga gtgacagaat
    83521 tccatctttc cctatttttg acagctttaa actttatatt ttcttccttt cttgtgagcc
    83581 gtcattaact tgtttctcaa agccattccc gtattaccca tcttgcagac gcagacagat
    83641 ttgggaattt gcggtcagag ttgtattgga cacatccccc cagcccacat gagatccttt
    83701 taatctattg catattaact agttttaagt acaatattcc tacttcattt aaaaccatta
    83761 atcaaagaat gagtttgaaa atgaacaaaa tgcaaactta cagttagaaa taattgtagt
    83821 gtctttagtt ttggttagga gtcggtttct tgtttgttaa actcaagatt gtgaacagtt
    83881 ttaattcact tgtttatttc caatagagat ttcaggttta catttgaatt cagaaacaaa
    83941 gttttctttc tcattacaga gaacactaaa ctctacatct cccttcccga gcaaggagct
    84001 ggcygaagcc acaaaaacat tgctgcatag tcttgggact ctggcccagg aggtaagttg
    84061 tgtctttcca gtaccaggaa gcggatcatc cactgtatca gtattttcat tcctgagtct
    84121 ggcaagaggt ccttttgagt tgaatatcac atgggatgta atatcaattt tcaaagtata
    84181 agtgatgtaa acaataatgt tttgatttcc ttattttaga aatgaagaaa cctaaaactc
    84241 atagatgtct cagagctaat tggttagtgg ctaacagctg gatatctagt ttagaacctt
    84301 ctccattttt tctttttgcc cctaggtaat catacatttg taaagaggag aattatctct
    84361 gccactgccc atgcactgct tttgtctgac cagcaatttc tccatattgc ttcttcagta
    84421 gcaaggccaa tcattttacc aacacacatg cttgctaact aacaggaata acgtggtacc
    84481 cctaattcag ccctttccct tgaaagcatc tggcttctga ggttcaacta tgggaatatg
    84541 gtctcttaat gaacattaag ttgagtttgc cttttaggtc cacatgttga caaatgtatc
    84601 agagtaatct ctgtcctagg atcagagggc ctgtaggcac ttgcaaaagc agttagctct
    84661 gactcccagc cagtgcacac tccacctttc tgactcccag ccttgtctca aattaggctt
    84721 ggaagcgagg aactgtctgg tgtcccccag cataggaagc tgagccaggg ggcagtgctc
    84781 acaaacaata cagactttaa cgtgtaggat attggaaaat aataatttgt ggggaaattg
    84841 tctcagactt ggtccaccct tatttttagc tgcttctcta atccgttttt ctttttttgg
    84901 tgcttgtatc taacctaccc attttttggt gcttgcatca ttttttcaaa tatcaaaaac
    84961 gaactttatg ttttctaaca atgaaagtat tgcatgttca ttgtggaaaa tgctgaagac
    85021 ttggaaaata caaaaatgct gagatcaaac actattgata cgttagtgta tttcttcctg
    85081 tcctgttcta ctttctttct ttgaattctg ctcacgtgtt tctgactgat gaggtctgac
    85141 ttttgggttc cttttccaga ggagaagcct tctttcagct tgccatttgt taccctggtt
    85201 atgaaggctg gtaacctttt ttactaggta gagaagctgg accaactggg gttcttccag
    85261 ggggagaatg agaaagagaa actgttttgc aagtccgtag ctatttctct agggccctgt
    85321 tagctgacat tgacatgcct tgcattgctc tgcagatccc ctcgcagccc tctgtccctt
    85381 gttcatttct ggccttagag aaagcaaagc agggtctgta acaggggagg ctgcctctaa
    85441 actcagggtt tggttacagc tgttttcact tacatcactg gccctggttt tttttttttt
    85501 tctggcatta aaaaaaaaaa ttggaagcag gtgatgttcc cattgctgat gtggtggaaa
    85561 ctctccaagt gaacaatata cgtttttctt ggcagctgtt tcttgtgccc tgcttgctcc
    85621 tggtccagga caagcaagga ccatctgcct ctttcaatag aacacctcca gatccctttg
    85681 atcaaaagtt actcattgtc tgacttgcta tttctgtgag ataaatggga gaagatcaat
    85741 aaatgcactt gtttgtccag tcagcgtgtg gaaagttgat aattttgacc aaagcacaac
    85801 ccttgaaagg aaaagaaaaa gggagtgaat gtcttctgag aagctgccta ggttcagaca
    85861 gtgtcaccca tttccctgta tgctccacat gacaaacctg agtgggtctc atcatgtcca
    85921 ttttgcagat ggcaccaagg ctcagaaagg ttaggcaact tttccagtca cccaatgagt
    85981 taattgacaa aactgggatt caaacccaga actgttggat tccaaagcct gtgttgttgc
    86041 ctgcttcgtg aaaaactcca gtagcgactg gaatagaaag gagaaccttc caagaaagaa
    86101 aatacgcact agcagaacct ggaaattggg aggaaatgag gacttgagga ataagatgaa
    86161 tgaaagctga cctgagtttc acatctgggt gatgggaagg gaggacaggg aggcagcatc
    86221 tcagatgtcc acccagcacc gaccagctgc ctggcattgc taggtgttga ggactcagca
    86281 gtgaacacgc taacttctct gctttcttgg ggcacgtata gggtgagaga cagaaacaaa
    86341 caggtcagtg tacaatgcca caggagggat atatgcagtg aagaaaaagc agggtaaggg
    86401 gcatagagca tgagaaggtg ctttttttaa agggggtgat taggaaagct ctctctaagg
    86461 tgacagttgg acctgaagga gatgatagca tgtctgtggt gagggaagga aactccgaac
    86521 aggaagaatg gcagatacaa agacattgat gctagagcat gcctaaggaa tgtgtttaag
    86581 gaccagggaa agtgagcaag tggtgggggg aggagaggag ctcagagcag gaggaggtga
    86641 gtgccataca ggcctggcaa gactttggat tcctgctggg tgagatgaga atccagcgga
    86701 gggcttgagg gaggggacat gatgtgatct agagtttaga ctgtttacac tctggttgtt
    86761 gggttgagaa gagactggga tgggggaaag ggaggacaaa ggacattgtg ctggattgag
    86821 aaagcagtaa gtcagtttca ttcattcact caaccgatga tgttcaaata ccaccatcat
    86881 ccgtgggcta aaggatgaag agccatccct ccctgagagt caggaagcac ttcccagata
    86941 aagtttggag tgtgagctga ggtgtaggag aaagagtaag agtttacccc tgaaacgggt
    87001 gctgggaaga gtcaatagtt tggaataact caataattta tggtgcttct ttagaaagat
    87061 ttgctggctt tatgtgggaa gaaatttttt tttttgattg gggagtggtg ggttggtggt
    87121 gaggctgcct gtggaaagag aagtgagtgt tttgactcac tgttatttaa aaatctctag
    87181 ggctgttcca ataagcaaca aaaggcaaaa tggcctggtt ctctgtcccc tttctgtctg
    87241 tatgcctcgt acaggttatg aaaagaaaaa gttgggaaaa gctgtccacc tcacctaatt
    87301 gtgttcttgt ggagtgtgct agatgccccc tctctggaga aaaaaaatcc ttgtggcctc
    87361 tgacccacct ctggagagcc tagttccctt ctggaggcag aaggcaaagc ttaggaccta
    87421 gagagtgctg gaccacgcca ctcacaggaa ccagcaggct gtgaggttga aagctaggca
    87481 tatggagctt tccaggctgg gtgcagggcc tcgtggccct tcccctcccc tctgtgctct
    87541 atagctcagt cttcccaggc ggtgtgaaca cgcagtgaca tttccaggaa tacagggatt
    87601 tattaatgat ttcttgtgaa atgtttggaa atacaaagta ctctataaat atttcataat
    87661 agcattgggg ctgagaactc cacaaagtgc cggaatacat ttgcatgtaa gacagaacgc
    87721 tgcctgggtc attgatgcct gttgagtggc agtcacagac actgcctagg gtttctgact
    87781 cacgctgttg ggactgttct atgcagggca ccctcttgtg tggcatagga tttgtgcctc
    87841 accacacact gttgtagctt tgctgtcttg atgatgagta gagggcagtg tccaggccat
    87901 ggtataagca tctactgccc cccagggtta ccaaaaccaa gccaagttgt gtctcagcga
    87961 gctccgtgaa gcatggagaa gttgagtact cagagacatg acgtgacttt tcaaaggctg
    88021 taagctgacg agggacatag ctagggttca gacttgagtt tttctttttc tttttctttt
    88081 tctttttttt ttaagactga gtcttgcttt tgtcgcccag gctggattgc agtggtgctt
    88141 ggctcactgc aacctctgcc tcccgggttc aagcaattct cctgcctcag cctccccagt
    88201 agctgggatt acaggcacct gccaccatgc ctggccaaca tttttgtatt tttttagtag
    88261 agatggggtt tcaccatgtt ggccaggctg gtcttgaact cctgacctca ggtgatccac
    88321 ccgcctcgac ctcccaaagt actgggatta caggtgtgag ccactgcacc cggcccagac
    88381 tcgagttttt catcttaatg ctttttcatt gcctgacact ttactgagac caagataggg
    88441 aacttcacat acagtacctt ttctcccaag gcggaagagg gctgttcaat ttctacacta
    88501 gagttcgggg agttttagaa atgagtcagt tatcgaggat gagagcagtt cctgataggc
    88561 tcaaccacaa tgagatgtag ctgttcagag aaagcattct tttatctata aactggaaga
    88621 taatcccggt gaaacgaagc ccagccccag gggcttcact aactccaggc tgtgcttctc
    88681 aaactttagt gagcatagga atcacctggg catcttgtga agctgtagat ttgaattctg
    88741 caggtcggca gaggggtctc agaatccgca tttccaacaa tgtctccagt aatgctgatg
    88801 ctgctcgtcc ctggaccaca gattgggtag ccaggttctg gcaagctcat cccaaggctt
    88861 tgagatgaca tcagacaaaa tatgttctgg gacatggctt ttgagaggtc aagaaaataa
    88921 gatgtttctt tctcttctca tccccaaccc ttgcactgcc cttttctccc ttcccctacc
    88981 ctcctttctg tccccatccc tgacgccagc tgttcagcat gagaagctgg agtgacatgc
    89041 gacaggaggt gatgtttctg accaatgtga acagctccag ctcctccacc caaatctacc
    89101 aggctgtgtc tcgtattgtc tgcgggcatc ccgagggagg ggggctgaag atcaagtctc
    89161 tcaactggta tgaggacaac aactacaaag ccctctttgg aggcaatggc actgaggaag
    89221 atgctgaaac cttctatgac aactctacaa gtgagtgtcc atgcagaccc cagccctgtc
    89281 cccaacccca tccctccctt agttctggcc ttggcctgtg tcatctcctc cctctgtagc
    89341 agcgttagat gtctacatgc ccatttgccc accagactga gctcttccta gaggagagag
    89401 gcttctcttg aatagctacc tgtccccagt tctctgaatg cagcctggca catctcaggt
    89461 gcacagtagt gtttatcaat ggaatgaatg attgacagcc aaccttctgg ttttctgggg
    89521 gatgtggaag ggtggcttcc agggtgatca agaatgagat aatggcagaa ggacaaatcc
    89581 tgcaagatct cacttatata tggaatatat gtaaggtaga aagtgtcagt ttcacatgat
    89641 gaataagttc ctgggatctt gatgtacatc gtgatgacta tagttagtaa cactgtatag
    89701 tatacttgaa atttgctaag agagtagatc cgaagtgttc acactacaca aaaaaggcaa
    89761 ctatgaggtg atggatttat taacagcttg attgtggtga tccttttaca aagtatacat
    89821 atattaaaac atcacattgt ataccttaaa tatatacaat ttttatttgt cagttgtaac
    89881 tcaaaaaagc tagaaaagca tttttaaaaa ggatgatgta ctggtcttaa tattaccatt
    89941 gagataagct ttataataac ataaaaagaa ataacagtaa tgataatagc aacaacaaca
    90001 acaacaaaga actaacattt aagtagaatt tcttgtgcac tgtgcattct gtttaagtta
    90061 tctcatttta ccctcatgat aaccctgcag ggaagattct ttaaccccac atttcatagg
    90121 ctcagagagg ttaagtgcct tggttagagc cacatcagag ttaatccaca agagccagga
    90181 ttcaagccca aatctgcctg gatctgtgct ctctaagata actgttagtg gtggcgtgtg
    90241 tgttctcaca ctcagacatt tgatctgccc tttgtttccc attcttagct gcaaggcagt
    90301 gttaaagaac cctgtgtctc catatccact ccccacactt aagcactttt gtgggcccgt
    90361 gtgccgtatg cctcgtggca gcagggatcc aatgtcacag ttttaggcag tggcatcctt
    90421 ttccttgaaa acttgatgca ggggaacctt tctccatttc caaccacagg tgtgtctttc
    90481 agacactgag tgaggcaggt tttgtacttt attgtaacac aagaaccttt tcttctctgg
    90541 agtaaagcac tccagacatt cgcaagttgc tttacaagcc ttaaaaggat ggtattgtag
    90601 gcaactttaa ttaaatccca tctcctcctc tcccccagct tgcaagttga cccaaggaag
    90661 ccttcatttc catgacagac ttaattgtga gggcatcctc attaaaaaaa aaaaaattct
    90721 attatctttc cagcatatag aagatacttg gtatctaaaa atccctgaaa aacttagaat
    90781 gaatttttaa aaatcaggga tcctgctgga taaccaaacc catttgtctg ttacaacttt
    90841 tgtatttggg tttttgttaa gtgtacatat actagtttgt gttaattaaa gagaattttt
    90901 tttttttttt ttgagaggga gtctcgctcc gttgcccagg ctggagtgca gtggcgccat
    90961 ctcggctcac tgcaagctcc gcctcccggg ttcacgccat tctcctgcct cagcctcctg
    91021 agtagctgga actgcaggtg cccgccacca tgcccagcta attttttttt tttttgtatt
    91081 ttgagtagag acggggtttc actgtgttag ccaggatggt ctcgatctcc tgacctcgtg
    91141 atccgcctgt ctcggcctcc caaagtgctg ggattacagg cgtgaaccac tgcgccctgt
    91201 tgagaatttt tttttttttt tttgggagac agagtttcgc tcttgttgcc cgggctagag
    91261 tgcagtgaca caatctcggc tcactgcaac ctctgcctcc tgggttcaag cgattctcct
    91321 gcctcagcct catgcgtcac cacgcccagc taattttgta tttttagtag agacagggtt
    91381 tctccatgtt ggtcaggctg gtctcgaact cccaacctca ggtggttcgc ccgccttggc
    91441 ctcccaaagt gctgggattg caggcatgag ccactgcgcc cagccccaaa ttttggtttt
    91501 tgcttgaaaa ctgaggtctg aattcagcct tctggttgcc cctcaagagt cagtttaaat
    91561 gttggtcatg ttagttgtca gtgaaaacaa tggtgaggct ggcatgagag tgtgaatctg
    91621 gatgggaggg cttgtgcttc atgaaaacat ttttccagat cagctcagtc gtgagttatc
    91681 cgtcattgac gttataataa gctctgatta tttatcaagc atcattcttt atagatatct
    91741 cagtttaatc tgagataatc ttctccacat ctctccacat agatgttatg aattttactt
    91801 ttacagagga gccaactgag gctcagataa gttacttatt atatgactag tagtggtaga
    91861 gctggggttt caactaagaa ctctctggct ccaaagccct tgtaagtttc tatcagtata
    91921 tgaccatgca tatgagcatt tgtctctcct cttcttcata gctccttact gcaatgattt
    91981 gatgaagaat ttggagtcta gtcctctttc ccgcattatc tggaaagctc tgaagccgct
    92041 gctcgttggg aagatcctgt atacacctga cactccagcc acaaggcagg tcatggctga
    92101 ggtaagctgc ccccagccca agactccctc cccagaatct ccccagaact gggggcaaaa
    92161 aactcaaggt agcttcagag gtgtgcgcta agtatactca cggctcttct ggaattccca
    92221 gagtgaaaac ctcaagtctg atgcagacca gagctgggcc agctccccag tcgtgggtat
    92281 agaatcatag ttacaagcag gcatttcttg gggatgggga ggactggcac agggctgctg
    92341 tgatggggta tcttttcagg gaggagccaa acgctcattg tctgtgcttc tcctcctttt
    92401 tctgcggtcc ctggctcccc acctgactcc aggtgaacaa gaccttccag gaactggctg
    92461 tgttccatga tctggaaggc atgtgggagg aactcagccc caagatctgg accttcatgg
    92521 agaacagcca agaaatggac cttgtccggg tgagtgtccc tcccattatt accatgtgcc
    92581 tgcttgatac tggagaggtg agtttctggt cactttccca ggtgtgagtg aggtgagaat
    92641 tctttcagtt tatctagctg ggggaatgta gtgagcatag ctaaagtcac agggcaccac
    92701 ctctccagaa gtacaggcca tggtgcagag ataacgctgt gcatatcagc atccatgcca
    92761 ctcacggtca aatagcagtt ttctgcaaaa cttagtgagg gctggtgttt ggaagtggag
    92821 ttgagtaatt gcagtaccct attttccttt ttggctgcag cctctcagcc agccacagca
    92881 tctccctgtg tcttggtagg ttttggaaag aagtgtggga gcaaaagcat gatgttacat
    92941 gtagactggc ctgagatact cattctcagg gcactgtgtg aatgatgagc tgctgttact
    93001 gtgtggaggg gaaatgcact tagtgcttca gagccacttg aaagggataa gtgctctaga
    93061 gacaattggg ttcaaatgtg gagcaggctg agcaagaaca gaatgtctcc tttgcctgag
    93121 cctgagtgct gttaatcaca tcttcctgcc ttgggctgag ttagagaatc attagactat
    93181 ttcctgtttc catggtgagg gaggcctctt ccttttgtct ctgctcccct taagaagcag
    93241 gtgaggattt tgccaggttt cttgttttga accttattga ctttaagggc ggctgggttt
    93301 tagagactgt acctacctag ggggaacact tccgaagttt aggactattc cctgatccgc
    93361 tgggaggcag gttactgagg aagtcccttt aaaaacaaag gagtttatac tgagaaaagc
    93421 ataaacagtg atttgtatgg attcacactg actaatatag ctcatgccat taaagtgggg
    93481 tctcttctct aaaggagggt tatatgatct agccccgtag acctaagtgt ggtttcagac
    93541 ctgttcttcc tggtcctctc cttggaatcc atatttctac tagttggact ttttctgttt
    93601 gtctggctct cagaggatta taggaggccc tgtgaagtga ctcagtgaat tttgatttgt
    93661 gggcaagtag atggttccct agtctgaaat tgactttgcc ttaggtgctt caattcttca
    93721 taagctccca gttcttaaag gacaagatcc ttgtaaacat ggcaatggca ttcattagga
    93781 atctagctgg gaaaatccag tgtgtatgct tggaaatgag ggatctgggg ctggagagaa
    93841 aggcatgggc atgccttgga gggacttgtg tgtcaagctg aggaccttta ctttaagctc
    93901 taggggacca ggcaagggga gatgtagata cgttactctg atggggtgga tgaattgaag
    93961 aaggatgagg caagaatgaa ggcagagacc agggaggagg ctctccaagt ggccaaggca
    94021 taaagcaaga aatgaggcct ggtgactgct tagtggcaga gcagtgaaag agagggaggc
    94081 atcaaagtga gtctcgattt ctagctgggt gggtggtagc gatgtccagt aggccagtgg
    94141 ctactgaggt ctgcagtgga ggagggtggt tgggctggag acagatgatg agggagtcat
    94201 cagcctgtgg gtggaagaaa agggaacctc ttccaactgt tttctttgct tcttccctct
    94261 ctttctcttt tttttttttt ttggacagag tcttgctctg tcacccaggc tgaaatgcag
    94321 tggcatgatc ttggctcacc acagcctccg cctcctgggt tcaagcaatt ctcctgtctc
    94381 agcctccaga gtagctggga ttacaggcac atatcactgt gcccggctaa tttttgtatt
    94441 ttcagtggag atgggatttc accatgttgg tcgggctgga atgaactcct gacctcaagt
    94501 gatccacctg cctcagcctc ccaaagtgtt gggattacag gcatgagcca ccgcgcccgg
    94561 cctttcttcc ctctcttaaa gagtgtttat ttaattccac aaacatgagc ttgtcacccc
    94621 ctgtagcctg gcatctccta cacgaggtga tggctgaggc ttctgcttct gctggggtag
    94681 ctctgatctt tctgctttct ctggcactgt ctacccatgt tgcctcaccc cacaggtccc
    94741 agggcacctc tctcgggcaa gtcttggaac cctctgacac tgatttgctc tcttttctga
    94801 gctgctttta gccacccatc ctcgggacct gttttctctc tgcctccacc cctgcgggca
    94861 gtcttaggtc tcctgcccct cacgagcacc ccagagaggc cacgtgctca gtgatctcag
    94921 tgggcgcatc tttctagtct tgctattctt tttggccatg ttgttcagaa accatactgg
    94981 gcagggccga cttcacccta aaggctgcgt ctcttcactc tgcttttgtt tgttccaaat
    95041 aaagtggctt cagaattgct aaccctagcc tctgtgaact tgtgaggtac aattttgtgt
    95101 ctgttatgtt aacaaaaata catacatacc ttcctggtga tggtataaat tgctattctc
    95161 tattggaaag caatttggaa tgaaaattta aagaaccatt ttaaaatatg ctatcctgcg
    95221 tacctccatt ccacccaccc ccagggatgt agcctactga aataatttta aagaagtcac
    95281 catatgagag aaaatgttat tgctatattg ttattgtgag aaattggaaa tagactaaat
    95341 gttcagcact ataggaataa ttaatgaaat tacatatact ctatacaatc attatgctgc
    95401 cattgaaata ataaatacaa aggcgcaagg ggggaaaagc ttataatgtt agtgaaacta
    95461 agactgattt ttttataaag cagcagtttt cagacccttg gagactccaa ttcggtagaa
    95521 ccagagcttc atcttctctg tcgaagctgt gacaggagtt gcaaatgcct ctcctttttg
    95581 ctgagtttgc agctgctgtt tttccggcag cacatctgtg caggcctctg cctcggcccc
    95641 tctggatctg ctgattgagc agcggattga tctgtccttc tctttcgtgt tgacccatgt
    95701 gaggaaccaa ctggcaaggg aacaagaaat ggaaataggc ctcctttgca tcatgacctg
    95761 tacatcctgc aattggaaaa gattgtactt tagttggttt aaccagcagc attatttttc
    95821 taaactaagc agtaagaagg aattaggttt tatgtgggat caacagactg ggtctcaaaa
    95881 gaggaaggtg atagaacaca gtggggaggg ggaggtgcac tagaaacaga gggcctatgc
    95941 tttcattctg gctttgctac ttaatagctg tgtgacccaa tcttagagac ttaacctctc
    96001 tgaacttcca ttttctcatg tataaaatgg gaaatattaa aggatactca ctgggctggt
    96061 ggcttgtgcc tgtaatccca gcacttgggg aggttgaggt gggaggatca cttgagccca
    96121 ggtgttcaag accagcccag gcaacatggc aagactctgt ctctatgaaa aaattaaaaa
    96181 ttagccaggt gtggtggtgt gcacctgtag tcttagctac ttggtaggct gagatgggag
    96241 gatcacttgg gcttgggagg tcaaggctgc ggtgagctgt gattccatca ctgcactcca
    96301 gcccgggcgg cagagcgaga cactgaatcc aaacgacaac aacaacaaaa ggcaaaaaaa
    96361 taaaagtgcc ctctttatgg agttgtgtaa ggtgaagcat atacactatt caacatagta
    96421 actatataaa ggaagtattg ttgttgttac tgtagttaat accattaagt gagatgtttc
    96481 gtatagtgga aagcacatgg actctgaatt cagactggtc tgactttgag tctcagctcc
    96541 acatctagta atactatgac caagccctgg ttaaaatcat gttttttttt cttcagcctc
    96601 agtcttctca catataaaat agggacactg tcatttacct cagttttctg tgaggataaa
    96661 acaacgacag tgtatatgca agtattttgt aaattttgta gtgctcctca agatttagtt
    96721 ggtgtttact acttgtactt tctcactgga atggcagatg ctgttggaca gcagggacaa
    96781 tgaccacttt tgggaacagc agttggatgg cttagattgg acagcccaag acatcgtggc
    96841 gtttttggcc aagcacccag aggatgtcca gtccagtaat ggttctgtgt acacctggag
    96901 agaagctttc aacgagacta accaggcaat ccggaccata tctcgcttca tggaggtgaa
    96961 tctgttgctg ggatcattta gaaaagactt aacggcttct ttctctgaga cgttacaata
    97021 aggttcaggc aggaggcaag tttagaaata atgtatagtc tcatttacaa aactatccct
    97081 caagcctaac acaggatttg ataacaaaag gcacttaata aatgttagtt gagtggttga
    97141 atgagtaaat aaactctagc tttagtaaat taactctagc ttattctata taggctcaag
    97201 agaatatttc tacccatttt cttctaggtt ttcctatctc agtgactaat ggtagcaaag
    97261 cattccctta aaaaggcatt atttgtgaaa cttatctaaa atcgaattcg ggtccaatta
    97321 aatttttgaa attttatatt aaaaattata ttagtaggga tgggtaagag gtgttttggt
    97381 ctggttggtt ggttagttgc tatgactcag aattgctaag aaaacagaaa agtaagataa
    97441 gatcattgtt ttaacctctt ttcctccaca aaatcaataa ataacatatc cctaaattac
    97501 tcttagaatt tctcttaaat tgcagtgaaa aaccaaaatc cttcattctt ggttgaaggt
    97561 tggaaaacta cgttagagag gattagagag agaggatgag caatcgtgta gtcagccctt
    97621 gcctcctagt gtaggatttg tctcagccac tgcttgttgt cctggctgcc aacgttctca
    97681 tgaaggctgt tcttctatca gtgtgtcaac ctgaacaagc tagaacccat agcaacagaa
    97741 gtctggctca tcaacaagtc catggagctg ctggatgaga ggaagttctg ggctggtatt
    97801 gtgttcactg gaattactcc aggcagcatt gagctgcccc atcatgtcaa gtacaagatc
    97861 cgaatggaca ttgacaatgt ggagaggaca aataaaatca aggatgggta agtggaatcc
    97921 catcacacca gcctggtctt ggggaggtcc agagcaccta ttatattagg acaagaggta
    97981 ctttatttta actaaaaatt tggtagaaat ttcaacaaca acaaaaaaac tcaacttggt
    98041 gtcatgattt tggtgaaatt ggtacatgac ttgctggaag gtttttcata ggtcataaaa
    98101 taacagtatc ttttgattta gcatttctac tcaagggaat taattccagg aattttggtg
    98161 gcaggcacct gtaatcccag ctactcggga ggctgaggca ggagaattgc ttgaacccag
    98221 gaggcagagg ttgcagtgag ctaagatcgc atcattgcac tcccgcctgg gcaataagag
    98281 tgaaactcca tctcaaaaaa aaaaaaagat acaaaaatag aaaaaggggc ttggtaaggg
    98341 tagtagggtt ttgggcaatt tttttttttt ttttttttta ttgtatggtt ctaaaggaat
    98401 ggttgattac ctgtggtttg gttttaggta ctgggaccct ggtcctcgag ctgacccctt
    98461 tgaggacatg cggtacgtct gggggggctt cgcctacttg caggatgtgg tggagcaggc
    98521 aatcatcagg gtgctgacgg gcaccgagaa gaaaactggt gtctatatgc aacagatgcc
    98581 ctatccctgt tacgttgatg acatgtaagt tacctgcaag ccactgtttt taaccagttt
    98641 atactgtgcc agatgggggt gtatatatgt gtgtgcatgt gcatgcatgt gtgaatgatc
    98701 tggaaataag atgccagatg taagttgtca acagttgcag ccacatgaca gacatagata
    98761 tatgtgcaca cactagtaaa cctctttcct tctcatccat ggttgccact tttatctttt
    98821 tatttttatt tttttttttg agatggagtc tcgctctgac gcccaggctg gagtgcagtg
    98881 gctcgatctc ggctcactgc aacctttgcc tcccgggttc aagctattct cctgcctcag
    98941 cctccacagt agctgggact acaggctcat gctgccacgc ccggctgact ttttgtattt
    99001 tagtagagac gaggtttcac catgttaccc aggctagact tcaactcctg agctcaggca
    99061 atccaccctc cttggcctcc caaagtgctg ggattacagg tgtgagccac tgcacccagc
    99121 ccaccacttt aattttttac actctaccct tttggtcaaa atttgctcaa tctgcaagct
    99181 taaaatgtgt catgacaaac acatgcaagc acatactcac acatagatgc agaaacagcg
    99241 tctaaactta taaaagcaca gtttatgtaa atgtgtgcac ttcttctccc taggtggtaa
    99301 accacatttc aaaacaaccc aaataaaact gaacaaagct tcttcctctt agacttttta
    99361 gaaaatcttt cagtgctgag tcactaagct gccaagttct cattgtggga actatgcctt
    99421 tggatgtaat gatttcttct aagacaatgg gcggaggtgt agttattgca gacatctgaa
    99481 atatgtaatg tttcttccag attctggaaa ttctcttatt ctctgtggtt ggtggtggtg
    99541 gtgggatgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtaggga tcaggatgcg
    99601 ggaggagctg ggttctgctt gtattggttc tctgttttgc attgaatagt gtgtttcctt
    99661 gtatggctat ctatagcttt tcaaggtcac cagaaattat cctgtttttc accttctaaa
    99721 caattagctg gaatttttca aaggaagact tttacaaaga cccctaagct aaggtttact
    99781 ctagaaagga tgtcttaaga cagggcacag gagttcagag gcattaagag ctggtgcctg
    99841 ttgtcatgta gtgagtatgt gcctacatgg taaagctttg acgtgaacct caagttcagg
    99901 gtccaaaatc tgtgtgcctt tttactttgc acatctgcat tttctattct agcttggaat
    99961 ctgaaacatt gacaagagct gcctgaaatg tatgtctgtg gtgtgattag agttacgata
   100021 agcaagtcaa tagtgagatg accttggaga tgttgaactt ttgtgagaga atgagttgtt
   100081 tttttgtttt ggtttttagt actttaacat aatctacctt tagtttaagt atcgctcaca
   100141 gttacctagt tactgaagca agcccccaaa gaaatttggt ttggcaacac tttgttagcc
   100201 tcgtttttct ctctacattg cattgctcgt gaagcattgg atcatacgta catttcagag
   100261 tctagagggc ctgtccttct gtggcccaga tgtggtgctc cctctagcat gcaggctcag
   100321 aggccttggc ccatcaccct ggctcacgtg tgtctttctt tctccccttg tccttccttg
   100381 gggcctccag ctttctgcgg gtgatgagcc ggtcaatgcc cctcttcatg acgctggcct
   100441 ggatttactc agtggctgtg atcatcaagg gcatcgtgta tgagaaggag gcacggctga
   100501 aagagaccat gcggatcatg ggcctggaca acagcatcct ctggtttagc tggttcatta
   100561 gtagcctcat tcctcttctt gtgagcgctg gcctgctagt ggtcatcctg aaggtaaggc
   100621 agcctcactc gctcttccct gccaggaaac tccgaaatag ctcaacacgg gctaagggag
   100681 gagaagaaga aaaaaaatcc aagcctctgg tagagaaggg gtcatacctg tcatttcctg
   100741 caatttcatc catttatagt tggggaaagt gaggcccaga gaggggcagt gacttgccca
   100801 aggtcaaccc agccgggtag cagctaagta ggatgagagt gcagggttca tgctttccag
   100861 ataaccacat gctcaactgt gccatgctgt ctcattggta gtggttcatg gcagcatctg
   100921 aaagctattt attttcttag atatattggg tggcgattct tcctaagttt ctaagaacaa
   100981 taatcagaag gatatatatt gttgcaggtt agactgtctg gaagcagagg ctgaaataga
   101041 gtttgatgta tgggtattta tgagggctca atacctatgg aagagatatg gaagatgcag
   101101 gattgggcag agggaggagt tgaactgtga tatagggcca accccgtggg gcactctaga
   101161 gaatatgcag cttgttggag ttgttcttca tcgagctgaa acatccagcc ctttgtgctc
   101221 ccccaaggcc tccctcctga caccacctac ctcagccctc tcaatcaatc actggatgtg
   101281 ggctgccctg ggaaggtcgt gccccagggc ctacatggct ctctgctgct gtgacaaacc
   101341 cagagttgct gatgcctgag gccgtctact gacagctggg caacaaggct tccctgaatg
   101401 gggactctgg gcagtgcagt tttgtgtctg aaccatacat taatatattt atatccgaat
   101461 tttctttctc tgcaagcatt tcatataaag acacatcagg taaaaataaa tgtttttgaa
   101521 gcaaaaggag tacaaagaga taagaactaa ctaatttaat actagttacc atctgttaca
   101581 aatagttcct actgattgcc aaggactgtt taaacacatc acatgggctt cttcttctat
   101641 cctcactaac ccttttaaca gacaaggaaa tgaggctcag gaaggtcaag gactttattg
   101701 aggttccaca gtaggataca gttcttgcta aaagcaaccc ctccctcatg ctctgttatc
   101761 taactgcaag gggaaggtca gtggcagagg tagtggtccc atggttggtg cataagagct
   101821 gctctgagac aactgcatgc tggtgggtcc tgcagacatg tacccatcag ccggagatag
   101881 gctcaaaata tccacaagag tttggatgat tgtgggaatg cagaatccat ggtgatcaag
   101941 agggaaagtc aagttgcctg gccattttcc ttggctttta gacagaaaag ttacgtggga
   102001 tattatctcc cacagctctt ctgtggtgcc accagtcata gtccttatat aaggagaaac
   102061 cagttgaaat tacctattga agaaacaaag agcaaactcg cccactgaaa tgcgtagaaa
   102121 gccctggact ctgttgtatt cataactctg ccattatttt tctgcgtagt tttgggtaag
   102181 tcacttatct tctttaggat ggtaatgatc agttgcctca tcagaaagat gaacagcatt
   102241 acgcctctgc attgtctcta acatgagtag gaataaaccc tgtctttttt ctgtagatca
   102301 tacaagtgag tgcttgggat tgttgaggca gcacatttga tgtgtctctt ccttcccagt
   102361 taggaaacct gctgccctac agtgatccca gcgtggtgtt tgtcttcctg tccgtgtttg
   102421 ctgtggtgac aatcctgcag tgcttcctga ttagcacact cttctccaga gccaacctgg
   102481 cagcagcctg tgggggcatc atctacttca cgctgtacct gccctacgtc ctgtgtgtgg
   102541 catggcagga ctacgtgggc ttcacactca agatcttcgc tgtgagtacc tctggccttt
   102601 cttcagtggc tgtaggcatt tgaccttcct ttggagtccc tgaataaaag cagcaagttg
   102661 agaacagaag atgattgtct tttccaatgg gacatgaacc ttagctctag attctaagct
   102721 ctttaagggt aagggcaagc attgtgtttt attaaattgt ttacctttag tcttctcagt
   102781 gaatcctggt tgaattgaat tgaatggaat ttttccgaga gccagactgc atcttgaact
   102841 gggctgggga taaatggcat tgaggaatgg cttcaggcaa cagatgccat ctctgccctt
   102901 tatctcccag ctctgttggc tatgttaagc tcatgacaaa gccaaggcca caaatagaac
   102961 tgaaaactct tgatgtcaga gatgacctct cttgtcttcc ttgtgtccag tatggtgttt
   103021 tgcttgagta atgttttctg aactaagcac aactgaggag caggtgcctc atcccacaaa
   103081 ttcctgactt ggacacttcc ttccctcgta cagagcaggg ggatatcttg gagagtgtgt
   103141 gagcccctac aagtgcaagt tgtcagatgt ccccaggtca cttatcagga aagctaagag
   103201 tgactcatag gatgctcctg ttgcctcagt ctgggcttca taggcatcag cagccccaaa
   103261 caggcacctc tgatcctgag ccatccttgg ctgagcaggg agcctcagaa gactgtgggt
   103321 atgcgcatgt gtgtggggga acaggattgc tgagccttgg ggcatctttg gaaacataaa
   103381 gttttaaaag ttttatgctt cactgtatat gcatttctga aatgtttgta tataatgagt
   103441 ggttacaaat ggaatcattt tatatgttac ttggtagccc accactcccc taaagggact
   103501 ctataggtaa atactacttc tgcaccttat gattgatcca ttttgcaaat tcaaatttct
   103561 ccaggtataa tttacactag aagagataga aaaatgagac tgaccaggaa atggataggt
   103621 gactttgcct gtttctcaca garcctgctg tctcctgtgg cttttgggtt tggctgtgag
   103681 tactttgccc tttttgagga gcagggcatt ggagtgcagt gggacaacct gtttgagagt
   103741 cctgtggagg aagatggctt caatctcacc acttcggtct ccatgatgct gtttgacacc
   103801 ttcctctatg gggtgatgac ctggtacatt gaggctgtct ttccaggtac actgctttgg
   103861 gcatctgttt ggaaaatatg acttctagct gatgtccttt ctttgtgcta gaatctctgc
   103921 agtgcatggg cttccctggg aagtggtttg ggctatagat ctatagtaaa cagatagtcc
   103981 aaggacaggc agctgatgct gaaagtacaa ttgtcactac ttgtacagca cttgtttctt
   104041 gaaaactgtg tgccaggcag catgcaaaat gttttataca cattgcttca tttaattctc
   104101 acaaggctac tctgaagtag ttactataat aaccagcaat tttcaaatga gagaactgtg
   104161 actcaaagac gttaagtaac cagctttggt cacacaactg ttaaatgttg gtacgtggag
   104221 gtgaatccac ttcggttaca ctgggtcaat aagcccaggc gaatcctccc aatgctcacc
   104281 caattctgta tttctgtgtc ctcagagggg gtacaactag gagaggttct gtttcctgag
   104341 tacaggttgt taataattaa atatactagc tctaaggcct gcctgtgatt taattagcat
   104401 tcaataaaaa ttcatgttga atttttcttt agtacttctt tcttaatata atacatcttc
   104461 ttgaccaagt ccaagaggaa cctgcgttgg acagttttca tatgagatca aattctgaga
   104521 gagcaagatt taaccctttt tggttcacct tctgatcctc ccctaaggag gtatacatga
   104581 aatatttatt actcctgcct gaacttcttt cattgaatat gcaattttgc agcatgcaga
   104641 ttctggattt aaattctgag tcttaactta ctggctgagg gaccttggat aggctcctta
   104701 tccctcagtt tcctcatctc taaaatgggg atggcacctg ccccgtgggt tgttggaagg
   104761 acttacagag gtgcagaatg tacgttgtac atagcaggtt tcagcaaatg ttagctccct
   104821 ctttccccac atccattcaa atctgttcct tctccaaagg atgtgtcaag gaggaaatgg
   104881 acctggctgg gaaaccctca gaatactggg atgatgctga gcttggctca tacctgtgct
   104941 ttgctttcag gccagtacgg aattcccagg ccctggtatt ttccttgcac caagtcctac
   105001 tggtttggcg aggaaagtga tgagaagagc caccctggtt ccaaccagaa gagaatatca
   105061 gaaagtaagt gctgttgacc tcctgctctt tctttaacct agtgctgctg cctctgctaa
   105121 ctgttggggg caagcgatgt ctcctgcctt tctaaaagac tgtgaaacca ctccaggggc
   105181 agagaaatca catgcagtgt ccctttccaa atcctcccat gccatttatg tccaatgctg
   105241 ttgacctatt gggagttcac ggtctcgatc cctgagggac attttctttg ttgtcttggc
   105301 ttctagaaga gtatctttta cttgccccct cccaaacaca catttcatgg tctcctaaca
   105361 agctagaaga aagaggtaaa gacaagcgtg attgtggaac catagcctcg ctgcctgcct
   105421 gtgacatggt gacctgtgta tcagcctgtg tgggctgaga ccaagtggct accacagagc
   105481 tcagcctatg cttcataatg taatcattac ccagatccct aatcctctct tggctcttaa
   105541 ctgcagacag agatgtccac agctcatcaa aggctctgcc ttctgggttc tttgtgctta
   105601 gagtggcttc ctaaatattt aataggtccc ttttctgcca gtctcttctg tgcccatccc
   105661 ctgattgccc ttggtaaaag tatgatgccc cttagtgtag cacgcttgcc tgctgttcct
   105721 aatcatcttc tcctacctcc tctttacacc tagctcctgt ttcagtcacc tagaaatgct
   105781 cacagtcgct ggaatatgtc atgttcttcc acacctccat gcctttgtag gtactgtttg
   105841 ctctcacagg agaactttct ctctaacttg cctatcttct caactcctcc tttctctcca
   105901 agatctagtt ccggatcccc tcccctgagc atccctcctt ggttctcagg tagtcagtca
   105961 ctctctgccc tgaacttcca tggcacgtga aagaaaatct ttttatttta aaacaattac
   106021 agactcacaa gaagtaatac aaattacatg agggggttcc cttaaacctt tcatccagtt
   106081 tccccaatgg tagcagcatg tgtaactgta gaatagtatc aaaaccatga aattgacata
   106141 ggtacaattc acaaaccttc ttcagatttc actagcttta tgtgcgctca tttgtgtgtg
   106201 tgtgtgcgta tttagttcta tgcaatttta tcatgtgtga attcatgtaa ttactagctc
   106261 agtcaagctg cagaaatatc tcattgtcac aaagctcctt catgctaccc cttaatggcc
   106321 acagccacct cccttcttcc tcagttcctg acacctgtca accactaatg cgttcctcgt
   106381 ttttacagtt ttattatttc tagaatgtta cataaatgga accatacagt aggtatcctt
   106441 ttgatactgg cttttttttt ttttcactca gcagtattcc cttagatcta tccaagttgt
   106501 gtgtgtcaac agttcattcc tcttcactgc tgagtagtgt tccctgggag gggtgtatca
   106561 cagttccatg gcatttttag atgtattttt taaacagctt tcagcatcct ctattttaat
   106621 tgttcatcaa gtcctttttc ccaatagact ctgaatgctc ctttatcatc gtattcccat
   106681 caccaacatc agtacccaaa taggccctaa ataaacattt atagcctcct gcctgcctga
   106741 gaaaccaggg tggacatgga gagaaggcac ttctgaaagt tcaagcgcag tgccctgtgt
   106801 ccttacactc cactcctcag tgctttctgt gggttcattt ctgtcttctc tcctgtcaca
   106861 gtctgcatgg aggaggaacc cacccacttg aagctgggcg tgtccattca gaacctggta
   106921 aaagtctacc gagatgggat gaaggtggct gtcgatggcc tggcactgaa tttttatgag
   106981 ggccagatca cctccttcct gggccacaat ggagcgggga agacgaccac catgtaagaa
   107041 gagggtgtgg ttcccgcaga atcagccaca ggagggttct gcagtagagt tagaaattta
   107101 taccttagga aaccatgctg atccctgggc caagggaagg agcacatgag gagttgccga
   107161 atgtgaacat gttatctaat catgagtgtc tttccacgtg ctagtttgct agatgttatt
   107221 tcttcagcct aaaacaagct ggggcctcag atgacctttc ccatgtagtt cacagaattc
   107281 tgcagtggtc ttggaacctg cagccacgaa aagatagatt acatatgttg gagggagttg
   107341 gtaattccca ggaactctgt ctctaagcag atgtgagaag cacctgtgag acgcaatcaa
   107401 gctgggcagc tggcttgatt gccttccctg cgacctcaag gaccttacag tgggtagtat
   107461 caggaggggt caggggctgt aaagcaccag cgttagcctc agtggcttcc agcacgattc
   107521 ctcaaccatt ctaaccattc caaagggtat atctttgggg ggtgacattc ttttcctgtt
   107581 ttctttttaa tcttttttta aaacatagaa ttaatatatt atgagctttt cagaagattt
   107641 ttaaaaggca gtcagaaatc ctactaccta acacaaaaat tgtttttatc tttgaataat
   107701 atgttcttgt ttgtccattt tccatgcatg cgatgttagg catacaaaat acatttttta
   107761 aagaatactt tcattgcaaa ttggaaactt cgtttaaaaa atgctcatac taaaattggc
   107821 atttctaacc cataggccca cttgtagtta tttaccgaag caaaaggaca gctttgcttt
   107881 gtgtgggtct ggtagggttc attagaaagg aatgggggcg gtgggagggt tggtgttctg
   107941 ttctctctgc agactgaatg gagcatctag agttaagggt aggtcaaccc tgacttctgt
   108001 acttctaaat ttttgtcctc aggtcaatcc tgaccgggtt gttccccccg acctcgggca
   108061 ccgcctacat cctgggaaaa gacattcgct ctgagatgag caccatccgg cagaacctgg
   108121 gggtctgtcc ccagcataac gtgctgtttg acatgtgagt accagcagca cgttaagaat
   108181 aggccttttc tggatgtgtg tgtgtcatgc catcatggga ggagtgggac ttaagcattt
   108241 tactttgctg tgtttttgtt ttttcttttt ttctttttta tttttttgag atggagtctc
   108301 gctctgtagc caggctggac tgtagtggcg cgatctcggc tcactgcaac cttggcctcc
   108361 caggttcaag cgattctcct gcctcagcct cccgagtagc tgggactcta ggcacacacc
   108421 accatgccca gctaattttt gtgtttttag tagagacggg gtttcaccat gttggccagg
   108481 atggtctcaa tgtcttgacc tcgtgatccg cccacctcgg tctcccaaag tgctgggaac
   108541 acaggcatga gccactgtgt ctggccacat tttactttct ttgaatatgg caggctcacc
   108601 tccgtgaaca ccttgagacc tagttgttct ttgattttag gagaagtggg aggtgaatgg
   108661 ttgagctgta gaggtgacat cagcccagcc agtggatggg ggcttgggaa acattgcttc
   108721 ccattattgt catgctggag ggccctttag cccatcctct cccccgccac cctccttatt
   108781 gaggcctgga gcagacttcc cagacctggt agtgcttcag ggccctggta tgatggacct
   108841 atatttgctg cttaagacat ttgctcccac tcaggttgtc ccatcagcca taaggccccc
   108901 agggagcccg tgtgatggag cagagagaga cctgagctct gcaatcttgg gcaaggcttt
   108961 tcccttatgt ttcttcttat ctaaagtgaa cagctggggc tcatgtgctc cctcctcatc
   109021 taaagtgaac acatggggct catgtgcagg gtcctccccg ctttcagagc ctgaggtccc
   109081 ctgaggctca ggaaggctgc tccaggtgag tgccgagctg acttcttggt ggacgtgctg
   109141 tggggacagc ccattaaaga ccacatcttg gggccctgaa attgaaagtt gtaactgcct
   109201 ggtgcatggt ggccaggcct gctggaaaca ggttggaagc gatctgtcac ctttcacttt
   109261 gatttcctga gcagctcatg tggttgctca ctgttgttct accttgaatc ttgaagatta
   109321 tttttcagaa attgataaag ttattttaaa aagcacgggg agagaaaaat atgcccattc
   109381 tcatctgttc tgggccaggg gacactgtat tctggggtat ccagtagggc ccagagctga
   109441 cctgcctccc tgtccccagg ctgactgtcg aagaacacat ctggttctat gcccgcttga
   109501 aagggctctc tgagaagcac gtgaaggcgg agatggagca gatggccctg gatgttggtt
   109561 tgccatcaag caagctgaaa agcaaaacaa gccagctgtc aggtgcggcc cagagctacc
   109621 ttccctatcc ctctcccctc ctcctccggc tacacacatg cggaggaaaa tcagcactgc
   109681 cccagggtcc caggctgggt gcggttggta acagaaactt gtccctggct gtgcccctag
   109741 gtcctctgcc ttcactcact gtctggggct ggtcctggag tttgtcttgc tctgtttttt
   109801 tgtaggtgga atgcagagaa agctatctgt ggccttggcc tttgtcgggg gatctaaggt
   109861 tgtcattctg gatgaaccca cagctggtgt ggacccttac tcccgcaggg gaatatggga
   109921 gctgctgctg aaataccgac aaggtgcctg atgtgtattt attctgagta aatggactga
   109981 gagagagcgg ggggcttttg agaagtgtgg ctgtatctca tggctaggct tctgtgaagc
   110041 catgggatac tcttctgtta tcacagaaga gataaagggc attgagactg agattcctga
   110101 gaggagatgc tgtgtcttta ttcatctttt tgtccccaac atggtgcact aaatttatgg
   110161 ttagttgaaa gggtggatgc ttaaatgaat ggaagcggag aggggcagga agacgattgg
   110221 gctctctggt tagagatctg atgtggtaca gtatgaggag cacaggcagg cttggagcca
   110281 actctggctg gccctgagac attgggaaag tcacaacttg cctcaccttc tttgccgata
   110341 ataatagtgg tgcttacctc atagaggatt aaattaaatg agaatgcaca caaaccacct
   110401 agcacaatgc ctggcatata gcaagttccc aaataaaatg ctactgttct tacctctgtg
   110461 aggatgtggt acctatatat acaaagcttt gccattctag gggtcatagc catacagggt
   110521 gaaaggtggc ttccaggtct cttccagtgc ttacccctgc taatatctct ctagtccctg
   110581 tcactgtgac aaatcagaac tgagaggcct cacctgtccc acatccttgt gtttgtgcct
   110641 ggcaggccgc accattattc tctctacaca ccacatggat gaagcggacg tcctggggga
   110701 caggattgcc atcatctccc atgggaagct gtgctgtgtg ggctcctccc tgtttctgaa
   110761 gaaccagctg ggaacaggct actacctgac cttggtcaag aaagatgtgg aatcctccct
   110821 cagttcctgc agaaacagta gtagcactgt gtcatacctg aaaaaggtga gctgcagtct
   110881 tggtgctggg ctggtgttgg gtctgggcag ccaggacttg ctggctgtga atgatttctc
   110941 catctccacc ccttttgcca tgttgaaacc accatctccc tgctctgttg cccctttgaa
   111001 atcatatcat acttaaggca tggaaagcta aggggccctc tgctcccatt gtgctagttc
   111061 tgttgaatcc cgttttcctt ttcctatgag gcacagagag tgatggagaa ggtccttaga
   111121 ggacattatt atgtcaaaga aaagagactt gtcaagaggt aagagccttg gctacaaatg
   111181 acctggtgtt cctgctcatt acttttcaat ctcattgacc ttaactttta aactataaaa
   111241 cagccaatat ttattaggca ctgatttcat gccagagaca ctctgggcat gaaagaaagt
   111301 aatgataata gttaatttta tatagcgttg ttaccattta caaccttttt ttttttttaa
   111361 cctctatcat ctcaattaaa gtgcagagag accctgggaa gaaggtaact atatttatta
   111421 tcccagatga gggaagtgag gcttgtaggg aattggtagc tgattcaagg tcacccagca
   111481 ggtaaataac agtggtggga ccagacccaa ttaccaggta tgttttcctc tgtaccgcag
   111541 tacatgcctg agatttattt gtgtgttgaa gccagtggta cctaatgtat ttacatccca
   111601 acctgaaact cctatccact tatttacctt ttaatgagcc tcttaactca agtgcagtct
   111661 gaggaccagc agcatcagga tcacttggga acttgttaga aattcagcaa cctgggccca
   111721 gctcagacct accgaatcag aatctgtgca ttttaacaag gttcttgagt ggttgaacac
   111781 acattaaagc atgagaagca ttgaactaga catgtagcca ggtaaaggcc ttgcctgaga
   111841 tggttggcaa aggcctcatt gcagcattca ttggcaggcc acagttcttt tggcagctct
   111901 gcttcctgac ctttcaccct caggaagcga ggctgttcac acggcacaca catgccagac
   111961 agggtcctct gaagccacgg ctgccagtgc atgtgtccca gggaaagctt tttcctttag
   112021 ttctcacaca acagagcttc ttggaagccc tccccggcga aggtgctggt ggctctgcct
   112081 tgctccgtcc ctgacccgtt ctcacctcct tctttgccat caggaggaca gtgtttctca
   112141 gagcagttct gatgctggcc tgggcagcga ccatgagagt gacacgctga ccatcggtaa
   112201 ggactctggg gtttcttatt caggtggtgc ctgagcttcc cccagctggg cagagtggag
   112261 gcagaggagg agaggtgcag aggctggtgg cgctgactca aggtttgctg ctgggctggg
   112321 gctgggtggc tgcgggtgtg ggagcagctt ggtggcgggt tggcctaatg cttgctgggg
   112381 tgcctggggc tcggtttggg agctagcagg gcagtgtccc agagagctga gatgattggg
   112441 gtttggggaa tcccttaggg gagtggacac tgaataccag ggatgaggag ctgagggcca
   112501 agccaggagg gtgggatttg agcttagtac ataagaagag tgagagccca ggagatgagg
   112561 aacagccttc cagatttttc ttgggtagcg tgtgtaggag gccagtgtca ccagtagcat
   112621 atgtggaaca gaagtcttga cccttgctat ctctgcctag tcctaatggc tggcttttcc
   112681 caggaaggct tctgcttcca tggactgtta gattaaccct ttatttaggt aaatgaggga
   112741 acctacttta taagcatagg aaagggtgaa gaatctttta agattccttt actcaagttt
   112801 tcttttgaag aatcccagag cttaggcaat agacaccaga ctttgagcct cagttatcca
   112861 ttcacccatc cacccaccca cccacccatc cttccatcct cccatcctcc cattcaccca
   112921 tccacccatc cagctgtcca cccattctac actgagtacc tataatgtgc ctggctttgg
   112981 tgatacaaag gtgaataaga catagtcctt tcctttgccc ccaaccctca gaccagagat
   113041 gaacatgtgg aatgacctaa acacctggaa caggtgtggt gtatgagcgg caggcctctg
   113101 atgagagggt gggggatggc cagccctcac tccgaagccc ctctgagttg attgagccat
   113161 ctttgcattc tggtccctgc agatgtctct gctatctcca acctcatcag gaagcatgtg
   113221 tctgaagccc ggctggtgga agacataggg catgagctga cctatgtgct gccatatgaa
   113281 gctgctaagg agggagcctt tgtggaactc tttcatgaga ttgatgaccg gctctcagac
   113341 ctgggcattt ctagttatgg catctcagag acgaccctgg aagaagtaag ttaagtggct
   113401 gactgtcgga atatatagca aggccaaatg tcctaaggcc agaccagtag cctgcattgg
   113461 gagcaggatt atcatggagt tagtcattga gtttttaggt catcgacatc tgattaatgt
   113521 tggccccagt gagccattta agatggtagt gggagatagc aggaaagaag tgttttcctc
   113581 tgtaccacag tacatgcctg agatttgtgt gttgaaacca gtggtaccta acacatttac
   113641 atcccaacct taaactccta tgcacttatt taccctttaa tgagcctctt tacttaagta
   113701 cagtgtgagg aacagcggca tcaggatcac ttgggaactt gttagaaatt cagcaacttg
   113761 ggcccagctc agacctactg aatcagaatc aggagcaatt ctctggtgtg actgtgtcac
   113821 agccaggtat caactggatt ctcatacata ggaaatgaca aacgtttatg gatggatagt
   113881 ctacttgtgc caggtgctga gatttgtttt ttgttttttg attttttttt aatcactgtg
   113941 acctcattta attctcaaaa aaagatgaaa aaatgaacac tcaggaatgc tgacatgaga
   114001 ttcagaatca ggggtttggg gcttcaaagt ccatcctctc tttatccatg taatgcctcc
   114061 ccttagagat acaacatcac agaccttgaa ggctgaaggg gatataaaag ctgtctggcc
   114121 aagtggtctc caagcttgac agtgcagcag aatcacctgg ggatattatt aaaaataaac
   114181 atactaaggt ttggcttcag ggcctgtgaa tcagaatttc tggaggtgag gccttgaagt
   114241 ctgtatttct attgcatact ttggacacag tggtctatag actagagttt ggaaatgatt
   114301 gcgctcattc agattctctt ctgatgtttg aattgctgcc atcatatttc tagtgctcta
   114361 tttcctcctg ctcattctgt cttggataac ttatcatagt actagcctac tcaaagattt
   114421 agagccacag tcctgaaaga agccacttga ctcattccct gtaggttcag aataaatttc
   114481 ttctgcgcag tgtctgtcat agcttttttt aaattttttt ttatttttga tgagactgga
   114541 gttttgctct tattgcccaa gctggagtgc agtggtgcga ttttggctca ctgcaacctc
   114601 cacctcccag gttcaagcga ttctcctgcc tcagcctccc aagtagctga gattacaagc
   114661 atgtgctacc acgcccagct aattttgtat ttttagtaga gatgggtttt atccatgttg
   114721 gtcaggctgg tctcgagctc cagacctcag gtgatctgcc cgcctcggcc tcccaaagtg
   114781 ctgggattat aggcctgagc cacagcgctc agccataact ttaatttgaa aatgattgtc
   114841 tagcttgata gctctcacca ctgaggaaat gttctctggc aaaaacggct tctctcccag
   114901 gtaactctga gaaagtgtta ttaagaaatg tggcttctac tttctctgtc ttacggggct
   114961 aacatgccac tcagtaatat aataatcgtg gcagtggtga ctactctcgt aatgttggtg
   115021 cttataatgt tctcatctct ctcattttcc agatattcct caaggtggcc gaagagagtg
   115081 gggtggatgc tgagacctca ggtaactgcc ttgagggaga atggcacact taagatagtg
   115141 ccttctgctg gctttctcag tgcacgagta ttgttccttt ccctttgaat tgttctattg
   115201 cattctcatt tgtagagtgt aggtttgttg cagatgggga aggtttgttt tgttgtaaat
   115261 aaaataaagt atgggattct ttccttgtgc cttcagatgg taccttgcca gcaagacgaa
   115321 acaggcgggc cttcggggac aagcagagct gtcttcgccc gttcactgaa gatgatgctg
   115381 ctgatccaaa tgattctgac atagacccag gtctgttagg gcaagatcaa acagtgtcct
   115441 actgtttgaa tgtgaaattc tctctcatgc tctcacctgt tttctttgga tggcctttag
   115501 ccaaggtgat agatccctac agagtccaaa gagaagtgag gaaatggtaa aagccacttg
   115561 ttctttgcag catcgtgcat gtgatcaaac ctgaaagagc ctatccatat cacttccttt
   115621 aaagacataa agatggtgcc tcaatcctct gaacccatgt atttattatc ttttctgcgg
   115681 ggtcctagtt tcttgtatac attaggtgtt taattgttga acaaatattc attcgagtag
   115741 atgagtgatt ttgaaagagt cagaaagggg aatttgctgt tagagttaat tgtaccctaa
   115801 gacttagata tttgaggctg ggcatggtgg ctcatgccag taatcccagc gctttgagag
   115861 gctgaggtgg gtagatcacc tgaggtcagg agtttgagac cagtctgacc aacaaggtga
   115921 aaccccgtct ctactaaata caaaaaatta gccgagtgtg gtggcacatg cctgtcatcc
   115981 cagctacttg ggaggctgag gcaggagaat cgcttgaacc caggaggcag aggttgcagt
   116041 cagccacggt tgcgccattg cactccagac tgggcaacaa gagtgaaaac tccatctcaa
   116101 aaaagaaaaa aaaagaatta gatattttgg atgagtgtgt ctttgtgtgt ttaactgaga
   116161 tggagaggag agctaagaca tcaaacaaat attgttaaga tgtaaaagca catcagttag
   116221 gtatcattag tttaggacaa ggatttctag aaaattttta ggaacagaaa actttccagt
   116281 tctctcaccc ctgctcaaag agtgtatggc tcttacatta tatataactg cctgacttca
   116341 tacagtatca gtacttagat catttgaaat gtgtccacgt tttaccaaaa tataataggg
   116401 tgagaagctg agatgctaat tgccattgtg tattctcaaa tatgtcaagc tacgtacatg
   116461 gcctgtttca tagagtagtc tataagaaat tgatgacttg attcatccga atggctggct
   116521 gtaacacctg gttacgcatg aacacctctt ttcagttgtc tcaagacacc tttcttttct
   116581 gtacttatca gacaaggact gaaaggcaga gactgctact gttagacatt ttgagtcaag
   116641 cttttccttg gacatagctt tgtcatgaaa gccctttact tctgagaaac ttctagcttc
   116701 agacacatgc cttcaagata gttgttgaag acaccagaag aaggagcatg gcaatgccga
   116761 aaacacctaa gataataggt gaccttcagt gttggcttct tgcagaatcc agagagacag
   116821 acttgctcag tgggatggat ggcaaagggt cctaccaggt gaaaggctgg aaacttacac
   116881 agcaacagtt tgtggccctt ttgtggaaga gactgctaat tgccagacgg agtcggaaag
   116941 gattttttgc tcaggtgaga cgtgctgttt tcgccagaga ctctggcttc atgggtgggc
   117001 tgcaggctct gtgaccagtg aaggcaggat agcatcctgg tcaagatatg gatgccggag
   117061 ccagatttat ctgtatttca atcccagttc tattccttgc cagttgtgta tccgctggca
   117121 agttacttct ctatgcctca atctcctcat ctgtaaaatg gggataataa tattacctgc
   117181 aatacagggt tgttacgaaa ataaaaatga ataggtgctt agaatggggc ctgacattag
   117241 taagtgctta gttttgtgtg tgtatatgtt atttttattt tggaggagaa cataaaaagg
   117301 acaaagtgta gaaaaactgg ttgggtgtat tcagctgtca taacatgaga gttgttatgc
   117361 ccagatgcac ttgacatgtg aatttattag aaacatgatt tttctctgag ttgatgttta
   117421 actcaaactg atagaaaaga taggtcagaa tatagttggc caacagagaa gacttgttag
   117481 actattgtct gcatgtcagt gtttgcatgc taacttgctt agttagaaag gttaaatttt
   117541 ttcactctat aaaatcaaga aatatagaga aaaggtctgc agagagtctt tcatttgatg
   117601 atgtggatat tgttaagagc gggagtttgg agcatacaga gctcaagttg aatcctgact
   117661 ttgctactta ttggctatat gaccttgggc aagctgctta gtctctctga tcctcagtta
   117721 cctttgtttg ttgatgatga ccattgataa cacaaccata aataatgaca acatagagat
   117781 agttctcatt atagtagttg ttatacagaa ttattcactc aatgttaatt ttctgcattg
   117841 aaatcccaga acattagaat tgggggcatt atttgaatct ttaaggttat aaggaataca
   117901 tttctcagca ataaatggaa ggagttttgg gttaacttat aaagtatacc caagtcattt
   117961 ttttttcaga gaagatatgg tagaaagtct taggaggttg aagaaggaat tggatattta
   118021 ttctttctga gactatcatg ggagataatg actatggttg tccatgattg gagccgttgc
   118081 tgtagagttg gttttattat agtgtaggat ttgaatgggc catgtgttct cagacctcag
   118141 attaaaatga gaaaactgag gccagtgggg agcgtgactt cacatgggta cacttgtgct
   118201 agagacagaa ccaggattca ggacttctgg ctcctggtcc tgggttcatg gcccaatgta
   118261 gtctttctca gtcttcagga ggaggaaggg caggacccag tgttctgagt caccctgaat
   118321 gtgagcacta tttacttcgt gaacttcttg gcttagtgcc tctgccaggt ggccataacc
   118381 tctggccttg tgttgccaga gaaaaggttt agttttcagg ctccattgct tcccagctgc
   118441 caagaatgcc ttggtgcagc acagtcatag gccctgcatt cctcattgcc gtgctggttg
   118501 gtcggggagg tgggctggac tcgtagggat ttgccccttg gccttgtttc taacacttgc
   118561 cgtttcctgc tgtccccctg ccccctccac tgcctgggta aagattgtct tgccagctgt
   118621 gtttgtctgc attgcccttg tgttcagcct gatcgtgcca ccctttggca agtaccccag
   118681 cctggaactt cagccctgga tgtacaacga acagtacaca tttgtcaggt atgtttgtct
   118741 tctacatccc aggagggggt aagattcgag cagaccaaag atgtttacga gggccaaggg
   118801 aatggacttc agaattacac ggtggaatga attttactgc tgcggctcag gtccctgtat
   118861 aagctaatac tgcatgcata gaacagcagc gaactaaccc tgaataatag gccagtcttc
   118921 tgttgagcct ttcagcctct ctcctcttca tcctactgtt gtcaggaaca gccacatgtg
   118981 ttttaggtga aataatccac ccttgcaaaa atccatgatt aagttataaa atatttggat
   119041 ttgtggagct gtgttttaat tctgtaactg agtcacaggg cacactgtca aagcatagaa
   119101 cctccagaga cttgttttct gcaaagtata attcatgtaa ttattatcta ttctgttata
   119161 tttgggatgt taggtagtgt ttgttcttta gataaaaata tcccccactc tgtaacaata
   119221 cattaaatca aagaaaagga caaaggattt ttctgggtct tgttagcagg agctttcttc
   119281 agtcctgaaa gatttgtaga cctgtagatg ggggaactgt gtcagtgata caaaagggaa
   119341 gcatttaaaa aaaaaaaagt atatatatat atatatatat atatatgtaa tgtgaattgg
   119401 cctctttttc tctaagccca cattttcttc ttacatagtt caggtttact ttattttttc
   119461 ctttccggct gctgaccctg tattgcccgt agttgtggaa catagcatgt gtttgtgacc
   119521 tgtgcctgtt atttttgtgc tttctagttg tgcatgcaaa gagtacaaag ttttcttgcc
   119581 ctttcttgga aaatcctgct tgtctgtgcc aaagggataa ttgtgaaagc acttttgaaa
   119641 tacttaatga gttgattttc ttcaaattaa aaaaaatata taaatgtata tgtgtatgta
   119701 catgtgtgta cacatacaca cctttataca tacagcccat ttaaaacaag ctccactttg
   119761 gagtgctcta cgtcaccctg atgccgaata cagggccaga gtctgagatc cttctgggtg
   119821 gtttctgtgt tttgttcatt tctgttttaa gagcctgtca cagagaaatg cttcctaaaa
   119881 tgtttaattt ataaaaacat ttttatctct cgattactgg ttttaatgaa ttactaagct
   119941 ggctgcctct catgtaccca cagcaatgat gctcctgagg acacgggaac cctggaactc
   120001 ttaaacgccc tcaccaaaga ccctggcttc gggacccgct gtatggaagg aaacccaatc
   120061 ccgtgagtgc cactttagcc ataagcaggc ttcttgtgct tgttgcctgg tttgatttct
   120121 aatatgctgc atttatcaac tgcatgccac attgtgaccg ccagcatttg ccctttgaat
   120181 tattattatg ttttatttac aaaaagcgaa ggtagtaacc gaactaaatt atctaggaac
   120241 aaacgtttgg agagtcttct aacaccgtgc aaagcacgtc attacagaca tttgtttact
   120301 gatttagaac cttaatattt aatttaaata gcactttaca cttactgatg aaatgctttt
   120361 cctttctttc tctcccagcc cctgtactta agtgcttcaa taggctctca ttatatatga
   120421 tttttaggtt ttgtttatca gcttcttcgc ttttataatc tgaaaagatg gcatatgaat
   120481 ttttataaaa agggacactt tcttcttctc aaattgtata tttttattgt actttccttc
   120541 aaaaccccct tttaaaaagt aagcagtaga taaataaatt cagtgaagca tccatatgac
   120601 tcttaagtga gtgtagggga agggaggtca ccagatcact gtgagtgaag atggtggaga
   120661 ggtgaggatc ttatgaggcc gtgctcaagg ctggtagagg tgggttagtg tttccaggtt
   120721 taggcagaat ctcagctgag gtcatgaaac aacagtgatc tctgaaaaat tatggcaagg
   120781 tgggaaggtg ctggagaatt ggagaggggg caaacttgac tttcaagttt caatgggaag
   120841 ataggtgact ctgcacacca cagaacagtg agcatgataa cctgtttata caaggttcta
   120901 gagcagattt ctaaatggat agctactgtg tgcttgtttg ttcttaatta gtattggata
   120961 gttactaaat acttgttagt acttagtaca taatgggtgg taaatcctag cagctaatat
   121021 tggttcccaa ataaccagat gacaaggata gagaaggaca cagacacggc ctatctggat
   121081 ttcatggtgc ctttgatttt ccacatgaag gttgtgtagg gaagatagaa gcatgagatg
   121141 agatgataat atagttatct ggattcatca ctggccagct gaaccatatg aactcatgga
   121201 ttgatgctag cttaggaagg ctctgtagga gccagaactg ggctgagagc cagcccatag
   121261 agacaaaaga ggcccggccc tgacatcaga gggttcaaac atgatgtctg agccccacct
   121321 acagtctgcc ggaggtggtt ggaaggaaga gcctttatcc ttacaattct tactgaaatt
   121381 caaattttta ggttttgcaa aaaaatggtg gacctgaagg aaatttgaca ggagcatgtc
   121441 tcagctgtat ttaaatttgt ctcagccaat ccccttttga atgttcagag tgtaagcttc
   121501 aggagggcag cgcgtcttag tgtgactttt ctggtcagtt caggtgcttt aaggagacaa
   121561 ttagagatca atctggaaaa cttcatttga atttttaata cataagaaaa caataagaaa
   121621 tagttaaaaa tatatattta tataatatat atgtgtgtgt gtgtgtgtgt gtgtatatat
   121681 atatatatat tttatttatt tatttttttt tgagatggag tctcgctctg ttgcccaggc
   121741 tggagtgcag tggctcaatc ttggctcact gccacctctg cctcccaggt tcaagtgatt
   121801 ctcctacctc agcctcctga gtagctggga ttacaagcat gtgccaccac actggctaat
   121861 ttttctaatt ttagtagaga tggagtttca ccatgttgga caggatggtc ttgaactcct
   121921 gacttagtga tccacccgcc ttcgcctccc aaagttctgg gattacaggc atgagccatc
   121981 gtgcctggca attatattta atatttaata ataaggaaat aattgctgta actttacttt
   122041 aaattgtgga attctgaaac tggaagggaa ctggaaatga cttgttgaat caaatcattt
   122101 taaactttta ttttgccagt ggaaaaaata agcccccaaa agagcagggg acctgctgat
   122161 gtcccacagt aattcagagc tggagatgag gttgaaggct ttgtgtctta tctccaggga
   122221 aaatttgtag acagcgtagc tctttatgtg acgagcattc tcaccccagt catcccccaa
   122281 ttctctactc atttgagaac ataaattgga tcttgccagt ctctactcat ttttcagcac
   122341 atcgagcata agatccagac tctttcccag gcctctctca tctggctcct ctcctcctcc
   122401 tttatcatta ctcttcttcg tagcttatcc tactccagcc atgctgtctt cctattattc
   122461 ctaaaaagta gaaatgcatt tcttcctagg gcctttgtac ctgcacttgc catcgctttt
   122521 gctcagaatg ttctttttgc caagcttttg cccagcttgt tctccatcat tgttatgttt
   122581 tggctgaaat gtcttctctt agtaggttca ttctccccag tcactgtctt tttattttgc
   122641 tttattttgg gccatctaag gttatcttat tagtgtattt gttgttcgtc tcctccatgg
   122701 gcatacacct ccatgaaggc aggtattttc accttaggcc ctcgaatata ctggacagca
   122761 tctggcacgt agtagatgct caacgaatgt ttgttgtgtg agcaaatggt tggttgattg
   122821 gattgaactg agttcagtat gtaaatattt agggcctctt tgcattctat tttacttatg
   122881 tataaaatga tacataatga tgatataaat gatgtcacag tgtacaaggc tgttgtggga
   122941 tcaagcaatc aaatgagatc atgcttgtct tttccaaatg gtgagggaat agatgcatgt
   123001 ttgtggttgt tacggaatga tcctgtgctc ctgaggcaac agaaaggcca ggccatctct
   123061 ggtaatccta ctcttgctgt cttccctttg cagagacacg ccctgccagg caggggagga
   123121 agagtggacc actgccccag ttccccagac catcatggac ctcttccaga atgggaactg
   123181 gacaatgcag aacccttcac ctgcatgcca gtgtagcagc gacaaaatca agaagatgct
   123241 gcctgtgtgt cccccagggg caggggggct gcctcctcca caagtgagtc actttcaggg
   123301 ggtgattggg cagaaggggt gcaggatggg ctggtagctt ccgcttggaa gcaggaatga
   123361 gtgagatatc atgttgggag ggtctgtttc agtctttttt gttttttgtt tttttttctg
   123421 aggcggagtc ttgctctgtc gcccaggctg gagtgctgtg gcatgatctt gcctcactgc
   123481 aacctccacc tcccaggttc aagcgattct cctgcctcag cctcctgagt agctgggatt
   123541 acaggcacgc accaccatgt ctggctaatt tttgtgtttt tagtagagat agggtttcgc
   123601 cgtgttggct aggctggtct ggaattcctg acctcaggtg atccacccgc ctcggcctcc
   123661 caaagtgctg ggattacagg cgtgagccac tacgcccagc cctgtttcag tctttaactc
   123721 gcttcttgtc ataagaaaaa gcatgtgagt tttgagggga gaaggtttgg accacactgt
   123781 gcccatgcct gtcccacagc agtaaagtca caggacagac tgtggcaggc ctggcttcca
   123841 atcttggctc tgcaacaaat gagctggtag cctttgacag gcctgggcct gtttcttcac
   123901 ctctgaatta gggaggctgg accagaaaac tcctgtggat cttgtcaact ctggtattct
   123961 tagagactct gtttgggaag gagtcctgag ccattttttt tttcttgaga atttcaggaa
   124021 gaggagtgct tatgatagct ctctgctgct tttatcagca accaaattgc aggatgagga
   124081 caagcaattc taaatgagta caggaactaa aagaaggctt ggttaccact cttgaaaata
   124141 atagctagtc caggtgcggg gtggctcaca cctgtaatct cagtattttg ggatgccgag
   124201 gtggactgat cacctaaggt caggagttcg aaaccagctt ggccaatgtg gcgaaaccct
   124261 gtctctacta aaaattcaaa aattagccag gcatggtggc acatgcctgt aatcccagtt
   124321 acttgggagg ctgaagcagg agaattgctt gaacctggga ggtggaggtc gcagggagcc
   124381 aaaattgcgc cactgtactc cagcctgagc aacacagcaa aactccatat caaaaaataa
   124441 aatgaataaa ataacagcta atctagtcat cagtataact ccagtgaaca gaagatttat
   124501 taggcatagt gaatgatggt gcttcctaaa aatctcttga ctacaaagaa tctcatttca
   124561 atgtttattg tttagatgtt cagaataaat tcttgggaaa gaccttggct tggtgtaagt
   124621 gaattaccag tgccgagggc agggtgaacc aagtctcagt gctggttgac tgagggcagt
   124681 gtctgggacc tgtagtcagg tttccggtca cactgtggac atggtcactg ttgtccttga
   124741 tttgttttct gtttcaattc ttgtctataa agacccgtat gcttggtttt catgtgatga
   124801 cagagaaaac aaaacactgc agatatcctt caggacctga caggaagaaa catttcggat
   124861 tatctggtga agacgtatgt gcagatcata gccaaaaggt gactttttac taaacttggc
   124921 ccctgcctta ttattactaa ttagaggaat taaagaccta caaataacag actgaaacag
   124981 tgggggaaat gccagattat ggcctgattc tgtctattgg aagtttagga tattatccca
   125041 aactagaaaa gatgacgaga gggactgtga acattcagtt gtcagcttca aggctgaggc
   125101 agcctggtct agaatgaaaa tagaaatgga ttcaacgtca aattttgcca cttagtagca
   125161 acttgaccag gtaactggtt atccttttaa agccttagtt tatctaaatt gtgatattaa
   125221 tgttgctctt ataagtttgt catgaggact aaattaaatg gtgtacatag agtgccttgg
   125281 gtactctctg atgggggact ccatgataat ttgtggtctc atggagggag ctctgggaag
   125341 gtttaggagc ctgccttggc tctgcagcct tgggagagcc ttctagcttc ccaggacatg
   125401 gcagcctagt gttgaatgct tggctcagca aatgtttgtt ctcgtttcct tcccatcaac
   125461 ttggtcagtt ggggtctttc agttaggagt atctcagtga ctttaaatgg catgggcatg
   125521 ctggagtgat agtgaccatg agtttctaag aaagaagcat aatttctcca tatgtcatcc
   125581 acaattgaaa tattattgtt aattgaaaaa gcttctaggc caggcacggt ggctcatgcc
   125641 tgtaatccca gcactttagg aggccaaggc gggtggatca cttgaggtca ggagtttgag
   125701 accagcctgg ccaacatggg gaaaccctgt ctctactaaa aatacaaaat aagctgggcg
   125761 tggtggtgcg tgcctgtaat cccagctact tgggaggctg aggcaggaga attgcttgaa
   125821 tctgggaggc ggaggttgca gtgagctgag ttcatgccat tgcattccag cctgggcaac
   125881 aagagcgaaa ccatctccca aaagaaaaaa aaaagaaaga aaaagcttct agtttggtta
   125941 catcttggtc tataaggtgg tttgtaaatt ggtttaaccc aaggcctggt tctcatataa
   126001 gtaatagggt atttatgatg gagagaaggc tggaagaggc ctgaacacag gcttcttttc
   126061 tctagcacaa ccctacaagg ccagctgatt ctagggttat ttctgtccgt tccttatatc
   126121 ctcaggtgga tatttactcc ttttgcatca ttaggaatag gctcagtgct ttctttgaac
   126181 tgattttttg tttctttgtc tctgcagctt aaagaacaag atctgggtga atgagtttag
   126241 gtaagttgct gtctttctgg cacgtttagc tcagggggag gatggtgttg taggtgtctt
   126301 ggattgaaga aagccttggg gattgtttgt cactcacaca cttgtgggtg ccatctcact
   126361 gtgaggagga cagaagccct gtgaacatgt ggagcacaca ggggcacaga cagatttaga
   126421 ttaggcctgc tttatagagt ttctgcctag agcatcatgg ctcagtgccc agcagcccct
   126481 ccagaggcct ctgaaatatt tgatatactg atttccttga ggagaatcag aaatctcctg
   126541 caggtgtcta gggatttcaa gtaagtagtg ttgtgagggg aatacctact tgtactttcc
   126601 ccccaaacca gattcccgag gcttcttaag gactcaagga caatttctag gcatttagca
   126661 cgggactaaa aaggtcttag aggaaataag aagcgccaaa accatctctt tgcactgtat
   126721 ttcaacccat ttgtccttct gggttttgaa ggaacaggtg ggactgggga cagaagagtt
   126781 cttgaagcca gtttgtccat catggaaaat gagataggtg atgtggctac gtcagggggc
   126841 ccgaaggctc cttgttactg atttccgtct tttctctctg ccttttcccc aagggccagg
   126901 acccctggat ctctgggcag agcagacgca ggcccctata atagccctca tgctagaagg
   126961 gagccggagc ctgtgtataa ggccagcgca gcctactctg gacagtgcag ggttcccact
   127021 ctcccaactc cccatctgct tgcctccaga cccacattca cacccgagcc actgggttgg
   127081 aggagcatct gtgagatgaa acaccattct ttcctcaatg tctcagctat ctaactgtgt
   127141 gtgtaatcag gccaggtcct ccctgctggg cagaaaccat gggagttaag agattgccaa
   127201 catttattag aggaagctga cgtgtaactt ctgaggcaaa atttagccct cctttgaaca
   127261 ggaatttgac tcagtgaacc ttgtacacac tcgcactgag tctgctgctg atgatactgt
   127321 gcaccccact gtctgggttt taatgtcagg ctgttctttt aggtatggcg gcttttccct
   127381 gggtgtcagt aatactcaag cacttcctcc gagtcaagaa gttaatgatg ccaycaaaca
   127441 aatgaagaaa cacctaaagc tggccaaggt aaaatatcta tcgtaagatg tatcagaaaa
   127501 atgggcatgt agctgctggg atataggagt agttggcagg ttaaacggat cacctggcag
   127561 ctcattgttc tgaatatgtt ggcatacaga gccgtctttg gcatttagcg atttgagcca
   127621 gacaaaactg aattacttag ttgtacgttt aaaagtgtag gtcaaaaaca aatccagagg
   127681 ccaggagctg tggctcatgc ctgtaatcct agcactttgg gaggctgaag cgggtggatc
   127741 acttgaggtc aggagttcga gaccagcctg gcctacatga caaaaccccg tatctactaa
   127801 aaatacaaaa aaattagctg ggcttggtgg cacacacctg taatcccagc tacttgggag
   127861 gctgaggcag gagaattgct tgaaccctgt aggaagaggt tgtagtgagc caagatcgca
   127921 ccgttgcact ccagcctggg caacaagagc aaaactccat ctcaaaaaac aaattaaatc
   127981 cagagattta aaagctctca gaggctgggc gcggtggctt acacctgtta tcccagcatt
   128041 ttgggatgcc gaggcgggca aagcacaagg tcaggagttt gagaccagcc tggccaacat
   128101 agtgaaaccc tgtctttgct aaaaacatag aaaaattagc cgggcatggt ggcgtgcgcc
   128161 tgtaatccca gctactcggg aggctgaggt gagagaattg cttgaacccg ggaggcggag
   128221 gttgcagtga gcccagattg caccactgca ctccagcctg ggcgacagag caagactcca
   128281 tctcaaaaaa agctctcaga acaaccaggt ttacaaattt ggtcagttgg taaataaact
   128341 gggtttcaaa catactttgc tgaaataatc actgactaaa taggaaatga atcttttttt
   128401 tttttttttt aagctggcaa gctggtctgt aggacctgat aagtactcac ttcatttctc
   128461 tgtgtctcag gtttcccatt tttaggtgag aattaagggg ctctgataaa acagacccta
   128521 ggattgtgga cagcagtggt agtcctagag tccacaagtc tgcttttgag tgatgggccc
   128581 atgtatctgg cacatctgca ggcagagcgt ggttctggct cttcagatga tgccggtgga
   128641 gcactttgag gagtcctcac cccaccgtga taaccagaca ttaaaatctt ggggctttgc
   128701 atcccaggat ttctctgtga ttccttctag acttgtggca tcatggcagc atcactgctg
   128761 tagatttcta gtcacttggt tctcaggagc cgtttattta atggcttcac atttaatttc
   128821 agtgaacaag gtagtggcat tgctcttcac agggccgtcc tgttgtccac aggttccaga
   128881 ttgactgttg ccccttatct atgtgaacag tcacaactga ggcaggtttc tgttgtttac
   128941 aggacagttc tgcagatcga tttctcaaca gcttgggaag atttatgaca ggactggaca
   129001 ccagaaataa tgtcaaggta aaccgctgtc tttgttctag tagctttttg atgaacaata
   129061 atccttatgt ttcctggagt actttcaact catggtaaag ttggcagggg cattcacaac
   129121 agaaaagagc aaactattaa ctttaccagt gaggcagtac ggtgtagtgt agtgattcag
   129181 agaatttgct ttgccaccag acataccagg taaccttgac taagttactt aacctatcta
   129241 aacctcagtt tcctcatctg tgaaatggag acagtaatca tagctatttc caaactgttg
   129301 tgagaattca atgagttaaa ggtataaggt cctcaccaca gcgcctgccc acatagtcag
   129361 tgatcactat gtcctgaaca ctgtaattac ttcgccatat tctctgatca tagtgttttg
   129421 ccttggtatg tgactagaat ttctttctga ggtttatggg catggttggt gggtatgcac
   129481 ctgcctgcag gagcccggtt tgggggcatt accttgtacc tggtatgttt tctttcaggt
   129541 gtggttcaat aacaagggct ggcatgcaat cagctctttc ctgaatgtca tcaacaatgc
   129601 cattctccgg gccaacctgc aaaagggaga gaaccctagc cattatggaa ttactgcttt
   129661 caatcatccc ctgaatctca ccaagcagca gctctcagag gtggctcygt aagtgtggct
   129721 gtgtctgtat agatggagtg gggcaaggga gagggttatg gagaagggga gaaaaatgtg
   129781 aatctcattg taggggaaca gctgcagaga ccgttatatt atgataaatc tggattgatc
   129841 caggctctgg gcagaagtga taagtttacg aattggctgg ttgggcttct tgaactgcag
   129901 aagagaaaat gacactgata tgtaaaaatc gtaacattta gtgaattcat ataaagtgag
   129961 ttcaaaaatt gttaattaaa ttataattta attataagtg tttaatcagt ttgatttgtt
   130021 taaaaaccac tgttttaaat ttggtggaat atgtttttat tagcttgtat ctttaattcc
   130081 taaattaagc tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg
   130141 tgaagtttaa agccaggatg agctagttta aagtatgcag cctttggagt catacagatc
   130201 tgggtttgaa tctggtctct aaactttata gatgtatgat attaaatgag gcagttcatg
   130261 taaattgcca agcccagcac tcagcacaga gttgatattt cacacacatt agataccttt
   130321 cctgtatgtg gagcatggca gttcctgttt ctgctttact cctacaggat actaatatag
   130381 gacactagga tctttatacc aagaccccat gtaatgggct tatgagacca ttcttcttat
   130441 aaaaatctga cagaattttt gtatgtgtta gatcaatagg ctgcatactg ttattttcaa
   130501 gttgatttac agccagaaat attaatttat ttgagtagtt acagagtaat atttctgctc
   130561 tcatttagtt ttcaagcccc actagtcctt tgtgtgtgaa aatttacaac ttactgctct
   130621 tacaaggtca tgaacagtgg accaaagtga atgccattaa ccactctgac ttccttcatt
   130681 agttttattg tgacagtgga ctcttttgac ctcagtaata ccagtttggc atttacattg
   130741 tcatattttt agacttaaaa atgatcatct taaccctgaa taaaatgtgt ctggtgaaca
   130801 gatgtttttc cttgggctgt gcctcagata tctctgtgtg tgtgtacgtg tgtgtttgtc
   130861 tgtgtgtcca tgtcctcact gattgagccc taactgcatc aaagacccct cagattttca
   130921 cacgcttttt ctctccagga tgaccacatc agtggatgtc cttgtgtcca tctgtgtcat
   130981 ctttgcaatg tccttcgtcc cagccagctt tgtcgtattc ctgatccagg agcgggtcag
   131041 caaagcaaaa cacctgcagt tcatcagtgg agtgaagcct gtcatctact ggctctctaa
   131101 ttttgtctgg gatatggtaa ggacacaggc ctgctgtatc tttctgatgt ctgtcagggc
   131161 catggattga tatggataag aaagaaagag ctctggctat catcaggaaa tgttccagct
   131221 actctaaaga tgtatgaaaa agaaatagcc agaggcaggt gatcactttc atgacaccaa
   131281 acacagcatt gggtaccaga gttcatgtca caccagaggg aaaattctgt acacaatgat
   131341 gaaaattaat accactacca cttaagttcc tatgtgacaa ctttcccaag aatcagagag
   131401 atacaagtca aaactccaag tcaatgcctc taacttctct gatgggtttt aacctccaga
   131461 gtcagaatgt tctttgcctt actaggaaag ccatctgtca tttgaaaact ctgtacattt
   131521 tatcagcagc ttatccatcc attgcaaata tgtttttgtg ccagccacaa tatattgctt
   131581 ctatttggac caatatgggg gatttgaagg aattctgaag ttctaattat atttcaactc
   131641 tactttacaa tatctccctg aaatatatct ccctgtaact tctattaatt ataagctaca
   131701 cagagcaaat ctaattcttc tcccaccgaa caagtccctg gatatttaaa aataactctc
   131761 atactctcat ttaacctgag tattacccag ataagatgat atatgagaat acaccttgta
   131821 acctccgaag cactgtacaa atgtgagcaa tgatggtgga gatgatgatg agatctttgc
   131881 tgtttatacc aagcccctta gactgtgtca ctcttctgat ccggttgtcc ttgtatggcc
   131941 atgctgtata ttgtgaatgt cccgttttca aaagcaaagc caagaattaa ccttgtgttc
   132001 aggctgtggt ctgaatggtt atgggtccag agggagttga tctttagctc acacttctat
   132061 tactgcagca caaagatttt gcattttgga aggagcaccg tcttactggc aacttagtgg
   132121 taaaccaaaa cctccatttc acacaaatga ttgtgaaatt cgggtctcct tcattctata
   132181 caaattcatt tgattttttt gaaactaaac tttatattta tccatattaa attacatggg
   132241 ttttattttt gttttatctt gattcagtaa ttactccttt cagtaaacac agactgagtg
   132301 ctgtgtgtct gacttatgcc aggcataggt gattcagaga tgaaaggtca agtccctgaa
   132361 cccatctctt gtcttcctgg gtattatctg tccctccctg ctttagagct cctgaaattt
   132421 gctagaagca tgtcttcatc taagttgttg ataaacacat caagtaggat tggactgagg
   132481 cagagccctg tagtctgaag ctgcagttct tctagcggct gacaagcccc actatcactt
   132541 ccctgctggt gctttgctct gccagctgtg aattctcata attgtcctat cgtcaagtct
   132601 ttatttctgc attttactgc ttgatacact gtcaggacag actttaaaat tattctcagt
   132661 gcgatgaaac aattctgaca ttcatgttat gagcagttac ctcataaata gattacatgt
   132721 gagattgaac ttgggcagac tataatatag cattaatgat gaaacagaca cagtcatctt
   132781 cgggaagaag aatagaggct tatttgctgc ctgtgaaatt aaaattactc tgactgggaa
   132841 tccatcgttc agtaagttta ctgagtgtga caccttggct tgactgttgg aaagacagaa
   132901 agggcatgta gtttataaaa tcagccaagg ggaaaatgct tgtcaaaatg tattgtcggg
   132961 tattttgatt aatagtttat gtggcttcat taattcagag ttactctcca atatgtttat
   133021 ctgccctttc ttgtctgata atggtgaaaa cttgtgtgat gcattgtata tttgatttag
   133081 gggtgaactg gatgtctttg ttttcacttt tagtgcaatt acgttgtccc tgccacactg
   133141 gtcattatca tcttcatctg cttccagcag aagtcctatg tgtcctccac caatctgcct
   133201 gtgctagccc ttctactttt gctgtatggg taagtcacct ctgagtgagg gagctgcaca
   133261 gtggataagg catttggtgc ccagtgtcag aaggagggca gggactctca gtagacactt
   133321 atctttttgt gtctcaacag gtggtcaatc acacctctca tgtacccagc ctcctttgtg
   133381 ttcaagatcc ccagcacagc ctatgtggtg ctcaccagcg tgaacctctt cattggcatt
   133441 aatggcagcg tggccacctt tgtgctggag ctgttcaccg acaatgtgag tcatgcagag
   133501 agaacactcc tgctgggatg agcatctctg ggagccagag gacagtgttt aattgtgatc
   133561 ttattccact tgtcagtggt attgacactg ctgactgcct tgtcctgtct tcagagtctg
   133621 tcttccctga gaaggcaaag cacctttctt tcttgctgtg ccttacattt tgctggtcaa
   133681 gcctttcagt ttcttttgac agtttttttt acttctttct tttttcaatg ttgctcttac
   133741 caagagtagc tcctctgcct tccactttac acatgagagc tgggcgacgc cattcagtcc
   133801 taaggctttt accatcacct ctcttggtgt ttttattgtc atctctaaga tcaatgcctt
   133861 tagccttgat cataaccttg aactctaatc tcaaattctc acttgcctag tggattgctc
   133921 catttagata gtatatagat accccaacct ggacatgtcc tagttttctt tccccttgga
   133981 acttaatgct tttcttgcca tccctgtcac actcagtggc actaccatcc actcggttgc
   134041 ccaagctggc tcttagagtt atcctagatg cttgctttgc tgttgcagat ttcccacatt
   134101 caactggtta tgttgtcagt tcttccaggt atggacctct aaaataaggc ttcctctcca
   134161 ttccggttgt cattgccttt gtccaaacac agcacacaag gccttttaca gttgcacaac
   134221 tcttcctgtc catacccacc acaccctttc ccagctgtaa gctttattca ttatttctcc
   134281 atactcagag catttctcca gattttctaa agaatagaat tttattgcta catatcatca
   134341 gctatgcctg ctgctattta attggtatct gaattaaaag gtctggtttg tccctagaga
   134401 atcaaatttt ttcttcactc ccatatttca gaacttgata catttttagg ataaaccatg
   134461 aatgacaccc gtttcttctc cctcaccctc ccttccctcc catttttttt tttttttttt
   134521 tttagaagct gaataatatc aatgatatcc tgaagtccgt gttcttgatc ttcccacatt
   134581 tttgcctggg acgagggctc atcgacatgg tgaaaaacca ggcaatggct gatgccctgg
   134641 aaaggtttgg tgagtgaagc agtggctgta ggatgcttta atggagatgg cactctgcat
   134701 aggccttggt accctgaact ttgttttgga aagaagcagg tgactaagca caggatgttc
   134761 ccccaccccc atgcccagtg acagggctca tgccaacaca gctggttgtg gcatgggttt
   134821 tgtgacacaa ccatttgtct gtgtctctga tagcattgag aaaagtgaaa gggcagtttt
   134881 gaaggtaagg aaaatagtgt tatttgcttg gatccactgg ctcatgccac tgtctgggtt
   134941 ggttagaagc actggaaaag tcaaaccata actttgagaa ttaggtgatc agggaatcag
   135001 aaggaaagat gcaaactttg gctcttttag gcgaatcatg tgcctgcaga tgaggtcatt
   135061 tattatcttt tacacagtct ataaaattat aatgtattac atctttttct acctttagaa
   135121 tggttaaaaa tatttctccg gtagccatat gattattatt catccattag ataatatagt
   135181 caaatgggcc atgttattta ctgttcatag aagaggggct ttttgcaact tgggctacaa
   135241 aggagatatg taaggaattt aaggaatggt tacatggaac tagatttaat tgaatctagt
   135301 ggtttaattg attcactagg atatatgcta ctgaaagggg aatctgctta aagtgctttc
   135361 tgatatttat tattactaaa acttagaatt tattaaaaat actgactgtg aaaattactt
   135421 gggtcgtttg cctttttaaa aggatttttg gcatgtctca ttaaaaaaag aaatactaga
   135481 tatcttcagt gaagttacaa atcgaataca cattggctct gaaattctga ttgatactgg
   135541 gtcataaaaa gttttcccaa atcagacttg gaaagtgatc actctcttgt tactcttttt
   135601 tccttgtcat gggtgatagc catttgtgtt tattggaaga tcggtgaatt ttaaggaaca
   135661 taggcccaaa tttgaggaag ggccatggtt tttgatccct ccattctgac cggatctctg
   135721 cattgtgtct actaggggag aatcgctttg tgtcaccatt atcttgggac ttggtgggac
   135781 gaaacctctt cgccatggcc gtggaagggg tggtgttctt cctcattact gttctgatcc
   135841 agtacagatt cttcatcagg cccaggtgag ctttttctta gaacccgtgg agcacctggt
   135901 tgagggtcac agaggaggcg cacagggaaa cactcaccaa tgggggttgc attgaactga
   135961 actcaaaata tgtgataaaa ctgattttcc tgatgtgggc atcccgcagc cccctccctg
   136021 cccatcctgg agactgtggc aagtaggttt tataatacta cgttagagac tgaatctttg
   136081 tcctgaaaaa tagtttgaaa ggttcatttt tcttgttttt tcccccaaga cctgtaaatg
   136141 caaagctatc tcctctgaat gatgaagatg aagatgtgag gcgggaaaga cagagaattc
   136201 ttgatggtgg aggccagaat gacatcttag aaatcaagga gttgacgaag gtgagagagt
   136261 acaggttaca atagctcatc ttcagttttt ttcagcttta tgtgctgtaa cccagcagtt
   136321 tgctgacttg cttaataaaa gggcatgtgt tcccaaaatg tacatctata ccaaggttct
   136381 gtcaatttta ttttaaaaac accatggaga cttcttaaag aattcttact gagaattctt
   136441 ttgtgatatg aattcccatt ctcgaataca ttggttttat atgcttacat ttatgtgtta
   136501 gttattaaaa catactaata ttgtatatct agtcaaaact gaggtagaga gaataaatgg
   136561 ttgattttga gtttgagttt catagtccaa aaagctgata tattgcctgt gttcaagagg
   136621 gtctatatca gccctctaga tgccagcatc tccaaatttt acttttttgg aatctgtaca
   136681 gtatttgcaa tatttttatt acaaatttct actctgtgga atttaatttt taaaatacct
   136741 gcaatacata tatatgttga atagatgaaa aattatgtag atgataatga atgatacggt
   136801 tctaaaaaga caggttaaaa agtaagttca cttttatttt gagcttcaga atcattcaga
   136861 agccagtcgc cacaaacgca gaccaaggct cttggcacat caaatatgcc tatggcttag
   136921 ggttattgac aagtcttatg ttgcagtgta tgtggtttat agtcctgcct tccacagttg
   136981 cttgggagag ctgtgagtca ctgaggctta tgaatgttta cattttgttt gttgcagata
   137041 tatagaagga agcggaagcc tgctgttgac aggatttgcg tgggcattcc tcctggtgag
   137101 gtaaagacac tttgtctata ttgcgtttgt ccctattagt tcagactatc tctacccaat
   137161 caagcaacga tgctcgttaa gaggtaaaag tggattttaa aggcttctgt atttatgcca
   137221 ggatggagca attagtcatc gagaagagag ggaccctgta tgtcaagaga atgatttcag
   137281 agaatccaat acaatttaag aaaaagcatg gggctgggcg cagtgattca ctcctgtaat
   137341 cccagcactt tgggaggccg aggtgggcgg atcacgaggt caggagattg agaccatcct
   137401 ggccaacatg gtgaaacccc atctctacta taaatacaaa aattagctgg gcatagtagt
   137461 gcattcctgt agtcccagct actcgggagg ctgaggcagg agaattgctt gaacctagga
   137521 gggggaggtt gcccagattg cgctgctgca ctccagcctg gtgacagagt gagactcatg
   137581 tcaacaacaa aaacagaaaa agcacgcaca tctaaaacat gcttttgtga tccatttggg
   137641 atggtgatga cattcaaata gttttttaaa aatagatttt ctcctttctg gtttccgttt
   137701 gtgttctttt atgccctttt gccagagtag gtggtgcaat ttggctagct ggctttcatt
   137761 actgtttttc acacattaac tttggcctca acttgacaac tcaaataata tttataaata
   137821 cagccacact taaaatggtc ccattatgaa atacatattt aaatatctat acgatgtgtt
   137881 aaaaccaaga aaatatttga ttcttctctg atatttaaga attgaaggtt tgaggtagtt
   137941 acgtgttagg ggcatttata ttcatgtttt tagagtttgc ttatacaact taatctttcc
   138001 ttttcagtgc tttgggctcc tgggagttaa tggggctgga aaatcatcaa ctttcaagat
   138061 gttaacagga gataccactg ttaccagagg agatgctttc cttaacaraa ataggtgaga
   138121 aaagaagtgg cttgtatttt gctgcaaaga ctttgttttt aatttattta aagaaatagg
   138181 ttgttatttt tgattacagt ggtattttta gagttcataa aaatgttgaa atatagtaaa
   138241 gggtaaagaa gcacataaaa tcatccatga tttcaatatc tagagataat cacaatttac
   138301 atttcctttc agtctcattc tcttctttta acagctttat tcaggtataa tttacataca
   138361 atataatttg cttgtttttt aagagtataa tttagtgatt tttggtaaat tgagagtttt
   138421 gcaaccatca ccacaatcca gttttagaac ttttccatca ccccacatct gtcttatata
   138481 cacatataaa tgtgccatac aattgagatc atactgtatg tagaatttaa aattagtttt
   138541 tattgttaat gagtgtatta tgaatatttc ccagtgggtt acatttccta agatgtggaa
   138601 ttttacattg ctacataaaa tccccctatg tacatgtacc tataatttat ttaataaatt
   138661 ccttataaat gttggacaca ttagtttcca tttttcacta tgtaaatatg tccctgtata
   138721 catcttttat tatttcctca ggaacaattc ctacaaagta aattgccctc tctaaagagc
   138781 atacaaattg actgagccac cgttaggcca ttttctgaga ctgcacaggt cacaaagcaa
   138841 tctgatcttt gggaatacag ctacatttta taggcttctt agataatgtt actctaagta
   138901 ctttaaatat gtggggcttc tctgggcttt tttttttttg agacggagtt tcactcttac
   138961 tgcccaggct ggagagcaat ggcgcgacct tggctcactg caacctccgc ctcccaggtt
   139021 caagcgattc tcctgcctca gcctcctgag tagctgagat tacaggtgcc cgccacaatg
   139081 cctgcctaat ttttttgtat tttcagtaga gatggggttt caccatgttg gccagactgg
   139141 tctcgagctc ctgacctcag gtgatccacc tgcctcagcc tcccaaagtt ctgggattac
   139201 aggcatgagc cactgcgccc ggcttctctg gacttattat gtggagagat agtacaaggc
   139261 agtggctttc agagtttttt gaccatgacc gttgtgggaa atacatttta tatctcaacc
   139321 tagtatgtac acacagacat gtagacacat gtataaccta aagtttcata aagcagtacc
   139381 tactgttact aattgtagtg cactctgcta tttcttattc taccttatac tgcgtcatta
   139441 aaaaagtgct ggtcatgacc cactaaattt atttcccaaa ccactaatga acaatgactc
   139501 acaatttgaa cacactggac agggggatag ccaataaaat tgaaaagagc aaggaaatta
   139561 atgtattcat gatctcctct cctgtctctt acatttttgc agtagcaatg taaaggaatc
   139621 ctaagagaac agacattctg ggaatagcag gcctagcgct gcacaactgc tttcctaggc
   139681 ttgctcctag taccaagctc ctgacgcata tagcagtggc agtaataacc agcccatagt
   139741 aaggtttgtc acagggactg gttgtaagaa ctgatttggt tggtatagct gtgagggcct
   139801 ggcacggtgt ccacgtgtgc ctcaatccta attctgaaaa aggctgaccc tgggggtgct
   139861 aattagatac acagagagga atgaatgctg ccagaaggcc aagttcatgg caatgccgct
   139921 gtggctgagg tgcagtcatc agtctggaac gtgaacactg aacttctctc acatgtgatt
   139981 cttcacttga ctggcttcat agaaccccaa agccacccca ccaccacata aattgtgtct
   140041 ctaggttctg tgttgctcac actcaaaatt tctgggcctt ctcatttggt gcatgtgaat
   140101 ggtgcatatg agtgaagtct aggatggggc cttagcatta aagccctggg gtagtgtgac
   140161 tgagattgtt ggtaaagaat gtgcagtggt tggcatgacc tcagaaattc tgaaatggga
   140221 ctgcacctgc agactgaagt gttcagagag ccagggaggt gcaaggactg gggagggtag
   140281 aggcaggaac cctgcctgcc aggaagagct agcatcctgg gggcagaaag gctgtgcttt
   140341 caagtagcag cagatgtatt ggtatctttg taatggagaa gcatacttta caggaacatt
   140401 aggccagatt gtctaaccag agtatctcta cctgcttaaa atctaagtag ttttcttgtc
   140461 ctttgcagta tcttatcaaa catccatgaa gtacatcaga acatgggcta ctgccctcag
   140521 tttgatgcca tcacagagct gttgactggg agagaacacg tggagttctt tgcccttttg
   140581 agaggagtcc cagagaaaga agttggcaag gtactgtggg cacctgaaag ccagcctgtc
   140641 tcctttggca tcctgacaat atatacctta tggcttttcc acacgcattg acttcaggct
   140701 gtttttcctc atgaatgcag cagcacaaaa tgctggttct ttgtatctgc tttcagggtg
   140761 gaaacctgta acggtggtgg ggcagggctg ggtgggcaga gagggagtgc tgctcccacc
   140821 acacgagtcc cttctccctg ctttggctcc tcaccagttg tcaggttatg attatagaat
   140881 ctagtcctac tcagtgaaag aactttcata catgtatgtg taggacagca tgataaaatt
   140941 cccaagccag accaaagtca aggtgctttt tatcactgta ggttggtgag tgggcgattc
   141001 ggaaactggg cctcgtgaag tatggagaaa aatatgctgg taactatagt ggaggcaaca
   141061 aacgcaagct ctctacagcc atggctttga tcggcgggcc tcctgtggtg tttctggtga
   141121 gtataactgt ggatggaaaa ctgttgttct ggcctgagtg gaaaacatga ctgttcaaaa
   141181 gtcctatatg tccagggctg ttgtatgatt ggcttgtctt cccccaggga cagcagagca
   141241 accttggaaa agcagaggga agcttctccc ttggcacaca ctggggtggc tgtaccatgc
   141301 ctgcagatgc tcccaaatag aggcactcca agcactttgt ttcttagcgt gattgaggct
   141361 ggatatgtga tttgatcttt ctctggaaca ttctttctaa tcatctttgt gttcattccc
   141421 tgaaaatgaa gagtgtggac acagctttaa aatccccaag gtagcaacta ggtcatagtt
   141481 ccttacacac ggatagatga aaaacagatc agactgggaa gtggcccttg accttttttc
   141541 ttctgtagat aagagcattg atgttattac gggaagaagc ctttgaggct tttatgtatt
   141601 ccacctcggt ctggaatttg tttctgtaag gctaacagtt gcaatatact agggtaatct
   141661 gagtgagctg gaattaaaaa aaaaaaggaa tttcacccca atcttatact gacttcaata
   141721 gaggtttcag acaaaaagtt gttttgtata tacttatcag tcatgaaaag ataattacaa
   141781 ctaaatggcc tttttccttc cctatttatt tggagaaatt taattacata aaaaagtact
   141841 cagaatattt gagtttcctg catcaataag acatttataa taatgacctt gtttacaaat
   141901 gaatttgaaa gttactctaa ttctttgatt catcaagaaa taactagaat ggcaagttaa
   141961 aatttaagct gtttcaaaga tgcttctgca tttaaaaaca aatttatctt tgattttttt
   142021 tccccccagc aaataagact tattttattc taattacagg atgaacccac cacaggcatg
   142081 gatcccaaag cccggcggtt cttgtggaat tgtgccctaa gtgttgtcaa ggaggggaga
   142141 tcagtagtgc ttacatctca taggtccgta gtaaagtctt gggttcctca ctgtgggatg
   142201 ttttaacttt ccaagtagaa tatgcgatca ttttgtaaaa attagaaaat acagaaaagc
   142261 aaagagtaaa acaattatta cctgaaatta tatatgcata ttcttacaaa aatgcaagcc
   142321 cagtataaat actgctcttt ttcacttaat atattgtaaa cattattcca agtcagtgca
   142381 tttaggtgtc atttcttata gctggatagt attccattag gatatactct tatttaacta
   142441 ttcccccttt tgtagacatt tggattattt ccaacttgtt cacaattgta aacaccacta
   142501 cactgaacag catcatccct atatccacat gtacttgtaa cagaatacaa ttccctagga
   142561 agctggaatg ctggaagtca tggtgatgtt ctcatggtta cagagaatct ctctaaaact
   142621 aaaacctctt tctgttttac cgcagtatgg aagaatgtga agctctttgc actaggatgg
   142681 caatcatggt caatggaagg ttcaggtgcc ttggcagtgt ccagcatcta aaaaataggt
   142741 aataaagata atttctttgg gatagtgcct agtgagaagg cttgatattt attcttttgt
   142801 gagtatataa atggtgcctc taaaataaag ggaaataaaa ctgagcaaaa cagtatagtg
   142861 gaaagaatga gggctttgaa gtccgaactg cattcaaatt ctgtctttac catttactgg
   142921 ttctgtgact cttgggcaag ttacttaact actgtaagag ttagtttccc tggaagatct
   142981 acctcctagc tttgtgctat agatgaaatg aaaaaaattt acatgtgcca gtactggtga
   143041 gagcgcaagc tttggagtca aacacaaatg ggtttgcatc ctggccctac caattatgag
   143101 ctctgagcca tgggcaagtg actaactccc tgggcctcag tttctctgta acatctgtca
   143161 gacttcatgg gtccaggtga ggattaaagg agatcatgta tttacagcac atggcatggt
   143221 gcttcacata aaataagtat ttagtaaatg ataactggtt ccttctctca gaaacttatt
   143281 tctgggcctg ccaggggccg ccctttttca tggcacaagt tgggttccca gggttcagta
   143341 ttcttttaaa tagttttctg gagatcctcc atttgggtat tttttcctgc tttcaggttt
   143401 ggagatggtt atacaatagt tgtacgaata gcagggtcca acccggacct gaagcctgtc
   143461 caggatttct ttggacttgc atttcctgga agtgttcyaa aagagaaaca ccggaacatg
   143521 ctacaatacc agcttccatc ttcattatct tctctggcca ggatattcag catcctctcc
   143581 cagagcaaaa agcgactcca catagaagac tactctgttt ctcagacaac acttgaccaa
   143641 gtaagctttg agtgtcaaaa cagatttact tctcagggtg tggattcctg ccccgacact
   143701 cccgcccata ggtccaagag cagtttgtat cttgaattgg tgcttgaatt cctgatctac
   143761 tattcctagc tatgcttttt actaaacctc tctgaacctg aaaagggaga tgatgcctat
   143821 gtactctata ggattattgt gagaatttac tgtaataata accataaaaa ctaccattta
   143881 gtgagcacct accatgggcc aggcatttta cttggtgcct aatcctattt aaattagata
   143941 aaaaagtacc aaataggtcc tgacacttaa gaagtactca gtaaatattt tcttccctct
   144001 tccctttaat caagaccgta tgtgccaaag taaatggatg actgagcagt tggtgatgta
   144061 ggggtggggg gcgatataga aagtcagttt ttggccgggc gtggtggctc atgcctgtaa
   144121 tcccagcact ttgggaggct gaggagcagg cagatcatga ggtcaggaga tccagataat
   144181 cctggccaac agggtgaaac cccgtctcta ctaaaaatac aaaaattagc tgggcatggt
   144241 ggtgcgcact tgtagtccca gctacttgcg aggctgaggc aggagaattg ctcgaaccca
   144301 ggaggtggag gttacagtga gccaaggtct cgccactgca ctccagcctg gggacagagc
   144361 aagaccccat ttcaaggggg gaaaaaaagt ctatttttaa gttgttattg cttttttcaa
   144421 gtattcttcc ctccttcaca cacagttttc tagttaatcc atttatgtaa ttctgtatgc
   144481 tcctacttga cctaatttca acatctggaa aaatagaact agaataaaga atgagcaagt
   144541 tgagtggtat ttataaaggt ccatcttaat cttttaacag gtatttgtga actttgccaa
   144601 ggaccaaagt gatgatgacc acttaaaaga cctctcatta cacaaaaacc agacagtagt
   144661 ggacgttgca gttctcacat cttttctaca ggatgagaaa gtgaaagaaa gctatgtatg
   144721 aagaatcctg ttcatacggg gtggctgaaa gtaaagaggr actagacttt cctttgcacc
   144781 atgtgaagtg ttgtggagaa aagagccaga agttgatgtg ggaagaagta aactggatac
   144841 tgtactgata ctattcaatg caatgcaatt caatgcaatg aaaacaaaat tccattacag
   144901 gggcagtgcc tttgtagcct atgtcttgta tggctctcaa gtgaaagact tgaatttagt
   144961 tttttaccta tacctatgtg aaactctatt atggaaccca atggacatat gggtttgaac
   145021 tcacactttt tttttttttt ttgttcctgt gtattctcat tggggttgca acaataattc
   145081 atcaagtaat catggccagc gattattgat caaaatcaaa aggtaatgca catcctcatt
   145141 cactaagcca tgccatgccc aggagactgg tttcccggtg acacatccat tgctggcaat
   145201 gagtgtgcca gagttattag tgccaagttt ttcagaaagt ttgaagcacc atggtgtgtc
   145261 atgctcactt ttgtgaaagc tgctctgctc agagtctatc aacattgaat atcagttgac
   145321 agaatggtgc catgcgtggc taacatcctg ctttgattcc ctctgataag ctgttctggt
   145381 ggcagtaaca tgcaacaaaa atgtgggtgt ctccaggcac gggaaacttg gttccattgt
   145441 tatattgtcc tatgcttcga gccatgggtc tacagggtca tccttatgag actcttaaat
   145501 atacttagat cctggtaaga ggcaaagaat caacagccaa actgctgggg ctgcaagctg
   145561 ctgaagccag ggcatgggat taaagagatt gtgcgttcaa acctagggaa gcctgtgccc
   145621 atttgtcctg actgtctgct aacatggtac actgcatctc aagatgttta tctgacacaa
   145681 gtgtattatt tctggctttt tgaattaatc tagaaaatga aaagatggag ttgtattttg
   145741 acaaaaatgt ttgtactttt taatgttatt tggaatttta agttctatca gtgacttctg
   145801 aatccttaga atggcctctt tgtagaaccc tgtggtatag aggagtatgg ccactgcccc
   145861 actattttta ttttcttatg taagtttgca tatcagtcat gactagtgcc tagaaagcaa
   145921 tgtgatggtc aggatctcat gacattatat ttgagtttct ttcagatcat ttaggatact
   145981 cttaatctca cttcatcaat caaatatttt ttgagtgtat gctgtagctg aaagagtatg
   146041 tacgtacgta taagactaga gagatattaa gtctcagtac acttcctgtg ccatgttatt
   146101 cagctcactg gtttacaaat ataggttgtc ttgtggttgt aggagcccac tgtaacaata
   146161 ctgggcagcc tttttttttt tttttttaat tgcaacaatg caaaagccaa gaaagtataa
   146221 gggtcacaag tctaaacaat gaattcttca acagggaaaa cagctagctt gaaaacttgc
   146281 tgaaaaacac aacttgtgtt tatggcattt agtaccttca aataattggc tttgcagata
   146341 ttggataccc cattaaatct gacagtctca aatttttcat ctcttcaatc actagtcaag
   146401 aaaaatataa aaacaacaaa tacttccata tggagcattt ttcagagttt tctaacccag
   146461 tcttattttt ctagtcagta aacatttgta aaaatactgt ttcactaata cttactgtta
   146521 actgtcttga gagaaaagaa aaatatgaga gaactattgt ttggggaagt tcaagtgatc
   146581 tttcaatatc attactaact tcttccactt tttccagaat ttgaatatta acgctaaagg
   146641 tgtaagactt cagatttcaa attaatcttt ctatattttt taaatttaca gaatattata
   146701 taacccactg ctgaaaaaga aaaaaatgat tgttttagaa gttaaagtca atattgattt
   146761 taaatataag taatgaaggc atatttccaa taactagtga tatggcatcg ttgcatttta
   146821 cagtatcttc aaaaatacag aatttataga ataatttctc ctcatttaat atttttcaaa
   146881 atcaaagtta tggtttcctc attttactaa aatcgtattc taattcttca ttatagtaaa
   146941 tctatgagca actccttact tcggttcctc tgatttcaag gccatatttt aaaaaatcaa
   147001 aaggcactgt gaactatttt gaagaaaaca caacatttta atacagattg aaaggacctc
   147061 ttctgaagct agaaacaatc tatagttata catcttcatt aatactgtgt taccttttaa
   147121 aatagtaatt ttttacattt tcctgtgtaa acctaattgt ggtagaaatt tttaccaact
   147181 ctatactcaa tcaagcaaaa tttctgtata ttccctgtgg aatgtaccta tgtgagtttc
   147241 agaaattctc aaaatacgtg ttcaaaaatt tctgcttttg catctttggg acacctcaga
   147301 aaacttatta acaactgtga atatgagaaa tacagaagaa aataataagc cctctataca
   147361 taaatgccca gcacaattca ttgttaaaaa acaaccaaac ctcacactac tgtatttcat
   147421 tatctgtact gaaagcaaat gctttgtgac tattaaatgt tgcacatcat tcattcactg
   147481 tatagtaatc attgactaaa gccatttgtc tgtgttttct tcttgtggtt gtatatatca
   147541 ggtaaaatat tttccaaaga gccatgtgtc atgtaatact gaaccacttt gatattgaga
   147601 cattaatttg tacccttgtt attatctact agtaataatg taatactgta gaaatattgc
   147661 tctaattctt ttcaaaattg ttgcatcccc cttagaatgt ttctatttcc ataaggattt
   147721 aggtatgcta ttatcccttc ttatacccta agatgaagct gtttttgtgc tctttgttca
   147781 tcattggccc tcattccaag cactttacgc tgtctgtaat gggatctatt tttgcactgg
   147841 aatatctgag aattgcaaaa ctagacaaaa gtttcacaac agatttctaa gttaaatcat
   147901 tttcattaaa aggaaaaaag aaaaaaaatt ttgtatgtca ataactttat atgaagtatt
   147961 aaaatgcata tttctatgtt gtaatataat gagtcacaaa ataaagctgt gacagttctg
   148021 ttggtctaca gaaatttact tttgtgcatt tgtggcacca cctactgttg aagggttata
   148081 aagccattag aaaagtagag gggaagtgat ttggatcaaa aggaaaaact ttagaaaaga
   148141 ttcaaatgtt cccttaatca taaaagagaa ctgaggggac tacttgaaaa taaaaggttg
   148201 ttttgtattt tcatgttggt taagatactg agtaactggt attaagtgtt agaggttttt
   148261 agataaatat tctgcttaat gattatgaag ctgcactgag atttctgaaa atgctctgta
   148321 gctgagctta tttaataaat gttcacttgg tataggggaa gctacaaagg cagccttcag
   148381 tgtccttttg tttattcaac caaaaatata aggacacaat gtagcagtta tactgggaag
   148441 gtgctggggg tggtggcaat ggtgagcagg aaggcgaagt agatatggaa acagaaatga
   148501 tactaatatc ggtgattcct tccttttttc ctgtaataag tgctgtgcag acaacatatg
   148561 agcagtgctg ataaatgtaa atgtattttt catagctcat taagaatcag tttcagaaag
   148621 agatgtctgc ttattttgct acttgaagaa tccctgtcaa acagtccttt tgaggaagta
   148681 caagaggctg tctctatttg tgacctcagg aatggctgtg acagtgtcgt gagcagtcct
   148741 tttcctgtgg cacagatctg aactttgtgt gcagaaaaat cttggcttca agtgagccaa
   148801 gatgccccct gagcatcagc atcacaactt catcctccta tcttgaagtt catgttatag
   148861 tgactttaat gaaatcatag aacactgttt cttcgtgaac aatgacgagg gagaggaaaa
   148921 aactttattg aaaaataaaa aggcaggtaa tttagatgaa aatatgttac ccatgaggtt
   148981 ttgtttttgc tttttgtttt tgtttttgag aaacagaatc tcgctctgtc gtcc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF064244. Homo sapiens inte...[gi:3859854] Links  


LOCUS       AF064244                7247 bp    mRNA    linear   PRI 21-NOV-1998
DEFINITION  Homo sapiens intersectin long form mRNA, complete cds.
ACCESSION   AF064244
VERSION     AF064244.1  GI:3859854
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 7247)
  AUTHORS   Guipponi,M., Scott,H.S., Chen,H., Schebesta,A., Rossier,C. and
            Antonarakis,S.E.
  TITLE     Two isoforms of a human intersectin (ITSN) protein are produced by
            brain-specific alternative splicing in a stop codon
  JOURNAL   Genomics 53 (3), 369-376 (1998)
  MEDLINE   99017974
   PUBMED   9799604
REFERENCE   2  (bases 1 to 7247)
  AUTHORS   Guipponi,M., Scott,H.S., Chen,H., Schebesta,A., Rossier,C. and
            Antonarakis,S.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-MAY-1998) Genetics and Microbiology, CMU, 1 rue
            Michel-Servet, Geneva 4 CH-1211, Switzerland
FEATURES             Location/Qualifiers
     source          1..7247
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="21"
                     /map="21q22.1-q22.2"
                     /tissue_type="brain"
                     /dev_stage="fetus"
     CDS             107..5272
                     /codon_start=1
                     /product="intersectin long form"
                     /protein_id="AAC78611.1"
                     /db_xref="GI:3859855"
                     /translation="MAQFPTPFGGSLDIWAITVEERAKHDQQFHSLKPISGFITGDQA
                     RNFFFQSGLPQPVLAQIWALADMNNDGRMDQVEFSIAMKLIKLKLQGYQLPSALPPVM
                     KQQPVAISSAPPFGMGGIASMPPLTAVAPVPMGSIPVVGMSPTLVSSVPTAAVPPLAN
                     GAPPVIQPLPAFAHPAATLPKSSSFSRSGPGSQLNTKLQKAQSFDVASVPPVAEWAVP
                     QSSRLKYRQLFNSHDKTMSGHLTGPQARTILMQSSLPQAQLASIWNLSDIDQDGKLTA
                     EEFILAMHLIDVAMSGQPLPPVLPPEYIPPSFRRVRSGSGISVISSTSVDQRLPEEPV
                     LEDEQQQLEKKLPVTFEDKKRENFERGNLELEKRRQALLEQQRKEQERLAQLERAEQE
                     RKERERQEQERKRQLELEKQLEKQRELERQREEERRKEIERREAAKRELERQRQLEWE
                     RNRRQELLNQRNKEQEDIVVLKAKKKTLEFELEALNDKKHQLEGKLQDIRCRLTTQRQ
                     EIESTNKSRELRIAEITHLQQQLQESQQMLGRLIPEKQILNDQLKQVQQNSLHRDSLV
                     TLKRALEAKELARQHLRDQLDEVEKETRSKLQEIDIFNNQLKELREIHNKQQLQKQKS
                     MEAERLKQKEQERKIIELEKQKEEAQRRAQERDKQWLEHVQQEDEHQRPRKLHEEEKL
                     KREESVKKKDGEEKGKQEAQDKLGRLFHQHQEPAKPAVQAPWSTAEKGPLTISAQENV
                     KVVYYRALYPFESRSHDEITIQPGDIVMVKGEWVDESQTGEPGWLGGELKGKTGWFPA
                     NYAEKIPENEVPAPVKPVTDSTSAPAPKLALRETPAPLAVTSSEPSTTPNNWADFSST
                     WPTSTNEKPETDNWDAWAAQPSLTVPSAGQLRQRSAFTPATATGSSPSPVLGQGEKVE
                     GLQAQALYPWRAKKDNHLNFNKNDVITVLEQQDMWWFGEVQGQKGWFPKSYVKLISGP
                     IRKSTSMDSGSSESPASLKRVASPAAKPVVSGEEFIAMYTYESSEQGDLTFQQGDVIL
                     VTKKDGDWWTGTVGDKAGVFPSNYVRLKDSEGSGTAGKTGSLGKKPEIAQVIASYTAT
                     GPEQLTLAPGQLILIRKKNPGGWWEGELQARGKKRQIGWFPANYVKLLNPGTSKITPT
                     EPPKSTALAAVCQVIGMYDYTAQNDDELAFNKGQIINVLNKEDPDWWKGEVNGQVGLF
                     PSNYVKLTTDMDPSQQWCSDLHLLDMLTPTERKRQGYIHELIVTEENYVNDLQLVTEI
                     FQKPLMESELLTEKEVAMIFVNWKELIMCNIKLLKALRVRKKMSGEKMPVKMIGDILS
                     AQLPHMQPYIRFCSRQLNGAALIQQKTDEAPDFKEFVKRLEMDPRCKGMPLSSFILKP
                     MQRVTRYPLIIKNILENTPENHPDHSHLKHALEKAEELCSQVNEGVREKENSDRLEWI
                     QAHVQCEGLSEQLVFNSVTNCLGPRKFLHSGKLYKAKNNKELYGFLFNDFLLLTQITK
                     PLGSSGTDKVFSPKSNLQYKMYKTPIFLNEVLVKLPTDPSGDEPIFHISHIDRVYTLR
                     AESINERTAWVQKIKAASELYIETEKKKREKAYLVRSQRATGIGRLMVNVVEGIELKP
                     CRSHGKSNPYCEVTMGSQCHITKTIQDTLNPKWNSNCQFFIRDLEQEVLCITVFERDQ
                     FSPDDFLGRTEIRVADIKKDQGSKGPVTKCLLLHEVPTGEIVVRLDLQLFDEP"
     misc_feature    167..406
                     /note="encodes EH domain"
     misc_feature    767..1936
                     /note="encodes EH domain"
     misc_feature    2324..2524
                     /note="encodes SH3 domain"
     misc_feature    2843..3019
                     /note="encodes SH3 domain"
     misc_feature    3110..3286
                     /note="encodes SH3 domain"
     misc_feature    3326..3520
                     /note="encodes SH3 domain"
     misc_feature    3569..3748
                     /note="encodes SH3 domain"
     misc_feature    3836..4390
                     /note="encodes GDS domain"
     misc_feature    4649..4819
                     /note="encodes PH domain"
     misc_feature    4895..5143
                     /note="encodes C2 domain"
BASE COUNT     2156 a   1652 c   1747 g   1692 t
ORIGIN      
        1 gcgtccctcc cagcggcgcg tgagcggcac tgatttgtcc ctggggcggc agcgcggacc
       61 cgcccggaga tgaggcgtcg attagcaagg taaaagtaac agaaccatgg ctcagtttcc
      121 aacacctttt ggtggcagcc tggatatctg ggccataact gtagaggaaa gagcgaagca
      181 tgatcagcag ttccatagtt taaagccaat atctggattc attactggtg atcaagctag
      241 aaactttttt tttcaatctg ggttacctca acctgtttta gcacagatat gggcactagc
      301 tgacatgaat aatgatggaa gaatggatca agtggagttt tccatagcta tgaaacttat
      361 caaactgaag ctacaaggat atcagctacc ctctgcactt ccccctgtca tgaaacagca
      421 accagttgct atttctagcg caccaccatt tggtatggga ggtatcgcca gcatgccacc
      481 gcttacagct gttgctccag tgccaatggg atccattcca gttgttggaa tgtctccaac
      541 cctagtatct tctgttccca cagcagctgt gccccccctg gctaacgggg ctccccctgt
      601 tatacaacct ctgcctgcat ttgctcatcc tgcagccaca ttgccaaaga gttcttcctt
      661 tagtagatct ggtccagggt cacaactaaa cactaaatta caaaaggcac agtcatttga
      721 tgtggccagt gtcccaccag tggcagagtg ggctgttcct cagtcatcaa ggctgaaata
      781 caggcaatta ttcaatagtc atgacaaaac tatgagtgga cacttaacag gtccccaagc
      841 aagaactatt cttatgcagt caagtttacc acaggctcag ctggcttcaa tatggaatct
      901 ttctgacatt gatcaagatg gaaaacttac agcagaggaa tttatcctgg caatgcacct
      961 cattgatgta gctatgtctg gccaaccact gccacctgtc ctgcctccag aatacattcc
     1021 accttctttt agaagagttc gatctggcag tggtatatct gtcataagct caacatctgt
     1081 agatcagagg ctaccagagg aaccagtttt agaagatgaa caacaacaat tagaaaagaa
     1141 attacctgta acgtttgaag ataaaaagcg ggagaacttt gaacgtggca acctggaact
     1201 ggagaaacga aggcaagctc tcctggaaca gcagcgcaag gagcaggagc gcctggccca
     1261 gctggagcgg gcggagcagg agaggaagga gcgtgagcgc caggagcaag agcgcaaaag
     1321 acaactggaa ctggagaagc aactggaaaa gcagcgggag ctagaacggc agagagagga
     1381 ggagaggagg aaagaaattg agaggcgaga ggctgcaaaa cgggaacttg aaaggcaacg
     1441 acaacttgag tgggaacgga atcgaaggca agaactacta aatcaaagaa acaaagaaca
     1501 agaggacata gttgtactga aagcaaagaa aaagactttg gaatttgaat tagaagctct
     1561 aaatgataaa aagcatcaac tagaagggaa acttcaagat atcagatgtc gattgaccac
     1621 ccaaaggcaa gaaattgaga gcacaaacaa atctagagag ttgagaattg ccgaaatcac
     1681 ccatctacag caacaattac aggaatctca gcaaatgctt ggaagactta ttccagaaaa
     1741 acagatactc aatgaccaat taaaacaagt tcagcagaac agtttgcaca gagattcact
     1801 tgttacactt aaaagagcct tagaagcaaa agaactagct cggcagcacc tacgagacca
     1861 actggatgaa gtggagaaag aaactagatc aaaactacag gagattgata ttttcaataa
     1921 tcagctgaag gaactaagag aaatacacaa taagcaacaa ctccagaagc aaaagtccat
     1981 ggaggctgaa cgactgaaac agaaagaaca agaacgaaag atcatagaat tagaaaaaca
     2041 aaaagaagaa gcccaaagac gagctcagga aagggacaag cagtggctgg agcatgtgca
     2101 gcaggaggac gagcatcaga gaccaagaaa actccacgaa gaggaaaaac tgaaaaggga
     2161 ggagagtgtc aaaaagaagg atggcgagga aaaaggcaaa caggaagcac aagacaagct
     2221 gggtcggctt ttccatcaac accaagaacc agctaagcca gctgtccagg caccctggtc
     2281 cactgcagaa aaaggtccac ttaccatttc tgcacaggaa aatgtaaaag tggtgtatta
     2341 ccgggcactg tacccctttg aatccagaag ccatgatgaa atcactatcc agccaggaga
     2401 catagtcatg gttaaagggg aatgggtgga tgaaagccaa actggagaac ccggctggct
     2461 tggaggagaa ttaaaaggaa agacagggtg gttccctgca aactatgcag agaaaatccc
     2521 agaaaatgag gttcccgctc cagtgaaacc agtgactgat tcaacatctg cccctgcccc
     2581 caaactggcc ttgcgtgaga cccccgcccc tttggcagta acctcttcag agccctccac
     2641 gacccctaat aactgggccg acttcagctc cacgtggccc accagcacga atgagaaacc
     2701 agaaacggat aactgggatg catgggcagc ccagccctct ctcaccgttc caagtgccgg
     2761 ccagttaagg cagaggtccg cctttactcc agccacggcc actggctcct ccccgtctcc
     2821 tgtgctaggc cagggtgaaa aggtggaggg gctacaagct caagccctat atccttggag
     2881 agccaaaaaa gacaaccact taaattttaa caaaaatgat gtcatcaccg tcctggaaca
     2941 gcaagacatg tggtggtttg gagaagttca aggtcagaag ggttggttcc ccaagtctta
     3001 cgtgaaactc atttcagggc ccataaggaa gtctacaagc atggattctg gttcttcaga
     3061 gagtcctgct agtctaaagc gagtagcctc tccagcagcc aagccggtcg tttcgggaga
     3121 agaatttatt gccatgtaca cttacgagag ttctgagcaa ggagatttaa cctttcagca
     3181 aggggatgtg attttggtta ccaagaaaga tggtgactgg tggacaggaa cagtgggcga
     3241 caaggccgga gtcttccctt ctaactatgt gaggcttaaa gattcagagg gctctggaac
     3301 tgctgggaaa acagggagtt taggaaaaaa acctgaaatt gcccaggtta ttgcctcata
     3361 caccgccacc ggccccgagc agctcactct cgcccctggt cagctgattt tgatccgaaa
     3421 aaagaaccca ggtggatggt gggaaggaga gctgcaagca cgtgggaaaa agcgccagat
     3481 aggctggttc ccagctaatt atgtaaagct tctaaaccct gggacgagca aaatcactcc
     3541 aacagagcca cctaagtcaa cagcattagc ggcagtgtgc caggtgattg ggatgtacga
     3601 ctacaccgcg cagaatgacg atgagctggc cttcaacaag ggccagatca tcaacgtcct
     3661 caacaaggag gaccctgact ggtggaaagg agaagtcaat ggacaagtgg ggctcttccc
     3721 atccaattat gtgaagctga ccacagacat ggacccaagc cagcaatggt gttcagactt
     3781 acatctcttg gatatgttga ccccaactga aagaaagcga caaggataca tccacgagct
     3841 cattgtcacc gaggagaact atgtgaatga cctgcagctg gtcacagaga tttttcaaaa
     3901 acccctgatg gagtctgagc tgctgacaga aaaagaggtt gctatgattt ttgtgaactg
     3961 gaaggagctg attatgtgta atatcaaact actaaaagcg ctgagagtcc gcaagaagat
     4021 gtccggggag aagatgcctg tgaagatgat tggagacatc ctgagcgcac agctgccgca
     4081 catgcagccc tacatccgct tctgcagccg ccagctcaac ggggctgccc tgatccagca
     4141 gaagacggac gaggccccag acttcaagga gttcgtcaaa agattggaaa tggatcctcg
     4201 gtgtaaaggg atgccactct ctagttttat actgaagcct atgcaacggg taacaagata
     4261 cccactgatc attaaaaata tcctggaaaa cacccctgaa aaccacccgg accacagcca
     4321 cttgaagcac gccctggaga aggcggaaga gctctgttcc caggtgaacg aaggggtgcg
     4381 ggagaaggag aactctgacc ggctggagtg gatccaggcc cacgtgcagt gtgaaggcct
     4441 gtctgagcaa cttgtgttca attcagtgac caattgcttg gggccgcgca aatttctgca
     4501 cagtgggaag ctctacaagg ccaagaacaa caaggagctg tatggcttcc ttttcaacga
     4561 cttcctcctg ctgactcaga tcacgaagcc tttggggtct tctggcaccg acaaagtctt
     4621 cagccccaaa tcaaacctgc agtataaaat gtataaaaca cctattttcc taaatgaggt
     4681 tctagtaaaa ttacccaccg acccttctgg agacgagccc atcttccaca tctcccacat
     4741 tgaccgcgtc tatactctcc gagcagaaag cataaatgaa aggactgcct gggtgcagaa
     4801 aatcaaagct gcttctgaac tttacataga gactgagaaa aagaagcgcg agaaagcgta
     4861 cctggtccgt tcccaaaggg caacaggcat tggaaggttg atggtgaacg tggttgaagg
     4921 catcgagttg aaaccctgtc ggtcacatgg aaagagcaac ccgtactgtg aggtgaccat
     4981 gggttcccag tgccacatca ccaagacgat ccaggacact ctgaacccca agtggaattc
     5041 caactgccag ttcttcatcc gagacctgga gcaggaagtc ctctgcatca ctgtgttcga
     5101 gagggaccag ttctcaccag atgatttttt gggtcggacg gagatccgtg tggcggacat
     5161 caagaaagac cagggctcca aaggtccagt tacgaagtgt cttctgctgc acgaagtccc
     5221 cacgggagag attgtggtcc gcttggacct gcagttgttt gatgagccgt aggcagcggg
     5281 ctcagggtgt gctcagcagg gtcccagccc acggccacac atgctgtctg gaaattgtat
     5341 tccttttcta agaaaccacc atttggtatt cagtcacagg gatatgggat ggcaaagaca
     5401 ggcccctcaa agctcctagg aatcattctc gacaatcctc cctgccccga aacaatttcc
     5461 tgtttcatga aacaaagctg tgttttcctt tgtcctcact acaggtctca ttatggcttc
     5521 tagggtcgct gaaatcccat agccctcaac agggtgcagc tgggagtcta gccccttccc
     5581 gggcttgagg gatgggtctg gttactataa aatagattta taaatgcaat gtctatattt
     5641 ttggagaact catgtaaccc tcctgtttct tacatccacc agtccccaag tagacttctt
     5701 ggcctacaat gcccagtcct tggtgtgagt ttagaaacaa ttatgacggt cctgtcattg
     5761 cttcagaatc ccatctctcc tgcagggaaa tgctgcctag agctgatcac tcggtgagac
     5821 ggtctgatca ggccctggct tagctctttg aagagctggt ctatggaagt ttccagcatg
     5881 tgcaccgtta tagccgttcc ttccccctct aggccttgta ttaatatatg tcaatgaaaa
     5941 cacactggtg tattgttgcg tggattcagt tctgattccc agcatgctta gaatatggtc
     6001 acagaaagtc attatctaga aagtcacccc tctgctggat cagatcacta caggtcactg
     6061 gaaaggcaac tttacaatgt tgggtcactg ggtctcggtt ggcagccatg ttggaaaaat
     6121 ctcttttggc tcggaggcct gtgatatttc atagcagcag tcgttgctgg tgacctgttc
     6181 tgtgcttgaa tgtgctgaat cctgattgtt gtaggacatt tcaacagctc tttttggtac
     6241 gttccccaaa aagccatgtc ctagatcccc aaggcgtgaa aaggaaaaat atcaagctgg
     6301 aggttgggaa agaaaatgaa ggcagtccat tatgtggtgg gtgaaagacc ctaggaggat
     6361 gcaagccccg cacatcccgg ggcaaagacc taagacactt ttccaccctc caccacccca
     6421 acctcacata atatgcttgt tgcaagagtc aggactttat gactatgtgc caagctgttt
     6481 ggtttgagtt ctttaatttt tttttccctt aaatgccagg agatcatctg gttagttaga
     6541 tagtaacttg atttgctaat gaaaagtggg ggccgtgttt tgtttgcatg ttaatattct
     6601 cataatccta gtttgttgtg gtcatgaaat gccctttgca tgttctgttg gtactggagt
     6661 ctagctttcc tgtactagat ggtgttctct ttgattgtag gtccttagac tttaattagg
     6721 gttatcaaag tgctttctaa atgatagcat cagcgttgtg gcagagtacc tcctttgctg
     6781 ggaactgaat gtgtagggtt atcatttccc atgagagccc ggtcatactt caagcaattt
     6841 ttttaaaagt gtgtgttgga aaggacaaca aagtttacat ttcatacttt taagaaatac
     6901 tttattattt atttattgaa gatagtgtag aattttgtat caagaacaac agacataagt
     6961 attttttgaa acaagcaaat ataccctgta gttagaaact ttcaactgaa catgttagag
     7021 accaagttta acttcaggca tgcatttgtt taccatttcc cagcagaaaa catggttaaa
     7081 atactttaag tttatatttt ttgatgttgt taagaaactt ttaaattaaa tctataaata
     7141 gacatgcaac tcatgctttc ctatttctat aaccaacacc gtttgtttag tgtatttatg
     7201 aaagatatgc taccatggta gaaagaaaag tattcaatgt gtaaatt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X61498. H.sapiens mRNA fo...[gi:35039] Links  


LOCUS       HSNFKBS                 3001 bp    mRNA    linear   PRI 29-APR-1992
DEFINITION  H.sapiens mRNA for NF-kB subunit.
ACCESSION   X61498
VERSION     X61498.1  GI:35039
KEYWORDS    NF-kb subunit.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3001)
  AUTHORS   Schmid,R.M., Perkins,N.D., Duckett,C.S., Andrews,P.C. and
            Nabel,G.J.
  TITLE     Cloning of an NF-kappa B subunit which stimulates HIV transcription
            in synergy with p65
  JOURNAL   Nature 352 (6337), 733-736 (1991)
  MEDLINE   91343004
REFERENCE   2  (bases 1 to 3001)
  AUTHORS   Schmid,R.M.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-AUG-1991) R.M. Schmid, Howard Hughes Medical Inst,
            Dept of Internal Medicine and Biological Chemistry, 1150 West
            Medical Center Drive, Ann Harbor Michigan 48109, USA
FEATURES             Location/Qualifiers
     source          1..3001
                     /organism="Homo sapiens"
                     /strain="B-lymophoid leukaemia"
                     /db_xref="taxon:9606"
     CDS             164..2965
                     /codon_start=1
                     /product="NF-kB subunit"
                     /protein_id="CAA43715.1"
                     /db_xref="GI:35040"
                     /db_xref="SWISS-PROT:Q00653"
                     /translation="MESCYNPGLDGIIEYDDFKLNSSIVEPKEPAPETADGPYLVIVE
                     QPKQRGFRFRYGCEGPSHGGLPGASSEKGRKTYPTVKICNYEGPAKIEVDLVTHSDPP
                     RAHAHSLVGKQCSELGICAVSVGPKDMTAQFNNLGVLHVTKKNMMGTMIQKLQRQRLR
                     SRPQGLTEAEQRELEQEAKELKKVMDLSIVRLRFSAFLRASDGSFSLPLKPVTSQPIH
                     DSKSPGASNLKISRMDKTAGSVRGGDEVYLLCDKVQKDDIEVRFYEDDENGWQAFGDF
                     SPTDVHKQYAIVFRTPPYHKMKIERPVTVFLQLKRKRGGDVSDSKQFTYYPLVEDKEE
                     VQRKRRKALPTFSQPFGGGSHMGGGSGGAAGGYGGAGGGGSLGFFPSSLAYSPYQSGA
                     GPMRCYPGGGGGAQMAATVPSRDSGEEAAEPSAPSRTPQCEPQAPEMLQRAREYNARL
                     FGLAHAAPSPTRLLRHRGRRALLAGQRHLLTAQDENGDTPLHLAIIHGQTSVIEQIVY
                     VIHHAQDLGVVNLTNHLHQTPLHLAVITGQTSVVSFLLRVGADPALLDRHGDSAMHLA
                     LRAGAGAPELLRALLQSGAPAVPQLLHMPDFEGLYPVHLAVRARSPECLDLLVDSGAE
                     VEATERQGGRTALHLATEMEELGLVTHLVTKLRANVNARTFAGNTPLHLAAGLGYPTL
                     TRLLLKAGADIHAENEEPLCPLPSPPTSDSDSDSEGPEKDTRSSFRGHTPLDLTCSTL
                     VKTLLLNAAQNTMEPPLTPPSPAGPGLSLGDTALQNLEQLLDGPEAQGSWAELAERLG
                     LRSLVDTYRQTTSPSGSLLRSYELAGGDLAGLLEALSDMGLEEGVRLLRGPETRDKLP
                     STEVKEDSAYGSQSVEQEAEKLGPPPEPPGGLSHGHPQPQVTDLLPAPSPLPGPPVQR
                     PHLFQILFNTPHPPLSWDK"
BASE COUNT      643 a    924 c    912 g    522 t
ORIGIN      
        1 actttcctgc cccttccccg gccaagccca actccggatc tcgctctcca ccggatctca
       61 cccgccacac ccggacaggc ggctggagga ggcgggcgtc taaaattctg ggaagcagaa
      121 cctggccgga gccactagac agagccgggc ctagcccaga gacatggaga gttgctacaa
      181 cccaggtctg gatggtatta ttgaatatga tgatttcaaa ttgaactcct ccattgtgga
      241 acccaaggag ccagccccag aaacagctga tggcccctac ctggtgatcg tggaacagcc
      301 taagcagaga ggcttccgat ttcgatatgg ctgtgaaggc ccctcccatg gaggactgcc
      361 cggtgcctcc agtgagaagg gccgaaagac ctatcccact gtcaagatct gtaactacga
      421 gggaccagcc aagatcgagg tggacctggt aacacacagt gacccacctc gtgctcatgc
      481 ccacagtctg gtgggcaagc aatgctcgga gctggggatc tgcgccgttt ctgtggggcc
      541 caaggacatg actgcccaat ttaacaacct gggtgtcctg catgtgacta agaagaacat
      601 gatggggact atgatacaaa aacttcagag gcagcggctc cgctctaggc cccagggcct
      661 tacggaggcc gagcagcggg agctggagca agaggccaaa gaactgaaga aggtgatgga
      721 tctgagtata gtgcggctgc gcttctctgc cttccttaga gccagtgatg gctccttctc
      781 cctgcccctg aagccagtca cctcccagcc catccatgat agcaaatctc cgggggcatc
      841 aaacctgaag atttctcgaa tggacaagac agcaggctct gtgcggggtg gagatgaagt
      901 ttatctgctt tgtgacaagg tgcagaaaga tgacattgag gttcggttct atgaggatga
      961 tgagaatgga tggcaggcct ttggggactt ctctcccaca gatgtgcata aacagtatgc
     1021 cattgtgttc cggacacccc cctatcacaa gatgaagatt gagcggcctg taacagtgtt
     1081 tctgcaactg aaacgcaagc gaggagggga cgtgtctgat tccaaacagt tcacctatta
     1141 ccctctggtg gaagacaagg aagaggtgca gcggaagcgg aggaaggcct tgcccacctt
     1201 ctcccagccc ttcgggggtg gctcccacat gggtggaggc tctgggggtg cagccggggg
     1261 ctacggagga gctggaggag gtggcagcct cggtttcttc ccctcctccc tggcctacag
     1321 cccctaccag tccggcgcgg gccccatgcg gtgctacccg ggaggcgggg gcggggcgca
     1381 gatggccgcc acggtgccca gcagggactc cggggaggaa gccgcggagc cgagcgcccc
     1441 ctccaggacc ccccagtgcg agccgcaggc cccggagatg ctgcagcgag ctcgagagta
     1501 caacgcgcgc ctgttcggcc tggcgcacgc agccccgagc cctactcgac tactgcgtca
     1561 ccgcggacgc cgcgcgctgc tggcgggaca gcgccacctg ctgacggcgc aggacgagaa
     1621 cggagacaca ccactgcacc tagccatcat ccacgggcag accagtgtca ttgagcagat
     1681 agtctatgtc atccaccacg cccaggacct cggcgttgtc aacctcacca accacctgca
     1741 ccagacgccc ctgcacctgg cggtgatcac ggggcagacg agtgtggtga gctttctgct
     1801 gcgggtaggt gcagacccag ctctgctgga tcggcatgga gactcagcca tgcatctggc
     1861 gctgcgggca ggcgctggtg ctcctgagct gctgcgtgca ctgcttcaga gtggagctcc
     1921 tgctgtgccc cagctgttgc atatgcctga ctttgaggga ctgtatccag tacacctggc
     1981 ggtccgagcc cgaagccctg agtgcctgga tctgctggtg gacagtgggg ctgaagtgga
     2041 ggccacagag cggcaggggg gacgaacagc cttgcatcta gccacagaga tggaggagct
     2101 ggggttggtc acccatctgg tcaccaagct ccgggccaac gtgaacgctc gcacctttgc
     2161 gggaaacaca cccctgcacc tggcagctgg actggggtac ccgaccctca cccgcctcct
     2221 tctgaaggct ggtgctgaca tccatgctga aaacgaggag cccctgtgcc cactgccttc
     2281 accccctacc tctgatagcg actcggactc tgaagggcct gagaaggaca cccgaagcag
     2341 cttccggggc cacacgcctc ttgacctcac ttgcagcacc ttggtgaaga ccttgctgct
     2401 aaatgctgct cagaacacca tggagccacc cctgaccccg cccagcccag cagggccggg
     2461 actgtcactt ggtgatacag ctctgcagaa cctggagcag ctgctagacg ggccagaagc
     2521 ccagggcagc tgggcagagc tggcagagcg tctggggctg cgcagcctgg tagacacgta
     2581 ccgacagaca acctcaccca gtggcagcct cctgcgcagc tacgagctgg ctggcgggga
     2641 cctggcaggt ctactggagg ccctgtctga catgggccta gaggagggag tgaggctgct
     2701 gaggggtcca gaaacccgag acaagctgcc cagcacagag gtgaaggaag acagtgcgta
     2761 cgggagccag tcagtggagc aggaggcaga gaagctgggc ccaccccctg agccaccagg
     2821 agggctctcg cacgggcacc cccagcctca ggtgactgac ctgctgcctg cccccagccc
     2881 ccttcccgga ccccctgtac agcgtcccca cctatttcaa atcttattta acaccccaca
     2941 cccacccctc agttgggaca aataaaggat tctcatggga aggggaggac cccgaattcc
     3001 t
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X99133. H.sapiens NGAL ge...[gi:1657330] Links  


LOCUS       HSNGALGEN               5869 bp    DNA     linear   PRI 27-OCT-1997
DEFINITION  H.sapiens NGAL gene.
ACCESSION   X99133
VERSION     X99133.1  GI:1657330
KEYWORDS    NGAL gene.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Cowland,J.B. and Borregaard,N.
  TITLE     Molecular characterization and pattern of tissue expression of the
            gene for neutrophil gelatinase-associated lipocalin from humans
  JOURNAL   Genomics 45 (1), 17-23 (1997)
  MEDLINE   97480711
REFERENCE   2  (bases 1 to 5869)
  AUTHORS   Cowland,J.B.
  TITLE     Direct Submission
  JOURNAL   Submitted (06-JUL-1996) J.B. Cowland, Granulocyte Research
            Laboratory, Dept of Hematology, National Univ. Hosp.,
            Rigshospitalet L-4041, 9 Blegdamsvej, DK-2100 Copenhagen, DENMARK
FEATURES             Location/Qualifiers
     source          1..5869
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_type="leukocyte"
     TATA_signal     1663..1669
     prim_transcript 1696..5691
     exon            1696..1902
                     /number=1
     misc_feature    1696
                     /note="CAP-site"
     gene            join(1765..1902,2476..2612,3877..3956,4145..4264,
                     4422..4523,5339..5358)
                     /gene="NGAL"
     CDS             join(1765..1902,2476..2612,3877..3956,4145..4264,
                     4422..4523,5339..5358)
                     /gene="NGAL"
                     /codon_start=1
                     /protein_id="CAA67574.1"
                     /db_xref="GI:1657331"
                     /db_xref="SWISS-PROT:P80188"
                     /translation="MPLGLLWLGLALLGALHAQAQDSTSDLIPAPPLSKVPLQQNFQD
                     NQFQGKWYVVGLAGNAILREDKDPQKMYATIYELKEDKSYNVTSVLFRKKKCDYWIRT
                     FVPGCQPGEFTLGNIKSYPGLTSYLVRVVSTNYNQHAMVFFKKVSQNREYFKITLYGR
                     TKELTSELKENFIRFSKYLGLPENHIVFPVPIDQCIDG"
     intron          1903..2475
                     /gene="NGAL"
                     /number=1
     exon            2476..2612
                     /gene="NGAL"
                     /number=2
     intron          2613..3876
                     /gene="NGAL"
                     /number=2
     exon            3877..3956
                     /gene="NGAL"
                     /number=3
     intron          3957..4144
                     /gene="NGAL"
                     /number=3
     exon            4145..4264
                     /gene="NGAL"
                     /number=4
     intron          4265..4421
                     /gene="NGAL"
                     /number=4
     exon            4422..4523
                     /gene="NGAL"
                     /number=5
     intron          4524..5338
                     /gene="NGAL"
                     /number=5
     exon            5339..5365
                     /number=6
     intron          5366..5549
                     /number=6
     exon            5550..5691
                     /number=7
     polyA_signal    5673..5678
BASE COUNT     1207 a   1782 c   1766 g   1114 t
ORIGIN      
        1 ctcgaggatc tcggctcact gcaacctccg cctcccaggt tcaagctgtt cttctgcctc
       61 agcctcccga gtagctggga ttacaggcgc ctgccaccat gccctgctaa tttttgtatt
      121 tttagtagag atggggtttc accgtgttgg ccagactggt ctcgaactcc tgacctcgtg
      181 atccacccgc ctcagcctcc caaatgctgg gattacagat gtgagccacc gcacccggcc
      241 tggcagagga tactttttaa ggtcaaagac agtagcagag gtggagttcc tgggaacagg
      301 gtcatgaggg gaagaggggg ttcggaggga gcgagtagcc actggctacc tctagaaagg
      361 gaaggctttg gtgcaacatc gttcccctgc agttttactc atctttgctt cctgcccttt
      421 catcatccaa tcgggcaggc aggacagggc ctgagggggc agggatccag tgggtgcctc
      481 tctagactaa ccccagctca ggactcccag agccccttcc ctgaggccct gctgccccca
      541 agcccagatt ggggatccca agcagcacgt aggcagagcc agtgaggtcc ccgttagtcc
      601 cattgaaagc tctaaaacca gcgaaccctc agtccagcct caggtcaggc atccaggacg
      661 gcctcagccc tcatgggtga gccatctctg cggacactgc acagggccta cgatccatcg
      721 ctgcctcccg aggatgccag ccaggccccc gttgagataa ctgcttccct gctggacaag
      781 gctgggacca gccatctcgg tgacagttcc agaacccctg gcctgggctg ctgggttcaa
      841 tggaaaaagg ctgtgactag agtcaggggg atggtctcag tgacctcaag gataaggcca
      901 gatccttgca ctgtcagtga cccaaagcaa caggtgtcca gagcagcagt gtggcgcctt
      961 cacgccccca cacatcagcc caactcaccc aggacaggga ctgtagcctc agcactcaac
     1021 ccatgtgccc tgtgtggggt ctcttcccac tgcactcaca ggagaggaag ggtccctcag
     1081 gggtccactg gggtcccctc ctgcaaatgg ggcaaggaga ggggcaaggg gctgtctcaa
     1141 ggcccctgga gcacatgcag gtcctggact ggggctcctg ggagggccat gattctgggc
     1201 tccatgagtt cagagcagac gccttgtttt tccttgtcca ctgtcagcca ccccaccctt
     1261 ccctgaccct taaaagaacc aggaaacagc acatgatctg ttggaaggag gcattcattc
     1321 tttcctttct gtgggtgtgg ggaggggacc acagggcaca taccccaccc tgggatccag
     1381 ctgagcaggg gggtcagaga tgacagctct tccggctcac aggccaccgg cccacataca
     1441 gggcaatcag aagaaagaaa cagcacaagg aaggcacaga gggagtcgtt gtccctgcca
     1501 gaggtgcagc actccgggaa tgtccctcac tctccccgtc cctctgtctt gcccaatcct
     1561 gaccaggtgc agaaatcttg ccaagtgttt ccgcaggagt tgctggcaat tgcctcacat
     1621 tcctggcctt ggcaaagaat gaatcaaccc accctagatc ccataaatag ggccacccag
     1681 gtgagcctct cactcgccac ctcctcttcc acccctgcca ggcccagcag ccaccacagc
     1741 gcctgcttcc tcggccctga aatcatgccc ctaggtctcc tgtggctggg cctagccctg
     1801 ttgggggctc tgcatgccca ggcccaggac tccacctcag acctgatccc agccccacct
     1861 ctgagcaagg tccctctgca gcagaacttc caggacaacc aagtaagggg ccaagagggg
     1921 cacctgcagg cagggcctgg ggaagagtgg gagcagaggg gaggagaggt gaagagactc
     1981 aggaagagcg ttgggcagga cttaggagtc cagggtccag gtttcagctc actctgtgcc
     2041 accagggtcc cctggtggaa accatgcccc ttccccccat ccccaccccc tctcagcctg
     2101 aacagactcc cccaggtcca catcccctct cccataaccc ccattgtcca aagaaggtgg
     2161 gagcactttt agtccccctg cacagatgag gaaactgagg ctcaggaagg cccaccagcc
     2221 acatgcctcc tccagtgagg aggtcaccct cctccctgcc agactcagaa ccgcctcttc
     2281 ccccaggact cccttctgga ctgatggcct cctgctcctg ccccttcacc agtgcaggcc
     2341 cagcctgggc cctgctgccc agctagaggg gctcatggtt ccaagctggg cggcccagag
     2401 gtgccacagg gacagagctg gaggggtggc tcctagggcc attcctgggt tgtgcctctt
     2461 atcagtccct tgcagttcca ggggaagtgg tatgtggtag gcctggcagg gaatgcaatt
     2521 ctcagagaag acaaagaccc gcaaaagatg tatgccacca tctatgagct gaaagaagac
     2581 aagagctaca atgtcacctc cgtcctgttt aggtgagggc cgacatctcc tgggggtgtg
     2641 agagtcagac tgacgtcaca ggcaagggat ggccaaagct gagggatcct gtcgttcacc
     2701 tcgctgttct gcccggaatt catctgtgtt catccttcct ctgttcctta gagcaacgtt
     2761 tatagcacat ttccatgcag acacacagac agtggtgggg atggacatgc acagtcgtta
     2821 gaaaacaaga cggagagagg aggggtgcct gggagcggga ggaggggaca ctggatccag
     2881 cctggaaccc cacccagtgc cttcatggaa ggcttccagg gaggtggcct taaaagagcc
     2941 aacctgcttc aaaaggaaat gtggggtgtt cccggcaggg gctggagtca gagagagccc
     3001 ccccttcagg aaggagcaag ccatcgcagg gtcaccctga gcagagctgc tgagcagcct
     3061 ggaggggcag gtggccacgc tagcacctag cacggtcctc aggccccgcc ccagcggatc
     3121 tgctgcggag tggcttagag cagggctctt gggccgcagg gtggggagac ttggtggggt
     3181 gcagcctagg gggtcgggag accagcgaaa gtgaagcggg gccgtcacag gtgtgagaga
     3241 acaggcgcag ggtgaagagg cagggagcca gggatcagcc gcccccagtg ggtttctgac
     3301 tctggcagct gagtggattg ggattggggc atttgtggag caaggagcag aatacagaca
     3361 ggttggggag ctcagccctg gggtgccagg ggatgggaag tgggaggact caaggatggg
     3421 gtcaggtttg acccgagagc taggggaacg gctggcatgg agcagactgg aagtaccgag
     3481 gtggatcccc gggagagggt ataggaaggg aagcagcaag ctgagtgcag gggagaaatg
     3541 cagggtttcc tgtgtgttgg gtggcggcgg gggtgaaagc cacccaggga ggcagccaaa
     3601 ggaaagaagg acatcgggtg ctggagggtc tgagtggggt ccaggggccc caggcaggcc
     3661 aggagggaca gcctggtgtc agctcaggga gaaggcccag gcccatctcg gctgggtggg
     3721 gtagggcccc tccaggtagt gggggatgag ctgtcacggg ttgggccgga ctgagagcaa
     3781 cagaaccctg ctgctgccct ggccccacct tgtccagcac aggaggccca agcctgggtt
     3841 gtctcccctc tcacccaccc atctctccct cccaaggaaa aagaagtgtg actactggat
     3901 caggactttt gttccaggtt gccagcccgg cgagttcacg ctgggcaaca ttaagagtga
     3961 gtcttgagtg aggtggggca ctgagttggg gctccgggga gctgggtagg ggcacagacc
     4021 ttcctgcccc tccacacaga tgtgttgtat ggggagaagc ccacgttgat gggctgggga
     4081 gggaggggac agctccctcc tcccatccag ggcagggctg acccctcacc gtccacgcct
     4141 gcaggttacc ctggattaac gagttacctc gtccgagtgg tgagcaccaa ctacaaccag
     4201 catgctatgg tgttcttcaa gaaagtttct caaaacaggg agtacttcaa gatcaccctc
     4261 tacggtgggt cctctcccat cccctcgggg actggctcct gatcacactt agtgggaggg
     4321 gaggccggtc ccccatgagg aagggatctg aggcctcatc tactcattca acgatattta
     4381 tgtggtgtct gccggccact cactggccat cttggtcaca gggagaacca aggagctgac
     4441 ttcggaacta aaggagaact tcatccgctt ctccaaatat ctgggcctcc ctgaaaacca
     4501 catcgtcttc cctgtcccaa tcggtaatgg ccagtctgga tgaggggacg gggacatggg
     4561 gactgttcag gcaggatgct tccctaccag ggatcaggga gaggagggac tccgtcctca
     4621 gcttcagtca ctggagcagt ggatggtcca ggagctcctt ggaagccact ctgggcccag
     4681 gaagactgtg ccccacccca gggtctatgg gactcccagg gacccaggcc gcaagtgctc
     4741 tttcctggca gtttagcccg ggtctgccca gacaaggatt tcaggcccag gcctgagtat
     4801 ccatttctca gtctcactgg cctgacacct ctggccaccc tcccaggccc ccttgttctg
     4861 ggccatctcc cccgaccctc ccaggcctcg tcaccctggg ttttgctgtc ctggctgtcc
     4921 tctctcccct ggggacttgc tcaccactga cttgggagct gtccttgact ccagggagcc
     4981 tggcttgggc aggaggctcc agccaggcca ttcagagagc cactggcctc ccccaggctg
     5041 agagactgcc tggactggta aacaggcagg agacctgggt gcccgaggag cctgggagct
     5101 gggcctcact cagggcagcc cctccccagg cctttctccc acatcccctg ccctgccatc
     5161 cacccctctg ttgccccatc tctgaaagga acccccatat cttctgcagc tgggccaggt
     5221 ggggcagggg ctgcccaggg gcagtgcaga ggacctggca gtcagggatc acacacacac
     5281 actcatacac gcacacacac acacagctgc ctgttctgac ggactttctc cctaacagac
     5341 cagtgtatcg acggctgagt gcacagtgag tgtggctggg cggctgcgag ggggcttgtg
     5401 ggaggccagg gtgcagtggg ctgggggtct tgggcctgcc tttgctcatc cccctgcccc
     5461 ccagcactgc tgctgtcttt attctgctgt ccccatctcg ggtgcctccc atttccccac
     5521 ccatcaccct catatccacc tctgtccagg gtgccgccag ctgccgcacc agcccgaaca
     5581 ccattgaggg agctgggaga ccctccccac agtgccaccc atgcagctgc tccccaggcc
     5641 accccgctga tggagcccca ccttgtctgc taaataaaca tgtgccctca ggcctctgag
     5701 tctacactgt ttgacccctg ggccttcgag gaaggggagg ggcgggaggc tcccactggc
     5761 atcactctca gggtctgcac ccccaggatg gagcctagcg aacccagcct gggtgttagg
     5821 gctgcagagt gaagacacaa gcccctggtc atcaccagca gctttgtgg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D10522. Homo sapiens mRNA...[gi:219893] Links  


LOCUS       HUMKCS                  2589 bp    mRNA    linear   PRI 02-FEB-1999
DEFINITION  Homo sapiens mRNA for 80K-L protein, complete cds.
ACCESSION   D10522 D90498
VERSION     D10522.1  GI:219893
KEYWORDS    80K-L protein; calmodulin binding protein; cytoplasm; plasma
            membrane; protein kinase C substrate.
SOURCE      Homo sapiens cell_line:A431 cDNA to mRNA, clone_lib:lamda gt10 A431
            cDNA library clone:lambda80L-[1,2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Sakai,K., Hirai,M., Kudoh,J., Minoshima,S. and Shimizu,N.
  TITLE     Molecular cloning and chromosomal mapping of a cDNA encoding human
            80K-L protein: major substrate for protein kinase C
  JOURNAL   Genomics 14 (1), 175-178 (1992)
  MEDLINE   93052291
REFERENCE   2  (bases 1 to 2589)
  AUTHORS   Sakai,K.
  JOURNAL   Unpublished
REFERENCE   3  (bases 1 to 2589)
  AUTHORS   Shimizu,N.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-SEP-1991) Nobuyoshi Shimizu, Keio University School
            of Medicine, Department of Molecular Biology; 35 Shinanomachi,
            Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp,
            Tel:03-3353-1211(ex.2721), Fax:03-3351-2370)
FEATURES             Location/Qualifiers
     source          1..2589
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /clone="lambda80L-[1,2]"
                     /cell_line="A431"
                     /clone_lib="lamda gt10 A431 cDNA library"
     gene            1..2589
                     /gene="80K-L"
     CDS             370..1368
                     /gene="80K-L"
                     /note="tentative"
                     /codon_start=1
                     /product="80K-L protein"
                     /protein_id="BAA01392.1"
                     /db_xref="GI:219894"
                     /translation="MGAQFSKTAAKGEAAAERPGEAAVASSPSKANGQENGHVKVNGD
                     ASPAAAESGAKEELQANGSAPAADKEEPAAAGSGAASPSSAEKGEPAAAAAPEAGASP
                     VEKEAPAEGEAAEPGSATAAEGEAASAASSTSSPKAEDGATPSPSNETPKKKKKRFSF
                     KKSFKLSGFSFKKNKKEAGEGGEAEAPAAEGGKDEAAGGAAAAAAEAGAASGEQAAAP
                     GEEAAAGEEGAAGGDPQEAKPQEAAVAPEKPPASDETKAAEEPSKVEEKKAEEAGASA
                     AACEAPSAAGPGAPPEQEAAPAEEPAAAAASSACAAPSQEAQPECSPEAPPAEAAE"
     polyA_site      2589
                     /gene="80K-L"
BASE COUNT      608 a    659 c    682 g    640 t
ORIGIN      
        1 caaccaggga gatttctcca ttttcctctt gtctacagtg cggctacaaa tctgggattt
       61 ttttattact tctttttttt tcgaactaca cttgggctcc tttttttgtg ctcgactttt
      121 ccaccctttt tccctccctc ctgtgctgct gctttttgat ctcttcgact aaaatttttt
      181 tatccggagt gtatttaatc ggttctgttc tgtcctctcc accaccccca cccccctccc
      241 tccggtgtgt gtgccgctgc cgctgttgcc gccgccgctg ctgctgctgc tcgccccgtc
      301 gttacaccaa cccgaggctc tttgtttccc ctcttggatc tgttgagttt ctttgttgaa
      361 gaagccagca tgggtgccca gttctccaag accgcagcga agggagaagc cgccgcggag
      421 aggcctgggg aggcggctgt ggcctcgtcg ccttccaaag cgaacggaca ggagaatggc
      481 cacgtgaagg taaacggcga cgcttcgccc gcggccgccg agtcgggcgc caaggaggag
      541 ctgcaggcca acggcagcgc cccggccgcc gacaaggagg agcccgcggc cgccgggagc
      601 ggggcggcgt cgccctcctc ggccgagaaa ggtgagccgg ccgccgccgc tgcccccgag
      661 gccggggcca gcccggtaga gaaggaggcc cccgcggaag gcgaggctgc cgagcccggc
      721 tcggccacgg ccgcggaggg agaggccgcg tcggccgcct cctcgacttc ttcgcccaag
      781 gccgaggacg gggccacgcc ctcgcccagc aacgagaccc cgaaaaaaaa aaagaagcgc
      841 ttttccttca agaagtcttt caagctgagc ggcttctcct tcaagaagaa caagaaggag
      901 gctggagaag gcggtgaggc tgaggcgccc gctgccgaag gcggcaagga cgaggccgcc
      961 gggggcgcag ctgcggccgc cgccgaggcg ggcgcggcct ccggggagca ggcagcggcg
     1021 ccgggcgagg aggcggcagc gggcgaggag ggggcggcgg gtggcgaccc gcaggaggcc
     1081 aagccccagg aggccgctgt cgcgccagag aagccgcccg ccagcgacga gaccaaggcc
     1141 gccgaggagc ccagcaaggt ggaggagaaa aaggccgagg aggccggggc cagcgccgcc
     1201 gcctgcgagg ccccctccgc cgccgggccc ggcgcgcccc cggagcagga ggcagccccc
     1261 gcggaggagc ccgcggccgc cgcagcctcg tcagcctgcg cagccccctc acaggaggcc
     1321 cagcccgagt gcagtccaga agccccccca gcggaggcgg cagagtaaaa gagcaagctt
     1381 ttgtgagata atcgaagaac ttttctcccc cgtttgtttg ttggagtggt gccaggtact
     1441 gttttggaga acttgtctac aaccagggat tgattttaaa gatgtctttt tttattttac
     1501 ttttttttaa gcaccaaatt ttgttgtttt tttttttctc ccctccccac agatcccatc
     1561 tcaaatcatt ctgttaacca ccattccaac aggtcgagga gagcttaaac accttcttcc
     1621 tctgccttgt ttctctttta ttttttattt tttcgcatca gtattaatgt ttttgcatac
     1681 tttgcatctt tattcaaaag tgtaaacttt ctttgtcaat ctatggacat gcccatatat
     1741 gaaggagatg ggtgggtcaa aaagggatat caaatgaagt gataggggtc acaatgggga
     1801 aattgaagtg gtgcataaca ttgccaaaat agtgtgccac tagaaatggt gtaaaggctg
     1861 tctttttttt tttttttaaa gaaaagttat taccatgtat tttgtgaggc aggtttacaa
     1921 cactacaagt cttgagttaa gaaggaaaga ggaaaaaaga aaaaacacca atacccagat
     1981 ttaaaaaaaa aaaaacgatc atagtcttag gagttcattt aaaccatagg aacttttcac
     2041 ttatctcatg ttagctgtac cagtcagtga ttaagtagaa ctacaagttg tataggcttt
     2101 attgtttatt gctggtttat gaccttaata aagtgtaatt atgtattacc agcagggtgt
     2161 ttttaactgt gactattgta taaaaacaaa tcttgatatc cagaagcaca tgaagtttgc
     2221 aactttccac cctgcccatt tttgtaaaac tgcagtcatc ttggaccttt taaaacacaa
     2281 attttaaact caaccaagct gtgataagtg gaatggttac tgtttatact gtggtatgtt
     2341 tttgattaca gcagataatg ctttcttttc cagtcgtctt tgagaataaa ggaaaaaaaa
     2401 tcttcagatg caatggtttt gtgtagcatc ttgtctatca tgttttgtaa atactggaga
     2461 agctttgacc aatttgactt agagatggaa tgtaactttg cttacaaaaa ttgctattaa
     2521 actcctgctt aaggtgttct aattttctgt gagcacacta aaagcgaaaa ataaatgtga
     2581 ataaaatgt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC007459. Homo sapiens, clo...[gi:13938612] Links  


LOCUS       BC007459                1229 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, clone MGC:12230 IMAGE:4052054, mRNA, complete cds.
ACCESSION   BC007459
VERSION     BC007459.1  GI:13938612
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1229)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-MAY-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 16 Row: d Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 6912563.
FEATURES             Location/Qualifiers
     source          1..1229
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:12230 IMAGE:4052054"
                     /tissue_type="Kidney, hypernephroma"
                     /clone_lib="NIH_MGC_58"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     CDS             109..762
                     /codon_start=1
                     /product="Unknown (protein for MGC:12230)"
                     /protein_id="AAH07459.1"
                     /db_xref="GI:13938613"
                     /translation="MSKPPPKPVKPGEGGQVKVFRALYTFEPRTPDELYFEEGDIIYI
                     TDMSDTNWWKGTSKGRTGLIPSNYVAEQAESIDNPLHEAAKRGNLSWLRECLDNRVGV
                     NGLDKAGSTALYWACHGGHKDIVEMLFTQPNIELNQQNKLGDTALHAAAWKGYADIVQ
                     LFLAKGARTDLRNIEKKLAFDMATNAACASLLKKKQGTDAVRTLSNAEDYLDDEDSD"
BASE COUNT      400 a    213 c    283 g    333 t
ORIGIN      
        1 acggttgtaa gccagacaaa aagaactggg gtgcccggag tgccaggtgg cgggcaagcg
       61 gtgggctttt cggcggggtc tttaggattt gcagctccag gaagcgagat gtcgaagccg
      121 ccacccaaac cagtcaaacc aggtgaggga gggcaagtta aagtcttcag agccctgtat
      181 acgtttgaac ccagaactcc agatgaatta tactttgagg aaggtgatat tatctacatt
      241 actgacatga gcgataccaa ttggtggaaa ggcacctcca aaggcaggac tggactaatt
      301 ccaagcaact atgtggctga gcaggcagaa tccattgaca atccattgca tgaagcagca
      361 aaaagaggca acttgagctg gttgagagag tgtttggaca acagagtggg tgttaatggc
      421 ttagacaaag ctggaagcac tgccttatac tgggcttgcc acgggggcca caaagatata
      481 gtggaaatgc tatttactca accaaatatt gaactgaacc agcagaacaa gttgggagat
      541 acagctttgc atgctgctgc ctggaagggt tatgcagata tcgtccagtt gtttctggca
      601 aaaggtgcta gaacagactt aagaaacatt gagaagaagc tggccttcga catggctacc
      661 aatgctgcct gtgcatctct cctgaaaaag aaacagggaa cagatgcagt tcgaacatta
      721 agcaatgccg aggactatct cgatgatgaa gactcagatt aattcctttc tggagctttg
      781 agatctaaaa cttctgttgc ttttgccatt ccaaaacttt gtctttgcca gaaaagtgtt
      841 ggtaactata aagaaaatta tatatgaaca cggcagtgtt gcactgtgtt tgagtagaac
      901 gtgtaaatga attgttccca cctttggttt gccagtaagt gactggattc ttggcacatt
      961 tgtgttcacc aaagtagaac aagaagatat tatttctatt tatcaagcaa aaggaatttt
     1021 aagatttttt tttctttaaa aacaaattag gatttttttt tttttttttt ttttttagtt
     1081 aaaatgcttt acctcaatgg ttgagatatt ttgaatggat ttttcaaggg ggggaaatgc
     1141 ttattataat aataaaccaa aatacttaac agaaaattgt cagctattct gacaaaaaca
     1201 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC004143. Homo sapiens, B-f...[gi:13278731] Links  


LOCUS       BC004143                2512 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, B-factor, properdin, clone MGC:1795 IMAGE:2959705,
            mRNA, complete cds.
ACCESSION   BC004143
VERSION     BC004143.1  GI:13278731
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2512)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-MAR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Rachel Dickhoff, Jessica Fahey, Stephanie Ford, Julia
            Greene, Mark Ketteman and Anuradha Madan
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 2 Row: k Column: 14
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 452937.
FEATURES             Location/Qualifiers
     source          1..2512
                     /organism="Homo sapiens"
                     /db_xref="LocusID:629"
                     /db_xref="taxon:9606"
                     /clone="MGC:1795 IMAGE:2959705"
                     /tissue_type="Colon, adenocarcinoma"
                     /clone_lib="NIH_MGC_15"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             145..2439
                     /codon_start=1
                     /product="B-factor, properdin"
                     /protein_id="AAH04143.1"
                     /db_xref="GI:13278732"
                     /translation="MGSNLSPQLCLMPFILGLLSGGVTTTPWSLAWPQGSCSLEGVEI
                     KGGSFRLLQEGQALEYVCPSGFYPYPVQTRTCRSTGSWSTLKTQDQKTVRKAECRAIH
                     CPRPHDFENGEYWPRSPYYNVSDEISFHCYDGYTLRGSANRTCQVNGRWSGQTAICDN
                     GAGYCSNPGIPIGTRKVGSQYRLEDSVTYHCSRGLTLRGSQRRTCQEGGSWSGTEPSC
                     QDSFMYDTPQEVAEAFLSSLTETIEGVDAEDGHGPGEQQKRKIVLDPSGSMNIYLVLD
                     GSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADW
                     VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWNRTRHVIILMTDGLH
                     NMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQ
                     HVFKVKDMENLEDVFYQMIDESQSLSLCGMVWEHRKGTDYHKQPWQAKISVIRPSKGH
                     ESCMGAVVSEYFVLTAAHCFTVDDKEHSIKVSVGGEKRDLEIEVVLFHPNYNINGKKE
                     AGIPEFYDYDVALIKLKNKLKYGQTIRPICLPCTEGTTRALRLPPTTTCQQQKEELLP
                     AQDIKALFVSEEEKKLTRKEVYIKNGDKKGSCERDAQYAPGYDKVKDISEVVTPRFLC
                     TGGVSPYADPNTCRGDSGGPLIVHKRSRFIQVGVISWGVVDVCKNQKRQKQVPAHARD
                     FHINLFQVLPWLKEKLQDEDLGFL"
BASE COUNT      681 a    623 c    687 g    521 t
ORIGIN      
        1 ggcacgaggg ggagcagggg aagggaatgt gaccaggtct aggtctggag tttcagcttg
       61 gacactgagc caagcagaca agcaaagcaa gccaggacac accatcctgc cccaggccca
      121 gcttctctcc tgccttccaa cgccatgggg agcaatctca gcccccaact ctgcctgatg
      181 ccctttatct tgggcctctt gtctggaggt gtgaccacca ctccatggtc tttggcctgg
      241 ccccagggat cctgctctct ggagggggta gagatcaaag gcggctcctt ccgacttctc
      301 caagagggcc aggcactgga gtacgtgtgt ccttctggct tctacccgta ccctgtgcag
      361 acacgtacct gcagatctac ggggtcctgg agcaccctga agactcaaga ccaaaagact
      421 gtcaggaagg cagagtgcag agcaatccac tgtccaagac cacacgactt cgagaacggg
      481 gaatactggc cccggtctcc ctactacaat gtgagtgatg agatctcttt ccactgctat
      541 gacggttaca ctctccgggg ctctgccaat cgcacctgcc aagtgaatgg ccggtggagt
      601 gggcagacag cgatctgtga caacggagcg gggtactgct ccaacccggg catccccatt
      661 ggcacaagga aggtgggcag ccagtaccgc cttgaagaca gcgtcaccta ccactgcagc
      721 cgggggctta ccctgcgtgg ctcccagcgg cgaacgtgtc aggaaggtgg ctcttggagc
      781 gggacggagc cttcctgcca agactccttc atgtacgaca cccctcaaga ggtggccgaa
      841 gctttcctgt cttccctgac agagaccata gaaggagtcg atgctgagga tgggcacggc
      901 ccaggggaac aacagaagcg gaagatcgtc ctggaccctt caggctccat gaacatctac
      961 ctggtgctag atggatcaga cagcattggg gccagcaact tcacaggagc caaaaagtgt
     1021 ctagtcaact taattgagaa ggtggcaagt tatggtgtga agccaagata tggtctagtg
     1081 acatatgcca cataccccaa aatttgggtc aaagtgtctg aagcagacag cagtaatgca
     1141 gactgggtca cgaagcagct caatgaaatc aattatgaag accacaagtt gaagtcaggg
     1201 actaacacca agaaggccct ccaggcagtg tacagcatga tgagctggcc agatgacgtc
     1261 cctcctgaag gctggaaccg cacccgccat gtcatcatcc tcatgactga tggattgcac
     1321 aacatgggcg gggacccaat tactgtcatt gatgagatcc gggacttgct atacattggc
     1381 aaggatcgca aaaacccaag ggaggattat ctggatgtct atgtgtttgg ggtcgggcct
     1441 ttggtgaacc aagtgaacat caatgctttg gcttccaaga aagacaatga gcaacatgtg
     1501 ttcaaagtca aggatatgga aaacctggaa gatgttttct accaaatgat cgatgaaagc
     1561 cagtctctga gtctctgtgg catggtttgg gaacacagga agggtaccga ttaccacaag
     1621 caaccatggc aggccaagat ctcagtcatt cgcccttcaa agggacacga gagctgtatg
     1681 ggggctgtgg tgtctgagta ctttgtgctg acagcagcac attgtttcac tgtggatgac
     1741 aaggaacact caatcaaggt cagcgtagga ggggagaagc gggacctgga gatagaagta
     1801 gtcctatttc accccaacta caacattaat gggaaaaaag aagcaggaat tcctgaattt
     1861 tatgactatg acgttgccct gatcaagctc aagaataagc tgaaatatgg ccagactatc
     1921 aggcccattt gtctcccctg caccgaggga acaactcgag ctttgaggct tcctccaact
     1981 accacttgcc agcaacaaaa ggaagagctg ctccctgcac aggatatcaa agctctgttt
     2041 gtgtctgagg aggagaaaaa gctgactcgg aaggaggtct acatcaagaa tggggataag
     2101 aaaggcagct gtgagagaga tgctcaatat gccccaggct atgacaaagt caaggacatc
     2161 tcagaggtgg tcacccctcg gttcctttgt actggaggag tgagtcccta tgctgacccc
     2221 aatacttgca gaggtgattc tggcggcccc ttgatagttc acaagagaag tcgtttcatt
     2281 caagttggtg taatcagctg gggagtagtg gatgtctgca aaaaccagaa gcggcaaaag
     2341 caggtacctg ctcacgcccg agactttcac atcaacctct ttcaagtgct gccctggctg
     2401 aaggagaaac tccaagatga ggatttgggt tttctataag gggtttcctg ctggacaggg
     2461 gcgtgggatt gaattaaaac agctgcgaca acaaaaaaaa aaaaaaaaaa aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L15702. Human complement ...[gi:291921] Links  


LOCUS       HUMCOMFACB              2388 bp    mRNA    linear   PRI 16-MAR-1994
DEFINITION  Human complement factor B mRNA, complete cds.
ACCESSION   L15702
VERSION     L15702.1  GI:291921
KEYWORDS    complement factor; complement factor B.
SOURCE      Homo sapiens (human).
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2388)
  AUTHORS   Horiuchi,T., Kim,S., Matsumoto,M., Watanabe,I., Fujita,S. and
            Volanakis,J.E.
  TITLE     Human complement factor B: cDNA cloning, nucleotide sequencing,
            phenotypic conversion by site-directed mutagenesis and expression
  JOURNAL   Mol. Immunol. 30 (17), 1587-1592 (1993)
  MEDLINE   94067177
   PUBMED   8247029
FEATURES             Location/Qualifiers
     source          1..2388
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     5'UTR           1..40
     CDS             41..2335
                     /codon_start=1
                     /product="complement factor B"
                     /protein_id="AAA16820.1"
                     /db_xref="GI:291922"
                     /translation="MGSNLSPQLCLMPFILGLLSGGVTTTPWSLAQPQGSCSLEGVEI
                     KGGSFRLLQEGQALEYVCPSGFYPYPVQTRTCRSTGSWSTLKTQDQKTVRKAECRAIH
                     CPRPHDFENGEYWPRSPYYNVSDEISFHCYDGYTLRGSANRTCQVNGRWSGQTAICDN
                     GAGYCSNPGIPIGTRKVGSQYRLEDSVTYHCSRGLTLRGSQRRTCQEGGSWSGTEPSC
                     QDSFMYDTPQEVAEAFLSSLTETIEGVDAEDGHGPGEQQKRKIVLDPSGSMNIYLVLD
                     GSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADW
                     VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWNRTRHVIILMTDGLH
                     NMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQ
                     HVFKVKDMENLEDVFYQMIDESQSLSLCGMVWEHRKGTDYHKQPWQAKISVIRPSKGH
                     ESCMGAVVSEYFVLTAAHCFTVDDKEHSIKVSVGGEKRDLEIEVVLFHPNYNINGKKE
                     AGIPEFYDYDVALIKLKNKLKYGQTIRPICLPCTEGTTRALRLPPTTTCQQQKEELLP
                     AQDIKALFVSEEEKKLTRKEVYIKNGDKKGSCERDAQYAPGYDKVKDISEVVTPRFLC
                     TGGVSPYADPNTCRGDSGGPLIVHKRSRFIQVGVISWGVVDVCKNQKRQKQVPAHARD
                     FHINLFQVLPWLKEKLQDEDLGFL"
     sig_peptide     41..115
     3'UTR           2336..2388
     polyA_signal    2369..2374
BASE COUNT      630 a    601 c    649 g    508 t
ORIGIN      
        1 tcctgcccca ggcccagctt ctctcctgcc ttccaacgcc atggggagca atctcagccc
       61 ccaactctgc ctgatgccct ttatcttggg cctcttgtct ggaggtgtga ccaccactcc
      121 atggtctttg gcccagcccc agggatcctg ctctctggag ggggtagaga tcaaaggcgg
      181 ctccttccga cttctccaag agggccaggc actggagtac gtgtgtcctt ctggcttcta
      241 cccgtaccct gtgcagacac gtacctgcag atctacgggg tcctggagca ccctgaagac
      301 tcaagaccaa aagactgtca ggaaggcaga gtgcagagca atccactgtc caagaccaca
      361 cgacttcgag aacggggaat actggccccg gtctccctac tacaatgtga gtgatgagat
      421 ctctttccac tgctatgacg gttacactct ccggggctct gccaatcgca cctgccaagt
      481 gaatggccgg tggagtgggc agacagcgat ctgtgacaac ggagcggggt actgctccaa
      541 cccgggcatc cccattggca caaggaaggt gggcagccag taccgccttg aagacagcgt
      601 cacctaccac tgcagccggg ggcttaccct gcgtggctcc cagcggcgaa cgtgtcagga
      661 aggtggctct tggagcggga cggagccttc ctgccaagac tccttcatgt acgacacccc
      721 tcaagaggtg gccgaagctt tcctgtcttc cctgacagag accatagaag gagtcgatgc
      781 tgaggatggg cacggcccag gggaacaaca gaagcggaag atcgtcctgg acccttcagg
      841 ctccatgaac atctacctgg tgctagatgg atcagacagc attggggcca gcaacttcac
      901 aggagccaaa aagtgtctag tcaacttaat tgagaaggtg gcaagttatg gtgtgaagcc
      961 aagatatggt ctagtgacat atgccacata ccccaaaatt tgggtcaaag tgtctgaagc
     1021 agacagcagt aatgcagact gggtcacgaa gcagctcaat gaaatcaatt atgaagacca
     1081 caagttgaag tcagggacta acaccaagaa ggccctccag gcagtgtaca gcatgatgag
     1141 ctggccagat gacgtccctc ctgaaggctg gaaccgcacc cgccatgtca tcatcctcat
     1201 gactgatgga ttgcacaaca tgggcgggga cccaattact gtcattgatg agatccggga
     1261 cttgctatac attggcaagg atcgcaaaaa cccaagggag gattatctgg atgtctatgt
     1321 gtttggggtc gggcctttgg tgaaccaagt gaacatcaat gctttggctt ccaagaaaga
     1381 caatgagcaa catgtgttca aagtcaagga tatggaaaac ctggaagatg ttttctacca
     1441 aatgatcgat gaaagccagt ctctgagtct ctgtggcatg gtttgggaac acaggaaggg
     1501 taccgattac cacaagcaac catggcaggc caagatctca gtcattcgcc cttcaaaggg
     1561 acacgagagc tgtatggggg ctgtggtgtc tgagtacttt gtgctgacag cagcacattg
     1621 tttcactgtg gatgacaagg aacactcaat caaggtcagc gtaggagggg agaagcggga
     1681 cctggagata gaagtagtcc tatttcaccc caactacaac attaatggga aaaaagaagc
     1741 aggaattcct gaattttatg actatgacgt tgccctgatc aagctcaaga ataagctgaa
     1801 atatggccag actatcaggc ccatttgtct cccctgcacc gagggaacaa ctcgagcttt
     1861 gaggcttcct ccaactacca cttgccagca acaaaaggaa gagctgctcc ctgcacagga
     1921 tatcaaagct ctgtttgtgt ctgaggagga gaaaaagctg actcggaagg aggtctacat
     1981 caagaatggg gataagaaag gcagctgtga gagagatgct caatatgccc caggctatga
     2041 caaagtcaag gacatctcag aggtggtcac ccctcggttc ctttgtactg gaggagtgag
     2101 tccctatgct gaccccaata cttgcagagg tgattctggc ggccccttga tagttcacaa
     2161 gagaagtcgt ttcattcaag ttggtgtaat cagctgggga gtagtggatg tctgcaaaaa
     2221 ccagaagcgg caaaagcagg tacctgctca cgcccgagac tttcacatca acctctttca
     2281 agtgctgccc tggctgaagg agaaactcca agatgaggat ttgggttttc tataaggggt
     2341 ttcctgctgg acaggggcgt gggattgaat taaaacagct gcgacaac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Z11793. H.sapiens mRNA fo...[gi:36425] Links  


LOCUS       HSSELPM                 2038 bp    mRNA    linear   PRI 30-NOV-1997
DEFINITION  H.sapiens mRNA for selenoprotein P.
ACCESSION   Z11793
VERSION     Z11793.1  GI:36425
KEYWORDS    selenoprotein P protein.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2038)
  AUTHORS   Hill,K.E., Lloyd,R. and Burk,R.F.
  TITLE     Conserved Nucleotide Sequences in the Open Reading Frame and
            3'-Untranslated Region of Selenoprotein P
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 2038)
  AUTHORS   Hill,K.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-MAR-1992) K.E. Hill, Medicine/GI, Vanderbilt
            University School of Medicine, 1161 21st Avenue South, Nashville,
            TN, 37232, USA
FEATURES             Location/Qualifiers
     source          1..2038
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /sex="Male"
                     /tissue_type="Liver"
     5'UTR           1..36
     CDS             37..1182
                     /note="tga codes for selenocysteine"
                     /codon_start=1
                     /codon=(seq:"tga",aa:OTHER)
                     /product="selenoprotein P"
                     /protein_id="CAA77836.1"
                     /db_xref="GI:2654365"
                     /db_xref="SWISS-PROT:P49908"
                     /translation="MWRSLGLALALCLLPSGGTESQDQSSLCKQPPAWSIRDQDPMLN
                     SNGSVTVVALLQASXYLCIIEASKLEDLRVKLKKEGYSNISYIVVNHQGISSRLKYTH
                     LKNKVSEHIPVYQQEENQTDVWTLLNGSKDDFLIYDRCGRLVYHLGLPFSFLTFPYVE
                     EAIKIAYCEKKCGNCSLTTLKDEDFCKRVSLATVDKTVETPSPHYHHEHHHNHGHQHL
                     GSSELSENQQPGAPNAPTHPAPPGLHHHHKHKGQHRQGHPENRDMPASEDLQDLQKKL
                     CRKRCINQLLCKLPTDSELAPRSXCCHCRHLIFEKTGSAITXQCKENLPSLCSXQGLR
                     AEENITESCQXRLPPAAXQISQQLIPTEASASXRXKNQAKKXEXPSN"
     sig_peptide     37..93
     mat_peptide     94..1179
                     /product="selenoprotein P"
     3'UTR           1183..2016
     polyA_signal    1196..2001
     polyA_site      2016
BASE COUNT      673 a    399 c    360 g    606 t
ORIGIN      
        1 gcaggcccgt tggaagtggt tgtgacaacc ccagcaatgt ggagaagcct ggggcttgcc
       61 ctggctctct gtctcctccc atcgggagga acagagagcc aggaccaaag ctccttatgt
      121 aagcaacccc cagcctggag cataagagat caagatccaa tgctaaactc caatggttca
      181 gtgactgtgg ttgctcttct tcaagccagc tgatacctgt gcatcatcga ggcatctaaa
      241 ttagaagacc tgcgagtaaa actgaagaaa gaaggatatt ctaatatttc ttatattgtt
      301 gttaatcatc aaggaatctc ttctcgatta aaatacacac atcttaagaa taaggtttca
      361 gagcatattc ctgtttatca acaagaagaa aaccaaacag atgtctggac tcttttaaat
      421 ggaagcaaag atgacttcct catatatgat agatgtggcc gtcttgtata tcatcttggt
      481 ttgccttttt ccttcctaac tttcccatat gtagaagaag ccattaagat tgcttactgt
      541 gaaaagaaat gtggaaactg ctctctcacg actctcaaag atgaagactt ttgtaaacgt
      601 gtatctttgg ctactgtgga taaaacagtt gaaactccat cgcctcatta ccatcatgag
      661 catcatcaca atcatggaca tcagcacctt ggcagcagtg agctttcaga gaatcagcaa
      721 ccaggagcac caaatgctcc tactcatcct gctcctccag gccttcatca ccaccataag
      781 cacaagggtc agcataggca gggtcaccca gagaaccgag atatgccagc aagtgaagat
      841 ttacaagatt tacaaaagaa gctctgtcga aagagatgta taaatcaatt actctgtaaa
      901 ttgcccacag attcagagtt ggctcctagg agctgatgct gccattgtcg acatctgata
      961 tttgaaaaaa cagggtctgc aatcacctga cagtgtaaag aaaacctccc atctttatgt
     1021 agctgacagg gacttcgggc agaggagaac ataactgaat cttgtcagtg acgtttgcct
     1081 ccagctgcct gacaaataag tcagcagctt atacccacag aagccagtgc cagttgacgc
     1141 tgaaagaatc aggcaaaaaa gtgagaatga ccttcaaact aaatatttaa aataggacat
     1201 actccccaat ttagtctaga cacaatttca tttccagcat ttttataaac taccaaatta
     1261 gtgaaccaaa aatagaaatt agatttgtgc aaacatggag aaatctactg aattggcttc
     1321 cagattttaa attttatgtc atagaaatat tgactcaaac catatttttt atgatggagc
     1381 aactgaaagg tgattgcagc ttttggttaa tatgtctttt tttttctttt tccagtgttc
     1441 tatttgcttt aatgagaata gaaacgtaaa ctatgaccta ggggttttct gttggataat
     1501 tagcagttta gaatggagga agaacaacaa agacatgctt tccatttttt cctttactta
     1561 tctctcaaaa caatattact ttgtcttttc aatcttctac ttttaactaa taaaataagt
     1621 ggattttgta ttttaagatc cagaaatact taacacgtga atattttgct aaaaaagcat
     1681 atataactat tttaaatatc catttatctt ttgtatatct aagactcatc ctgattttta
     1741 ctatcacaca tgaataaagg cctttgtatc tttctttctc taatgttgta tcatactctt
     1801 ctaaaacttg agtggctgtc ttaaaagata taaggggaaa gataatattg tctgtctcta
     1861 tattgcttag taagtatttc catagtcaat gatggtttaa taggtaaacc aaaccctata
     1921 aacctgacct cctttatggt taatactatt aagcaagaat gcagtacaga attggataca
     1981 gtacggattt gtccaaataa attcaataaa aaccttaaaa aaaaaaaaaa aaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF018956. Homo sapiens neur...[gi:2407640] Links  


LOCUS       AF018956                2772 bp    mRNA    linear   PRI 18-SEP-1997
DEFINITION  Homo sapiens neuropilin mRNA, complete cds.
ACCESSION   AF018956
VERSION     AF018956.1  GI:2407640
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2772)
  AUTHORS   He,Z. and Tessier-Lavigne,M.
  TITLE     Neuropilin is a receptor for the axonal chemorepellent Semaphorin
            III
  JOURNAL   Cell 90 (4), 739-751 (1997)
  MEDLINE   97433084
   PUBMED   9288753
REFERENCE   2  (bases 1 to 2772)
  AUTHORS   He,Z. and Tessier-Lavigne,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (11-AUG-1997) Howard Hughes Medical Institute, University
            of California, San Francisco, 513 Parnassus Avenue, HSE-201, San
            Francisco, CA 94143, USA
FEATURES             Location/Qualifiers
     source          1..2772
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             1..2772
                     /codon_start=1
                     /product="neuropilin"
                     /protein_id="AAC51759.1"
                     /db_xref="GI:2407641"
                     /translation="MERGLPLLCAVLALVLAPAGAFRNDECGDTIKIESPGYLTSPGY
                     PHSYHPSEKCEWLIQAPDPYQRIMINFNPHFDLEDRDCKYDYVEVFDGENENGHFRGK
                     FCGKIAPPPVVSSGPFLFIKFVSDYETHGAGFSIRYEIFKRGPECSQNYTTPSGVIKS
                     PGFPEKYPNSLECTYIVFAPKMSEIILEFESFDLEPDSNPPGGMFCRYDRLEIWDGFP
                     DVGPHIGRYCGQKTPGRIRSSSGILSMVFYTDSAIAKEGFSANYSVLQSSVSEDFKCM
                     EALGMESGEIHSDQITASSQYSTNWSAERSRLNYPENGWTPGEDSYREWIQVDLGLLR
                     FVTAVGTQGAISKETKKKYYVKTYKIDVSSNGEDWITIKEGNKPVLFQGNTNPTDVVV
                     AVFPKPLITRFVRIKPATWETGISMRFEVYGCKITDYPCSGMLGMVSGLISDSQITSS
                     NQGDRNWMPENIRLVTSRSGWALPPAPHSYINEWLQIDLGEEKIVRGIIIQGGKHREN
                     KVFMRKFKIGYSNNGSDWKMIMDDSKRKAKSFEGNNNYDTPELRTFPALSTRFIRIYP
                     ERATHGGLGLRMELLGCEVEAPTAGPTTPNGNLVDECDDDQANCHSGTGDDFQLTGGT
                     TVLATEKPTVIDSTIQSEFPTYGFNCEFGWGSHKTFCHWEHDNHVQLKWSVLTSKTGP
                     IQDHTGDGNFIYSQADENQKGKVARLVSPVVYSQNSAHCMTFWYHMSGSHVGTLRVKL
                     RYQKPEEYDQLVWMAIGHQGDHWKEGRVLLHKSLKLYQVIFEGEIGKGNLGGIAVDDI
                     SINNHISQEDCAKPADLDKKNPEIKIDETGSTPGYEGEGEGDKNISRKPGNVLKTLEP
                     ILITIIAMSALGVLLGAVCGVVLYCACWHNGMSERNLSALENYNFELVDGVKLKKDKL
                     NTQSTYSEA"
BASE COUNT      772 a    664 c    702 g    634 t
ORIGIN      
        1 atggagaggg ggctgccgct cctctgcgcc gtgctcgccc tcgtcctcgc cccggccggc
       61 gcttttcgca acgatgaatg tggcgatact ataaaaattg aaagccccgg gtaccttaca
      121 tctcctggtt atcctcattc ttatcaccca agtgaaaaat gcgaatggct gattcaggct
      181 ccggacccat accagagaat tatgatcaac ttcaaccctc acttcgattt ggaggacaga
      241 gactgcaagt atgactacgt ggaagtcttc gatggagaaa atgaaaatgg acattttagg
      301 ggaaagttct gtggaaagat agcccctcct cctgttgtgt cttcagggcc atttcttttt
      361 atcaaatttg tctctgacta cgaaacacat ggtgcaggat tttccatacg ttatgaaatt
      421 ttcaagagag gtcctgaatg ttcccagaac tacacaacac ctagtggagt gataaagtcc
      481 cccggattcc ctgaaaaata tcccaacagc cttgaatgca cttatattgt ctttgcgcca
      541 aagatgtcag agattatcct ggaatttgaa agctttgacc tggagcctga ctcaaatcct
      601 ccagggggga tgttctgtcg ctacgaccgg ctagaaatct gggatggatt ccctgatgtt
      661 ggccctcaca ttgggcgtta ctgtggacag aaaacaccag gtcgaatccg atcctcatcg
      721 ggcattctct ccatggtttt ttacaccgac agcgcgatag caaaagaagg tttctcagca
      781 aactacagtg tcttgcagag cagtgtctca gaagatttca aatgtatgga agctctgggc
      841 atggaatcag gagaaattca ttctgaccag atcacagctt cttcccagta tagcaccaac
      901 tggtctgcag agcgctcccg cctgaactac cctgagaatg ggtggactcc cggagaggat
      961 tcctaccgag agtggataca ggtagacttg ggccttctgc gctttgtcac ggctgtcggg
     1021 acacagggcg ccatttcaaa agaaaccaag aagaaatatt atgtcaagac ttacaagatc
     1081 gacgttagct ccaacgggga agactggatc accataaaag aaggaaacaa acctgttctc
     1141 tttcagggaa acaccaaccc cacagatgtt gtggttgcag tattccccaa accactgata
     1201 actcgatttg tccgaatcaa gcctgcaact tgggaaactg gcatatctat gagatttgaa
     1261 gtatacggtt gcaagataac agattatcct tgctctggaa tgttgggtat ggtgtctgga
     1321 cttatttctg actcccagat cacatcatcc aaccaaggag acagaaactg gatgcctgaa
     1381 aacatccgcc tggtaaccag tcgctctggc tgggcacttc cacccgcacc tcattcctac
     1441 atcaatgagt ggctccaaat agacctgggg gaggagaaga tcgtgagggg catcatcatt
     1501 cagggtggga agcaccgaga gaacaaggtg ttcatgagga agttcaagat cgggtacagc
     1561 aacaacggct cggactggaa gatgatcatg gatgacagca aacgcaaggc gaagtctttt
     1621 gagggcaaca acaactatga tacacctgag ctgcggactt ttccagctct ctccacgcga
     1681 ttcatcagga tctaccccga gagagccact catggcggac tggggctcag aatggagctg
     1741 ctgggctgtg aagtggaagc ccctacagct ggaccgacca ctcccaacgg gaacttggtg
     1801 gatgaatgtg atgacgacca ggccaactgc cacagtggaa caggtgatga cttccagctc
     1861 acaggtggca ccactgtgct ggccacagaa aagcccacgg tcatagacag caccatacaa
     1921 tcagagtttc caacatatgg ttttaactgt gaatttggct ggggctctca caagaccttc
     1981 tgccactggg aacatgacaa tcacgtgcag ctcaagtgga gtgtgttgac cagcaagacg
     2041 ggacccattc aggatcacac aggagatggc aacttcatct attcccaagc tgacgaaaat
     2101 cagaagggca aagtggctcg cctggtgagc cctgtggttt attcccagaa ctctgcccac
     2161 tgcatgacct tctggtatca catgtctggg tcccacgtcg gcacactcag ggtcaaactg
     2221 cgctaccaga agccagagga gtacgatcag ctggtctgga tggccattgg acaccaaggt
     2281 gaccactgga aggaagggcg tgtcttgctc cacaagtctc tgaaacttta tcaggtgatt
     2341 ttcgagggcg aaatcggaaa aggaaacctt ggtgggattg ctgtggatga cattagtatt
     2401 aataaccaca tttcacaaga agattgtgca aaaccagcag acctggataa aaagaaccca
     2461 gaaattaaaa ttgatgaaac agggagcacg ccaggatacg aaggtgaagg agaaggtgac
     2521 aagaacatct ccaggaagcc aggcaatgtg ttgaagacct tagaacccat cctcatcacc
     2581 atcatagcca tgagcgccct gggggtcctc ctgggggctg tctgtggggt cgtgctgtac
     2641 tgtgcctgtt ggcataatgg gatgtcagaa agaaacttgt ctgccctgga gaactataac
     2701 tttgaacttg tggatggtgt gaagttgaaa aaagacaaac tgaatacaca gagtacttat
     2761 tcggaggcat ga
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X52851. Human cyclophilin...[gi:30167] Links  


LOCUS       HSCPH70                 6711 bp    DNA     linear   PRI 26-APR-1993
DEFINITION  Human cyclophilin gene for cyclophilin (EC 5.2.1.8).
ACCESSION   X52851
VERSION     X52851.1  GI:30167
KEYWORDS    Alu repeat; cyclophilin; cyclosporin A-binding protein;
            peptidylprolyl isomerase.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Haendler,B. and Hofer,E.
  TITLE     Characterization of the human cyclophilin gene and of related
            processed pseudogenes
  JOURNAL   Eur. J. Biochem. 190 (3), 477-482 (1990)
  MEDLINE   90322991
REFERENCE   2  (bases 1 to 6711)
  AUTHORS   Hofer,E.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-APR-1990) Hofer E., Sandoz Ltd, Preclinical Research,
            386/625, 4002 Basle, Switzerland
COMMENT     See also  for cDNA sequence and  for related
            processed pseudogenic sequences.
FEATURES             Location/Qualifiers
     source          1..6711
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="CPH-70"
                     /cell_type="leucocyte"
                     /clone_lib="Lambda EMBL3"
     misc_feature    71..360
                     /note="Alu repeat"
     misc_feature    361..562
                     /note="Alu repeat"
     protein_bind    781..790
                     /note="SP1 binding domain"
                     /bound_moiety="Sp1"
     protein_bind    1371..1380
                     /note="SP1 binding domain"
                     /bound_moiety="Sp1"
     TATA_signal     1585..1591
     mRNA            join(1616..1728,4173..4203,4318..4406,4628..4800,
                     6215..6561)
     exon            1616..1728
                     /number=1
     CDS             join(1660..1728,4173..4203,4318..4406,4628..4800,
                     6215..6350)
                     /EC_number="5.2.1.8"
                     /codon_start=1
                     /product="peptidylprolyl isomerase"
                     /protein_id="CAA37039.1"
                     /db_xref="GI:30168"
                     /db_xref="SWISS-PROT:P05092"
                     /translation="MVNPTVFFDIAVDGEPLGRVSFELFADKVPKTAENFRALSTGEK
                     GFGYKGSCFHRIIPGFMCQGGDFTRHNGTGGKSIYGEKFEDENFILKHTGPGILSMAN
                     AGPNTNGSQFFICTAKTEWLDGKHVVFGKVKEGMNIVEAMERFGSRNGKTSKKITIAD
                     CGQLE"
     intron          1729..4172
                     /number=1
     misc_feature    2054..2063
                     /note="general enhancer core"
     protein_bind    2522..2537
                     /note="NP1 binding domain"
                     /bound_moiety="NP1"
     misc_feature    3207..3445
                     /note="Alu repeat"
     protein_bind    4094..4100
                     /note="AP1 binding domain"
                     /bound_moiety="AP1"
     exon            4173..4203
                     /number=2
     intron          4204..4317
                     /number=2
     exon            4318..4406
                     /number=3
     intron          4407..4627
                     /number=3
     exon            4628..4800
                     /number=4
     intron          4801..6214
                     /number=4
     misc_feature    5026..5314
                     /note="Alu repeat"
     misc_feature    5340..5626
                     /note="Alu repeat"
     misc_feature    5755..6043
                     /note="Alu repeat"
     exon            6215..6561
                     /number=5
     polyA_signal    6538..6543
BASE COUNT     1552 a   1625 c   1717 g   1817 t
ORIGIN      
        1 gaattccctt gtaaggtttt cttaacaaaa caccagtcac ataagtgcat tttattttat
       61 atttttgttt atttatttga gacggagtct cttgtctctc aggctggagt gcagtggcgc
      121 catctctgct cgctgcaacc tccacctcct gggttccagc gattctcctg cctcagcctc
      181 ccgagggggt agctgggact acaggtgcgc accaccatgc ccagctaatt ttgtattttt
      241 cgtagagatg gggtttcacc atgttgtcca ggctggtctt gaactcctga cctcaggtga
      301 tcctcccgcc tcggcctccc aaagtgctgg aattacaggc gtgatccacc gcacccggcc
      361 tattttttga gagagggtca cactctgtcg tcccggctgg aatgcagtga tgcgatcacc
      421 gcccactaca gcctcgacct ccgggctcaa gcaatcctcc ccgcccagcc tcctgagtag
      481 cgagcgcctc gacgcccagc taatttttat ttttatttat ttttttgtag agacggcgtc
      541 tctctaagat gcccaggctg gtggccggtg tcgaactcct aagatgaagc gatcctcccc
      601 ggccttggcc tccgcgcctc ctaaagcgcc aggtatgagc caccgcgcct ggcctacaag
      661 tgcattttaa ttaaagtatt attaatgtct ttgcctgaag aaattcgctt ttaaattgtg
      721 acttatcttt cacccaaaaa tcaaagcaca attcagcccc gaggcggggg cggtaggagc
      781 tgggcggggc gggggcaggg aaagaccagg agcagagatt caaaaagagt aagagggcaa
      841 aatgtgcata atgcatcttc acaggtaaga gcctggccag gctcctgttt taatggcttc
      901 ctcctgaaga agattcaagc agagtgtaag atattttcgg aaagtagagc attttgaaag
      961 catttcataa tctctcaaaa ccggagactg ctcctgtccc acctcgttag agaaaacagc
     1021 gatgctcaaa ggcaacctcc ttcctgacat tgcctggtag gacgcgacgt ggtgtttgcc
     1081 cgcgcggaat gcggacgcaa ggctgctcct aggtctcggg gacgcgccat ccccatttcc
     1141 gctcgcggag gcgtagggtc cgggcgcggg accccagtcg accttgactg gcggcgcgac
     1201 cttgaggcct gcgttcgcct cagttgcccc ctctgtgcaa tggggagacg cgcctcatcg
     1261 cttgacaacg gccgaagagc cgccgcgctt ccgtctcccg cgtgcgcgcg ccatgctgcc
     1321 cacccccgtt ccgcactgac cctcccccgt gccccgcgtc ccgtactgcc gccccgcccc
     1381 gagtcccatg ccgcagccac cgcgacggag cccgcaggcg ggaacctgcc tccgcgcgtt
     1441 agcgcgcacg cgcgcctcat gtgtcgtccc catcagcgcc ggcttccgtc tataggccag
     1501 atgcactgtc actctggcga agtcgcagac ccgattggcc gggacggagg cgcgagaccg
     1561 ggttgcgggc ggggccgaac gtggtataaa acgggcggga ggccaggctc gtgccgtttt
     1621 gcagacgcca ccgccgagga aaaccgtgta ctattagcca tggtcaaccc caccgtgttc
     1681 ttcgacattg ccgtcgacgg cgagcccttg ggccgcgtct cctttgaggt cgggcgggcg
     1741 gcggcgtgcg ggaatggggc ccagaaagtg ggccggggtc ggggtgggtg gtagcgcccc
     1801 aaaggcccgg gcgcggggcg accctgcttg aggggcgagc gcgggcgggc tgcggcgcca
     1861 tttcctgacg aggggccatt ttgggaggtc cgcgagtcgc gggaggaggc cgggacgcgg
     1921 cggacaaagg caggcggggc ggctgcgagg ccgttggggg agggggcccg cgtccgcccg
     1981 cccgcctcat gtggccgcgc cctgtcctgt ccgacgcacg tgctcggcgg ccgcgctcag
     2041 gtccgcgcct tgagagtcgt tgtccgccct agcttggcct gggcgccgca gaccggagcc
     2101 agaagcacgc tcgcgggggc ttgcgaccgc cttcctggga agctgtcccc tggcaggcat
     2161 gggtgcttta catcctgagc tgggaagctg tttgcttgag ggtttttctc aaggatcgag
     2221 gcgcggtgtg agcccgtcca tgctcggtcc tgtagatccc gggaggccat gttataaaag
     2281 gagacttgct gggatgtgac gggttgccac ttgaaatatc ttccatttgg ataaagtagg
     2341 aatatttata catgtgcccc aaacgtccct ccgtgtcccc cacccccaag cggaaatgtg
     2401 aaaatgggcc ttgcctttgc tggtgcccaa ggaccgcctt ccactgcagt gacggcgctg
     2461 gcgggggagg cgctcttgag cccctcccga ttgtccctct gcctagcaag caagttgcga
     2521 ctggccacaa ggcaggcctc ttccgaccaa ggtggattac cagtgattac ctaattagtt
     2581 ttgagagcgt taaatgagtt cttaaagatc agttgtaatt atagcatagt atctaaactt
     2641 ggcgcgtgtc ttcaaagtta aatattgagt acgattccgt tccagttaac atggatagac
     2701 cttagggagt agcgaaatag gatgttagtg gttttattcc tttaaatcac atctcaaaag
     2761 gccaccaatg gctagtttgg atcttattcc gaaaatagat tgatcctcat gcagtcttcg
     2821 tgaggacaga gcgatttcct tgttgcctac cctgtccata gtgcctggca cataggcact
     2881 gaaacactgc atgttaatcc acaccccacc ccacctatga gtgtagtcaa agctggtaag
     2941 tgacaagggc tttcgtggaa acttggcctg acctaatgtt gggcatcagg ttacccaaag
     3001 agcttcaggg aaatgagaaa ggacttgcag gtcttgatga gaatggaggg gtaactgcca
     3061 atgagggctt tggctttagc gaaagtctga aagggaagcc ataggaactt aaacgtaccg
     3121 actataaagc tctgagaaaa gctgatgttt tagaaagacc atacattcta ggtacaaata
     3181 cctaaaaact aaaaaataag tacgttggcc aggcgggcgg atcacgaagt caggagattg
     3241 agaccatcct gggcccctgg tgaaacccca cctctattaa aaatacaaaa attagctggg
     3301 cgtggtggcg cttgcctgta atctcagcta ctctagaggc tgaggcagga gatcgcttga
     3361 accccggagg cggaggctgc agtgagccga gatcgtgcca ctgcactcca gcctggtgac
     3421 agcgagactc ttgtctcaaa aaaaaaaaag tacattgcta taagagaagt gcacacggat
     3481 actagtagtt aattcagtca catctgtgaa atagcttata aaatgctact tttaaacaag
     3541 ctgtttttat gaaagggctt gtaaatgttt atggtattta agctacctct ctagccataa
     3601 cgtattatac attcaagaaa ggttcaaaac cagatatact agaaaccaat ctttattttt
     3661 taccccacta ctaggtaagg gcctggatac caagaagtga ctgctcatct aatccataaa
     3721 gctatgttaa cagattggag gtagtagcat tttcattaca agtgactaaa agaacagctg
     3781 tttacccctg atcgtgcagc agtgcttgct gttccttaga attttgcctt gtaagttcta
     3841 gctcaagttg gggggtggtg atagacattt aagaagccat atatcttttc agaagtaggt
     3901 gtgatgtact aaaagtttga gacactttct agaagtctca ctatttaagt tatgactagt
     3961 attggatttt tggcatgtct ttgggtttca tgtttcttaa cccaactgcc tgcagggcct
     4021 tatggctgtc aggagcagtt cttgggaatt aaagtaatta ctgaagaagt attctagtga
     4081 gaaaatgaat ttatgactca gaagccccta aagacatggg tactaagcaa caaaataagc
     4141 agatgttaat taactgtaat tttctcttac agctgtttgc agacaaggtc ccaaagacag
     4201 caggttggtc cattttctaa gtttaacaaa gatgttccaa ttgtgacagt ttgtgtgtgt
     4261 gtgtgtatat atatattttt atgtatgtat atatgtgttt aatttttttt taaacagaaa
     4321 attttcgtgc tctgagcact ggagagaaag gatttggtta taagggttcc tgctttcaca
     4381 gaattattcc agggtttatg tgtcaggtac gaaatttact gaattttatt ttatttgggt
     4441 tgctcccttc atttgggatt gagccagaat atttcaggat acacatatct gaactgttac
     4501 tctaccattt cggttctatt taacccttct attcagtttg aacttgggtt taaagtttga
     4561 accttgcaga tttggcacac ttcatggtta tgttgtcaga agtgacattt ttcctatatg
     4621 ttgacagggt ggtgacttca cacgccataa tggcactggt ggcaagtcca tctatgggga
     4681 gaaatttgaa gatgagaact tcatcctaaa gcatacgggt cctggcatct tgtccatggc
     4741 aaatgctgga cccaacacaa atggttccca gtttttcatc tgcactgcca agactgagtg
     4801 gtaagggtac aacatggcac actaaccacc tgactaaatg aaaagttgcc ctggggggaa
     4861 cggaacaaac actacttttc ttcaaccttt gcttccacag actttttcat ccctaagata
     4921 ctagaagaag agcatacata aatgacaaat atagccaatg tgatacagaa tgtcagatac
     4981 tatgatagaa acttggccct tagctgggtg gttgaattag gtgctacttt tttgagatgg
     5041 agttttgctc tgttgccagg ttggagtgca gtggcacaat ctgggctcac tgcaacctct
     5101 gcctcctggg ttcaagcgat tctcctgcct tggcctcctg agtagctgag aatacagatg
     5161 tgtgccagca tgcctggcta attttttgta tttttgtgga gacggggttt catcatgttg
     5221 gccaagctgg tcttgaactc gtgacttaag gtgaaccacc tgccttggcc ccccaaagtg
     5281 ctgggatttc aggcatgagc cactgcgccc aaccaattaa gtgctttttt tttttttttt
     5341 cttttctcag actggatctc gctcttatct cccaggttgg agtgcagtgg tgccatctca
     5401 gctcactgca acctcctccc gggttcaagc aattcttctg cctcagcctc tcaagtagct
     5461 ggaactacag gcatgcacca ccactcccag ctaaattgtg tattattagt agagcgggat
     5521 ttaccatgtt gtccaggctg gtctcgaact cctgggctca agtgatctgc ctgccttgac
     5581 ccccccgaag tgctgggatt acaggcatga gccactgtgc ccacccaatt aagtgctgct
     5641 tttatgttac tattaataac atgcggttgg ttgggttttt tgtttctttg gggtttttgt
     5701 tttgttttgt ttgtttttgg gggagggggg cgcaattcat tctatatgtg taactctttt
     5761 ttgagatgga gtttcgctct gtcgcccagg ctggagtgca gtggcgcgat ctcggctcac
     5821 tgcaagctcc gcctcccagg ttcacgccat tctcctgcct cagcctcccg agtagctggg
     5881 actataggca catgccacca tgcccggcta attttttgta tttttagtag agacagggtt
     5941 tcaccgtgtt agccaggatg gtctcgatct cctgacctcg tgatccgccc gccttggcct
     6001 cccaaagtgc tgggattaca ggcgtgagcc accgcacccg gcctatatgt gtaactcttt
     6061 aatggtaatt ggagaatcat gtttaatgac atttagtaca aaaggcttca gttaaaaaaa
     6121 aaaaaaaaaa gctacctttc tcgtcttggt tcatgacaca tggaggctgc ttgtttgtgg
     6181 ttgccagtca taatgattgt tcttcctttt caaggttgga tggcaagcat gtggtgtttg
     6241 gcaaagtgaa agaaggcatg aatattgtgg aggccatgga gcgctttggg tccaggaatg
     6301 gcaagaccag caagaagatc accattgctg actgtggaca actcgaataa gtttgacttg
     6361 tgttttatct taaccaccag atcattcctt ctgtagctca ggagagcacc cctccacccc
     6421 atttgctcgc agtatcctag aatctttgtg ctctcgctgc agttcccttt gggttccatg
     6481 ttttccttgt tccctcccat gcctagctgg attgcagagt taagtttatg attatgaaat
     6541 aaaaactaaa taacaattgt cctcgtttga gttaagtgtt gatgtaggct ttattttaag
     6601 cagtaatggg ttacttctga aacatcactt gtttgcttaa ttctacacag tacttagatt
     6661 ttttttactt tccagtccca ggaagtgtca atgtttgttg agtggaatat t
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC018034. Homo sapiens, pho...[gi:17390057] Links  


LOCUS       BC018034                1779 bp    mRNA    linear   PRI 06-DEC-2001
DEFINITION  Homo sapiens, phosphatidylinositol-4-phosphate 5-kinase, type II,
            alpha, clone MGC:26205 IMAGE:4817357, mRNA, complete cds.
ACCESSION   BC018034
VERSION     BC018034.1  GI:17390057
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1779)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-DEC-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 32 Row: n Column: 6
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 6857819.
FEATURES             Location/Qualifiers
     source          1..1779
                     /organism="Homo sapiens"
                     /db_xref="LocusID:5305"
                     /db_xref="taxon:9606"
                     /clone="MGC:26205 IMAGE:4817357"
                     /tissue_type="Brain, hippocampus"
                     /clone_lib="NIH_MGC_95"
                     /lab_host="DH10B"
                     /note="Vector: pBluescript"
     CDS             230..1450
                     /codon_start=1
                     /product="phosphatidylinositol-4-phosphate 5-kinase, type
                     II, alpha"
                     /protein_id="AAH18034.1"
                     /db_xref="GI:17390058"
                     /translation="MATPGNLGSSVLASKTKTKKKHFVAQKVKLFRASDPLLSVLMWG
                     VNHSINELSHVQIPVMLMPDDFKAYSKIKVDNHLFNKENMPSHFKFKEYCPMVFRNLR
                     ERFGIDDQDFQNSLTRSAPLPNDSQARSGARFHTSYDKRYIIKTITSEDVAEMHNILK
                     KYHQYIVECHGITLLPQFLGMYRLNVDGVEIYVIVTRNVFSHRLSVYRKYDLKGSTVA
                     REASDKEKAKELPTLKDNDFINEGQKIYIDDNNKKVFLEKLKKDVEFLAQLKLMDYSL
                     LVGIHDVERAEQEEVECEENDGEEEGESDGTHPVGTPPDSPGNTLNSSPPLAPGEFDP
                     NIDVYGIKCHENSPRKEVYFMAIIDILTHYDAKKKAAHAAKTVKHGAGAEISTVNPEQ
                     YSKRFLDFIGHILT"
BASE COUNT      491 a    416 c    502 g    370 t
ORIGIN      
        1 gcggcgggcg caggatacgg gccggggcgc gagccgagcg cagtctgccg ggcagagcgg
       61 gcggagcgag ccgagtgggg ctgagcgcgc cggcggcggc gggcggagcg gagcgcggcg
      121 cgccggggcc gccgccgggg ggatgcggct gcctccccgg gccggggtgt agagagggcg
      181 ggtccccggc ctcgggagca cggcggtgga ggggacatag gaggcggcca tggcgacccc
      241 cggcaaccta gggtcctctg tcctggcgag caagaccaag accaagaaga agcacttcgt
      301 agcgcagaaa gtgaagctgt ttcgggccag cgacccgctg ctcagcgtcc tcatgtgggg
      361 ggtaaaccac tcgatcaatg aactgagcca tgttcaaatc cctgttatgt tgatgccaga
      421 tgacttcaaa gcctattcaa aaataaaggt ggacaatcac ctttttaaca aagaaaacat
      481 gccgagccat ttcaagttta aggaatactg cccgatggtc ttccgtaacc tgcgggagag
      541 gtttggaatt gatgatcaag atttccagaa ttccctgacc aggagcgcac ccctccccaa
      601 cgactcccag gcccgcagtg gagctcgttt tcacacttcc tacgacaaaa gatacatcat
      661 caagactatt accagtgaag acgtggccga aatgcacaac atcctgaaga aataccacca
      721 gtacatagtg gaatgtcatg ggatcaccct tcttccccag ttcttgggca tgtaccggct
      781 taatgttgat ggagttgaaa tatatgtgat agttacaaga aatgtattca gccaccgttt
      841 gtctgtgtat aggaaatacg acttaaaggg ctctacagtg gctagagaag ctagtgacaa
      901 agaaaaggcc aaagaactgc caactctgaa agataatgat ttcattaatg agggccaaaa
      961 gatttatatt gatgacaaca acaagaaggt cttcctggaa aaactaaaaa aggatgttga
     1021 gtttctggcc cagctgaagc tcatggacta cagtctgctg gtgggaattc atgatgtgga
     1081 gagagccgaa caggaggaag tggagtgtga ggagaacgat ggggaggagg agggcgagag
     1141 cgatggcacc cacccggtgg gaaccccccc agatagcccc gggaatacac tgaacagctc
     1201 accacccctg gctcccgggg agttcgatcc gaacatcgac gtctatggaa ttaagtgcca
     1261 tgaaaactcg cctaggaagg aggtgtactt catggcaatt attgacatcc ttactcatta
     1321 tgatgcaaaa aagaaagctg cccatgctgc aaaaactgtt aaacatggcg ctggcgcgga
     1381 gatctccacc gtgaacccag aacagtattc aaagcgcttt ttggacttta ttggccacat
     1441 cttgacgtaa cctcctgcgc agcctcggac agacatgaac attggaggga cagaggtggc
     1501 ttcggtgtag gaaaaatgaa aaccaaactc agtgaagtac tcatcttgca ggaagcaaac
     1561 ctccttgttt acatcttcag gccaagatga ctgatttggg ggctactcgc tttacagcta
     1621 cctgattttc ccagcatcgt tctagctatt tctgactttg tgtatatgtg tgtgtgtgtg
     1681 ttgggggggg tgagtgtgtg cgcgcgtgtg cattttaaaa gtcataaatt aattaaaaca
     1741 gatccacttc ggtcaaaaaa aaaaagaaaa aaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U56637. Human capping pro...[gi:1336098] Links  


LOCUS       HSU56637                2385 bp    mRNA    linear   PRI 04-FEB-1998
DEFINITION  Human capping protein alpha subunit isoform 1 mRNA, complete cds.
ACCESSION   U56637
VERSION     U56637.1  GI:1336098
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2385)
  AUTHORS   Hart,M.C., Korshunova,Y.O. and Cooper,J.A.
  TITLE     Vertebrates have conserved capping protein alpha isoforms with
            specific expression patterns
  JOURNAL   Cell Motil. Cytoskeleton 38 (2), 120-132 (1997)
  MEDLINE   97470757
   PUBMED   9331217
REFERENCE   2  (bases 1 to 2385)
  AUTHORS   Hart,M.C., Korshunova,Y.O. and Cooper,J.A.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-APR-1996) Marilyn C. Hart, Cell Biology & Physiology,
            Washington University, Box 8228, 660 S. Euclid Ave., St. Louis, MO
            63110, USA
FEATURES             Location/Qualifiers
     source          1..2385
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     source          1..33
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="testis"
                     /note="derived by RACE from RNA"
     source          34..2385
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="G4021"
                     /tissue_type="heart"
                     /dev_stage="fetal"
                     /note="see also EST sequence, GenBank Accession Number
                     R58525"
     source          812..2385
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="146582"
                     /tissue_type="placenta"
     CDS             1..861
                     /function="binds barbed ends of actin filaments"
                     /codon_start=1
                     /product="capping protein alpha subunit isoform 1"
                     /protein_id="AAC00533.1"
                     /db_xref="GI:1336099"
                     /translation="MADFDDRVSDEEKVRIAAKFITHAPPGEFNEVFNDVRLLLNNDN
                     LLREGAAHAFAQYNMDQFTPVKIEGYEDQVLITEHGDLGNSRFLDPRNKISFKFDHLR
                     KEASDPQPEEADGGLKSWRESCDSALRAYVKDHYSNGFCTVYAKTIDGQQTIIACIES
                     HQFQPKNFWNGRWRSEWKFTITPPTAQVVGVLKIQVHYYEDGNVQLVSHKDVQDSLTV
                     SNEAQTAKEFIKIIENAENEYQTAISENYQTMSDTTFKALRRQLPVTRTKIDWNKILS
                     YKIGKEMQNA"
     polyA_signal    2268..2273
BASE COUNT      724 a    448 c    447 g    766 t
ORIGIN      
        1 atggccgact tcgatgatcg tgtgtcggat gaggagaagg tacgcatagc tgctaaattc
       61 atcactcatg cacccccagg ggaatttaat gaagtattca atgacgttcg gctactactt
      121 aataatgaca atctcctcag ggaaggggca gcacatgcat ttgcccagta taacatggat
      181 cagttcacgc ctgtgaagat agaaggatat gaagatcagg tcttaattac agagcacggt
      241 gacctgggta atagcagatt tttagatcca agaaacaaaa tttcctttaa atttgaccac
      301 ttacggaaag aagcaagtga cccccagcca gaagaagcag atggaggtct gaagtcttgg
      361 agagaatcct gtgacagtgc tttaagagcc tatgtgaaag accattattc caacggcttc
      421 tgtactgttt atgctaaaac tatcgatggg caacagacta ttattgcatg tattgaaagc
      481 caccagtttc agcctaaaaa cttctggaat ggtcgttgga gatcagagtg gaagttcacc
      541 atcacaccac ctacagccca ggtggttggc gtgcttaaga ttcaggttca ctattatgaa
      601 gatggcaatg ttcagttggt tagtcataaa gatgtacagg attcactaac tgtttcgaat
      661 gaagcccaaa ctgccaagga gtttattaaa atcatagaga atgcagaaaa tgagtatcag
      721 acagcaatta gtgaaaacta tcaaacaatg tcagatacca cattcaaggc cttgcgccgc
      781 cagcttccag ttacccgcac caaaatcgac tggaacaaga tactcagcta caagattggc
      841 aaagaaatgc agaatgctta aaggctgaat gtaggattct tcagtatgtg gaaagacaag
      901 gattcaacgt gtggtcatat gataaataag tgatttataa acaagagtga tattttgcta
      961 gggctttcaa agttaaccgg ttttctagcc tcatggaata ctgttgaacc tatagcgttg
     1021 tcttgattct tttgtgttct ctgccttgta attttctgtt actgctatat ctacgtgtaa
     1081 atcttttttt cttttttttt tttttttttt ttcttttttg gttaattctg ccacatttaa
     1141 tgttggtgag agagtgatct atcctaatga catttactgt ttaaaaaagt ttcctagcca
     1201 tgaagccctg ctactgattt agacaaggta ttatggtcat tactttgtac ccctatcctt
     1261 ccaagcactt ctggtacttc agtcgttttt actgatccac caacacctaa agaggctatg
     1321 ctacagtctc tagctaaatg gaagacacat tcatccttct ccctctgact gctttgatca
     1381 tcatttattg catcgtcata tcatatttat cgcatctcat aactaacttt ctaaagtttg
     1441 gattgggact tttcaggtcc tttttggagg gcaaaggaag ttccagcttc tctggggaac
     1501 ttgtttttaa atccaaagac ttgaaccaca ttccctgcac atgaacatgt ttgcttttat
     1561 cccttctctc attggctcct tcccatctta gtaccattgt agttatacat ctgcattttt
     1621 tagaagcatt ttacccattt atttttttaa acattcaaga actgctgacg tactgtggat
     1681 gtagagtata aaacttgaaa aatgcagatg ttgaaggaat aataggtatc ttgtgcttta
     1741 atactttatg gcaggattgt actataagca aatgaattaa acagctatgt aaatcataaa
     1801 gaaaaactaa aaatgaacca aagtgaaagg ataacttcca ggcagtatct ttctattgta
     1861 acctgttatt taaggaaata ctagtgattt cttctaaata ggatgtaaac ttctttcaaa
     1921 ttactcttcc tcagtctgcc tgccaagaac tcaagtgtaa ctgtgataaa ataacctttc
     1981 ccaggtatat tcggcaggta tgtgtgtaat ctcagaatac acaggtgaca tagatatgat
     2041 atgacaactg gtaatggtgg attcatttac attgtttaca cttctatgac caggccttaa
     2101 gggaaggtca gttttttaaa aaaccaagta gtgtcttcct acctatctcc agatacatgt
     2161 caaaaagaaa aggtgtttgt gctccgtttt gtttctgctc agtaatatag tcaagcaagt
     2221 ttgttccagg tgacccattg agctgtgtat gcatttttgt ttatttcaat aaaatatatt
     2281 tgtattattt gtccttcata ctatccatcc ataccacact atcttctgta tcaggtagtc
     2341 taatagaaat atacctgttt tgttctaaaa aaaaaaaaaa aaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X63613. H.sapiens mRNA fo...[gi:35796] Links  


LOCUS       HSPTX3R                 1837 bp    mRNA    linear   PRI 29-JUL-1993
DEFINITION  H.sapiens mRNA for pentaxin (PTX3).
ACCESSION   X63613 S47824
VERSION     X63613.1  GI:35796
KEYWORDS    pentaxin; PTX3 gene.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1837)
  AUTHORS   Breviario,F.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-JAN-1992) F. Breviario, Istitute Ricerche Farmacol.
            Mario Negri, Via Eritrea, 62, 20157 Milano, ITALY
REFERENCE   2  (bases 1 to 1837)
  AUTHORS   Breviario,F., d'Aniello,E.M., Golay,J., Peri,G., Bottazzi,B.,
            Bairoch,A., Saccone,S., Marzella,R., Predazzi,V., Rocchi,M. et al.
  TITLE     Interleukin-1-inducible genes in endothelial cells. Cloning of a
            new gene related to C-reactive protein and serum amyloid P
            component
  JOURNAL   J. Biol. Chem. 267 (31), 22190-22197 (1992)
  MEDLINE   93054498
   PUBMED   1429570
FEATURES             Location/Qualifiers
     source          1..1837
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="PTX3/E"
                     /cell_line="HUVEC"
                     /clone_lib="ZAP-FB1"
     gene            68..1213
                     /gene="PTX3"
     CDS             68..1213
                     /gene="PTX3"
                     /codon_start=1
                     /product="pentaxin"
                     /protein_id="CAA45158.1"
                     /db_xref="GI:35797"
                     /db_xref="SWISS-PROT:P26022"
                     /translation="MHLLAILFCALWSAVLAENSDDYDLMYVNLDNEIDNGLHPTEDP
                     TPCDCGQEHSEWDKLFIMLENSQMRERMLLQATDDVLRGELQRLREELGRLAESLARP
                     CAPGAPAEARLTSALDELLQATRDAGRRLARMEGAEAQRPEEAGRALAAVLEELRQTR
                     ADLHAVQGWAARSWLPAGCETAILFPMRSKKIFGSVHPVRPMRLESFSACIWVKATDV
                     LNKTILFSYGTKRNPYEIQLYLSYQSIVFVVGGEENKLVAEAMVSLGRWTHLCGTWNS
                     EEGLTSLWVNGELAATTVEMATGHIVPEGGILQIGQEKNGCCVGGGFDETLAFSGRLT
                     GFNIWDSVLSNEEIRETGGAESCHIRGNIVGWGVTEIQPHGGAQYVS"
     polyA_signal    1802..1807
     polyA_site      1822
BASE COUNT      507 a    370 c    500 g    460 t
ORIGIN      
        1 ctcaaactca gctcacttga gagtctcctc ccgccagctg tggaaagaac tttgcgtctc
       61 tccagcaatg catctccttg cgattctgtt ttgtgctctc tggtctgcag tgttggccga
      121 gaactcggat gattatgatc tcatgtatgt gaatttggac aacgaaatag acaatggact
      181 ccatcccact gaggacccca cgccgtgcga ctgcggtcag gagcactcgg aatgggacaa
      241 gctcttcatc atgctggaga actcgcagat gagagagcgc atgctgctgc aagccacgga
      301 cgacgtcctg cggggcgagc tgcagaggct gcgggaggag ctgggccggc tcgcggaaag
      361 cctggcgagg ccgtgcgcgc cgggggctcc cgcagaggcc aggctgacca gtgctctgga
      421 cgagctgctg caggcgaccc gcgacgcggg ccgcaggctg gcgcgtatgg agggcgcgga
      481 ggcgcagcgc ccagaggagg cggggcgcgc cctggccgcg gtgctagagg agctgcggca
      541 gacgcgagcc gacctgcacg cggtgcaggg ctgggctgcc cggagctggc tgccggcagg
      601 ttgtgaaaca gctattttat tcccaatgcg ttccaagaag atttttggaa gcgtgcatcc
      661 agtgagacca atgaggcttg agtcttttag tgcctgcatt tgggtcaaag ccacagatgt
      721 attaaacaaa accatcctgt tttcctatgg cacaaagagg aatccatatg aaatccagct
      781 gtatctcagc taccaatcca tagtgtttgt ggtgggtgga gaggagaaca aactggttgc
      841 tgaagccatg gtttccctgg gaaggtggac ccacctgtgc ggcacctgga attcagagga
      901 agggctcaca tccttgtggg taaatggtga actggcggct accactgttg agatggccac
      961 aggtcacatt gttcctgagg gaggaatcct gcagattggc caagaaaaga atggctgctg
     1021 tgtgggtggt ggctttgatg aaacattagc cttctctggg agactcacag gcttcaatat
     1081 ctgggatagt gttcttagca atgaagagat aagagagacc ggaggagcag agtcttgtca
     1141 catccggggg aatattgttg ggtggggagt cacagagatc cagccacatg gaggagctca
     1201 gtatgtttca taaatgttgt gaaactccac ttgaagccaa agaaagaaac tcacacttaa
     1261 aacacatgcc agttgggaag gtctgaaaac tcagtgcata ataggaacac ttgagactaa
     1321 tgaaagagag agttgagacc aatctttatt tgtactggcc aaatactgaa taaacagttg
     1381 aaggaaagac attggaaaaa gcttttgagg ataatgttac tagactttat gccatggtgc
     1441 tttcagttta atgctgtgtc tctgtcagat aaactctcaa ataattaaaa aggactgtat
     1501 tgttgaacag agggacaatt gttttacttt tctttggtta attttgtttt ggccagagat
     1561 gaattttaca ttggaagaat aacaaaataa gatttgttgt ccattgttca ttgttattgg
     1621 tatgtacctt attacaaaaa aaatgatgaa aacatattta tactacaagg tgacttaaca
     1681 actataaatg tagtttatgt gttataatcg aatgtcacgt ttttgagaag atagtcatat
     1741 aagttatatt gcaaaaggga tttgtattaa tttaagacta tttttgtaaa gctctactgt
     1801 aaataaaata ttttataaaa ctaaaaaaaa aaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D21878. Human mRNA for BS...[gi:506334] Links  


LOCUS       D21878                  1411 bp    mRNA    linear   PRI 01-FEB-2000
DEFINITION  Human mRNA for BST-1, complete cds.
ACCESSION   D21878
VERSION     D21878.1  GI:506334
KEYWORDS    BST-1; pre-B cell growth; CD157.
SOURCE      Homo sapiens bone marrow stromal cell cDNA to mRNA, clone BST-1.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1411)
  AUTHORS   Kaisho,T., Ishikawa,J., Oritani,K., Inazawa,J., Tomizawa,H.,
            Muraoka,O., Ochi,T. and Hirano,T.
  TITLE     BST-1, a surface molecule of bone marrow stromal cell lines that
            facilitates pre-B-cell growth
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 91 (12), 5325-5329 (1994)
  MEDLINE   94261578
REFERENCE   2  (bases 1 to 1411)
  AUTHORS   Hirano,T.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-NOV-1993) Toshio Hirano, Osaka Univ. Med. Sch.,
            Division of Molecular Oncology; 2-2, Yamadaoka, Suita, Osaka 565,
            Japan (Tel:06-879-3880, Fax:06-879-3889)
FEATURES             Location/Qualifiers
     source          1..1411
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="4p15"
                     /cell_type="bone marrow stromal cell"
     CDS             128..1084
                     /codon_start=1
                     /product="BST-1 precursor"
                     /protein_id="BAA04885.1"
                     /db_xref="GI:999429"
                     /translation="MAAQGCAASRLLQLLLQLLLLLLLLAAGGARARWRAEGTSAHLR
                     DIFLGRCAEYRALLSPEQRNKNCTAIWEAFKVALDKDPCSVLPSDYDLFINLSRHSIP
                     RDKSLFWENSHLLVNSFADNTRRFMPLSDVLYGRVADFLSWCRQKNDSGLDYQSCPTS
                     EDCENNPVDSFWKRASIQYSKDSSGVIHVMLNGSEPTGAYPIKGFFADYEIPNLQKEK
                     ITRIEIWVMHEIGGPNVESCGEGSMKVLEKRLKDMGFQYSCINDYRPVKLLQCVDHST
                     HPDCALKSAAAATQRKAPSLYTEQRAGLIIPLFLVLASRTQL"
     sig_peptide     128..211
     mat_peptide     212..1081
                     /product="BST-1"
                     /function="facilitate pre-B-cell growth"
BASE COUNT      363 a    339 c    362 g    347 t
ORIGIN      
        1 cgggaaacgg caaacagcga gatatccgag cgagagtccc gccctgcatc agtttgcgga
       61 accgccttgg tagaaggaga gaaggggagt ggaggaagca cgggactgga gggaccaaag
      121 ttccccgatg gcggcccagg ggtgcgcggc atcgcggctg ctccagctgc tgctgcagct
      181 tctgcttcta ctgttgctgc tggcggcggg cggggcgcgc gcgcggtggc gcgcggaggg
      241 caccagcgca cacttgcggg acatcttcct gggccgctgc gccgagtacc gcgcactgct
      301 gagtcccgag cagcggaaca agaactgcac agccatctgg gaagccttta aagtggcgct
      361 ggacaaggat ccctgctccg tgctgccctc agactatgac ctttttatta acttgtccag
      421 gcactctatt cccagagata agtccctgtt ctgggaaaat agccacctcc ttgttaacag
      481 ctttgcagac aacacccgtc gttttatgcc cctgagcgat gttctgtatg gcagggttgc
      541 agatttcttg agctggtgtc gacagaaaaa tgactctgga ctcgattacc aatcctgccc
      601 tacatcagaa gactgtgaaa ataatcctgt ggattccttt tggaaaaggg catccatcca
      661 gtattccaag gatagttctg gggtgatcca cgtcatgctg aatggttcag agccaacagg
      721 agcctatccc atcaaaggtt tttttgcaga ttatgaaatt ccaaacctcc agaaggaaaa
      781 aattacacga atcgagatct gggttatgca tgaaattggg ggacccaatg tggaatcctg
      841 cggggaaggc agcatgaaag tcctggaaaa gaggctgaag gacatggggt tccagtacag
      901 ctgtattaat gattaccgac cagtgaagct cttacagtgc gtggaccaca gcacccatcc
      961 tgactgtgcc ttaaagtcgg cagcagccgc tactcaaaga aaagccccaa gtctttatac
     1021 agaacaaagg gcgggtctta tcattcccct ctttctggtg ctggcttccc ggactcaact
     1081 gtaactggaa actgtgttgc tctaaccctc ctccagccct gcagcctccc cttgcagtca
     1141 tcattcgtgt tctgtgtata ccaaatgatt ctgttatcta aagaagcttt ttgctgggaa
     1201 aacgatgtcc tgaaaatggt atttcaatga ggcatatgtt caggatttca gaaacaagaa
     1261 gttagttcta tttagcaggt taaaaaatgc tgcattagaa ttaaagcaag ttattttctt
     1321 atttgtataa tgacacaaag cattgggagt cagactgctt gtatattatc aaacatttta
     1381 agagaattct aataaagctg tattttacat c
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U54617. Human pyruvate de...[gi:1399196] Links  


LOCUS       HSU54617                1798 bp    mRNA    linear   PRI 06-SEP-1996
DEFINITION  Human pyruvate dehydrogenase kinase isoform 4 mRNA, complete cds.
ACCESSION   U54617
VERSION     U54617.1  GI:1399196
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1798)
  AUTHORS   Rowles,J., Scherer,S.W., Xi,T., Majer,M., Nickle,D.C.,
            Rommens,J.M., Popov,K.M., Harris,R.A., Riebow,N.L., Xia,J.,
            Tsui,L., Bogardus,C. and Prochazka,M.
  TITLE     Cloning and characterization of PDK4 on 7q21.3 encoding a fourth
            pyruvate dehydrogenase kinase isoenzyme in human
  JOURNAL   J. Biol. Chem. 271 (37), 22376-22382 (1996)
  MEDLINE   96394293
   PUBMED   8798399
REFERENCE   2  (bases 1 to 1798)
  AUTHORS   Prochazka,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-APR-1996) Michal Prochazka, CDNS/PECRB, NIDDK/NIH,
            4212 N. 16th Street, Phoenix, AZ 85016, USA
FEATURES             Location/Qualifiers
     source          1..1798
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="7"
                     /map="7q21.3"
                     /tissue_type="frontal cortex"
     misc_feature    1..50
                     /note="determined by 5' RACE"
     CDS             223..1458
                     /codon_start=1
                     /product="pyruvate dehydrogenase kinase isoform 4"
                     /protein_id="AAC50669.1"
                     /db_xref="GI:1399197"
                     /translation="MKAARFVLRSAGSLNGAGLVPREVEHFSRYSPSPLSMKQLLDFG
                     SENACERTSFAFLRQELPVRLANILKEIDILPTQLVNTSSVQLVKSWYIQSLMDLVEF
                     HEKSPDDQKALSDFVDTLIKVRNRHHNVVPTMAQGIIEYKDACTVDPVTNQNLQYFLD
                     RFYMNRISTRMLMNQHILIFSDSQTGNPSHIGSIDPNCDVVAVVQDAFECSRMLCDQY
                     YLSSPELKLTQVNGKFPDQPIHIVYVPSHLHHMLFELFKNAMRATVEHQENQPSLTPI
                     EVIVVLGKEDLTIKISDRGGGVPLRIIDRLFSYTYSTAPTPVMDNSRNAPLAGFGYGL
                     PISRLYAKYFQGDLNLYSLSGYGTDAIIYLKALSSESIEKLPVFNKSAFKHYQMSSEA
                     DDWCIPSREPKNLAKEVAM"
BASE COUNT      500 a    440 c    393 g    465 t
ORIGIN      
        1 agacttgaac ttgaatctcg aaccactgca tctccgactc tgcccagact cttcactccg
       61 cggcaccctc aaaccccagc ccaggccggg gcgcacaagc cagccagcgc acctgcagtc
      121 ctcgcccgga cgcgccgcgc cccctcggaa ccaggctctg ctccgagcag ccttcgcccc
      181 tcaagccagc cacagtcccc gccaggccgg gtgggcgtca agatgaaggc ggcccgcttc
      241 gtgctgcgca gcgctggctc gctcaacggc gccggcctgg tgccccgaga ggtggagcat
      301 ttctcgcgct acagcccgtc cccgctgtcc atgaagcagc tactggactt tggttcagaa
      361 aatgcatgtg aaagaacttc ttttgcattt ttgcgacaag aattgcctgt gagactcgcc
      421 aacattctga aggaaattga tatcctcccg acccaattag taaatacctc ttcagtgcaa
      481 ttggttaaaa gctggtatat acagagcctg atggatttgg tggaattcca tgagaaaagc
      541 ccagatgacc agaaagcatt atcagacttt gtagatacac tcatcaaagt tcgaaataga
      601 caccataatg tagtccctac aatggcacaa ggaatcatag agtataaaga tgcctgtaca
      661 gttgacccag tcaccaatca aaatcttcaa tatttcttgg atcgatttta catgaaccgt
      721 atttctactc ggatgctgat gaaccagcac attcttatat ttagtgactc acagacagga
      781 aacccaagcc acattggaag cattgatcct aactgtgatg tggtagcagt ggtccaagat
      841 gcctttgagt gttcaaggat gctctgtgat cagtattatt tatcatctcc agaattaaag
      901 cttacacaag tgaatggaaa atttccagac caaccaattc acatcgtgta tgttccttct
      961 cacctccatc atatgctctt tgaactattt aagaatgcaa tgcgggcaac agttgaacac
     1021 caggaaaatc agccttccct tacaccaata gaggttattg ttgtcttggg aaaagaagac
     1081 cttaccatta agatttcaga cagaggaggt ggtgttcccc tgagaattat tgaccgcctc
     1141 tttagttata catactccac tgcaccaacg cctgtgatgg ataattcccg gaatgctcct
     1201 ttggctggtt ttggttacgg cttgccaatt tctcgtctgt atgcaaagta ctttcaagga
     1261 gatctgaatc tctactcttt atcaggatat ggaacagatg ctatcatcta cttaaaggct
     1321 ttgtcttctg agtctataga aaaacttcca gtttttaaca agtcagcctt caaacattat
     1381 cagatgagct ctgaggctga tgactggtgt atcccaagca gggaaccaaa gaacctggca
     1441 aaagaagtgg ccatgtgaag agggacactc aggacacttt acgggatcaa agtgggtcta
     1501 caccagtgct gcttcctgaa tgtttgtgtg tgaacccttg tttcctccaa aacaaacgac
     1561 agcaacgaaa actccttaat cagaacactg atccaatgag gaatggagct tgtttctgtg
     1621 acccaggaga acttagtgca agactacagg agttaacaga tggccagctc cttatttttt
     1681 aatgtagaat aactcctgag tttatatcaa atcctgaaga aataagcctc agttttccat
     1741 ctgtttttga taagaataag aaagggagtg agtgtgaaga tggtggttag cagtttcg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC012064. Homo sapiens, pro...[gi:15082316] Links  


LOCUS       BC012064                3292 bp    mRNA    linear   PRI 06-AUG-2001
DEFINITION  Homo sapiens, proprotein convertase subtilisin/kexin type 5, clone
            MGC:19910 IMAGE:4562734, mRNA, complete cds.
ACCESSION   BC012064
VERSION     BC012064.1  GI:15082316
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3292)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-AUG-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
            Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
            Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
            Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
            Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
            Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
            Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
            George Yang, Scott Zuyderduyn, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 29 Row: f Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 9296928.
FEATURES             Location/Qualifiers
     source          1..3292
                     /organism="Homo sapiens"
                     /db_xref="LocusID:5125"
                     /db_xref="taxon:9606"
                     /clone="MGC:19910 IMAGE:4562734"
                     /tissue_type="Kidney, renal cell adenocarcinoma"
                     /clone_lib="NIH_MGC_14"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             478..3219
                     /codon_start=1
                     /product="proprotein convertase subtilisin/kexin type 5"
                     /protein_id="AAH12064.1"
                     /db_xref="GI:15082317"
                     /translation="MGWGSRCCCPGRLDLLCVLALLGGCLLPVCRTRVYTNHWAVKIA
                     GGFPEANRIASKYGFINIGQIGALKDYYHFYHSRTIKRSVISSRGTHSFISMEPKVEW
                     IQQQVVKKRTKRDYDFSRAQSTYFNDPKWPSMWYMHCSDNTHPCQSDMNIEGAWKRGY
                     TGKNIVVTILDDGIERTHPDLMQNYDALASCDVNGNDLDPMPRYDASNENKHGTRCAG
                     EVAAAANNSHCTVGIAFNAKIGGVRMLDGDVTDMVEAKSVSFNPQHVHIYSASWGPDD
                     DGKTVDGPAPLTRQAFENGVRMGRRGLGSVFVWASGNGGRSKDHCSCDGYTNSIYTIS
                     ISSTAESGKKPWYLEECSSTLATTYSSGESYDKKIITTDLRQRCTDNHTGTSASAPMA
                     AGIIALALEANPFLTWRDVQHVIVRTSRAGHLNANDWKTNAAGFKVSHLYGFGLMDAE
                     AMVMEAEKWTTVPRQHVCVESTDRQIKTIRPNSAVRSIYKASGCSDNPNRHVNYLEHV
                     VVRITITHPRRGDLAIYLTSPSGTRSQLLANRLFDHSMEGFKNWEFMTIHCWGERAAG
                     DWVLEVYDTPSQLRNFKTPGKLKEWSLVLYGTSVQPYSPTNEFPKVERFRYSRVEDPT
                     DDYGTEDYAGPCDPECSEVGCDGPGPDHCNDCLHYYYKLKNNTRICVSSCPPGHYHAD
                     KKRCRKCAPNCESCFGSHGDQCMSCKYGYFLNEETNSCVTHCPDGSYQDTKKNLCRKC
                     SENCKTCTEFHNCTECRDGLSLQGSRCSVSCEDGRYFNGQDCQPCHRFCATCAGAGAD
                     GCINCTEGYFMEDGRCVQSCSISYYFDHSSENGYKSCKKCDISCLTCNGPGFKNCTSC
                     PSGYLLDLGMCQMGAICKDATEESWAEGGFCMLVKKNNLCQRKVLQQLCCKTCTFQG"
BASE COUNT      826 a    870 c    937 g    659 t
ORIGIN      
        1 cggagggagc gctgggagcg agcaagcgag cgtttggagc ccgggccagc agagggggcg
       61 cccggtcgct gcctgtaccg ctcccgctgg tcatctccgc cgcgctcggg ggccccggga
      121 ggagcgagac cgagtcggag agtccgggag ccaagccggg cgaaacccaa ctgcggagga
      181 cgcccgcccc actcagcctc ctcctgcgtc cgagccgggg agcatcgccg agcgccccac
      241 gggccggaga gctgggagca caggtcccgg cagccccagg gatggtctag gagccggcgt
      301 aaggctcgct gctctgctcc ctgccggggc tagccgcctc ctgccgatcg cccggggctg
      361 cgagctgcgg cggcccgggg ctgctcgccg ggcggcgcag gccggagaag ttagttgtgc
      421 gcgcccttag tgcgcggaac cagccagcga gcgagggagc agcgaggcgc cgggaccatg
      481 ggctggggga gccgctgctg ctgcccggga cgtttggacc tgctgtgcgt gctggcgctg
      541 ctcgggggct gcctgctccc cgtgtgtcgg acgcgcgtct acaccaacca ctgggcagtc
      601 aaaatcgccg ggggcttccc ggaggccaac cgtatcgcca gcaagtacgg attcatcaac
      661 ataggacaga taggggccct gaaggactac taccacttct accatagcag gacgattaaa
      721 aggtcagtta tctcgagcag agggacccac agtttcattt caatggaacc aaaggtggaa
      781 tggatccaac agcaagtggt aaaaaagcgg acaaagaggg attatgactt cagtcgtgcc
      841 cagtctacct atttcaatga tcccaagtgg cccagcatgt ggtatatgca ctgcagtgac
      901 aatacacatc cctgccagtc tgacatgaat atcgaaggag cctggaagag aggctacacg
      961 ggaaagaaca ttgtggtcac tatcctggat gacggaattg agagaaccca tccagatctg
     1021 atgcaaaact acgatgctct ggcaagttgc gacgtgaatg ggaatgactt ggacccaatg
     1081 cctcgttatg atgcaagcaa cgagaacaag catgggactc gctgtgctgg agaagtggca
     1141 gccgctgcaa acaattcgca ctgcacagtc ggaattgctt tcaacgccaa gatcggagga
     1201 gtgcgaatgc tggacggaga tgtcacggac atggttgaag caaaatcagt tagcttcaac
     1261 ccccagcacg tgcacattta cagcgccagc tggggcccgg atgatgatgg caagactgtg
     1321 gacggaccag cccccctcac ccggcaagcc tttgaaaacg gcgttagaat ggggcggaga
     1381 ggcctcggct ctgtgtttgt ttgggcatct ggaaatggtg gaaggagcaa agaccactgc
     1441 tcctgtgatg gctacaccaa cagcatctac accatctcca tcagcagcac tgcagaaagc
     1501 ggaaagaaac cttggtacct ggaagagtgt tcatccacgc tggccacaac ctacagcagc
     1561 ggggagtcct acgataagaa aatcatcact acagatctga ggcagcgttg cacggacaac
     1621 cacactggga cgtcagcctc agcccccatg gctgcaggca tcattgcgct ggccctggaa
     1681 gccaatccgt ttctgacctg gagagacgta cagcatgtta ttgtcaggac ttcccgtgcg
     1741 ggacatttga acgctaatga ctggaaaacc aatgctgctg gttttaaggt gagccatctt
     1801 tatggatttg gactgatgga cgcagaagcc atggtgatgg aggcagagaa gtggaccacc
     1861 gttccccggc agcacgtgtg tgtggagagc acagaccgac aaatcaagac aatccgccct
     1921 aacagtgcag tgcgctccat ctacaaagct tcaggctgct cggataaccc caaccgccat
     1981 gtcaactacc tggagcacgt cgttgtgcgc atcaccatca cccaccccag gagaggagac
     2041 ctggccatct acctgacctc gccctctgga actaggtctc agcttttggc caacaggcta
     2101 tttgatcact ccatggaagg attcaaaaac tgggagttca tgaccattca ttgctgggga
     2161 gaaagagctg ctggtgactg ggtccttgaa gtttatgata ctccctctca gctaaggaac
     2221 tttaagactc caggtaaatt gaaagaatgg tctttggtcc tctacggcac ctccgtgcag
     2281 ccatattcac caaccaatga atttccgaaa gtggaacggt tccgctatag ccgagttgaa
     2341 gaccccacag acgactatgg cacagaggat tatgcaggtc cctgcgaccc tgagtgcagt
     2401 gaggttggct gtgacgggcc aggaccagac cactgcaatg actgtttgca ctactactac
     2461 aagctgaaaa acaataccag gatctgtgtc tccagctgcc cccctggcca ctaccacgcc
     2521 gacaagaagc gctgcaggaa gtgtgccccc aactgtgagt cctgctttgg gagccatggt
     2581 gaccaatgca tgtcctgcaa atatggatac tttctgaatg aagaaaccaa cagctgtgtt
     2641 actcactgcc ctgatgggtc atatcaggat accaagaaaa atctttgccg gaaatgcagt
     2701 gaaaactgca agacatgtac tgaattccat aactgtacag aatgtaggga tgggttaagc
     2761 ctgcagggat cccggtgctc tgtctcctgt gaagatggac ggtatttcaa cggccaggac
     2821 tgccagccct gccaccgctt ctgcgccact tgtgctgggg caggagctga tgggtgcatt
     2881 aactgcacag agggctactt catggaggat gggagatgcg tgcagagctg tagtatcagc
     2941 tattactttg accactcttc agagaatgga tacaaatcct gcaaaaaatg tgatatcagt
     3001 tgtttgacgt gcaatggccc aggattcaag aactgtacaa gctgccctag tgggtatctc
     3061 ttagacttag gaatgtgtca aatgggagcc atttgcaagg atgcaacgga agagtcctgg
     3121 gcggaaggag gcttctgtat gcttgtgaaa aagaacaatc tgtgccaacg gaaggttctt
     3181 caacaacttt gctgcaaaac atgtacattt caaggctgag cagccatctt agatttcttt
     3241 gttcctgtag acttatagat tattccatat tattaaaaaa aaaaaaaaaa aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X71087. H.sapiens NC28 mR...[gi:288396] Links  


LOCUS       HSMCP3                   810 bp    mRNA    linear   PRI 07-AUG-1993
DEFINITION  H.sapiens NC28 mRNA for monocyte chemoattractant protein (MCP-3).
ACCESSION   X71087
VERSION     X71087.1  GI:288396
KEYWORDS    cytokine; monocyte-chemoattractant protein.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Minty,A., Chalon,P., Guillemot,J.C., Kaghad,M., Liauzun,P.,
            Magazin,M., Miloux,B., Minty,C., Ramond,P., Vita,N., Lupker,J.,
            Shire,D., Ferrara,P. and Caput,D.
  TITLE     Molecular cloning of the MCP-3 chemokine gene and regulation of its
            expression
  JOURNAL   Eur. Cytokine Netw. 4 (2), 99-110 (1993)
  MEDLINE   93305913
REFERENCE   2  (bases 1 to 810)
  AUTHORS   Minty,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-MAR-1993) A. Minty, Sanofi Elf Bio Recherches, Labege
            BP 137, 31676 Labege Cedex, FRANCE
FEATURES             Location/Qualifiers
     source          1..810
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_type="peripheral blood monocytes"
     gene            41..370
                     /gene="NC28"
     CDS             41..370
                     /gene="NC28"
                     /note="alternative"
                     /codon_start=1
                     /product="monocyte chemoattractant protein (MCP-3)"
                     /protein_id="CAA50405.1"
                     /db_xref="GI:288397"
                     /db_xref="SWISS-PROT:P80098"
                     /translation="MWKPMPSPSNMKASAALLCLLLTAAAFSPQGLAQPVGINTSTTC
                     CYRFINKKIPKQRLESYRRTTSSHCPREAVIFKTKLDKEICADPTQKWVQDFMKHLDK
                     KTQTPKL"
     CDS             53..370
                     /gene="NC28"
                     /note="alternative"
                     /codon_start=1
                     /product="monocyte chemoattractant protein (MCP-3)"
                     /protein_id="CAA50406.1"
                     /db_xref="GI:288398"
                     /db_xref="SWISS-PROT:P80098"
                     /translation="MPSPSNMKASAALLCLLLTAAAFSPQGLAQPVGINTSTTCCYRF
                     INKKIPKQRLESYRRTTSSHCPREAVIFKTKLDKEICADPTQKWVQDFMKHLDKKTQT
                     PKL"
     CDS             71..370
                     /gene="NC28"
                     /codon_start=1
                     /product="monocyte chemoattractant protein (MCP-3)"
                     /protein_id="CAA50407.1"
                     /db_xref="GI:288399"
                     /db_xref="SWISS-PROT:P80098"
                     /translation="MKASAALLCLLLTAAAFSPQGLAQPVGINTSTTCCYRFINKKIP
                     KQRLESYRRTTSSHCPREAVIFKTKLDKEICADPTQKWVQDFMKHLDKKTQTPKL"
     mat_peptide     140..367
                     /gene="NC28"
                     /product="monocyte chemoattractant protein (MCP-3)"
     misc_feature    548..552
                     /note="ATTTA motif"
     polyA_signal    787..792
BASE COUNT      248 a    169 c    155 g    238 t
ORIGIN      
        1 agcagagggg ctgagaccaa accagaaacc tccaattctc atgtggaagc ccatgccctc
       61 accctccaac atgaaagcct ctgcagcact tctgtgtctg ctgctcacag cagctgcttt
      121 cagcccccag gggcttgctc agccagttgg gattaatact tcaactacct gctgctacag
      181 atttatcaat aagaaaatcc ctaagcagag gctggagagc tacagaagga ccaccagtag
      241 ccactgtccc cgggaagctg taatcttcaa gaccaaactg gacaaggaga tctgtgctga
      301 ccccacacag aagtgggtcc aggactttat gaagcacctg gacaagaaaa cccaaactcc
      361 aaagctttga acattcatga ctgaactaaa aacaagccat gacttgagaa acaaataatt
      421 tgtataccct gtcctttctc agagtggttc tgagattatt ttaatctaat tctaaggaat
      481 atgagcttta tgtaataatg tgaatcatgg tttttcttag tagattttaa aagttattaa
      541 tattttaatt taatcttcca tggattttgg tgggttttga acataaagcc ttggatgtat
      601 atgtcatctc agtgctgtaa aaactgtggg atgctcctcc cttctctacc tcatgggggt
      661 attgtataag tccttgcaag aatcagtgca aagatttgct ttaattgtta agatatgatg
      721 tccctatgga agcatattgt tattatataa ttacatattt gcatatgtat gactcccaaa
      781 ttttcacata aaatagattt ttgtaaaaaa 
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: J04130. Human activation ...[gi:178017] Links  


LOCUS       HUMACT2A                 696 bp    mRNA    linear   PRI 30-OCT-1994
DEFINITION  Human activation (Act-2) mRNA, complete cds.
ACCESSION   J04130
VERSION     J04130.1  GI:178017
KEYWORDS    act2 gene; immune activation gene.
SOURCE      Human (Hut-102B2 library) activated T cells, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 696)
  AUTHORS   Lipes,M.A., Napolitano,M., Jeang,K.T., Chang,N.T. and Leonard,W.J.
  TITLE     Identification, cloning, and characterization of an immune
            activation gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85 (24), 9704-9708 (1988)
  MEDLINE   89071764
   PUBMED   2462251
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            W.Leonard, 09-JAN-1989.
FEATURES             Location/Qualifiers
     source          1..696
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="Unassigned"
     gene            1..696
                     /gene="LAG2"
     mRNA            <1..696
                     /gene="LAG2"
                     /product="act-2 mRNA"
     CDS             109..387
                     /gene="LAG2"
                     /note="act-2 protein precursor"
                     /codon_start=1
                     /protein_id="AAA51576.1"
                     /db_xref="GI:178018"
                     /db_xref="GDB:G00-127-452"
                     /translation="MKLCVTVLSLLMLVAAFCSPALSAPMGSDPPTACCFSYTARKLP
                     RNFVVDYYETSSLCSQPAVVFQTKRSKQVCADPSESWVQEYVYDLELN"
     sig_peptide     109..177
                     /gene="LAG2"
                     /note="act-2 protein signal peptide"
     mat_peptide     178..384
                     /gene="LAG2"
                     /product="act-2 protein"
BASE COUNT      157 a    203 c    139 g    197 t
ORIGIN      Unreported.
        1 ttcccccccc cccccccccc ccccgcccga gcacaggaca cagctgggtt ctgaagcttc
       61 tgagttctgc agcctcacct ctgagaaaac ctcttttcca ccaataccat gaagctctgc
      121 gtgactgtcc tgtctctcct catgctagta gctgccttct gctctccagc gctctcagca
      181 ccaatgggct cagaccctcc caccgcctgc tgcttttctt acaccgcgag gaagcttcct
      241 cgcaactttg tggtagatta ctatgagacc agcagcctct gctcccagcc agctgtggta
      301 ttccaaacca aaagaagcaa gcaagtctgt gctgatccca gtgaatcctg ggtccaggag
      361 tacgtgtatg acctggaact gaactgagct gctcagagac aggaagtctt cagggaaggt
      421 cacctgagcc cggatgcttc tccatgagac acatctcctc catactcagg actcctctcc
      481 gcagttcctg tcccttctct taatttaatc ttttttatgt gccgtgttat tgtattaggt
      541 gtcatttcca ttatttatat tagtttagcc aaaggataag tgtcctatgg ggatggtcca
      601 ctgtcactgt ttctctgctg ttgcaaatac atggataaca catttgattc tgtgtgtttt
      661 ccataataaa actttaaaat aaaatgcaga cagtta
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U20158. Human 76 kDa tyro...[gi:806765] Links  


LOCUS       HSU20158                2032 bp    mRNA    linear   PRI 18-MAY-1995
DEFINITION  Human 76 kDa tyrosine phosphoprotein SLP-76 mRNA, complete cds.
ACCESSION   U20158
VERSION     U20158.1  GI:806765
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2032)
  AUTHORS   Jackman,J.K., Motto,D.G., Sun,Q., Tanemoto,M., Turck,C.W.,
            Peltz,G.A., Koretzky,G.A. and Findell,P.R.
  TITLE     Molecular cloning of SLP-76, a 76-kDa tyrosine phosphoprotein
            associated with Grb2 in T cells
  JOURNAL   J. Biol. Chem. 270 (13), 7029-7032 (1995)
  MEDLINE   95221345
   PUBMED   7706237
REFERENCE   2  (bases 1 to 2032)
  AUTHORS   Findell,P.R.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-JAN-1995) Paul R. Findell, Inflammation and
            Immunology, Syntex Discovery Research, 3401 Hillview Avenue, Palo
            Alto, CA 94303, USA
FEATURES             Location/Qualifiers
     source          1..2032
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     misc_signal     250..258
                     /note="Kozak sequence"
     CDS             256..1857
                     /note="76 kDa tyrosine phosphoprotein"
                     /codon_start=1
                     /product="SLP-76"
                     /protein_id="AAC50135.1"
                     /db_xref="GI:806766"
                     /translation="MALRNVPFRSEVLGWDPDSLADYFKKLNYKDCEKAVKKYHIDGA
                     RFLNLTENDIQKFPKLRVPILSKLSQEINKNEERRSIFTRKPQVPRFPEETESHEEDN
                     GGWSSFEEDDYESPNDDQDGEDDGDYESPNEEEEAPVEDDADYEPPPSNDEEALQNSI
                     LPAKPFPNSNSMYIDRPPSGKTPQQPPVPPQRPMAALPPPPAGRNHSPLPPPQTNHEE
                     PSRSRNHKTAKLPAPSIDRSTKPPLDRSLAPFDREPFTLGKKPPFSDKPSIPAGRSLG
                     EHLPKIQKPPLPPTTERHERSSPLPGKKPPVPKHGWGPDRRENDEDDVHQRPLPQPAL
                     LPMSSNTFPSRSTKPSPMNPLPSSHMPGAFSESNSSFPQSASLPPYFSQGPSNRPPIR
                     AEGRNFPLPLPNKPRPPSPAEEENSLNEEWYVSYITRPEAEAALRKINQDGTFLVRDS
                     SKKTTTNPYVLMVLYKDKVYNIQIRYQKESQVYLLGTGLRGKEDFLSVSDIIDYFRKM
                     PLLLIDGKNRGSRYQCTLTHAAGYP"
     misc_feature    1519..1797
                     /note="encodes src homology 2 domain"
     polyA_signal    2009..2015
     polyA_site      2032
BASE COUNT      581 a    579 c    468 g    404 t
ORIGIN      
        1 ctggttcggc ccacctctga aggttccaga atcgatagtg aattcgtgga agagaccata
       61 tttgttcgca gaggaagccg ttgctttctg ggatctggct acggcagaaa agacatcggc
      121 tccaacaggg gtgttccaca gggtagctgg gagttggaag agccaagaac gcctccgagc
      181 tctggatttg agcttctctg cccatgggtg aagcgcccat gctcagcttg tgagcttctt
      241 cccgggagag cagccatggc actgaggaat gtgccctttc gctcagaggt cctgggctgg
      301 gaccccgaca gccttgctga ctatttcaag aagctcaact ataaggactg tgagaaggca
      361 gtgaagaagt accacatcga tggggctcgc ttcttgaacc tgacagaaaa tgacatccag
      421 aagttcccca agctccgggt gccgattctc agtaagttaa gtcaggaaat caacaagaac
      481 gaagagagga ggagcatctt cacacgcaaa ccccaagtcc cgcggtttcc tgaagagaca
      541 gaaagccacg aagaggacaa tgggggttgg tcgtcctttg aagaagacga ttatgaaagt
      601 cccaatgatg accaggatgg ggaggatgat ggagactatg agtcccccaa tgaggaggaa
      661 gaggcacccg tggaagatga cgcggattat gagccgccac cctccaatga cgaggaagct
      721 ctgcagaact ccatcctgcc tgccaagcct ttccccaact ccaactccat gtacatcgac
      781 cggcccccct ctgggaaaac cccccagcag cctcctgtgc ccccccagag accgatggcc
      841 gccctcccgc ccccaccagc cggccggaat cactcgccac tgcccccacc ccagaccaac
      901 cacgaagaac ccagcagaag cagaaaccac aaaacggcaa agctccctgc tccttcaata
      961 gacagaagca cgaaacctcc cctagatcgt tcattagctc cgtttgatag agaacccttc
     1021 acactaggaa agaaaccacc attttctgac aagccctcga ttccagcggg aaggtcactc
     1081 ggggagcatt tacccaagat tcaaaagcct cctttaccac cgaccacgga aagacatgaa
     1141 aggagcagcc ccctgccagg gaagaagcca cctgtgccaa agcatggatg gggaccagac
     1201 agaagagaga atgatgaaga tgatgtgcat cagagacctt tgccccagcc agcactactt
     1261 cctatgagct ccaacacttt cccttcaaga tctactaagc caagtcccat gaaccctctc
     1321 ccatcctctc acatgcctgg agcattctca gaaagtaaca gcagttttcc acagagtgcc
     1381 tccctgccac catacttctc tcaaggccct agcaacagac cacctatcag agccgaaggc
     1441 agaaacttcc ccttgccact tccaaacaaa cctcggcccc catcccccgc ggaggaagag
     1501 aattcattaa atgaagagtg gtacgtttct tatattaccc gaccagaggc agaagctgct
     1561 cttagaaaga taaaccagga tggcacattt ctggtcagag acagctctaa aaaaacaaca
     1621 accaatccat atgtcctcat ggtgttgtac aaagataaag tttacaacat ccagatccgt
     1681 tatcagaagg aaagtcaagt ttacttgttg ggaactggac tccgagggaa agaggacttt
     1741 ctgtctgtgt cagatattat tgactacttc aggaaaatgc cacttctgct cattgatggg
     1801 aaaaaccgag gttccagata ccagtgcaca ttaacgcatg ctgcagggta cccatagcaa
     1861 gttatagccg agcaaatgaa ccgtcctcct gcctctgttg ccaacacgag atcaatcagc
     1921 cttggtcaat ggacaaacac ttaggactga actgaacccc tccccatgaa cacaagggtt
     1981 ttatcctttc ctttaaaaac agtgtttgaa atgaagactg tcaactatcc cc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH005786. Homo sapiens CC c...[gi:2739498] Links  


LOCUS       HSCCR5AB1               1976 bp    DNA     linear   PRI 03-JAN-1998
DEFINITION  Homo sapiens CC chemokine receptor 5 (CCR5) gene, 5' flanking
            sequence.
ACCESSION   AF031236
VERSION     AF031236.1  GI:2739496
KEYWORDS    .
SEGMENT     1 of 2
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1976)
  AUTHORS   Mummidi,S., Ahuja,S.S., McDaniel,B.L. and Ahuja,S.K.
  TITLE     The human CC chemokine receptor 5 (CCR5) gene. Multiple transcripts
            with 5'-end heterogeneity, dual promoter usage, and evidence for
            polymorphisms within the regulatory regions and noncoding exons
  JOURNAL   J. Biol. Chem. 272 (49), 30662-30671 (1997)
  MEDLINE   98049523
   PUBMED   9388201
REFERENCE   2  (bases 1 to 1976)
  AUTHORS   Mummidi,S., Ahuja,S.S., McDaniel,B.L. and Ahuja,S.K.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-OCT-1997) Medicine, University of Texas Health
            Science Center at San Antonio, 7703, Floyd Curl Drive, San Antonio,
            TX 78284, USA
FEATURES             Location/Qualifiers
     source          1..1976
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     misc_feature    1..1976
                     /gene="CCR5"
                     /note="5' flanking sequence"
BASE COUNT      556 a    443 c    426 g    551 t
ORIGIN      
        1 ctgtttaaag acaaaaaggc cccaaaaagg agggatggca cgaaacaccc tccaatatgg
       61 gcatggagtc tagagtgaca aagtgatcaa aagttcattt cctatggggt gtccgaatgt
      121 acttaataat aaaaagagaa caagagccat gcaaactgag agggacaaag tagaaagagt
      181 agcagacacc aagcaactaa gtcacagcat gataagctgc tagcttgttg tcattattgt
      241 atccagaaca acatttcatt taaatgctga agaatttccc atgggtcccc actttcttgt
      301 gaatccttgg gctgaacccc cccgtcctga gtggttacta gaacacacct ctggaccaga
      361 aacacaagag tggagtaaca cacactgcaa agctgtgctt ccttgtttca gcctgtgaat
      421 cctcaccttg tttcccatct agcctatatt tttcaaacta acttggccat agaatcatgt
      481 cgtatttagg gtggaagctg ccccaggtct agcgcgtcat ttaacagatg aggaaatgga
      541 agcttgggca gtggaagtat cttgccgagg tcacacagca agtcagcagc acagcgtgtg
      601 tgactccgag cctgctccgc tagcccacat tgccctctgg gggtgagtat gtcttcacat
      661 cctccaatac ccctaatgac agacaaacag aacatggcaa agcctcagct ctgcatggtg
      721 aaagtaagaa ccagcaattg ccacaaacag aaatacagtg ttggtccggc agcctccggg
      781 ggttctgcac aagtggatta ccagtgaata caaggctatc tatcttccga aaaaccaaag
      841 ttgtatttat gctatctatt ttctataaaa ttttatatta atttacttgt cctatttttg
      901 aactctttca aaagcacact ttatatttcc cctgcttaaa cagtcccccg agggtgggtg
      961 cccaaaaggc tctacacttg ttatcattcc ctctccacca caggcatatt gagtaagttt
     1021 gtatttgggt ttttttaaaa cctccactct acagttaaga aaactaaggc acagagcttc
     1081 aataatttgg tcagagccaa gtagcagtaa tgaagctgga ggttaaaccc agcagcatga
     1141 ctgcagttct taatcaatgc cttttgaatt gcacatatgg gatgaactag aacattttct
     1201 cgatgattcg ctgtccttgt tatgattatg ttactgagct ctgttgtagc acagacatat
     1261 gtccctatat ggggcggggg tgggggtgtc ttgatcgctg ggctatttct atactgttct
     1321 ggcttttccc aagcagtcat ttctttctat cctccaagca ccagcaatta gctttacctt
     1381 ttcagcttct agtttgctga aactaatctg ctatagacag agactccggt gaaccaattt
     1441 tattaggatt tgatcaaata aactctctct gacaaaggac tgctgaaaga gtaactaaga
     1501 gtttgatgtt tactgagtgc atagtatgtg ctagatgctg gccgtggatg cctcatagaa
     1561 tcctcccaac aactcatgaa atgactactg tcattcagcc caatacccag acgagaaagc
     1621 tgagggtaag acaggtttca agcttggcag tctgactaca gaggccactg gcttagcccc
     1681 tgggttagtc tgcctctgta ggattggggg cacgtaattt tgctgtttgg ggtctcattt
     1741 gccttcttag agatcacaag ccaaagcttt ttattctaga gccaaggtca cggaagccca
     1801 gaggacatct tgtggctcgg gagtagctct ctgctgtctt ctcagctctg ctgacaatac
     1861 ttgagatttt cagatgtcac caaccgccaa gagagcttga tatgactgta tatagtatag
     1921 tcataaagaa cctgaacttg accatatact tatgtcatgt ggaaaatttc tcatag
//
LOCUS       HSCCR5AB2               6059 bp    DNA     linear   PRI 03-JAN-1998
DEFINITION  Homo sapiens CC chemokine receptor 5 (CCR5) gene, complete cds.
ACCESSION   AF031237
VERSION     AF031237.1  GI:2739497
KEYWORDS    .
SEGMENT     2 of 2
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 6059)
  AUTHORS   Mummidi,S., Ahuja,S.S., McDaniel,B.L. and Ahuja,S.K.
  TITLE     The human CC chemokine receptor 5 (CCR5) gene. Multiple transcripts
            with 5'-end heterogeneity, dual promoter usage, and evidence for
            polymorphisms within the regulatory regions and noncoding exons
  JOURNAL   J. Biol. Chem. 272 (49), 30662-30671 (1997)
  MEDLINE   98049523
   PUBMED   9388201
REFERENCE   2  (bases 1 to 6059)
  AUTHORS   Mummidi,S., Ahuja,S.S., McDaniel,B.L. and Ahuja,S.K.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-OCT-1997) Medicine, University of Texas Health
            Science Center at San Antonio, 7703, Floyd Curl Drive, San Antonio,
            TX 78284, USA
FEATURES             Location/Qualifiers
     source          1..6059
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            join(AF031236.1:1..1976,1..6059)
                     /gene="CCR5"
     misc_RNA        AH005786.1:1..8035
                     /gene="CCR5"
                     /product="truncated transcript"
     mRNA            join(1..57,559..847,2751..6059)
                     /gene="CCR5"
                     /product="CC chemokine receptor 5A"
     mRNA            join(1..57,794..847,2751..6059)
                     /gene="CCR5"
                     /product="CC chemokine receptor 5B"
     mRNA            join(750..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     mRNA            join(773..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     mRNA            join(777..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     mRNA            join(779..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     mRNA            join(784..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     mRNA            join(795..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     mRNA            join(798..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     mRNA            join(804..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     mRNA            join(805..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     mRNA            join(806..847,2751..6059)
                     /gene="CCR5"
                     /product="CCR5 truncated isoform"
     CDS             2762..3820
                     /gene="CCR5"
                     /codon_start=1
                     /product="CC chemokine receptor 5"
                     /protein_id="AAB94735.1"
                     /db_xref="GI:2739499"
                     /translation="MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFG
                     FVGNMLVILILINCKRLKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMCQ
                     LLTGLYFIGFFSGIFFIILLTIDRYLAVVHAVFALKARTVTFGVVTSVITWVVAVFAS
                     LPGIIFTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGIL
                     KTLLRCRNEKKRHRAVRLIFTIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLD
                     QAMQVTETLGMTHCCINPIIYAFVGEKFRNYLLVFFQKHIAKRFCKCCSIFQQEAPER
                     ASSVYTRSTGEQEISVGL"
BASE COUNT     1753 a   1210 c   1393 g   1703 t
ORIGIN      
        1 cttcagatag attatatctg gagtgaagga tcctgccacc tacgtatctg gcatagtgtg
       61 agtcctcata aatgcttact ggtttgaagg gcaacaaaat agtgaacaga gtgaaaatcc
      121 ccactaagat cctgggtcca gaaaaagatg ggaaacctgt ttagctcacc cgtgagccca
      181 tagttaaaac tctttagaca acaggttgtt tccgtttaca gagaacaata atattgggtg
      241 gtgagcatct gtgtgggggt tggggtggga taggggatac ggggagagtg gagaaaaagg
      301 ggacacaggg ttaatgtgaa gtccaggatc cccctctaca tttaaagttg gtttaagttg
      361 gctttaatta atagcaactc ttaagataat cagaattttc ttaacctttt agccttactg
      421 ttgaaaagcc ctgtgatctt gtacaaatca tttgcttctt ggatagtaat ttcttttact
      481 aaaatgtggg cttttgacta gatgaatgta aatgttcttc tagctctgat atcctttatt
      541 ctttatattt tctaacagat tctgtgtagt gggatgagca gagaacaaaa acaaaataat
      601 ccagtgagaa aagcccgtaa ataaaccttc agaccagaga tctattctcc agcttatttt
      661 aagctcaact taaaaagaag aactgttctc tgattctttt cgccttcaat acacttaatg
      721 atttaactcc accctccttc aaaagaaaca gcatttccta cttttatact gtctatatga
      781 ttgatttgca cagctcatct ggccagaaga gctgagacat ccgttcccct acaagaaact
      841 ctccccggta agtaacctct cagctgcttg gcctgttagt tagcttctga gatgagtaaa
      901 agactttaca ggaaacccat agaagacatt tggcaaacac caagtgctca tacaattatc
      961 ttaaaatata atctttaaga taaggaaagg gtcacagttt ggaatgagtt tcagacggtt
     1021 ataacatcaa agatacaaaa catgattgtg agtgaaagac tttaaaggga gcaatagtat
     1081 tttaataact aacaatcctt acctctcaaa agaaagattt gcagagagat gagtcttagc
     1141 tgaaatcttg aaatcttatc ttctgctaag gagaactaaa ccctctccag tgagatgcct
     1201 tctgaatatg tgcccacaag aagttgtgtc taagtctggt tctctttttt ctttttcctc
     1261 cagacaagag ggaagcctaa aaatggtcaa aattaatatt aaattacaaa cgccaaataa
     1321 aattttcctc taatatatca gtttcatggc acagttagta tataattctt tatggttcaa
     1381 aattaaaaat gagcttttct aggggcttct ctcagctgcc tagtctaagg tgcagggagt
     1441 ttgagactca cagggtttaa taagagaaaa ttctcagcta gagcagctga acttaaatag
     1501 actaggcaag acagctggtt ataagactaa actacccaga atgcatgaca ttcatctgtg
     1561 gtggcagacg aaacattttt tattatatta tttcttgggt atgtatgaca actcttaatt
     1621 gtggcaactc agaaactaca aacacaaact tcacagaaaa tgtgaggatt ttacaattgg
     1681 ctgttgtcat ctatgacctt ctctgggact tgggcacccg gccatttcac tctgactaca
     1741 tcatgtcacc aaacatctga tggtcttgcc ttttaattct cttttcgagg actgagaggg
     1801 agggtagcat ggtagttaag agtgcaggct tcccgcattc aaaatcggtt gcttactagc
     1861 tgtgtggctt tgagcaagtt actcaccctc tctgtgcttc aaggtccttg tctgcaaaat
     1921 gtgaaaaata tttcctgcct cataaggttg ccctaaggat taaatgaatg aatgggtatg
     1981 atgcttagaa cagtgattgg catccagtat gtgccctcga ggcctcttaa ttattactgg
     2041 cttgctcata gtgcatgttc tttgtgggct aactctagcg tcaataaaaa tgttaagact
     2101 gagttgcagc cgggcatggt ggctcatgcc tgtaatccca gcattctagg aggctgaggc
     2161 aggaggatcg cttgagccca ggagttcgag accagcctgg gcaacatagt gtgatcttgt
     2221 atctataaaa ataaacaaaa ttagcttggt gtggtggcgc ctgtagtccc cagccacttg
     2281 gaggggtgag gtgagaggat tgcttgagcc cgggatggtc caggctgcag tgagccatga
     2341 tcgtgccact gcactccagc ctgggcgaca gagtgagacc ctgtctcaca acaacaacaa
     2401 caacaacaaa aaggctgagc tgcaccatgc ttgacccagt ttcttaaaat tgttgtcaaa
     2461 gcttcattca ctccatggtg ctatagagca caagatttta tttggtgaga tggtgctttc
     2521 atgaattccc ccaacagagc caagctctcc atctagtgga cagggaagct agcagcaaac
     2581 cttcccttca ctacaaaact tcattgcttg gccaaaaaga gagttaattc aatgtagaca
     2641 tctatgtagg caattaaaaa cctattgatg tataaaacag tttgcattca tggagggcaa
     2701 ctaaatacat tctaggactt tataaaagat cactttttat ttatgcacag ggtggaacaa
     2761 gatggattat caagtgtcaa gtccaatcta tgacatcaat tattatacat cggagccctg
     2821 ccaaaaaatc aatgtgaagc aaatcgcagc ccgcctcctg cctccgctct actcactggt
     2881 gttcatcttt ggttttgtgg gcaacatgct ggtcatcctc atcctgataa actgcaaaag
     2941 gctgaagagc atgactgaca tctacctgct caacctggcc atctctgacc tgtttttcct
     3001 tcttactgtc cccttctggg ctcactatgc tgccgcccag tgggactttg gaaatacaat
     3061 gtgtcaactc ttgacagggc tctattttat aggcttcttc tctggaatct tcttcatcat
     3121 cctcctgaca atcgataggt acctggctgt cgtccatgct gtgtttgctt taaaagccag
     3181 gacggtcacc tttggggtgg tgacaagtgt gatcacttgg gtggtggctg tgtttgcgtc
     3241 tctcccagga atcatcttta ccagatctca aaaagaaggt cttcattaca cctgcagctc
     3301 tcattttcca tacagtcagt atcaattctg gaagaatttc cagacattaa agatagtcat
     3361 cttggggctg gtcctgccgc tgcttgtcat ggtcatctgc tactcgggaa tcctaaaaac
     3421 tctgcttcgg tgtcgaaatg agaagaagag gcacagggct gtgaggctta tcttcaccat
     3481 catgattgtt tattttctct tctgggctcc ctacaacatt gtccttctcc tgaacacctt
     3541 ccaggaattc tttggcctga ataattgcag tagctctaac aggttggacc aagctatgca
     3601 ggtgacagag actcttggga tgacgcactg ctgcatcaac cccatcatct atgcctttgt
     3661 cggggagaag ttcagaaact acctcttagt cttcttccaa aagcacattg ccaaacgctt
     3721 ctgcaaatgc tgttctattt tccagcaaga ggctcccgag cgagcaagct cagtttacac
     3781 ccgatccact ggggagcagg aaatatctgt gggcttgtga cacggactca agtgggctgg
     3841 tgacccagtc agagttgtgc acatggctta gttttcatac acagcctggg ctgggggtgg
     3901 ggtgggagag gtctttttta aaaggaagtt actgttatag agggtctaag attcatccat
     3961 ttatttggca tctgtttaaa gtagattaga tcttttaagc ccatcaatta tagaaagcca
     4021 aatcaaaata tgttgatgaa aaatagcaac ctttttatct ccccttcaca tgcatcaagt
     4081 tattgacaaa ctctcccttc actccgaaag ttccttatgt atatttaaaa gaaagcctca
     4141 gagaattgct gattcttgag tttagtgatc tgaacagaaa taccaaaatt atttcagaaa
     4201 tgtacaactt tttacctagt acaaggcaac atataggttg taaatgtgtt taaaacaggt
     4261 ctttgtcttg ctatggggag aaaagacatg aatatgatta gtaaagaaat gacacttttc
     4321 atgtgtgatt tcccctccaa ggtatggtta ataagtttca ctgacttaga accaggcgag
     4381 agacttgtgg cctgggagag ctggggaagc ttcttaaatg agaaggaatt tgagttggat
     4441 catctattgc tggcaaagac agaagcctca ctgcaagcac tgcatgggca agcttggctg
     4501 tagaaggaga cagagctggt tgggaagaca tggggaggaa ggacaaggct agatcatgaa
     4561 gaaccttgac ggcattgctc cgtctaagtc atgagctgag cagggagatc ctggttggtg
     4621 ttgcagaagg tttactctgt ggccaaagga gggtcaggaa ggatgagcat ttagggcaag
     4681 gagaccacca acagccctca ggtcagggtg aggatggcct ctgctaagct caaggcgtga
     4741 ggatgggaag gagggaggta ttcgtaagga tgggaaggag ggaggtattc gtgcagcata
     4801 tgaggatgca gagtcagcag aactggggtg gatttggttt ggaagtgagg gtcagagagg
     4861 agtcagagag aatccctagt cttcaagcag attggagaaa cccttgaaaa gacatcaagc
     4921 acagaaggag gaggaggagg tttaggtcaa gaagaagatg gattggtgta aaaggatggg
     4981 tctggtttgc agagcttgaa cacagtctca cccagactcc aggctgtctt tcactgaatg
     5041 cttctgactt catagatttc cttcccatcc cagctgaaat actgaggggt ctccaggagg
     5101 agactagatt tatgaataca cgaggtatga ggtctaggaa catacttcag ctcacacatg
     5161 agatctaggt gaggattgat tacctagtag tcatttcatg ggttgttggg aggattctat
     5221 gaggcaacca caggcagcat ttagcacata ctacacattc aataagcatc aaactcttag
     5281 ttactcattc agggatagca ctgagcaaag cattgagcaa aggggtccca tataggtgag
     5341 ggaagcctga aaaactaaga tgctgcctgc ccagtgcaca caagtgtagg tatcattttc
     5401 tgcatttaac cgtcaatagg caaagggggg aagggacata ttcatttgga aataagctgc
     5461 cttgagcctt aaaacccaca aaagtacaat ttaccagcct ccgtatttca gactgaatgg
     5521 gggtgggggg ggcgccttag gtacttattc cagatgcctt ctccagacaa accagaagca
     5581 acagaaaaaa tcgtctctcc ctccctttga aatgaatata ccccttagtg tttgggtata
     5641 ttcatttcaa agggagagag agaggttttt ttctgttctt tctcatatga ttgtgcacat
     5701 acttgagact gttttgaatt tgggggatgg ctaaaaccat catagtacag gtaaggtgag
     5761 ggaatagtaa gtggtgagaa ctactcaggg aatgaaggtg tcagaataat aagaggtgct
     5821 actgactttc tcagcctctg aatatgaacg gtgagcattg tggctgtcag caggaagcaa
     5881 cgaagggaaa tgtctttcct tttgctctta agttgtggag agtgcaacag tagcatagga
     5941 ccctaccctc tgggccaagt caaagacatt ctgacatctt agtatttgca tattcttatg
     6001 tatgtgaaag ttacaaattg cttgaaagaa aatatgcatc taataaaaaa caccttcta
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M31732. Human B-cell lymp...[gi:179375] Links  


LOCUS       HUMBCL3AA               1813 bp    mRNA    linear   PRI 31-OCT-1994
DEFINITION  Human B-cell lymphoma 3-encoded protein (bcl-3) mRNA, complete cds.
ACCESSION   M31732
VERSION     M31732.1  GI:179375
KEYWORDS    lymphoma 3-encoded protein.
SOURCE      Human lymphocyte Louckes cell line (Burkitt's lymphoma), cDNA to
            mRNA, clone cLK2.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1813)
  AUTHORS   Ohno,H., Takimoto,G. and McKeithan,T.W.
  TITLE     The candidate proto-oncogene bcl-3 is related to genes implicated
            in cell lineage determination and cell cycle control
  JOURNAL   Cell 60 (6), 991-997 (1990)
  MEDLINE   90199880
   PUBMED   2180580
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by T.McKeithan, 30-JAN-1990.
FEATURES             Location/Qualifiers
     source          1..1813
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="19q13.1-q13.2"
     gene            1..1813
                     /gene="BCL3"
     mRNA            <1..1813
                     /gene="BCL3"
                     /product="bcl-3 mRNA"
     CDS             42..1382
                     /gene="BCL3"
                     /note="lymphoma 3-encoded protein (bcl-3)"
                     /codon_start=1
                     /protein_id="AAA51815.1"
                     /db_xref="GI:179376"
                     /db_xref="GDB:G00-120-561"
                     /translation="MDEGPVDLRTRPKAAGLPGAALPLRKRPLRAPSPEPAAPRGAAG
                     LVVPLDPLRGGCDLPAVPGPPHGLARPEALYYPGALLPLYPTRAMGSPFPLVNLPTPL
                     YPMMCPMEHPLSADIAMATRADEDGDTPLHIAVVQGNLPAVHRLVNLFQQGGRELDIY
                     NNLRQTPLHLAVITTLPSVVRLLVTAGASPMALDRHGQTAAHLACEHRSPTCLRALLD
                     SAAPGTLDLEARNYDGLTALHVAVNTECQETVQLLLERGADIDAVDIKSGRSPLIHAV
                     ENNSLSMVQLLLQHGANVNAQMYSGSSALHSASGRGLLPLVRTLVRSGADSSLKNCHN
                     DTPLMVARSRRVIDILRGKATRPASTSQPDPSPDRSANTSPESSSRLSSNGLLSASPS
                     SSPSQSPPRDPPGFPMAPPNFFLPSPSPPAFLPFAGVLRGPGRPVPPSPAPGGS"
BASE COUNT      298 a    714 c    486 g    315 t
ORIGIN      
        1 ccgtccccgg cggccccatg ccccgatgcc ccgcgggggc catggacgag gggcccgtgg
       61 acctgcgcac ccggcccaag gccgccggac tcccgggcgc cgcgctgccg ctccgcaagc
      121 gcccgctgcg cgcgccctcc ccggagcccg ccgctccccg cggcgctgcg ggccttgtcg
      181 tccccctgga ccctctgcgc ggcggctgcg acctgccggc ggtccccggg cccccccacg
      241 gcctggcccg gccggaggcg ctttactacc ccggagcctt actgcctttg taccccactc
      301 gggccatggg ctccccgttt cctctggtga acctgcctac acccctatac cccatgatgt
      361 gccccatgga acaccccctt tctgctgaca tcgccatggc cacccgtgca gatgaggacg
      421 gagacacgcc tctccatatt gctgtggtgc agggtaacct gccagctgtg caccggctgg
      481 tcaacctctt ccagcagggg ggccgggagc tcgacatcta caacaaccta cggcagacac
      541 cgctccacct ggctgtgatc accacattac cgtctgtggt ccggctcctg gtgacagctg
      601 gtgccagccc catggcgctg gaccgccatg gccagacggc cgctcacctg gcgtgcgagc
      661 accgcagccc gacctgcctg cgagccctgc tggacagcgc agctccgggc acgttggacc
      721 tggaggcccg caattatgac gggctcaccg ccctgcacgt ggcagtgaac accgagtgcc
      781 aagaaaccgt gcagctcttg ctagagcgcg gtgccgacat cgacgcagtg gacattaaga
      841 gcggccgctc cccgctcatc cacgccgtgg aaaacaacag ccttagcatg gtgcagctgc
      901 tgctgcagca cggcgccaac gtgaacgcgc aaatgtactc cggcagctcc gccctgcact
      961 cagcgtccgg ccgcgggctc ctcccgctgg tgcgcacgct ggtccgcagc ggcgctgaca
     1021 gcagcctcaa gaactgccac aacgacacgc cgctcatggt ggcgcgcagc cgcagggtca
     1081 tcgacatcct gagggggaag gccacccggc ctgcttccac ctcccagcca gacccctccc
     1141 ctgaccggag cgccaacacc tcccccgaga gcagcagccg cctcagctcc aatggtcttc
     1201 tctccgcatc accatcctcc tcaccctccc agtctccccc cagggacccc cctggattcc
     1261 ccatggctcc tcccaatttc ttccttcctt ccccatctcc acccgccttc ctgccctttg
     1321 ctggggtcct ccgaggccct ggccggccgg tgcccccctc cccagctcca ggaggcagct
     1381 gagggggatg ggggggcaga tcttggactc atgaggaggg gcccccctgc ccagaggggt
     1441 caacccttct ggaaactgtg aagatctgac ttcgcccccc ccccccccca tcttcgggac
     1501 caggatttgc acagaagcac atgcacctac ccatacaccc cctcttctga gcgtccctgt
     1561 tcccccatct cgctccctcc caggactctg accccagcat tctcaggcac cagtccctgt
     1621 ccggaatgcc acccacatct tccatttcca tgtcccctcc cagagctggt ggacccaggg
     1681 aacagccact cccctccact ctctaccaga taactgagga ggggagaggt gggccgtaac
     1741 gggcacggat cacgatgtaa attattaagc attttggttg gatttctttt gtaataaact
     1801 atttttgtac cat
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L32137. Human germline ol...[gi:602449] Links  


LOCUS       HUMCOMP                 2439 bp    mRNA    linear   PRI 15-DEC-1994
DEFINITION  Human germline oligomeric matrix protein (COMP) mRNA, complete cds.
ACCESSION   L32137
VERSION     L32137.1  GI:602449
KEYWORDS    germline; matrix protein.
SOURCE      Homo sapiens cartilage cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2439)
  AUTHORS   Newton,G., Weremowicz,S., Morton,C.C., Copeland,N.G., Gilbert,D.J.,
            Jenkins,N.A. and Lawler,J.
  TITLE     Characterization of human and mouse cartilage oligomeric matrix
            protein
  JOURNAL   Unpublished (1994)
FEATURES             Location/Qualifiers
     source          1..2439
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="19p13.1"
                     /cell_type="chondrocyte"
                     /tissue_type="cartilage"
                     /germline
     gene            1..2439
                     /gene="COMP"
     CDS             26..2299
                     /gene="COMP"
                     /standard_name="cartilage oligomeric matrix protein"
                     /note="putative"
                     /codon_start=1
                     /product="matrix protein"
                     /protein_id="AAA57253.1"
                     /db_xref="GI:602450"
                     /translation="MVPDTACVLLLTLAALGASGQGQSPLGSDLGPQMLRELQETNAA
                     LQDVRDWLRQQVREITFLKNTVMECDACGMQQSVRTGLPSVRPLLHCAPGFCFPGVAC
                     IQTESGGRCGPCPAGFTGNGSHCTDVNECNAHPCFPRVRCINTSPGFRCEACPPGYSG
                     PTHQGVGLAFAKANKQVCTDINECETGQHNCVPNSVCINTRGSFQCGPCQPGFVGDQA
                     SGCQRGAQRFCPDGSPSECHEHADCVLERDGSRSCVCRVGWAGNGILCGRDTDLDGFP
                     DEKLRCPEPQCRKDNCVTVPNSGQEDVDRDGIGDACDPDADGDGVPNEKDNCPLVRNP
                     DQRNTDEDKWGDACDNCRSQKNDDQKDTDQDGRGDACDDDIDGDRIRNQADNCPRVPN
                     SDQKDSDGDGIGDACDNCPQKSNPDQADVDHDFVGDACDSDQDQDGDGHQDSRDNCPT
                     VPNSAQEDSDHDGQGDACDDDDDNDGVPDSRDNCRLVPNPGQEDADRDGVGDVCQDDF
                     DADKVVDKIDVCPENAEVTLTDFRAFQTVVLDPEGDAQIDPNWVVLNQGREIVQTMNS
                     DPGLAVGYTAFNGVDFEGTFHVNTVTDDDYAGFIFGYQDSSSFYVVMWKQMEQTYWQA
                     NPFRAVAEPGIQLKAVKSSTGPGEQLRNALWHTGDTESQVRLLWKDPRNVGWKDKKSY
                     RWFLQHRPQVGYIRVRFYEGPELVADSNVVLDTTMRGGRLGVFCFSQENIIWANLRYR
                     CNDTIPEDYETHQLRQA"
     sig_peptide     26..85
                     /gene="COMP"
     repeat_region   290..892
                     /note="putative"
                     /rpt_family="thrombospondin type 2"
                     /rpt_type=tandem
     repeat_region   893..1577
                     /note="putative"
                     /rpt_family="thrombospondin type 3"
                     /rpt_type=tandem
     polyA_signal    2420..2425
                     /gene="COMP"
     polyA_site      2439
                     /gene="COMP"
BASE COUNT      503 a    758 c    809 g    369 t
ORIGIN      
        1 cagcacccag ctccccgcca ccgccatggt ccccgacacc gcctgcgttc ttctgctcac
       61 cctggctgcc ctcggcgcgt ccggacaggg ccagagcccg ttgggctcag acctgggccc
      121 gcagatgctt cgggaactgc aggaaaccaa cgcggcgctg caggacgtgc gggactggct
      181 gcggcagcag gtcagggaga tcacgttcct gaaaaacacg gtgatggagt gtgacgcgtg
      241 cgggatgcag cagtcagtac gcaccggcct acccagcgtg cggcccctgc tccactgcgc
      301 gcccggcttc tgcttccccg gcgtggcctg catccagacg gagagcggcg gccgctgcgg
      361 cccctgcccc gcgggcttca cgggcaacgg ctcgcactgc accgacgtca acgagtgcaa
      421 cgcccacccc tgcttccccc gagtccgctg tatcaacacc agcccggggt tccgctgcga
      481 ggcttgcccg ccggggtaca gcggccccac ccaccagggc gtggggctgg ctttcgccaa
      541 ggccaacaag caggtttgca cggacatcaa cgagtgtgag accgggcaac ataactgcgt
      601 ccccaactcc gtgtgcatca acacccgggg ctccttccag tgcggcccgt gccagcccgg
      661 cttcgtgggc gaccaggcgt ccggctgcca gcgcggcgca cagcgcttct gccccgacgg
      721 ctcgcccagc gagtgccacg agcatgcaga ctgcgtccta gagcgcgatg gctcgcggtc
      781 gtgcgtgtgt cgcgttggct gggccggcaa cgggatcctc tgtggtcgcg acactgacct
      841 agacggcttc ccggacgaga agctgcgctg cccggagccg cagtgccgta aggacaactg
      901 cgtgactgtg cccaactcag ggcaggagga tgtggaccgc gatggcatcg gagacgcctg
      961 cgatccggat gccgacgggg acggggtccc caatgaaaag gacaactgcc cgctggtgcg
     1021 gaacccagac cagcgcaaca cggacgagga caagtggggc gatgcgtgcg acaactgccg
     1081 gtcccagaag aacgacgacc aaaaggacac agaccaggac ggccggggcg atgcgtgcga
     1141 cgacgacatc gacggcgacc ggatccgcaa ccaggccgac aactgcccta gggtacccaa
     1201 ctcagaccag aaggacagtg atggcgatgg tataggggat gcctgtgaca actgtcccca
     1261 gaagagcaac ccggatcagg cggatgtgga ccacgacttt gtgggagatg cttgtgacag
     1321 cgatcaagac caggatggag acggacatca ggactctcgg gacaactgtc ccacggtgcc
     1381 taacagtgcc caggaggact cagaccacga tggccagggt gatgcctgcg acgacgacga
     1441 cgacaatgac ggagtccctg acagtcggga caactgccgc ctggtgccta accccggcca
     1501 ggaggacgcg gacagggacg gcgtgggcga cgtgtgccag gacgactttg atgcagacaa
     1561 ggtggtagac aagatcgacg tgtgtccgga gaacgctgaa gtcacgctca ccgacttcag
     1621 ggccttccag acagtcgtgc tggacccgga gggtgacgcg cagattgacc ccaactgggt
     1681 ggtgctcaac cagggaaggg agatcgtgca gacaatgaac agcgacccag gcctggctgt
     1741 gggttacact gccttcaatg gcgtggactt cgagggcacg ttccatgtga acacggtcac
     1801 ggatgacgac tatgcgggct tcatctttgg ctaccaggac agctccagct tctacgtggt
     1861 catgtggaag cagatggagc aaacgtattg gcaggcgaac cccttccgtg ctgtggccga
     1921 gcctggcatc caactcaagg ctgtgaagtc ttccacaggc cccggggaac agctgcggaa
     1981 cgctctgtgg catacaggag acacagagtc ccaggtgcgg ctgctgtgga aggacccgcg
     2041 aaacgtgggt tggaaggaca agaagtccta tcgttggttc ctgcagcacc ggccccaagt
     2101 gggctacatc agggtgcgat tctatgaggg ccctgagctg gtggccgaca gcaacgtggt
     2161 cttggacaca accatgcggg gtggccgcct gggggtcttc tgcttctccc aggagaacat
     2221 catctgggcc aacctgcgtt accgctgcaa tgacaccatc ccagaggact atgagaccca
     2281 tcagctgcgg caagcctagg gaccagggtg aggacccgcc ggatgacagc caccctcacc
     2341 gcggctggat gggggctctg cacccagccc aaggggtggc cgtcctgagg gggaagtgag
     2401 aagggctcag agaggacaaa ataaagtgtg tgtgcaggg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AB011406. Homo sapiens mRNA...[gi:3401944] Links  


LOCUS       AB011406                2510 bp    mRNA    linear   PRI 14-APR-2000
DEFINITION  Homo sapiens mRNA for alkalin phosphatase, complete cds.
ACCESSION   AB011406
VERSION     AB011406.1  GI:3401944
KEYWORDS    tissue non-specific alkalin phosphatase; alkalin phosphatase.
SOURCE      Homo sapiens cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Mitchell,W.J., Paula,H.S., Mary,L.A., Clive,S., Michael,R. and
            Harry,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986)
REFERENCE   2  (sites)
  AUTHORS   Naoya,S., Sadahiko,I., Yuichi,H. and Eiji,K.
  TITLE     A novel missense mutation of the tissue-nonspecific alkaline
            phosphatase gene detected in hypophosphatasia
  JOURNAL   Unpublished
REFERENCE   3  (bases 1 to 2510)
  AUTHORS   Sugimoto,N. and Kajii,E.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-FEB-1998) Naoya Sugimoto, Jichi Medical School,
            Orthopaedics; Yakusiji 3311, Minamikawachi, Tochigi 329-0498, Japan
            (E-mail:nsugimot@jichi.ac.jp, Tel:+81-0285-44-2111,
            Fax:+81-0285-44-4902)
FEATURES             Location/Qualifiers
     source          1..2510
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             177..1751
                     /function="hydrolyze a variety of monophosphate esters
                     with high pH optima"
                     /standard_name="tissue non-specific alkalin phosphatase"
                     /codon_start=1
                     /evidence=experimental
                     /product="alkalin phosphatase"
                     /protein_id="BAA32129.1"
                     /db_xref="GI:3401945"
                     /translation="MISPFLVLAIGTCLTNSLVPEKEKDPKYWRDQAQETLKYALELQ
                     KLNTNVAKNVIMFLGDGMGVSTVTAARILKGQLHHNPGEETRLEMDKFPFVALSKTYN
                     TNAQVPDSAGTATAYLCGVKANEGTVGVSAATERSRCNTTQGNEVTSILRWAKDAGKS
                     VGIVTTTRVNHATPSAAYAHSADRDWYSDNEMPPEALSQGCKDIAYQLMHNIRDIDVI
                     MGGGRKYMYPKNKTDVEYESDEKARGTRLDGLDLVDTWKSFKPRYKHSHFIWNRTELL
                     TLDPHNVDYLLGFFEPGDMQYELNRNNVTDPSLSEMVVVAIQILRKNPKGFFLLVEGG
                     RIDHGHHEGKAKQALHEAVEMDRAIGHAGSLTSSEDTLTVVTADHSHVFTFGGYTPRG
                     NSIFGLAPMLSDTDKKPFTAILYGNGPGYKVVGGERENVSMVDYAHNNYQAQSPVPLR
                     HETHGGEDVAVFSKGPMAHLLHGVHEQNYVPHVMAYAACIGANLGHCAPASSAGSLAA
                     GPLLLALALYPLSVLF"
     variation       1041
                     /note="This mutation causes an amino acid substitution
                     from Leu to Phe."
                     /replace="T"
     polyA_site      2510
                     /note="21 a nucleotides"
BASE COUNT      553 a    821 c    664 g    472 t
ORIGIN      
        1 cgcgcccgct atcctggctc cgtgctccca cgcgcttgtg cctggacgga ccctcgccag
       61 tgctctgcgc aggattggaa catcagttaa catctgacca ctgccagccc accccctccc
      121 acccacgtcg attgcatctc tgggctccag ggataaagca ggtcttgggg tgcaccatga
      181 tttcaccatt cttagtactg gccattggca cctgccttac taactcctta gtgccagaga
      241 aagagaaaga ccccaagtac tggcgagacc aagcgcaaga gacactgaaa tatgccctgg
      301 agcttcagaa gctcaacacc aacgtggcta agaatgtcat catgttcctg ggagatggga
      361 tgggtgtctc cacagtgacg gctgcccgca tcctcaaggg tcagctccac cacaaccctg
      421 gggaggagac caggctggag atggacaagt tccccttcgt ggccctctcc aagacgtaca
      481 acaccaatgc ccaggtccct gacagcgccg gcaccgccac cgcctacctg tgtggggtga
      541 aggccaatga gggcaccgtg ggggtaagcg cagccactga gcgttcccgg tgcaacacca
      601 cccaggggaa cgaggtcacc tccatcctgc gctgggccaa ggacgctggg aaatctgtgg
      661 gcattgtgac caccacgaga gtgaaccatg ccacccccag cgccgcctac gcccactcgg
      721 ctgaccggga ctggtactca gacaacgaga tgccccctga ggccttgagc cagggctgta
      781 aggacatcgc ctaccagctc atgcataaca tcagggacat tgacgtgatc atggggggtg
      841 gccggaaata catgtacccc aagaataaaa ctgatgtgga gtatgagagt gacgagaaag
      901 ccaggggcac gaggctggac ggcctggacc tcgttgacac ctggaagagc ttcaaaccga
      961 gatacaagca ctcccacttc atctggaacc gcacggaact cctgaccctt gacccccaca
     1021 atgtggacta cctattgggt ttcttcgagc caggggacat gcagtacgag ctgaacagga
     1081 acaacgtgac ggacccgtca ctctccgaga tggtggtggt ggccatccag atcctgcgga
     1141 agaaccccaa aggcttcttc ttgctggtgg aaggaggcag aattgaccac gggcaccatg
     1201 aaggaaaagc caagcaggcc ctgcatgagg cggtggagat ggaccgggcc atcggccacg
     1261 caggcagctt gacctcctcg gaagacactc tgaccgtggt cactgcggac cattcccacg
     1321 tcttcacatt tggtggatac accccccgtg gcaactctat ctttggtctg gcccccatgc
     1381 tgagtgacac agacaagaag cccttcactg ccatcctgta tggcaatggg cctggctaca
     1441 aggtggtggg cggtgaacga gagaatgtct ccatggtgga ctatgctcac aacaactacc
     1501 aggcgcagtc tcctgtgccc ctgcgccacg agacccacgg cggggaggac gtggccgtct
     1561 tctccaaggg ccccatggcg cacctgctgc acggcgtcca cgagcagaac tacgtccccc
     1621 acgtgatggc gtatgcagcc tgcatcgggg ccaacctcgg ccactgtgct cctgccagct
     1681 cggcaggcag ccttgctgca ggccccctgc tgctcgctct ggccctctac cccctgagcg
     1741 tcctgttctg agggcccagg gcccgggcac ccacaagccc gtgacagatg ccaacttccc
     1801 acacggcagc ccccccctca aggggcaggg aggtgggggc ctcctcagcc tctgcaactg
     1861 caagaaaggg gacccaggaa accaaagtct gccgcccacc tcgctcccct ctggaatctt
     1921 ccccaagggc caaacccact tctggcctcc agcctttgct ccctccccgc tgccctttgg
     1981 ccaccagggt agatttctct tgggcaggca gagagtacag actgcagaca ttctcaaagc
     2041 ctcttatttt tctagcgaac gtatttctcc agacccagag gccctgaagc ctccgtggaa
     2101 cattgtggat ctgaccctcc cagtctcatc tcctgaccct cccactccca tctccttacc
     2161 tctggaaccc cccaggccct acaatgctca tgtccctgtc cccaggcgag ccctccttca
     2221 ggggagttga ggtctttctc ctcaggacaa ggccttgctc actcactcac tccaagacca
     2281 ccagggtccc aggaagccgg tgcctgggtg gccatcctac ccagcgtgcc caggccggga
     2341 agagccacct ggcagggctc acactcctgg gctctgaaca cacacgccag ctcctctctg
     2401 aagcgactct cctgtttgga acggcaaaaa aaaatttttt tttctctttt tggtggtggt
     2461 taaaagggaa cacaaaacat ttaaataaaa ctttccaaat atttccgagg 
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH005272. Human liver/bone/...[gi:178461] Links  


LOCUS       HUMALPL01                806 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            1.
ACCESSION   M24428 J03929 M14168 M21959
VERSION     M24428.1  GI:178449
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     1 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 631 to 702)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 806)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..806
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     exon            560..701
                     /gene="ALPL"
                     /note="minor alternative exon; G00-118-730"
                     /number=1
     exon            561..701
                     /gene="ALPL"
                     /note="minor alternative exon; G00-118-730"
                     /number=1
     exon            562..701
                     /gene="ALPL"
                     /note="minor alternative exon; G00-118-730"
                     /number=1
     exon            563..701
                     /partial
                     /gene="ALPL"
                     /note="minor alternative exon; G00-118-730"
                     /number=1
     exon            610..701
                     /gene="ALPL"
                     /note="major alternative exon; G00-118-730"
                     /number=1
     exon            611..701
                     /gene="ALPL"
                     /note="major alternative exon; G00-118-730"
                     /number=1
     exon            612..701
                     /gene="ALPL"
                     /note="major alternative exon; G00-118-730"
                     /number=1
     exon            613..701
                     /gene="ALPL"
                     /note="major alternative exon; G00-118-730"
                     /number=1
     exon            614..701
                     /gene="ALPL"
                     /note="major alternative exon; G00-118-730"
     exon            615..701
                     /gene="ALPL"
                     /note="major alternative exon; G00-118-730"
                     /number=1
     exon            616..701
                     /gene="ALPL"
                     /note="major alternative exon; G00-118-730"
                     /number=1
     intron          702..>806
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=1
BASE COUNT      148 a    244 c    319 g     95 t
ORIGIN      Chromosome 1p36.1-p34.
        1 ggtccccttc tgcttcttct tgcggtagcc agggagggca gcccacgggc aggaagcggg
       61 ggtgggggtg cagagtcaga ggtgcacgtg gacagagaca gagagacagg gacacgtggg
      121 cagagacgga taaagacaga gacccagaga aagccagata tgttgacaga cacagagaca
      181 gacgccagag aggaaggcag acaaagagac gggtggagac aaagactccc accaagagac
      241 gcagaaggaa gatgccgacg gtaaagacaa aacaggagac gcgcgcaagg agcaggtcag
      301 agcccaggct cgctgagaga ggaagggctg ggctggggca gcccggaggc agagagaccg
      361 agagtgcggg gcgggcgagg gacgccaggg ccgcgtcacc ccagcccgtt cctagctccg
      421 ctcccggcag ggggcgccct ggcctcgtgg cacgaccggc ccgcggggcg cggggctcgg
      481 gccgggggcg gggccggggc cgggctgggg aggggttggg gccgggggcg ggggaggggg
      541 cgggctgccc gggcctcact cgggccccgc ggccgccttt ataaggcggc gggggtggtg
      601 gcccgggccg cgttgcgctc ccgccactcc gcgcccgcta tcctggctcc gtgctcccac
      661 gcgcttgtgc ctggacggac cctcgccagt gctctgcgca ggtaaggatt cgacgctgcc
      721 ccgcgccctg gttccccagg gccccagcgg acgtggtcca tccccttctg catcctccgc
      781 tggccccgtg gttgaacttt aatggc
//
LOCUS       HUMALPL02                183 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            2.
ACCESSION   M24429 J03929 M14168 M21960
VERSION     M24429.1  GI:178450
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     2 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 177)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 183)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..183
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=1
     exon            13..177
                     /gene="ALPL"
                     /note="first expressed exon; G00-118-730"
                     /number=2
     sig_peptide     117..167
                     /gene="ALPL"
                     /note="G00-118-730"
     intron          178..>183
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=2
BASE COUNT       40 a     56 c     37 g     50 t
ORIGIN      About 25 kb after segment 1; chromosome 1p36.1-p34.
        1 tttaatttct aggattggaa catcagttaa catctgacca ctgccagccc accccctccc
       61 acccacgtcg attgcatctc tgggctccag ggataaagca ggtcttgggg tgcaccatga
      121 tttcaccatt cttagtactg gccattggca cctgccttac taactcctta gtgccaggta
      181 tgc
//
LOCUS       HUMALPL03                138 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            3.
ACCESSION   M24430 M14168
VERSION     M24430.1  GI:178451
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     3 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 132)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 138)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..138
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=2
     exon            13..132
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=3
     intron          133..>138
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=3
BASE COUNT       44 a     30 c     38 g     26 t
ORIGIN      About 7.5 kb after segment 2; chromosome 1p36.1-p34.
        1 ctctgtgttt agagaaagag aaagacccca agtactggcg agaccaagcg caagagacac
       61 tgaaatatgc cctggagctt cagaagctca acaccaacgt ggctaagaat gtcatcatgt
      121 tcctgggaga tggtgagg
//
LOCUS       HUMALPL04                128 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            4.
ACCESSION   M24431 M14168
VERSION     M24431.1  GI:178452
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     4 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 7 to 122)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 128)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..128
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..6
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=3
     exon            7..122
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=4
     intron          123..>128
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=4
BASE COUNT       25 a     41 c     40 g     22 t
ORIGIN      0.4 kb after segment 3; chromosome 1p36.1-p34.
        1 ctgcagggat gggtgtctcc acagtgacgg ctgcccgcat cctcaagggt cagctccacc
       61 acaaccctgg ggaggagacc aggctggaga tggacaagtt ccccttcgtg gccctctcca
      121 aggtgagc
//
LOCUS       HUMALPL05                193 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            5.
ACCESSION   M24432 M14168
VERSION     M24432.1  GI:178453
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     5 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 187)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 193)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..193
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=4
     exon            13..187
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=5
     conflict        45
                     /gene="ALPL"
                     /citation=[1]
                     /replace=""
     intron          188..>193
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=5
BASE COUNT       40 a     68 c     60 g     25 t
ORIGIN      1.8 kb after segment 4; chromosome 1p36.1-p34.
        1 ccccacctgc agacgtacaa caccaatgcc caggtccctg acagcgccgg caccgccacc
       61 gcctacctgt gtggggtgaa ggccaatgag ggcaccgtgg gggtaagcgc agccactgag
      121 cgttcccggt gcaacaccac ccaggggaac gaggtcacct ccatcctgcg ctgggccaag
      181 gacgctggtg agt
//
LOCUS       HUMALPL06                194 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            6.
ACCESSION   M24433 M14168
VERSION     M24433.1  GI:178454
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     6 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 188)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 194)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..194
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=5
     exon            13..188
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=6
     intron          189..>194
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=6
BASE COUNT       46 a     65 c     52 g     31 t
ORIGIN      0.9 kb after segment 5; chromosome 1p36.1-p34.
        1 cctgcacccc agggaaatct gtgggcattg tgaccaccac gagagtgaac catgccaccc
       61 ccagcgccgc ctacgcccac tcggctgacc gggactggta ctcagacaac gagatgcccc
      121 ctgaggcctt gagccagggc tgtaaggaca tcgcctacca gctcatgcat aacatcaggg
      181 acattgacgt gagt
//
LOCUS       HUMALPL07                162 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            7.
ACCESSION   M24434 M14168
VERSION     M24434.1  GI:178455
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     7 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 156)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 162)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..162
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=6
     exon            13..156
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=7
     conflict        68
                     /gene="ALPL"
                     /citation=[1]
                     /replace=""
     intron          157..>162
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=7
BASE COUNT       46 a     33 c     52 g     31 t
ORIGIN      4.3 kb after segment 6; chromosome 1p36.1-p34.
        1 tgtctctttt aggtgatcat ggggggtggc cggaaataca tgtaccccaa gaataaaact
       61 gatgtggagt atgagagtga cgagaaagcc aggggcacga ggctggacgg cctggacctc
      121 gttgacacct ggaagagctt caaaccgaga tacaaggtag cc
//
LOCUS       HUMALPL08                 88 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            8.
ACCESSION   M24435 M14168
VERSION     M24435.1  GI:178456
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     8 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 82)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 88)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..88
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=7
     exon            13..82
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=8
     intron          83..>88
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=8
BASE COUNT       19 a     33 c     15 g     21 t
ORIGIN      1.9 kb after segment 7; chromosome 1p36.1-p34.
        1 ccttcctcct agcactccca cttcatctgg aaccgcacgg aactcctgac ccttgacccc
       61 cacaatgtgg actacctatt gggtaagt
//
LOCUS       HUMALPL09                153 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            9.
ACCESSION   M24436 M14168
VERSION     M24436.1  GI:178457
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     9 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 148)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 153)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..153
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=8
     exon            13..147
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=9
     intron          148..>153
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=9
BASE COUNT       33 a     42 c     48 g     30 t
ORIGIN      3.4 kb after segment 8; chromosome 1p36.1-p34.
        1 cgtcctcctc aggtctcttc gagccagggg acatgcagta cgagctgaac aggaacaacg
       61 tgacggaccc gtcactctcc gagatggtgg tggtggccat ccagatcctg cggaagaacc
      121 ccaaaggctt cttcttgctg gtggaaggta ggg
//
LOCUS       HUMALPL10                210 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            10.
ACCESSION   M24437 M14168
VERSION     M24437.1  GI:178458
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     10 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 204)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 210)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..210
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=9
     exon            13..204
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=10
     intron          205..>210
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=10
BASE COUNT       47 a     60 c     64 g     39 t
ORIGIN      2.1 kb after segment 9; chromosome 1p36.1-p34.
        1 tggtgtccca aggaggcaga attgaccacg ggcaccatga aggaaaagcc aagcaggccc
       61 tgcatgaggc ggtggagatg gaccgggcca tcgggcaggc aggcagcttg acctcctcgg
      121 aagacactct gaccgtggtc actgcggacc attcccacgt cttcacattt ggtggataca
      181 ccccccgtgg caactctatc tttggtaggt 
//
LOCUS       HUMALPL11                138 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            11.
ACCESSION   M24438 M14168
VERSION     M24438.1  GI:178459
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     11 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 132)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 138)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..138
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=10
     exon            13..132
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=11
     intron          133..>138
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=11
BASE COUNT       30 a     35 c     44 g     29 t
ORIGIN      0.7 kb after segment 10; chromosome 1p36.1-p34.
        1 ctccctgtgc aggtctggcc cccatgctga gtgacacaga caagaagccc ttcactgcca
       61 tcctgtatgg caatgggcct ggctacaagg tggtgggcgg tgaacgagag aatgtctcca
      121 tggtggacta tggtgaga
//
LOCUS       HUMALPL12               1051 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human liver/bone/kidney-type alkaline phosphatase (ALPL) gene, exon
            12.
ACCESSION   M24439 M14168
VERSION     M24439.1  GI:178460
KEYWORDS    alkaline phosphatase; orthophosphoric-monoester phosphohydrolase;
            phosphatase.
SEGMENT     12 of 12
SOURCE      Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA,
            clones pLBK 14 [1] and pS3-1 [2].
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 13 to 471; 473 to 1022)
  AUTHORS   Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M.
            and Harris,H.
  TITLE     Isolation and characterization of a cDNA encoding a human
            liver/bone/kidney-type alkaline phosphatase
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7182-7186 (1986)
  MEDLINE   87016911
   PUBMED   3532105
REFERENCE   2  (bases 1 to 1051)
  AUTHORS   Weiss,M.J., Ray,K., Henthorn,P.S., Lamb,B., Kadesch,T. and
            Harris,H.
  TITLE     Structure of the human liver/bone/kidney alkaline phosphatase gene
  JOURNAL   J. Biol. Chem. 263 (24), 12002-12010 (1988)
  MEDLINE   88298884
   PUBMED   3165380
COMMENT     Draft entry and clean copy sequence for [1] kindly provided by
            M.J.Weiss, 27-JAN-1987.
FEATURES             Location/Qualifiers
     source          1..1051
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p34"
                     /cell_line="Saos-2"
                     /tissue_type="osteosarcoma"
     gene            join(M24428.1:560..806,M24429.1:1..183,M24430.1:1..138,
                     M24431.1:1..128,M24432.1:1..193,M24433.1:1..194,
                     M24434.1:1..162,M24435.1:1..88,M24436.1:1..153,
                     M24437.1:1..210,M24438.1:1..138,1..1027)
                     /gene="ALPL"
     CDS             join(M24429.1:117..177,M24430.1:13..132,M24431.1:7..122,
                     M24432.1:13..187,M24433.1:13..188,M24434.1:13..156,
                     M24435.1:13..82,M24436.1:13..147,M24437.1:13..204,
                     M24438.1:13..132,13..278)
                     /gene="ALPL"
                     /EC_number="3.1.3.1"
                     /note="precursor"
                     /codon_start=1
                     /product="alkaline phosphatase"
                     /protein_id="AAB59378.1"
                     /db_xref="GI:178462"
                     /db_xref="GDB:G00-118-730"
                     /translation="MISPFLVLAIGTCLTNSLVPEKEKDPKYWRDQAQETLKYALELQ
                     KLNTNVAKNVIMFLGDGMGVSTVTAARILKGQLHHNPGEETRLEMDKFPFVALSKTYN
                     TNAQVPDSAGTATAYLCGVKANEGTVGVSAATERSRCNTTQGNEVTSILRWAKDAGKS
                     VGIVTTTRVNHATPSAAYAHSADRDWYSDNEMPPEALSQGCKDIAYQLMHNIRDIDVI
                     MGGGRKYMYPKNKTDVEYESDEKARGTRLDGLDLVDTWKSFKPRYKHSHFIWNRTELL
                     TLDPHNVDYLLGLFEPGDMQYELNRNNVTDPSLSEMVVVAIQILRKNPKGFFLLVEGG
                     RIDHGHHEGKAKQALHEAVEMDRAIGQAGSLTSSEDTLTVVTADHSHVFTFGGYTPRG
                     NSIFGLAPMLSDTDKKPFTAILYGNGPGYKVVGGERENVSMVDYAHNNYQAQSAVPLR
                     HETHGGEDVAVFSKGPMAHLLHGVHEQNYVPHVMAYAACIGANLGHCAPASSAGSLAA
                     GPLLLALALYPLSVLF"
     mat_peptide     join(M24429.1:168..177,M24430.1:13..132,M24431.1:7..122,
                     M24432.1:13..187,M24433.1:13..188,M24434.1:13..156,
                     M24435.1:13..82,M24436.1:13..147,M24437.1:13..204,
                     M24438.1:13..132,13..275)
                     /gene="ALPL"
                     /product="alkaline phosphatase"
                     /EC_number="1.3.1.3"
                     /note="G00-118-730"
     intron          <1..12
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=11
     exon            13..1027
                     /gene="ALPL"
                     /note="G00-118-730"
                     /number=12
     old_sequence    239..240
                     /gene="ALPL"
                     /citation=[1]
     conflict        245
                     /gene="ALPL"
                     /citation=[1]
                     /replace=""
     conflict        344
                     /gene="ALPL"
                     /citation=[1]
                     /replace=""
     old_sequence    471
                     /gene="ALPL"
                     /citation=[2]
     conflict        510
                     /gene="ALPL"
                     /citation=[1]
                     /replace=""
     conflict        632
                     /gene="ALPL"
                     /citation=[1]
                     /replace=""
     conflict        732..735
                     /gene="ALPL"
                     /citation=[1]
                     /replace=""
     old_sequence    900
                     /gene="ALPL"
                     /citation=[2]
     old_sequence    924
                     /gene="ALPL"
                     /citation=[2]
BASE COUNT      208 a    378 c    256 g    209 t
ORIGIN      0.5 kb after segment 11; chromosome 1p36.1-p34.
        1 cctggcccac agctcacaac aactaccagg cgcagtctgc tgtgcccctg cgccacgaga
       61 cccacggcgg ggaggacgtg gccgtcttct ccaagggccc catggcgcac ctgctgcacg
      121 gcgtccacga gcagaactac gtcccccacg tgatggcgta tgcagcctgc atcggggcca
      181 acctcggcca ctgtgctcct gccagctcgg caggcagcct tgctgcaggc cccctgctgc
      241 tcgctctggc cctctacccc ctgagcgtcc tgttctgagg gcccagggcc cgggcaccca
      301 caagcccgtg acagatgcca acttcccaca cggcagcccc cccttcaagg ggcagggagg
      361 tgggggcctc ctcagcctct gcaactgcaa gaaaggggac ccaggaaacc aaagtctgcc
      421 gcccacctcg ctcccctctg gaatcttccc caagggccaa acccacttct ggcctccagc
      481 ctttgctccc tccccgctgc cctttggcca acagggtaga tttctcttgg gcaggcagag
      541 agtacagact gcagacattc tcaaagcctc ttatttttct agcgaacgta tttctccaga
      601 cccagaggcc ctgaagcctc cgtggaacat tctggatctg accctcccag tctcatctcc
      661 tgaccctccc actcccatct ccttacctct ggaacccccc aggccctaca atgctcatgt
      721 ccctgtcccc aggccagccc tccttcaggg gagttgaggt ctttctcctc aggacaaggc
      781 cttgctcact cactcactcc aagaccacca gggtcccagg aagccggtgc ctgggtggcc
      841 atcctaccca gcgtgcccag gccgggaaga gccacctggc agggctcaca ctcctgggct
      901 ctgaacacac acgccagctc ctctctgaag cgactctcct gtttggaacg gcaaaaaaaa
      961 attttttttt ctctttttgg tggtggttaa aagggaacac aaaacattta aataaaactt
     1021 tccgaggaca gagctgagtc tttgtggtca g
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

ProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M15801. Human fibronectin...[gi:182686] Links  


LOCUS       HUMFN3                   742 bp    DNA     linear   PRI 21-NOV-1994
DEFINITION  Human fibronectin (FN) gene, exon 1, clone pgHF3.7.
ACCESSION   M15801
VERSION     M15801.1  GI:182686
KEYWORDS    fibronectin.
SOURCE      Homo sapiens fibrosarcoma DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 742)
  AUTHORS   Dean,D.C., Bowlus,C.L. and Bourgeois,S.
  TITLE     Cloning and analysis of the promotor region of the human
            fibronectin gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 84 (7), 1876-1880 (1987)
  MEDLINE   87175578
   PUBMED   3031656
COMMENT     Draft entry and clean copy of sequence [1] kindly provided by,
            S.Bourgeois , 01-JUN-1987.
FEATURES             Location/Qualifiers
     source          1..742
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="2q34"
                     /cell_line="HT1080, DNA"
                     /tissue_type="fibrosarcoma"
     gene            310..742
                     /gene="FN1"
     exon            310..724
                     /partial
                     /gene="FN1"
                     /note="G00-119-135"
                     /number=1
     CDS             577..>724
                     /gene="FN1"
                     /codon_start=1
                     /product="fibronectin"
                     /protein_id="AAA53376.1"
                     /db_xref="GI:553293"
                     /db_xref="GDB:G00-119-135"
                     /translation="MLRGPGPGLLLLAVLCLGTAVPSTGASKSKRQAQQMVQPQSPVA
                     VSQSK"
     sig_peptide     577..669
                     /gene="FN1"
                     /note="G00-119-135"
     mat_peptide     670..>724
                     /gene="FN1"
                     /product="fibronectin"
                     /note="G00-119-135"
     intron          725..>742
                     /gene="FN1"
                     /note="G00-119-135"
                     /number=1
BASE COUNT      118 a    264 c    240 g    120 t
ORIGIN      147 bp upstream of SmaI site; map position 8.
        1 ccagccgctt cccatccctt cccccatccc ctaaaaagtt tgatgaccgc aaaggaaacc
       61 gaaaaaaagt tgtcttgccc cagtcctggc gggccatcag catctctttt gttcgctgcg
      121 aacccacagt cccccgtgac gtcacccggg agcccgggcc aatcgggcgc ggtcggctgc
      181 ggcggccggc gggcgggcgg gtggggtggg gcggggcggg gacagcccgg cgggtctctc
      241 ctcccccgcg ccccgggcct ccagaggggc gggagggccg tcccatataa gcccggctcc
      301 cgcgctccga cgcccgcgcc ggctgtgctg cacaggggga ggagagggaa ccccaggcgc
      361 gagcgggaag aggggacctg cagccacaac ttctctggtc ctctgcatcc cttctgtccc
      421 tccacccgtc cccttcccca ccctctggcc cccaccttct tggaggcgac aacccccggg
      481 aggcattaga agggattttt cccgcagttg cgaagggaag caaacttggt ggcaacttgc
      541 ctcccggtgc gggcgtctct cccccaccgt ctcaacatgc ttaggggtcc ggggcccggg
      601 ctgctgctgc tggccgtcct gtgcctgggg acagcggtgc cctccacggg agcctcgaag
      661 agcaagaggc aggctcagca aatggttcag ccccagtccc cggtggctgt cagtcaaagc
      721 aagcgtgagt actgaccgcg gg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AL109804. Human DNA sequenc...[gi:11121192] Links  


LOCUS       HS1009E24             185820 bp    DNA     linear   PRI 26-APR-2001
DEFINITION  Human DNA sequence from clone RP5-1009E24 on chromosome 20 Contains
            the SN gene encoding sialoadhesin, a novel gene similar to
            KIAA0417, the CENPB gene for centromere protein B, the CDC25B gene
            for Cell division cycle protein 25B, three novel genes, the 5' end
            of gene KIAA1271, nine CpG islands, ESTs, STSs and GSSs, complete
            sequence.
ACCESSION   AL109804
VERSION     AL109804.41  GI:11121192
KEYWORDS    HTG; CDC25B; CENPB; Centromere; CpG island; KIAA0417; KIAA1271;
            sialoadhesin; SN.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 185820)
  AUTHORS   Phillimore,B.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-APR-2001) Sanger Centre, Hinxton, Cambridgeshire,
            CB10 1SA, UK. E-mail enquiries: humquery@sanger.ac.uk Clone
            requests: clonerequest@sanger.ac.uk
COMMENT     On Nov 8, 2000 this sequence version replaced gi:10443466.
            During sequence assembly data is compared from overlapping clones.
            Where differences are found these are annotated as variations
            together with a note of the overlapping clone name. Note that the
            variation annotation may not be found in the sequence submission
            corresponding to the overlapping clone, as we submit sequences with
            only a small overlap as described above.
            The following abbreviations are used to associate primary accession
            numbers given in the feature table with their source databases:
            Em:, EMBL; Sw:, SWISSPROT; Tr:, TREMBL; Wp:, WORMPEP; Information
            on the WORMPEP database can be found at
            http://www.sanger.ac.uk/Projects/C_elegans/wormpep This sequence
            was generated from part of bacterial clone contigs of human
            chromosome 20, constructed by the Sanger Centre Chromosome 20
            Mapping Group.  Further information can be found at
            http://www.sanger.ac.uk/HGP/Chr20
            This sequence is the entire insert of clone RP5-1009E24 The true
            left end of clone RP11-119B16 is at 128095 in this sequence. The
            true right end of clone RP5-964F7 is at 20907 in this sequence.
            This sequence was finished as follows unless otherwise noted: all
            regions were either double-stranded or sequenced with an alternate
            chemistry or covered by high quality data (i.e., phred quality >=
            30); an attempt was made to resolve all sequencing problems, such
            as compressions and repeats; all regions were covered by at least
            one plasmid subclone or more than one M13 subclone; and the
            assembly was confirmed by restriction digest. RP5-1009E24 is from
            the library RPCI-5 constructed by the group of Pieter de Jong. For
            further details see
            http://www.chori.org/bacpac/home.htm
            VECTOR: pCYPAC2.
FEATURES             Location/Qualifiers
     source          1..185820
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="20"
                     /clone="RP5-1009E24"
                     /clone_lib="RPCI-5"
     repeat_region   644..695
                     /note="MIR repeat: matches 195..250 of consensus"
     repeat_region   1011..1288
                     /note="AluSx repeat: matches 37..312 of consensus"
     misc_feature    2195..2583
                     /note="match: GSS: Em:AQ790341"
     repeat_region   3333..3423
                     /note="MIR repeat: matches 79..169 of consensus"
     repeat_region   3643..3706
                     /note="MER69 repeat: matches 70..136 of consensus"
     repeat_region   3708..3789
                     /note="MER69 repeat: matches 2422..2512 of consensus"
     repeat_region   3708..3739
                     /note="16 copies 2 mer tt 84% conserved"
     misc_feature    6250..7126
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   6526..6555
                     /note="10 copies 3 mer agc 90% conserved"
     repeat_region   6702..6757
                     /note="28 copies 2 mer cc 73% conserved"
     repeat_region   7387..7686
                     /note="AluSc repeat: matches 1..300 of consensus"
     repeat_region   7710..8015
                     /note="AluSp repeat: matches 1..307 of consensus"
     repeat_region   8249..8302
                     /note="27 copies 2 mer tt 72% conserved"
     repeat_region   8361..8589
                     /note="AluJo repeat: matches 80..292 of consensus"
     repeat_region   8590..8899
                     /note="AluSq repeat: matches 1..310 of consensus"
     repeat_region   8900..8968
                     /note="AluJo repeat: matches 9..80 of consensus"
     repeat_region   9029..9471
                     /note="L1ME2 repeat: matches 5706..6155 of consensus"
     repeat_region   9484..9645
                     /note="MLT2CA repeat: matches 321..483 of consensus
                     MLT2CA repeat: matches 321..483 of consensus"
     repeat_region   9840..10070
                     /note="L2 repeat: matches 2417..2664 of consensus"
     repeat_region   10677..10756
                     /note="L1ME2 repeat: matches 5557..5637 of consensus"
     repeat_region   10754..10907
                     /note="L1ME3 repeat: matches 5901..6065 of consensus"
     gene            complement(11558..31714)
                     /gene="SN"
     mRNA            complement(join(11558..13205,13741..13813,14186..14288,
                     14548..14850,15926..16186,16489..16788,17107..17358,
                     17448..17717,18033..18284,18807..19118,19249..19509,
                     21172..21474,21671..21928,22384..22719,23788..24045,
                     25928..26227,27783..28037,28411..28677,30330..30626,
                     30933..31292,31666..31714))
                     /gene="SN"
                     /product="dJ1009E24.1.1 (Sialoadhesin (isoform 1) )"
                     /note="match: cDNAs: Em:Z36293 Em:Z36233 Em:AK024462
                     match: ESTs: Em:AI347674 Em:AI818995 Em:AI864580
                     Em:AI375186 Em:AI936668 Em:AI032935 Em:AI031641
                     Em:BE857558"
                     /evidence=not_experimental
     mRNA            complement(join(11558..14850,15926..16186,16489..16788,
                     17107..17358,17448..17665))
                     /gene="SN"
                     /product="dJ1009E24.1.2 (Sialoadhesin (isoform 2))"
                     /note="match: cDNAs: Em:AK024479 Em:AK024459"
                     /evidence=not_experimental
     polyA_site      complement(11558)
                     /gene="SN"
     polyA_signal    complement(11575..11580)
                     /gene="SN"
     repeat_region   12381..12733
                     /note="MLT1J repeat: matches 91..485 of consensus"
     CDS             complement(join(13146..13205,13741..13813,14186..14288,
                     14548..14850,15926..16186,16489..16788,17107..17358,
                     17448..17717,18033..18284,18807..19118,19249..19509,
                     21172..21474,21671..21928,22384..22719,23788..24045,
                     25928..26227,27783..28037,28411..28677,30330..30626,
                     30933..31292,31666..31714))
                     /gene="SN"
                     /note="match: proteins: Tr:Q62523 Sw:Q62230 Sw:P20273
                     Tr:Q08476"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.1.1 (Sialoadhesin (isoform 1) )"
                     /protein_id="CAC17543.1"
                     /db_xref="GI:11493365"
                     /translation="MGFLPKLLLLASFFPAGQASWGVSSPQDVQGVKGSCLLIPCIFS
                     FPADVEVPDGITAIWYYDYSGQRQVVSHSADPKLVEARFRGRTEFMGNPEHRVCNLLL
                     KDLQPEDSGSYNFRFEISEVNRWSDVKGTLVTVTEEPRVPTIASPVELLEGTEVDFNC
                     STPYVCLQEQVRLQWQGQDPARSVTFNSQKFEPTGVGHLETLHMAMSWQDHGRILRCQ
                     LSVANHRAQSEIHLQVKYAPKGVKILLSPSGRNILPGELVTLTCQVNSSYPAVSSIKW
                     LKDGVRLQTKTGVLHLPQAAWSDAGVYTCQAENGVGSLVSPPISLHIFMAEVQVSPAG
                     PILENQTVTLVCNTPNEAPSDLRYSWYKNHVLLEDAHSHTLRLHLATRADTGFYFCEV
                     QNVHGSERSGPVSVVVNHPPLTPVLTAFLETQAGLVGILHCSVVSEPLATLVLSHGGH
                     ILASTSGDSDHSPRFSGTSGPNSLRLEIRDLEETDSGEYKCSATNSLGNATSTLDFHA
                     NAARLLISPAAEVVEGQAVTLSCRSGLSPTPDARFSWYLNGALLHEGPGSSLLLPAAS
                     STDAGSYHCRARDGHSASGPSSPAVLTVLYPPRQPTFTTRLDLDAAGAGAGRRGLLLC
                     RVDSDPPARLQLLHKDRVVATSLPSGGGCSTCGGCSPRMKVTKAPNLLRVEIHNPLLE
                     EEGLYLCEASNALGNASTSATFNGQATVLAIAPSHTLQEGTEANLTCNVSREAAGSPA
                     NFSWFRNGVLWAQGPLETVTLLPVARTDAALYACRILTEAGAQLSTPVLLSVLYPPDR
                     PKLSALLDMGQGHMALFICTVDSRPLALLALFHGEHLLATSLGPQVPSHGRFQAKAEA
                     NSLKLEVRELGLGDSGSYRCEATNVLGSSNTSLFFQVRGAWVQVSPSPELQEGQAVVL
                     SCQVHTGVPEGTSYRWYRDGQPLQESTSATLRFAAITLTQAGAYHCQAQAPGSATTSL
                     AAPISLHVSYAPRHVTLTTLMDTGPGRLGLLLCRVDSDPPAQLRLLHGDRLVASTLQG
                     VGGPEGSSPRLHVAVAPNTLRLEIHGAMLEDEGVYICEASNTLGQASASADFDAQAVN
                     VQVWPGATVREGQLVNLTCLVWTTHPAQLTYTWYQDGQQRLDAHSIPLPNVTVRDATS
                     YRCGVGPPGRAPRLSRPITLDVLYAPRNLRLTYLLESHGGQLALVLCTVDSRPPAQLA
                     LSHAGRLLASSTAASVPNTLRLELRGPQPRDEGFYSCSARSPLGQANTSLELRLEGVR
                     VILAPEAAVPEGAPITVTCADPAAHAPTLYTWYHNGRWLQEGPAASLSFLVATRAHAG
                     AYSCQAQDAQGTRSSRPAALQVLYAPQDAVLSSFRDSRARSMAVIQCTVDSEPPAELA
                     LSHDGKVLATSSGVHSLASGTGHVQVARNALRLQVQDVPAGDDTYVCTAQNLLGSIST
                     IGRLQVEGARVVAEPGLDVPEGAALNLSCRLLGGPGPVGNSTFAWFWNDRRLHAEPVP
                     TLAFTHVARAQAGMYHCLAELPTGAAASAPVMLRVLYPPKTPTMMVFVEPEGGLRGIL
                     DCRVDSEPLASLTLHLGSRLVASSQPQGAPAEPHIHVLASPNALRVDIEALRPSDQGE
                     YICSASNVLGSASTSTYFGVRALHRLHQFQQLLWVLGLLVGLLLLLLGLGACYTWRRR
                     RVCKQSMGENSVEMAFQKETTQLIDPDAATCETSTCAPPLG"
     repeat_region   14212..14286
                     /note="25 copies 3 mer cag 65% conserved"
     CDS             complement(join(14492..14850,15926..16186,16489..16788,
                     17107..17358,17448..>17665))
                     /gene="SN"
                     /codon_start=2
                     /evidence=not_experimental
                     /product="dJ1009E24.1.2 (Sialoadhesin (isoform 2))"
                     /protein_id="CAC17542.1"
                     /db_xref="GI:11493364"
                     /translation="LALVLCTVDSRPPAQLALSHAGRLLASSTAASVPNTLRLELRGP
                     QPRDEGFYSCSARSPLGQANTSLELRLEGVRVILAPEAAVPEGAPITVTCADPAAHAP
                     TLYTWYHNGRWLQEGPAASLSFLVATRAHAGAYSCQAQDAQGTRSSRPAALQVLYAPQ
                     DAVLSSFRDSRARSMAVIQCTVDSEPPAELALSHDGKVLATSSGVHSLASGTGHVQVA
                     RNALRLQVQDVPAGDDTYVCTAQNLLGSISTIGRLQVEGARVVAEPGLDVPEGAALNL
                     SCRLLGGPGPVGNSTFAWFWNDRRLHAEPVPTLAFTHVARAQAGMYHCLAELPTGAAA
                     SAPVMLRVLYPPKTPTMMVFVEPEGGLRGILDCRVDSEPLASLTLHLGSRLVASSQPQ
                     GAPAEPHIHVLASPNALRVDIEALRPSDQGEYICSASNVLGSASTSTYFGVRGEGRGL
                     HLPGHSAQKPSS"
     repeat_region   14872..15301
                     /note="MLT1H repeat: matches 8..547 of consensus"
     repeat_region   15333..15405
                     /note="MIR repeat: matches 196..255 of consensus"
     repeat_region   15406..15711
                     /note="AluSx repeat: matches 6..311 of consensus"
     repeat_region   15712..15816
                     /note="MIR repeat: matches 47..196 of consensus"
     repeat_region   18761..18802
                     /note="21 copies 2 mer ac 100% conserved"
     repeat_region   19757..19875
                     /note="L2 repeat: matches 2617..2749 of consensus"
     repeat_region   20406..20683
                     /note="AluSp repeat: matches 1..296 of consensus"
     misc_feature    20673..21013
                     /note="match: STS: Em:HSB026XH5"
     repeat_region   20707..20752
                     /note="23 copies 2 mer ca 87% conserved"
     repeat_region   22819..22997
                     /note="MER91A repeat: matches 2..186 of consensus
                     MER91A repeat: matches 2..186 of consensus"
     repeat_region   24281..24402
                     /note="AluJo repeat: matches 1..124 of consensus"
     repeat_region   24403..24693
                     /note="AluJb repeat: matches 1..294 of consensus"
     repeat_region   24693..25003
                     /note="AluSx repeat: matches 1..311 of consensus"
     repeat_region   25004..25187
                     /note="AluJo repeat: matches 434..300 of consensus"
     repeat_region   25274..25584
                     /note="AluSc repeat: matches 1..309 of consensus"
     repeat_region   26485..26889
                     /note="L1MC4 repeat: matches 7188..7559 of consensus"
     repeat_region   26890..27187
                     /note="AluSx repeat: matches 1..296 of consensus"
     repeat_region   27188..27292
                     /note="L1MC4 repeat: matches 7559..7656 of consensus"
     repeat_region   27482..27599
                     /note="L1MC4 repeat: matches 7854..7977 of consensus"
     repeat_region   29202..29509
                     /note="AluJo repeat: matches 1..301 of consensus"
     repeat_region   29510..29569
                     /note="MIR repeat: matches 185..244 of consensus"
     repeat_region   29574..29882
                     /note="AluSx repeat: matches 1..310 of consensus"
     repeat_region   32272..32335
                     /note="L2 repeat: matches 2163..2229 of consensus"
     repeat_region   32934..33069
                     /note="L1MB5 repeat: matches 5022..5156 of consensus"
     repeat_region   33070..33350
                     /note="AluJo repeat: matches 1..296 of consensus"
     repeat_region   33352..33653
                     /note="AluSc repeat: matches 1..300 of consensus"
     repeat_region   33654..34656
                     /note="L1MB5 repeat: matches 5156..6164 of consensus"
     repeat_region   37748..38059
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   38241..38563
                     /note="AluJo repeat: matches 1..312 of consensus"
     repeat_region   39027..39221
                     /note="MLT1H repeat: matches 110..307 of consensus"
     repeat_region   39512..39809
                     /note="AluSx repeat: matches 1..294 of consensus"
     repeat_region   39860..40183
                     /note="AluJb repeat: matches 3..310 of consensus"
     repeat_region   40187..40253
                     /note="MIR repeat: matches 79..150 of consensus"
     repeat_region   40355..40547
                     /note="MLT1J repeat: matches 208..410 of consensus"
     repeat_region   40569..40724
                     /note="FRAM repeat: matches 8..163 of consensus"
     repeat_region   40736..40867
                     /note="MLT1J repeat: matches 37..195 of consensus"
     repeat_region   41138..41232
                     /note="MIR repeat: matches 5..107 of consensus"
     repeat_region   41793..41956
                     /note="AluY repeat: matches 136..299 of consensus"
     repeat_region   41965..42234
                     /note="AluSx repeat: matches 3..284 of consensus"
     repeat_region   42760..43063
                     /note="AluJb repeat: matches 1..299 of consensus"
     repeat_region   43075..43203
                     /note="MLT1J repeat: matches 37..195 of consensus"
     repeat_region   43221..43334
                     /note="AluSg/x repeat: matches 198..311 of consensus"
     repeat_region   43415..43726
                     /note="AluSx repeat: matches 5..305 of consensus"
     repeat_region   43730..43942
                     /note="MIR repeat: matches 2..221 of consensus"
     misc_feature    complement(43953..44223)
                     /note="match: STS: Em:L18465"
     repeat_region   43997..44096
                     /note="AluJo/FLAM repeat: matches 2..102 of consensus"
     repeat_region   44097..44383
                     /note="AluJb repeat: matches 1..282 of consensus"
     repeat_region   44384..44414
                     /note="FLAM_C repeat: matches 102..132 of consensus"
     repeat_region   44434..44598
                     /note="L1M4 repeat: matches 1199..1354 of consensus"
     repeat_region   44494..44609
                     /note="HAL1 repeat: matches 170..289 of consensus"
     repeat_region   45597..45902
                     /note="AluSg repeat: matches 3..308 of consensus"
     repeat_region   46249..46504
                     /note="AluJb repeat: matches 1..301 of consensus"
     repeat_region   46898..47239
                     /note="TIGGER2 repeat: matches 1..2717 of consensus
                     TIGGER2 repeat: matches 1..2717 of consensus"
     repeat_region   47351..47650
                     /note="AluSx repeat: matches 1..300 of consensus"
     repeat_region   47654..47808
                     /note="L1MC2 repeat: matches 5182..5342 of consensus"
     repeat_region   47809..48086
                     /note="AluSx repeat: matches 9..294 of consensus"
     repeat_region   48087..48432
                     /note="L1MC2 repeat: matches 5342..5675 of consensus"
     repeat_region   48434..48518
                     /note="AluJo repeat: matches 1..91 of consensus"
     repeat_region   48519..48832
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   48833..49043
                     /note="AluJo repeat: matches 91..273 of consensus"
     repeat_region   49044..49171
                     /note="64 copies 2 mer ga 61% conserved"
     repeat_region   49209..49921
                     /note="L1MC2 repeat: matches 5661..6326 of consensus"
     repeat_region   49952..50270
                     /note="AluSx repeat: matches 12..298 of consensus"
     repeat_region   50310..50476
                     /note="FRAM repeat: matches 1..165 of consensus"
     misc_feature    complement(50609..50813)
                     /note="match: STS: Em:G04713"
     repeat_region   50818..51120
                     /note="AluSx repeat: matches 1..303 of consensus"
     repeat_region   51155..51340
                     /note="AluJb repeat: matches 136..312 of consensus"
     repeat_region   51347..51645
                     /note="AluSg repeat: matches 1..299 of consensus"
     repeat_region   52013..52178
                     /note="L2 repeat: matches 1591..1754 of consensus"
     repeat_region   52189..52690
                     /note="MLT1J repeat: matches 13..513 of consensus"
     repeat_region   53124..53252
                     /note="L2 repeat: matches 2363..2493 of consensus"
     misc_feature    53729..54143
                     /note="match: GSS: Em:AQ071055"
     repeat_region   53834..53930
                     /note="MIR repeat: matches 92..191 of consensus"
     repeat_region   54064..54173
                     /note="55 copies 2 mer tg 73% conserved"
     repeat_region   55555..55726
                     /note="86 copies 2 mer ac 69% conserved"
     repeat_region   56198..56475
                     /note="AluJo repeat: matches 12..289 of consensus"
     repeat_region   57171..57398
                     /note="114 copies 2 mer gg 54% conserved"
     repeat_region   58617..58760
                     /note="AluJb repeat: matches 146..289 of consensus"
     repeat_region   59324..59623
                     /note="AluSg repeat: matches 1..300 of consensus"
     repeat_region   59830..59867
                     /note="MIR repeat: matches 107..144 of consensus"
     repeat_region   60048..60351
                     /note="AluSq repeat: matches 1..304 of consensus"
     repeat_region   60446..60505
                     /note="L2 repeat: matches 2643..2706 of consensus"
     repeat_region   60556..60761
                     /note="MER63A repeat: matches 1..209 of consensus
                     MER63A repeat: matches 1..209 of consensus"
     repeat_region   61277..61326
                     /note="MER5A repeat: matches 133..183 of consensus"
     repeat_region   61327..61670
                     /note="AluSx repeat: matches 1..311 of consensus"
     misc_feature    complement(61498..61660)
                     /note="match: GSS: Em:AQ750212"
     repeat_region   61671..61806
                     /note="MER5A repeat: matches 4..133 of consensus"
     repeat_region   61876..61980
                     /note="L2 repeat: matches 1129..1227 of consensus"
     repeat_region   61993..62054
                     /note="AluJ/FRAM repeat: matches 231..292 of consensus"
     repeat_region   62307..62607
                     /note="AluSx repeat: matches 1..301 of consensus"
     misc_feature    complement(62332..62490)
                     /note="match: GSS: Em:AQ015769"
     misc_feature    complement(62339..62515)
                     /note="match: GSS: Em:AQ028161"
     misc_feature    complement(62426..62603)
                     /note="match: GSS: Em:AQ891720"
     misc_feature    complement(62436..62607)
                     /note="match: GSS: Em:AQ317665"
     repeat_region   62744..62933
                     /note="L2 repeat: matches 2522..2735 of consensus"
     repeat_region   62938..63152
                     /note="L1MC5 repeat: matches 7558..7783 of consensus"
     gene            63244..77697
                     /gene="dJ1009E24.2"
     mRNA            join(63244..63303,65401..65498,66870..66994,69488..69674,
                     70079..70183,70501..70617,72803..72977,73818..73904,
                     74336..74440,74555..74813,75390..75493,76097..77697)
                     /gene="dJ1009E24.2"
                     /product="dJ1009E24.2 (novel protein similar to KIAA0417)"
                     /note="match: cDNAs: Em:AB007877 Em:AK012548 Em:AK006867"
                     /evidence=not_experimental
     CDS             join(63261..63303,65401..65498,66870..66994,69488..69674,
                     70079..70183,70501..70617,72803..72977,73818..73904,
                     74336..74440,74555..74813,75390..75493,76097..76752)
                     /gene="dJ1009E24.2"
                     /note="match: proteins: Tr:O43301"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.2 (novel protein similar to KIAA0417)"
                     /protein_id="CAC17544.3"
                     /db_xref="GI:13624748"
                     /translation="MLAVPEMGLQGLYIGSSPERSPVPSPPGSPRTQESCGIAPLTPS
                     QSPKPEVRAPQQASFSVVVAIDFGTTSSGYAFSFASDPEAIHMMRKWEGGDPGVAHQK
                     TPTCLLLTPEGAFHSFGYTARDYYHDLDPEEARDWLYFEKFKMKIHSATDLTLKTQLE
                     AVNGKTMPALEVFAHALRFFREHALQELREQSPSLPEKDTVRWVLTVPAIWKQPAKQF
                     MREAAYLAGLVSRENAEQLLIALEPEAASVYCRKLRLHQLLDLSGRAPGGGRLGERRS
                     IDSSFRQAREQLRRSRHSRTFLVESGVGELWAEMQAGDRYVVADCGGGTVDLTVHQLE
                     QPHGTLKELYKASGGPYGAVGVDLAFEQLLCRIFGEDFIATFKRQRPAAWVDLTIAFE
                     ARKRTAGPHRAGALNISLPFSFIDFYRKQRGHNVETALRRSSVNFVKWSSQGMLRMSC
                     EAMNELFQPTVSGIIQHIEALLARPEVQGVKLLFLVGGFAESAVLQHAVQAALGARGL
                     RVVVPHDVGLTILKGAVLFGQAPGVVRVRRSPLTYGVGVLNRFVPGRHPPEKLLVRDG
                     RRWCTDVFERFVAAEQSVALGEEVRRSYCPARPGQRRVLINLYCCAAEDARFITDPGV
                     RKCGALSLELEPADCGQDTAGAPPGRREIRAAMQFGDTEIKVTAVDVSTNRSVRASID
                     FLSN"
     repeat_region   63417..63452
                     /note="18 copies 2 mer tg 86% conserved"
     repeat_region   63631..63726
                     /note="48 copies 2 mer gt 61% conserved"
     repeat_region   65936..66232
                     /note="AluSg repeat: matches 1..294 of consensus"
     repeat_region   67275..67457
                     /note="AluSg/x repeat: matches 125..307 of consensus"
     repeat_region   67462..67770
                     /note="AluY repeat: matches 1..297 of consensus"
     repeat_region   67771..68082
                     /note="AluJo repeat: matches 1..299 of consensus"
     repeat_region   68220..68425
                     /note="L1ME repeat: matches 5620..5833 of consensus"
     repeat_region   68462..68513
                     /note="26 copies 2 mer aa 78% conserved"
     repeat_region   69072..69282
                     /note="L2 repeat: matches 1974..2184 of consensus"
     repeat_region   69390..69438
                     /note="L2 repeat: matches 2352..2400 of consensus"
     repeat_region   70848..70885
                     /note="LTR16A repeat: matches 408..444 of consensus"
     repeat_region   70886..71187
                     /note="AluYb8 repeat: matches 1..311 of consensus"
     repeat_region   71188..71523
                     /note="LTR16A repeat: matches 64..408 of consensus"
     misc_feature    74264..76883
                     /gene="dJ1009E24.2"
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   75669..75835
                     /note="MIR repeat: matches 82..251 of consensus"
     misc_feature    complement(77427..77688)
                     /note="match: STS: Em:G14972"
     repeat_region   77529..77610
                     /note="MIR repeat: matches 59..142 of consensus"
     polyA_signal    77672..77677
                     /gene="dJ1009E24.2"
     polyA_site      77697
                     /gene="dJ1009E24.2"
     gene            complement(78095..92330)
                     /gene="dJ1009E24.3"
     mRNA            complement(join(78095..78744,78983..79105,80055..80193,
                     83122..83264,84668..84772,92249..92330))
                     /gene="dJ1009E24.3"
                     /product="dJ1009E24.3 (novel protein)"
                     /note="match: cDNAs: Em:AK024220 Em:AK000557 Em:AK022713
                     match: ESTs: Em:BE795641 Em:BE786533 Em:BE793892
                     Em:BE536848"
                     /evidence=not_experimental
     polyA_site      complement(78095)
                     /gene="dJ1009E24.3"
     polyA_signal    complement(78112..78117)
                     /gene="dJ1009E24.3"
     CDS             complement(join(78644..78744,78983..79105,80055..80193,
                     83122..83264,84668..84686))
                     /gene="dJ1009E24.3"
                     /note="match: proteins: Tr:Q62523"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.3 (novel protein)"
                     /protein_id="CAC17545.1"
                     /db_xref="GI:11493367"
                     /translation="MAAANKGNKPRVRSIRFAAGHDAEGSHSHVHFDEKLHDSVVMVT
                     QESDSSFLVKVGFLKILHRYEITFTLPPVHRLSKDVREAPVPSLHLKLLSVVPVPEGY
                     SVKCEYSAHKEGVLKEEILLACEGGTGTCVRVTVQARVMDRHHGTPMLLDGVKCVGAE
                     LEYDSEHSDWHGFD"
     repeat_region   81655..81948
                     /note="AluSq repeat: matches 1..299 of consensus"
     repeat_region   82214..82427
                     /note="AluSg/x repeat: matches 87..303 of consensus"
     repeat_region   82771..82860
                     /note="MIR repeat: matches 29..122 of consensus"
     repeat_region   82976..83010
                     /note="MIR repeat: matches 110..144 of consensus"
     repeat_region   83267..83313
                     /note="L1PBa repeat: matches -1202..-1156 of consensus"
     misc_feature    83884..84178
                     /note="match: GSS: Em:AQ028334"
     repeat_region   84500..84575
                     /note="38 copies 2 mer ca 75% conserved"
     misc_feature    complement(85104..85666)
                     /gene="dJ1009E24.3"
                     /note="match: GSS: Em:AQ375029"
     misc_feature    complement(85341..85642)
                     /gene="dJ1009E24.3"
                     /note="match: GSS: Em:AQ092132"
     repeat_region   85441..85583
                     /note="L1MC4 repeat: matches 6879..7018 of consensus"
     repeat_region   85633..85880
                     /note="L1MC5 repeat: matches 7005..7228 of consensus"
     repeat_region   85917..86221
                     /note="AluSg repeat: matches 1..302 of consensus"
     repeat_region   86336..86415
                     /note="L1MC4 repeat: matches 7341..7420 of consensus"
     repeat_region   86364..86543
                     /note="L1MC5 repeat: matches 7268..7480 of consensus"
     repeat_region   86592..86885
                     /note="AluSx repeat: matches 1..310 of consensus"
     repeat_region   86970..87236
                     /note="AluJb repeat: matches 2..299 of consensus"
     repeat_region   87244..87416
                     /note="MER58A repeat: matches 50..224 of consensus"
     repeat_region   87561..87861
                     /note="AluSx repeat: matches 2..302 of consensus"
     repeat_region   87862..88162
                     /note="AluSp repeat: matches 1..302 of consensus"
     repeat_region   88958..89325
                     /note="L2 repeat: matches 2302..2708 of consensus"
     repeat_region   89944..90083
                     /note="MIR repeat: matches 25..173 of consensus"
     misc_feature    91907..93346
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   94050..94362
                     /note="AluSx repeat: matches 1..303 of consensus"
     repeat_region   94488..94564
                     /note="L2 repeat: matches 2616..2701 of consensus"
     repeat_region   95014..95314
                     /note="AluY repeat: matches 1..301 of consensus"
     repeat_region   95402..95539
                     /note="FLAM_C repeat: matches 1..133 of consensus"
     repeat_region   95604..95637
                     /note="L1MC/D repeat: matches 5604..5637 of consensus"
     repeat_region   95638..95989
                     /note="AluYb8 repeat: matches 1..303 of consensus"
     repeat_region   95992..96126
                     /note="AluJo/FLAM repeat: matches 1..133 of consensus"
     repeat_region   96127..96264
                     /note="L1MC/D repeat: matches 5474..5605 of consensus"
     repeat_region   96404..96429
                     /note="13 copies 2 mer ac 100% conserved"
     repeat_region   96433..96652
                     /note="AluSg/x repeat: matches 91..309 of consensus"
     repeat_region   96659..96885
                     /note="L1ME2 repeat: matches 5594..5826 of consensus"
     repeat_region   96908..97088
                     /note="AluSg repeat: matches 1..179 of consensus"
     repeat_region   97105..97539
                     /note="MLT2D repeat: matches 115..553 of consensus"
     repeat_region   97540..97847
                     /note="AluSx repeat: matches 1..308 of consensus"
     repeat_region   97848..97861
                     /note="MLT2D repeat: matches 103..115 of consensus"
     repeat_region   97876..98166
                     /note="AluSx repeat: matches 2..302 of consensus"
     repeat_region   98350..98472
                     /note="L2 repeat: matches 2570..2710 of consensus"
     repeat_region   98522..98673
                     /note="AluJb repeat: matches 1..124 of consensus"
     repeat_region   98674..98964
                     /note="AluSx repeat: matches 3..296 of consensus"
     repeat_region   98965..99123
                     /note="AluJb repeat: matches 124..292 of consensus"
     repeat_region   99133..99442
                     /note="AluSx repeat: matches 1..310 of consensus"
     misc_feature    complement(99437..99666)
                     /note="Single clone region. sequence from reads from a
                     short insert library derived from a clone PCR.Restriction
                     digest data confirm the assembly."
     repeat_region   99478..99646
                     /note="MLT1E repeat: matches 1..156 of consensus"
     repeat_region   100425..100800
                     /note="MLT1E repeat: matches 186..568 of consensus"
     repeat_region   100844..101151
                     /note="AluSx repeat: matches 1..308 of consensus"
     repeat_region   101270..101563
                     /note="AluSq repeat: matches 1..296 of consensus"
     gene            complement(102090..106022)
                     /gene="dJ1009E24.4"
     mRNA            complement(join(102090..102950,103007..103130,
                     103332..103576,103782..103938,104250..104361,
                     105765..106022))
                     /gene="dJ1009E24.4"
                     /product="dJ1009E24.4 (novel protein)"
                     /note="match: cDNAs: Em:AL080154
                     match: ESTs: Em:AL040853 Em:AA166662 Em:BE368954
                     Em:BE652481"
                     /evidence=not_experimental
     polyA_site      complement(102090)
                     /gene="dJ1009E24.4"
     polyA_signal    complement(102107..102112)
                     /gene="dJ1009E24.4"
     misc_feature    102146..103123
                     /note="CpG island"
                     /evidence=not_experimental
     CDS             complement(join(103466..103576,103782..103938,
                     104250..104361,105765..105873))
                     /gene="dJ1009E24.4"
                     /note="match: proteins: Tr:Q62523 Tr:Q9Y4P9"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.4 (novel protein)"
                     /protein_id="CAC17546.1"
                     /db_xref="GI:11493368"
                     /translation="MASSVDEEALHQLYLWVDNIPLSRPKRNLSRDFSDGVLVAEVIK
                     FYFPKMVEMHNYVPANSLQQKLSNWGHLNRKVLKRLNFSVPDDVMRKIAQCAPGVVEL
                     VLIPLRQRLEERQRRRKQGAGSLQELAPQDGSGYMDVGKVAFSISPSRLELSFCPSSC
                     HL"
     repeat_region   104648..104947
                     /note="AluSx repeat: matches 7..309 of consensus"
     repeat_region   105003..105044
                     /note="21 copies 2 mer ca 92% conserved"
     misc_feature    complement(105830..106256)
                     /note="match: GSS: Em:AQ705250"
     repeat_region   106540..106595
                     /note="MIR repeat: matches 88..147 of consensus"
     repeat_region   106672..106986
                     /note="AluSp repeat: matches 1..313 of consensus"
     repeat_region   107087..107188
                     /note="MIR repeat: matches 76..200 of consensus"
     gene            complement(108438..111069)
                     /gene="CENPB"
     mRNA            complement(108438..111069)
                     /gene="CENPB"
                     /product="dJ1009E24.5 (Centromere protein B (80KDa))"
                     /note="match: cDNAs: Em:U38847 Em:AB018254 Em:AF056312
                     Em:X05299
                     match: ESTs: Em:AA639474 Em:BE269163 Em:F01282"
                     /evidence=not_experimental
     polyA_site      complement(108438)
                     /gene="CENPB"
     polyA_signal    complement(108481..108486)
                     /gene="CENPB"
     misc_feature    108691..109111
                     /note="match: GSS: Em:AZ088838"
     CDS             complement(109270..111069)
                     /gene="CENPB"
                     /note="match: proteins: Tr:Q62523 Tr:Q13537 Tr:Q60976
                     Tr:Q9TT39"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.5 (Centromere protein B (80KDa))"
                     /protein_id="CAC17547.1"
                     /db_xref="GI:11493369"
                     /translation="MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLST
                     ILKNKRAILASERKYGVASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKE
                     KALRIAEELGMDDFTASNGWLDRFRRRHGVVSCSGVARARARNAAPRTPAAPASPAAV
                     PSEGSGGSTTGWRAREEQPPSVAEGYASQDVFSATETSLWYDFLPDQAAGLCGGDGRP
                     RQATQRLSVLLCANADGSEKLPPLVAGKSAKPRAGQAGLPCDYTANSKGGVTTQALAK
                     YLKALDTRMAAESRRVLLLAGRLAAQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVK
                     GHYRQAMLLKAMAALEGQDPSGLQLGLTEALHFVAAAWQAVEPSDIAACFREAGFGGG
                     PNATITTSLKSEGEEEEEEEEEEEEEEGEGEEEEEEGEEEEEEGGEGEELGEEEEVEE
                     EGDVDSDEEEEEDEESSSEGLEAEDWAQGVVEAGGSFGAYGAQEEAQCPTLHFLEGGE
                     DSDSDSEEEDDEEEDDEDEDDDDDEEDGDEVPVPSFGEAMAYFAMVKRYLTSFPIDDR
                     VQSHILHLEHDLVHVTRKNHARQAGVRGLGHQS"
     repeat_region   109457..109861
                     /note="135 copies 3 mer tcc 64% conserved"
     repeat_region   109719..109858
                     /note="70 copies 2 mer cc 61% conserved"
     misc_feature    110283..111575
                     /note="CpG island"
                     /evidence=not_experimental
     misc_feature    111113..111331
                     /note="Single clone region. Sequence from reads from a
                     short insert library derived from a single pUC clone.
                     Restriction digest data confirm the assembly."
     repeat_region   111190..111385
                     /note="98 copies 2 mer gg 57% conserved"
     repeat_region   112335..112438
                     /note="L2 repeat: matches 1554..1650 of consensus"
     repeat_region   112439..112750
                     /note="AluSg repeat: matches 1..308 of consensus"
     repeat_region   112751..112840
                     /note="L2 repeat: matches 1443..1554 of consensus"
     repeat_region   112871..113170
                     /note="L1M4 repeat: matches 5051..5389 of consensus"
     repeat_region   113294..113487
                     /note="97 copies 2 mer tt 61% conserved"
     repeat_region   113815..113913
                     /note="AluJo/FRAM repeat: matches 208..306 of consensus"
     repeat_region   113956..114256
                     /note="AluSx repeat: matches 1..300 of consensus"
     repeat_region   114281..114297
                     /note="AluJo repeat: matches 118..134 of consensus"
     repeat_region   114298..114588
                     /note="AluY repeat: matches 5..295 of consensus"
     repeat_region   114589..114765
                     /note="AluJo repeat: matches 133..312 of consensus"
     repeat_region   114782..114865
                     /note="AluSg/x repeat: matches 211..294 of consensus"
     repeat_region   114867..114962
                     /note="L1MB8 repeat: matches 6055..6170 of consensus"
     repeat_region   114963..115085
                     /note="AluSg/x repeat: matches 181..299 of consensus"
     repeat_region   115127..115191
                     /note="Alu repeat: matches 2..66 of consensus"
     repeat_region   115193..115455
                     /note="AluSq repeat: matches 1..264 of consensus"
     repeat_region   115456..115583
                     /note="AluJb repeat: matches 9..138 of consensus"
     repeat_region   115989..116286
                     /note="AluJb repeat: matches 3..303 of consensus"
     misc_feature    116336..117010
                     /note="match: GSS: Em:AQ313572"
     repeat_region   116489..116794
                     /note="AluSx repeat: matches 6..311 of consensus"
     repeat_region   116909..117046
                     /note="AluJo repeat: matches 157..301 of consensus"
     repeat_region   117047..117350
                     /note="AluSg repeat: matches 2..304 of consensus"
     repeat_region   117351..117524
                     /note="AluJo repeat: matches 6..157 of consensus"
     repeat_region   117529..117761
                     /note="MER33 repeat: matches 39..249 of consensus"
     repeat_region   117762..118061
                     /note="AluSx repeat: matches 1..301 of consensus"
     repeat_region   118129..118190
                     /note="31 copies 2 mer tt 75% conserved"
     repeat_region   118194..118970
                     /note="L1PA3 repeat: matches 5369..6146 of consensus"
     repeat_region   118196..118970
                     /note="L1PA2 repeat: matches 5369..6144 of consensus"
     repeat_region   119666..119974
                     /note="AluSg repeat: matches 1..310 of consensus"
     misc_feature    120138..121313
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   120393..120480
                     /note="MIR repeat: matches 55..145 of consensus"
     gene            120896..130701
                     /gene="CDC25B"
     mRNA            join(120896..121317,122250..122335,122998..123049,
                     124874..124915,125048..125084,125329..125451,
                     125557..125679,125840..125974,126308..126388,
                     126510..126686,126867..126962,127496..127558,
                     127694..127792,127990..128123,129155..129266,
                     129407..130701)
                     /gene="CDC25B"
                     /product="dJ1009E24.6.1 (Cell division cycle protein 25B,
                     isoform 1)"
                     /note="match: cDNAs: Em:S93521 Em:S78187 Em:Z68092
                     match: ESTs: Em:BE295234 Em:BE902692 Em:BE799234
                     Em:BE869880"
                     /evidence=not_experimental
     repeat_region   120930..121037
                     /note="54 copies 2 mer cc 66% conserved"
     mRNA            join(121038..121317,122208..122335,122998..123049,
                     124874..124915,125048..125084,125557..125679,
                     125840..125974,126308..126388,126510..126686,
                     126867..126962,127496..127558,127694..127792,
                     127990..128123,129155..129266,129407..129804)
                     /gene="CDC25B"
                     /product="dJ1009E24.6.2 (Cell division cycle protein 25B,
                     isoform 2)"
                     /note="match: cDNAs: Em:Z68092
                     match: ESTs: Em:BF310722 Em:W51993"
                     /evidence=not_experimental
     CDS             join(121118..121317,122208..122335,122998..123049,
                     124874..124915,125048..125084,125557..125679,
                     125840..125974,126308..126388,126510..126686,
                     126867..126962,127496..127558,127694..127792,
                     127990..128123,129155..129266,129407..129547)
                     /gene="CDC25B"
                     /note="match: proteins: Tr:Q13971"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.2 (Cell division cycle protein 25B,
                     isoform 2)"
                     /protein_id="CAC17548.1"
                     /db_xref="GI:11493370"
                     /translation="MEVPQPEPAPGSALSPAGVCGGAQRPGHLPGLLLGSHGLLGSPV
                     RAAASSPVTTLTQTMHDLAGLGSETPKSQVGTLLFRSRSRLTHLSLSRRASESSLSSE
                     SSESSDAGLCMDSPSPMDPHMAEQTFEQAIQAASRIIRNEQFAIRRFQSMPDGFVFKM
                     PWKPTHPSSTHALAEWASRREAFAQRPSSAPDLMCLSPDRKMEVEELSPLALGRFSLT
                     PAEGDTEEDDGFVDILESDLKDDDAVPPGMESLISAPLVKTLEKEEEKDLVMYSKCQR
                     LFRSPSMPCSVIRPILKRLERPQDRDTPVQNKRRRSVTPPEEQQEAEEPKARVLRSKS
                     LCHDEIENLLDSDHRELIGDYSKAFLLQTVDGKHQDLKYISPETMVALLTGKFSNIVD
                     KFVIVDCRYPYEYEGGHIKTAVNLPLERDAESFLLKSPIAPCSLDKRVILIFHCEFSS
                     ERGPRMCRFIRERDRAVNDYPSLYYPEMYILKGGYKEFFPQHPNFCEPQDYRPMNHEA
                     FKDELKTFRLKTRSWAGERSRRELCSRLQDQ"
     CDS             join(121118..121317,122250..122335,122998..123049,
                     124874..124915,125048..125084,125329..125451,
                     125557..125679,125840..125974,126308..126388,
                     126510..126686,126867..126962,127496..127558,
                     127694..127792,127990..128123,129155..129266,
                     129407..129547)
                     /gene="CDC25B"
                     /note="match: proteins: Sw:P30305 Sw:P30306 Tr:O43551"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.1 (Cell division cycle protein 25B,
                     isoform 1)"
                     /protein_id="CAC17549.1"
                     /db_xref="GI:11493371"
                     /translation="MEVPQPEPAPGSALSPAGVCGGAQRPGHLPGLLLGSHGLLGSPV
                     RAAASSPVTTLTQTMHDLAGLGSRSRLTHLSLSRRASESSLSSESSESSDAGLCMDSP
                     SPMDPHMAEQTFEQAIQAASRIIRNEQFAIRRFQSMPVRLLGHSPVLRNITNSQAPDG
                     RRKSEAGSGAASSSGEDKENDGFVFKMPWKPTHPSSTHALAEWASRREAFAQRPSSAP
                     DLMCLSPDRKMEVEELSPLALGRFSLTPAEGDTEEDDGFVDILESDLKDDDAVPPGME
                     SLISAPLVKTLEKEEEKDLVMYSKCQRLFRSPSMPCSVIRPILKRLERPQDRDTPVQN
                     KRRRSVTPPEEQQEAEEPKARVLRSKSLCHDEIENLLDSDHRELIGDYSKAFLLQTVD
                     GKHQDLKYISPETMVALLTGKFSNIVDKFVIVDCRYPYEYEGGHIKTAVNLPLERDAE
                     SFLLKSPIAPCSLDKRVILIFHCEFSSERGPRMCRFIRERDRAVNDYPSLYYPEMYIL
                     KGGYKEFFPQHPNFCEPQDYRPMNHEAFKDELKTFRLKTRSWAGERSRRELCSRLQDQ
                     "
     CDS             join(<121283..121317,122208..122335,122998..123049,
                     124874..124915,125048..125084,125329..125679,
                     125840..125974,126510..>126644)
                     /gene="CDC25B"
                     /note="match: proteins: Tr:O43550"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.5 (Cell division cycle protein 25B,
                     isoform 5)"
                     /protein_id="CAC32459.1"
                     /db_xref="GI:13159927"
                     /translation="TQTMHDLAGLGSETPKSQVGTLLFRSRSRLTHLSLSRRASESSL
                     SSESSESSDAGLCMDSPSPMDPHMAEQTFEQAIQAASRIIRNEQFAIRRFQSMPVRLL
                     GHSPVLRNITNSQAPDGRRKSEAGSGAASSSGEDKENVRFWKAGVGALREEEGACWGG
                     SLACEDPPLPSWLQDGFVFKMPWKPTHPSSTHALAEWASRREAFAQRPSSAPDLMCLS
                     PDRKMEVEELSPLALGRFSLTPAEGDTEEDDGFVDILESDLKDLVMYSKCQRLFRSPS
                     MPCSVIRPILKRLERPQDRDTPVQNKRRR"
     CDS             join(<121283..121317,122208..122335,122998..123049,
                     124874..124915,125048..125084,125329..125451,
                     125557..125679,125840..125974,126308..126388,
                     126510..>126644)
                     /gene="CDC25B"
                     /note="match: proteins: Tr:O43551"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.6 (Cell division cycle protein 25B,
                     isoform 6)"
                     /protein_id="CAC32458.1"
                     /db_xref="GI:13159926"
                     /translation="TQTMHDLAGLGSETPKSQVGTLLFRSRSRLTHLSLSRRASESSL
                     SSESSESSDAGLCMDSPSPMDPHMAEQTFEQAIQAASRIIRNEQFAIRRFQSMPVRLL
                     GHSPVLRNITNSQAPDGRRKSEAGSGAASSSGEDKENDGFVFKMPWKPTHPSSTHALA
                     EWASRREAFAQRPSSAPDLMCLSPDRKMEVEELSPLALGRFSLTPAEGDTEEDDGFVD
                     ILESDLKDDDAVPPGMESLISAPLVKTLEKEEEKDLVMYSKCQRLFRSPSMPCSVIRP
                     ILKRLERPQDRDTPVQNKRRR"
     repeat_region   122900..122923
                     /note="12 copies 2 mer tt 95% conserved"
     misc_feature    complement(123556..124203)
                     /note="match: GSS: Em:AQ382541"
     misc_feature    complement(123724..124197)
                     /note="match: GSS: Em:AQ762596"
     repeat_region   124071..124112
                     /note="21 copies 2 mer tg 100% conserved"
     misc_feature    124205..124409
                     /gene="CDC25B"
                     /note="match: GSS: Em:AQ198009"
     CDS             join(<126251..126388,126510..>126602)
                     /gene="CDC25B"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.4 (Cell division cycle protein 25B,
                     isoform 4)"
                     /protein_id="CAC17550.1"
                     /db_xref="GI:11493372"
                     /translation="EGFGLCLWVVTLSLLIWPQDDDAVPPGMESLISAPLVKTLEKEE
                     EKDLVMYSKCQRLFRSPSMPCSVIRPILKRLER"
     CDS             join(<127355..127558,127694..127792,127990..>128009)
                     /gene="CDC25B"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.6.3 (Cell division cycle protein 25B,
                     isoform 3)"
                     /protein_id="CAC17551.1"
                     /db_xref="GI:11493373"
                     /translation="LPWEPELNLKAKAGVASDFGNETSFFEAPGLTNLTFSPLPHPFR
                     SLQAFLLQTVDGKHQDLKYISPETMVALLTGKFSNIVDKFVIVDCRYPYEYEGGHIKT
                     AVNLPL"
     misc_feature    complement(127847..127955)
                     /note="match: GSS: Em:AQ548024"
     misc_feature    128101..128439
                     /gene="CDC25B"
                     /note="match: GSS: Em:AQ341279"
     repeat_region   128680..129071
                     /note="L2 repeat: matches 1821..2210 of consensus"
     misc_feature    129548..130698
                     /gene="CDC25B"
                     /note="match: STS: Em:G06651"
     misc_feature    complement(130195..130698)
                     /note="match: STS: Em:G22801"
     polyA_signal    130676..130681
                     /gene="CDC25B"
     polyA_site      130702
     repeat_region   130963..131020
                     /note="L2 repeat: matches 2163..2225 of consensus"
     repeat_region   131021..131316
                     /note="AluSg repeat: matches 1..296 of consensus"
     repeat_region   131317..131331
                     /note="L2 repeat: matches 2225..2238 of consensus"
     repeat_region   131597..131654
                     /note="MIR repeat: matches 117..168 of consensus"
     repeat_region   132314..132361
                     /note="L2 repeat: matches 2647..2695 of consensus"
     repeat_region   133102..133390
                     /note="L1ME3 repeat: matches 5780..6082 of consensus"
     repeat_region   133396..133575
                     /note="AluSq repeat: matches 134..313 of consensus"
     repeat_region   133576..133866
                     /note="AluSg repeat: matches 7..296 of consensus"
     misc_feature    133807..134133
                     /note="Single clone region. Assembly confirmed by
                     restriction digest data."
     repeat_region   133905..134185
                     /note="AluSq repeat: matches 1..292 of consensus"
     misc_feature    134053..134057
                     /note="weak data"
     repeat_region   134186..134207
                     /note="11 copies 2 mer ta 100% conserved"
     repeat_region   134208..134438
                     /note="AluJb repeat: matches 86..311 of consensus"
     repeat_region   134446..134769
                     /note="AluJo repeat: matches 1..312 of consensus"
     repeat_region   134840..134949
                     /note="L1ME1 repeat: matches 6040..6137 of consensus"
     repeat_region   134988..135145
                     /note="L1ME3 repeat: matches 5486..5652 of consensus"
     repeat_region   135147..135453
                     /note="AluSx repeat: matches 2..308 of consensus"
     repeat_region   135454..135756
                     /note="AluY repeat: matches 1..299 of consensus"
     repeat_region   136920..137062
                     /note="MIR repeat: matches 82..225 of consensus"
     repeat_region   137065..137192
                     /note="FLAM_C repeat: matches 1..128 of consensus"
     repeat_region   137197..137503
                     /note="AluY repeat: matches 1..306 of consensus"
     repeat_region   137705..137994
                     /note="AluJo repeat: matches 1..298 of consensus"
     repeat_region   138351..138660
                     /note="AluSg repeat: matches 1..304 of consensus"
     repeat_region   138772..138953
                     /note="FAM repeat: matches 1..175 of consensus"
     repeat_region   139041..139160
                     /note="AluSx repeat: matches 1..121 of consensus"
     repeat_region   139161..139441
                     /note="AluY repeat: matches 2..294 of consensus"
     repeat_region   139442..139621
                     /note="AluSx repeat: matches 121..298 of consensus"
     repeat_region   139917..139978
                     /note="31 copies 2 mer tt 71% conserved"
     repeat_region   140076..140371
                     /note="AluSx repeat: matches 1..298 of consensus"
     repeat_region   140568..140642
                     /note="MIR repeat: matches 185..253 of consensus"
     repeat_region   140981..141128
                     /note="L2 repeat: matches 2347..2497 of consensus"
     repeat_region   141578..141869
                     /note="AluYb8 repeat: matches 1..311 of consensus"
     repeat_region   141934..142386
                     /note="L1ME3A repeat: matches 5705..6159 of consensus"
     repeat_region   142387..142697
                     /note="AluSg repeat: matches 1..307 of consensus"
     repeat_region   142698..142823
                     /note="L1ME3A repeat: matches 5583..5705 of consensus"
     repeat_region   143764..143958
                     /note="AluSq repeat: matches 1..200 of consensus"
     repeat_region   143983..144006
                     /note="12 copies 2 mer aa 100% conserved"
     repeat_region   143984..144007
                     /note="12 copies 2 mer aa 100% conserved"
     repeat_region   144603..144681
                     /note="MER5A repeat: matches 92..171 of consensus"
     misc_feature    144701..145485
                     /note="CpG island"
                     /evidence=not_experimental
     misc_feature    144851..145589
                     /note="match: GSS: Em:AQ939180"
     misc_feature    144851..145539
                     /note="match: GSS: Em:AQ938678"
     gene            145142..149893
                     /gene="dJ1009E24.7"
     mRNA            join(145142..145344,146688..146879,148457..149893)
                     /gene="dJ1009E24.7"
                     /product="dJ1009E24.7.1 (novel protein, isoform 1)"
                     /note="match: cDNAs: Em:AK002030
                     match: ESTs: Em:BE907799 Em:AA247632 Em:R15304"
                     /evidence=not_experimental
     mRNA            join(145190..145440,146688..146757)
                     /gene="dJ1009E24.7"
                     /product="dJ1009E24.7.2 (putative novel protein, isoform
                     2)"
                     /note="match: ESTs: Em:Z44061"
                     /evidence=not_experimental
     repeat_region   145645..145951
                     /note="L1MC4 repeat: matches 7607..7922 of consensus"
     repeat_region   145952..146252
                     /note="AluSx repeat: matches 1..306 of consensus"
     repeat_region   146253..146437
                     /note="L1MC4 repeat: matches 7425..7607 of consensus"
     repeat_region   146441..146484
                     /note="22 copies 2 mer tt 84% conserved"
     CDS             join(146704..146879,148457..148883)
                     /gene="dJ1009E24.7"
                     /note="match: proteins: Tr:Q9R172 Tr:Q13577"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.7.1 (novel protein, isoform 1)"
                     /protein_id="CAC17552.1"
                     /db_xref="GI:11493374"
                     /translation="MVHAFLIHTLRAPNTEDTGLCRVLYSCVFGAEKSPDDPRPHGAE
                     RDRLLRKEQILAVARQVESMCRLQQQASGRPPMDLQPQSSDEQVPLHEAPRGAFRLAA
                     ENPFQEPRTVVWLGVLSLGFALVLDAHENLLLAEGTLRLLTRLLLDHLRLLAPSTSLL
                     LRADRIEGILTRFLPHGQLLFLNDQFVQGLEKEFSAAWPR"
     repeat_region   147088..147335
                     /note="MIR repeat: matches 20..258 of consensus"
     repeat_region   147841..148136
                     /note="AluY repeat: matches 1..293 of consensus"
     repeat_region   148293..148423
                     /note="MIR repeat: matches 71..202 of consensus"
     misc_feature    complement(148381..148792)
                     /note="match: STS: Em:G56165
                     match: GSS: Em:AQ322349"
     misc_feature    148861..149365
                     /gene="dJ1009E24.7"
                     /note="match: GSS: Em:AQ615523"
     repeat_region   149237..149533
                     /note="AluSx repeat: matches 1..298 of consensus"
     polyA_signal    149862..149867
                     /gene="dJ1009E24.7"
     polyA_site      149894
     repeat_region   150133..150201
                     /note="LTR16C repeat: matches 301..366 of consensus"
     misc_feature    complement(150573..151059)
                     /note="match: GSS: Em:AQ889255"
     misc_feature    151073..151469
                     /note="match: GSS: Em:AQ721082"
     repeat_region   151361..151437
                     /note="L2 repeat: matches 2672..2746 of consensus"
     repeat_region   151851..152146
                     /note="AluY repeat: matches 1..299 of consensus"
     repeat_region   152336..152647
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   152791..153673
                     /note="L1MC1 repeat: matches 5461..6327 of consensus"
     repeat_region   153674..153840
                     /note="AluSg/x repeat: matches 132..302 of consensus"
     repeat_region   153841..154141
                     /note="AluSx repeat: matches 1..301 of consensus"
     repeat_region   154142..154495
                     /note="L1MC1 repeat: matches 5059..5403 of consensus"
     repeat_region   154496..154795
                     /note="AluJo repeat: matches 1..301 of consensus"
     repeat_region   154796..154812
                     /note="L1MC1 repeat: matches 5044..5059 of consensus"
     repeat_region   154813..155113
                     /note="AluSp repeat: matches 1..303 of consensus"
     repeat_region   155114..155489
                     /note="L1MC1 repeat: matches 4678..5044 of consensus"
     repeat_region   155490..155790
                     /note="AluSx repeat: matches 1..303 of consensus"
     repeat_region   155791..155812
                     /note="L1MC1 repeat: matches 4766..4678 of consensus"
     repeat_region   155813..156125
                     /note="AluYa8 repeat: matches 1..308 of consensus"
     misc_feature    complement(155959..156295)
                     /note="match: GSS: Em:AQ629218"
     repeat_region   156126..156197
                     /note="L1MC1 repeat: matches 4698..4767 of consensus"
     misc_feature    complement(156184..156300)
                     /note="match: STS: Em:HSB022XB1"
     repeat_region   156198..156221
                     /note="12 copies 2 mer at 95% conserved"
     misc_feature    complement(156205..156336)
                     /note="match: GSS: Em:AZ016984"
     repeat_region   156266..156511
                     /note="AluJo repeat: matches 35..304 of consensus"
     repeat_region   156543..156841
                     /note="AluSx repeat: matches 1..301 of consensus"
     repeat_region   156842..157153
                     /note="AluY repeat: matches 1..311 of consensus"
     repeat_region   157201..157496
                     /note="AluSx repeat: matches 3..297 of consensus"
     repeat_region   157503..157799
                     /note="AluJb repeat: matches 1..299 of consensus"
     repeat_region   157803..157941
                     /note="AluJo/FLAM repeat: matches 1..133 of consensus"
     repeat_region   157944..158484
                     /note="L1 repeat: matches 3689..4328 of consensus"
     repeat_region   158591..158696
                     /note="AluSg/x repeat: matches 68..172 of consensus"
     repeat_region   158699..158934
                     /note="AluSx repeat: matches 7..242 of consensus"
     repeat_region   159172..159480
                     /note="AluSp repeat: matches 1..311 of consensus"
     repeat_region   159487..159793
                     /note="L1 repeat: matches 2839..3131 of consensus"
     repeat_region   159796..160091
                     /note="AluSq repeat: matches 1..295 of consensus"
     repeat_region   160200..160507
                     /note="AluSp repeat: matches 1..309 of consensus"
     misc_feature    complement(160512..160880)
                     /note="match: GSS: Em:AQ344470"
     repeat_region   160575..160838
                     /note="AluSc repeat: matches 36..299 of consensus"
     repeat_region   161648..161770
                     /note="FLAM_C repeat: matches 1..124 of consensus"
     repeat_region   161787..161973
                     /note="AluSg1 repeat: matches 161..298 of consensus"
     repeat_region   161974..162276
                     /note="AluSp repeat: matches 1..313 of consensus"
     repeat_region   162277..162422
                     /note="AluSg1 repeat: matches 1..161 of consensus"
     repeat_region   162432..162546
                     /note="FLAM_A repeat: matches 23..142 of consensus"
     repeat_region   162591..162887
                     /note="AluSx repeat: matches 1..296 of consensus"
     repeat_region   162895..162999
                     /note="L1MB4 repeat: matches 6081..6185 of consensus"
     repeat_region   163000..163308
                     /note="AluSq repeat: matches 1..309 of consensus"
     repeat_region   163309..163433
                     /note="L1MB4 repeat: matches 5957..6082 of consensus"
     repeat_region   163770..163937
                     /note="L2 repeat: matches 2524..2709 of consensus"
     repeat_region   164193..164495
                     /note="AluSg repeat: matches 1..307 of consensus"
     repeat_region   164625..164879
                     /note="AluSx repeat: matches 1..256 of consensus"
     repeat_region   164980..165275
                     /note="AluSx repeat: matches 1..294 of consensus"
     repeat_region   165313..165438
                     /note="AluJo repeat: matches 23..142 of consensus"
     repeat_region   165439..165735
                     /note="AluY repeat: matches 1..298 of consensus"
     repeat_region   165736..165804
                     /note="AluJo repeat: matches 142..212 of consensus"
     repeat_region   165844..166097
                     /note="AluJb repeat: matches 1..248 of consensus"
     repeat_region   166100..166187
                     /note="L1ME3 repeat: matches 6019..6114 of consensus"
     repeat_region   166188..166494
                     /note="AluSx repeat: matches 16..307 of consensus"
     repeat_region   166500..166800
                     /note="AluSc repeat: matches 1..301 of consensus"
     repeat_region   166801..166855
                     /note="L1ME3 repeat: matches 6118..6154 of consensus"
     repeat_region   167649..167811
                     /note="AluSg/x repeat: matches 124..288 of consensus"
     repeat_region   167930..168100
                     /note="FLAM_A repeat: matches 6..142 of consensus"
     repeat_region   168530..168584
                     /note="L1MC4 repeat: matches 7924..7977 of consensus"
     repeat_region   168537..168631
                     /note="L1MD3 repeat: matches 7647..7734 of consensus"
     repeat_region   168632..168936
                     /note="AluSq repeat: matches 1..305 of consensus"
     repeat_region   168937..169127
                     /note="L1MD3 repeat: matches 7471..7647 of consensus"
     repeat_region   169128..169175
                     /note="MER3 repeat: matches 1..48 of consensus"
     repeat_region   169176..169485
                     /note="AluSp repeat: matches 1..311 of consensus"
     repeat_region   169486..169550
                     /note="MER3 repeat: matches 48..112 of consensus"
     repeat_region   169551..169862
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   169863..169959
                     /note="MER3 repeat: matches 112..209 of consensus"
     repeat_region   169960..170041
                     /note="L1MD3 repeat: matches 7307..7471 of consensus"
     repeat_region   170422..170732
                     /note="AluSx repeat: matches 1..311 of consensus"
     repeat_region   171023..171056
                     /note="MIR repeat: matches 221..254 of consensus"
     misc_feature    171154..171737
                     /note="match: GSS: Em:AQ696969"
     misc_feature    171165..171970
                     /note="CpG island"
                     /evidence=not_experimental
     gene            171429..182395
                     /gene="bA119B16.1"
     mRNA            join(171429..171489,179144..179327,182221..>182395)
                     /gene="bA119B16.1"
                     /product="dJ1009E24.8 (KIAA1271)"
                     /note="match: cDNAs: Em:AK023799 Em:AB033097
                     match: ESTs: Em:AU140492 Em:BF204863"
                     /evidence=not_experimental
     repeat_region   172691..172984
                     /note="AluSx repeat: matches 1..295 of consensus"
     repeat_region   173297..173793
                     /note="L2 repeat: matches 1736..2191 of consensus"
     repeat_region   173794..174079
                     /note="AluSq repeat: matches 12..302 of consensus"
     repeat_region   174080..174463
                     /note="L2 repeat: matches 2191..2687 of consensus"
     repeat_region   174583..174889
                     /note="AluY repeat: matches 1..310 of consensus"
     repeat_region   174902..175190
                     /note="AluSp repeat: matches 2..298 of consensus"
     repeat_region   175191..175491
                     /note="AluY repeat: matches 1..301 of consensus"
     repeat_region   175519..175624
                     /note="AluSx repeat: matches 30..135 of consensus"
     repeat_region   175625..175942
                     /note="AluSp repeat: matches 1..312 of consensus"
     repeat_region   175943..176111
                     /note="AluSx repeat: matches 135..304 of consensus"
     repeat_region   176260..176560
                     /note="AluSx repeat: matches 1..299 of consensus"
     repeat_region   176577..176879
                     /note="AluY repeat: matches 1..303 of consensus"
     repeat_region   177277..177385
                     /note="FLAM_A repeat: matches 30..138 of consensus"
     repeat_region   177421..177722
                     /note="AluSg1 repeat: matches 3..302 of consensus"
     misc_feature    177660..178067
                     /gene="bA119B16.1"
                     /note="CpG island"
                     /evidence=not_experimental
     repeat_region   177791..178089
                     /note="AluY repeat: matches 1..299 of consensus"
     repeat_region   178098..178374
                     /note="AluJo repeat: matches 1..268 of consensus"
     repeat_region   178375..178683
                     /note="AluSg repeat: matches 1..297 of consensus"
     repeat_region   178684..178725
                     /note="AluJo repeat: matches 268..310 of consensus"
     repeat_region   178727..179031
                     /note="AluSx repeat: matches 1..303 of consensus"
     misc_feature    complement(178764..178905)
                     /note="match: GSS: Em:AQ715943"
     CDS             join(179211..179327,182221..>182395)
                     /gene="bA119B16.1"
                     /note="Continues in Em:AL353194 as bA119B16.1
                     match: proteins: Tr:Q9ULE9"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ1009E24.8 (KIAA1271)"
                     /protein_id="CAC17553.1"
                     /db_xref="GI:11493375"
                     /translation="MPFAEDKTYKYICRNFSNFCNVDVVEILPYLPCLTARDQDRLRA
                     TCTLSGNRDTLWHLFNTLQRRPGWVEYFIAALRGCELVDLADEVASVYQSYQP"
     repeat_region   179473..179770
                     /note="AluSc repeat: matches 5..302 of consensus"
     repeat_region   180351..180582
                     /note="AluSg/x repeat: matches 80..306 of consensus"
     repeat_region   180583..180643
                     /note="Alu repeat: matches 236..296 of consensus"
     repeat_region   180644..180938
                     /note="AluY repeat: matches 2..296 of consensus"
     repeat_region   180940..181228
                     /note="AluSx repeat: matches 1..302 of consensus"
     misc_feature    181311..182190
                     /gene="bA119B16.1"
                     /note="match: GSS: Em:AQ739964"
     repeat_region   181338..181651
                     /note="AluSx repeat: matches 1..312 of consensus"
     repeat_region   181829..182002
                     /note="MIR repeat: matches 29..234 of consensus"
     repeat_region   182506..182728
                     /note="MIR repeat: matches 30..250 of consensus"
     repeat_region   182984..183283
                     /note="AluY repeat: matches 1..310 of consensus"
     repeat_region   183375..183675
                     /note="AluJo repeat: matches 1..311 of consensus"
     repeat_region   183812..184114
                     /note="AluSg repeat: matches 1..298 of consensus"
     repeat_region   184411..184702
                     /note="AluY repeat: matches 1..291 of consensus"
     repeat_region   184819..184938
                     /note="AluSg/x repeat: matches 176..295 of consensus"
     repeat_region   184951..185260
                     /note="AluSp repeat: matches 1..310 of consensus"
     repeat_region   185261..185560
                     /note="AluSg repeat: matches 1..300 of consensus"
     repeat_region   185570..185820
                     /note="AluY repeat: matches 59..308 of consensus"
BASE COUNT    43611 a  49688 c  48794 g  43727 t
ORIGIN      
        1 gatccacagg atcgggaggg gaggagtcag gagacactgc cgaagaatgg gacttggagt
       61 tggggaaatg cggtgacctc ccccagttcc cctgcctgct gccctccttt gttgggcatc
      121 tggtcgaccc tcttgccccc acctgcccta gatccttgaa atattttcct cagacttcta
      181 gaccccacat acctcccacc tgtccttcag tgattgatgc tcaccccctg cctccagaga
      241 aaacagaatc gccacctgcc cacgctgctt ccaccctccc tgccttctcc acacccactc
      301 cggtaatgat tccatcttca ggctccatct caacaggatc tttcccacac ggatggatca
      361 atcataagtc aatctgtctt ctttaaagaa aatccttaac ccaacctcac cttggcctca
      421 ttacctccag accacccgct aatgatggct gcttcccccc tcccaggcat tccaccacct
      481 gccccagctc tgccccctac ccctgcccca cacacacccg ccaccctagg aggtaggtga
      541 tgtgaccacc ccattgaaag ggtagggacg tcgggaaaat atggttgggc acagtggaac
      601 tagagtttgt tccctgtcca tccgactcca cgagggagaa taaaatacgt gtcaagtgct
      661 cagaacagcg cctgcggtca agcactcagt aggtgatata tactgataac ataatctggg
      721 tggttttaag agcctgcgct ccagcccgga cacccacccc acccagccca aggcaccttc
      781 ctgagcacag gtctgcctgt cccttcccca gctcaaaatc tttggctctg gatggtccag
      841 gacactgagc accaaaatgg ccttctgtga tctggccctc ctgggactct ccaaattcat
      901 tcccccctgc tcccctccct gtagatggag accttccagg caggtcagac aaactgctgc
      961 ccccaagtgt agccactgca tctttttctt ttttcttttc tttctttttt tttttctttt
     1021 tctttttttt ctcttttttt gagacggagt ctcactcttt gcccaggccg gagtggtgca
     1081 gtggtgtgat cttggctcac cacaacctct acctccgggg ttcaagcgat tctcccgtct
     1141 cagcccccta agtagctggg attacaggcg ccgccaccac gcctggctaa ttttttgtat
     1201 ttttagtaga gacggggttt caccatgttg gcaaggctgg tctcaaactc ctgacctcag
     1261 gtgagccgcc cgcctcggcc tcccaaagcc accgcatctt ggtccctgcc attcccttag
     1321 cctggggtgc cggctcatct tttccctcta ggatttcttt agactcagca tatcttgcaa
     1381 atgtccacta ggtggtgctc actcatcgcc agcagggagc taacaagccg ctcctggggt
     1441 tgggagggcg gaggtgcccc acagcggggc tgacagcctc agcggtcctc ttcagcctcc
     1501 agggagccaa ccacaggcct gcgtgactct ccctgtcatc tgcaccctct ctggggtcct
     1561 ctgcccatcc agccacccgc acagatctgt gtcagtccct gccccccaac actgatcccc
     1621 tcctcccagc cctaccccag cctggcactc actggttctt ctccagctca agcaggagct
     1681 cctggccttc agcctccagg gccaccagcc ccatgtctgg cttcgagacc tgggcaagaa
     1741 aatgtgtgga gctgagatgg tggcctccag gcctcctgcc tgccagggag taggtggcct
     1801 gtggagccgg ctggggagga agttcttggg gagaacgtgg gctggggagt cagcaggacc
     1861 ccccacatac tatggagggc gtggaggagg tgagaacata caaagatgtt cccaaactca
     1921 ggatgtttgc agtcctgaca acagccactt ggaagggcgt tggcacagcc tgccaggcac
     1981 accagcatcc tccctagaga ccagaggtcc cagaaaggtg cccctcccct ggcccgccct
     2041 cttctttcat gcccagaagg ggcatcaaaa gcaggggaag acagaggggt gctgaggaca
     2101 ttatgggggc atcgggtagc catggtcagg gcctcctcag agcctctgct acctgaggct
     2161 tgtttccaaa tgagctgctg ctcattccct atagaattca aatttgactc ctccacttcc
     2221 aattttggca aactgctccc tcttccaaag ttttcctggg cctccagcag cccccgtccc
     2281 tccggctccg acacctgctt cactggaccc acgaagtaaa catggacgcc attccagcca
     2341 agagagcaca ctggctctca gctaggtgtc aggaggctgg cttggacggc cagccctctc
     2401 tccttccccc accctcttgg cgtctcccac cctgtgggaa caccccactt cccccttgtc
     2461 cactcagcct ggctgggggc ccagagttgg agccggccca ggagcttcct gggaggctgc
     2521 tgcgccttcg gaatgtttaa cccccgactc cttttctcca aaaatgcact ggcctggggc
     2581 cctgtccaag ggtctcagag tctttggagg gagttcttcc ttcgcaagtg gggagcagat
     2641 ggtccttgcc tccctggcca caggccccac aaggcctcca gcatgagctc atgaggctgg
     2701 aatgccactt gctttattgg ggaaaggtct gcaccgggaa aaaggccata ctcgaggtcc
     2761 ctgttcctct gcagcccctg ctatctttac tcttgccctc ctggtaccct gccccttgat
     2821 atatacccct catcttgaaa tgtgagtgtt tcctgccttt tggaggggac acctagcctc
     2881 tactctttct tctgtaccat cttggcaggc ttcctggggg caggggccca ccggtggggg
     2941 aagcagagcc cctttggggc tctcctcttg gtcacagccc aggccagaca gacagggagg
     3001 cccagaggca gagtgacccc agtgtgtgtc cagccttccc ctcctgggga tggggagggc
     3061 aatctcaaag ctcaggccag tgccgtgctt gaccagtgga atgggggcct tatgggccta
     3121 ggggatccca gtgagggccc tgggttggga gctgctgggt ctctgggggc ctctcagcct
     3181 tcatggcaat gctcccctgc cttccctctt gctggatttg gacagtaggg ctgaaaattc
     3241 caaacaaaga gggctctcta ggaggggcag gggtgtagcc aatggtttaa aatcgttcag
     3301 accttagtgg gtctcaggct cccagcctaa agagctgtgt gaccatggac aatttcccca
     3361 agctctctgg gcttccgttt gcccctctgt aaaatgagca tatcaaggct actgccctct
     3421 tagtttgcag cacagatatt atggcacaaa cagatggggc atggttattc tggaagcgtg
     3481 tgaagagcgg gattgggaag aggctggggc agagcgtcct gcagaagaag cacatggggt
     3541 ggtcttacat ctgggggaca tcaggagagt gaccactgcc cccccgatac cagaagtgga
     3601 ttccacagga gccagtgagg ctgaaggttc aggccttcgt ggcagggccc tgagagggac
     3661 agcagtgtgt ccacagggtc acatgttctg gtcaactttg caaaaggttt tctttttggt
     3721 gctttttttt tttttttttt agaggctcct gaaaagcttc aggacccaca aactctggac
     3781 ccatttctgc ctggtggggg tgggggtggc ccagatcatc cagggaggga gggaaagagg
     3841 gaggtggggt ggagaaagct gaaatgactt ccatgtgtgc gggctcacga gatccagatg
     3901 tccaaacccc agtgccttct tctgcccact tgaggggcag gggaggcagg ggcctatagg
     3961 agtagtgact tggtggttct ggggacccca gcaaaactag aagctgtaat gtagggagag
     4021 acaaaagggc tgggaggttc agggcccctg tggagggcgg ggagacatgg cactgaccgg
     4081 ctcctccagg ctgacggtgc gccagggttg tccatccagg acccagtgcg gggtgactgg
     4141 ctgcccaggg atatgtcctg gagtaaagac agagcacagg gtgaggggga cctgaggaac
     4201 acaggggcat gggacaaagc agagggaggg gggtagagga catccccagg gaggcactgg
     4261 aggccttttg gggcagactt caccttcaac acgcgtgggc tcagcctgga gaaggaggga
     4321 cgcccgtggg catccttgga tctgaggagc tatcaaggag gaggaaaaga gaaggctgga
     4381 aagggacagc tcagctgggg acacgggagt cccctgacct ttgtcggggg gcaggcttgg
     4441 gctcggcgat cacaaggaag aggccaaggc cgccagtgca gaggggaagg aaagagcgcg
     4501 gcagccttag ggatttttag atgggcagca gatgccttta gggtgagaga tgtacgaaga
     4561 gaggacactt gtgccccccc catcatctga gaaaaacaac agccagatgt tgccttgcga
     4621 ggtccacctt gcccagagct ccctcgggga ctctgtcctg gtggcagggt tttggtaccc
     4681 tggcccagaa ggcccctcct catctcttca aggggagggg acgcttctgg acggagcctt
     4741 ggtgctccct ggccgggtgt gcctaagggg gctctaggag gaatcccaga gccaagcatt
     4801 actcagaggg cgcctggaat gttcccctgg aatgctccca gccctccact ggccccaacc
     4861 actctcacag gcccgccctg caggagccag gccccaggca cccagagcct gcagcagccc
     4921 tccttccccc ggtacccagt cccagctccc agaacagaca gcctcccccc tccacgcagc
     4981 cctggcctca gtcctgctgg gctgatggct gcctgtggaa gtgactcagc tcctgctagg
     5041 ccaccccaac tccttttttc tcctccacct tctctcccag actacaaaca tcaaagaccc
     5101 ttcctccaag aagccctcct tgattggatg agtgaattgc catcaggcag atgagggccg
     5161 agaggagtct gccaccttgg aaaggaggct agaggggcca gtgcagggag ggctctgagt
     5221 ggatgtgggg gaggggaagg aggggaggtc tctcagccca gagagcactt aactgagagt
     5281 agagaaccaa gctttgctgc tcctaggcct ctaagggttt ggggaagagg tagggtgggc
     5341 ccgggcacag gtgtggtgtg ggtgcagtgt ggtgtgtggg tgctgtccac atggccttgc
     5401 gcgcacgtgc tggccacggg caccctgacc ccaatgaggg agagaggggc agagctggag
     5461 ctggagctgg agctccggtg accgggtgaa tgggggtgga acccgaggga gccaggctgg
     5521 tattgggcac atagacgccc ctctcccagg ggtcccatca cctcccctga ccccaggata
     5581 gggctcagag gggagggagc agtggaccgc ctggggccct cccctggggc cagaacagac
     5641 caggcccctg tacctgtttg gtccccacac agtgctgtgg aagccaccgc ccagtctgca
     5701 tagcacagcc cagccccgca tgccccctcc ctggttgccc tccctgttcc cggccaggca
     5761 cttgctgtgc aggactggct aatcctccca cccgcttgca gaggttgttc cagccccatc
     5821 ttaacatctt tgtttggagg ggttaccccg aggagacagc tgcagtcttc ccagagcact
     5881 gctaaacaga caccttctat ctggagaggc ccttctctat ctcaccaaac aaggcaacaa
     5941 tataaacaac atacacactg ccctgctgcc ctgggaggag ggacgagggg tgagcagggt
     6001 ggaggccaca gctagttctg cagcctgaga gcaaagcagg gactctgggg gactcttggg
     6061 catgggggct tcctagagga cggagccccg ctgagtccta aggggtggag gagcaggagc
     6121 gggtcacacg gtggcctgcg gatggaagct ggttgtgaga gcgagaatcc aggaagaggg
     6181 ggctacggct atgggctggg ggctgggctg cccccgaggg ggagggagcc atgccctctg
     6241 ctttgccagc ggagtggcag ccgggcagtg tgggcaagtc cgggcccggg gccagcccaa
     6301 gcacacttga gcgtccctgg gcaggtccca cggagacccc cccaaagagt ccccacgccc
     6361 tgacctactg gccgtatggt gccggggccg tgagaccctc cgcgcgctga cccgagctct
     6421 gagcagaacc catccccgcc accaccaccg cgcctagcct gcccctcagg gcgcaccccg
     6481 cccgcgtcct caccttgaag caccccggcg cctggcactg gccagagcag cagcagtagt
     6541 agcagcagca gcaacggggt cccccgagct ctccggggcc tccagcccat agctgtgagc
     6601 tcctcggcct ctaggcagcg gctcgcaact ccggctccgc ccaggctgga ttgcggccga
     6661 cccgtgcccg gtgcagcctc aggccgccgc cttcggacct tcccgccccc acctcccacc
     6721 gcccgccctc gctcccgcct cccctccccg ccaaccccgc tcggagcctg gccaggggcc
     6781 ccgacggcgc gcgccatggg ggagccgggt cgccactccc ggaccgccgc ccctcgaggg
     6841 ggtggagctg ggcggaggag ggaatccgtg cggcccctcg gatgaccggc ccgagccgtc
     6901 cctccccgtc ggtctcagag ggcctctact cctgagagga ggagagaacc gctgggaagg
     6961 ttcttggagg accgcggcgt ggtgggatga ggcggtgggc aaaggccgcc tctcgctgct
     7021 gaagttggcc ccaggagcgc gatcttccgt ggtctcctgg ggccgatctc tgtcccctcc
     7081 ttgctacccg tcctgccccg agggtgccct ggcggaggtt gagtcgggtc atccacctgc
     7141 actgggtgcc cccaaggata ggaaggttca ggcaaccggc tgccgctgtc ttgggggctt
     7201 cattgctggg caaaggcgat gcagcagacg gagacaacct ttcttccctg gcggtggcca
     7261 gagggcagaa ttgcataaaa gctgcagact cccaggcctg ggagaccctt tcggcctcag
     7321 taacatctgt ttcatgtttt aaacttttgt tttcctactc ggtgcaaatt tggatgagat
     7381 gttaactttt tttttttttt tttttttgag atggagtctc cctctgtcgc caggctggag
     7441 tgcagcggcg cgatcttggc tcactgcaac ctccgactcc ctggttcaag cgattctcct
     7501 gcctcagcct cccgagtagc tgggactaca ggcgcgcgct accaccccca gctaactttt
     7561 gtatttttag cagagacgag gtttcaccat tttggccagg atggtctcaa tctcctgatc
     7621 tcgtgatcca cccgcctcgg cctctcaaag cgctgggatt acaggcatga gccaccgcgc
     7681 ccggccggag atgttaactt ttaagcaaat cttttttttt tttttttttt tttgagacag
     7741 agtttctctc ttgttaccca gactggagtg caatggcatg atctgggctc actgcaacct
     7801 ctgcctccca gattcaagtg attcttctgc ctcagcctcc cgagtagctg ggattacagg
     7861 cattcgccac cacgcctggc taattttgta tttttagtag agatggggtt tctccatgtt
     7921 ggtcaggctg gtctcgaact cccgacctca ggtgatctgc ccgcctcggc ctcccaaagc
     7981 gctggaatta caggcgtgag acaccgcacc cagcctactt ttaagtaaat ctatttgttt
     8041 ttgagaattt ggaatgtagt aatttggtta gtgaaagttc gagcagtgag agaaacctac
     8101 attcacatat ctcaaaatca aaaagtacag aaagcatagg gaaaagtctc cgtgctctta
     8161 gccctcctca ccaacaggaa accaatatga ttagtttctt tcataggctt ttagattatt
     8221 ttttcacact caagacaata cagacatatt tttttctctt attaacgttt ttctgcactt
     8281 tgattttctt tttttttttt ttggtcgctt aatacacctt agatatcagt gcgtttagag
     8341 ggtccttgtt gttcttatga ttattattta gagacagggt ctcactctgt cacccacgct
     8401 agaggacagt ggcctgatca tgcctcattg cagccttgaa atcctgggct caaggtatcc
     8461 tcccacctca gcctcctgag tagctggaac tacaggcaca cggcaccagg cccagctaaa
     8521 atttttaatt tttctgtaga caggggggtc tcactttgtt tcccaggctg gtctcaaact
     8581 cctggtcttg gccaggcgca gtgtctcatg cctgtaatcc cagcactttg ggaggccgag
     8641 gcgggcagat cactggaggt caggagttca agaccagtct ggccaacatg gtgaaacccc
     8701 atctctacta aaaatacaaa aattagccgg gcatggtggt gagtgcctgt agttccagct
     8761 acttgggagg ctgaggcagg aaaatcgctt gaactcagaa ggtggaggtt gcagcgagcc
     8821 gagatcatgc cattgcactc cagcctgggc aacaagagcg aaactccgtc tcaaaaaata
     8881 aaaataaaaa taaaaagaac tcctgatctt aagtgatcct cctgcctcag cttctcaaat
     8941 cgctggaatt acaggagtga gtcaccacag ctgtccagct acgagattat tacttattat
     9001 tactactttg gattttcaaa tcaacttcat taaggtataa tttacacaca ataaaatgca
     9061 cttattttaa gtggccagta agatgagttt cgataagtgt atataactac ataagcatca
     9121 ctataatgca gacacattcc ctcactcaca gaaagagccc tgtgcccttc cagccaaact
     9181 tgcccactcc caaccccaga cagccactga tctgttgttc tctgtctata gataagtttt
     9241 gcctgttcta gaatttcata taaatggaat catgcggcat gcactcttct gtgtctggct
     9301 tccttccctc tttccgatgt ttttgagatt catttacact attttgcata tcaatagttt
     9361 gttccttcgt attgctgaat agtgttcggt ggtttgaggg aaccacagtt tctctactca
     9421 ccagtgcacc atagggttat tttccagtta ggggctctta taattggaac tatatttgca
     9481 cagagagaga gagaggaaga aagagggaga gagatattta ttatagcaat tggctcacgt
     9541 gattatggag gccaaaaagt tcccgaatct gccatctgca agctggagaa cgaggaaagc
     9601 cagtggtgtg attcagtttg agttcaaagg cctgagaacc aggagcacca gtatggaggt
     9661 ggctcgagct cagaacaagt tggggacagg aaagcagagc agcgccccag agcagcccct
     9721 cagcgacacc tcttcagtaa agcaaggctg aacacagagg ggctggcttc agtgtggatg
     9781 tcaggtacag aaggcagctc gaggagctac tctggcgttc ttgcttactg gtattcttac
     9841 ctcgaactgg ccaactccta cttaaactgc aggccatggc tttaatgtcc tgtcattcag
     9901 aggctgtccc ttacccaaag ccaggttagc atcccctgac tgacacttct ccctgcaaca
     9961 cgtttcagaa ggccctgtag tcgtccactt ccctgtctct ctccccaagc tcctgagctc
    10021 catgtggtct gggaatatgt gtgttgctca cttcctagca cagtcagtgc taataactga
    10081 ctgtagaggg gacacagtcg aaaagccaca tggggatcag agtcatcctt acacagttga
    10141 cacctcccaa acccagatga gctgtgtcca agtgcaggtc agaggaattt tctgccgaag
    10201 tctctgagaa agggtttatt tacattttga ggttgcaggg gaggagatga ggccatcaaa
    10261 ccaaagctga ggaagaggga tcctaggatg caccgagcag ctccgggggc gcctgacagc
    10321 acctgggaaa gatggcttct ccactggctt gttggcgtca ccctccagag gggcatcagg
    10381 aaatgtcctg ggaaccaggc aaaccagtga gcattaaccc ttaggagtgc ttggcatggg
    10441 tgacacccac catctgtaaa cacgacttct cccaaggagt gacgcagaac aggatgtctg
    10501 agggaggcac tccgactcca gccttcagag atcgccaggg tggcacctgg tgacgacagg
    10561 ctgatgcttg ggtgccccgg aaaaagtcat gtgtgtgaat gggggcccca aagccaacgc
    10621 ttcatccctg acagcctggt gcatttagag gggaactttt tgtcccttgg caaggtgggt
    10681 ggaatttcag gttcataggg caagggtatt ttagctttaa tagatattgt caaacagttt
    10741 tccaaagtca ttgtacacac tctgtgattc tacttaagta aagtttaaaa acaggcaaat
    10801 caaatctatg gtgttcgaag tcaagacagt agttaccctt gtgggggctg caactggtac
    10861 agagtgtaag gggggactgt aggatggtct atttcttgat ctgggtgtgt tcgcttttgg
    10921 aaaagtcctt gagttgcatt tataatgtgt gaacttttct gtatgttaca ctttaattga
    10981 atgtacaaaa agtctcagga ggcctcagac cactggaagc ggacacaact aacccctctg
    11041 agagcctcca atccaagatg gacatatgtc cccttggaag tatgcagaag caggtgaaga
    11101 ctcctaagcc ggatattccc aaatcccccc agtagctgca gcttcagcag ctgcttatgg
    11161 tcctccctac accctctctt ccccagacag cccccaaaca tctggctgca tttgacttgc
    11221 tctctccctg tcccacctct ggatttagtc catgttctcc accctcccca ctgtcagcaa
    11281 tgtagacaag acaaacgctt agttcacgtg cccacctact gcgtgccatg cacggggctg
    11341 gtcattgtgg gtggcaaatg tgagcaacac acgaagcctc aaggagcaga aagggacaca
    11401 aatcacttca gcgtaaggta atttgtgata aatgtcatgt aacttgcagc ccctggcccc
    11461 ctcctacaga tggtgtctaa gaataaaccc cactaacatg tgactcctct gttctagccc
    11521 agctgtttgg gttgcaagaa agagactcac tccagttgcg tcataggatg gagttttatt
    11581 gggaggacat tctggacggg ctcccagcaa gagtctggca atggagcatg aaaatgaatg
    11641 agcctggaat gagggagggt gcggcctcgg ctacacaaag ttcacccgcc cccaactgcc
    11701 tcccagcgtg ttagctcctg tggcaactcc ccacttctct ctctgttttt caacccaaat
    11761 cctagagcag agggctcttc gctcctcaca ccatctccac caggctgcag ggggagtcac
    11821 tagcccactc aagagctctc tcctgttggt ccctgattcg caggggcacg tcattccttc
    11881 ttggtaactc attctcttta acagcccgac acagggtctc caggaaaagg acaggagctc
    11941 acacagtcat gaacagagat gcatctggag caggaataag gatgctgaca acatctgtgt
    12001 ctgccctcca ctcttgtaag ggccagtaag gtagtaccca gtccctgggg gtaggggggg
    12061 tggtggtggg aatgagggca tctcttcttt caggggccaa atctggcaca aggatgtctg
    12121 cttaggcaac tccactgcct ggcatgctct ctccctagga aaacaccaaa ccttttttat
    12181 ttcctcagtc ttagtgtgag agtgatcacc tcttccagga agcccacaac cagaatggcc
    12241 aggggcacac cctggccctc agtagatggg cactgagaca aaacatggcc tgctggtggc
    12301 attcccaaca atgtcaaaag tctcagggat gagctcacag ttgggagtcc tttggaagtc
    12361 caattccaaa tgtcctctgg actcaaaata aaaactacat ttcccagtct cccttgtagc
    12421 ttgcagtagc catgtgacaa tactccagcc aatgggaagt gagtggaaaa gtcctgtgca
    12481 gcttctgagt tgcgtccacc atgggcatgg gtgtgcttct acctccactc ttccttcttc
    12541 tggctggagg gcatgtggac aaagcggggc catggtgagc ctcacaatca aaggcatcat
    12601 tttagggata gagaaaagga agatagaagg atcctgagcc ccaacatcat aaaccatcgt
    12661 aacagccaga ctagcgtggg agagaaaaat aaaattctat catgttgaag tcactgtatt
    12721 tggtctcttg ttagagcagg ctgactgata ctctgttgaa tacagaatgc cttggtgagc
    12781 ctctgaggat gcaggatgct gcaatccaac caaagtccag ggtgactttc gaggagaagg
    12841 atgtgaaagg gcagggcttc ctttgaggga gaaatggaga gaagggagga gatccagaga
    12901 agagagagcg agatcagctc agtgctcttc agtgtcctcc agggtcaaca ccctcctggg
    12961 cagacctctc accacccatt tttggggggg caagaggctt ggggccaggt cataaaaagt
    13021 cagatgtcac agagctgttt tcgtagaggc gggcaggact caacactgcc tcattcacat
    13081 tcataggctg gagtcatcac agattctggc cactttctcc tccggagggc aggcaacacc
    13141 actggtcagc ccaggggtgg ggcacaggtt gaggtctcac atgtggctgc atcaggatca
    13201 atgagcttcc cacagagaaa accctggtca caggaggcct ctgggagcca gggcaggccc
    13261 ccaggccccc agcctcaggt ggggtcccaa gaggattggt gggagtgtgc atggctggaa
    13321 gtgtgacttg gagccctggc ctaccaggtg aatcagctgc agaagggaca caccacaggg
    13381 gaggagaaac tgactttcac ttctgcccag gtgacgggca ggcttgggag tggggaagag
    13441 gcctcagaac aaaggtgggg gagtcagaac atacgtgtct tggaaatgga ggggatgctg
    13501 ccaagtccag aaagtctccc ctagtcctct aagccagccg ccagctgaca cagggattcc
    13561 ctctgccccg agcagaacag ggctctccta tctcctgagg ggctcctggt tccctgcaca
    13621 ctctccccaa cctccccttg gtcccaggca ccatcctcta aatgacagca gtctccagat
    13681 gccagttggc actgtgggtt agctgggtat tatatctgcc cgtgtgtgtt gaatggatac
    13741 ctgcgtggtc tctttctgaa aagccatctc caccgaattc tcgcccatgc tctgcttaca
    13801 aacacgcctc cttctgcaag gcccggagtg gactcagaga gcctctcccc tccccacctc
    13861 ctgcccccat tcctctcccg ctccccacag gctggctcca gaccacaaga gcgtgggaac
    13921 ctcatggaga gtgggtgctt cctctgtctt cctccctagc ccttgcttta gcacagccag
    13981 ggcagagagg aggaaagaag gaaactacag caaggaggat ttagggacaa ttctagaagg
    14041 gatttccaat ttggaggggc agtctgggaa gtaggcagcc tgtgaactta ataggaggtc
    14101 ctaaaggacc tgggggaatt tggggagaag acgaggtggg ctgccttacc ccaatcttct
    14161 ggcccactcc cagctccaga cccacctcca ggtgtagcag gcccccaggc ccaacagcag
    14221 gagcaggagg cccaccagca gtcccaggac ccagagcagc tgctggaact gatgcaggcg
    14281 gtgcagggct ggaacacaga gcgggactca gagcagccac agctgcaggc ccccagtgct
    14341 tccttccaag gggacccacc caagatgaca ccttctaact tttgctttat cgctgaagct
    14401 ctccacacaa gggtgccccg agcaatcagt ttagagtcga caggcaaatc atgctgcagc
    14461 aggagggata cagggaagga agcctttggg cctaagagct gggcttttgt gcagagtggc
    14521 ctgggagatg gagccccctc ccctcacctc tgaccccaaa gtaggtggag gtagaggcag
    14581 agcccaggac atttgaggca gaacagatgt attccccttg gtcgctgggc ctcagcgcct
    14641 cgatgtccac cctcagggca ttgggggaag ccaggacatg gatgtgtggc tctgcaggag
    14701 caccctgggg ctgactggag gccaccagtc gactgccaag gtggagagtc aggctggcga
    14761 gcggctcgct gtccactcgg caatccagga tgccccggag gccaccctca ggctccacga
    14821 agaccatcat ggtgggcgtc ttgggagggt ctgtggggag gaagggagtg tggtgattgc
    14881 agacaatgac cacaaatttc tccctccctg tacccatgca cagtgacttt gctgctcctc
    14941 ccattaagag gcagaaccta tttcctcact ccttgaatct gtgactcagt ttgaccaatt
    15001 gaagaaagaa gtgatgttgt gcaacttcag agccagacct caagaggcct tgtagcttct
    15061 gctctacttg ttggaacaca acaatgtggg aagcccaggc cagcctgctg ggcacatgtg
    15121 gcccaactgg cagccaggag cgtgcagtca tctgagacca ctgggcaact gcagccacat
    15181 gaggtgtgaa caacagaaga actccccagc tgagcccagc ccacagaatt gagcagataa
    15241 catgactttt gtttgaagca gtaagttttg ttgtggtcaa tagcaatagc tgactgatac
    15301 agtgtaggcc ttggtgaagg ctggagtggg agaccaagac tgatgagggc ttatcgtgtg
    15361 ccagacacac tcagcacgct ttatttcttt tggttttttc tcttcttttc ttttcttttt
    15421 tttttttttt tcttgagaca gagtctcagt ctgtcaccca ggctggagtg cgttgtggtg
    15481 atcttggctc actgcaacct ccacctccca ggctcaagtg agtctcctgc ctcagcctcc
    15541 tgaatagctg ggattacagg tgtgtgcctt catgccgtgc taattcttgt atttttagta
    15601 gagacggagt ttcgccatat tgaccaggct agtcttgaac tcctgacctc aagtgatccg
    15661 cccacctcag cctctcaaag tgctgggatt acaggcataa gcccctgcac cagccattta
    15721 tttcttatac aacttgcagg atgaggcgaa tgatgtattc cccattttac aaggtcaccc
    15781 aaccagacag cgggagagcc aggattccaa cccaggactc aatatccgtg aagccttcac
    15841 tcttggcaga cagaagagga ggaagaactg aggtgggcgt gtgagatgtg gggagactaa
    15901 ggcggaggag ggggttcaca ctcacagagc acacggagca tgactggagc agaggcagca
    15961 gccccagtgg ggagctcagc caggcagtgg tacatcccag cttgagcacg agccacgtgg
    16021 gtgaaggcga gagtgggcac aggctccgcg tgcagccgcc ggtcattcca gaaccatgca
    16081 aaggtggagt tgcccacagg cccagggcca cccaggaggc ggcagctgag gttcagggca
    16141 gcgccctcag gcacgtccag gccaggctct gccaccacgc gtgcacctgc gggcggagga
    16201 tagagagatg attggggatc tgtaggcctt gggggctgga gcctctgggt gggacctctg
    16261 aggggcctct gcacattgtg ggaaggtctc agtctgactt ggtatagggc tttccaaggt
    16321 ggaaccatgc cccctccagt gggagctgtg ccccctccac cagattaggc ttctcaaggc
    16381 agagctttct cagaccagac agcccgctct agtgggagct ccagcccctc tcgtggactt
    16441 ggacctgaga gcatcagagc ccctcctcct cccctctctt ccactcacct tctacctgca
    16501 accgcccgat ggtgctgatt gagcccagca agttttgggc tgtgcaaaca taggtgtcat
    16561 cacctgcagg cacatcttgc acctgcagcc gtagggcgtt tcgggccacc tggacatggc
    16621 ctgtccctga tgccaagctg tggaccccgc tgctcgtggc cagcaccttg ccatcatgag
    16681 atagggccag ctcagcaggt ggctcactgt ccacagtgca ctgtatcaca gccatggatc
    16741 tggccctgga gtcccggaag gaggacagga cagcgtcctg aggggcatct gtgggcaggc
    16801 agggcacaga tggggacctt gcttaggcac cctgtactgc tcattgccta gctgcctggg
    16861 gttggtcagg ctgaggcctc tggaccaatg gccacttcct agaagtgaca ccttcccagg
    16921 agtcctgtgg gcagagtgct ccaggcaggg cttataggac tgtctctttc tctagcacct
    16981 atgtcctctt tccccaactg cccattcctg ggcccactgc agtcctttgc ccactcccac
    17041 cagagcccag aagcaccctc caccaccctc ctcctgggca gccctggcca aggaccctct
    17101 gctcacagag gacttgcagg gcagcaggac gggagctgcg ggtgccctgg gcatcctggg
    17161 cctggcaaga gtaggcgcct gcatgagccc gcgtggccac caggaatgag agtgaggcag
    17221 ctggaccctc ctgcagccaa cgaccgttgt ggtaccaagt atagagtgtg ggtgcgtggg
    17281 cagcagggtc cgcacaggtc actgtgatgg gggcaccttc aggcacggca gcctccggag
    17341 ccaggatcac ccgcacacct gtgccaagga ggccaggatc agccgggccc agccggacac
    17401 cacagtgggc ttccatccca gtgggcttgg cctaggccct gccttaccct ccagccgcag
    17461 ctccagggac gtgttggcct ggcccagagg gctgcgggca gagcagctgt agaaaccctc
    17521 atccctgggc tgtggccctc gcagctccag gcgcagggtg ttggggacag aggctgctgt
    17581 cgaggaggcc aagaggcgac cggcgtggct gagggccagc tgggcgggcg ggcggctgtc
    17641 cacagtgcac agtaccaggg ccagctgccc gccatggctc tccaggaggt aggtcaggcg
    17701 caggttgcgg ggcgcgtctg cagggcatga gaggcttaca cggagtcaca ggcagcagcc
    17761 tgcaggaggc cagtttgcag gtggcattca aactcaggag ggccctggct ccccacccct
    17821 atggctccca accacccagt ctgaggcact gctcctctgc ccagacagga ctcaccccag
    17881 cgccccctac tcttaatgct ccttgcccag tgctccacta gctactgggt accgagttcc
    17941 ccactggaca cctgtcgggt tccggccatg ccgtgatccc tgtgggcttc tgtcattcct
    18001 gtgagtccca ccctgcatcc agccagactc acagaggacg tccaaggtga taggtctgga
    18061 gaggcggggt gcccgaccag gggggcccac accgcagcgg taggaggtgg catccctgac
    18121 tgtgacgttg ggcaggggga tggagtgggc atccaggcgc tgctgcccat cctggtacca
    18181 tgtgtaggtg agctgggccg ggtgagtggt ccacacaagg caggtcaggt tcaccagctg
    18241 cccctcccgc acggtagccc cgggccacac ctgcacattc acagctgggg agagggaggg
    18301 cacaggacac cagtgagggt cttgaggctg tgtacagcag gccacctgtc tccatgtggg
    18361 tgagctactc ctactgctgg ggagcccctt ggcctttggg agccttgggt ggcccaccta
    18421 taaatgtggc ttagaagtca agcttgggaa tagcccactg gctcctgagt ggtcccaagt
    18481 agagttccac agagtcctgg aaagccctga gcccctccct gaccctggct gggggcaaat
    18541 ggggagacca cagacctccc ctcccctgct gcactttctc ctgaaattcc ctgtttagtt
    18601 ttatccactg ggcatacttg taaaatttgg tgtttataaa aggttctcct aagaaactca
    18661 agtctggagt ccaccatcgt taggaagggt gggaatgggg actattcccg tgcccccagg
    18721 cttctagaac cctgtcccac ctcctctcct cttttaaagg acacacacac acacacacac
    18781 acacacacac acacacacac actgaccttg agcgtcgaag tcagctgagg ccgaggcctg
    18841 gcccagggtg ttggaggcct cacagatata gacaccctca tcctccagca tagccccgtg
    18901 gatctccaga cgcagtgtgt tgggggccac agccacatgc agcctgggag agctgccttc
    18961 gggtcccccc acaccttgta gggtggaggc cacaaggcga tccccgtgga gcagccgcag
    19021 ctgggccgga gggtcactgt ccacacggca caggaggagg cccagtcgtc cagggcctgt
    19081 gtccatcagg gtagtgagtg tgacgtggcg tggggcatct acacagggtg gggagtggtc
    19141 aggacccagc caggggggtc cctcatccag ggctctgtca tgtgacacag cccaggactg
    19201 ggggactgca ttccttcccc cacacctgga tgggacaaaa gtccttacag gacacgtgga
    19261 ggctgatggg tgcagctagg ctcgtggtgg ctgagcctgg ggcctgggct tggcaatgat
    19321 aggccccagc ttgtgtcaaa gttatggctg caaagcggag cgtggccgag gtcgactcct
    19381 ggaggggctg gccatcccga taccaacgat atgaggtccc ctctgggact cctgtgtgta
    19441 cctggcagct caggaccaca gcctggccct cttggagctc aggtgatggt gacacctgga
    19501 cccaggctcc tgcaggggaa aaccaagagc aggtgagggc tctccaccac acctctcaca
    19561 gtctgggacc atgtgcgtgt ccacctagga ctacacaacc caaccccatg tgctggagcc
    19621 gcagcccctt ccagaccacc gccagggccc actcagccct atgccttggc ccctcctgct
    19681 ccccactctc ctgcacaggc tccatcctgg catttacctc ccaccacctg gtgggatggt
    19741 tttactgtgt cagtcatatc tcgcctacca aagcgtaagc cttatgggca ctggacttgt
    19801 gtgacctgta accccagttc caggcactgt gcctgcatgt gttagctcct gacaaatgtt
    19861 ggttcaaccc ataaagtact gaaaagggag ggatttagat catcccgggg ccattgtcct
    19921 aactaggatc tggactcagc atgggtgacc gagggcttgg aagggaatgt tagcagctat
    19981 gttatcttga gcgctctcgc caggccagcc ctgcatccag actccaggcc acaagagcac
    20041 aggacctggg gaccaggcaa agggcagctc ctcagtggcc cctgggtgtg aagcaggaga
    20101 accccaggtt gcagggagtg agatggaggg acagctggaa tgctaagcaa gaacacagac
    20161 catgcctgag accacagcta gcagggtcat gcaactgggg aaaagctcct aaaactcaaa
    20221 ccttcaattt cttcctgcga aataggtatg ctaagtaaga aaagatacac agaaccaagg
    20281 ctcaaagtaa atgtcccaaa attggtgctg tcggtcaccg tggtggaaga aacactcatg
    20341 ttcctttgac cttcgcatta tgggcattat ctttcctttt tcctggttta tttgtttagg
    20401 gagaattttt ttttttttaa gacagagtct cactctgtca tccaggctag agtgaagtac
    20461 aattttggct cactgcaacc tccatctccc aggttcaaac gattcgcctg ccttagcctc
    20521 ccgagtaact gggattacaa atgcccacca ccacacctag ctaatttttg tatttttagt
    20581 agagtgttgg ccaggctggt cttgaactcc tgacctcaag tgatccgccc gcctcagcct
    20641 cccaaagtgc tgggattaca ggcatgagcc atggtgcccg gcctgaggga ggatttttat
    20701 agagactata tatatacaca cacacaaaca cacacacaca cacacacaca cacgtatgtg
    20761 tatataaata aagaatatga atatattatg cataagaata cacataatgt atatttgtat
    20821 gtgtatatac tcatacataa acatatatgc ataatataaa tatacatatg tatgagaatg
    20881 tttactatgt aacaaaatta tatgatcaca aaacaagtta tttcagaaag tgctcccacc
    20941 aagcaagcca ctctgtgaat tgccattgcc tcctagtggc tgctgcagtt attacaggtg
    21001 aaaatatcta gctggaggca aagggaggcc ttctgctggc tggaaactgc ccttaccagg
    21061 actgaaaaag acatatttct gcacaaaatt cacagcaccc agcccaaagt ctgaccagag
    21121 tagtccccag ggcatcagag tctaccatat cttcagaaga ctgacactca cctcggacct
    21181 ggaagaagag tgaggtgttg gatgatccaa gaacatttgt ggcctcacag cggtagctgc
    21241 cagagtcccc aaggcccagt tctcggacct ctaacttcag ggagttggcc tcagctttag
    21301 cctggaaccg accatgggat gggacctggg gacccaggct ggtggccagg aggtgctccc
    21361 catggaacaa ggccagcaag gccagggggc ggctgtccac agtgcagatg aacagagcca
    21421 tgtggccctg gcccatgtct aggagggctg acagctttgg acggtccggg ggatctgcag
    21481 gaacagaggg agctgaggcc acccagccta gctcaccaca ttacaactgc cacactatgt
    21541 cccccacctc ctgccacaca cacagcctaa gccatgctct ttcctcccca aaccccagcc
    21601 cggcccctgc ttctccccag accacccctc caccctccaa gtccccaggc tgcccatgcc
    21661 agtcacccac agagtacact caggagcacg ggagtggaga gctgggcacc agcctcagtc
    21721 aggatgcggc aggcgtaaag ggcagcatca gttctggcca cgggcagcag tgtcacggtc
    21781 tccaggggac cctgggccca cagcacccca tttcggaacc aggagaagtt agcagggctg
    21841 ccagcagctt cccggctcac gttgcaagtc aagttggctt ctgtgccctc ctgaagtgtg
    21901 tgtgatggtg caatggccag gacagtggct ggagagcagg cggcacagct tactgaccac
    21961 ccccggcccc caggagccag gggtccagcc gcccagcctg gaaggagcag gaaggaggcc
    22021 ccctggggtg cccacagggt tggggagaaa agcaagctag ctcacagggc agcctaggag
    22081 aagtgggccc tggggactgt gggggccctg ggcagagagg gttgcagtac agggaaggag
    22141 gacaaggcag cccaactgtc ccagctgggg agtccttcct cagagaccag ctgtgttgcc
    22201 tatccccggc ctgaacagag atcatgggaa ggagcagctc cgctcttcct gaatgtggag
    22261 gagggcagcg aagggccccc actgctgcac agagtggttt tgcctgatta gatcctcctc
    22321 ggagcagaac agtccagagc tccagcccct gcccccaggc cacccattcc ctgcctgact
    22381 caccctggcc attgaaggtg gctgaggtgg aggcgttgcc cagggcattg ctggcctcac
    22441 agaggtacaa gccctcctct tccagcaaag ggttgtgaat ctccacacgc agcaagttgg
    22501 gggctttggt gaccttcatg cgtggggaac agcccccaca ggtgctgcag ccaccccctg
    22561 atggcaggga agtggccaca acacggtcct tgtggagcag ctgcagcctg gcgggggggt
    22621 cgctgtccac acggcacaaa aggaggcctc gccgtccagc cccggcccca gcggcatcaa
    22681 ggtccagcct ggtggtgaat gttggttgtc gaggggggtc tgcagggagg aagaacatgg
    22741 gcactcatcc cacggatgct ccagggcccc acaagcctgg ctaggctccc agaatacact
    22801 ggacaaaggc agatcccgaa ggctgcccca agcagctgtg cagactgtgc actgcacaag
    22861 ggcagccaga ccagggtggg tggaactcaa gctaggctca tgctcctcaa gctaagctgt
    22921 gccctgatgc ctactgagtc ttcccagaag aaggagcacc attttctagc tcagacaaag
    22981 gtgctgtatg ggctggctgc caccttgcat aagtcccatc tgctgctgag cagggtgccc
    23041 acctagcaat ccctgggggt catgctgtct gctcccaccc tgcacccaat gccttaccac
    23101 ggagcctccc ctcaagatct gccctgcagc ttatctgtga tctccaagcc agctcctgtg
    23161 atactagcga atgcatagaa tggcctcttc ctgaaggctt ctggagacag ttggccaaaa
    23221 gagagcaatc agccaaggcc cccatgtgat ctcactcaga caccagaaaa ctttgcagag
    23281 gcagctgccc tctgacagtt ccgggacgtg gctccggcac acatgtgtca ggggaagcct
    23341 ctggaatcag gcccttggtc ttacacggag cctccccgct tggcggatgc agaggggctg
    23401 tctgtccctg tgctgagggg ctttgccttg gtgtctatgg gagcccagca cagcctgtgg
    23461 ggcatgtgca caaatagagg agtgtggtgt cccagggctc acccatgagg acaaagcatg
    23521 gatgtggtgc ccatgctgag cttgaggctg ggacagagga gcccacaggg ctggcattgg
    23581 agggtaaaga catggggagg tagagaaggc cccagcctca tttcctaaac ttcagcccca
    23641 catggaaaga cccactgtgg ccaggtctgt gacccagccc tcccgatata gaccgtgggg
    23701 cagggcagga cagggctggg gggcctagag gaggtggggt ggctggtggg tcccggcagg
    23761 aggctgcaac aggtggtggc tgctcacaga gcacagtgag aacagctggc gaagaggggc
    23821 cactggcact gtggccgtcc cgggcccggc agtggtatga gccggcgtca gtgctggagg
    23881 ccgcggggag caggaggctg ctgccgggac cctcgtgaag cagggctcca ttcaggtacc
    23941 aggagaagcg ggcatcaggt gtggggctta ggccgcttct gcagctcagt gtcactgcct
    24001 gtccttccac cacctcggct gccgggctga tgaggagacg ggcggctgcg gggagaggaa
    24061 gaggctggga agggtccctc ctctcaaccc cacatgctgc cctatataga aagccttcca
    24121 gggttctcct ttccccacac tactgtaggt agctctgggc tttgtggtgc atcaggaggg
    24181 tttgcccttt aatgccaaat cagatctatt ggtagtagat ggctgcagca tggttgaaga
    24241 ttcagagcca gagcccaggc ttatgtccaa cacctggctt ggctgggcat ggtagctcat
    24301 gcctgtaatc ttagcacttt gggagactga ggcagaagga tcgtttgagg ccaggagttc
    24361 cagaccagcc taggcaacat agtgagactc catctcaaac aatttttttt ttttgagact
    24421 gagtctcact ctgtctccca ggctggagtg cagtggtgtg atcctggctc actgcaacct
    24481 ctgcctccca ggttcaagcg attctcctgc ctcagcctcc caagtagctg ggattacagg
    24541 tgtgccacca cacccagcta atgttataca tgtagtagag atagggtttc accatgttag
    24601 ccaggcagat ctcgaactcc cgacctctgg tgatccaccc gcctcagcct cccaaagtgc
    24661 tgggattaca ggtgtgagcc actgtggcca gctttttttt tttttttttt tttttttttt
    24721 tgagacggag tcttgctccg tcagccaggc tggagtgcag tggcgcaatc tcagctcgct
    24781 acaacctctg cctcccaggt tcaagcaatt atcctgcctc agcctcccta gtagctggga
    24841 ccacaggtgt gcgccaccac acccggctaa tttttgtatt tttagtggag acggggtttc
    24901 accacgttgg ccaggctgat ctcaagctcc tgacctcagg tgatctgcct gcctcggcct
    24961 cccaaagtgc tggaattaca ggcatgagcc accatgcctg gccacaattt ttttttttta
    25021 attagctggg tgtggtgaca tgggccgtag tctcagctat ttgggaggct gagatgggag
    25081 gatggcttga gcccaggagt ttgaggctgc agtgagccat gaacatacca ttgcactccg
    25141 gcctgggcaa cagagcaaca ccctatctca aaaaacaaaa agaaaaacct ggcttgatca
    25201 attagctacc atgccctcag gaggagggaa ggacagtgca cataccgaaa gttggaagac
    25261 cgtacttttc ttttttcttt ttcttttttt tttttttttt tttgagacag agtctcactc
    25321 ttgtcaccca ggctggagtg cagtggcgct atcttggccc actacaactt ccacctcctg
    25381 ggttcaagcg attctcctgc cttagcctcc caggtagctg ggactacagg aactcaccac
    25441 catgcccagc taatttttgt atttttagta gagatgggat ttcaccatgt tggccaggat
    25501 ggtctcgatc tcttgacctc gtgatccacc cacctcggcc tcccaaagtg ctgggattac
    25561 aggagtgagc caccacgccc agccagaaga ccctactttt ctatttggct tcccacatct
    25621 gactgctagc atagagcctg ctcccagagt ttcataatta aaaaacaatg aatgcttctg
    25681 agggactctc caagtttagg gtcagggtag gtgcaaaagg aatgatgtga cctgttgtgt
    25741 ttcccttttt cccttgactt ccaggaagct ctgccgttgg gtcactgcac agcccctgtc
    25801 ttttatgtgg cgtagccagt tagctcagtc ctgcggttga gtccactaga cttctagaag
    25861 gaacagactg gagcaggctc ctcctcaggc tccctccact ctccctggct gccagtgccc
    25921 atcttaccat tggcatggaa gtccagggtg gaggttgcat ttccaaggga gttggtggct
    25981 gagcacttgt actccccact gtcagtttcc tccaggtctc ggatctccag gcgcagggag
    26041 ttgggaccag aggtaccact gaagcgtggg ctgtgatcac tgtccccgga ggtggaggcc
    26101 aggatatgac ccccatgtga cagcaccagt gtggccaggg gctcactgac cacagagcag
    26161 tgaaggatgc ccacaagtcc cgcctgggtc tccaggaagg ctgtcaggac tggagtgaga
    26221 ggcgggtctg tgtggagacg agaggtgggc ctgtcaccct cagacaaggg cattccctgg
    26281 ataccctgat acaacccgtg acctctgcac cgctttgtcc cacactgccc tgtgagaagg
    26341 gggtgatccc aaagtgcctg agtgcctgct gaatacactt ttgtccttgg ctgggctggg
    26401 tacgtcactc tgttgtccca gtcagtgttc aaggccaccc tgcaagtggg gataaaagcc
    26461 ccacttcaaa gatgagaaaa ctaatagaga gacctggtga gggacagcac ctcagccagg
    26521 tgaccaaagt gagcatcatc agcaatggga caagctgaca cagagtgcct cctgacaggg
    26581 cagagcagca tgtccgcggc cttcccaccc agagggcatg gtctcaggcc attcaagtca
    26641 ggcaaatgtg gagtgaggga cctcctcaga acagcaggcc tgcactctgc aagtttcaat
    26701 gtcaagaatg acaaggaaag actgaggaac agtctcagac taacggagaa tgaagagacg
    26761 caacgaccca atgcaatatg tgaactgtga ttgtattctg gaccagaaaa aaaatggcta
    26821 caaaagacag tattaggtcc actggtaaaa tgtgaatata gattatagct tagataacag
    26881 tcttctatca gccaggtgta gtggctcata cctgtaatcc cagtactttg ggaggccgag
    26941 gtaggtggat cacgtgaggt caggagttca agaccatcct ggccaacatg gtgaaaccct
    27001 gtctctacaa aaacacaaaa attagccagg caggatggtg ggtgcctgta atcccagcta
    27061 ctcaggaggc tgaggcagga ggagaatcgc ttgagcccca gaggcggaca ttgcagtgag
    27121 ccaagatagc accattgcac tccagcctgg gcaacacagt gagactccat ctcaaaaaaa
    27181 aaaaaaagtc ttctatcaat gctaattttc ctgatcatga tcattgcatt gtggatatgt
    27241 aagagaatga tcttaggaat ttagcggtaa agaagcctca tgtctgcaac ttcaggaata
    27301 taaatatgta ggtagataca taaataacat atgcatatgt atatagagtg tccatatatg
    27361 tataagtgca catgtccaca tagagtggat gtgcatacac acaagtgcac ctgtatatat
    27421 gcaagtctat atccatacat ttatatgtat gtatgtgcgt gtgtgtgtct gtgtagacag
    27481 cgagaaagat taagcaaatg tgccaaaatg ttaataaatg ggaaatctag gttaaaggta
    27541 tataatcatt atttgtcttt ctctgcaact tttcataagt ttaaacttcc aaatataaaa
    27601 taggagggaa ctagggaggt caaagcattg cccaggtgtg cagagctggg acatgagctc
    27661 aaggccacct ccaggtaagt ggccttgaag ttcccatgcc caggacccca acccttcctt
    27721 ggggctccac catctggtcc tgctctgact gtgtccagtg ccaccccacc ctggcctctt
    27781 acggttgact accacgctga cagggcccga gcgctcgctg ccatggacgt tctgcacctc
    27841 acagaagtag aagccagtat cagccctagt ggccaagtgc agccggaggg tatgggagtg
    27901 ggcatcctcc agcaggacat ggttcttgta ccagctgtag cggagatcac tgggtgcctc
    27961 attgggtgtg ttgcagacta gtgtcactgt ctggttctcc aggatgggac ctgctgggct
    28021 cacctggacc tcagccactg caagggcagc atagggagtg ctggggggtc ccagccaact
    28081 tccagcccca gccccataca gtccccgggt ctcagccaat cagcctcaat ctctgcacat
    28141 cctacaccac ccgcctgctg ctacagggga gcctcctgga gtgctctgca tctctttgtc
    28201 ctgctcacca ggatccccct gacaccccac ccttcgtggg ggtattgtca cctgtctcat
    28261 ggtaaggcag ggctgggact ccacacctgc atccaggaag cactgggtta tgccagttgc
    28321 tggccctgcc ctgccctgtc tcccctccgt ccctgaggcc tgagctcctc cctattctct
    28381 ggccaatacg caaccttccc aagaactcac tgaagatgtg gaggctgatg gggggtgaga
    28441 ccaaagagcc cacgccgttc tcagcttggc aggtgtagac gccagcatcg ctccaggctg
    28501 cctggggcag gtgcagcaca ccagtcttgg tttggaggcg taccccatcc ttgagccact
    28561 taatggaact gactgcaggg tagctgctgt tcacctggca ggtgagtgtg accagctcac
    28621 ctggaaggat gttcctcccc gaggggctga ggaggatctt cacacccttg ggggcatctg
    28681 caagtcacag tagggggtat tgggtaaggt gcttggggag ggcagaggat ggcacacttc
    28741 ttcttgcccc cctttaaaag ctcagtccta aggaagtatg cccagataaa acagcagtcc
    28801 ccttcaatcc tcacccaggg catgctctgt ctgcccatgt ccgtcccttt cccgccccct
    28861 gcgcctgatg cattcctatg cccattgaga gctgatcatg tgacgcttgg ccagcgtcca
    28921 gcctacccca ccagctgtag ttttctgctt cccaatgctg tccattgctc ctgctctatt
    28981 tgggatgagc tccacacaca gggtcatggg taccccactg ctagctttag tggcctgtgg
    29041 aagctattgg taagggacca actacctagt gggagggggc caaaggcagc atcaaactag
    29101 ctctgaaaat agttacccag tttgtaagca agaggccaac aacacaaaga actgcatttc
    29161 cttaaatctt gttccaaagc ctctctctgt gtattcaagt gttttattct tattttttta
    29221 gaaagaggat ctggctcagt cagccaagct ggagtgcaga ggcacaatca tagctcactg
    29281 cagcctgaaa ctcctgggct caagtgatcc tcctgcctca gcctcccgag gaactgggac
    29341 tacaggtgca agccaccaca tccagctaat ttttgttttt taattttttt gtagagacag
    29401 ggtctcacta tgttgcccag gctggtctcg aactccttgc ctcaagcaat tctcctgcct
    29461 tggcctccca aagcactggg gttacaggtg tgagccactg tatccggcct caagtgttta
    29521 atatgtgcca ggcactcttc taaatccttg acctgggtca tctcctttat ttgtttttgt
    29581 tgttgttgtt gttttttgtt tgagacacag tctcactctg tcacccaggc tggagtgcag
    29641 tggcacaatc acaactcaat acaactccac ccccggggtt caagcaatcc tagtgcctca
    29701 gcctccttag taactgggat tacaggcata tgccaccaca cccggctaat ttttgtattt
    29761 ttagtagaga cgaggtttca ccatgttggc caagctggtc ttgaatccct ggcctcaagt
    29821 gatccacccg cctcagcctc ccaaagtgat gggattacag gcataagcca ccgcgcccgg
    29881 cccatctcct ttaattttta tcataaatct gtgagacagg aaccatctat tgttatcttc
    29941 gtcatagact gagaaaacag agccaggcag gaaagggata aatcccacct ccccagcatc
    30001 ctccagcacg gggtaaatcc cggggcactt gctgccaacc ctgaagctgc atgggagctc
    30061 ctgttcccca accctttgtg tctccctttc tctgcccagc ctggcctcaa ggacaagccc
    30121 cttcgaagca gtaggaggtg gagggaacca tttaacgaag tccttggggc tcagcagtgt
    30181 ctctccactt gccctaccct ggaggcccag gagcagcctg gttttgcatc aggagcaagg
    30241 gttccgtttc tgtgggctgg agaggggctg gtttctgtca ggagcaacag atgcgctcag
    30301 ccacaagggt gtgtccccag acaactcaca cttcacttgg aggtgaatct cgctctgagc
    30361 cctgtgattg gccacggaga gctggcagcg caggatccgg ccgtggtcct gccaggacat
    30421 ggccatgtgg agggtctcca ggtggccgac gccggtgggc tcaaacttct ggctgttgaa
    30481 ggtgacagag cgagcagggt cctggccttg ccactgcagt ctgacctgct cctgcaggca
    30541 tacgtaggga gtggagcagt tgaagtccac ctctgtgccc tcgagaagct ccaccgggga
    30601 ggcaatggtg ggcaccctgg gctcctctga ggacagagac agcagtgctc aggacccgct
    30661 tttgccaccc ctgagatccc tcgccctgga aaccccagct gaggagagag ccctggggag
    30721 ggggctttta ggggaaggaa agacgctttc tactccagcc ccacttgggt gaatacaagg
    30781 gagaaccagg cctgggccta cgccggggct ggaacagagg ctgagactgg ctggggttag
    30841 attcaggaca agggctgggg ctgagagcca aggggtccag aagcagcttg ggaatccctc
    30901 ccggggggca gccaggccac cccacttatc acctgttact gtgaccaagg tgcctttcac
    30961 atctgaccag cggttgacct cactgatctc gaagcggaag ttgtaggaac cagagtcctc
    31021 gggctgcagg tccttcagca gcaggttgca caccctgtgc tcggggttcc ccatgaactc
    31081 ggtgcggccg cggaagcggg cctccaccag cttggggtcc gccgagtggc tcaccacctg
    31141 ccgctggccc gagtagtcgt agtaccagat ggccgtgatg ccgtcgggca cctccacgtc
    31201 ggcagggaag ctgaagatgc aggggataag caggcaagac cccttcacac cctgcacgtc
    31261 ctggggactg gagacgcccc atgaggcctg gcctggggga agaacggcag ggggacagag
    31321 gggagggtga tacaggcctc agggtgccac agagccctgc gacctgcccc agagaaggtg
    31381 ccccagctgg gctcccaaat tctgccctgc cccgagaaat gcacacttag agcagccctt
    31441 ctcagtgccc caggggtcag tccactgccc gaggctgctc agaggtttgg tagggtggct
    31501 cgaagacaga tggcagcttc ctgcccagct cctggccact gcccgattgg gccctccctt
    31561 gacctcagca accaaacatg tgacccaggc atagtctaat ctgccaaggg ctggacttat
    31621 gaaggctgga ccctggaaga caggcactag gcccgcaaag cttacctgct gggaagaatg
    31681 aggccaggag gagaagcttg ggcaagaagc ccatagcagg ttcttgtgct gctcctgttg
    31741 cctaagaggg tggtgcgcac tgcgctggct gggctcacag gggcctccag ggacacctct
    31801 gggcacttta gccccagcac ctgctagaag tccgagcctg tgtccccacc tcctctgctg
    31861 gccaacccaa taagagggca gggctcttaa agacctctga gtcagacacc agcagagagc
    31921 caggaggcca cgttcccagc tcaggctgtg cccaggaatg ccctcacttg gtggcctgcc
    31981 tcagaaaagc ctgtgtgtcc cttggcccta tcccaagttc tgctttccca gcccctcaag
    32041 gataccaccc taaggcagat gaacagctgt ttctccctct tccccacttc cctgccccct
    32101 ccccaccacc caaaccaaca ggaactggag cccagagatg cccagttact cactccaaga
    32161 cacccagcta gaatgatggt ttcttcctga ggcttgtctc ctaccacctg ccttactaac
    32221 tatagaccat aatggggctt tactgaactt gccgaagtgc tgcctttaac agtcactccc
    32281 ctgctcaaaa accttctgtg gctccccatt gcccgtgaga tgtgaaaagt cctcatttcc
    32341 tgcccccagc tctgtcccca ttccctgctc tccgcagacc cactctgggg gcagttctgt
    32401 ctgctcaagg gctccctagc tgcccagctc tatctccacc acagataatc tttgcctgct
    32461 gaaaccttat tcaaccttca aggagcagca tgaatttggc ttccagctgg aacttctcct
    32521 ttgaggttcc tgtagctacc ccagagctat ctctattttc cttgccttgt tttttacagc
    32581 ttgtgagagc ccatttctca ggacctagaa ctgaagatat gtgccccata gcagtgcgga
    32641 gcctaccagg cattcagcaa accccttagt gactaagaga ggggtgaggt ctttaggggt
    32701 tcagagctga ggttcagagt tggagtgggg aggtggcaag gcaagtctag gtttgaaagg
    32761 tagcatgaga gcgctgtgga acacataccc cacaaatatg agctcaatgt gtgcggagtg
    32821 taccatatcc aaaaaggcag gccctcaacc atggagtgcc cctggtcagg gagtgtctaa
    32881 ggggtaccat agacctgagc ccaaaaggaa gagatgccag aaacacatat aagtgaaact
    32941 ataaaactct tagaagaaaa caggtgaaaa tcttcatgac cttggattag gcaatgcttt
    33001 cttaagtatg aaattaaaag cacgaaaaaa aaaaataggt aagttggact tcatcaaaat
    33061 ttaaaacttg gccagacaca gtggctcatg cctgtgaacc caacactttg ggaggctgag
    33121 gcaggaggat cactttagcc caggagttca agacgagcct gggcaatact gcaagactct
    33181 gtctctacga aaaattaaaa aacaggcctg tggtcccagc tactctggag gctgaggtgg
    33241 gaagatcgct tgagcccagg ggaggggtcg aggctgcagt gagccatgat tgcaccactg
    33301 cactccagcc tgggtgacag agcaagaccc tgtctgaaaa gaacaaacaa cagctgggtg
    33361 cggtggctca cgcctgtaat cccagcactt tgggaggctg aggcatgcag atcacaaggt
    33421 caagagattg agaccatcct ggccaacatg gtgaaacccc gtctctacta aaaatacaaa
    33481 attagctggg tgtggtggtg tgcacctcta gtcctagcta ctcgggaggc tgaggcagga
    33541 gaatcacctg aacccaggag gcggaggtca cggtgagcca ggatcacgcc actgtactcc
    33601 agcctggtga cagagtgaga ctcttctgtc tcaaaaaaaa aaaaaaaaaa aaaggatatg
    33661 aatcaaagga cactatccag agagtgaata aaaggaggac aacccacaga atgggagaaa
    33721 atatttgtaa atcatttatc tgataaagga ctaatatcca gaatatataa agaattccta
    33781 caatgaacaa caacaaccac caaaaaaacc atgaaatcca actccaaaaa tgggcaaaac
    33841 acttgaataa acatttcttc aaagaagata tataactggc tgataaggac atgaaaagat
    33901 gctcaacatc actaggcatt aggaaatgca aatcaaaacc aaaccacagt gagataccac
    33961 ttcacatccg ttagaatggc tattcacaaa caaacaaagc aacacagaaa acaataaata
    34021 ttggtgagga tgcgaagttg aaattcttgt gtattgctgg tgggaatata aaatggttcc
    34081 gtcactgtgg aaaacaattg ggtcattcct caaaaagtca acataggatt accatatgat
    34141 ccagcaattc cactcctagg tatataccca aaagtactga aaacagggac tctaacagag
    34201 tacaccaatg ttcacggcag cactattcca ctaaaaggtg gaaacaggtc aagtgtccat
    34261 cagtgaatgg atgtggataa acaaactgtg gtatgtacat acaatggaat atcaatcggc
    34321 cataaggagg aatgaattct aacatatgtg aaccttgaaa acattatgct cagtgaaacc
    34381 agccagacac aaaagggcaa atattgtagg gttccaatta catgaaatat ctagactacg
    34441 tatattcaga gactgaaagt agaataggat agaggtaacc aggggctgca gggaggggga
    34501 gctaatgttt aatgattgct gagtctctca gataatgaaa aagttctgga aatagtggtg
    34561 atggttacac aatattgcaa atgtacctaa tgtcatgggg tgtacactta aagacagtta
    34621 aaacagtaaa ttttacatta tgtatatttc accacataca cacccatgtt gccaactttg
    34681 caatcctccc tggtcctaaa tgctgacttg gccaagtgaa cgaggaggct ggaagtgggg
    34741 acaggaaact catgacctcc cagctcccag cccatccgcc tcaggggctg ggctcagcag
    34801 attccaatga ctaccagggg tcacacctgg gaagggggtg agccgaggcc cagggccagt
    34861 caggctgacc aggtgggact tagcctgctg cagaaggcag aaggtgcccc agcagggggc
    34921 acagtacagg gcgggattgg gacaggaagg acaccgctcc ccaggggacc cagccctctc
    34981 gcaggctgct ggagtggact gatctggcca tttatggagg cccaagggct catctccagt
    35041 tctctaggaa gccctaggcc tcctcctctt ctgggaagat gcacccccag cctccacacc
    35101 aggttcttgg ccactggaga atgatatagc tggggccctg ggacctggac acctcaccgt
    35161 gaagaaaagc agcctgctgg gcacactggg ggtcagatgt gtccctggcc acaggggatg
    35221 tcagggtcag ctctgctatg gccagggcag ctatcttgtc ccagctcccc tgttcctccc
    35281 atttggggtc ctgaaaaggg caatcgtgaa cctgatggaa gaatggtggg gttctggaca
    35341 cagcgaccct ggaacagggc gcgggggagg accctttcca ggaccactcc catcacataa
    35401 tgtagaggcc acctatgctt agcccggccc taaccccaaa ggggtcagcc ccaccggaat
    35461 ccagcctatt ggctcagcct gtcaccacaa aggccagctt cagcccagat aactgttctg
    35521 gaaacagaaa gagcagggac cgctcagaaa ggagatctct gtccctgttt gaaagcctgg
    35581 agttgagggg acagtgcccc gccccccgcc ccccgcaact tgggttgcag ctgtggccta
    35641 gtgagcacgc agcgccccct ggtggtcgag ggggaattgc gggtcccggg aaagggggcg
    35701 gtgtgccagc aacagggagc aggcagctct gcagccctga accatccctc ccttgggtga
    35761 ctcttttgga aatcattgtt ccccagacag gaggttcctg aggttcatac ttgggtctcc
    35821 aagtcttggg tgctctgaag acaggatttt aaatccccac tcctactatt ggtatgtgtt
    35881 gcatcagggt ggcttgagct ggcctggtac acagtgggca ctcacgtttg ctgcctgttt
    35941 gagaccaagt gcctcaggag gtcttggagg gttggctggg gccccaagtc cctgacctct
    36001 gattccagag gccaagttta gctggggaag aaagggcaga ggcagtttcc ctatggacag
    36061 ctaggcccgg gtgtaggatt cagtttctgt ttcctgacac caaggcttct ccccaacttc
    36121 cccattgggc tagagaagga agaacacagg gtgacatggc cagctggagg ttactggccc
    36181 acagataggg agtcagggta cggatgggca attcctggag cagattatgg tcaaaatagt
    36241 ggaaatcccc aatcacaggc caaatgttta attctcagag ccatagaatc cataactaat
    36301 gcttattggc tttgacatgg gcagtagaaa tttcacactt cattccttaa tctggctcta
    36361 aatgcttctg gctggagtgc tcactctcca aactgtgctg gacagcacca gaagccctcc
    36421 tgtggacgga cgaagtggtg atggatgaag tggaattgtg ctggggttag agtaaggaga
    36481 gattatcgtt gggctgggct ggggctgagg tcagggtgat gatcaggttg gaattctgga
    36541 agcaaattaa ggctgggatt ggggtggaca ttgaggttga gtgatgggtg ggggtaaggg
    36601 tgaaggttgg agttgggtgc aggtgatggt taagatacgg tggaggctgg gttgagatga
    36661 ggatgataga gtcggagttg tggttagggt tgggatgatg gatggttggg attagatgag
    36721 atgagtaaat ggttagggtt ggggtgcaag tgaggtgagg gtgagataag gttggaatag
    36781 gggctggggc tggggctgag gtagggtcag agcagggtga tggtcgggac tgggattggg
    36841 atgcaggttg gaaggatttg ggggtggtgg aagggtttca gttgagtcca tctttgatta
    36901 gtgtcttgga cttgggttgg tttggggtcc accactcgca cccagatgga gccccccccc
    36961 gacccctgcc cctatcccgc tcagccagtt tcagcccagc cccgctccta atgctccact
    37021 caccctctgg gcccagggac cagggacagg ggtacctgct ccacagaagg aagtggctgc
    37081 ggcggtgctg gacctgcgga gaggagaaca ggaaggacgg ccaagagctc ctggtgcagc
    37141 tggctcccca gggctctgcc ggctcaagag agaaggatcc cgtatcaggg gctgcttcct
    37201 ctttcccaaa gcctcagctc tactgtccaa cccagaggct ggtcagggag gcagctgcag
    37261 gccttgcgca atgccaaaac gggaaagacc tccatagggg aaggccctcg gcaaggccag
    37321 ggacttaggg actccagcaa gcagaagtgg gaccgctgca acgctggagc ctccccaggc
    37381 aaagtgagaa atggagtggg ggactcccat tcaccagcca aatccacacc ccactctctc
    37441 tgagcccctt agggaggctg ggggaggtgg aagggggctt cctgcacaca gctctcctcc
    37501 ctgacacctg agagggaggc gcgcccagcc tggggtgggg atgcactaca cgatgcccag
    37561 accaaggcag ttattctcca agccattcag gagcctccct gtaccactga gtactcttaa
    37621 gaacccccaa gaggcagtcc cgtgttgtgg gggacataag gccccaatgt ccaggacact
    37681 cctcaggttt tctctttccc ctctcactgt cctgtacgtt ttttggtttt tggcttcttg
    37741 ttgttggttt ttttcttttt gttttgtttt tgcttttgag atggagtctc actctgtcgt
    37801 ccaggctgga gtgcggtggc acaatctcag ctcacggcaa cctccacctc ccgagttcaa
    37861 gtgatcatct tgcctcagcc tctcgagtag ctgggattac aggcatgcac caccacatcc
    37921 ggctaatttt tgtatttttg gtagagacgg ggtttcacca tgttgtccag gttggtcttg
    37981 aactcctgac cccaagtaat ctgcctgcct cggcctccca aagttctggg attacagagg
    38041 tgagccgctg cacccggcca ctatcctata ccttcacccc caccttggga caggaaagga
    38101 aagcccccac cacagctagt tactgttaca ttactgcagc ccatcttatt gagggcctag
    38161 tttgtgcagg cacttgacat gtgaagcaga tgttaattgc tccaagcaat cccagataca
    38221 agcaacaaga ctttttttgg tttttgtttg tttgtttgtt tttgtttttt gagacagtgt
    38281 cttgctctgt cgcccaggct ggggtgtcct ggtgcaatct tggctcactt tggctcactg
    38341 cagcttcgaa attctaggct taagcaatcc tcctgcttca gcctcctaag aagccgggat
    38401 tacaggcgct tgccactatg cacggctaat tttttaagtt attttgtaga gacggagtct
    38461 ccctatgttc ccaggctggt ctcaaacacc tgggctcaag tgactctccc acctcgacct
    38521 cccaaagtgc tggaactaca ggcgtaagcc accactcctg gccatttttt tttttaattt
    38581 cacaactcaa acttaattca actcagtccc tgcctacctg tactggtggg gagctggcca
    38641 cagcaaatta ggcccctgta ccctgagggc aaggccacgt gtgcaggtgt aaaaggatgt
    38701 gaaaccctaa atcagggtgg atccagaatc gcaggccatg gtgccccaaa gcagatgtct
    38761 ggtgacattc caccctgaaa tgctcaggct acagagatat taggtctcta tcactctgtt
    38821 cctctttata gctcctgtgt ccactcctat gcttgggcca tttcctcttt cggccaaaac
    38881 acaaagggtt catcccatta cttcctccct caacagctgg tccggagaca cccagctcta
    38941 ggcctgtggg gttgtgacac atgggtacca atccttcagt ccactgggac tctatacatc
    39001 caccccttgg cttcatggtg ggaaagcatc ccttgacttt gaccttagtc atatgacttg
    39061 ctctggccaa tggatgtgca cggacatgac acaaactaag tcttgaaacg tacttaagca
    39121 gtttgccttt ccccctgaat ttctgtcatt gccatgcaag gggcaggctc caactaacct
    39181 gctggtccaa agaggatgac gaacacatgc agatgacctg aatcagaccc atggattgaa
    39241 acaaaactca gctgagccca gcctacgtcc accagaccag tcaacctgtg gatgcgtgaa
    39301 tttattgctg gatgctgctg agaattttgt ggctacatta gcaagatgat acaaggccta
    39361 agtcccagaa caacacaccc agaacttgct tacctttcct tagcatgagg agagcaaaga
    39421 cttgtctacc ttgattagtc agggagcact gcttcctgtc atttccttga gtatacagca
    39481 aactaggtaa ataaataaaa ataactaggt aggctgggca cggtggctca cgcctgtaat
    39541 ctcagcactt taggaggccg aggtgggcag atcgcttgag gccaggagtt caagaccagc
    39601 ctggtcaacg tggcgaaacc ctgtctctac gaaaaataca aaaattagct gggcctggtg
    39661 gcaggcgcct gtagtctcag ctactcagga ggctgaggca cgagaatcga ttgaacccgg
    39721 gaggtggaga ttgcagtgag ccgagatcac actactgcac tccagcctgg atgacagagc
    39781 gagactctgc ctcaaaatat tttaaaaaat gtaatttctc agtaggtcac actgttacac
    39841 attcatctaa taacaattat tctttttctt tttttttttt ttttttggag atagggtctc
    39901 actgtcaccc aggctggaca ggctggagtg caatggcaca atctcggctc actgcaactt
    39961 tgacctccta ggctcaagtg atcctcctgc ctcagcctcc caagttgctg ggactacagg
    40021 tgagtaccac cagacgcagc caatttttgt atttttttgt agagatgggg cttcaccatg
    40081 ttgcccaggc tggtctcaca ctcctgagag ttcccagtca agtgatccac ccaccttggc
    40141 ctcccaaagt gctgggatta caggtatgag ccaccatacc cagtgaatta ttctcgttcc
    40201 agataggaaa accaagtcat agggaggtta gagaatttgc caaagacaaa actttttggt
    40261 tgaaaaaaaa taagttttgc tacaagtata gaaaacacca aataacggtg ttttaaataa
    40321 aatagaagtg tttcaccctc tccctcaagt aagtgttggc atccaagatg atatgacaac
    40381 tccacaatca tgaaactaga tcccttttta ttttttggct cagtcattgt caatgggctg
    40441 cttccagtca ttgtccaaag tggctgcttg cgctccagcc actgtatctg cattcgggaa
    40501 agcaggatgg aggaaaggac aaagtaggcc atgccctcta ctttaagata actccttaaa
    40561 ttgctgatgt ggtgggtcac tcctgcaata ccagccactc aggaggctga agcaggaggt
    40621 tcacttgaac ccaggagctt gaggctgcag tgaactataa ttgtgtcact gcattccagc
    40681 ctgagtgaca gagtgagatc ttgtctctta acaacaacaa aaaaggtaat tctttaaatt
    40741 gtacacttta ttgctgctta tattccacta gaagctagaa gttagtcaca tggtcttaag
    40801 tctttattct aacaggtgat gtgcccgagt gaatcctggg ggcccagtgc taagaagtag
    40861 agggagatga agccctgccc tcctgtggag caaccaggac cttaattcag tcattcattc
    40921 cttggaccta cccagattcc agcctcaggc ccctgcccac ctccttgcca gtatctctgc
    40981 agagcctcct gcctccccca gagggtgcca gagccctctg cttcctcagc cttcagaccc
    41041 acttgctgtg tctcaggtgg ggacaactca gtctatagag gcccaaggga gatctttgag
    41101 acccatactg tagtcagctt ggggcagatg ggagtcatgt agctcactgg ctaaaagcct
    41161 aggtcctgca tagagctggg ttcaaatcca ggcatatcca tcacctgctc tacgatccca
    41221 ggagtcactc aatgactcag aagctagagt cctctggaaa gcttgtatga agagttccca
    41281 gggcctggaa catcaaaaga gctctgagaa tgttggctct gctgttgctt ctcttattgg
    41341 gtatctgaag gccgtactct cctctctcct ccacccgagg tgacctctac tctcatagcc
    41401 acaccctgga ccctcactca ttcagtcacc tcacccagtc tgtcatctgt ccctctggct
    41461 ctcctctcct gttcatgcat cttgtactca cccacttctg tccctgagtg tttcaaggct
    41521 ctggggtttt cagagacact gaagggacct ccctcctcaa cacaaccaca aggtctaggt
    41581 ggaatgaccc actaagggac cggctctgaa gccagagact tccagggaaa gtcaacaagc
    41641 ccaaggatgc ccgttacaag aaagttaaaa gacccatgta catgtcctcc cgttttattc
    41701 cctgctcagg gtctgggcat acagtggaac acatgcagtc cccaaaggac accgtctgta
    41761 cagagtcaga tggagttaag aacatttagc cggctgggcg cggtggctca cgcctgtagt
    41821 cccagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc ggagcttgca
    41881 gtgagccgag atcgcgccgc tgcactccag cctgggccac agagacagac tctgtctcaa
    41941 aaaaaaaaaa aaagaatatt tagtctgggc acggaagctt atgcctgtaa tcccagcact
    42001 ttgggaggct gaggtgggcg gataacctga ggtcagaagt ttgagaccag cctggccaac
    42061 atggtgaaac cccatctcta ctaaaaatac aaaaattagc caggcatctg taatcccagc
    42121 tactcaggag gctgaggcag gaaaatcact tgaacccagg aggtggcggt cacagtgagc
    42181 caagattgtg ccagtgcact ccagcctggg tgacagagca agactccatc taaacacaca
    42241 cacgcacacg cacacatatt taaaccacca caccaacatc tagttcaaga tggtggactg
    42301 agaacttgtc tctgccattc ctggcccatc caataccact gagagcacag taagcaaagg
    42361 gaaaaggaga cagaagggct gggaacagga tggctggggg atgggaagta tccactgcac
    42421 aggattttga tttaattcta gaagatagaa agagggagga tcacgttcag gaacagatgc
    42481 gggtaaagga aaccagagcc aaagcacgct gagagaaagc tgccccagag gccggagcag
    42541 aagtggactc tctacaggga ctcaatacac cccaaagggt tggtagctgg cacacgtacc
    42601 tctccacccc cacatgtaac actgcagggc agaggaaata ccctaggtga gttccagtat
    42661 caatcaacct gccctttgtt cataaagatg aattggttac caggtattac cagacaggtg
    42721 aggaagacca acacaaagag aaagatccag aaacaaacag gccaggcgca ttggctcacg
    42781 cctgtaatcc cagcactctg ggaggccaag atgggtggat cacctgaggt caggagttca
    42841 agaccagcct tgccaacatg gtgaaaccct gtctctacta aaaatacaaa aattagctgg
    42901 gtgtggtggg gttctcctgc ctcagcctcc cgagtagctg ggaggctgaa gtaggaggtt
    42961 cacttgaacc caggagctcg aggctgcatt gaactatgat tatgtcacta cattccagcc
    43021 tgagtgacag agtgagatct tgtctcttaa caacaacaaa aaaggtaatt ctttaaattg
    43081 cacactacat tgctgcttat attccactgg ctagaggtta gtcacatggt ctcatgtctt
    43141 tattctaaca ggtgatgtgc ccgagtaaat cctggggccc cagtgctaag aagtaggggg
    43201 agatgaagcc ctgccctcct gcttgaaccc tagaggtaga ggttgcagag agcggagatc
    43261 atgccactgc actccagcct gggtgacaga gtgagactcc atctcaaaaa ataaaaaata
    43321 aataaataaa taaataaata aataaataaa aacctacagc aaagaacaaa gagctattgc
    43381 accgttactg tgggcctggc agtattatga caactttttt tttttttttt ttttttggag
    43441 atggagtatt tttctgtcac ccaggctgga gtgcagtggc tcaatctcag ctcactgcaa
    43501 cctctgcctc ctgggttcaa gcgattctcc tgcttcagac tcccgagtac ctggtattat
    43561 aggcacatcc caccacaccc ggctaatttt tgtattttta gtagagaccg ggttacaaca
    43621 tggtttcacc atgttggcca gtctagtatc aaactcctga cctcaggtga tccacccgcc
    43681 tcagcctccc aaagtgccag gattacaggc atgagccacc acacccatcc agtgttctga
    43741 caattttata cttattaact catttagcaa ccctctgagc tagatgctat tgttatctcc
    43801 acttacagga aactgagtca cagaatggta cagtaaaata accttgaccg aggtttacat
    43861 ggctggtaag tggcagagaa atgaaacttg aacccaggca acctgtttcc tgagttggac
    43921 tctgaactac tcagccatac tgcctattta tttatttatt tatttactta cttacttatt
    43981 tattgagatg gggtcagccg ggcatggtgg cttacgcctg taatcccacc actttgggag
    44041 gctgagggga acagatcact tgaggccagc agttcgagac cagcctggac aacaaggaga
    44101 cagggtctcg ctctgtcacc caggctggag tgcaatggtg caatcatggt tcactgcagc
    44161 ttcgacctcc caggctcaag cgatcctccc acctcagcct cccaagtagc tgggaccata
    44221 ggcacccact atcatgtcca gctaatcgaa aaaaaaaaat tatatagaga tgggggtctc
    44281 actatattgc ctagactggt ctctaactcc tgggcttaag caatccaccc accttgccct
    44341 cccaaagtac tagaattaca ggtgtgagcc accgtgcctg accaatactt cctctttaat
    44401 aaactaaaca aaaagtgaac aaaccaatcc cggagggaac agataattca aggaatacaa
    44461 gaaaacttat gaaaagaaag attctagtgc caatatgttc ataaaacaaa aaaaaagcag
    44521 ctatgaaaag aaagcaaaaa ataattcttg gaaatttaaa atagaattgc tgaaataaaa
    44581 agaaattcaa tagaaaaggg tgtgaacatc tgtggttttg gtttccaaaa ttagttcact
    44641 cttctttggg taaaaatacc ctgattttcc tgctgtattc tcaagccctg tgggttggtg
    44701 gaattgacac ctcttccatt tactgcacga gagaagcatg ttgctgctca cagttcatta
    44761 tgaggagcca gcaggtaaag acaacactga gggcagatga aagatatata ctgaccctgg
    44821 gtccttgtgg acatcattga gtccctggat caagccttac ctgaagctaa agggatgagt
    44881 gcttctcact gttaatagcc actgatcaaa ttccatagaa ggactccaat tatccttgcc
    44941 tcagttgtgt gtccagccct cggatgagct accatcaagg gaaactgcca gaccttggtg
    45001 acaagcccac ccactgaaaa tggggaaaaa acccaagtca catcattgaa ctgaaagagc
    45061 tgcagttccc cgaaagacag gagagaagga atgctggata aacacaacaa ctacttcatc
    45121 tcactgcacc tgcaaagaat ccaggcacag gaaggtgggc acagcaagta gtctttctgg
    45181 ttgacataac aaagacaaca tctgtttctt ctctcagaaa gaagtaagaa aggcaattgg
    45241 agaaccagaa acgacaaaat ctattccagg accaaataga aagaaaacac atctgaagca
    45301 tgattttgaa caattagtgg ggtgtaagaa aagagaatcc atttgacctt gacatctgaa
    45361 acagcaaata ttctccatca aggcacttta gggtagagga aatgatacta tgtttccttc
    45421 tcaaaggagc tatctggtta cgtggttctg cagtaaatga cgtttataca atcataacat
    45481 ccgttggttt tcaattttca gaatcaacct gaagataaag catggaagat ttcattataa
    45541 ttaccgaaca gcatgcaaat gttacaaact ttgaccatga aaatgtaaaa aagagcccag
    45601 gcgcagtggc tcacgcctgt aatcctagca ctttcggagg ccgaggcggg tggatcacaa
    45661 ggtcaggagt ttgagaccag cttggccaac atagtgaaac cccatctcta ctaaaaatac
    45721 aaacattagc taggcatggt ggcacgcacc tgtagtccca gctacatggg aggctgaggc
    45781 aggagaatca cttgaaccca ggaggcggag tttgtggtga gccaagatca cgccgcggca
    45841 ctacagcctg gtcaacagag caagactcca tctcaaaaaa aagaaaagaa aagaaaagaa
    45901 aagtagacag cagaaattag agggagatgg caggtgacct ggggctgggg gaggcacaat
    45961 ttttcttaca tagtgatagg gctcaagaaa ttctttctaa aattgatgta tgaggaacga
    46021 aattttaaat atagattgtt tagaactata agtagcacca acacaggact ccagtaatca
    46081 cacagcaaac aaaactgaga aaacagacat gggcggggag atgggaatgg gtgatttcca
    46141 ttccctgctt taatgatgag gtgtcagtag atgctctctt gagttactaa attgagaaac
    46201 aaagataaag tatattgttt agaggagtga tgttcaaact ccagaaaagg ctgggcgcgg
    46261 tggctcatac ctgtaatcct ggcactttgg gaggccgagg tgggcagatc acttaagccc
    46321 aggagtttga gaccagccta ggcaacatga tgaagcccta tctctactaa aaaaaaaaaa
    46381 aatacaaaaa attagcaaag catggtggtg cacacctgta gtcccagtta ttcaggatct
    46441 cactactgta ctccagcctg ggtaatgaga gtgagaccct ctctcaaaaa aaaaaaaaaa
    46501 aaaagaagaa gaagaagaaa actccagaat atataaaaat gaaaagaggg cggaagagtg
    46561 gaactgcggg gtagaaaatt ttgcattttg tttcatctct ctgtatatgt tgaacttttt
    46621 ttttaaccta catgtactac gtcatatcca cagcccccaa ccatccacca ggcatcaact
    46681 gctgtgttta tatggagggt tgagcaatca ttcctgcctc cttccagttc gtcttcctgt
    46741 actgcagagg ctagaaaact aaatttatat cacccagatt cccttccagc taccttaatg
    46801 ctaccacctc attcagccat cattgattct ccacttcacc aaactggtca ataattgtct
    46861 cttctaaaaa tattagaatt gggactaaga gacactacag ctgaccctta aacaacatgg
    46921 gtttgaactg tgctagtctg cttatacaca gatttttaaa aaaataaaca tattgaaaaa
    46981 aattttggag agatgtgaca atttgaaaaa actcacaaac cacatagcta gaaatatcaa
    47041 aaaaaaaaaa actgaaacag agttaggtag gtcaggaatg cataaatata tatgttaatt
    47101 ggctgtttat attattagta gggcttccag tcaacagaag gctattagta attaagtatt
    47161 tgagtcaaaa gttatacaca gatttttgat gatgaggagg attagtgccc ccaaccccca
    47221 tgttgttcaa gggttagcta tatttgaaat ctgctggtag ctggaattgg gcaactaaat
    47281 tcacgagatg tggaacggtc gtatacatgg agaataaata aataaataaa taaataaata
    47341 agtatttgtt ggcctggtgc agtggctcat acctgtaatc ccaacacttt gggaggccaa
    47401 ggcaggcaga tgacttgagg tcaggagttc gagaccagcc tggccaacat ggtgaaatac
    47461 cgtctctact aaaaatacaa aaatcagctg ggcgtggtgg tgtgcacctg taatcccagc
    47521 tgcttgggag gctgagggat gagaattgct tgaacctggg agtcagaggt tgcagtgagc
    47581 cgagattgca ccactgcact ccagcctggg cgaaagagtg ggactctgtc tcaacaacaa
    47641 caacaaaaaa gcaattgtta aaagaataag aaaacaagcc acagattggg agaatatatt
    47701 tgcaaaacaa atatctgatg acccaaaata cacaaagaac tcttaaaacc caacactaag
    47761 aaatcaaaca accccattaa aaaatggaca aacgggccag gcatggtggc ggtggctcac
    47821 atctgtaatg ccagcacttt gggaggccga gcagggcaga tcaccttagg tcaggagttc
    47881 tcaactagcc tggccaacat agtgaaaccg tctctactga aaatacaaaa attagccgac
    47941 cgtggtgacg tgcatctgtg gtcccagcta cttgggtgtc tgcggcagga aaatagcttg
    48001 aaccagggag gttgcaatga gctaagatcg cgccactgca ctccagcctg ggcaacacag
    48061 tgagactccg tctcaaaaaa aaaaaatggg caaaagatgt gaacagacac ctcatcaaag
    48121 aagatataca gatggaagat aagcatatga aaatatgctt aacctgatgt cattaaggaa
    48181 ttacaaattg aagcaacaag gtaccactaa acccctatta aaatggtcaa aatccagaat
    48241 gctaaaaaca ccaaatgctg gtgaggatgt ggagtgacag gactctcatt cgttgctggt
    48301 ggaaatgtaa aatggtatgg ccatttttaa gacagtttgg cagtttctta caaaactaaa
    48361 catagtctta ccatacaatc taacaattac actcctagat agttacccaa ttgaggtgaa
    48421 aacttatatc cagggccgag cacagtggct cacacctgta atcccatcac tttgggaggc
    48481 caaagaggaa ggattgcttg aggccaggag ttctttcttt ttatttattt atttatttat
    48541 tttcttttga gacagagttt cgctctgtca cccaggctgg agtgcagtgg agtgatctca
    48601 tctcactgca atctctgcct cccggcttca agcgattctc ctgtctcggc ctctgagtag
    48661 ctgctgggat tacaggtgca cgccaccatg cccagctaat ttttatattt ttagtagaga
    48721 cagggtttca ccatgtcagc tagactggtc ttgaactcct gacctcaagt gacctgcctg
    48781 cctcggtctc ccaaagtgct gggattacag gtgtgagcta ctgtgctcag ccaaggccag
    48841 gagttccaga ccagtcctgg caacacagtg atactctgtc tctacaaaaa aaaatttttt
    48901 taattagtca catgtggtgg cacacacttg tagtgtcagc tactggggag gctgaggctg
    48961 aggatcactt gagccccagg agtttgaggt tgcagtgagc cacgattatg cctctgcact
    49021 ccagcgtggg caacagagta agagaaagaa agagagagag agagagagag agaggaagga
    49081 aggaaggaag gagaaaagaa acgaaaagaa aagaaggaag gaagggaagg aagggaggga
    49141 aggaagggag gaaggaagga agaaagaaag aaaggaaaga aataaaaaag gaaaagaaaa
    49201 gagagagaga aaccttatat ccatacaaaa ccctgcgcaa gaatgtttat agctgcttta
    49261 ttcataattg ccaaaaactg gaagcaacta agaagctctc caatgggtga aaggataaac
    49321 aaactgtggt atatctgtgc aatggagtat cattaagtaa taaaaagaag acttagcaaa
    49381 ccacaaaaag taatatgcat actaaatatg catttagtaa tattaatatg cattaagcaa
    49441 taaaaaaaga cgtgtcaagc cacaaaaagt aatatccata ttactaaatg aaagaagcca
    49501 gtctgaaaag gctacgtgct gtgtgatttc aactatttga ctttctggaa aaggcaaaac
    49561 tacagacagt aaaaagatca agggttgccg ggggttctgg ggacaggaaa ggatgaatag
    49621 gtggagcacg ggaattttag gtcagtgata ttattctata tgatactata atggtggatc
    49681 catgatatgc ttttgtcaaa acccatagaa agtacaacac aaagagtgat tcttaatgta
    49741 aactatgtgc tttagtcaac aatgtattga tattagttca ttaattttaa caatgtacca
    49801 cactaatcca agatgttaat aatggggaaa ctgcgtgtga gggaggggat acatgggaac
    49861 tctgtactac agctcaataa ttttataaat ctaaaacttt ttttttaaat aaagtctctt
    49921 agaaaacaat aaaaataaaa taaaaagtca attttttttt tttttttgag agggaatctt
    49981 gctctgtcac ccaggctgga gtgcagtggc acaatctctg ctcactgcaa cctctgcctc
    50041 cttagttcaa gcgattctcc tgcctctacc tcccaggttc aatcaattct cccacctcag
    50101 cctcccaagt agctgggact acaggcatgc gccaccacgc ctggctaatg tttgtatttt
    50161 tagtagagat ggggtttcgc catgttggcc aggctggtca agaactcctg acctcaggtg
    50221 atccgcccgc ctcagccttt caaagtgctg ggattacagg gttgaaccac atgcctggtc
    50281 taaaatgtcc atttttaaaa ggcagtctgg ctggggcagt ggctcacgcc tgtaatccca
    50341 gcaccatggg agaaggctga gcagggtgta tctcttgagc ccaggagttc gaggctgtag
    50401 tgtgctatga tggtgccact gcactccagc ctgggtgaca gagtgagact ctgtctctaa
    50461 aataaataaa taaaaataca ataaaatttt aaaaggcaat ctgcagcaag agaagatgaa
    50521 gttagcagga gagcaggaca gaggagagtc catgaacaac agggagagga aagtggtcat
    50581 ggtggcagct cccaggctcc tggcatatgt tcaattccct gttcccagcc ctcaggaagc
    50641 ccagatgtcc ccacctgccc ctacatacaa accctggatc cttgacatca acttctcctt
    50701 tccttcatgt gctctaatga gtttttgtta cttgtggaac atttaactaa catcattaac
    50761 caagtggacc tgtctcagag tggtgttatg agaagacagc agccaaaaga cagctgcagc
    50821 caagcacagt ggctcatgcc tgtaatccta gcattttggg aggccgaggt gggtggatca
    50881 cctgaggtca ggagttcgag accagcatgg ccaacatggc gaaaccccct ctccactaaa
    50941 aatataaaaa ttagccgggt gtggtggcga gcgcctataa tcccagctac ttgggaggtt
    51001 gaggcaggag aattgcttga acccaggggg cagaggtggc agtgagccgg gatcatgcca
    51061 cttcactcca gcctgggtga aagagcaaaa ctctgtctca aaaaaaaaaa aaaaaaaaga
    51121 cagctgcaac aaatgtcaag ttctgtgtgt tttcttttct tttctttttt ttctatttaa
    51181 ttaatttatt ttagagtcag agcctcccta tgtcacccag gctggagtgc agtggcacag
    51241 tcacagctca ctgtagcctc aacctcctgg gctcaggcga tccttccacc tcagcctcct
    51301 tcctagctgg gactacaggt gtgtgccacg acatctggct tgtgtgtttt cttttctttt
    51361 tttttgagac ggagtcttgc tctgccaccc aggctggagt gcagtggcgc gatcttggct
    51421 cactgcaacc tctgcctcct gggctcaagc aattctcctg cctccgcctc ctgagtagct
    51481 gggaatacag gcgcacacca ccatgcccag ctaatttttg tatttttagt agagacgggg
    51541 tttcaccatg ttggccagga tggtctcgat ctcctgacct tgtgatccac ctgactcggc
    51601 atcccaaagt gctgggatta caggcgtgag ccactgcacc ctgcctggct tgcatgtttt
    51661 ctgacatact gtcaaaagga tactcatact aaatggcaac acattctcaa gccccttcct
    51721 tttcttctcc tatctgcttt accacacacc atagctgctt ttaccgtttt ctccttaaaa
    51781 actcaaaaaa accttccccc aacctatcta tccacttctt gtcctcccct caaacccact
    51841 ccattttgat tctgcatcat aaaaaccaag cagagttctg gggaggcaga agcctcctct
    51901 tatgatattg gaagggggtg gatgaaattg agcacacaaa gccagaatcc tcttttgtgg
    51961 aaagatgggg gcagtgggca gagggaagca ggctcatctc tttccctctc ttcccttccc
    52021 ttctctttca caccacacgc tccgcctggg tgagctcatc gtcctcgtgt cttcaatttc
    52081 cacctccagg agatcgactg ccaaatttct acccgcaatc ccaaactcca ctctgagctc
    52141 cagacccatc tactaattgc ctagtcaaca ttttctctgt gggatcgaat cagataggat
    52201 tatattctgc tgtgactaac agagactgaa aattcagagt cctaaacaaa agactttctt
    52261 tttcttgtgc tgaaaagtct ggaagtacac agccctaggc tggaataggg atactgctgg
    52321 ggggctccgc ctccttgcag gctgcccctc tgctatctcc agggtgtgtc cctcaaccac
    52381 actgtccaaa gtggtgactg aaggaccagt cctcccactg acattccagg caacagaatg
    52441 gagaagaaca ggctgaggaa ggaggaacca caggcgccct ccagctgttt caaggaaggt
    52501 tcctggaaga tgctaaataa caattccgtt tatatctcat tagccaggct tagtcacgtg
    52561 gcaacaccta gaagcaagaa aggctgggaa atatactgtt tctgctgggt taccgtatgc
    52621 atgactaaaa agagagcgct ctgttcttag gaaaatgaga atagactggg ttgggggagg
    52681 ataaccagca ttggctacac tgcagcccca caacctcctc atgcttgatg tggctgaaat
    52741 aaaaagagca tctcccccta acatttcctt tcccttagtc ttgttccttt tctgtacccc
    52801 ataccctagc agaggggcaa ttctgtgggg tgggctcact ggcaagggga ggtctcaccc
    52861 aagctcagtc aagggtgagc agagagaatt gtacaaggaa gctgaatttc cagctggtct
    52921 ggaggaaaca ctgaaaaccc caaacagaag atgtccaggt ggagcaaatg cagctgagag
    52981 ttcttgtcca aggggacaga atgtcagcag agtaggacac aagggcaggt ctctactgaa
    53041 agcaccaggg gcaaagtcac ccagcccttc aaccggctgg ccaccaagca catcggctcc
    53101 actctgcctc ctgcccgtgc caccctttgc acttttgtct gtaccatctc ctccaccagg
    53161 agtgcccttc ccgctcctcc atgtggtata ctctcagcca cccctcaggc tcagatccag
    53221 agccacctcc tccaagaagt ctccacctgt ctgctgggaa ccctcacatc acactgggag
    53281 acgaagccaa aagtgggggg gtcagtacag aggtgtggga ggtaggtcca atcctggctg
    53341 gacctggctt tgggtgccat ttgggacagg gtattgggga ggttcagcag gaaggtggga
    53401 cctcaaggct ttctttttat gaaggaggag gaaccagctg ttgacagagc agcaacagca
    53461 gggagccacc aaggctgagc cttgaaccct gggccaccag gctcctgcag gcaactggga
    53521 gcagctcagc ctctcacagt cccatttcca gggaggaaga tgatgcagcc tgaggcaatt
    53581 ggccacctgg aggaagttca gctgggtggg gcaggcagag gaattcctgg aaggaagctc
    53641 acccctcagc actggggtag agggacacta gagacagggg tggcccagga gctgcccacc
    53701 tcctatcccc atgaataagc aggtgggccc gggtgaccag ctgggagcca cccctgggtg
    53761 ctcaggtacc atgcttctgg cccagcctcc tgagctgtgg cacaggcaag agcaccggcc
    53821 cacggggtct gcatggggca gtgacctgtt ctctctaagc ttcagctttc tcatctgtag
    53881 aaagagggtg aagagtcctt ccccaccatg gttgtggtga gggtaactga agtgtcaggg
    53941 taggggaccc agtggggcac cagcacacct gaggttcagt caacaaggcc cactggtgaa
    54001 gaaaacctcg cccaccctcc cgtccctcct gaggcaggtg gtggaggcac atgccctctc
    54061 cattgtgtgt gcatgagtgt gtgggtgcat gcatgtgtgt gcatgcaagt gtgtgtgcat
    54121 gtgtgtgcat gcatgtgtgt gcgaatgtgt gtgtgtgcgc acatgcatgt gtgctaagct
    54181 ctggagaaca agcagccagg cttgggcagg agtccggggg caaggggggg acatgcaaag
    54241 ttctctccac agcccctcag tgctaacagg ctgaggtggg ggatgacatg tcttagaaac
    54301 aaacttcacc ctctgcccta aaactgcctg ctggtctgcc acatcgggcc aagctgtggg
    54361 tacttggaaa gagggccctg ctcctgcctt ccaaacccag gaagtcttga aatccccatg
    54421 gaggggacct ggggccaggg gactccccaa gcagacacag cacccacaca agggtctaag
    54481 ggaccccttt ctccccaacc gcctgaggta gggatggatg tggacaagca gctcagggct
    54541 ggggctggcc taggtggaag gttcagaagc acagaaggca ggacgtgccc aaggggctgg
    54601 acatggggtt ggaggctagg ggctggaaaa agagtttggg aggtcagcca agggcagtcc
    54661 acgccttggc tcttctctgg gtcatgggga gccacggaga gtataggagt ggtgatggga
    54721 tggccccaga gttgtgcctt tgaaagtcac ctctggttgc agatagagat ggactggggg
    54781 cagggctgag aggaagggca tctcatgaga ggggaaatcc ctgaccacag tccagaggaa
    54841 aacagcgggc ctggggagag gctgctcctg gggaggacct actcagaggc tgattcttcc
    54901 tcgtgctacc ccacccaggg agatgcagga agctggtgag gtcagtgggt gggcccacaa
    54961 ccatgctgga gaggagcagg agggagacgg gagggcacag agaggtagag ggcagggact
    55021 ctgaggaccc tcatccatgc ccacctggac aattgaatcg agcttcctct agtgtggaat
    55081 gtgggctcac tggggctatt tgctatcctt ctctcatgtc agactggatg cggcccaagg
    55141 gtggggtctg tgactccctc atcagactga ggggggccga aatgtggggg tgggggaagg
    55201 agggagcttc caaacaggaa ggtttggagg gcgctggcac tgaggacagg ggatacgccc
    55261 cccatgagtc ttcacccctg ctcagagcct gctgccgtgg ccctcagaga cgagcccatc
    55321 tttggagctg gagttgggga cagggaaacc aggtggaagg gaaaaagaaa aaaggggcag
    55381 gcatgagtgt gaggggccag accctgcaga ccccaggaca tgcacagcag acacacactc
    55441 caatccctcc tggctctgct cagcctcaaa cgaggtgcag gcctgcaggc cccaaccaag
    55501 gacccctgtt acacacagac acaaagacac aagatgcagc aacacacaca cgtaatacag
    55561 aaacacagtc acacactagc gcacacacag acacagtcag acaatccaag aaacacactc
    55621 acacatgaac agacatataa gcaaagtctt acacatacaa agacacactt aagcacagac
    55681 acaaacacac caacagatgc acagacggac acacatagac gcacactggg aggcggcctg
    55741 tggagactgc tgattggata ccatgttggc ttggtatcca gaattgttgg tgcagcgcac
    55801 tcaccatgcc tgacctcatt aaggaggccc tctggggcag tggctctgag aagaaaggca
    55861 gcaagcctag ggaggtgaca ctgtagctgg ggagcatttt ggctgggcag agaggaggaa
    55921 ggagagcatt tcagggaggg aaaatagccc ttgcaaaggt ttggagtgga agagcaaaag
    55981 ggatgtccca agaactgtgc ctcatttgat ggggctgtgg ggcccttctc tgacgggcag
    56041 ccttcccagg cacaaaggga ggtggactga gaagcatctg gaacttctgt gggggcacat
    56101 ggggcagagg ggaaggggaa caggcagaac gagtcagctg cctttgaaag ctccacgagg
    56161 gacctagcag aaattcattc attcatccgt tcattcattt atttatgaca ggttctctct
    56221 ctgtcaccca agctggagtg cggtggtgtg atcacactta ctgcagcctc aaactcttgg
    56281 gctcaagcaa tcctccaccc cagcctcctg agtacctagg actacagacg tgagctacta
    56341 cacctggcta atttttaaaa ctttttagag acagggtctc cctttgttgc ccaggctggt
    56401 ctttgttgcc cagccagctt caagccatcc tcccacctca gactcccaaa gcgctggatt
    56461 acaggcatga gccacgggtt cccagcccag aaagagacct tcaccccaca aaccccctgt
    56521 ctggaggcca agtcccagca tccccagaag ggccccggtt aggacaagaa aaaggctgtg
    56581 tgcctcggag gagaggctgg gtgccccctg gtgcggagtg gtccccttgc cgggcatggg
    56641 gcctcagtct gtgagactcc gcgcctccgt cgtctgaact tgagcaaaga cagagctagg
    56701 cctagcctgg ggacccgagg gcctgccctc atccgcaatt ctgcctcttt tctggccact
    56761 gaggaccggg ctagggaagc tgtaggggca gtaggggtgg gaggagagag ggttccgggt
    56821 caagggactg gaggggtctg gggcagggaa gtggaggggt ctagggcggg cccttccccc
    56881 tccgctgttg ggccctcgct gaaggggccg aggctgagcc ccaaggtggg cctgtggggg
    56941 cggggcgggg gagcccctca gcagctggac cgcaggaggc cgaccagggt ctggaactcc
    57001 tcttcccctc cctccgccgc cccctctccc ttctcccacc ccctgggctg ctaggggagc
    57061 gggggaggcg caggaggggc tcagggaggg gccctggacg cgggaccagg ctgggcccct
    57121 cggcggaggc ccgcgcaggc aggccccgcc ccggcctcgc acatctgggc cgggagcgcg
    57181 ggccgacccg gcggcgcagg cggcgcggcc atccggcctg ggggaggggg cggcgggcgg
    57241 ggtgggcggc gcgaggaggc gggaggcctc agggccgggc gcacgtcgag ggctgcggcc
    57301 gccgcagcgg gcacggccaa cgagctgcgg gcccgggatc gcggcggctg gacggggctg
    57361 gagctgtcgg gagggcggag gtgagttctg gggcgggggc tgccgggcgc cccgagtgga
    57421 gaaaggcgag gaggtttgcc gtcccgggct gtcggctgag accccaccaa aaacctccag
    57481 actttgtctg gtggggacag atctgcaagc ccctctctgc agcgtggctg cgggctcggg
    57541 aaggcacttg gggcagcgac cttggtctcc cacctgggct tcgggggacc acctcccctg
    57601 tccctcgcac agctgctcag aggcttaggc tggagaaact cctgtgtctt aggggctggc
    57661 aggactggtg tgggctgggg cctgctccag ggagccaaac tgagggaccc ggtgcctagg
    57721 tctgaggtgg aaacctggga gttgagtgta tgtgtgtgtg ttggggggtg cctcaggggt
    57781 cccaggccct gtggtcccga agcgcaacag agtactgagt ggcccaggtg ccagggtctg
    57841 aggctgggac ctggtcttcc gggccagctt gaggtccagg tggggtgggg tccaggttcc
    57901 cgctagggca gtggagccta tctgtgaggt tagcgtgtgt tgatgagaga gactggaagg
    57961 gagtgatcac gaaactaggg accctccgga gagcagacgc agcgcaggaa tcccaggcca
    58021 gaggggtaat ggggggctgt gggcagagca tgggaggggc tctgttgttt gtgaactggc
    58081 tgctcccatg atgaggaccc aagggggtgc ggcaggatgt atggaaagca aagatgagga
    58141 cagggagggg ccatgggagg ccatttctgt cactccctgc cccagcgcag gatggggcag
    58201 ccacctcact cagccacaca tggtcccatc ttcttaaggg ctcctctcca gaacagagtg
    58261 gggcagccct ttgggtgccg tttctgatct gtgctttggg ccctggttgg cccagtctgt
    58321 ggcttggaag aaacgcctgt ggcctcactt agcccgctga cggagtcctg ttccattaca
    58381 gggctggctg ggtgctgagc agagagcagc aggcggaaga ggcagggtcc tagtgaggag
    58441 gggagctgga atgacctggg tgggctcttt gtctacacag gcatcgtgtg gagccctgaa
    58501 acccagaaca ctgcattccc atagtgctcc tggggacatg agcgcccagc tgaaacaccc
    58561 cacacaggaa gagatcctcc gaggattctg tcaggcaaag ggttgggccc ctgcaaggtg
    58621 gctcacgcct gtaatcccag cactttggga ggatgaagtg ggaggatcgc ttgaggccag
    58681 gtggtcgagg ctgcagtgag ccgtgatcgt gccactgcac tccagcctag gcaacagagc
    58741 aagaccctgt ctcaaaaaaa ggagcgggag tgggggcagg gttcgtccta ggctattgta
    58801 agaccagggg atactctggg atgggagtct ctggttgtga gttcctgctg ggccacaccc
    58861 tatcctggtc cccagtgact aatcggccct gcctattggg tagaagtcag ggccaggggt
    58921 cagggtgaag ctgatgacta gaggtcaata tgaagccaag agttaggggt cagtccatgg
    58981 gcagggttag ggatcaggct ggggccagag tttggcatgt gcttgtagcc aggaaatgga
    59041 gttcctttct gtgtaaatca aactgtccca ctcaattact ctcaaaccca taggcaccac
    59101 cagtctgaaa ttagtaggcc cctgaagcgg ggctcagcct cccctcattt tcagctgtgg
    59161 actctgccca gacctagctt tcgggagctg ggcttgtccc gtgttctgtg gtcagcactc
    59221 tccttaagtg tccctgggtg tgtgtgctcg cttggcagtc agccacctct tggcttacct
    59281 ccacctctcg gggaatccgt gggatcctcc ttctttaaca caattttttt tttttttttt
    59341 ttggagacag ggtctcactc tgtcacccag gctggagtgc agtggtgtga tcatgactta
    59401 ctgcagcctc aaccttccgg gctcaagcaa ttctcccacc tcagcgtcca aagtggctgg
    59461 gactctaggc atgcactacc acgctgggct aatatttgta tttttcgtag agatggggtt
    59521 ttgctgttgc ccaggctggt ctcgaactcc tgagctcaag caatccaccc actttgacct
    59581 cccaaagtgc tagaattaca ggcatgagcc accacaccca gcccttaaca cagattgatt
    59641 tagttcctac ctccttcctt aaagaactca tgatgactag aatgcgattc tcagtaagac
    59701 caaaatatct aacaagaaaa ttcagaatct gcatcatgaa gggaaagaac actgctcaaa
    59761 cctgcccaac tccctgcttt tatcaatacc ttgcatatct ggacttgtta acataagccc
    59821 tctgtgtcac ccattctaca ggaaagggca ctgaggcaca gagatgtggc atgatgcact
    59881 cacgctaggt gccataatag gctcttgcca tccctgaaac cccagcatag gggcactcag
    59941 catatcgcac aatgagagac ggtagataac tggacaagct tccgatagct gggaagagta
    60001 taggcccatg gtttggatgt ggaatccagt gtagagcgag gactcagtct tttttttttt
    60061 tttttcttga gatggggtct cactcttctc gcccaggctg gagtgcaatg gcgcgatctt
    60121 ggctcactgc aacctctgct tcccaggttc aaacgattct cctgcctcag cctcctgagt
    60181 agctggaatt acaggcgtct gccactacgc ttggctaatt tttgtatttt tagtagagac
    60241 aaggtttcac catgttggcc aggctggtct cgaattcctg acctcaggtg atccacctgc
    60301 ctcagcctct gaaagtgctg ggattacagg agtgagccac ggcgcccggc ccagagtctt
    60361 ttttttctgg tggttctagg ttgtttctcc tccagataca atagcctcgg tccagaaaat
    60421 ccattcattc attcaacaaa gtggttcccc agtgcttagc acagagcctg gcatgaagaa
    60481 tctcgataag tgtttgctga agaatcagaa caacatacaa ataaggctgc cagctcctcc
    60541 ctcgagacct tgatccagta gtatgctggt taatggtttc caatcagctc tggaagaagg
    60601 ataaaatcct gatttatggt gtttgccaat ttctgtcttg taaatactcc cagccatgac
    60661 tgatttgaag ctaccaatgt gatgtcattg aatgcagaat tgggaagaga tagaaagaat
    60721 cagctcgcat gagctggtgg gacccagctc agcatcccac tccatcatcc tgagctgggc
    60781 tctgggctcc tgcccctggc tcctaagaac cctcccctcc ccctctctgt gctttccctt
    60841 ttctcctcaa cagccagctg ctatatgcat gcccacaacc agcacccatc cttcttctga
    60901 ttctgattcc tgagcatcca actccaatgg tacagaggac agcaagagtg tctggaaatc
    60961 tccatggtgc tggccatgca ggagaatcct ggacatacaa gagtgtgtgg gagtccaggt
    61021 gtctgtccac aatggggctg gtttgtgtga actgtgtgag caagggtgtg tgcatgtgtc
    61081 tatgggcatg ccagagccag ctctgtgtct gcctgtgttc tatagcaaaa ctttcagatt
    61141 gggccgctgg tgacagagaa ggaatgcatt ggccctgcgt agcctcctcc tttgtttaga
    61201 ggagcagcca ggcccaccaa tgagaacata gtgtcagtgg tctgcagact ggctgagaga
    61261 cactgttgag caacacttct caaacttcaa tgtgcacctg aatcacctgg gaatcttgtt
    61321 aaaataggcc agacgcggtg gcttacgcct gtaatcccag cactttggga ggctgagacg
    61381 ggcggatcac ctgaggccag gagttcgaga ccagcctggc caatatggtg aaaccccgac
    61441 tctactaaaa atacaaaaat tagccaggca tttttaggtg gcattttagg caaattaggt
    61501 ggcgcatgcc tgtggcccca gctacttggg aggctgaggc aggataatca cttgaaccca
    61561 ggagatggag attgcagtga gctgagaact gagatcacgc cactgcactc cagcctgggg
    61621 gacagagtga gactctgcct caaaaataaa taaataaata aataaataaa taaataaata
    61681 aataaataaa tatgaaatac atgttctgat tcagcaggtc tgggctgggg cctgaggttc
    61741 tgcatttcta acaagcttcc aggtgatacc aatgctgctg atcctcagcc cacactttga
    61801 gtaacaaagc tgttgatggt ttgaaatatc ctgcctatag cctggcccca gaacttcata
    61861 gacaggatcg agcccctccc atcttagaaa ataaatattc ttcctcacct cccttcagct
    61921 ttagcgccct ccaactactg ccctcacatg tattcctgat ttcagccaag cttttcagaa
    61981 attctagctc aatatgattg cgtcactaca ctccagcctg ggcaacagag tgaaactcca
    62041 tcttaagaaa agaagttcta gctcatgtca cctcttgctc actcctcacc taagcttacc
    62101 tcccttgggc cactaaaaca gtgtttccag tctcatctgg ggacctttta cagggatcta
    62161 gctttaccca tctcttggta caatcttcat gagaccatga ggcatctcag tctcctgggc
    62221 cggtatatat tcctctgttc agcctttaaa tgtaggttct cctcgtgaca gagtgacacc
    62281 ctatctcaaa acaaacaaac aacgaaggcc aggcacggtg gctcatgcct gtaatcccag
    62341 cactttgaga ggccgaggtg ggcggatcac ctgaggtcag gagttcgaga acagcctggc
    62401 caacacggcg aaaccccgtc tctactaaaa atacaaaaat tagctgggca tggaggcact
    62461 cacctgtaat cccagctact gaggaggctg aggtaggaga atggcttgaa cctgggaggc
    62521 agaggttgca gtgagccaag atcgtgccac tgcattccag cctgggcaac agagtgagac
    62581 cctgtctcag aaaaaagaaa agacaaattg agtcatgccc ccaccccacc ctgccacctc
    62641 aaacctttca agatttgctg ttgcccttca tataaaatca aatctgaaat agggtgcaca
    62701 ggccctgcct gcagggtaag cccagcccag ccctctgtcc agacttacca tgtcctcttc
    62761 ccaccccgca ttatccaact ttgaccttcc tagcactcag caacgcttgt catttcaaaa
    62821 tcacttgcct gattatctcc tccatcagac cgttggtttc tagggcaggg acagtgtctc
    62881 tctgggtcac tatctcccag cacctggcac agagcaggca ccagaaaaca tttaggagga
    62941 tcctgattcc aagaaagtaa agccattttt tgacacaagc tgggaactgt gggactttag
    63001 atgagattta aggaattact gttaatttca ttagacatga gaatggtgtt gcgatgacat
    63061 tgtttaaaac gtctttatct gttggagaga tgtattaaag agttcacggg tgaaacaaca
    63121 tgatggctgg gatttgcttc aaaggactcc agccaaaagt aaacaaataa aataaataag
    63181 cattcagatg agggaacaaa gggacaaata aatcccactc tgcttgtctg ttcctgttga
    63241 cagctacagg gcctgcaagg atgttggctg tcccggagat gggcctgcag gggctgtaca
    63301 tcggtaagaa cccccaccat tctgccctga cccatcaccc actcacccga gcaggcactg
    63361 agacccacct acaggttgtg caatgtacct ctcccacaga gcttcccaac agctcttctg
    63421 tgagtgagtg tgtgtgtgtg tgtgcatgtg tgcagggcag gtctgtgttt actggacccc
    63481 cctgtgtgtg cagaaagggt ctgtgtgcac cggtctgtgt atgtgagtat gtgtgcacac
    63541 aggactggtc tattcacagg accctcatgt gtatatgagt gtgggctgag tgcgtgtgtt
    63601 tgtgtgtgcc agaagggtct gtgcacacag gagtgtatgc atgtgcgtgt gttttgttca
    63661 ggctctgtcc acaggaccct gtgtgagtgt gtcagtgagt gtgtgtctct gcgggcaggt
    63721 ctctgtgcac gggatctgtg tacgggagta tagtctgcag gtgggcatgc atacagagag
    63781 ttctgcgtgt gcatggaggg tctgtgtgca caggagtgtt tgaatgtcca tgtgttttgt
    63841 tcaggccatg tgcacaggac cccgtgtgag tgcatcagtg agtgtgtgtc tctgcgggca
    63901 ggtctctgtg cacaggatct gtgtacggga gtatagtctg caggtgggca cgcatgcaga
    63961 gagttctgcg tgtgcatgtg cgtctggcca tagcctggtc tgggagcagg cttgggttct
    64021 ccgcaggggt gggaggccgg aggcaagagg gacactcctc aggaagtcca ttcacaaaga
    64081 cctggtgggc tgggcccttc cttcctgcct cccttcccaa gctcgagcca cagcaacttg
    64141 gcctctcggg ctgcagctgg tggcagggct ggagcctgcc ctggctgcct gaggagaggg
    64201 agcccttgaa atgtggccac tttttccctg cccctccagc acctcagact ctggccctgt
    64261 ccaggcgctt cccagagtcc ccctcatgcc tggcccttcg cctgtgttcc caggttaccc
    64321 ccaaggccag ggagagtgga gacctgcctg gcagcacgtg acaggactgt ggctgaggga
    64381 gcacagggcc tgcctggctc agcctaaggc gagccagacc tgcctgattt gtccatggag
    64441 cttcctacca gatctcactc tccctcccta gccagcctcc ctggcctgtc catggatatc
    64501 aaggctgccc cagtactcct cacatgcccc ttagcaatgc ccagtgtggt ggtggcagct
    64561 accagccagg caggccctcc tgccccataa gccgcctgat cctcggggcg aggcctgggg
    64621 ctgtcctcca gggctactcc actccctccc accaaaggtc cagctcagtc ccaggcgtgc
    64681 aagagggtct tgatggggcg tcctgccctg aaatgctggc tcagggtgtg tgtatgtgtg
    64741 tgtgtggggg gtgggaatca gacttcctaa tgttgtggac acccccagtt caggactggg
    64801 ctatctctat gggtgtggga ggaggtgctc cccctcctcg atcctcctca agcagccagc
    64861 tgggcctgtc tgtggccctt gtgggcatgg gagctgcaga gttccctcaa gtcctgatta
    64921 gaaccacgtg gcccaacttt agggtggagg ggagcaaccc cacctcctag ccagttagct
    64981 ctgtagctcc accttttgaa ggaggagggc ttctctctcc tctactccct cccaggagct
    65041 cacctgccct atctcccctg cccttccatc tctggactga gtctccctgc cacacaatgc
    65101 ttttctgtct ctctcccctc ccacactgtg tgtctgaggg gccctagtgg ggagacagtg
    65161 agctcctggg gaaagcatgg gtcatctgct agggaaacgt tgcctctgac agcagaccct
    65221 cagctcctgc ccctgcgtca tgtgtgtctt tcctacaccc cgcagtcctc ctccccagca
    65281 gagagccctt tgccttagcc cagcaaggac tgatggagaa cctttgggtg cctggcttgg
    65341 ggccccgctg ccatacctac cctgcttgtg ccaggatgaa ctgccgtctc ttctctgcag
    65401 gctccagccc ggagcggtcc ccagtgccta gcccacccgg ctccccgagg acccaggaaa
    65461 gctgcggcat tgcccccctc acaccctcgc agtctccagt aagcccagag cagggaccag
    65521 gtggtgggtg acctggctgg tgtggacagg gtcgtgcgtg gcaaagtcat gacagggcct
    65581 cagctaggaa ggaggagggg atgggggtag caccatgcct cttgtccccg tacactccag
    65641 tcatgtgccc cccagaagga ttggccttgg gtgaccctgg actataaagg gtaacaattc
    65701 tactgagaag gcgagcccca tgtaactaac tcccaacccc cccacacagg aggcacccac
    65761 ggccattgaa gctggcctgg ctgaggcaag tgttcccctg agcatgtggc ttttctaggt
    65821 ccccagagca ctgggcctct tgtggaccac agcccacctt aatccaagtc atatctgaag
    65881 gagaggagat gtgttgccaa aaaaaaaaaa aaatagtctg gaaaccatta gcttgggctg
    65941 ggcatggtgg ctcatacctg taatcccagc actttgggag gccgaggtgg gcagattgct
    66001 ttgagctcag gagttcaaga acagcctgag caacacagcg aaaccccatc tctacaaaaa
    66061 atacaacaat tagctgggcg tggtggtgca cacctgtaat cccagctact cgggaggctg
    66121 agggtggaga attgcttgaa cctgggaagc ggaggttgca gttagccaag atcgcaccac
    66181 tgcatttcag cctaggtgac agagtgagac cctgtctcaa ataaaaagga aactactagc
    66241 ttgaagtatc caggacacca cctccccacc cccaagctcc ctgggtcacc ccacctggat
    66301 gaccttgccc tggtgatcca gcttagggac ccccttattc ttaggggagt ccagccctgg
    66361 cataggagaa ggcacacact tctctgggga catgcacgtg ccctctcctc tctgccacag
    66421 aatcagacag tctggtttgt gtcactatgg ccacaaagag caagaggata aaatatatac
    66481 actggtccat gtgctgtaaa actgcctgga ggagcaaaat tgctcccagc tgggcccaga
    66541 ctaatggctg cctatgaaga tggtgtgagg tgggggtggg gctaaggcag tccagaggga
    66601 gggtgggcag aggctggaag caggaaagca ggagcttgag ccaaaccctg cctggggcta
    66661 cctgagagac acgtccaagg ctcagcctgg agcctgggag ggcaagggag cccgagcagg
    66721 tgggcaggtg gggtgggcca ggtccagggc tggtctggac cacagtcagt ggaggggagt
    66781 gtcttcccca tgcagaggaa gcattggggc tgtgggggat gggggtgtcc ctgctggctg
    66841 cttgctaatt ctaagtcttg cacttgcaga aacccgaggt ccgagccccc cagcaggcct
    66901 ccttctctgt ggtggtggcc attgacttcg gcaccacgtc tagtggctat gctttcagct
    66961 ttgccagtga ccctgaggcc atccacatga tgaggtgagg tcggctgggc tgagagagtg
    67021 aggtggggaa ggtggggagt tcctcatacc ttggtcccaa aagtactgtc accgagacat
    67081 ggggtctcct tggaggccct gccaccccca gtctggggct ccccactggg gtaaaagttg
    67141 gaggatgggc cccaggcctg ggtccttggc ctcaactgga gcagctccca accctctgta
    67201 ggaaaagtca gactttgttt ttaattttga atttttttga gatataacaa acataacaat
    67261 aaaatatgca aatgtttttt attttccttt tttttttttt tgaggcggag tctcactctg
    67321 tcgcctaggc tggagtgcaa aggcacgatc ttagctcatt gcaacctcca cctctcgggt
    67381 tcaagcgatt ctcctgcctc agcctcccaa gtagctggga atacaggtgt gcgccactat
    67441 gcctggctaa cttttgtggg gttttttgtt tttttttgag acggagtctc actctgttgt
    67501 ccaggctgga gtgcagtgac gcaatctcag ctcactgcaa gctctgcctc ctgggttcat
    67561 gccattctcc tgcctcagcc tcttgagtag ctgggactac aggcgaccgc caccacaccc
    67621 agctaatttt tttttttttt tttgtatttt tagtagagac ggggtttcac cgtgttagcc
    67681 aggctggtct caaattcctg acttcgggtg atccgccagc ctcggcctcc cagagtgcta
    67741 ggattacaga tgtgagccac caacctggcc tttttttttt tttctttgag acagggtctc
    67801 gctctgtcac acaagctgga gtgcagtggt gtgatcatag ctcactgcag cctcaaactc
    67861 ctgggctcaa gcaatcctcc cacttcagcc tcctgagtag ctaggactac agacagcacc
    67921 accacacctg gctaatttaa aaaaaaaaaa aaattttttt ttggagagac agggtctcac
    67981 tgtgttgcca gggctggtct tgaattcctg gcctcaagtg attatcccac tttagcatcc
    68041 caaaaatgct ggaattgcag gcgtgagcca ccatgcccag cccacaaata tttttctaag
    68101 cttgtagaca aacacacaca tatacaaaca tatatgtgtt tttttaagta aaagtgagat
    68161 atacaacttg ctttgtttta aagcttcaac actattttgt agatatttca tgtcagcaca
    68221 tacgaagcta ccactttctt tgaacggcct agtattccat agtacagatg tgtcataatt
    68281 tatccattcc cctattgaca gttttagggg ggttttttcc tcctctcatt tttcactatt
    68341 acaaaaagtg cagcagtaat atccttcttt gaaaacttgt gccgcaatct ccatggggta
    68401 gattcctaga ggtgattcct aggtccaaca tctcaaatct tttttctttt ccttttttag
    68461 taaaaaggaa aagaaaaaat aataaaacaa gaaataaaaa taaaaacaag aaaatgaagg
    68521 ttctaagggc tgagttgaca agtcacacag ttgttttgaa agacatgttt caaaaatcca
    68581 ttcaaatgtt gggttttatg tgggacctca aggaccttgg gacctcaaga ccccagtcct
    68641 ctgaatatgt cctagagggt cggcaggcag taggtgaagc tacaggagac ctcaaagctc
    68701 tcaacccaga ctgggatggg aggtaagagg ggcttcccaa ggcctggaga ctggatggaa
    68761 gggtgaatgg agacatagac agacaggtaa ctgctcccaa aatacatggg acttgagctt
    68821 catgagactg gactgtcctt ccttccacat cacaggaaac cacaaaggct ctgagcactt
    68881 acagccctct aggagctgca gggcatccca gatgtcttgc tgatcaacac acatggctga
    68941 ggccatgctg gggctaagga cacatccagg aacactgtct ctgtcctgtg gaggtgacag
    69001 tcacaggaaa taactggggg acattccagc ctcctcccaa gctctctctt cccaccagac
    69061 accaagcccc taacatctct agaataccca tctacttctc tccatggcca ctgatcgtgc
    69121 ccaagttcag ccccgacctt ctctcacctg gatggctgca atatcctcct ccttggtttc
    69181 ctttctccat gctcccttcc ctgcattctg ttgctccatg ggagccagtg acctttcttt
    69241 ctaaaggcaa atgtggtcat acctcttggc tactttaaat ccacgaagac agaattctga
    69301 atggccttct gtgctcttca tgatctcacc ttagcattct tgtgttttct cctgctgcct
    69361 atggagccga cccatagcta gttctccctg ctctttcaca tctgtaagtt tttgcacatg
    69421 ctgttccctc tgcctggatt cccacccaag aagggagggt tgcactgact gccctgtgcc
    69481 tccgcaggaa atgggagggc ggagacccgg gcgtggccca ccagaagacc ccgacctgcc
    69541 tgctgctgac tccggagggc gccttccaca gctttggcta caccgcccgc gattactacc
    69601 atgacctgga ccccgaagag gcgcgggact ggctctactt cgagaagttc aagatgaaga
    69661 tccacagcgc cacggtgagt cacagggctc cagacaggga ggcggggcca gcatggaaaa
    69721 gggcagggct aatgggggtg ggtgggacaa aaccaaaacg tgtgaggacc ggcccgatgg
    69781 agtcgtggct gagagggggc ggggctaaag ggagacgtcg gactccggtg tgggcggagc
    69841 tcagaaatga ggtggaggcg gggctaatgt gggtggggct aatagtgaag ctggggttgc
    69901 aggaggggtg gggctaagga gaggggtcgg ggcagagcta atgtcacatg gggcaagagt
    69961 gggacggtgg taaagaggag gggaagctcc aggaaacggg gtgattttaa gagcgaggtc
    70021 gtcaggaaat gagtgccaag ctgaggcctc ctgcagagcc gccctgtgtc cctgccagga
    70081 tctcaccttg aagacccagc tagaggcagt aaatggaaag acgatgcccg ccctggaggt
    70141 gttcgcccat gccctgcgct tcttcaggga gcacgccctt caggtgcgct gcggccccac
    70201 ctctgccgac tgtggcaggg accccctatt ttcccctcat ccgaaaccgc tcccccatcc
    70261 cgtccccgac attggatggg tagccaccgc cggagctcag aggtcatctt ctccagtacc
    70321 ctcctccctt tttgtctggt agagcctgca ccaagccata ctgatgggag gggggccgat
    70381 tcttccagct ctgctgggaa gtccttcctg tgatttgatt agtacctcca gttccgcaga
    70441 gggctgaaga ccaccctccc tccaagccag ctttcctctc actgccccct cctgtaccag
    70501 gagctgaggg agcagagccc atcgctgcca gagaaggaca ctgtgcgctg ggtgttgacg
    70561 gtgcctgcca tctggaaaca gccagccaag cagttcatgc gggaggctgc ctacctggtg
    70621 aggacgtgca ggcgggcccg agaacactgc tcaggaaggg ccaggcctgt ccccatgctt
    70681 gcatgcaccc caccaccctt gagaccacag agtcattgtg gaaagaactt cagcctgctc
    70741 ctgatgggag tttgtagagt tgctcccacc agaagaggga gtgggcctgc aggaacaggg
    70801 gacagaggga caaaagacac agcccaggcc agtgtagaca gtgccttaag tcaggtgtcc
    70861 taaaagcaga gcccaagatg gagagtcttt tttttttttt ttttttttga gacagtctcg
    70921 ctctgtcgcc caggctggag cgcagtggcg tgatctcggc tcactgcaag ctctgcctcc
    70981 caggttcacg ccattctcct gcctcagcct cccgagtagc tgggactaca ggtgcccgcc
    71041 accacgcccg gctaattttt tgtattttta gtagagacgg ggtttcatcg tgttagccag
    71101 gatggtttca atctcctgac cttgtgatcc gcccgcctcg gcctcccaaa gtgctgggat
    71161 tacaggcgtg agccaccgcg cccggccaag atggagagtc ttatatgtga ggtctattga
    71221 agaaggtttc tcaggagaaa ggcgagggag ggaggcggga caggagaagg agctatgaag
    71281 gatgtggtct ttgctggagt ctagcctcag gtttagcctc agctagatcc catggggagc
    71341 tgcagaggga gaactgtagc acagagttgg ggccggcttc ttgcacatcc gtatcggtct
    71401 gtcactggtt ctggggaaat gggagggaaa cctctccaag tgaggccatt cccattcagc
    71461 tgagggcagt tatccagagg aggtagcagc tgtgagccaa tagcagccaa cactcacagt
    71521 ggcaggcagg gcacccagaa cattcacttc acttggcatc agtattttga gggacacccc
    71581 caccaccgcc cacctttgtt ggttcttgag caggggaaat aggactatga aataaaggga
    71641 ggcgattcta gccaatgcat acaggtcttc ctaccactga gatgctcaaa ggtctagagc
    71701 aaggagttgg ggggggctcc cgtggagaca gagagggcat agcctcaaga ctcattgctt
    71761 ggtcgtaacc ctagagcctt ggatacagac cctaagctgt ccaaaaggga tctcaagcct
    71821 gcctgtggac accctggagg gacccccaat cctcccatac ctatgggcca ggcatgaccc
    71881 agggccaggt ctgtaaatgt ctacactttt cactggaagg agctaaccct gagcacagtg
    71941 gacatccctg tgcaaagatc aggacggaag caaggctgtc cccagccaga acaaacacct
    72001 gctcccccag cgtccccttg cctattttcc caccctcctt ctcacgctgt tccctggctg
    72061 ctcaggaaag tctggccctg ggtaaaagtc tccattcaca cccttatccc tctcccctat
    72121 ctggtcagaa tctggggaac cctacaaaac cataatatac ttcccacagg gatcccctac
    72181 cctgaaggaa ataactccag atctcaagtg tttttctcca cccaactggc ctgtcttggt
    72241 gcctggaatt tcagcttcct ccctgctgta tcagattccc tgggaacaaa gttctcctgg
    72301 aagtggggtg gcctatggct accgtctgac accagcctag tggagaaaga gggttcaatg
    72361 agctaaaagg gctccccacc acctcctctg agccatcaca cacccccaca ttgtgctaga
    72421 gtctctgcca atacccccag ccagcgctca gctggccgct tgtacctgtg aaggggaacc
    72481 ttgctgcagc cccgctatct tgggaaattt gaaggggcag atcccagggg ttctaggtct
    72541 gcattctgtc tagagtcttt tcctgctggg caacttcttg gcatatttgg ccagggccat
    72601 tccctcccca gcctggtcac agcgtggcta tgaggcagct ccaaatttgt gcagcacaga
    72661 ggggcctgag aggcctaaca tgggtggggt gtgatggagg aggaggtggg agccccacag
    72721 gccgcggttc cccacctcac agtgccatct taggtgtgac agcccacagt gctgcctgac
    72781 cctgcccacc acccatcccc aggctggact agtgtcccga gagaatgcag agcagctact
    72841 catcgccctg gagcccgagg ccgcctcggt atactgccgc aagctgcgcc tgcaccagct
    72901 cctggacctg agtggccggg ccccaggtgg tgggcgcctg ggtgagcgcc gctccatcga
    72961 ctccagcttc cgtcagggtg agctgccccc ggggacacca cccacccctg gagggtcaga
    73021 gggtcactga agccagaagc tcagccatgt ctagtatgaa ggggagaggg tacccaccct
    73081 ggaggagccc aatttgagca ggcagaaaca tggctggtgg agcctgtctg aggaagggag
    73141 gcacagccct gcccaagggc aacctcgtct cagggtggga catgccctac ctagggcagc
    73201 ccaatctcag ggtggagatt tcaccttgcc ctaggggagc aaagtctgag gagggcagga
    73261 ctccgcccag ccctgagtct gagggttggg ggagacacag actaccccag gaggaaactg
    73321 ctggggtact gacaaaggag gaagtcagcc cactgcatca gctgtcccct tcactcccct
    73381 catcttcccc gacacacatc agccccaagc tcctgctgcc aggaccaggc acccaggcca
    73441 aggacagcta tggtcatcct cccaaatgcc atctcccagg gcagggagaa gccctaagtc
    73501 ctgagtcccc tctgagactc caaagaccta cctgcctccg ctcctccaaa cccctctaac
    73561 cctgattttg ccatgacctg agacctcctg cgttaaagga aggccctgtg tccataaata
    73621 tttccccaca gctgttggat acagggtggg agtttggggt tcaggattgc cctctcccag
    73681 tcaggagcag gttggagttt caggagcact ggctgctccc agtgcccatg gaggtcctgg
    73741 gcaggaggat gggagttgaa cgccatagct ggagcacctc cttctaatct cactccctgc
    73801 tgtctcctga cccccagctc gggagcagct gcgaaggtcc cgccacagcc gcacgttcct
    73861 ggtggagtca ggcgtaggag agctgtgggc agagatgcaa gcaggtaggg ggaaaggggg
    73921 acggagtgtt atccttggcc cctaccgggc accatatact gatgggggga agggcatgtt
    73981 tgcaaagccc gtctcttcct cctccattcg ctgtacccaa cctggccgtc ccctcacagt
    74041 cacccgcacc cccaccccac tcacagcggc gcccctaact cccactcctc caggggattc
    74101 tccgcggacg ctcgggtgga gttgcagagc ctctggaacc atttctgccc cacaccctgc
    74161 gcccatatgt ggtggtctga ggttcaagca cctgaagccc ctcacgtccc tcccccgacc
    74221 ctgcagacag gccttgggac ccggggcagg gctggaggct gggcgaggct ggagggggcg
    74281 cagggctgag ggtgcgaggc cgcccacgag tgtgtgcccg cgctcgccgc cgcaggagac
    74341 cgctacgtgg tggccgactg cggcggaggc accgtggacc tgacggtgca ccagctggag
    74401 cagccccatg gcaccctcaa ggagctctac aaggcatctg gtgagtagcc aggcggcgcc
    74461 ccggtaccca gcgcgacccg ggctccggcc ccgccactgc cccctggcgg cccggcgagc
    74521 gctgacgccc tcttcgcccc ctgctccacc ccagggggcc cttatggcgc ggtgggcgtg
    74581 gacctggcct tcgagcagct gctgtgccgc atcttcggcg aggacttcat cgccaccttc
    74641 aaaaggcaac ggccggcagc ctgggtagat ctgaccatcg ccttcgaggc tcgcaagcgc
    74701 actgctggcc cacaccgtgc aggggcgctc aacatctcgc tgcccttctc cttcattgac
    74761 ttctaccgca agcagcgggg ccacaacgtg gagaccgctc tgcgcaggag caggtgggtc
    74821 ctgagcccgc gggctcaggc agggtttgcc gacccgggaa tgaccgtgca ctggagggtc
    74881 ccgggcccca aggaacggtg ggggtctgcc tgattcatcc cacatataca ctaagccagc
    74941 agggcgtcgg ggtggggcgg cggggagcgg cgagtgagtg ccccagccca gcaggctcca
    75001 cccacggaat ccgcagcccg aactggggca agacagagaa tcatagcggg gaggcggcaa
    75061 tgcctatctc ctcccagcct tctctacacc cccaccccgg gccctgcggg cccatgctcc
    75121 tcggtttccc tgcaccaaag caaggggagg cccctcccag gacctcgtac ctggaacctg
    75181 gagcaggctg gcaactaaat cctctgagtg agtagggtgg agataaggga ctaacatccc
    75241 gcaggtccag tcctccagac accacgtgca gtcggtgccc aggcacttct gcctggaggc
    75301 agaggtagag aataaggacc acggacccca aactggggca agcagctggg ccctgaccga
    75361 tggatatttg cccctttcac caccaacagc gtgaacttcg tgaagtggtc ctcacagggg
    75421 atgctccgaa tgtcttgtga agccatgaac gagctctttc agcccaccgt cagcgggatc
    75481 atccagcaca taggtgagca cctgagcttg gtcccccacc cgcccctaca tgaacaaaca
    75541 gatgcagaat aattcccccc catcagtgcc tagatacctc cacacatcca tacactgtga
    75601 tgagacctag aatcatctag aacacctgcg ggatgaagtg cagtggtgat taagagctag
    75661 agggttgata tgtagtcttg ccaaggcaaa aaacttctgg tgcctcagtt tccccattta
    75721 taaaatgggg tgatagtatt gggttctcaa aacgttatca cagggataaa atgagctgaa
    75781 gtacctagag tgagcacaat gtcttgcaca caatgtctag gtgtttaata cgtgtaaaat
    75841 gcatatcctt atctctcgtc ctccacgtcg tggtgggaga gaagtgggga gcgtgagtgt
    75901 tggggaggcg aagccctcga ggactcccgt gagctctcaa agaaagtgct caaatggcta
    75961 ctttctagtc gccaggtagg tacaggctag ggaggggagg cgccggtggc cgcctagtgg
    76021 tggcctcagt ggctctctct cccccgcccc ttctcctctg cccccttcac ccgcgtcccc
    76081 ccgtcctgtc ccgcagaggc cctgctggca cggccggagg tgcagggtgt gaagctgctg
    76141 ttcctagtgg gcggcttcgc cgagtcagcg gtgctgcagc acgcggtgca ggcggcgctg
    76201 ggcgcccgcg gtctgcgtgt cgtggtcccg cacgacgtgg gcctcaccat cctcaaaggc
    76261 gcggtgctgt tcggccaggc gccgggcgtg gtgcgggtcc gccgctcgcc gctcacctat
    76321 ggcgtgggcg tgctcaaccg ctttgtgcct gggcgccacc cgcccgaaaa gctgctggtt
    76381 cgcgacggcc gccgctggtg caccgacgtc ttcgagcgct tcgtggccgc cgagcagtcg
    76441 gtggccctgg gcgaggaggt gcggcgcagc tactgcccgg cgcgtcccgg ccagcggcgc
    76501 gtactcatca acctgtactg ctgcgcggca gaggatgcgc gcttcatcac cgaccccggc
    76561 gtgcgcaaat gcggcgcgct cagcctcgag cttgagcccg ccgactgcgg ccaggacacc
    76621 gccggcgcgc ctcccggccg ccgcgagatc cgcgccgcca tgcagtttgg cgacaccgaa
    76681 attaaggtca ccgccgtcga cgtcagcacc aatcgctccg tgcgcgcgtc catcgacttt
    76741 ctttccaact gagggcgcgc cggcgcggtg ccagcgccgt ctgcccggcc ccgccctctt
    76801 tcggttcagg ggcctgcgga gcgggttggg gcgggggaaa cgatagttct gcagtctgcg
    76861 cctttccacg ccctccagcc ccgggggaga taaggtcatg ggagagtggg tggggacaca
    76921 cccagagact ggctttggga ttgggcactg gtccgctgac tgccaggctg aagggacccg
    76981 ccaaggactg aacgggtaag agaagaggtt tgcaagacag agcgcgcagc ccggcaaggg
    77041 gcatgtgacc ccgaaggaag aacgcaacag aagagtcctg gtctgaactt ggccgagtag
    77101 gggtgggggt gggatggcag gaggagccgc aggaggaagg aggttgtgca gggtctggac
    77161 ctgcagggct gaagttcact catcgaccga ctcagcccca accgggagcc aggcagaaaa
    77221 accctgtgcc gtaggaaagt gactggaagt ggactccaga gggacaggtg tggtggcaca
    77281 gtcctggtgt ggtgctgacc acccaaatat gactgtgaat tgtggaaagg gcagtagatc
    77341 tctaatgtgg aggtgggaac attattgtgg tggaggcaat tatgagggta gcatttcttt
    77401 cgagacaaaa cacccgtctg ggaaggcccc aaggtcagct tatgaaggac cccacttgca
    77461 ccccacccca gccatggaag agcagctgga gggtggatgg ggaggccaga gggagcaatg
    77521 aggggtggtc ccagctctgc tattgactcg gtatgccttt aggacattct cttaccgctc
    77581 atgggcctca gtttcctaaa gtgtgaaatg tcaggcactt ccctctaact ggcatgcaac
    77641 agccccacct gcctgagagc cctgaggtga caataaaaca tttatgctca aggggaagcc
    77701 acagcctgct gatatggcgt ggagacccta atagtgggag gaatgcaagg gttcccggtg
    77761 ctagagagag aagggagaaa gctttcagct gtgcataggg aactgaccag aagggggtgc
    77821 tgctgtctcc catcaagcat cccaaacaac tccactgctt aagacctctc tggcctacac
    77881 atgaggtccc tctctcctca ttcaaattaa ttgtcttggc agccagcttc tggcctaaaa
    77941 tgccaccacc tgtgcatacc tcttgtgggg ctaggtgcta taataccacg cggtgcccct
    78001 gcctcctgag tgagtctacc caagtctttc cctggcccat ctgcaaagga gtaggcatta
    78061 ccccaacccc agagaacaaa aatccacctg gcctccggta tccactggaa gtttatttct
    78121 ttagggttct atcccaacca gtcgcttaaa aaccaagtaa cacagacctg aggggtgggg
    78181 gctggggact gcacctccct cctactcatg gtggacagca gtggggacta gggaggggca
    78241 ggagaggtgg ctgaagcaag gcagcagtaa tggggccacg acgccacaga gccagctccg
    78301 tcctctccca gaccctggtg ggagtccctg tggcttgggg tggggagtgg gggacccacc
    78361 ccaggccctc cctctccctt cctcagacag cctcctttcg ggctcaaccc atttcttccg
    78421 gcaggagact gaggcacaca gagaggagga agtgggagag gaggacgagg gaggggcagg
    78481 gtggcagcac aaatgaaggc agaggtgaga ggcgtgggca aggccactcc acccccacac
    78541 ccaccccaga gaggggcgag gaagccacac catcacgcag catgtcgggg ggacaaggcg
    78601 gggtttaagg ctgaggggcc cggggcaggc ggggcctcgg gcctcagtca aagccgtgcc
    78661 agtcgctgtg ctctgagtcg tattccagct cggcgcccac acacttgaca ccatccagca
    78721 gcatgggcgt gccgtggtgc cggtctgcaa ggcagggtgc aagtcagtgc catgctggcc
    78781 cccggcccca cccatgcggc ccactaaggg gacccctccc cttccctcag ggatcagctg
    78841 gaggtaggga cctgccaagg aggttgagaa cccctgagcc gggcaaggat cccttgttca
    78901 gccttggttc cctgaggagg acagaaaccc tcgcagtcga gcttgtgcat ccctcctcca
    78961 accaggagcc tcacgctctc acccatgacg cgggcctgca ccgtcacgcg cacacaggtg
    79021 ccagtgccac cttcgcaggc tagcagtatc tcctctttga ggacgccctc tttgtgcgcc
    79081 gagtactcac acttgacact ataacctgcc cggggacagg ggcaattggt cagcacctcc
    79141 cccagcctcc ccgatcctgc ccctggcact cacgggcccc tacccaccac catgggggga
    79201 ctgcatgcta agccccccca agaaaggggc agggatggcc cttctggagc ccagagaccc
    79261 attcccccta gcaggggtgc acaggtctca aaaacccatt cttcagtgag ctggacatgc
    79321 ctccagcctg tgggccaccc caatgtggct ccaagagtga atgaagtagg gtccaattca
    79381 ggcttccaaa agaaggtctg gctgttttct cccaaaagga aggcagggag aggcggtgat
    79441 gaggagtgag gggggcaggg cagggtaggc tttgagcaga tccgatggca agaggtaaag
    79501 gcctgagggg tgtcaatcca ttgaagggca aggtcatgaa gaggctggtg acggggacat
    79561 gaaactgaag ctggagctcc acgaaactga caccctagcc tctgccaggc tgggagtggg
    79621 gcatggggca gggcctgaag tgagcctgat aggaacagca gcctgaaacc caaggtctgg
    79681 gactggtggg tgctaggagt tatccacacg gtgtgtgtag ctccagcagg tttttctaga
    79741 gggttaggga ggcaggaggg aaggctggag gcttcaaacc agttcctcag cagctcccat
    79801 cttggttact gccccacgga ggtaaccatc acaccatggg caggtacagg gaagtatgag
    79861 ctcatggact tctgttcagg acagggagca aggcctggag tgtgggacct gccatgctgc
    79921 cacagtgcaa gctcacaaag aggtgccaca tccccgactt gctgagctgc ccatccaccc
    79981 tagttggaag taagggagca ccccatgttc tcactcccac acccatccag gccctgctgg
    80041 aggaggggac gcaccttcag ggacgggcac cacgctgagg agcttgaggt gcaggctggg
    80101 gacaggtgcc tcgcggacat ccttgctcag cctgtgcact gggggcagag tgaaggtaat
    80161 ctcatacctg tgcaggatct tcaggaagcc aacctacagc aggagaggtg agggagggag
    80221 gcaccttcac ttcctgcttc ccgcaggccc acccagttgt gccacctatg atcccatgcg
    80281 ccccagcacc tgcttcccca gagagcccag aaagcccaac tccctccatc actccagctg
    80341 atcgaccccc agcccagtgg ccccagccct ttcacagacc ccttgagggt cccagcccta
    80401 cagcttggcc agcaagatcc tcagcatctg tctcacgggc cccaaagtcc tcagtggctc
    80461 atttcataga caggaaaatg gaaaccacaa aatgactggc caaaggctat gtagtgtggc
    80521 cagggtcaag tcctccctag acttcttggc cattcctgct atgcatggca cccagcaagg
    80581 caccacccac tgcctacaag ggaaagttca aagtcccatg ggtcattcac ctgtccccat
    80641 cactgccagt atcccagggt cagggatgct ggtcctccat tcgctcatag gatgacacca
    80701 ggccacagat agcgtgggat gagggatggg atcaaattag agatatgaca gtagaccagc
    80761 tgtgtcagcc cagagctggc ccacaggcct ggcaccagcc actgaactgg ggctcctggc
    80821 tgccctgctg gcactggact ggggatgcct gcctgcccac tcaaaggctg tgtccaatgg
    80881 cccatactca cgacccaagc aatttctgtt gttcattcac agtgcttggt gacaaccacc
    80941 ctctgggaca aatgccattc ttggaaactc cagtagtatg aagggtcaca agccagggtg
    81001 gttgctgagc aggggggctg gtgggggtgc cacgccagca ggcaaagggt gcctgctgta
    81061 ctcccagctt gcatgggcac acagagtcct gctcacagtt actggggctg ggcagcccca
    81121 tccctggggg ccaactggga ctggctgcag agagttttag ccattcaatg ggaccaggtt
    81181 gatattgcta catgacaaac ttcaatccat gtgcccccct catgctctgc tggtaggggg
    81241 cacatccctc aagaaggctt tctggctgta cctgctaact gtgaccctgt gacccagggg
    81301 tactacttat agaattttcc aggggaaata gagatgtgca gaaaaatact cttcgctgcc
    81361 ctatttataa caatgaaaac aaaaacagcc gtaacaccca ggaacaggga gccagctaag
    81421 tcaatgatga cacgtatcat ggaccagtag gcagttgtaa aatctcgtgg aaaattattt
    81481 actgacacag gtaggagaca gttcacaaca gaagcaagca aaaaccagtg tggacacggt
    81541 gaggccagct tttgtttttg gaaacaaagc tttttaaaat aagctggatc catcaccata
    81601 gtgttttcag aaaaataaat aaataaataa ataacatttt aaaaagctgg aataagccag
    81661 gtgtggtggc tcacgcctgt aatcccagca ctttgggagg ccgacacgga cggatcacga
    81721 ggtcaggaga tcgagaccat cctggccaac agagtgaaac cccgtctcta ctaaaaatat
    81781 aaaaaattag ctgagagtgg tggcacgcgc ctgtaatccc agctactctg gaggctgagg
    81841 caggagaatt gcttgaacct gggagctgga ggttgcagtg agctgagatg gcgccactgt
    81901 actccagcct ggagacagca agactccgtc tcaaaaagaa aaaaaaaagc tgaaataaca
    81961 cacatcagaa tattaaccat ggttattttg ggatagtatg actgtgggta cttttaaatt
    82021 tcttcatatt ttctctgtct tccaaatttt ctcatgtcta ttaagacatg agtggatttt
    82081 gcaatgaggg gagagagcta tttttaagtg ttggtttgtt ccatttactc tcttggggaa
    82141 gctgtcagct gtaggacaaa gagtgggaac ttcagagttg gacagaaatg gatacaaacc
    82201 tgggcccccc aggttctttt ttgtttgttt gtttgagaca gagtctttct gttgcccagg
    82261 ctggagtgca gtggcacaat ctcagctcac tgcaacctca gcctcctggg ttcaagcaat
    82321 tcttctgcct cagcctccca agtagctggg actacaggcg tgcgccacca cgctcagcta
    82381 acttttgtat ttttagtaga gacgggtttc gacccattgg ccaggctaaa cctgggcccc
    82441 ttttgagcag cgcagcagac cccccactca gggccatctc aatgggccag acctcccccg
    82501 acccaagggc actcctgttc aactctggac ccctgattta ttgaccaacc tgaaggcctc
    82561 agtctccatt ttccccaatc cagctccacc aagcagaaaa cagggggtga taatactact
    82621 tcaaagagcc taactgagcc agagatttgg caaagaaggg ttaaaaaaaa agttgcacct
    82681 tgttattgcc atagttagct tcacacctgt atcacataca tacattcttc cattctcaca
    82741 ccaactctag gggatgtgat tgtctccact tgagagaaaa gaaactcagg tgaccttccc
    82801 aaggtcacag agccagaatg gctggcctag acctgaactc aggccactgg caccacagcc
    82861 acgacactgc cttccattgc tattgtctgg gtgaagcaga tgacagcagt ggctctgctc
    82921 atcttgatct ggatgacatt ctaaatgctc tcatctcatt gaaatctcac tgcctcccat
    82981 ctgacagatg gggcaaccaa ggcacaggga aaggcacaga catgcctata agcccggctg
    83041 agaggaacgg ggataggcac ccagaagcca ggcagagccg gggaggggaa gaagctgctg
    83101 atggggtagg tgtgcatgta ccttgaccag aaagctgctg tcactctcct gggtgaccat
    83161 gaccaccgag tcatgcagct tctcatcaaa gtggacgtgg ctgtgggatc cttctgcatc
    83221 gtggcctgcc gcaaagcgga tactccggac tctgggcttg ttgcctagga agagtgggag
    83281 agggctctga gggctttccc tgtccccagg aaactcctcc cccttgtccc ctcgtcaccc
    83341 ccaagactgc ccttgacatc atccagctcc cactggctgg gctcttttca gggctagatg
    83401 gacactagga tcatgaggct tgaggcctcc ccgccggccc cgacctgccc ctcccacaaa
    83461 aatccatccc ctgagagtga gcgtgggggg actcagggac tggctaatat gacccttgct
    83521 tggagggggt ggggctggtg ccagagccag gaagggacag tcttccagac tcccaccaag
    83581 ccagggcagc gtgggactca gcccagtcct ctggataccc tgcccagtgc tctccttcac
    83641 agtgcaaagc tccccatccc tggggcctac ccctaagctg tgtcagtgca tgaggcccct
    83701 gcctgcccat cctcatgacc caaacctgaa agggcaggga caggaacagg ctctggggga
    83761 tctgcctggg gtggcagaaa caggggggat caaaaaacac actcaggcca cctgggcgag
    83821 gctgcagctg ccagtaaacc tcactggccc ctggcagcaa gaagagagaa tccaaggaag
    83881 cttccagacc cacctcaaga cccccttcct tcctctgtct agcccaccat caaggccctc
    83941 agctgtctct gagctgggtc gggttctcca aggccagaaa gggcagacgg atgagacagg
    84001 cagggtaggc gggagccgag ctgaaggcct ggactggatg aatgctcagg ctgtagaggc
    84061 cgagagcacg gccctagtcc tctctgaccc ctggccccta ggccctccca ccaagagccc
    84121 tgcccagggt cctcttggcg ggaggacctg atccagcttt gagccggcct cttggatact
    84181 cggaggccca ggggaccagc ctggcacatc cacaatccct ggtgagcccc acagaaccct
    84241 caaatctcac cagatcaccc tggcctaaaa ccctccctta tcccttctgg actctgaaag
    84301 aaacccaaac tcccttccac agcctgctgt ggccctccca catcaccagt ctcagtgctc
    84361 accataagcc cctcggctcc cccaaccccc acagtggaac ggcagggaca caagagcaca
    84421 cacctccaca gaccagccta aggagtcagc agggctctgc aggggtcctg ctagggcttg
    84481 cgtgagcccc agaacctggc acaaacactg acatagtccc aatcacattc gcacaccagc
    84541 acacacgcac tcacacatgc acacacacac acacaggtgg tgcgggccct acctgcagca
    84601 atggccctcc taccagggcc caccggaaga gcttgtgcaa gtcctaccac gccaggaagg
    84661 cacttaccct tgttggctgc agccatggac gccctccctg ccacgcagct cctgccagac
    84721 accgccactc acgctcagca gcctcccatg ctccagggac accagcggga gcctgaagga
    84781 tagggagtgg ggagggcagg ggtcagggcc actcatggac ccatggctca gggacaccag
    84841 aggagctccc tttaggaagg actcaaaccc ctaccgctct cagcagccca agtggtgcct
    84901 gtactctcta gcacgggggc ttctgcccac cctctaccga ctgcccacat tcaccaagag
    84961 ccctcacccc ttcctgatac cccagtgact cagggtcctg ccagcatgtg ctgagctgca
    85021 gcttctaggt gccaaagcaa cagcatagga ctcctatccc cagcaccccc agagactggc
    85081 aggaggggca gcctggaaag gaggacttta ttggtatttt ccagggacca gttctggtgc
    85141 tgctgacccc aaaggctggg cctggagcta ccttattcag gtcacacaag caatgcagct
    85201 gggtgcggtc agaggcaaca gggtgctcga tggtgggcaa gaacagggga ccacataaag
    85261 taagcgtgcc cttagagctt tccctcctgg tgatcggtca gggccatatg caaaacagat
    85321 gcctgtggag ggggtgtgcc cccacctgaa ccccattcag accctgccct gagtctcagg
    85381 cctgcctccg cctcagctcc ccaacggcag cctcagcaca ggggcagtga gaggcgcccc
    85441 aggaagcctc caatgggccg agctgggacc atctgagcat caaaaagaat aatgagtgca
    85501 actgatggaa acagatggaa cacataaaag cagtgcattc acagagatta ttagaaaaga
    85561 agagagaaag caaaacaaac aaacccaagc cctcaaaagc cctcatttgt caccactgac
    85621 ggtagcagtg ccctctttac tctgaaagct gaggattaaa gaggaagaat tcagcctgtc
    85681 tcttggcctt tcttgtaacc aaattgccct ggtggttgat ggaaagctct tctttcggaa
    85741 agaattcttg ctaataaata acaaagaaat gaccaaatgc taagtcattc tgcaacccct
    85801 aaaggaacaa atgctggagg caacaagaac cagtggatgc taaaccagag gggaaggttg
    85861 acaggaagca gatactcaca ggcgcccaag cgtcactgac agaggacctg gtatggtttt
    85921 tgtttttttt ttttttctga gacggagttt cactcttgtc acccaggctg ggatgcaatg
    85981 gcacgatctc ggctaactgc aacatctgcc tcctgggttc gagcgactct cccacctcag
    86041 catcccgagt agctgggact acaggcgccc gccaccacgc ctagctaatt tttgtatttt
    86101 tagtagagac agggtttcac catgttggcc aggctggtct cgaactcctg acctcaagca
    86161 atccgctcac ttcggcctcc caaagtgttg ggattacagg tgtgagccac catgcctggc
    86221 cgaggacctg gtgtttaaga ggaatggcac aagtttgcaa tggaggtatg tgacttctcc
    86281 cagtcacaca ctggtctatc tctctactct ctgcaggaca gcctgtcaca ctgatgcctc
    86341 ctgaagtaag gcagcccaaa gtcacagcat cactgacaag ggattcctgc caaaaaggtt
    86401 taacctggaa tctaaaccag cctctaactt caacttctca tttgctaaaa acacagagga
    86461 gaggggaaca aatttaatga ctccatgaga aagcaatgag acgagcccag aaggcaggat
    86521 attctacagg atgactgacc tgttttttgt ttttgttttt gttttcaaat gtcaagaaga
    86581 aaaagaaagc aggctgggaa cggtggctca cgcctgtaat cccagcactt tcacggatca
    86641 caaggtaagg agttcaagac cagcctggcc aacatagtga aacccagtct ctaccaaaag
    86701 tacaaagatt agccgggtgt ggtggcgggc agctgtagtc ccagctagtt gggaggttga
    86761 ggcaggagaa tcacttgaat ccggaaggcg gaggttgcag tgagccaaga tcgcgccacg
    86821 gcacttcagc ctgggtgaca cagcgagact ctgtctcaag aaaaaaaaaa aagaaagaag
    86881 aaaaataaaa cgagggagat tattctagat aggaaagcag caaactacag ctgattacac
    86941 agctgatggt aaggttgctg cctgttgtgt ttttgttttt gtttttgaga cagggtctca
    87001 ctctgtccag cccaggctgg agtccagctc actgccaccg taatctccca ggctcaagca
    87061 atcctcccac cccagcctgc caagtagctg agaccacagc cacgcaccac catgcccggc
    87121 taatttttgt agagacagag ttgcccaggc tggtctcaaa ctccggagct taagcgatcc
    87181 acccacctca gcctcccaaa gtgctgggat tacagacgtg agccgccagg cctggcaaat
    87241 aaatttttgc aaataaagtt ttactaatag aacactacac ttcattcact tacatgttgt
    87301 ctgtttttgc actaaaacaa agttacttcc aagtagttat gatagagacc atatggcctg
    87361 caaagtctaa accatttatt atttggccct tcagagcaaa agtttgccaa cccttgttct
    87421 agatcaacac acacttaaca gaccaaaaaa ggataatctt caggacaact gacctgtttt
    87481 tttcaccaca tcagctgcaa gaagaaaaat aaaggagagg gtgtttcgaa ctacctatca
    87541 aagaagacat tttggggggt gccgggcgct gcggctcaca cctgtaatcc cagcactttg
    87601 ggaagccaag gcgggtagat cacctgaggt caggagtttg ggaccagcct ggccaacatg
    87661 gcgaaacccc atctctacta aaaatacaaa aatcagctag gcatggtggt gtgcacctgt
    87721 aatcccagct actcaggagg ctgaggcagg agaatcgctt gaacctggga ggcggaggtt
    87781 gcagtgagcc aagatcccgc cactgcactc cagcctgggt gacagagcga gactctgtct
    87841 caaaaaaaaa aaaaaaaaaa aggccgggtg aggtggctca cgcctgtaat cacagcactg
    87901 tgggaggctg aggcgggcgg atcacctgag gtcgggagtt caagaccagc ctggccaaca
    87961 tggagaaacc ctgtctctac taaaaaatac aaaattagcc aggcgtggtg gcgcatgcct
    88021 gtaatccagc tactcgggag gctgaggcag gagaatcgct tgaacccagg aggcggaggt
    88081 tgcggtgagc cgagattgcg ccactgcact ctagcctggg caaaaggagc aaaactccgt
    88141 ctcaaaaaaa aaaaaaaaaa gacatttggg ggatctggaa aatgtaaatg gcctggatat
    88201 gaggtaacac taagaaatga gttgatcatg gaattgtatt atgtaaagaa ttatgtcctt
    88261 attttgtcag ggaaacatac tgaacaactt agagatgagt gacactggct catcagtcca
    88321 gggacagaag aggaactaca gggtggcctc atagtttagg gctgggcctg tggagggcat
    88381 gatctcaagg tccctctggt gccaagctgc tctgggagca gatgtggcct cacctcgctc
    88441 cccacttctc aatgtgggca aagtccaccc aggcccaagc cctgcctcat gggcagggtg
    88501 ttcacagttt gggtaccacc aggagccccc tttggccaat ggaccaggcc agcaaccccc
    88561 tcttggaggt gagcttccag gccccagccc agggtgcagg aagtggagct acacccattc
    88621 acctctggcc agggccactg tggggtcagt tgcctcagcc ctggaaagct cagcttgtcc
    88681 ccagggaact tggtcttgga gagcagcctg ccgtaaccgc cccccgacta cagctgcccc
    88741 caaagctgtg aactcagagt atatgacaaa tggccaagca caagagtgtg aggccctcct
    88801 atcctgcaga ggctgctcca gccacacctc tgcagccagg aggcagtgac agcccagtgt
    88861 cctcaaggtc cccccatctc ccaggatgct tcctgaaggc tctggttaaa ctgcaccagt
    88921 ggctggtgag caaaggctgc aggccctttt cacaaaacac tctgaactgg tccatgcagg
    88981 gcaccctccc tgtcttcctc ccaacctact gcctgaccgt ccccccgccc tgcctttgcc
    89041 caggtcatcc cttcagccag gaacaatccc gtccctccca gccacacctt cgacagggtc
    89101 tcagccccaa catttctttt ttaaggatgc acaccctgca aatcccagga caggactaca
    89161 gctctttttg gaatcccctt tctcatgtct gggatcattt gatatccctc ttcctcacct
    89221 tacaagggta gtgactgtgt ttttgtcaaa ccctgtgtcc ccggcactca gcgcagcacc
    89281 agacagggag gagctgtgtt tggtttatgt ttgttgaatg aatgaccaca tcacatttgc
    89341 ttggagggcc tggccaaggc ccgtacttca gccctaaatg atatggtcag ggccagtgac
    89401 tgcactgggg gctctgagga agcagaactc cccaccccac tcttttgcct cacccctgcc
    89461 ctggggggca cttccctctc cctggccccc accaagtggc ctcccactgg gactcctcag
    89521 gccttgctgc agtcagctgg ttacctgtcc atgcctgctt tgagagggac atgccccatg
    89581 cccactgagg gtaggaccca catctcatgc agtgccagta ctcagtaaag ggcaaacact
    89641 gagggcctgt aaccctctgg atagtgacaa catagaggca ggaagcaagg gacttcagga
    89701 acccaaagga aactgggaaa accaaacctc ctttctcaat ggagaacctg ctggtcgctg
    89761 tcctcgggga gctagactct tgtgcacaca caattcctca tcagaaacag gcaaagaggg
    89821 cactgggcct cccctgctgc agctgtgtcc acagagaaca gctaggccac cacacaaccc
    89881 caagcctggc cctgcctcca gcctcctcct gtgccaacca aggcctcacc atggctcaat
    89941 ccacatggcc cccagaggca gcagtcctgg ttcaaattca ggatctgccc ttgctagctg
    90001 catgggtgag ttctttcact tctctgtccc tgtttcctaa tgtgtaacac aggaataagc
    90061 agtggcctct tccttccagg gtttgctgca actgtgcctg aagctctgtg acaatgcacc
    90121 ccccatctct gcaaaagcaa acccccaaag gcctggtttc taaagcaaca gcagtttcag
    90181 cagcaccgtc agagaggcac ttcaggccaa tcctggagga gccaggagtg acccttagag
    90241 tgggccccag ggacgtcagc cttcttggaa aacagctcaa ggggtgaggg ggcctccctc
    90301 ctcctgcctc ccccttctcc cactcccaaa gcagccaggt ccctagggag ggtcagagaa
    90361 cagatgctgg gagtttccag tccccctaac cagagggggt cacaaggaag atgtgcagaa
    90421 tgaacatcct gggaaactgg gaaatgacta gggaggaaca tggtgcctcc ccgccagcaa
    90481 aaaaaattat acccttcccc atgagatgga gtgtcagcaa gcttccaggc cccagcccag
    90541 ggtgcaggaa gaggagctac acccattcac ctctggtcag catgctccca acgtctgaga
    90601 cctcatttct cattccttct tcagtcccca ttctcattgt ggttcgaggt ctttctctct
    90661 gagccaggaa ctccgcaacc ttccaccctc cctacctctt cctctcctac cccagcctgg
    90721 ggctcagtcc tggctcaagc actcagtcca gcagaagcac tgtgtagcct cccattaaag
    90781 ctcacgcctg tgaaaagaac agccattgag gccttgagat ggggccacac tgacccgctg
    90841 actctcagga ctggacacag cagaggccac acatactcag aacaaagcct ggaaaggcaa
    90901 ggctggaggt cagtagttgt ggcagcttca catcaactca gctttaatgt gatttaattt
    90961 ccttctccct ccagtgggcc aaaggtgcaa agataagtat ggctgttctc tctccttcta
    91021 acagtgaggt gctgggggtg ggggtggggg aatatggaga agggaccctc accacccaca
    91081 ccttcctgcc tccccaacaa gtgctgccct cctctgccca gcattctccc cactttgccc
    91141 tcagctagtg ggtgcttagc ctccagatag catgccccac ctaggccctg ccctgggcct
    91201 gtgatccaga ggtcccaaga agcagaggcc aggctggatc cagggggtca gccaaggtga
    91261 gggtgggagc acacaggatt atctcccagg gacagggctg ctgcctcgta gctcaggatg
    91321 gatagaatgt ggggggatat ccagctacat tttccctcca caaaagacca gaatgggaga
    91381 gggatggggt gctgccccga ctttcttcaa ctccccggag cagaaaaatg ccctacctcc
    91441 actttccagt gccaagattc aagaagaaag gcaagcggag acttcccttt ctcagtccct
    91501 gcttactaat ggaaacacgg gtccagaacc taaatccagt ccctcctcct tcataccacc
    91561 gggagggagg tgcagcccaa gcccccgagg ccccaagggt ccaggtgtag gaccctttat
    91621 cctctccggc agccatccct gtgggtgtgg cacccccgcc acaccccatt cttgtcatct
    91681 cagggggagg ggggaaatgt aatcggacat cccccccatc caatccatcc tgagctgcga
    91741 ggcggcggct ctgtccctcg gagataatcc tgtgcactcc ccaccttcac tcacctcggc
    91801 tgacgcagga gtctccggag cccgcactcc cagacatcac tgccctcctt cctgagggtg
    91861 ctgaggctcg gaggctcaga gatgctactg gtccaaggtc atgcagcgag gcagcggcaa
    91921 tgaacagggt cgtggggagg agggggcgct gaccccatta cgcccccgcc ctcactaccg
    91981 cactcatgcc cccggacaat cgcttcgcgg aaaacacccc agctgcacca gctaacggtc
    92041 actgcgcccc gggaacctga catcactcta gttcaggtgc tggggagtcc tccaggcccg
    92101 gccccgacag cggagccccc caccccagcc ctgccatcac ggccgctggg gtcccaggca
    92161 ctgactgctc aggacagggc cgggaccggg aggggacctc ggcgaacggg aaggaaccgg
    92221 gaggcaggga gcagaggggt ggcctcacct tgcggggtgc accccggggc cggggagggc
    92281 agggacgacc actgcagcgg cggcggctgc aggagctcaa cgccgagcac gaggaaggga
    92341 gccccgcgcc gcggccgccc tcccgtcggc acgcccccgc ctccgcccat tggttgatct
    92401 gggagggtgg ggcgagggac gctccggacc aatgagcggg ctccaaagaa cggccaactg
    92461 gcgagggccg cctacgtcac gtgccagggt cgccgaggca gcgccctgct agtccgcgcc
    92521 tgccgggcga gctctcgcga ggaagacggg caggcggccc aactaggcca ggggccagaa
    92581 ccgaccactc gaagagggag aaggagggcc tcggataggc cccgcccccg ctccttcttc
    92641 cgcctggggg atagcgcctc tagccttgaa ccttgcttag gacgcacctc ccttgggccc
    92701 ttcgctctcg ggagggctgt cgggcgcgtc tcggggctgg gtggagctcc cgaaggtggc
    92761 ctttctccct gggcttccac gccggcttcg gcca


&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U05291. Human fibromoduli...[gi:450854] Links  


LOCUS       HSU05291                1892 bp    mRNA    linear   PRI 26-JAN-1994
DEFINITION  Human fibromodulin mRNA, partial cds.
ACCESSION   U05291
VERSION     U05291.1  GI:450854
KEYWORDS    .
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1892)
  AUTHORS   Sztrolovics,R., Chen,X.N., Grover,J., Roughley,P.J. and
            Korenberg,J.R.
  TITLE     Localization of the human fibromodulin gene to chromosome 1q32 and
            completion of the cDNA sequence
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 1892)
  AUTHORS   Sztrolovics,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-JAN-1994) Robert Sztrolovics, Department of Surgery,
            McGill University, Genetics Unit, Shriners Hospital for Crippled
            Children, 1529 Cedar Avenue, Montreal, Quebec, H3G 1A6, Canada
FEATURES             Location/Qualifiers
     source          1..1892
                     /organism="Homo sapiens"
                     /isolate="patient A10/03/93"
                     /db_xref="taxon:9606"
                     /chromosome="1"
                     /map="1q32"
                     /clone="pHFM-3'UT"
                     /sex="female"
                     /cell_type="chondrocyte"
                     /tissue_type="cartilage"
                     /clone_lib="PCR product"
                     /dev_stage="neonate"
     CDS             <1..178
                     /note="Encodes only the most carboxy terminal 58 amino
                     acids of fibromodulin."
                     /codon_start=2
                     /product="fibromodulin"
                     /protein_id="AAA16153.1"
                     /db_xref="GI:450855"
                     /translation="YLQGNRINEFSISSFCTVVDVVNFSKLQVLRLDGNEIKRSAMPA
                     DAPLCLRLASLIEI"
     3'UTR           179..1892
                     /note="Identity as the fibromodulin cDNA 3'-untranslated
                     region is based on Northern blot studies and comparison
                     with published bovine fibromodulin cDNA sequence."
     polyA_signal    1871..1876
                     /note="This is the only consensus poly-A signal in the
                     sequence entry and correlates with mRNA size observed on
                     Northern blot analysis."
                     /evidence=experimental
     polyA_site      1892
                     /note="3'-untranslated sequence was cloned from
                     oligo-dT-primed cDNA. Cloned sequence contained poly-A
                     sequence adjacent to last base of sequence entry."
                     /evidence=experimental
BASE COUNT      453 a    493 c    476 g    470 t
ORIGIN      
        1 ctacctccaa ggcaatagga tcaatgagtt ctccatcagc agcttctgca ccgtggtgga
       61 cgtcgtgaac ttctccaagc tgcaggtgct gcgcctggac gggaacgaga tcaagcgcag
      121 cgccatgcct gccgacgcgc ccctctgcct gcgccttgcc agcctcatcg agatctgagc
      181 agccctggca ccgggtactg ggcggagagc ccccgtggca tttggcttga tggtttggtt
      241 tggcttttgc tggaaggtcc aggatggacc atgtgacaga agtccacggg caccctctgt
      301 agtcttcttt cctgtaggtg gggttagggg gggcgatcag ggacaggcag ccttctgctg
      361 aggacatagg cagaagctca ctcttttcca gggacagaag tggtggtaga tggaaggatc
      421 cctggatgtt ccaaccccat aaatctcacg gctcttaagt tcttcccaat gatctgaggt
      481 catggaactt caaaagtggc atgggcaata gtatataacc atacttttct aacaatccct
      541 ggctgtctgt gagcagcact tgacagctct ccctctgtgc tgggctggtc gtgcagttac
      601 tctgggctcc catttgttgc ttctcaaaat atacctcttg cccagctgcc tcttctgaaa
      661 tccacttcac ccactccact ttcctccaca gatgcctctt ctgtgcctta agcagagtca
      721 ggagacccca aggcatgtga gcatctgccc agcaacctgt ggagacaacc cacactgtgt
      781 ctgagggtga aaggacacca ggagtcactt ctatacctcc ctaacctcac ccctggaaag
      841 ccaccagatt ggaggtcacc agcatgatga taatattcat gacctgatgt gggaggagac
      901 agccaacctc aggcttagat caatgtatag ggctatattt tggcagctgg gtagctcttt
      961 gaaggtggat aagacttcag aagaggaaag gccagacttt gcttaccatc agcatctgca
     1021 atgggccaaa cacacctcaa attggctgag ttgagaaagc agccccagta gttccattct
     1081 tgcccagcac tttctgcatt ccaaacagca tcctacctgg gtttttatcc acaaaggtag
     1141 cggccacatg gtttttaaag tatgagaaac acagtttgtc ctctcctttt atccaagcag
     1201 gaagattcta tatcctgatg gtagagacag actccaggca gccctggact tgctagccca
     1261 aagaaggagg atgtggttaa tctgtttcac ctggtttgtc ctaaggccat agttaaaaag
     1321 taccagctct ggctggggtc cgtgaagccc aggccaggca gccaaatctt gcctgtgctg
     1381 ggcatacaac cctctgcttt cacatctctg agctatatcc tcattagtga aggtggcttt
     1441 tgctttatag tttggctggg gagcacttaa ttcttcccat ttcaaaaggt aatgttgcct
     1501 ggggcttaac ccacctgccc tttgggcaag gttgggacaa agccatctgg gcagtcaggg
     1561 gcaaggactg ttggaggaga gttagcccaa gtataggctc tgcccagatg ccatcacatc
     1621 cctgatactg tgtatgcttt gaagcacctt ccctgagaag ggaagagggg atctttggac
     1681 tacgttcttg gctccagacc tggaatccac aaaagccaaa ccagctcatt tcaacaaagg
     1741 agctccgatg tgaggggcaa ggctgccccc tgccccaggg ctcttcagaa agcatctgca
     1801 tgtgaacacc atcatgcctt tataaaggat ccttattaca ggaaaagcat gagtggtggc
     1861 taacctgacc aataaagtta ttttatgatt gc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





&&&&&&&

    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: S62077. HP1Hs alpha=25 kd...[gi:386086] Links  


LOCUS       S62077                   876 bp    mRNA    linear   PRI 25-AUG-1993
DEFINITION  HP1Hs alpha=25 kda chromosomal autoantigen [human, mRNA, 876 nt].
ACCESSION   S62077
VERSION     S62077.1  GI:386086
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 876)
  AUTHORS   Saunders,W.S., Chue,C., Goebl,M., Craig,C., Clark,R.F.,
            Powers,J.A., Eissenberg,J.C., Elgin,S.C., Rothfield,N.F. and
            Earnshaw,W.C.
  TITLE     Molecular cloning of a human homologue of Drosophila
            heterochromatin protein HP1 using anti-centromere autoantibodies
            with anti-chromo specificity
  JOURNAL   J. Cell. Sci. 104 (Pt 2), 573-582 (1993)
  MEDLINE   93280259
   PUBMED   8505380
  REMARK    GenBank staff at the National Library of Medicine created this
            entry [NCBI gibbsq 133160] from the original journal article.
            This sequence comes from Fig. 3.
FEATURES             Location/Qualifiers
     source          1..876
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..876
                     /gene="HP1Hs alpha"
     CDS             135..710
                     /gene="HP1Hs alpha"
                     /note="25 kda chromosomal autoantigen; This sequence comes
                     from Fig. 3"
                     /codon_start=1
                     /product="HP1Hs alpha"
                     /protein_id="AAB26994.1"
                     /db_xref="GI:386087"
                     /translation="MGKKTKRTADSSSSEDEEEYVVEKVLDRRVVKGQVEYLLKWKGF
                     SEEHNTWEPEKNLDCPELISEFMKKYKKMKEGENNKPREKSESNKRKSNFSNSADDIK
                     SKKKREQSNDIARGFERGLEPEKIIGATDSCGDLMFLMKWKDTDEADLVLAKEANVKC
                     PQIVIAFYEERLTWHAYPEDAENKEKETAKS"
BASE COUNT      308 a    158 c    221 g    189 t
ORIGIN      
        1 gcgcagaagc ggcggcggtg gtggcttgtg gtgcggcctc accatacagg aacagggcag
       61 acgttagcgt gagtgatcac tctcaatccc ggggacctgg tggccttagt ctttcaggtg
      121 gaacggtgtg cgacatggga aagaaaacca agcggacagc tgacagttct tcttcagagg
      181 atgaggagga gtatgttgtg gagaaggtgc tagacaggcg cgtggttaag ggacaagtgg
      241 aatatctact gaagtggaaa ggcttttctg aggagcacaa tacttgggaa cctgagaaaa
      301 acttggattg ccctgagcta atttctgaat ttatgaaaaa gtataagaag atgaaggagg
      361 gtgaaaataa taaacccagg gagaagtcag aaagtaacaa gaggaaatcc aatttctcaa
      421 acagtgccga tgacatcaaa tctaaaaaaa agagagagca gagcaatgat atcgctcggg
      481 gctttgagag aggactggaa ccagaaaaga tcattggggc aacagattcc tgtggtgatt
      541 taatgttcct aatgaaatgg aaagacacag atgaagctga cctggttctt gcaaaagaag
      601 ctaatgtgaa atgtccacaa attgtgatag cattttatga agagagactg acatggcatg
      661 catatcctga ggatgcggaa aacaaagaga aagaaacagc aaagagctaa aggaggggat
      721 ggtctctgtc atttctcttt gtacataata cattcacctc cctgcctcct ctcctttcta
      781 cccacccctt tctatcctaa acacatccat aaaaaatgtg cttatcactg tgctccacaa
      841 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M77829. Human channel-lik...[gi:180500] Links  


LOCUS       HUMCHIP28A              1340 bp    mRNA    linear   PRI 31-DEC-1994
DEFINITION  Human channel-like integral membrane protein (CHIP28) mRNA,
            complete cds.
ACCESSION   M77829
VERSION     M77829.1  GI:180500
KEYWORDS    channel-like integral membrane protein.
SOURCE      Homo sapiens male adult bone marrow cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1340)
  AUTHORS   Preston,G.M. and Agre,P.
  TITLE     Isolation of the cDNA for erythrocyte integral membrane protein of
            28 kilodaltons: member of an ancient channel family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 88 (24), 11110-11114 (1991)
  MEDLINE   92107900
   PUBMED   1722319
FEATURES             Location/Qualifiers
     source          1..1340
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /sex="male"
                     /tissue_type="bone marrow"
                     /dev_stage="adult"
     gene            1..1340
                     /gene="CHIP28"
     5'UTR           1..38
                     /gene="CHIP28"
     CDS             39..848
                     /gene="CHIP28"
                     /codon_start=1
                     /product="channel-like integral membrane protein"
                     /protein_id="AAA58425.1"
                     /db_xref="GI:180501"
                     /translation="MASEFKKKLFWRAVVAEFLATTLFVFISIGSALGFKYPVGNNQT
                     AVQDNVKVSLAFGLSIATLAQSVGHISGAHLNPAVTLGLLLSCQISIFRALMYIIAQC
                     VGAIVATAILSGITSSLTGNSLGRNDLADGVNSGQGLGIEIIGTLQLVLCVLATTDRR
                     RRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSAVITHNFSNHWIFWVGPFI
                     GGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDINSRVEMKPK"
     3'UTR           846..1340
                     /gene="CHIP28"
BASE COUNT      250 a    411 c    397 g    282 t
ORIGIN      
        1 gcacccggca gcggtctcag gccaagcccc ctgccagcat ggccagcgag ttcaagaaga
       61 agctcttctg gagggcagtg gtggccgagt tcctggccac gaccctcttt gtcttcatca
      121 gcatcggttc tgccctgggc ttcaaatacc cggtggggaa caaccagacg gcggtccagg
      181 acaacgtgaa ggtgtcgctg gccttcgggc tgagcatcgc cacgctggcg cagagtgtgg
      241 gccacatcag cggcgcccac ctcaacccgg ctgtcacact ggggctgctg ctcagctgcc
      301 agatcagcat cttccgtgcc ctcatgtaca tcatcgccca gtgcgtgggg gccatcgtcg
      361 ccaccgccat cctctcaggc atcacctcct ccctgactgg gaactcgctt ggccgcaatg
      421 acctggctga tggtgtgaac tcgggccagg gcctgggcat cgagatcatc gggaccctcc
      481 agctggtgct atgcgtgctg gctactaccg accggaggcg ccgtgacctt ggtggctcag
      541 ccccccttgc catcggcctc tctgtagccc ttggacacct cctggctatt gactacactg
      601 gctgtgggat taaccctgct cggtcctttg gctccgcggt gatcacacac aacttcagca
      661 accactggat tttctgggtg gggccattca tcgggggagc cctggctgta ctcatctacg
      721 acttcatcct ggccccacgc agcagtgacc tcacagaccg cgtgaaggtg tggaccagcg
      781 gccaggtgga ggagtatgac ctggatgccg acgacatcaa ctccagggtg gagatgaagc
      841 ccaaatagaa ggggtctggc ccgggcatcc acgtaggggg caggggcagg ggcgggcgga
      901 gggaggggag gggtgaaatc catactgtag acactctgac aagctggcca aagtcacttc
      961 cccaagatct gccagacctg catggtcaag cctcttatgg gggtgtttct atctctttct
     1021 ttctctttct gtttcctggc ctcagagctt cctggggacc aagatttacc aattcaccca
     1081 ctcccttgaa gttgtggagg aggtgaaaga aagggaccca cctgctagtc gcccctcaga
     1141 gcatgatggg aggtgtgcca gaaagtcccc cctcgcccca aagttgctca ccgactcacc
     1201 tgcgcaagtg cctgggattc taccgtaatt gctttgtgcc tttgggcacg gccctccttc
     1261 ttttcctaac atgcaccttg ctcccaatgg tgcttggagg gggaagagat cccaggaggt
     1321 gcagtggagg gggcaagctt 
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U41517. Human channel-lik...[gi:1314303] Links  


LOCUS       HSU41517                1657 bp    mRNA    linear   PRI 23-AUG-1996
DEFINITION  Human channel-like integral membrane protein (AQP-1) mRNA, clone
            AQP-1-1656, complete cds.
ACCESSION   U41517
VERSION     U41517.1  GI:1314303
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 269)
  AUTHORS   Preston,G.M. and Agre,P.
  TITLE     Isolation of the cDNA for erythrocyte integral membrane protein of
            28 kilodaltons: member of an ancient channel family
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 88 (24), 11110-11114 (1991)
  MEDLINE   92107900
   PUBMED   1722319
REFERENCE   2  (bases 1 to 1657)
  AUTHORS   Moon,C., Preston,G.M., Griffin,C.A., Jabs,E.W. and Agre,P.
  TITLE     The human aquaporin-CHIP gene. Structure, organization, and
            chromosomal localization
  JOURNAL   J. Biol. Chem. 268 (21), 15772-15778 (1993)
  MEDLINE   93340184
   PUBMED   8340403
REFERENCE   3  (bases 1 to 269)
  AUTHORS   Ruiz,A. and Bok,D.
  TITLE     Characterization of the 3' UTR sequence encoded by the AQP-1 gene
            in human retinal pigment epithelium
  JOURNAL   Biochim. Biophys. Acta 1282 (2), 174-178 (1996)
  MEDLINE   96326579
   PUBMED   8703970
REFERENCE   4  (bases 1 to 1657)
  AUTHORS   Ruiz,A.C.
  TITLE     Direct Submission
  JOURNAL   Submitted (29-NOV-1995) Alberto C. Ruiz, Neurobiology, University
            of California at Los Angeles, 10833 Le Conte Avenueue, CHS RM
            73-235, Los Angeles, CA 90024, USA
FEATURES             Location/Qualifiers
     source          1..1657
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="AQP-1-1656"
                     /sex="male"
                     /tissue_type="retinal pigment epithelium"
                     /dev_stage="fetus"
     gene            1..1657
                     /gene="AQP-1"
     5'UTR           1..33
                     /gene="AQP-1"
     CDS             34..843
                     /gene="AQP-1"
                     /function="water channel"
                     /note="aquaporin-1"
                     /citation=[1]
                     /citation=[2]
                     /codon_start=1
                     /product="channel-like integral membrane protein"
                     /protein_id="AAC50648.1"
                     /db_xref="GI:1314304"
                     /translation="MASEFKKKLFWRAVVAEFLATTLFVFISIGSALGFKYPVGNNQT
                     AVQDNVKVSLAFGLSIATLAQSVGHISGAHLNPAVTLGLLLSCQISIFRALMYIIAQC
                     VGAIVATAILSGITSSLTGNSLGRNDLADGVNSGQGLGIEIIGTLQLVLCVLATTDRR
                     RRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSAVITHNFSNHWIFWVGPFI
                     GGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDINSRVEMKPK"
     3'UTR           844..1656
                     /gene="AQP-1"
     polyA_signal    1631..1636
                     /gene="AQP-1"
     polyA_site      1656
                     /gene="AQP-1"
BASE COUNT      319 a    499 c    473 g    366 t
ORIGIN      
        1 cggcagcggt ctcaggccaa gccccctgcc agcatggcca gcgagttcaa gaagaagctc
       61 ttctggaggg cagtggtggc cgagttcctg gccacgaccc tctttgtctt catcagcatc
      121 ggttctgccc tgggcttcaa atacccggtg gggaacaacc agacggcggt ccaggacaac
      181 gtgaaggtgt cgctggcctt cgggctgagc atcgccacgc tggcgcagag tgtgggccac
      241 atcagcggcg cccacctcaa cccggctgtc acactggggc tgctgctcag ctgccagatc
      301 agcatcttcc gtgccctcat gtacatcatc gcccagtgcg tgggggccat cgtcgccacc
      361 gccatcctct caggcatcac ctcctccctg actgggaact cgcttggccg caatgacctg
      421 gctgatggtg tgaactcggg ccagggcctg ggcatcgaga tcatcgggac cctccagctg
      481 gtgctatgcg tgctggctac taccgaccgg aggcgccgtg accttggtgg ctcagccccc
      541 cttgccatcg gcctctctgt agcccttgga cacctcctgg ctattgacta cactggctgt
      601 gggattaacc ctgctcggtc ctttggctcc gcggtgatca cacacaactt cagcaaccac
      661 tggattttct gggtggggcc attcatcggg ggagccctgg ctgtactcat ctacgacttc
      721 atcctggccc cacgcagcag tgacctcaca gaccgcgtga aggtgtggac cagcggccag
      781 gtggaggagt atgacctgga tgccgacgac atcaactcca gggtggagat gaagcccaaa
      841 tagaaggggt ctggcccggg catccacgta gggggcaggg gcaggggcgg gcggagggag
      901 gggaggggtg aaatccatac tgtagacact ctgacaagct ggccaaagtc acttccccaa
      961 gatctgccag acctgcatgg tcaagcctct tatgggggtg tttctatctc tttctttctc
     1021 tttctgtttc ctggcctcag agcttcctgg ggaccaagat ttaccaattc acccactccc
     1081 ttgaagttgt ggaggaggtg aaagaaaggg acccacctgc tagtcgcccc tcagagcatg
     1141 atgggaggtg tgccagaaag tcccccctcg ccccaaagtt gctcaccgac tcacctgcgc
     1201 aagtgcctgg gattctaccg taattgcttt gtgcctttgg gcaggccctc cttcttttcc
     1261 taacatgcac cttgctccca atggtgcttg gagggggaag agatcccagg aggtgcagtg
     1321 gagggggcaa gctttgctcc ttcagttctg cttgctccca agcccctgac ccgctcggac
     1381 ttactgcctg accttggaat cgtccctata tcagggcctg agtgacctcc ttctgcaaag
     1441 tggcagggac cggcagagct ctacaggcct gcagccccta agtgcaaaca cagcatgggt
     1501 ccagaagacg tggtctagac cagggctgct ctttccactt gccctgtgtt ctttccccag
     1561 gggcatgact gtcgccacac gcctctgcat atatgtctct ttggagttgg aatttcatta
     1621 tatgttaaga aaataaagga aaatgacttg taaggtc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF079806. Homo sapiens cyto...[gi:4731862] Links  


LOCUS       AF079806                1066 bp    mRNA    linear   PRI 24-NOV-2000
DEFINITION  Homo sapiens cytokine responsive protein (CR6) mRNA, complete cds.
ACCESSION   AF079806
VERSION     AF079806.1  GI:4731862
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1066)
  AUTHORS   Beadling,C., Johnson,K.W. and Smith,K.A.
  TITLE     Isolation of interleukin 2-induced immediate-early genes
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 90 (7), 2719-2723 (1993)
  MEDLINE   93219355
   PUBMED   7681987
REFERENCE   2  (bases 1 to 1066)
  AUTHORS   Zhang,W., Bae,I., Krishnaraju,K., Azam,N., Fan,W., Smith,K.,
            Hoffman,B. and Liebermann,D.A.
  TITLE     CR6: A third member in the MyD118 and Gadd45 gene family which
            functions in negative growth control
  JOURNAL   Oncogene 18 (35), 4899-4907 (1999)
  MEDLINE   99422022
   PUBMED   10490824
REFERENCE   3  (bases 1 to 1066)
  AUTHORS   Fan,W., Beadling,C., Richter,G. and Smith,K.A.
  TITLE     Direct Submission
  JOURNAL   Submitted (22-JUL-1998) Dept. of Medicine, Div. of Immunology,
            Cornell Medical College, 525 E 68th St. LC-907, New York, NY 10021,
            USA
FEATURES             Location/Qualifiers
     source          1..1066
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..1066
                     /gene="CR6"
     CDS             98..577
                     /gene="CR6"
                     /note="interleukin-2 responsive protein; MyD118/Gadd45
                     related protein"
                     /codon_start=1
                     /product="cytokine responsive protein"
                     /protein_id="AAD28544.1"
                     /db_xref="GI:4731863"
                     /translation="MTLEEVRGQDTVPESTARMQGAGKALHELLLSAQRQGCLTAGVY
                     ESAKVLNVDPDNVTFCVLAAGEEDEGDIALQIHFTLIQAFCCENDIDIVRVGDVQRLA
                     AIVGAGEEAGAPGDLHCILISNPNEDAWKDPALEKLSLFCEESRSVNDWVPSITLPE"
BASE COUNT      199 a    293 c    378 g    196 t
ORIGIN      
        1 gtgggtgcgc cgtgctgagc tctggctgtc agtgtgttcg cccgcgtccc ctccgcgctc
       61 tccgcttgtg gataactagc tgctggttga tcgcactatg actctggaag aagtccgcgg
      121 ccaggacaca gttccggaaa gcacagccag gatgcagggt gccgggaaag cgctgcatga
      181 gttgctgctg tcggcgcagc gtcagggctg cctcactgcc ggcgtctacg agtcagccaa
      241 agtcttgaac gtggaccccg acaatgtgac cttctgtgtg ctggctgcgg gtgaggagga
      301 cgagggcgac atcgcgctgc agatccattt tacgctgatc caggctttct gctgcgagaa
      361 cgacatcgac atagtgcgcg tgggcgatgt gcagcggctg gcggctatcg tgggcgccgg
      421 cgaggaggcg ggtgcgccgg gcgacctgca ctgcatcctc atttcgaacc ccaacgagga
      481 cgcctggaag gatcccgcct tggagaagct cagcctgttt tgcgaggaga gccgcagcgt
      541 taacgactgg gtgcccagca tcaccctccc cgagtgacag cccggcgggg accttggtct
      601 gatcgacgtg gtgacgcccc ggggcgccta gagcgcggct ggctctgtgg aggggccctc
      661 cgagggtgcc cgagtgcggc gtggagactg gcaggcgggg ggggcgcctg gagagcgagg
      721 aggcgcggcc tcccgaggag gggcccggtg gcggcagggc caggctggtc cgagctgagg
      781 actctgcaag tgtctggagc ggctgctcgc ccaggaaggc ctaggctagg acgttggcct
      841 cagggccagg aaggacagac tggccgggca ggcgtgactc agcagcctgc gctcggcagg
      901 aaggagcggc gccctggact tggtacagtt tcaggagcgt gaaggactta accgactgcc
      961 gctgcttttt caaaacggat ccgggcaatg cttcgttttc taaaggatgc tgctgttgaa
     1021 gctttgaatt ttacaataaa ctttttgaaa caaaaaaaaa aaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M20776. Homo sapiens, alp...[gi:180909] Links  


LOCUS       HUMCOLTHA               1008 bp    mRNA    linear   PRI 01-NOV-1994
DEFINITION  Homo sapiens, alpha-1 (VI) collagen.
ACCESSION   M20776 J04211
VERSION     M20776.1  GI:180909
KEYWORDS    alpha-1 type VI collagen; collagen; triple-helical domain; type VI
            collagen.
SOURCE      Human placenta fibroblast, cDNA to mRNA, clones P18, P6, P101,
            F108, F113, and F157.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1008)
  AUTHORS   Chu,M.L., Conway,D., Pan,T.C., Baldwin,C., Mann,K., Deutzmann,R.
            and Timpl,R.
  TITLE     Amino acid sequence of the triple-helical domain of human collagen
            type VI
  JOURNAL   J. Biol. Chem. 263 (35), 18601-18606 (1988)
  MEDLINE   89066644
   PUBMED   3198591
COMMENT     Draft entry and computer-readable sequence [1] kindly submitted by
            M.-L.Chu 28-SEP-1988.
FEATURES             Location/Qualifiers
     source          1..1008
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q22.3"
     gene            1..1008
                     /gene="COL6A1"
     misc_feature    1..1008
                     /gene="COL6A1"
                     /note="G00-119-065"
BASE COUNT      225 a    281 c    413 g     89 t
ORIGIN      123 bp upstream of KpnI site; chromosome 21q22.3.
        1 ggacctccgg gcctccgggg cgaccccggc tttgagggag aacgaggcaa gccggggctc
       61 ccaggagaga agggagaagc cggagatcct ggaagacccg gggacctcgg acctgttggg
      121 taccagggaa tgaagggaga aaaagggagc cgtggggaga agggctccag gggaccaaag
      181 ggctacaagg gagagaaggg caagcgtggc atcgacgggg tggacggcgt gaagggggag
      241 atggggtacc caggcctgcc aggctgcaag ggctcgccgg gttttgacgg cattcaagga
      301 ccccctggcc ccaagggaga ccccggcgcc tttggactga aaggagaaaa gggcgagcct
      361 ggagctgacg gggaggccgg gagaccagga gctcggggac catctggaga cgaggggcca
      421 gccggagagc ctgggccccc cggagagaaa ggagaggcgg gcgacgaggg gaacccagga
      481 cctgacggtg cccccgggga gcggggtggc cctggagaga gaggaccacg ggggacccca
      541 ggcccgcggg gaccaagagg agaccctggt gaagctggcc cgcagggtga tcagggaaga
      601 gaagggcccg ttggtgtccc tggagacccg ggcgaggctg gccctatcgg acctaaaggc
      661 taccgaggcg atgagggtcc cccagggtcc gagggtgcca gaggagcccc aggacctgcc
      721 ggaccccctg gagacccggg gctgatggga gaaaggggag aagacggccc cgctggaaat
      781 ggcaccgagg gcttccccgg cttccccggg tatcccggga acaggggcgc tcccgggata
      841 aacggcacga agggctaccc cggcctcaag ggggacgagg gagaagccgg ggaccccgga
      901 gacgataaca acgacattgc accccgagga gtcaaaggag caaaggggta ccggggtccc
      961 gagggccccc agggaccccc aggacaccaa ggaccgcctg ggccggac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  






    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X15879. Human mRNA for co...[gi:30031] Links  


LOCUS       HSCOL1N                  816 bp    mRNA    linear   PRI 31-MAR-1995
DEFINITION  Human mRNA for collagen VI alpha-1 N-terminal globular domain.
ACCESSION   X15879
VERSION     X15879.1  GI:30031
KEYWORDS    collagen; collagen alpha 1 type VI; glycoprotein.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 816)
  AUTHORS   Chu,M.L., Pan,T.C., Conway,D., Kuo,H.J., Glanville,R.W., Timpl,R.,
            Mann,K. and Deutzmann,R.
  TITLE     Sequence analysis of alpha 1(VI) and alpha 2(VI) chains of human
            type VI collagen reveals internal triplication of globular domains
            similar to the A domains of von Willebrand factor and two alpha
            2(VI) chain variants that differ in the carboxy terminus
  JOURNAL   EMBO J. 8 (7), 1939-1946 (1989)
  MEDLINE   90005396
COMMENT     See  for collagen VI alpha-1 C-terminal domain.
FEATURES             Location/Qualifiers
     source          1..816
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="F157"
                     /tissue_type="fibroblast"
                     /clone_lib="lambda ZAP"
     CDS             49..>816
                     /codon_start=1
                     /product="precursor polypeptide (AA -19 to 237)"
                     /protein_id="CAA33888.1"
                     /db_xref="GI:30032"
                     /db_xref="SPTREMBL:Q14041"
                     /translation="MRAARALLPLLLQACWTAAQDEPETPRAVAFQDCPVDLFFVLDT
                     SESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQG
                     LTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGGSHLKENKYLIVVTDGH
                     PLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADW
                     GQSRDAEEAISQTIDTIVDMIKNNVEQVCCSFECQPAR"
     sig_peptide     49..105
                     /note="signal peptide (AA -19 to -1)"
     mat_peptide     106..>816
                     /product="alpha-1 collagen VI chain (AA 1-237)"
BASE COUNT      168 a    255 c    262 g    131 t
ORIGIN      
        1 ggccctctct gccctggccg cgctgtgtgg tgaccgcagg cccgagacat gagggcggcc
       61 cgtgctctgc tgcccctgct gctgcaggcc tgctggacag ccgcgcagga tgagccggag
      121 accccgaggg ccgtggcctt ccaggactgc cccgtggacc tgttctttgt gctggacacc
      181 tctgagagcg tggccctgag gctgaagccc tacggggccc tcgtggacaa agtcaagtcc
      241 ttcaccaagc gcttcatcga caacctgagg gacaggtact accgctgtga ccgaaacctg
      301 gtgtggaacg caggcgcgct gcactacagt gacgaggtgg agatcatcca aggcctcacg
      361 cgcatgcctg gcggccgcga cgcactcaaa agcagcgtgg acgcggtcaa gtactttggg
      421 aagggcacct acaccgactg cgctatcaag aaggggctgg agcagctcct cgtggggggc
      481 tcccacctga aggagaataa gtacctgatt gtggtgaccg acgggcaccc cctggagggc
      541 tacaaggaac cctgtggggg gctggaggat gctgtgaacg aggccaagca cctgggcgtc
      601 aaagtcttct cggtggccat cacacccgac cacctggagc cgcgtctgag catcatcgcc
      661 acggaccaca cgtaccggcg caacttcacg gcggctgact ggggccagag ccgcgacgca
      721 gaggaggcca tcagccagac catcgacacc atcgtggaca tgatcaaaaa taacgttgag
      781 caagtgtgct gctccttcga atgccagcct gcaaga
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH006115. Human retinoid X ...[gi:1724040] Links  


LOCUS       HSCOLLA1                2255 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human retinoid X receptor beta (RXRbeta) gene, partial 3'
            transcript, and collagen alpha2(XI) (COL11A2) gene, exon 1.
ACCESSION   U41065
VERSION     U41065.1  GI:1724035
KEYWORDS    .
SEGMENT     1 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2255)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 2255)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..2255
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
                     /clone="c11a2e1"
     gene            1..156
                     /gene="RXRbeta"
     mRNA            <1..156
                     /gene="RXRbeta"
                     /product="retinoid X receptor beta"
     promoter        157..1230
                     /gene="COL11A2"
     mRNA            1231..>2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
     exon            1231..2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
                     /number=1
     mRNA            1233..>2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
     exon            1233..2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
                     /number=1
     mRNA            1235..>2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
     exon            1235..2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
                     /number=1
BASE COUNT      364 a    647 c    759 g    461 t     24 others
ORIGIN      
        1 aattgagggg aggtggccag cccgcagagg tggggtgctg gggctgcatg atttttgccc
       61 tgcgtccctt ctcttcgggg ctcctttccc ctctcataca taaaatcgct ttcaaattaa
      121 aatcgctgtt ttctggactg aggtgactgt atgaggatgg gcagcgcgtt tcgggcttgg
      181 ggggggtatc cggttgggag ctccggacgc atccctggac gccgccgccg aggttccgtg
      241 gggcccaaga ggtgcacgca gggtggggga cacaaggcaa gctttgctag gaggggggag
      301 agggtgtgcc aggnnggggg nggggccggc cggaggagga agggggnncg gtggattctc
      361 aaaggcgcct tgttcgctcg tgccctctcg cccagcgggc ggcggcgcct cggctcgctg
      421 cggccttcct ccgggcgctg cgggctccgg gcagacccgg cgccgtgccc gctcgtgggg
      481 cgcactggcg ccgagcgctg cgtgcgtctc agcgttgcgc ccggaggcgg cccacgtccc
      541 cgagtgaccc catttccctg gacccttcca accaggtcga ctnctcgctc atcttaccac
      601 tgcnncccct ctacntcggc tctgggactc cctaanngcg tcccccattt gcatctcctt
      661 ggacctgact atcctgtgtt tgtcggtggg tcccagtctc tnnactctnn ctctccgtag
      721 aacaccttna accacatacc ccnnaccagt tagncgctat ctatagctgt ctataaatac
      781 cccgcccgcc gctctgtaat tacacggcgg gtgagtaagt aattatggag acctgattat
      841 gggtggggtg ggactgcgac ccccaggccc gcctctcccc ctcccactgt gccctaaatc
      901 ccgcccaggc ctctgctgaa aggctctggg ccccnaagag ggagggaatg ggagggagtg
      961 tgtgtgactg gacgtttggg tcctagaaaa ggaaggggct agggaagata ttggggttcc
     1021 cgaaaagaga atcttagggt acaggccgtt gagacctaca aggggcagga gagagcgagc
     1081 gatagaggag gttcccgcct ccctccccag gtggagactg aggggtgggg gtttccttcg
     1141 gcggctgtgg gcgggcggct ggagtgcatg ggggccgnnc gggggcgggg gcgaggtggg
     1201 ggctctgggg gccagggtgg ccggggacac acagaagcgg cagccaccga ggagggagca
     1261 gtgccgggag ccccgacggc gccttgctgc atggagctgg gccgctgaca gctgtngtgc
     1321 ccgcagcctc tgacctccct gggaccccgg ngtctgaggc tcatagtctg ctccctgtct
     1381 tctgtcagcc tcagggcatc cagcgtctca ggccgacctg tccctgggac ccggcgtttc
     1441 gcttctcagc catggagcgg tgcagccgct gccatcgcct cctcctcctc ctacctctgg
     1501 tgctggggct gagcgcggcc ccaggctggg caggtaagag gagtcctgat gcctgggtcc
     1561 tcggagtcag gctggagctc tgagtgtctc tgggaagggg ccagctgatg cctggggcag
     1621 cagcttctga gtctaacaag gagatttggg gcttcaaggc tacaggctgg agggctggac
     1681 acctgtgttc cttggcatca gggtaggaat ctctttgtcc ttgaagtgac tgggggaggg
     1741 cagatagggc tgaaaggtgg aggaccactt tagagacctg aatggggatg gatgcttgtc
     1801 ttgagtcctt gggaagagct gaagatggcc agggccagcc cggtccagtc attgctggtg
     1861 tggggatggt agacgccaat gttgatgggt gaggcgtgtg tgtcttatgg cctttctgtc
     1921 gatatctctg ggataagggg tgagcacctg tgattcagag cagagcccag taagcccaga
     1981 aagaaaggaa gccttgacta cctgtctcat tggcctcctg gacagggtcc ccctccccct
     2041 ccctgtgccc tcgatcgtgc acgcagctgt ctggggagca ctggggctgg ctgccaacag
     2101 gacggcctcc ggacaggcca ggatcaggtg tctgagcaga gccaactctt tgcattcctg
     2161 cctagcccct cctccctctg gccagccagg ctcccctggg accctggtgc ccacaaccct
     2221 gatcttttcc ttctttttcc caggacccag aattc
//
LOCUS       HSCOLLA2                1667 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human collagen alpha2(XI) (COL11A2) gene, exons 2, 3, and 4.
ACCESSION   U41066
VERSION     U41066.1  GI:1724036
KEYWORDS    .
SEGMENT     2 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1667)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 1667)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..1667
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
                     /clone="c11a2e2e4"
     exon            411..560
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=2
     exon            692..902
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=3
     exon            1341..1503
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=4
BASE COUNT      380 a    446 c    419 g    419 t      3 others
ORIGIN      
        1 tggagnnaga cagaagagac agagcagaaa gaccaacata aagagacaga gagngtaaga
       61 gagtgagaga gacaggagac agatgaagag ttgagagaag tattcagaga gtacagagag
      121 agacagattc ataagctaaa agacaaagag agagcgtaag agaggcagat gctaggagag
      181 acaggggcag gagtgtcatg gggttgaatg ccagctgatg ctgccctgcc tccaggtacc
      241 tttgcatttc ccatccattc aggtactgtt cagaagcaag tgagggtgag gtgacctttc
      301 ccttgcaggg agtttgagcc ctggaaagct gagttctaga tcccaaatgt cccctattta
      361 tgccctcctg agggcatgtc ccttctcctg ataatcatgg actctcccag gtgcaccccc
      421 tgtggatgtg ctccgggccc tgaggttccc ctccctccct gatggtgtcc ggagagcgaa
      481 aggcatctgt ccagctgatg tggcctaccg agtggcacga cctgcccagc tcagtgcacc
      541 cactcgccag cttttcccag gtatgggtga catggtgggg taggcctggg gggaggtaat
      601 gggatggggc ctaggatcag acaccaggag gaaaggggtt gtggcggctc cctttgcctc
      661 tcactctgtg tgtatctctc tcggttacta ggaggatttc caaaagattt tcctctgctg
      721 actgttgtcc gcacccgccc tggtctccga gctcccctcc tgactctcta cagtgcccag
      781 ggtgtccgac agctgggcct ggagctgggc cgacctgtcc gcttcctgta tgaagaccag
      841 acagggcggc ctcaacctcc ctctcagcca gtcttccgag gcctcagcct agcagatggc
      901 aagtaagttt gtttgctcct ctggtctgcc tggcccacac tttcaggagg aagtgccccc
      961 aaacccctac actctaaact gtgaaaccct tgagaccctt tgggccacac cacacctacc
     1021 cactgcccaa acttcagtca cttctagtcc agagtttggg ctttagagtg tacagccctt
     1081 ctctgcatgt tgaactagcc tgtaccttgg gcaagttact taaaattttt gagcctcagt
     1141 ttctacatct gtaaaataga cattaaatag aattggcaca aataagaaag tgacggcatg
     1201 gtgtctggtg cactgtaaac actcaataaa tggtagccat tgttaccatt ggcttcaact
     1261 cttatcactg ctctccaacc atgttgactc ccctacttca gtcagccaag agttcacttg
     1321 aacctcttcc accatttcag gtggcaccgt gtggctgtgg ctgtgaaggg ccagtctgtc
     1381 accctcattg ttgactgcaa gaagcgagtc acccggcctc tcccccgaag tgctcgtcca
     1441 gtattggaca cccatggagt gatcatcttt ggtgcccgta ttctggatga agaagtcttt
     1501 gaggtaacca gagcaatcag aggcaggatt gacttctggt cccctatctt gtgcccacta
     1561 cccttctggc cccagcatgt catcttccta ttccctaggc cttcattttt ttttcttatg
     1621 aattttcatt tctattttct atcttcaagg gctactgttc tctgcag
//
LOCUS       HSCOLLA3                 966 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human collagen alpha2(XI) (COL11A2) gene, exon 5.
ACCESSION   U41067
VERSION     U41067.1  GI:1724037
KEYWORDS    .
SEGMENT     3 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 966)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 966)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W., Nicholls,J. and Cheah,K.S.
  TITLE     Extensive alternative splicing within the amino-propeptide coding
            domain of alpha2(XI) procollagen mRNAs. Expression of transcripts
            encoding truncated pro-alpha chains
  JOURNAL   J. Biol. Chem. 271 (28), 16945-16951 (1996)
  MEDLINE   96279277
   PUBMED   8663204
REFERENCE   3  (bases 1 to 966)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..966
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
                     /clone="c11a2e5"
     exon            328..519
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=5
BASE COUNT      230 a    238 c    257 g    238 t      3 others
ORIGIN      
        1 gaattctgga aggcctggga tctggtcctg attctgccac tctgggtgac cttggggcag
       61 tcattgcctt tttgggcttt tctgaatgga acgtatctga aaaatgggta gaaagaaaga
      121 tccttgttct gccaactttc ctggttgtta tgaaggtcct atgagatgcc ttgaaaactg
      181 aaaagggcta tgcatatgga ggactgacag tgaccttctc ccatcttagg gttttaccta
      241 ggattggcct cctatactct ttctcccgga tgataccctc tgcctctctc tgaatctctc
      301 cgctccatct ctctcatgtc tttgcagggt gatgtccagg agctggccat tgtcccaggg
      361 gtccaggcag cctatgaatc atgtgaacag aaggagctgg aatgcgaggg gggccagagg
      421 gaaagacccc aaaaccaaca gcctcacaga gcccagagat ctccacagca gcaaccatca
      481 agacttcaca ggccacaaaa tcaggaaccc cagagccagg tgagggagct gggagaaccc
      541 ccaantgcag ctctatcacc ccacgagtat gggaatgaca cccatgtgac actctctcct
      601 ccttagttgt cctccccacc ctcatctccc tattcccatc acccccctcc tcctagtcct
      661 cattcatcag cctttttatt ttcatctaaa aataaaaaga gttgttatga agaatttgtg
      721 gctggagctg tgttccctgc tctgcgtctc ctcttccatc ttcctggaac tgtgtccctg
      781 ggttttccct tccttctttg aattaaggat ggtattggga nacagtggga gaggggtgag
      841 agctggggag gcaggtagga gaggggactg aagccaggag aaagcagtcg ggatggtgag
      901 accaaggagg aggaatggga aggagtaggg atggagggaa tatcaaagga gggngtgggt
      961 ctgcag
//
LOCUS       HSCOLLA4                7784 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human collagen alpha2(XI) (COL11A2) gene, exons 6 through 16, and
            partial cds.
ACCESSION   U41069
VERSION     U41069.1  GI:1724038
KEYWORDS    .
SEGMENT     4 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 7784)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 7784)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W., Nicholls,J. and Cheah,K.S.
  TITLE     Extensive alternative splicing within the amino-propeptide coding
            domain of alpha2(XI) procollagen mRNAs. Expression of transcripts
            encoding truncated pro-alpha chains
  JOURNAL   J. Biol. Chem. 271 (28), 16945-16951 (1996)
  MEDLINE   96279277
   PUBMED   8663204
REFERENCE   3  (bases 1 to 7784)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..7784
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
     mRNA            join(U41065.1:1231..1533,U41066.1:411..560,
                     U41066.1:692..902,U41066.1:1341..1503,U41067.1:328..519,
                     425..502,1144..1206,1864..2043,5003..5062,5168..5209,
                     5443..5505,5835..5909,6361..6447,6682..6738,6900..6953,
                     7058..>7111)
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="contains exon 7C; alternatively spliced exon 7"
     CDS             join(U41065.1:1452..1533,U41066.1:411..560,
                     U41066.1:692..902,U41066.1:1341..1503,U41067.1:328..519,
                     425..502,1144..1206,1864..2043,5003..5062,5168..5209,
                     5443..5505,5835..5909,6361..6447,6682..6738,6900..6953,
                     7058..>7111)
                     /gene="COL11A2"
                     /note="contains exon 7C; alternatively spliced exon 7"
                     /codon_start=1
                     /product="collagen alpha2(XI)"
                     /protein_id="AAC17464.1"
                     /db_xref="GI:1724041"
                     /translation="MERCSRCHRLLLLLPLVLGLSAAPGWAGAPPVDVLRALRFPSLP
                     DGVRRAKGICPADVAYRVARPAQLSAPTRQLFPGGFPKDFPLLTVVRTRPGLRAPLLT
                     LYSAQGVRQLGLELGRPVRFLYEDQTGRPQPPSQPVFRGLSLADGKWHRVAVAVKGQS
                     VTLIVDCKKRVTRPLPRSARPVLDTHGVIIFGARILDEEVFEGDVQELAIVPGVQAAY
                     ESCEQKELECEGGQRERPQNQQPHRAQRSPQQQPSRLHRPQNQEPQSQPTESLYYDYE
                     PPYYDVMTTGTTPDYQDPTPGEEEEILESSLLPPLEEEQTDLQVPPTADRFQAEEYGE
                     GGTDPPEGPYDYTYGYGDDYREETELGPALSAETAHSGAAAHGPRGLKGEKGEPAVLE
                     PGMLVEGPPGPEGPAGLIGPPGIQGNPGPVGDPGERGPPGRAGLPGSDGAPGPPGTSL
                     MLPFRFGSGGGDKGPVVAAQEAQAQAILQQARLALRGPPGPMGYTGRPGPLGQPGSPG
                     LKGESGDLGPQGPRGPQGLTGSLGKAGRR"
     exon            425..502
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=6
     exon            1144..1206
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="exon 7C; potential alternatively spliced exon 7"
     exon            1864..2043
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=8
     exon            5003..5062
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=9
     exon            5168..5209
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=10
     exon            5443..5505
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=11
     exon            5835..5909
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=12
     exon            6361..6447
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=13
     exon            6682..6738
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=14
     exon            6900..6953
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=15
     exon            7058..7111
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=16
BASE COUNT     1536 a   2078 c   2064 g   2085 t     21 others
ORIGIN      
        1 ctgcagaagt gtgggagtgg ggaaccagtc tctggccccc acctacctgg aggggcagga
       61 aaaggggagg tagtaggggg aagggaaggg aagggaaggg aagggagcct gggttggatg
      121 gggtctggag agtaagaata cagataggct ctggggtcag tggtctaaag atcagagggt
      181 caatctctta atcatgtctg cttctgcccc acatgtcgaa ccccttccct ctnnnnccac
      241 ctctccccac ctccctcctg accctcacct ctcctttcat cttcaactct ctgtttccca
      301 actccacacc tccccnattt cccaactccc nanaccatct ttatatctcc ctctcccgcc
      361 cttcacccca caccttccaa ttaccccatc cccttcctgt ctatacccac cccccaatcc
      421 ccagcccact gagtctctct actatgacta cgagcccccc tattatgatg tgatgactac
      481 ggggacaacc cctgattatc aggtaaacnn aagggacccc ttcatttctt ccctttctcc
      541 cagcaccccc agcaatcatc tcctaggaca ccttcctccc ctcccccatg ccgtggggag
      601 aggcacagtg tgtttagatt atggggattt acttagttcc tgccccctcc ctgtctctct
      661 gctgcagcgc cctgggagcc cccaccacag gcctccccct tcccctaggg gcatcggacc
      721 cctccctgaa agcctacccc acctgccaag aggtcaatga cctggcaggc gtccccctca
      781 ccaggccgga ggccctctcc aactgcaggc tttgacccag catggagcag cctccccttt
      841 cccgatgccc agctcggagg gcagcttccc ccactgcctg gccccgcccc aggaccagag
      901 gcagcagtta ccggcagtag agccgggatc cagatctctg cccccaaacc tctatgacct
      961 tggaggttca ctctccccaa actatccccc tccaaagggg ctgcaagatc tgagatttcc
     1021 cccatccccc agcacagaca gaccctgccc tcgggggaca ggtgctgctt tgcctccctt
     1081 ctgctaacct ttctccttcc ttcccacccc gtgctactct tctccttctc tccttggggt
     1141 caggacccca ccccaggtga agaggaagaa atcctggagt cgagcctctt gccacccctt
     1201 gaggaggtaa ctctatgccc caccctcacc catctggggg cactgggnac ctgtgactca
     1261 ctcccctcag cccatgacgt cttgatggac cagcccctcc tctggccgga gggttcttac
     1321 catgcatttc catcgatggc ctctgtcttt gggtcactct agaactatgg gcctgcttga
     1381 ctctctaagt ccagcccgtt cccttcagta tccaggcctc cttccatgat tctttgattc
     1441 ctgagtgact tcgtctccat gttggtgtct aatggtctta ctttctatgt ggtatttctg
     1501 ttcccgggag ccacttnctg tccagtgttg tcaccttttg tctnccctat ctctttggtc
     1561 attcctggac tcctttgtct gtcattgtgt ccatggactg tccactctgc atccagtcct
     1621 ctggccaccc ttcattttat ccacactctc cactgaccac ttctgacatc tctcccattt
     1681 tgcctttgat ggcccttcca tttatataca accactccca tggctattct tggagaaaaa
     1741 atccagttct gattaacctt ggaatccttt gggctagaag ttgtcacacc ctggggtcca
     1801 gggtaagcct cctctcagcc ctaacatccc catctacccc tctccctccc catccctggg
     1861 aaggagcaga cagatctcca ggtccccccc acagccgaca ggttccaggc agaggaatat
     1921 ggggagggtg gcacagaccc ccctgaaggg ccctacgatt acacctatgg ctatggggat
     1981 gattatcgtg aggagacaga gcttggccct gccctctctg cggagacagc ccactcagga
     2041 gccgtaagtg aaaggagctt ctctttcatt cttgactacc agtgggagct acaagaattg
     2101 cttcttgttg gtttcgatcc ttggctgttg gctgtgagct atttcccatc tgtgacttgt
     2161 catgatgtat atgttgcttg aaaagattac tacctgcctt cctttgccaa atgtcttacc
     2221 ttgtgactct tgtggatact tctgaagctc agggcaattg ggaattaaaa aaaaaaaata
     2281 tgtagccact tcctgtcttt cttgatcact tcctgtctaa tcgaagctgg atttcataaa
     2341 gcttttggtc aataaactac taccttgtgt ccattttcag tgtttctcta tcatttcttc
     2401 ttggtgaaat aattttttgt tcatgtttct accaaacatc taacttccta tgtcaccttt
     2461 ttttaaccta caggccactt attatctgtc ccttcgctgc cttctccagc tgtggcatcc
     2521 atcttggtaa ctgtgtcagg ttggactgca aaggatttat tatttctgtt ctggctctag
     2581 tttttcatgc tatcttgtga caatttcctt tgacaataag ccacttcctg tatgtctcag
     2641 gtcatttaaa tcataaagtt tccattgtgg gcctcgatat ttctcatggc tacttcctgt
     2701 gtgtcctagc cttggatgtt tgtgttttct ggaagaatta cccttcactg tcatgaattc
     2761 cctttgcagg agtcacattc tacctgtttt gtgcctttct tttctctctg ggtcacggta
     2821 gccatgttgt tgttgaaggt tcacttcctg gatcctcttt ccttcactgc ttcctgtctg
     2881 ttcagccact tcctgtttgt ccagtttgtt ttctagccat gttggccatg tttaattttc
     2941 ttcggccctg gcatcattct tgcttccaag tgtttgtcat tttgttctgt gtcccttagt
     3001 tgttctcata tcttccagta ttccctgtca gtgggactgc ttcctgtcta ctccgctccg
     3061 tcttctgcct gccttgcagt tctggctgcc tcctctaccc tgctgccgca tcccatacct
     3121 cccctctctt gacctccctt gagcttctac tgcctttcct ttatgctgct ctctgggtat
     3181 ttggggttga tatgggattt cagagaattt tgaaacggat cattggctgt ttggtcagaa
     3241 tgaaggtgga gaggggcatt ctgaagatct ttgttggggg gagttggggt ggtggtgaat
     3301 gtctcccatt ctccatgttc ttattttgtt tctggcccta gagttgggaa caggtatctg
     3361 agtggaggtg tgcaagggag ggaactggtg ggcttggaat gtacttgagg gccagaggaa
     3421 gggcaatgca ttcagtgggg actgctctcc aatttttttt ttttttgagg cagggtctca
     3481 ctctgtcacc caggttggag tgcagtggca ggatcttggc ttactgcaac ctctgcctcc
     3541 tggnnntcaa gcgatcctct gcctcagnct cccgagtagc tggggttaca ggcatgtgca
     3601 ctgcgcccgg ctattttttg tatttttagt agagacgggg tttcacatgt tggccaggct
     3661 ggtctcaaac tactgacctc aaataagctg cctgtttcgg ccccacaaag tgctgggatt
     3721 acaggcgtgc gccaccgtgc ctggcctcca aatttactta gagtatggat gcttaacctt
     3781 tggggcatat ttaaacttgt gaaccctttg aaatcctaaa taaaattctg tgcataatag
     3841 catatacttt ttcatttctt ggagaaagac tcacagcttt catcagattc ttttttttct
     3901 ttgagatgga gttttgctct tgttgcctag gctggagtac gatggcataa tcttggttca
     3961 cttcaacctc cacctcctgg gttcaagcta ttctcctgct tcagcctccc aagtagctgg
     4021 gattacaggc atgtgccacc acacccagct aattttgtat ttttattaga gacggggttg
     4081 caccatgttg gtcaggctgg tctcaaactc ctgacctcag gtgatctgcc tgcctccgcc
     4141 tcccaaagtg ctgggattat gggcgtaacg acctcgcccg gcctagcttt tatcaggttc
     4201 tcatgtgggt ccatgccctc ctcttacccc aaataaaaat ctacccacaa tcttagagag
     4261 attcttttat gatgtattgg ggtaaatgtg tggaaggttt cttaaggttc tagggaggga
     4321 gtttgggaga tgggataggg tcctctagaa gagggtttgg gaaggtgggt ggtttaaaat
     4381 ccctgaaagg ggccaggtgt ggtgctcacg cctgtaattc cagcacttta ggaggccaag
     4441 gcaggcggat cacctgaggt caggactttg agaccagcct ggccaacacg gtgaaacccc
     4501 atctctaata aaaatacaaa aattagccgg gcatggtggt gcacatctgt aatcccagtg
     4561 taatcccagc tacttgggag gctgaggcag gagaatcact tgaacctgga aggcagaggt
     4621 tgtagtgagc cgagatcgcg ccactatact tcagcctggg tgacagagca agacttcgtc
     4681 tcaaaaaaaa agaaaaacaa acaaacaaac aaaaacactg gaagtttctg ggaccttcct
     4741 gaaagaggct ctcagggaga tggagaggtt gtgtttgctg aggtggggct gggaactggg
     4801 ggggcaggga accctgaggt ctccttggca ggcttgaaag gtttgtgaag agggagtttt
     4861 aggngagggc tgtggggctt tcgggtaggg ctgaagtggt ctcgagagga tctaagaatc
     4921 aaggttggag ttgagatgga gaaaggccta gtgtcctggc tgctcacggg ccccactggg
     4981 gctacatgtg tgtctccttc aggctgccca tggaccccga gggctgaagg gagagaaagg
     5041 agagcctgca gtgttggaac ctgtaagtta tgctggtcac agggctgagg cagtggaaat
     5101 aggagaagca agtggggttg agtgtgctgg tcctgtcgcc tctgattttt aacctttgac
     5161 tccacagggt atgctcgtgg aggggccccc tggcccagaa gggcctgcgg taagtctagc
     5221 agtgacctgg tggccattct tttcttagaa accccttcta tgtgctcatc tgagccttcc
     5281 ccacatatgc ccaggcctcc tgctcagaac tcgggagcat ccctccaatc aacgcttccc
     5341 agattcccag atctgttctg cacagaccac tcctcagcca caggctggat gtccacacct
     5401 gtctgaatgc ccacacctga cccctactct tttgttcctc agggattgat tggtccccct
     5461 ggcatccagg ggaacccagg cccagttgga gaccctggag agagggtaag ggggtgtcct
     5521 ccatgtgacg ggggagcttc gggggagcta gtggtatcca agcggggggc taccatgccc
     5581 agtaatttcc tgttaggtgg cttgttaatg aatgagggga tggttgtgct tcccagcaac
     5641 cagaaaggag ggcagactct ggttggggag ggtgctgatc tggtttctca ccctcaattt
     5701 tccccatggg gtgggagact caggaaagag ctagagcaat ctacacttat tctgttgaag
     5761 aggatatttc aaaatcatgc taagatcttg atgtccctta tctnacccca tctttgcatc
     5821 tcttcccact ccagggcccc cctggccgag cagggctccc tggatcagat ggggctcctg
     5881 gtcctcctgg cacatctctc atgctcccag tgagttgtct cttgggtttt ggaacatgct
     5941 gatggggaag acaaggaatt gtgtcatgtt accaagaacc agatgggcag gaaagatatg
     6001 gaggagtctc taaagatcat caaagatcat cactccaaag ggattcgtct ttggggntgg
     6061 ggagatgggg cctcattagg tgacctagaa caaaacacca gtcttctagg actgggcaga
     6121 acagcctata tttcaggagc tctagaacca aaagagagtc tttctggagg cagaggctgg
     6181 acagtggagg agggcaggag ttgagtttct ggaatcctct aagtgattat ctggaagcca
     6241 tgggaaatgc tggatgggca gaacctgggg caaggggcaa aacgctggag aaaccgtctg
     6301 gctgggaggg tcccatggct ttcttggaag agaacctgga ccttctctcc ttgccctcag
     6361 ttccggtttg gcagtggtgg gggtgacaag ggccctgtgg tggcggccca ggaggctcag
     6421 gcccaggcga tcctgcagca ggcgagggtg agtggggctg cttccctgga aaggaaggct
     6481 ctggggggca ctggaagggt tggctggagt gagtgtgccg caggggaggc ctgggttggt
     6541 ttggggttaa cagggaggtt ggggagaatc ctgggcagtg gatgaggatc aggagatgga
     6601 agggcttgaa ttttggagag tgctggggct gggggttccc actgcctgtc cctcagctct
     6661 gctgtccccc tgctccccta gctggcgctc cgtggacccc ctggccccat gggatacaca
     6721 gggcgccctg gacccttggt gcagtgagca gggtgctcgg gtggaggatg ctttaattgt
     6781 gtgtggggtg tggatagttc tggaaagggg cttcctggaa aggggcttct tttggggaac
     6841 actccgaggc cactgttggc tgtgtccttc ttacggccat cctctcttct ctctcccagg
     6901 gccaacctgg gagccctggc ctgaaaggag agtctggaga cttaggacct caggtgacna
     6961 ctttcctcca ctgnaccccc atatctgttc accagctcag tctccctcac tcccctccct
     7021 atgcctccta accccacccc atctctcctc tcaccagggc cccagaggac ctcagggcct
     7081 cacaggctcc ctgggcaagg ctgggcgaag ggtgagtgcc cctggggtgg atgggtgttg
     7141 tggggagaca gggcctgcag ggtaggggac tgcggcctgc ttgttctgac acttcccttg
     7201 ttctcccagg gccgggcagg tcctgatgga gcccgaggga ccctgggaga tcctggagtg
     7261 aaggtaacag gcttgggccc ctccctgaag cctgtagcct tcagcccacg ctgggctcag
     7321 ttgtttcttg gggatgacct agttctccag gcttccccag gatcagcacc cccccagctc
     7381 agggtgcagt agggaggggt gggcttggac agggtcgtgt cctcactctg gctatccttc
     7441 ctcctcctag ggtgaccgag gttttgatgg actcccaggg ctccctggag agaagggcca
     7501 tagggtgtga tacattagtg ggtgtgtgta attggggatc ttctatggag tgaggatagg
     7561 taggcaagga ggctggaggc ttggcaggag ctcagtgaaa gtaatggatg ggctaagtgc
     7621 agaggttcgg tgtctgcctg tgttgggctc tgaagcccct cttatgagtt cccccatttt
     7681 gcagggtgat actggtgccc agggccttcc tgtccccctg gtgaggatgg agagagggta
     7741 agtgtagtgg caaatgtagg ggctgggtgc aggggaggtc taga
//
LOCUS       HSCOLLA5                2199 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human collagen alpha2(XI) (COL11A2) gene, exons 61 and 62, and
            partial cds.
ACCESSION   U41068
VERSION     U41068.1  GI:1724039
KEYWORDS    .
SEGMENT     5 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2199)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 2199)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..2199
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
     gene            join(U41065.1:157..2255,U41066.1:1..1667,U41067.1:1..966,
                     U41069.1:1..7784,1..2199)
                     /gene="COL11A2"
     CDS             join(<143..349,799..939)
                     /gene="COL11A2"
                     /codon_start=1
                     /product="collagen alpha2(XI)"
                     /protein_id="AAC17465.1"
                     /db_xref="GI:1724042"
                     /translation="FSYVDSEGSPVGVVQLTFLRLLSVSAHQDVSYPCSGAARDGPLR
                     LRGANEDELSPETSPYVKEFRDGCQTQQGRTVLEVRTPVLEQLPVLDASFSDLGAPPR
                     RGGVLLGPVCFMG"
     exon            143..349
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=61
     exon            799..939
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=62
     3'UTR           940..2199
                     /gene="COL11A2"
BASE COUNT      460 a    606 c    642 g    489 t      2 others
ORIGIN      
        1 caagtctgct cagcatcctg ctcatgactc tgctcagctg tcctgatcat gtagatgact
       61 cctgaggcca tggtctctgt ctcctctgag gccatagacc tctctctccc cctctgaggc
      121 cagccctctc tctcctttcc agttctctta cgtggactca gagggctccc cagtgggtgt
      181 ggtccagctn accttcctgc ggctgctcag cgtctcagcc caccaggacg tctcctaccc
      241 ctgctctgga gcagcccgtg acggtcccct gagactccgt ggggccaatg aggatgagct
      301 gagcccggag actagcccct atgtcaaaga attcagagat ggctgccagg tgggaacagg
      361 aagagctggg ttgggggctg atctcagact caggctggag gaaggaggtg ggaagacccc
      421 ttgggcaggg cacccagggg gagcagggag gagtccgtcc tgctggattg tcagggatgc
      481 ctaggggagc tcaaaagccg gtgaggaagg tggaaaggat ggaagccatc gggtagggtt
      541 cacttgggta caaacaccac gtgacacacg gttaatgcag acgtgtacac agactaacac
      601 atggctgccc agcagaggca cacggtgggg gagaagcaaa cacacacgca tgggcacata
      661 gacacatgcc agtctgtaca tcctgagtca ccctgatggg gccaaggtgc tcagaggaga
      721 ggtgagcctg ggcctgaggt tagagggtgg ggggtgcctc catcctgctc acactttctt
      781 ccttgtctcc ccctgaagac acagcaaggc cggacggtgc tggaggtgcg aacgcctgtg
      841 ctggagcagc tgccagtgct ggatgcctcc ttctcagacc tgggagcccc accgcggcgg
      901 ggaggggtgc tgctggggcc tgtctgcttc atgggctagg accgtctctg tctgatcctg
      961 tccattcgga accaggccca cctggaatcc cacaacatca gctctgtgcc acctcccaag
     1021 agggctcctc actatctagg gagccctggg ccagggcgtg gagagccctc agtcggggca
     1081 ggccagggga ggggtgaagt ggttgcctgg acaccccacg ggaggagtgg catctggggc
     1141 tcttggccct cccacctgga gcctgttacc cgttagagag ctgagaccct tatttaaaac
     1201 tcacctccca atcaccccaa acaaatggaa gagaagagaa aggacatggc gtattttgta
     1261 tttaaaagta attgtattaa ttatttaaag tgtggaaagc aaaataacaa aaaagagana
     1321 cgccaacaaa aaatcagcag atgttgaaga caggggtctc gggggtgggc tccggcaccc
     1381 acatctgagt caggactttc ctcagtgact gtgtgtaggg gggttcaggg ctgaacccac
     1441 ctccctccca ccttcctccc acctcacctg tcgcacccac tgtgaaagtt ggaatatgtg
     1501 gtctccctgg cctcagggct ctgactctgc cagggtgggg ctctctaacc cacaggtgtt
     1561 ggctgcctgg cccatgtgcc cactgtctct tccacttggt ctgggtttgg caggcactgc
     1621 tgctacttga gggccaggat gctcccccag ggaagaaacg gaatagtgtg gggtgtgtgc
     1681 agggctgcat ccgcagatgg ctggaatatt aaaattcttc tatattggct ggtaaattgc
     1741 catggccctg agccactgag tatgttcatt gccacccctg tcctcccctg ggcacccctc
     1801 actttccctg atcctgcaat taaagggtta atgtgtggca tatggaaggg actcccagga
     1861 ccctgtgccc agcttccatg ctgactgatg gttaaataat gtgattgtct cctcccaggt
     1921 gtctgtgtca ctgcttgtgt tgttatttca gtctcccccg acacccatct gatgcttcct
     1981 cttcccagct aagtggttac cagaattgta tggcttaatc cagataccct gcaagccctg
     2041 tccagctggg gtgtcagggc ccagagttta caaagcctgg gtgactagca taaaattaca
     2101 aacaatgccc cagggacacc aggtaggggt ttggggtcct atggctccag tttctgacac
     2161 gggtgtgcga ttgcttctgg gtgtgggtgg gacgtgcta
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH006115. Human retinoid X ...[gi:1724040] Links  


LOCUS       HSCOLLA1                2255 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human retinoid X receptor beta (RXRbeta) gene, partial 3'
            transcript, and collagen alpha2(XI) (COL11A2) gene, exon 1.
ACCESSION   U41065
VERSION     U41065.1  GI:1724035
KEYWORDS    .
SEGMENT     1 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2255)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 2255)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..2255
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
                     /clone="c11a2e1"
     gene            1..156
                     /gene="RXRbeta"
     mRNA            <1..156
                     /gene="RXRbeta"
                     /product="retinoid X receptor beta"
     promoter        157..1230
                     /gene="COL11A2"
     mRNA            1231..>2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
     exon            1231..2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
                     /number=1
     mRNA            1233..>2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
     exon            1233..2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
                     /number=1
     mRNA            1235..>2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
     exon            1235..2255
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="alternative transcription start"
                     /number=1
BASE COUNT      364 a    647 c    759 g    461 t     24 others
ORIGIN      
        1 aattgagggg aggtggccag cccgcagagg tggggtgctg gggctgcatg atttttgccc
       61 tgcgtccctt ctcttcgggg ctcctttccc ctctcataca taaaatcgct ttcaaattaa
      121 aatcgctgtt ttctggactg aggtgactgt atgaggatgg gcagcgcgtt tcgggcttgg
      181 ggggggtatc cggttgggag ctccggacgc atccctggac gccgccgccg aggttccgtg
      241 gggcccaaga ggtgcacgca gggtggggga cacaaggcaa gctttgctag gaggggggag
      301 agggtgtgcc aggnnggggg nggggccggc cggaggagga agggggnncg gtggattctc
      361 aaaggcgcct tgttcgctcg tgccctctcg cccagcgggc ggcggcgcct cggctcgctg
      421 cggccttcct ccgggcgctg cgggctccgg gcagacccgg cgccgtgccc gctcgtgggg
      481 cgcactggcg ccgagcgctg cgtgcgtctc agcgttgcgc ccggaggcgg cccacgtccc
      541 cgagtgaccc catttccctg gacccttcca accaggtcga ctnctcgctc atcttaccac
      601 tgcnncccct ctacntcggc tctgggactc cctaanngcg tcccccattt gcatctcctt
      661 ggacctgact atcctgtgtt tgtcggtggg tcccagtctc tnnactctnn ctctccgtag
      721 aacaccttna accacatacc ccnnaccagt tagncgctat ctatagctgt ctataaatac
      781 cccgcccgcc gctctgtaat tacacggcgg gtgagtaagt aattatggag acctgattat
      841 gggtggggtg ggactgcgac ccccaggccc gcctctcccc ctcccactgt gccctaaatc
      901 ccgcccaggc ctctgctgaa aggctctggg ccccnaagag ggagggaatg ggagggagtg
      961 tgtgtgactg gacgtttggg tcctagaaaa ggaaggggct agggaagata ttggggttcc
     1021 cgaaaagaga atcttagggt acaggccgtt gagacctaca aggggcagga gagagcgagc
     1081 gatagaggag gttcccgcct ccctccccag gtggagactg aggggtgggg gtttccttcg
     1141 gcggctgtgg gcgggcggct ggagtgcatg ggggccgnnc gggggcgggg gcgaggtggg
     1201 ggctctgggg gccagggtgg ccggggacac acagaagcgg cagccaccga ggagggagca
     1261 gtgccgggag ccccgacggc gccttgctgc atggagctgg gccgctgaca gctgtngtgc
     1321 ccgcagcctc tgacctccct gggaccccgg ngtctgaggc tcatagtctg ctccctgtct
     1381 tctgtcagcc tcagggcatc cagcgtctca ggccgacctg tccctgggac ccggcgtttc
     1441 gcttctcagc catggagcgg tgcagccgct gccatcgcct cctcctcctc ctacctctgg
     1501 tgctggggct gagcgcggcc ccaggctggg caggtaagag gagtcctgat gcctgggtcc
     1561 tcggagtcag gctggagctc tgagtgtctc tgggaagggg ccagctgatg cctggggcag
     1621 cagcttctga gtctaacaag gagatttggg gcttcaaggc tacaggctgg agggctggac
     1681 acctgtgttc cttggcatca gggtaggaat ctctttgtcc ttgaagtgac tgggggaggg
     1741 cagatagggc tgaaaggtgg aggaccactt tagagacctg aatggggatg gatgcttgtc
     1801 ttgagtcctt gggaagagct gaagatggcc agggccagcc cggtccagtc attgctggtg
     1861 tggggatggt agacgccaat gttgatgggt gaggcgtgtg tgtcttatgg cctttctgtc
     1921 gatatctctg ggataagggg tgagcacctg tgattcagag cagagcccag taagcccaga
     1981 aagaaaggaa gccttgacta cctgtctcat tggcctcctg gacagggtcc ccctccccct
     2041 ccctgtgccc tcgatcgtgc acgcagctgt ctggggagca ctggggctgg ctgccaacag
     2101 gacggcctcc ggacaggcca ggatcaggtg tctgagcaga gccaactctt tgcattcctg
     2161 cctagcccct cctccctctg gccagccagg ctcccctggg accctggtgc ccacaaccct
     2221 gatcttttcc ttctttttcc caggacccag aattc
//
LOCUS       HSCOLLA2                1667 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human collagen alpha2(XI) (COL11A2) gene, exons 2, 3, and 4.
ACCESSION   U41066
VERSION     U41066.1  GI:1724036
KEYWORDS    .
SEGMENT     2 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1667)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 1667)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..1667
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
                     /clone="c11a2e2e4"
     exon            411..560
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=2
     exon            692..902
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=3
     exon            1341..1503
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=4
BASE COUNT      380 a    446 c    419 g    419 t      3 others
ORIGIN      
        1 tggagnnaga cagaagagac agagcagaaa gaccaacata aagagacaga gagngtaaga
       61 gagtgagaga gacaggagac agatgaagag ttgagagaag tattcagaga gtacagagag
      121 agacagattc ataagctaaa agacaaagag agagcgtaag agaggcagat gctaggagag
      181 acaggggcag gagtgtcatg gggttgaatg ccagctgatg ctgccctgcc tccaggtacc
      241 tttgcatttc ccatccattc aggtactgtt cagaagcaag tgagggtgag gtgacctttc
      301 ccttgcaggg agtttgagcc ctggaaagct gagttctaga tcccaaatgt cccctattta
      361 tgccctcctg agggcatgtc ccttctcctg ataatcatgg actctcccag gtgcaccccc
      421 tgtggatgtg ctccgggccc tgaggttccc ctccctccct gatggtgtcc ggagagcgaa
      481 aggcatctgt ccagctgatg tggcctaccg agtggcacga cctgcccagc tcagtgcacc
      541 cactcgccag cttttcccag gtatgggtga catggtgggg taggcctggg gggaggtaat
      601 gggatggggc ctaggatcag acaccaggag gaaaggggtt gtggcggctc cctttgcctc
      661 tcactctgtg tgtatctctc tcggttacta ggaggatttc caaaagattt tcctctgctg
      721 actgttgtcc gcacccgccc tggtctccga gctcccctcc tgactctcta cagtgcccag
      781 ggtgtccgac agctgggcct ggagctgggc cgacctgtcc gcttcctgta tgaagaccag
      841 acagggcggc ctcaacctcc ctctcagcca gtcttccgag gcctcagcct agcagatggc
      901 aagtaagttt gtttgctcct ctggtctgcc tggcccacac tttcaggagg aagtgccccc
      961 aaacccctac actctaaact gtgaaaccct tgagaccctt tgggccacac cacacctacc
     1021 cactgcccaa acttcagtca cttctagtcc agagtttggg ctttagagtg tacagccctt
     1081 ctctgcatgt tgaactagcc tgtaccttgg gcaagttact taaaattttt gagcctcagt
     1141 ttctacatct gtaaaataga cattaaatag aattggcaca aataagaaag tgacggcatg
     1201 gtgtctggtg cactgtaaac actcaataaa tggtagccat tgttaccatt ggcttcaact
     1261 cttatcactg ctctccaacc atgttgactc ccctacttca gtcagccaag agttcacttg
     1321 aacctcttcc accatttcag gtggcaccgt gtggctgtgg ctgtgaaggg ccagtctgtc
     1381 accctcattg ttgactgcaa gaagcgagtc acccggcctc tcccccgaag tgctcgtcca
     1441 gtattggaca cccatggagt gatcatcttt ggtgcccgta ttctggatga agaagtcttt
     1501 gaggtaacca gagcaatcag aggcaggatt gacttctggt cccctatctt gtgcccacta
     1561 cccttctggc cccagcatgt catcttccta ttccctaggc cttcattttt ttttcttatg
     1621 aattttcatt tctattttct atcttcaagg gctactgttc tctgcag
//
LOCUS       HSCOLLA3                 966 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human collagen alpha2(XI) (COL11A2) gene, exon 5.
ACCESSION   U41067
VERSION     U41067.1  GI:1724037
KEYWORDS    .
SEGMENT     3 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 966)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 966)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W., Nicholls,J. and Cheah,K.S.
  TITLE     Extensive alternative splicing within the amino-propeptide coding
            domain of alpha2(XI) procollagen mRNAs. Expression of transcripts
            encoding truncated pro-alpha chains
  JOURNAL   J. Biol. Chem. 271 (28), 16945-16951 (1996)
  MEDLINE   96279277
   PUBMED   8663204
REFERENCE   3  (bases 1 to 966)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..966
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
                     /clone="c11a2e5"
     exon            328..519
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=5
BASE COUNT      230 a    238 c    257 g    238 t      3 others
ORIGIN      
        1 gaattctgga aggcctggga tctggtcctg attctgccac tctgggtgac cttggggcag
       61 tcattgcctt tttgggcttt tctgaatgga acgtatctga aaaatgggta gaaagaaaga
      121 tccttgttct gccaactttc ctggttgtta tgaaggtcct atgagatgcc ttgaaaactg
      181 aaaagggcta tgcatatgga ggactgacag tgaccttctc ccatcttagg gttttaccta
      241 ggattggcct cctatactct ttctcccgga tgataccctc tgcctctctc tgaatctctc
      301 cgctccatct ctctcatgtc tttgcagggt gatgtccagg agctggccat tgtcccaggg
      361 gtccaggcag cctatgaatc atgtgaacag aaggagctgg aatgcgaggg gggccagagg
      421 gaaagacccc aaaaccaaca gcctcacaga gcccagagat ctccacagca gcaaccatca
      481 agacttcaca ggccacaaaa tcaggaaccc cagagccagg tgagggagct gggagaaccc
      541 ccaantgcag ctctatcacc ccacgagtat gggaatgaca cccatgtgac actctctcct
      601 ccttagttgt cctccccacc ctcatctccc tattcccatc acccccctcc tcctagtcct
      661 cattcatcag cctttttatt ttcatctaaa aataaaaaga gttgttatga agaatttgtg
      721 gctggagctg tgttccctgc tctgcgtctc ctcttccatc ttcctggaac tgtgtccctg
      781 ggttttccct tccttctttg aattaaggat ggtattggga nacagtggga gaggggtgag
      841 agctggggag gcaggtagga gaggggactg aagccaggag aaagcagtcg ggatggtgag
      901 accaaggagg aggaatggga aggagtaggg atggagggaa tatcaaagga gggngtgggt
      961 ctgcag
//
LOCUS       HSCOLLA4                7784 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human collagen alpha2(XI) (COL11A2) gene, exons 6 through 16, and
            partial cds.
ACCESSION   U41069
VERSION     U41069.1  GI:1724038
KEYWORDS    .
SEGMENT     4 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 7784)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 7784)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W., Nicholls,J. and Cheah,K.S.
  TITLE     Extensive alternative splicing within the amino-propeptide coding
            domain of alpha2(XI) procollagen mRNAs. Expression of transcripts
            encoding truncated pro-alpha chains
  JOURNAL   J. Biol. Chem. 271 (28), 16945-16951 (1996)
  MEDLINE   96279277
   PUBMED   8663204
REFERENCE   3  (bases 1 to 7784)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..7784
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
     mRNA            join(U41065.1:1231..1533,U41066.1:411..560,
                     U41066.1:692..902,U41066.1:1341..1503,U41067.1:328..519,
                     425..502,1144..1206,1864..2043,5003..5062,5168..5209,
                     5443..5505,5835..5909,6361..6447,6682..6738,6900..6953,
                     7058..>7111)
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="contains exon 7C; alternatively spliced exon 7"
     CDS             join(U41065.1:1452..1533,U41066.1:411..560,
                     U41066.1:692..902,U41066.1:1341..1503,U41067.1:328..519,
                     425..502,1144..1206,1864..2043,5003..5062,5168..5209,
                     5443..5505,5835..5909,6361..6447,6682..6738,6900..6953,
                     7058..>7111)
                     /gene="COL11A2"
                     /note="contains exon 7C; alternatively spliced exon 7"
                     /codon_start=1
                     /product="collagen alpha2(XI)"
                     /protein_id="AAC17464.1"
                     /db_xref="GI:1724041"
                     /translation="MERCSRCHRLLLLLPLVLGLSAAPGWAGAPPVDVLRALRFPSLP
                     DGVRRAKGICPADVAYRVARPAQLSAPTRQLFPGGFPKDFPLLTVVRTRPGLRAPLLT
                     LYSAQGVRQLGLELGRPVRFLYEDQTGRPQPPSQPVFRGLSLADGKWHRVAVAVKGQS
                     VTLIVDCKKRVTRPLPRSARPVLDTHGVIIFGARILDEEVFEGDVQELAIVPGVQAAY
                     ESCEQKELECEGGQRERPQNQQPHRAQRSPQQQPSRLHRPQNQEPQSQPTESLYYDYE
                     PPYYDVMTTGTTPDYQDPTPGEEEEILESSLLPPLEEEQTDLQVPPTADRFQAEEYGE
                     GGTDPPEGPYDYTYGYGDDYREETELGPALSAETAHSGAAAHGPRGLKGEKGEPAVLE
                     PGMLVEGPPGPEGPAGLIGPPGIQGNPGPVGDPGERGPPGRAGLPGSDGAPGPPGTSL
                     MLPFRFGSGGGDKGPVVAAQEAQAQAILQQARLALRGPPGPMGYTGRPGPLGQPGSPG
                     LKGESGDLGPQGPRGPQGLTGSLGKAGRR"
     exon            425..502
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=6
     exon            1144..1206
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /note="exon 7C; potential alternatively spliced exon 7"
     exon            1864..2043
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=8
     exon            5003..5062
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=9
     exon            5168..5209
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=10
     exon            5443..5505
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=11
     exon            5835..5909
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=12
     exon            6361..6447
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=13
     exon            6682..6738
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=14
     exon            6900..6953
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=15
     exon            7058..7111
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=16
BASE COUNT     1536 a   2078 c   2064 g   2085 t     21 others
ORIGIN      
        1 ctgcagaagt gtgggagtgg ggaaccagtc tctggccccc acctacctgg aggggcagga
       61 aaaggggagg tagtaggggg aagggaaggg aagggaaggg aagggagcct gggttggatg
      121 gggtctggag agtaagaata cagataggct ctggggtcag tggtctaaag atcagagggt
      181 caatctctta atcatgtctg cttctgcccc acatgtcgaa ccccttccct ctnnnnccac
      241 ctctccccac ctccctcctg accctcacct ctcctttcat cttcaactct ctgtttccca
      301 actccacacc tccccnattt cccaactccc nanaccatct ttatatctcc ctctcccgcc
      361 cttcacccca caccttccaa ttaccccatc cccttcctgt ctatacccac cccccaatcc
      421 ccagcccact gagtctctct actatgacta cgagcccccc tattatgatg tgatgactac
      481 ggggacaacc cctgattatc aggtaaacnn aagggacccc ttcatttctt ccctttctcc
      541 cagcaccccc agcaatcatc tcctaggaca ccttcctccc ctcccccatg ccgtggggag
      601 aggcacagtg tgtttagatt atggggattt acttagttcc tgccccctcc ctgtctctct
      661 gctgcagcgc cctgggagcc cccaccacag gcctccccct tcccctaggg gcatcggacc
      721 cctccctgaa agcctacccc acctgccaag aggtcaatga cctggcaggc gtccccctca
      781 ccaggccgga ggccctctcc aactgcaggc tttgacccag catggagcag cctccccttt
      841 cccgatgccc agctcggagg gcagcttccc ccactgcctg gccccgcccc aggaccagag
      901 gcagcagtta ccggcagtag agccgggatc cagatctctg cccccaaacc tctatgacct
      961 tggaggttca ctctccccaa actatccccc tccaaagggg ctgcaagatc tgagatttcc
     1021 cccatccccc agcacagaca gaccctgccc tcgggggaca ggtgctgctt tgcctccctt
     1081 ctgctaacct ttctccttcc ttcccacccc gtgctactct tctccttctc tccttggggt
     1141 caggacccca ccccaggtga agaggaagaa atcctggagt cgagcctctt gccacccctt
     1201 gaggaggtaa ctctatgccc caccctcacc catctggggg cactgggnac ctgtgactca
     1261 ctcccctcag cccatgacgt cttgatggac cagcccctcc tctggccgga gggttcttac
     1321 catgcatttc catcgatggc ctctgtcttt gggtcactct agaactatgg gcctgcttga
     1381 ctctctaagt ccagcccgtt cccttcagta tccaggcctc cttccatgat tctttgattc
     1441 ctgagtgact tcgtctccat gttggtgtct aatggtctta ctttctatgt ggtatttctg
     1501 ttcccgggag ccacttnctg tccagtgttg tcaccttttg tctnccctat ctctttggtc
     1561 attcctggac tcctttgtct gtcattgtgt ccatggactg tccactctgc atccagtcct
     1621 ctggccaccc ttcattttat ccacactctc cactgaccac ttctgacatc tctcccattt
     1681 tgcctttgat ggcccttcca tttatataca accactccca tggctattct tggagaaaaa
     1741 atccagttct gattaacctt ggaatccttt gggctagaag ttgtcacacc ctggggtcca
     1801 gggtaagcct cctctcagcc ctaacatccc catctacccc tctccctccc catccctggg
     1861 aaggagcaga cagatctcca ggtccccccc acagccgaca ggttccaggc agaggaatat
     1921 ggggagggtg gcacagaccc ccctgaaggg ccctacgatt acacctatgg ctatggggat
     1981 gattatcgtg aggagacaga gcttggccct gccctctctg cggagacagc ccactcagga
     2041 gccgtaagtg aaaggagctt ctctttcatt cttgactacc agtgggagct acaagaattg
     2101 cttcttgttg gtttcgatcc ttggctgttg gctgtgagct atttcccatc tgtgacttgt
     2161 catgatgtat atgttgcttg aaaagattac tacctgcctt cctttgccaa atgtcttacc
     2221 ttgtgactct tgtggatact tctgaagctc agggcaattg ggaattaaaa aaaaaaaata
     2281 tgtagccact tcctgtcttt cttgatcact tcctgtctaa tcgaagctgg atttcataaa
     2341 gcttttggtc aataaactac taccttgtgt ccattttcag tgtttctcta tcatttcttc
     2401 ttggtgaaat aattttttgt tcatgtttct accaaacatc taacttccta tgtcaccttt
     2461 ttttaaccta caggccactt attatctgtc ccttcgctgc cttctccagc tgtggcatcc
     2521 atcttggtaa ctgtgtcagg ttggactgca aaggatttat tatttctgtt ctggctctag
     2581 tttttcatgc tatcttgtga caatttcctt tgacaataag ccacttcctg tatgtctcag
     2641 gtcatttaaa tcataaagtt tccattgtgg gcctcgatat ttctcatggc tacttcctgt
     2701 gtgtcctagc cttggatgtt tgtgttttct ggaagaatta cccttcactg tcatgaattc
     2761 cctttgcagg agtcacattc tacctgtttt gtgcctttct tttctctctg ggtcacggta
     2821 gccatgttgt tgttgaaggt tcacttcctg gatcctcttt ccttcactgc ttcctgtctg
     2881 ttcagccact tcctgtttgt ccagtttgtt ttctagccat gttggccatg tttaattttc
     2941 ttcggccctg gcatcattct tgcttccaag tgtttgtcat tttgttctgt gtcccttagt
     3001 tgttctcata tcttccagta ttccctgtca gtgggactgc ttcctgtcta ctccgctccg
     3061 tcttctgcct gccttgcagt tctggctgcc tcctctaccc tgctgccgca tcccatacct
     3121 cccctctctt gacctccctt gagcttctac tgcctttcct ttatgctgct ctctgggtat
     3181 ttggggttga tatgggattt cagagaattt tgaaacggat cattggctgt ttggtcagaa
     3241 tgaaggtgga gaggggcatt ctgaagatct ttgttggggg gagttggggt ggtggtgaat
     3301 gtctcccatt ctccatgttc ttattttgtt tctggcccta gagttgggaa caggtatctg
     3361 agtggaggtg tgcaagggag ggaactggtg ggcttggaat gtacttgagg gccagaggaa
     3421 gggcaatgca ttcagtgggg actgctctcc aatttttttt ttttttgagg cagggtctca
     3481 ctctgtcacc caggttggag tgcagtggca ggatcttggc ttactgcaac ctctgcctcc
     3541 tggnnntcaa gcgatcctct gcctcagnct cccgagtagc tggggttaca ggcatgtgca
     3601 ctgcgcccgg ctattttttg tatttttagt agagacgggg tttcacatgt tggccaggct
     3661 ggtctcaaac tactgacctc aaataagctg cctgtttcgg ccccacaaag tgctgggatt
     3721 acaggcgtgc gccaccgtgc ctggcctcca aatttactta gagtatggat gcttaacctt
     3781 tggggcatat ttaaacttgt gaaccctttg aaatcctaaa taaaattctg tgcataatag
     3841 catatacttt ttcatttctt ggagaaagac tcacagcttt catcagattc ttttttttct
     3901 ttgagatgga gttttgctct tgttgcctag gctggagtac gatggcataa tcttggttca
     3961 cttcaacctc cacctcctgg gttcaagcta ttctcctgct tcagcctccc aagtagctgg
     4021 gattacaggc atgtgccacc acacccagct aattttgtat ttttattaga gacggggttg
     4081 caccatgttg gtcaggctgg tctcaaactc ctgacctcag gtgatctgcc tgcctccgcc
     4141 tcccaaagtg ctgggattat gggcgtaacg acctcgcccg gcctagcttt tatcaggttc
     4201 tcatgtgggt ccatgccctc ctcttacccc aaataaaaat ctacccacaa tcttagagag
     4261 attcttttat gatgtattgg ggtaaatgtg tggaaggttt cttaaggttc tagggaggga
     4321 gtttgggaga tgggataggg tcctctagaa gagggtttgg gaaggtgggt ggtttaaaat
     4381 ccctgaaagg ggccaggtgt ggtgctcacg cctgtaattc cagcacttta ggaggccaag
     4441 gcaggcggat cacctgaggt caggactttg agaccagcct ggccaacacg gtgaaacccc
     4501 atctctaata aaaatacaaa aattagccgg gcatggtggt gcacatctgt aatcccagtg
     4561 taatcccagc tacttgggag gctgaggcag gagaatcact tgaacctgga aggcagaggt
     4621 tgtagtgagc cgagatcgcg ccactatact tcagcctggg tgacagagca agacttcgtc
     4681 tcaaaaaaaa agaaaaacaa acaaacaaac aaaaacactg gaagtttctg ggaccttcct
     4741 gaaagaggct ctcagggaga tggagaggtt gtgtttgctg aggtggggct gggaactggg
     4801 ggggcaggga accctgaggt ctccttggca ggcttgaaag gtttgtgaag agggagtttt
     4861 aggngagggc tgtggggctt tcgggtaggg ctgaagtggt ctcgagagga tctaagaatc
     4921 aaggttggag ttgagatgga gaaaggccta gtgtcctggc tgctcacggg ccccactggg
     4981 gctacatgtg tgtctccttc aggctgccca tggaccccga gggctgaagg gagagaaagg
     5041 agagcctgca gtgttggaac ctgtaagtta tgctggtcac agggctgagg cagtggaaat
     5101 aggagaagca agtggggttg agtgtgctgg tcctgtcgcc tctgattttt aacctttgac
     5161 tccacagggt atgctcgtgg aggggccccc tggcccagaa gggcctgcgg taagtctagc
     5221 agtgacctgg tggccattct tttcttagaa accccttcta tgtgctcatc tgagccttcc
     5281 ccacatatgc ccaggcctcc tgctcagaac tcgggagcat ccctccaatc aacgcttccc
     5341 agattcccag atctgttctg cacagaccac tcctcagcca caggctggat gtccacacct
     5401 gtctgaatgc ccacacctga cccctactct tttgttcctc agggattgat tggtccccct
     5461 ggcatccagg ggaacccagg cccagttgga gaccctggag agagggtaag ggggtgtcct
     5521 ccatgtgacg ggggagcttc gggggagcta gtggtatcca agcggggggc taccatgccc
     5581 agtaatttcc tgttaggtgg cttgttaatg aatgagggga tggttgtgct tcccagcaac
     5641 cagaaaggag ggcagactct ggttggggag ggtgctgatc tggtttctca ccctcaattt
     5701 tccccatggg gtgggagact caggaaagag ctagagcaat ctacacttat tctgttgaag
     5761 aggatatttc aaaatcatgc taagatcttg atgtccctta tctnacccca tctttgcatc
     5821 tcttcccact ccagggcccc cctggccgag cagggctccc tggatcagat ggggctcctg
     5881 gtcctcctgg cacatctctc atgctcccag tgagttgtct cttgggtttt ggaacatgct
     5941 gatggggaag acaaggaatt gtgtcatgtt accaagaacc agatgggcag gaaagatatg
     6001 gaggagtctc taaagatcat caaagatcat cactccaaag ggattcgtct ttggggntgg
     6061 ggagatgggg cctcattagg tgacctagaa caaaacacca gtcttctagg actgggcaga
     6121 acagcctata tttcaggagc tctagaacca aaagagagtc tttctggagg cagaggctgg
     6181 acagtggagg agggcaggag ttgagtttct ggaatcctct aagtgattat ctggaagcca
     6241 tgggaaatgc tggatgggca gaacctgggg caaggggcaa aacgctggag aaaccgtctg
     6301 gctgggaggg tcccatggct ttcttggaag agaacctgga ccttctctcc ttgccctcag
     6361 ttccggtttg gcagtggtgg gggtgacaag ggccctgtgg tggcggccca ggaggctcag
     6421 gcccaggcga tcctgcagca ggcgagggtg agtggggctg cttccctgga aaggaaggct
     6481 ctggggggca ctggaagggt tggctggagt gagtgtgccg caggggaggc ctgggttggt
     6541 ttggggttaa cagggaggtt ggggagaatc ctgggcagtg gatgaggatc aggagatgga
     6601 agggcttgaa ttttggagag tgctggggct gggggttccc actgcctgtc cctcagctct
     6661 gctgtccccc tgctccccta gctggcgctc cgtggacccc ctggccccat gggatacaca
     6721 gggcgccctg gacccttggt gcagtgagca gggtgctcgg gtggaggatg ctttaattgt
     6781 gtgtggggtg tggatagttc tggaaagggg cttcctggaa aggggcttct tttggggaac
     6841 actccgaggc cactgttggc tgtgtccttc ttacggccat cctctcttct ctctcccagg
     6901 gccaacctgg gagccctggc ctgaaaggag agtctggaga cttaggacct caggtgacna
     6961 ctttcctcca ctgnaccccc atatctgttc accagctcag tctccctcac tcccctccct
     7021 atgcctccta accccacccc atctctcctc tcaccagggc cccagaggac ctcagggcct
     7081 cacaggctcc ctgggcaagg ctgggcgaag ggtgagtgcc cctggggtgg atgggtgttg
     7141 tggggagaca gggcctgcag ggtaggggac tgcggcctgc ttgttctgac acttcccttg
     7201 ttctcccagg gccgggcagg tcctgatgga gcccgaggga ccctgggaga tcctggagtg
     7261 aaggtaacag gcttgggccc ctccctgaag cctgtagcct tcagcccacg ctgggctcag
     7321 ttgtttcttg gggatgacct agttctccag gcttccccag gatcagcacc cccccagctc
     7381 agggtgcagt agggaggggt gggcttggac agggtcgtgt cctcactctg gctatccttc
     7441 ctcctcctag ggtgaccgag gttttgatgg actcccaggg ctccctggag agaagggcca
     7501 tagggtgtga tacattagtg ggtgtgtgta attggggatc ttctatggag tgaggatagg
     7561 taggcaagga ggctggaggc ttggcaggag ctcagtgaaa gtaatggatg ggctaagtgc
     7621 agaggttcgg tgtctgcctg tgttgggctc tgaagcccct cttatgagtt cccccatttt
     7681 gcagggtgat actggtgccc agggccttcc tgtccccctg gtgaggatgg agagagggta
     7741 agtgtagtgg caaatgtagg ggctgggtgc aggggaggtc taga
//
LOCUS       HSCOLLA5                2199 bp    DNA     linear   PRI 27-MAY-1998
DEFINITION  Human collagen alpha2(XI) (COL11A2) gene, exons 61 and 62, and
            partial cds.
ACCESSION   U41068
VERSION     U41068.1  GI:1724039
KEYWORDS    .
SEGMENT     5 of 5
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2199)
  AUTHORS   Lui,V.C., Ng,L.J., Sat,E.W. and Cheah,K.S.
  TITLE     The human alpha 2(XI) collagen gene (COL11A2): completion of coding
            information, identification of the promoter sequence, and precise
            localization within the major histocompatibility complex reveal
            overlap with the KE5 gene
  JOURNAL   Genomics 32 (3), 401-412 (1996)
  MEDLINE   96435918
   PUBMED   8838804
REFERENCE   2  (bases 1 to 2199)
  AUTHORS   Cheah,K.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-NOV-1995) Kathryn S. Cheah, Biochemistry, The
            University of Hong Kong, Sassoon Rd., Hong Kong, Hong Kong
FEATURES             Location/Qualifiers
     source          1..2199
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6p21.3"
     gene            join(U41065.1:157..2255,U41066.1:1..1667,U41067.1:1..966,
                     U41069.1:1..7784,1..2199)
                     /gene="COL11A2"
     CDS             join(<143..349,799..939)
                     /gene="COL11A2"
                     /codon_start=1
                     /product="collagen alpha2(XI)"
                     /protein_id="AAC17465.1"
                     /db_xref="GI:1724042"
                     /translation="FSYVDSEGSPVGVVQLTFLRLLSVSAHQDVSYPCSGAARDGPLR
                     LRGANEDELSPETSPYVKEFRDGCQTQQGRTVLEVRTPVLEQLPVLDASFSDLGAPPR
                     RGGVLLGPVCFMG"
     exon            143..349
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=61
     exon            799..939
                     /gene="COL11A2"
                     /product="collagen alpha2(XI)"
                     /number=62
     3'UTR           940..2199
                     /gene="COL11A2"
BASE COUNT      460 a    606 c    642 g    489 t      2 others
ORIGIN      
        1 caagtctgct cagcatcctg ctcatgactc tgctcagctg tcctgatcat gtagatgact
       61 cctgaggcca tggtctctgt ctcctctgag gccatagacc tctctctccc cctctgaggc
      121 cagccctctc tctcctttcc agttctctta cgtggactca gagggctccc cagtgggtgt
      181 ggtccagctn accttcctgc ggctgctcag cgtctcagcc caccaggacg tctcctaccc
      241 ctgctctgga gcagcccgtg acggtcccct gagactccgt ggggccaatg aggatgagct
      301 gagcccggag actagcccct atgtcaaaga attcagagat ggctgccagg tgggaacagg
      361 aagagctggg ttgggggctg atctcagact caggctggag gaaggaggtg ggaagacccc
      421 ttgggcaggg cacccagggg gagcagggag gagtccgtcc tgctggattg tcagggatgc
      481 ctaggggagc tcaaaagccg gtgaggaagg tggaaaggat ggaagccatc gggtagggtt
      541 cacttgggta caaacaccac gtgacacacg gttaatgcag acgtgtacac agactaacac
      601 atggctgccc agcagaggca cacggtgggg gagaagcaaa cacacacgca tgggcacata
      661 gacacatgcc agtctgtaca tcctgagtca ccctgatggg gccaaggtgc tcagaggaga
      721 ggtgagcctg ggcctgaggt tagagggtgg ggggtgcctc catcctgctc acactttctt
      781 ccttgtctcc ccctgaagac acagcaaggc cggacggtgc tggaggtgcg aacgcctgtg
      841 ctggagcagc tgccagtgct ggatgcctcc ttctcagacc tgggagcccc accgcggcgg
      901 ggaggggtgc tgctggggcc tgtctgcttc atgggctagg accgtctctg tctgatcctg
      961 tccattcgga accaggccca cctggaatcc cacaacatca gctctgtgcc acctcccaag
     1021 agggctcctc actatctagg gagccctggg ccagggcgtg gagagccctc agtcggggca
     1081 ggccagggga ggggtgaagt ggttgcctgg acaccccacg ggaggagtgg catctggggc
     1141 tcttggccct cccacctgga gcctgttacc cgttagagag ctgagaccct tatttaaaac
     1201 tcacctccca atcaccccaa acaaatggaa gagaagagaa aggacatggc gtattttgta
     1261 tttaaaagta attgtattaa ttatttaaag tgtggaaagc aaaataacaa aaaagagana
     1321 cgccaacaaa aaatcagcag atgttgaaga caggggtctc gggggtgggc tccggcaccc
     1381 acatctgagt caggactttc ctcagtgact gtgtgtaggg gggttcaggg ctgaacccac
     1441 ctccctccca ccttcctccc acctcacctg tcgcacccac tgtgaaagtt ggaatatgtg
     1501 gtctccctgg cctcagggct ctgactctgc cagggtgggg ctctctaacc cacaggtgtt
     1561 ggctgcctgg cccatgtgcc cactgtctct tccacttggt ctgggtttgg caggcactgc
     1621 tgctacttga gggccaggat gctcccccag ggaagaaacg gaatagtgtg gggtgtgtgc
     1681 agggctgcat ccgcagatgg ctggaatatt aaaattcttc tatattggct ggtaaattgc
     1741 catggccctg agccactgag tatgttcatt gccacccctg tcctcccctg ggcacccctc
     1801 actttccctg atcctgcaat taaagggtta atgtgtggca tatggaaggg actcccagga
     1861 ccctgtgccc agcttccatg ctgactgatg gttaaataat gtgattgtct cctcccaggt
     1921 gtctgtgtca ctgcttgtgt tgttatttca gtctcccccg acacccatct gatgcttcct
     1981 cttcccagct aagtggttac cagaattgta tggcttaatc cagataccct gcaagccctg
     2041 tccagctggg gtgtcagggc ccagagttta caaagcctgg gtgactagca taaaattaca
     2101 aacaatgccc cagggacacc aggtaggggt ttggggtcct atggctccag tttctgacac
     2161 gggtgtgcga ttgcttctgg gtgtgggtgg gacgtgcta
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L38969. Homo sapiens thro...[gi:886298] Links  


LOCUS       HUMTHBS3A               3127 bp    mRNA    linear   PRI 22-FEB-2001
DEFINITION  Homo sapiens thrombospondin 3 (THBS3) mRNA, complete cds.
ACCESSION   L38969
VERSION     L38969.1  GI:886298
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3127)
  AUTHORS   Adolph,K.W., Long,G.L., Winfield,S., Ginns,E.I. and Bornstein,P.
  TITLE     Structure and organization of the human thrombospondin 3 gene
            (THBS3)
  JOURNAL   Genomics 27 (2), 329-336 (1995)
  MEDLINE   96044440
   PUBMED   7558000
FEATURES             Location/Qualifiers
     source          1..3127
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1q21"
                     /tissue_type="lung"
                     /dev_stage="fetal"
     gene            <1..>3127
                     /gene="THBS3"
     exon            <1..100
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=1
                     /evidence=experimental
     5'UTR           <1..21
                     /gene="THBS3"
                     /note="G00-264-170"
                     /evidence=experimental
     CDS             22..2892
                     /gene="THBS3"
                     /codon_start=1
                     /evidence=experimental
                     /product="thrombospondin 3"
                     /protein_id="AAC41762.1"
                     /db_xref="GI:886299"
                     /db_xref="GDB:G00-264-170"
                     /translation="METQELRGALALLLLCFFTSASQDLQVIDLLTVGESRQMVAVAE
                     KIRTALLTAGDIYLLSTFRLPPKQGGVLFGLYSRQDNTRWLEASVVGKINKVLVRYQR
                     EDGKVHAVNLQQAGLADGRTHTVLLRLRGPSRPSPALHLYVDCKLGDQHAGLPALAPI
                     PPAEVDGLEIRTGQKAYLRMQGFVESMKIILGGSMARVGALSECPFQGDESIHSAVTN
                     ALHSILGEQTKALVTQLTLFNQILVELRDDIRDQVKEMSLIRNTIMECQVCGFHEQRS
                     HCSPNPCFRGVDCMEVYEYPGYRCGPCPPGLQGNGTHCSDINECAHADPCFPGSSCIN
                     TMPGFHCEACPRGYKGTQVSGVGIDYARASKQVCNDIDECNDGNNGGCDPNSICTNTV
                     GSFKCGPCRLGFLGNQSQGCLPARTCHSPAHSPCHIHAHCLFERNGAVSCQCNVGWAG
                     NGNVCGTDTDIDGYPDQALPCMDNNKHCKQDNCLLTPNSGQEDADNDGVGDQCDDDAD
                     GDGIKNVEDNCRLFPNKDQQNSDTDSFGDACDNCPNVPNNDQKDTDGNGEGDACDNDV
                     DGDGIPNGLDNCPKVPNPLQTDRDEDGVGDACDSCPEMSNPTQTDADSDLVGDVCDTN
                     EDSDGDGHQDTKDNCPQLPNSSQLDSDNDGLGDECDGDDDNDGIPDYVPPGPDNCRLV
                     PNPNQKDSDGNGVGDVCEDDFDNDAVVDPLDVCPESAEVTLTDFRAYQTVVLDPEGDA
                     QIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTVTDDDYAGFLFSY
                     QDSGRFYVVMWKQTEQTYWQATPFRAVAQPGLQLKAVTSVSGPGEHLRNALWHTGHTP
                     DQVRLLWTDPRNVGWRDKTSYRWQLLHRPQVGYIRVKLYEGPQLVADSGVIIDTSMRG
                     GRLGVFCFSQENIIWSNLQYRCNDTVPEDFEPFRRQLLQGRV"
     exon            101..307
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=2
                     /evidence=experimental
     exon            308..564
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=3
                     /evidence=experimental
     exon            565..667
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=4
                     /evidence=experimental
     exon            668..694
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=5
                     /evidence=experimental
     exon            695..787
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=6
                     /evidence=experimental
     exon            788..829
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=7
                     /evidence=experimental
     exon            830..978
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=8
                     /evidence=experimental
     exon            979..1119
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=9
                     /evidence=experimental
     exon            1120..1197
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=10
                     /evidence=experimental
     exon            1198..1350
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=11
                     /evidence=experimental
     exon            1351..1461
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=12
                     /evidence=experimental
     exon            1462..1569
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=13
                     /evidence=experimental
     exon            1570..1729
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=14
                     /evidence=experimental
     exon            1730..1848
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=15
                     /evidence=experimental
     exon            1849..1901
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=16
                     /evidence=experimental
     exon            1902..2095
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=17
                     /evidence=experimental
     exon            2096..2274
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=18
                     /evidence=experimental
     exon            2275..2323
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=19
                     /evidence=experimental
     exon            2324..2520
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=20
                     /evidence=experimental
     exon            2521..2693
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=21
                     /evidence=experimental
     exon            2694..2833
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=22
                     /evidence=experimental
     exon            2834..>3127
                     /gene="THBS3"
                     /note="G00-264-170"
                     /number=23
                     /evidence=experimental
     3'UTR           2893..>3127
                     /gene="THBS3"
                     /note="G00-264-170"
                     /evidence=experimental
BASE COUNT      725 a    857 c    893 g    652 t
ORIGIN      
        1 gtagtgagcc ggctgagagg catggagacg caggaacttc ggggggccct ggctcttctc
       61 ctcctttgct ttttcacatc tgccagtcag gatctgcagg taattgacct gctgactgtg
      121 ggcgagtctc ggcagatggt agctgtggca gagaagatcc ggacagcctt gctcactgct
      181 ggggacatct acctcttatc caccttccgc ctgcccccca agcagggtgg tgtcctcttt
      241 ggcctctatt ctcgccaaga caacactcga tggctggagg cctctgttgt aggcaagatc
      301 aacaaagtac tggtgcgata ccagcgggag gatggcaaag tccacgccgt gaacctacag
      361 caagcgggcc tggctgatgg gcgcacacac acagttctcc tgcgactccg aggtccctcc
      421 agacccagcc ctgccctaca tctctacgtg gactgcaaac tgggtgacca acatgcaggc
      481 cttccagcac tggcccccat tcctccagcg gaggtcgatg ggctggagat taggactgga
      541 cagaaggcgt atttgaggat gcagggcttt gtggaatcta tgaaaattat tctgggtggg
      601 tccatggccc gggtaggagc cctgagtgag tgtccattcc aaggggacga gtccatccac
      661 agtgcagtga ccaatgcact gcactccatt ctaggggagc agaccaaggc gctggtcacc
      721 caactcaccc tcttcaacca gatcctggtg gagctgcggg atgatatacg agaccaggta
      781 aaggaaatgt ccctgatccg aaacaccatt atggagtgtc aggtgtgcgg cttccatgag
      841 cagcgttccc actgcagccc caatccctgc ttccgaggtg tggactgcat ggaagtgtac
      901 gagtacccag gctaccgctg tgggccctgc ccccctggcc tgcagggcaa cggcacccac
      961 tgcagtgaca tcaatgagtg tgctcacgct gacccctgtt tcccgggctc cagctgcatc
     1021 aacaccatgc ccggcttcca ctgtgaggcc tgtcctcgag ggtacaaggg cacacaggtg
     1081 tctggtgtgg gcattgacta tgcccgggcc agcaaacagg tctgcaatga catcgatgaa
     1141 tgcaacgatg gcaacaatgg tggctgtgac ccaaactcca tctgcaccaa cactgtgggc
     1201 tctttcaagt gtggtccctg ccgcctgggt ttcctgggca accagagcca gggctgcctc
     1261 ccagcccgga cctgccacag cccagcccac agcccctgcc acatccatgc tcactgtctc
     1321 tttgaacgca atggtgcagt gtcctgccag tgtaacgtgg gctgggctgg gaatgggaac
     1381 gtgtgtggga ctgacacaga catcgatggc tacccagacc aagcactgcc ctgcatggac
     1441 aacaacaaac actgcaaaca ggacaactgc cttttgacac ccaactctgg gcaggaagat
     1501 gctgataatg atggtgtggg ggaccagtgt gatgatgatg ctgatgggga tgggatcaag
     1561 aatgttgagg acaactgccg gctgttcccc aacaaagacc agcagaactc agatacagat
     1621 tcatttggtg atgcctgtga caattgcccc aacgttccca acaatgacca gaaggacaca
     1681 gatggcaatg gggaaggaga tgcctgtgac aacgacgtgg atggggatgg catccccaat
     1741 ggattggaca attgccctaa agtccccaac ccactacaga cagacaggga tgaggacggg
     1801 gtgggagatg cttgcgacag ctgccctgaa atgagcaatc ctacccagac agatgcagac
     1861 agcgacctgg tgggggatgt ctgtgatact aatgaagaca gcgatgggga tgggcatcag
     1921 gacaccaagg acaactgccc acagctgcca aatagctccc agctggactc tgataacgat
     1981 ggacttggag atgagtgtga tggggatgat gacaatgatg gcatcccaga ttatgtgcct
     2041 cctggtcccg ataactgccg cctggtaccc aatcccaatc agaaggactc agatggcaat
     2101 ggcgttggtg atgtgtgtga ggatgacttt gacaatgatg ctgtggtcga ccccctggat
     2161 gtgtgtcctg aaagtgcaga ggtaacgctt acggattttc gggcctatca gaccgtcgtc
     2221 ctggatcctg agggtgatgc tcagattgac ccaaactggg ttgtgctcaa ccagggcatg
     2281 gaaatcgttc agaccatgaa cagtgaccct ggcttggcag ttggatacac ggccttcaat
     2341 ggtgtggact ttgaaggcac cttccatgtg aacacagtga ctgatgatga ctacgcaggc
     2401 tttctcttca gttatcaaga cagtggccgc ttctacgtag tcatgtggaa gcagaccgag
     2461 cagacctact ggcaggctac acccttccgg gcggttgccc agcccgggct gcagctcaag
     2521 gcagtgacat cagtgtctgg cccaggtgag cacctccgaa atgccctgtg gcatactggc
     2581 cacacccctg atcaggtacg actcctgtgg acagacccac gaaatgtggg ctggcgggac
     2641 aagacctcct atcgctggca gcttctgcac cggcctcaag ttggctacat tcgggtgaag
     2701 ctctatgagg gaccccagct tgtggcggat tctggggtga tcattgacac atccatgcga
     2761 ggggggcgtc ttggtgtatt ctgcttctcc caagaaaaca taatttggtc caatctccag
     2821 tatcgatgca atgacacagt gcctgaggac tttgagccat tccggaggca gctgctccag
     2881 ggaagggtgt gaggaggagg ccaccagatt cagaattcag aattttagac cctttggcct
     2941 tggggtccat cctggagacc ctgaggtcta agctacagcc cctcagccaa ccacagaccc
     3001 ttctctggct cccaaaagga gttcagtccc agaggggtgg tcaccccacc cttcagggga
     3061 tgagaagttt tcaaggggta ttactcaggc actaacccca ggttagatga cagcacattg
     3121 ccataaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF177396. Homo sapiens dick...[gi:6049607] Links  


LOCUS       AF177396                2479 bp    mRNA    linear   PRI 16-OCT-1999
DEFINITION  Homo sapiens dickkopf-3 (DKK-3) mRNA, complete cds.
ACCESSION   AF177396
VERSION     AF177396.1  GI:6049607
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2479)
  AUTHORS   Krupnik,V.E., Sharp,J.D., Jiang,C., Robison,K., Chickering,T.W.,
            Amaravadi,L., Brown,D.E., Guyot,D., Mays,G., Leiby,K., Chang,B.,
            Duong,T., Goodearl,A.D., Gearing,D.P., Sokol,S.Y. and McCarthy,S.A.
  TITLE     Functional and structural diversity of the human Dickkopf gene
            family
  JOURNAL   Gene 238 (2), 301-313 (1999)
  MEDLINE   20035735
   PUBMED   10570958
REFERENCE   2  (bases 1 to 2479)
  AUTHORS   Krupnik,V.E., Sharp,J.D., Jiang,C., Robison,K., Chickering,T.W.,
            Amaravadi,L., Brown,D.E., Guyot,D., Mays,G., Leiby,K., Chang,B.,
            Duong,T., Goodearl,A.D.J., Gearing,D.P., Sokol,S.Y. and
            McCarthy,S.A.
  TITLE     Direct Submission
  JOURNAL   Submitted (13-AUG-1999) Cell Biology, Millennium BioTherapeutics
            Inc., 640 Memorial Drive, Cambridge, MA 02139, USA
FEATURES             Location/Qualifiers
     source          1..2479
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..2479
                     /gene="DKK-3"
     CDS             38..1090
                     /gene="DKK-3"
                     /note="secreted protein"
                     /codon_start=1
                     /product="dickkopf-3"
                     /protein_id="AAF02676.1"
                     /db_xref="GI:6049608"
                     /translation="MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEE
                     ATLNEMFREVEELMEDTQHKLRSAVEEMEAEEAAAKASSEVNLANLPPSYHNETNTDT
                     KVGNNTIHVHREIHKITNNQTGQMVFSETVITSVGDEEGRRSHECIIDEDCGPSMYCQ
                     FASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHCTKMATRGSNGTICDNQRDCQPGL
                     CCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDGALDRCPCASGLLCQPH
                     SHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLERSLTEEMAL
                     GEPAAAAAALLGGEEI"
BASE COUNT      625 a    618 c    668 g    567 t      1 others
ORIGIN      
        1 ggcacgaggg ggcggcggct gcgggcgcag agcggagatg cagcggcttg gggccaccct
       61 gctgtgcctg ctgctggcgg cggcggtccc cacggccccc gcgcccgctc cgacggcgac
      121 ctcggctcca gtcaagcccg gcccggctct cagctacccg caggaggagg ccaccctcaa
      181 tgagatgttc cgcgaggttg aggaactgat ggaggacacg cagcacaaat tgcgcagcgc
      241 ggtggaagag atggaggcag aagaagctgc tgctaaagca tcatcagaag tgaacctggc
      301 aaacttacct cccagctatc acaatgagac caacacagac acgaaggttg gaaataatac
      361 catccatgtg caccgagaaa ttcacaagat aaccaacaac cagactggac aaatggtctt
      421 ttcagagaca gttatcacat ctgtgggaga cgaagaaggc agaaggagcc acgagtgcat
      481 catcgacgag gactgtgggc ccagcatgta ctgccagttt gccagcttcc agtacacctg
      541 ccagccatgc cggggccaga ggatgctctg cacccgggac agtgagtgct gtggagacca
      601 gctgtgtgtc tggggtcact gcaccaaaat ggccaccagg ggcagcaatg ggaccatctg
      661 tgacaaccag agggactgcc agccggggct gtgctgtgcc ttccagagag gcctgctgtt
      721 ccctgtgtgc acacccctgc ccgtggaggg cgagctttgc catgaccccg ccagccggct
      781 tctggacctc atcacctggg agctagagcc tgatggagcc ttggaccgat gcccttgtgc
      841 cagtggcctc ctctgccagc cccacagcca cagcctggtg tatgtgtgca agccgacctt
      901 cgtggggagc cgtgaccaag atggggagat cctgctgccc agagaggtcc ccgatgagta
      961 tgaagttggc agcttcatgg aggaggtgcg ccaggagctg gaggacctgg agaggagcct
     1021 gactgaagag atggcgctgg gggagcctgc ggctgccgcc gctgcactgc tgggagggga
     1081 agagatttag atctggacca ggctgtgggt agatgtgcaa tagaaatagc taatttattt
     1141 ccccangtgt gtgctttaag cgtgggctga ccaggcttct tcctacatct tcttcccagt
     1201 aagtttcccc tctggcttga cagcatgagg tgttgtgcat ttgttcagct cccccaggct
     1261 gttctccagg cttcacagtc tggtgcttgg gagagtcagg cagggttaaa ctgcaggagc
     1321 agtttgccac ccctgtccag attattggct gctttgcctc taccagttgg cagacagccg
     1381 tttgttctac atggctttga taattgtttg aggggaggag atggaaacaa tgtggagtct
     1441 ccctctgatt ggttttgggg aaatgtggag aagagtgccc tgctttgcaa acatcaacct
     1501 ggcaaaaatg caacaaatga attttccacg cagttctttc catgggcata ggtaagctgt
     1561 gccttcagct gttgcagatg aaatgttctg ttcaccctgc attacatgtg tttattcatc
     1621 cagcagtgtt gctcagctcc tacctctgtg ccagggcagc attttcatat ccaagatcaa
     1681 ttccctctct cagcacagcc tggggagggg gtcattgttc tcctcgtcca tcagggattt
     1741 cagaggctca gagactgcaa gctgcttgcc caagtcacac agctagtgaa gaccagagca
     1801 gtttcatctg gttgtgactc taagctcagt gctctctcca ctaccccaca ccagccttgg
     1861 tgccaccaaa agtgctcccc aaaaggaagg agaatgggat ttttcttttg aggcatgcac
     1921 atctggaatt aaggtcaaac taattctcac atccctctaa aagtaaacta ctgttaggaa
     1981 cagcagtgtt ctcacagtgt ggggcagccg tccttctaat gaagacaatg atattgacac
     2041 tgtccctctt tggcagttgc attagtaact ttgaaaggta tatgactgag cgtagcatac
     2101 aggttaacct gcagaaacag tacttaggta attgtagggc gaggattata aatgaaattt
     2161 gcaaaatcac ttagcagcaa ctgaagacaa ttatcaacca cgtggagaaa atcaaaccga
     2221 gcagggctgt gtgaaacatg gttgtaatat gcgactgcga acactgaact ctacgccact
     2281 ccacaaatga tgttttcagg tgtcatggac tgttgccacc atgtattcat ccagagttct
     2341 taaagtttaa agttgcacat gattgtataa gcatgctttc tttgagtttt aaattatgta
     2401 taaacataag ttgcatttag aaatcaagca taaatcactt caactgctaa aaaaaaaaaa
     2461 aaaaaaaaaa aaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AB003184. Homo sapiens mRNA...[gi:2554603] Links  


LOCUS       AB003184                2110 bp    mRNA    linear   PRI 05-FEB-1999
DEFINITION  Homo sapiens mRNA for ISLR, complete cds.
ACCESSION   AB003184
VERSION     AB003184.1  GI:2554603
KEYWORDS    ISLR.
SOURCE      Homo sapiens (isolate:Caucasian) retina cDNA to mRNA,
            clone_lib:human retina 5'-STRECH cDNA library (CLONTECH).
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Nagasawa,A., Kubota,R., Imamura,Y., Nagamine,K., Wang,Y.,
            Asakawa,S., Kudoh,J., Minoshima,S., Mashima,Y., Oguchi,Y. and
            Shimizu,N.
  TITLE     Cloning of the cDNA for a new member of the immunoglobulin
            superfamily (ISLR) containing leucine-rich repeat (LRR)
  JOURNAL   Genomics 44 (3), 273-279 (1997)
  MEDLINE   97468140
REFERENCE   2  (bases 1 to 2110)
  AUTHORS   Shimizu,N.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-APR-1997) Nobuyoshi Shimizu, Keio University School
            of Medicine, Department of Molecular Biology; 35 Shinanomachi,
            Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp,
            Tel:03-3351-2370, Fax:03-3351-2370)
FEATURES             Location/Qualifiers
     source          1..2110
                     /organism="Homo sapiens"
                     /isolate="Caucasian"
                     /db_xref="taxon:9606"
                     /chromosome="15"
                     /map="15q23-24"
                     /tissue_type="retina"
                     /clone_lib="human retina 5'-STRECH cDNA library
                     (CLONTECH)"
     CDS             99..1385
                     /codon_start=1
                     /product="ISLR"
                     /protein_id="BAA22848.1"
                     /db_xref="GI:2554604"
                     /translation="MQELHLLWWALLLGLAQACPEPCDCGEKYGFQIADCAYRDLESV
                     PPGFPANVTTLSLSANRLPGLPEGAFREVPLLQSLWLAHNEIRTVAAGALASLSHLKS
                     LDLSHNLISDFAWSDLHNLSALQLLKMDSNELTFIPRDAFRSLRALRSLQLNHNRLHT
                     LAEGTFTPLTALSHLQINENPFDCTCGIVWLKTWALTTAVSIPEQDNIACTSPHVLKG
                     TPLSRLPPLPCSAPSVQLSYQPSQDGAELRPGFVLALHCDVDGQPAPQLHWHIQIPSG
                     IVEITSPNVGTDGRALPGTPVASSQPRFQAFANGSLLIPDFGKLEEGTYSCLATNELG
                     SAESSVDVALATPGEGGEDTLGRRFHGKAVEGKGCYTVDNEVQPSGPEDNVVIIYLSR
                     AGNPEAAVAEGVPGQLPPGLLLLGQSLLLFFFLTSF"
     sig_peptide     99..152
     misc_feature    153..260
                     /note="amino-flanking region"
     repeat_region   261..638
                     /note="leucine-rich repeat"
     misc_feature    639..788
                     /note="carboxy-flanking region"
     misc_feature    789..1148
                     /note="immunoglobulin like domain"
     misc_feature    1335..1382
                     /note="transmembrane domain"
     polyA_signal    2089..2094
     polyA_site      2110
                     /note="13 A nucleotides"
BASE COUNT      379 a    693 c    592 g    446 t
ORIGIN      
        1 caggccgagg cagggagaac tctccactcg gaggaggagc tggggtcctc ttccatcccg
       61 tcttcatcct gcctggctgc gtgacctcgg gaggcaccat gcaggagctg catctgctct
      121 ggtgggcgct tctcctgggc ctggctcagg cctgccctga gccctgcgac tgtggggaaa
      181 agtatggctt ccagatcgcc gactgtgcct accgcgacct agaatccgtg ccgcctggct
      241 tcccggccaa tgtgactaca ctgagcctgt cagccaaccg gctgccaggc ttgccggagg
      301 gtgccttcag ggaggtgccc ctgctgcagt cgctgtggct ggcacacaat gagatccgca
      361 cggtggccgc cggagccctg gcctctctga gccatctcaa gagcctggac ctcagccaca
      421 atctcatctc tgactttgcc tggagcgacc tgcacaacct cagtgccctc caattgctca
      481 agatggacag caacgagctg accttcatcc cccgcgacgc cttccgcagc ctccgtgctc
      541 tgcgctcgct gcaactcaac cacaaccgct tgcacacatt ggccgagggc accttcaccc
      601 cgctcaccgc gctgtcccac ctgcagatca acgagaaccc cttcgactgc acctgcggca
      661 tcgtgtggct caagacatgg gccctgacca cggccgtgtc catcccggag caggacaaca
      721 tcgcctgcac ctcaccccat gtgctcaagg gtacgccgct gagccgcctg ccgccactgc
      781 catgctcggc gccctcagtg cagctcagct accaacccag ccaggatggt gccgagctgc
      841 ggcctggttt tgtgctggca ctgcactgtg atgtggacgg gcagccggcc cctcagcttc
      901 actggcacat ccagataccc agtggcattg tggagatcac cagccccaac gtgggcactg
      961 atgggcgtgc cctgcctggc acccctgtgg ccagctccca gccgcgcttc caggcctttg
     1021 ccaatggcag cctgcttatc cccgactttg gcaagctgga ggaaggcacc tacagctgcc
     1081 tggccaccaa tgagctgggc agtgctgaga gctcagtgga cgtggcactg gccacgcccg
     1141 gtgagggtgg tgaggacaca ctggggcgca ggttccatgg caaagcggtt gagggaaagg
     1201 gctgctatac ggttgacaac gaggtgcagc catcagggcc ggaggacaat gtggtcatca
     1261 tctacctcag ccgtgctggg aaccctgagg ctgcagtcgc agaaggggtc cctgggcagc
     1321 tgcccccagg cctgctcctg ctgggccaaa gcctcctcct cttcttcttc ctcacctcct
     1381 tctagcccca cccagggctt ccctaactcc tccccttgcc cctaccaatg cccctttaag
     1441 tgctgcaggg gtctggggtt ggcaactcct gaggcctgca tgggtgactt cacattttcc
     1501 tacctctcct tctaatctct tctagagcac ctgctatccc caacttctag acctgctcca
     1561 aactagtgac taggatagaa tttgatcccc taactcactg tctgcggtgc tcattgctgc
     1621 taacagcatt gcctgtgctc tcctctcagg ggcagcatgc taacggggcg acgtcctaat
     1681 ccaactggga gaagcctcag tggtggaatt ccaggcactg tgactgtcaa gctggcaagg
     1741 gccaggattg ggggaatgga gctggggctt agctgggagg tggtctgaag cagacaggga
     1801 atgggagagg aggatgggaa gtagacagtg gctggtatgg ctctgaggct ccctggggcc
     1861 tgctcaagct cctcctgctc cttgctgttt tctgatgatt tgggggcttg ggagtccctt
     1921 tgtcctcatc tgagactgaa atgtggggat ccaggatggc ttccttcctc ttacccttcc
     1981 tccctcagcc tgcaacctct atcctggaac ctgtcctccc tttctcccca actatgcatc
     2041 tgttgtctgc tcctctgcaa aggccagcca gcttgggagc agcagagaaa taaacagcat
     2101 ttctgatgcc 
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X70940. H.sapiens mRNA fo...[gi:38455] Links  


LOCUS       HSEFAC1A2               1755 bp    mRNA    linear   PRI 18-AUG-1993
DEFINITION  H.sapiens mRNA for elongation factor 1 alpha-2.
ACCESSION   X70940
VERSION     X70940.1  GI:38455
KEYWORDS    elongation factor; elongation factor 1-alpha-2.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Knudsen,S.M., Frydenberg,J., Clark,B.F. and Leffers,H.
  TITLE     Tissue-dependent variation in the expression of elongation factor-1
            alpha isoforms: isolation and characterisation of a cDNA encoding a
            novel variant of human elongation-factor 1 alpha
  JOURNAL   Eur. J. Biochem. 215 (3), 549-554 (1993)
  MEDLINE   93358875
REFERENCE   2  (bases 1 to 1755)
  AUTHORS   Leffers,H.
  TITLE     Direct Submission
  JOURNAL   Submitted (16-FEB-1993) H. Leffers, Inst of Medical Biochemistry &
            Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus
            University, 8000 Aarhus C, DENMARK
FEATURES             Location/Qualifiers
     source          1..1755
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone_lib="lambda ZAPII; AMA"
     CDS             84..1475
                     /codon_start=1
                     /product="elongation factor 1 alpha-2"
                     /protein_id="CAA50280.1"
                     /db_xref="GI:38456"
                     /db_xref="SWISS-PROT:Q05639"
                     /translation="MGKEKTHINIVVIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEK
                     EAAEMGKGSFKYAWVLDKLKAERERGITIDISLWKFETTKYYITIIDAPGHRDFIKNM
                     ITGTSQADCAVLIVAAGVGEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEP
                     AYSEKRYDEIVKEVSAYIKKIGYNPATVPFVPISGWHGDNMLEPSPNMPWFKGWKVER
                     KEGNASGVSLLEALDTILPPTRPTDKPLRLPLQDVYKIGGIGTVPVGRVETGILRPGM
                     VVTFAPVNITTEVKSVEMHHEALSEALPGDNVGFNVKNVSVKDIRRGNVCGDSKSDPP
                     QEAAQFTSQVIILNHPGQISAGYSPVIDCHTAHIACKFAELKEKIDRRSGKKLEDNPK
                     SLKSGDAAIVEMVPGKPMCVESFSQYPPLGRFAVRDMRQTVAVGVIKNVEKKSGGAGK
                     VTKSAQKAQKAGK"
     polyA_signal    1736..1741
BASE COUNT      368 a    585 c    548 g    254 t
ORIGIN      
        1 cctcggctcc ggaatcactg cagcccccct cgccctgagc cagagcaccc cgggtcccgc
       61 cagcccctca cactcccagc aaaatgggca aggagaagac ccacatcaac atcgtggtca
      121 tcggccacgt ggactccgga aagtccacca ccacgggcca cctcatctac aaatgcggag
      181 gtattgacaa aaggaccatt gagaagttcg agaaggaggc ggctgagatg gggaagggat
      241 ccttcaagta tgcctgggtg ctggacaagc tgaaggcgga gcgtgagcgc ggcatcacca
      301 tcgacatctc cctctggaag ttcgagacca ccaagtacta catcaccatc atcgatgccc
      361 ccggccaccg cgacttcatc aagaacatga tcacgggtac atcccaggcg gactgcgcag
      421 tgctgatcgt ggcggcgggc gtgggcgagt tcgaggcggg catctccaag aatgggcaga
      481 cgcgggagca tgccctgctg gcctacacgc tgggtgtgaa gcagctcatc gtgggcgtga
      541 acaaaatgga ctccacagag ccggcctaca gcgagaagcg ctacgacgag atcgtcaagg
      601 aagtcagcgc ctacatcaag aagatcggct acaacccggc caccgtgccc tttgtgccca
      661 tctccggctg gcacggcgac aacatgctgg agccctcccc caacatgccg tggttcaagg
      721 gctggaaggt ggagcgtaag gagggcaacg caagcggcgt gtccctgctg gaggccctgg
      781 acaccatcct gccccccacg cgccccacgg acaagcccct gcgcctgccg ctgcaggacg
      841 tgtacaagat tggcggcatt ggcacggtgc ccgtgggccg ggtggagacc ggcatcctgc
      901 ggccgggcat ggtggtgacc tttgcgccag tgaacatcac cactgaggtg aagtcagtgg
      961 agatgcacca cgaggctctg agcgaagctc tgcccggcga caacgtcggc ttcaatgtga
     1021 agaacgtgtc ggtgaaggac atccggcggg gcaacgtgtg tggggacagc aagtctgacc
     1081 cgccgcagga ggctgctcag ttcacctccc aggtcatcat cctgaaccac ccggggcaga
     1141 ttagcgccgg ctactccccg gtcatcgact gccacacagc ccacatcgcc tgcaagtttg
     1201 cggagctgaa ggagaagatt gaccggcgct ctggcaagaa gctggaggac aaccccaagt
     1261 ccctgaagtc tggagacgcg gccatcgtgg agatggtgcc gggaaagccc atgtgtgtgg
     1321 agagcttctc ccagtacccg cctctcggcc gcttcgccgt gcgcgacatg aggcagacgg
     1381 tggccgtagg cgtcatcaag aacgtggaaa agaagagcgg cggcgccggc aaggtcacca
     1441 agtcggcgca gaaggcgcag aaggcgggca agtgaagcgc gggccgcggc gcgaccctcc
     1501 ccggcggcgc cgcgctccga accccggccc ggcccccgcc ccgcccccgc cccgcgcgcc
     1561 gctccggcgc cccgcacccc cgccaggcgc atgtctgcac ctccgcttgc cagaggccct
     1621 cggtcagcga ctggatgctc gccatcaagg tccagtggaa gttcttcaag aggaaaggcg
     1681 cccccgcccc aggcttccgc gcccagcgct cgccacgctc agtgcccgtt ttaccaataa
     1741 actgagcgac cccag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AL161958. Homo sapiens mRNA...[gi:7328010] Links  


LOCUS       HSM802531               1791 bp    mRNA    linear   PRI 23-MAR-2000
DEFINITION  Homo sapiens mRNA; cDNA DKFZp761B15121 (from clone DKFZp761B15121);
            complete cds.
ACCESSION   AL161958
VERSION     AL161958.1  GI:7328010
KEYWORDS    .
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1791)
  AUTHORS   Poustka,A., Wellenreuther,R., Mewes,H.W., Weil,B. and Wiemann,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-MAR-2000) MIPS, Am Klopferspitz 18a, D-82152
            Martinsried, GERMANY
COMMENT     Clone from S. Wiemann, Molecular Genome Analysis, German Cancer
            Research Center (DKFZ); Email s.wiemann@dkfz-heidelberg.de;
            sequenced by DKFZ (German Cancer Research Center,
            Heidelberg/Germany) within the cDNA sequencing consortium of the
            German Genome Project.
            This clone (DKFZp761B15121) is available at the RZPD in Berlin.
            Please contact the RZPD: Ressourcenzentrum, Heubnerweg 6, 14059
            Berlin-Charlottenburg, GERMANY; Email: clone@rzpd.de Further
            information about the clone and the sequencing project is available
            at http://www.mips.biochem.mpg.de/proj/cDNA/.
FEATURES             Location/Qualifiers
     source          1..1791
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="11q22.3-q23"
                     /clone="DKFZp761B15121"
                     /tissue_type="amygdala"
                     /clone_lib="761 (synonym: hamy2). Vector pSport1; host
                     DH10B; sites NotI + SalI"
                     /dev_stage="adult"
     gene            1..1791
                     /gene="DKFZp761B15121"
     CDS             57..542
                     /gene="DKFZp761B15121"
                     /note="THY1 (Homo sapiens)"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="CAB82306.1"
                     /db_xref="GI:7328011"
                     /translation="MNLAISIALLLTVLQVSRGQKVTSLTACLVDQSLRLDCRHENTS
                     SSPIQYEFSLTRETKKHVLFGTVGVPEHTYRSRTNFTSKYNMKVLYLSAFTSKDEGTY
                     TCALHHSGHSPPISSQNVTVLRDKLVKCEGISLLAQNTSWLLLLLLSLSLLQATDFMS
                     L"
     polyA_signal    1748..1753
                     /gene="DKFZp761B15121"
     polyA_site      1771
                     /gene="DKFZp761B15121"
BASE COUNT      415 a    544 c    471 g    361 t
ORIGIN      
        1 ggaggctgca gcagcggaag accccagtcc agatccagga ctgagatccc agaaccatga
       61 acctggccat cagcatcgct ctcctgctaa cagtcttgca ggtctcccga gggcagaagg
      121 tgaccagcct aacggcctgc ctagtggacc agagccttcg tctggactgc cgccatgaga
      181 ataccagcag ttcacccatc cagtacgagt tcagcctgac ccgtgagaca aagaagcacg
      241 tgctctttgg cactgtgggg gtgcctgagc acacataccg ctcccgaacc aacttcacca
      301 gcaaatacaa catgaaggtc ctctacttat ccgccttcac tagcaaggac gagggcacct
      361 acacgtgtgc actccaccac tctggccatt ccccacccat ctcctcccag aacgtcacag
      421 tgctcagaga caaactggtc aagtgtgagg gcatcagcct gctggctcag aacacctcgt
      481 ggctgctgct gctcctgctc tccctctccc tcctccaggc cacggatttc atgtccctgt
      541 gactggtggg gcccatggag gagacaggaa gcctcaagtt ccagtgcaga gatcctactt
      601 ctctgagtca gctgaccccc tccccgcaat ccctcaaacc ttgaggagaa gtggggaccc
      661 cacccctcat caggagttcc agtgctgcat gcgattatct acccacgtcc acgcggccac
      721 ctcaccctct ccgcacacct ctggctgtct ttttgtactt tttgttccag agctgcttct
      781 gtctggttta tttaggtttt atccttcctt ttctttgaga gttcgtgaag agggaagcca
      841 ggattgggga cctgatggag agtgagagca tgtgaggggt agtgggatgg tggggtacca
      901 gccactggag gggtcatcct tgcccatcgg gaccagaaac ctgggagaga cttggatgag
      961 gagtggttgg gctgtgcctg ggcctagcac ggacatggtc tgtcctgaca gcactcctcg
     1021 gcaggcatgg ctggtgcctg aagaccccag atgtgagggc accaccaaga atttgtggcc
     1081 taccttgtga gggagagaac tgagcatctc cagcattctc agccacaacc aaaaaaaaat
     1141 aaaaagggca gccctcctta ccactgtgga agtccctcag aggccttggg gcatgaccca
     1201 gtgaagatgc aggtttgacc aggaaagcag cgctagtgga gggttggaga aggaggtaag
     1261 gatgagggtt catcatccct ccctgcctaa ggaagctaaa agcatggccc tgctgcccct
     1321 ccctgcctcc acccacagtg gagagggcta caaaggagga caagaccctc tcaggctgtc
     1381 ccaagctccc aagagcttcc agagctctga cccacagcct ccaagtcagg tggggtggag
     1441 tcccagagct gcacagggtt tggcccaagt ttctaaggga ggcacttcct cccctcgccc
     1501 atcagtgcca gcccctgctg gctggtgcct gagcccctca gacagccccc tgccccgcag
     1561 gcctgccttc tcagggactt ctgcggggcc tgaggcaagc catggagtga gacccaggag
     1621 ccggacactt ctcaggaaat ggcttttccc aacccccagc ccccacccgg tggttcttcc
     1681 tgttctgtga ctgtgtatag tgccaccaca gcttatggca tctcattgag gacaaagaaa
     1741 actgcacaat aaaaccaagc ctctggaatc taaaaaaaaa aaaaaaaaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&

    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH002674. Human biglycan (B...[gi:179432] Links  


LOCUS       HUMBGN1                 1141 bp    DNA     linear   PRI 07-NOV-1994
DEFINITION  Human biglycan (BGN) gene, exon 1.
ACCESSION   M65151
VERSION     M65151.1  GI:179428
KEYWORDS    CSPGI; DSPGI; PG1; PGI; biglycan.
SEGMENT     1 of 4
SOURCE      Homo sapiens adult DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1141)
  AUTHORS   Fisher,L.W., Heegaard,A.M., Vetter,U., Vogel,W., Just,W.,
            Termine,J.D. and Young,M.F.
  TITLE     Human biglycan gene. Putative promoter, intron-exon junctions, and
            chromosomal localization
  JOURNAL   J. Biol. Chem. 266 (22), 14371-14377 (1991)
  MEDLINE   91317791
   PUBMED   1860845
FEATURES             Location/Qualifiers
     source          1..1141
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="Xq13-qter"
                     /cell_line="W138 lung fibroblast"
                     /cell_type="fibroblast"
                     /dev_stage="adult"
     exon            505..637
                     /gene="BGN"
                     /product="biglycan"
                     /note="G00-119-727"
                     /number=1
BASE COUNT      141 a    457 c    295 g    248 t
ORIGIN      Map position:  Xq27-ter.
        1 gatcctggag aaaccacctc cttgcttagg cccaagcagg ttcctggcag gctcaggacc
       61 aaattccagg ggccactcat gggcctagca gcccaaggcc gcctccccct cgtctttctt
      121 ccatctctct ttcctcgtcc tggcgagatg ccagccagca cctcagtgtc cccatctggg
      181 cagtggaaag tttgactctc tgggtccttg tttgagtgag tgcgagtgtg tccgttcctt
      241 tgctgtctgc cccaggcggg ggaggggggg ggaggtggtg ggggcgaggg ggcgggggct
      301 cagctagtcc agccgtctac aagaaaattg ctccctttga agctgccagg ggggccggga
      361 agcctgcccc ctcctgctcg cccgccctct ccgccccacc agccccctcc ctcctttcct
      421 ccctccccgc cctctccccg ctgtcccctc cccgtcggcc cgcctgccca gcctttagcc
      481 tcccgcccgc cgcctctgtc tccctctctc cacaaactgc ccaggagtga gtagctgctt
      541 tcggtccgcc ggacacaccg gacagataga cgtgcggacg gcccaccacc ccagcccgcc
      601 aactagtcag cctgcgcctg gcgcctcccc tctccaggta gggctggctt caagctgcct
      661 cctcagcaac ccagagatgc ccctggctct gctgcctccg ctgtcccaag ccctggtcct
      721 gctgtcccca gtgccgcgag ggtgtccaca gatttccccg gtgctctctg taggctgctg
      781 atccacgccc cttcatcgcc accctgcggc ccccttggtc cctgtcaggc ttctgctcgt
      841 ctcgccgccc tccaggcacc tttccctcac cccttcctct cccttctgac cttgctctgc
      901 ttcatccacc tcttgtctct ctgcctccca ctcggggtcc gtcttcttgg ctaccaccct
      961 agagcgtggc tgggtgactg gtaccccagc tttgccaatg gccctgtttc atcattgcaa
     1021 gtcccaggcg catgctccac tccctcagcc tcgctctgcc caggcgcctc cttgctccag
     1081 gcttggcgcc tggcccgggt tgggtcggat cggggaggac cgcccagcgc ccaccgagct
     1141 c
//
LOCUS       HUMBGN2                 2631 bp    DNA     linear   PRI 07-NOV-1994
DEFINITION  Human biglycan (BGN) gene, exons 2-7.
ACCESSION   M65152
VERSION     M65152.1  GI:179429
KEYWORDS    CSPGI; DSPGI; PG1; PGI; biglycan.
SEGMENT     2 of 4
SOURCE      Homo sapiens adult DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2631)
  AUTHORS   Fisher,L.W., Heegaard,A.M., Vetter,U., Vogel,W., Just,W.,
            Termine,J.D. and Young,M.F.
  TITLE     Human biglycan gene. Putative promoter, intron-exon junctions, and
            chromosomal localization
  JOURNAL   J. Biol. Chem. 266 (22), 14371-14377 (1991)
  MEDLINE   91317791
   PUBMED   1860845
FEATURES             Location/Qualifiers
     source          1..2631
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="Xq13-qter"
                     /cell_line="W138 lung fibroblast"
                     /cell_type="fibroblast"
                     /dev_stage="adult"
     mRNA            join(M65151.1:505..637,46..294,584..774,1286..1499,
                     1950..2060,2251..2344,2467..2605)
                     /partial
                     /gene="BGN"
                     /note="G00-119-727"
     intron          order(M65151.1:638..>1141,<1..45)
                     /gene="BGN"
                     /note="Approx. .65 kb gap; G00-119-727"
                     /number=1
     exon            46..294
                     /gene="BGN"
                     /product="biglycan"
                     /note="G00-119-727"
                     /number=2
     intron          295..583
                     /gene="BGN"
                     /note="G00-119-727"
                     /number=2
     exon            584..774
                     /gene="BGN"
                     /product="biglycan"
                     /note="G00-119-727"
                     /number=3
     intron          775..1285
                     /gene="BGN"
                     /note="G00-119-727"
                     /number=3
     exon            1286..1499
                     /gene="BGN"
                     /product="biglycan"
                     /note="G00-119-727"
                     /number=4
     intron          1500..1949
                     /gene="BGN"
                     /note="G00-119-727"
                     /number=4
     exon            1950..2060
                     /gene="BGN"
                     /product="biglycan"
                     /note="G00-119-727"
                     /number=5
     intron          2061..2250
                     /gene="BGN"
                     /note="G00-119-727"
                     /number=5
     exon            2251..2344
                     /gene="BGN"
                     /product="biglycan"
                     /note="G00-119-727"
                     /number=6
     intron          2345..2466
                     /gene="BGN"
                     /note="G00-119-727"
                     /number=6
     exon            2467..2605
                     /gene="BGN"
                     /product="biglycan"
                     /note="G00-119-727"
                     /number=7
BASE COUNT      507 a    800 c    817 g    507 t
ORIGIN      Map position:  Xq27-ter.
        1 acaggtgggt gctggtgctg atgatcccct cgcctcttcc cccaggtcca tccgccatgt
       61 ggcccctgtg gcgcctcgtg tctctgctgg ccctgagcca ggccctgccc tttgagcaga
      121 gaggcttctg ggacttcacc ctggacgatg ggccattcat gatgaacgat gaggaagctt
      181 cgggcgctga cacctcgggc gtcctggacc cggactctgt cacacccacc tacagcgcca
      241 tgtgtccttt cggctgccac tgccacctgc gggtggttca gtgctccgac ctgggtttgt
      301 ccctgagtga tggggagcgg ggcatgcagg gaggctcagg tgcagcctga gagccccttc
      361 tgaagggggc acatgctggt cctgtggacg gtggcgagca tgatgtaagt gtaggagggg
      421 tccagccgtc tggctgtgag ctgtgcagtt tgtgcccact tgtggtggca tccccgtgtg
      481 cccgtcagtg tccctgtgtg tgtgtccccg gtcctcccta ccagtggggc tagtcggctg
      541 gatggctcca agttcatgct ggtgatggtg gtggggcccc taggtctcga gttcatgctg
      601 gtggtggggg tggggcccct aggtctcaag ttcatgctgg tgatgggggt ggggccccta
      661 ggtctgaagt ctgtgcccaa agagatctcc cctgacacca cgctgctgga cctgcagaac
      721 aacgacatct ccgagctccg caaggatgac ttcaagggtc tccagcacct ctacgtaagg
      781 agctgggagg aaccagcagg cctacagcag agggcagggg tccgggtggg tgcatgtgcg
      841 tggacgtgtg gggtatgaga ggggttcggg gactcgtggg acttcagggt gaagcctgga
      901 gccagccgtg atgggagctc ccgggtttgc ggctcactca tgtgggtttg agcaaccaca
      961 gctgcaggac cggatcgctc agttcggctc ccttcgtggc tgaaaacgtt tcatcacgtc
     1021 cactcctccc agcaacagag gagaacggat ttcattgtag ccagtgtgcg tgtgaggaaa
     1081 ctgaggctgg gagcggcaag gcagtggtgg cactgctggg gctcaggacc gggcctgggt
     1141 gctgcctcct gccctgcact ctgctcacaa gcatggactg acctcctcga gcgccagtgg
     1201 gctggggagg cacaggaagg caggagagag gggcgggtgg ggtggggagt ctgtgccttc
     1261 acctcctccg cccaccctgc ttcaggccct cgtcctggtg aacaacaaga tctccaagat
     1321 ccatgagaag gccttcagcc cactgcggaa cgtgcagaag ctctacatct ccaagaacca
     1381 cctggtggag atcccgccca acctacccag ctccctggtg gagctccgca tccacgacaa
     1441 ccgcatccgc aaggtgccca agggagtgtt cagtgggctc cggaacatga actgcatcgg
     1501 tgagctgagg gcctcccaga acattccaga gccttgtctc gaggcatggg gaagggagac
     1561 caaggaatac ctttagaggc tcagttcaag aaagagtatg gtgagaacgg tcaaaagaaa
     1621 atccatggat ttcttggcaa atcctccatg caggcgatca ccacggctaa agagaagact
     1681 ggccagaggg gccgggtggc ttccggagcc ccatcttcat ctctggcact cctccctttc
     1741 ctcttgctgc ccctggagct agcagtcctg gggctagcag tcctgaacag ctaggagttt
     1801 gcaattagcc cggtaaatta gcagaactgc tttcaggaga cgggagcagc cggcaggtag
     1861 cagggcccac cacactggcc cggaagtgac aggacccagg gctgtgcagg gaccaccagg
     1921 ctcccgggct aatgaggtct ctcccctaga gatgggcggg aacccactgg agaacagtgg
     1981 ctttgaacct ggagccttcg atggcctgaa gctcaactac ctgcgcatct cagaggccaa
     2041 gctgactggc atccccaaag gtaggaagcc cactcttcct gcacgcctgc ctgcctcacc
     2101 cccaacagca cagatggcca gggtgggggc tctggatggg cccgatctac tcagggaaag
     2161 gctcaacagt cccctcccgc cacctggggc agagctaggg cccctgccct cagcacctgc
     2221 attctcccct gtgccctctt ctcctggcag acctccctga gaccctgaat gaactccacc
     2281 tagaccacaa caaaatccag gccatcgaac tggaggacct gcttcgctac tccaagctgt
     2341 acaggtgagg ccagcagggc accgccaagg gtgatgccag agtccctcag tgctgtgtgg
     2401 cccctcgcgc ccagcccccc atccttacct ccagcctttg agtccgtgtc attctcccgc
     2461 tcacaggctg ggcctaggcc acaaccagat caggatgatc gagaacggga gcctgagctt
     2521 cctgcccacc ctccgggagc tccacttgga caacaacaag ttggccaggg tgccctcagg
     2581 gctcccagac ctcaagctcc tccaggtgag agctgggcat gcacagccag g
//
LOCUS       HUMBGN3                  941 bp    DNA     linear   PRI 07-NOV-1994
DEFINITION  Human biglycan (BGN) gene, exon 8.
ACCESSION   M65153
VERSION     M65153.1  GI:179430
KEYWORDS    CSPGI; DSPGI; PG1; PGI; biglycan.
SEGMENT     3 of 4
SOURCE      Homo sapiens adult DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 941)
  AUTHORS   Fisher,L.W., Heegaard,A.M., Vetter,U., Vogel,W., Just,W.,
            Termine,J.D. and Young,M.F.
  TITLE     Human biglycan gene. Putative promoter, intron-exon junctions, and
            chromosomal localization
  JOURNAL   J. Biol. Chem. 266 (22), 14371-14377 (1991)
  MEDLINE   91317791
   PUBMED   1860845
FEATURES             Location/Qualifiers
     source          1..941
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="Xq13-qter"
                     /cell_line="W138 lung fibroblast"
                     /cell_type="fibroblast"
                     /dev_stage="adult"
     CDS             join(M65152.1:57..294,M65152.1:584..774,
                     M65152.1:1286..1499,M65152.1:1950..2060,
                     M65152.1:2251..2344,M65152.1:2467..2605,286..483)
                     /gene="BGN"
                     /codon_start=1
                     /product="biglycan"
                     /protein_id="AAA52287.1"
                     /db_xref="GI:179433"
                     /db_xref="GDB:G00-119-727"
                     /translation="MWPLWRLVSLLALSQALPFEQRGFWDFTLDDGPFMMNDEEASGA
                     DTSGVLDPDSVTPTYSAMCPFGCHCHLRVVQCSDLGLEFMLVVGVGPLGLKFMLVMGV
                     GPLGLKSVPKEISPDTTLLDLQNNDISELRKDDFKGLQHLYALVLVNNKISKIHEKAF
                     SPLRNVQKLYISKNHLVEIPPNLPSSLVELRIHDNRIRKVPKGVFSGLRNMNCIEMGG
                     NPLENSGFEPGAFDGLKLNYLRISEAKLTGIPKDLPETLNELHLDHNKIQAIELEDLL
                     RYSKLYRLGLGHNQIRMIENGSLSFLPTLRELHLDNNKLARVPSGLPDLKLLQVVYLH
                     SNNITKVGVNDFCPMGFGVKRAYYNGISLFNNPVPYWEVQPATFRCVTDRLAIQFGNY
                     KK"
     intron          order(M65152.1:2606..>2631,<1..285)
                     /gene="BGN"
                     /note="Approx. 1.2 kb gap; G00-119-727"
BASE COUNT      194 a    340 c    235 g    172 t
ORIGIN      Map position:  Xq27-ter.
        1 acctcacacc accaaacaca cctctacccc agccccgccc ccacatgtcc tcaacctgac
       61 ccacctgaga ccctcatcct tgtccctggt cacatccagt gccttaatcc tggctgacac
      121 ccacacaaat aacacgccca tgccttggtt tgctcctccc aacaacgggg agcctctggt
      181 gtggcccttg aagtaggttg cagaggcaac agcaaaatgc ctcctggagg cagcgggctt
      241 ggcgtggagg gagggaggcc tgtgacccgg cctctctgcc ttcaggtggt ctatctgcac
      301 tccaacaaca tcaccaaagt gggtgtcaac gacttctgtc ccatgggctt cggggtgaag
      361 cgggcctact acaacggcat cagcctcttc aacaaccccg tgccctactg ggaggtgcag
      421 ccggccactt tccgctgcgt cactgaccgc ctggccatcc agtttggcaa ctacaaaaag
      481 tagaggcagc tgcagccacc gcggggcctc agtgggggtc tctggggaac acagccagac
      541 atcctgatgg ggaggcagag ccaggaagct aagccagggc ccagctgcgt ccaacccagc
      601 cccccacctc aggtccctga ccccagctcg atgccccatc accgcctctc cctggctccc
      661 aagggtgcag gtgggcgcaa ggcccggccc ccatcacatg ttcccttggc ctcagagctg
      721 cccctgctct cccaccacag ccacccagag gcaccccatg aagctttttt ctcgttcact
      781 cccaaaccca agtgtccaaa gctccagtcc taggagaaca gtccctgggt cagcagccag
      841 gaggcggtcc ataagaatgg ggacagtggg ctctgccagg gctgccgcac ctgtccagaa
      901 caacatgttc tgttcctcct cctcatgcat ttccagcctt g
//
LOCUS       HUMBGN4                  260 bp    DNA     linear   PRI 07-NOV-1994
DEFINITION  Human biglycan (BGN) gene, intron 8 region.
ACCESSION   M65154
VERSION     M65154.1  GI:179431
KEYWORDS    CSPGI; DSPGI; PG1; PGI; biglycan.
SEGMENT     4 of 4
SOURCE      Homo sapiens adult DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 260)
  AUTHORS   Fisher,L.W., Heegaard,A.M., Vetter,U., Vogel,W., Just,W.,
            Termine,J.D. and Young,M.F.
  TITLE     Human biglycan gene. Putative promoter, intron-exon junctions, and
            chromosomal localization
  JOURNAL   J. Biol. Chem. 266 (22), 14371-14377 (1991)
  MEDLINE   91317791
   PUBMED   1860845
FEATURES             Location/Qualifiers
     source          1..260
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="Xq13-qter"
                     /cell_line="W138 lung fibroblast"
                     /cell_type="fibroblast"
                     /dev_stage="adult"
     gene            join(M65151.1:505..1141,M65152.1:1..2631,M65153.1:1..941,
                     1..260)
                     /gene="BGN"
     intron          order(M65153.1:286..>914,<1..260)
                     /gene="BGN"
                     /note="Approx 1.3 kb gap; G00-119-727"
                     /number=8
BASE COUNT       47 a     95 c     68 g     50 t
ORIGIN      Map position:  Xq27-ter.
        1 ggacagcggt ctccccagcc tgccctgctc agccctgccc ccaaacctgt actgtcccgg
       61 aggaggttgg gaggtggagg cccagcatcc cgcgcagatg acaccatcaa ccgccagagt
      121 cccagacacc ggttttccta gaagcccctc acccccactg gcccactggt ggctaggtct
      181 ccccttactc ttctggtcca gcgcaaccag gggctgcttc tgaggtcggt ggctgtcttt
      241 ccattaaaga aacaccgtgc 
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC002416. Homo sapiens, big...[gi:12803216] Links  


LOCUS       BC002416                2361 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, biglycan, clone MGC:2298 IMAGE:3162633, mRNA,
            complete cds.
ACCESSION   BC002416
VERSION     BC002416.1  GI:12803216
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2361)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site:       http://www.nisc.nih.gov/
            Contact:        nisc_mgc@nhgri.nih.gov
            Shevchenko,Y., Wetherby,K.D., Beckstrom-Sternberg,S.M.,
            Benjamin,B., Blakesley,R.W., Bouffard,G.G., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Guan,X., Gupta,J., Ho,S.-L., Karlins,E., Legaspi,R.,
            Lim,M., Maduro,Q.L., Masiello,C., Mastrian,S.D., McCloskey,J.C.,
            McDowell,J., Pearson,R., Snyder,B., Stantripop,S., Thomas,P.J.,
            Tiongson,E.E., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A.,
            Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 5 Row: c Column: 11
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 6960456.
FEATURES             Location/Qualifiers
     source          1..2361
                     /organism="Homo sapiens"
                     /db_xref="LocusID:633"
                     /db_xref="taxon:9606"
                     /clone="MGC:2298 IMAGE:3162633"
                     /tissue_type="Brain, neuroblastoma"
                     /clone_lib="NIH_MGC_19"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             139..1245
                     /codon_start=1
                     /product="biglycan"
                     /protein_id="AAH02416.1"
                     /db_xref="GI:12803217"
                     /translation="MWPLWRLVSLLALSQALPFEQRGFWDFTLDDGPFMMNDEEASGA
                     DTSGVLDPDSVTPTYSAMCPFGCHCHLRVVQCSDLGLKSVPKEISPDTTLLDLQNNDI
                     SELRKDDFKGLQHLYALVLVNNKISKIHEKAFSPLRKLQKLYISKNHLVEIPPNLPSS
                     LVELRIHDNRIRKVPKGVFSGLRNMNCIEMGGNPLENSGFEPGAFDGLKLNYLRISEA
                     KLTGIPKDLPETLNELHLDHNKIQAIELEDLLRYSKLYRLGLGHNQIRMIENGSLSFL
                     PTLRELHLDNNKLARVPSGLPDLKLLQVVYLHSNNITKVGVNDFCPMGFGVKRAYYNG
                     ISLFNNPVPYWEVQPATFRCVTDRLAIQFGNYKK"
BASE COUNT      459 a    841 c    587 g    474 t
ORIGIN      
        1 ggcacgaggc ccaggagtga gtagctgctt tcggtccgcc ggacacaccg gacagataga
       61 cgtgcggacg gcccaccacc ccagccctcc aactagtcag cctgcgcctg gcgcctcccc
      121 tctccaggtc catccgccat gtggcccctg tggcgcctcg tgtctctgct ggccctgagc
      181 caggccctgc cctttgagca gagaggcttc tgggacttca ccctggacga tgggccattc
      241 atgatgaacg atgaggaagc ttcgggcgct gacacctcag gcgtcctgga cccggactct
      301 gtcacaccca cctacagcgc catgtgtcct ttcggctgcc actgccacct gcgggtggtt
      361 cagtgctccg acctgggtct gaagtctgtg cccaaagaga tctcccctga caccacgctg
      421 ctggacctgc agaacaacga catctccgag ctccgcaagg atgacttcaa gggtctccag
      481 cacctctacg ccctcgtcct ggtgaacaac aagatctcca agatccatga gaaggccttc
      541 agcccactgc ggaagctgca gaagctctac atctccaaga accacctggt ggagatcccg
      601 cccaacctac ccagctccct ggtggagctc cgcatccacg acaaccgcat ccgcaaggtg
      661 cccaagggag tgttcagcgg gctccggaac atgaactgca tcgagatggg cgggaaccca
      721 ctggagaaca gtggctttga acctggagcc ttcgatggcc tgaagctcaa ctacctgcgc
      781 atctcagagg ccaagctgac tggcatcccc aaagacctcc ctgagaccct gaatgaactc
      841 cacctagacc acaacaaaat ccaggccatc gaactggagg acctgcttcg ctactccaag
      901 ctgtacaggc tgggcctagg ccacaaccag atcaggatga tcgagaacgg gagcctgagc
      961 ttcctgccca ccctccggga gctccacttg gacaacaaca agttggccag ggtgccctca
     1021 gggctcccag acctcaagct cctccaggtg gtctatctgc actccaacaa catcaccaaa
     1081 gtgggtgtca acgacttctg tcccatgggc ttcggggtga agcgggccta ctacaacggc
     1141 atcagcctct tcaacaaccc cgtgccctac tgggaggtgc agccggccac tttccgctgc
     1201 gtcactgacc gcctggccat ccagtttggc aactacaaaa agtagaggca gctgcagcca
     1261 ccgcggggcc tcagtggggg tctctgggga acacagccag acatcctgat ggggaggcag
     1321 agccaggaag ctaagccagg gcccagctgc gtccaaccca gccccccacc tcgggtccct
     1381 gaccccagct cgatgcccca tcaccgcctc tccctggctc ccaagggtgc aggtgggcgc
     1441 aaggcccggc ccccatcaca tgttcccttg gcctcagagc tgcccctgct ctcccaccac
     1501 agccacccag aggcacccca tgaagctttt ttctcgttca ctcccaaacc caagtgtcca
     1561 aggctccagt cctaggagaa cagtccctgg gtcagcagcc aggaggcggt ccataagaat
     1621 ggggacagtg ggctctgcca gggctgccgc acctgtccag acacacatgt tctgttcctc
     1681 ctcctcatgc atttccagcc tttcaaccct ccccgactct gcggctcccc tcagccccct
     1741 tgcaagttca tggcctgtcc ctcccagacc cctgctccac tggcccttcg accagtcctc
     1801 ccttctgttc tctctttccc cgtccttcct ctctctctct ctctctctct ctctctcttt
     1861 ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt cttgtgcttc ctcagacctt
     1921 tctcgcttct gagcttggtg gcctgttccc tccatctctc cgaacctggc ttcgcctgtc
     1981 cctttcactc cacaccctct ggccttctgc cttgagctgg gactgctttc tgtctgtccg
     2041 gcctgcaccc agcccctgcc cacaaaaccc cagggacagc ggtctcccca gcctgccctg
     2101 ctcaggcctt gcccccaaac ctgtactgtc ccggaggagg ttgggaggtg gaggcccagc
     2161 atcccgcgca gatgacacca tcaaccgcca gagtcccaga caccggtttt cctagaagcc
     2221 cctcaccccc actggcccac tggtggctag gtctcccctt atccttctgg tccagcgcaa
     2281 ggaggggctg cttctgaggt cggtggctgt ctttccatta aagaaacacc gtgcaacgtg
     2341 aaaaaaaaaa aaaaaaaaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: AI755052. cr34f05.x1 Jia bo...[gi:5133305] Links  


IDENTIFIERS

dbEST Id:       2664787
EST name:       cr34f05.x1
GenBank Acc:    AI755052
GenBank gi:     5133305

CLONE INFO
Clone Id:       HBMSC_cr34f05 (3')
Source:         Libin Jia
Plate:          34 Row: f Column: 05
DNA type:       cDNA

PRIMERS
Sequencing:     -21M13 forward primer (ABI)
PolyA Tail:     Unknown

SEQUENCE
                TTTTTTTTTTTTTTTTTTTGAATTTTAATATGATATTTTATTATGGGTGTCTGTAAGGAA
                AAAAAAGATCAACAACCACATACAAGCTTACAAAGTTAAATTTCAACACATTCTCTATGC
                TAGTGTGACAAAAGCAGCCCCATAATTTGGTTTTTATTGTTGACCTTTACAGGATGAAGG
                AGGAGAATCCCCTGTGGCATGCCAATGAATCTTTCTGATGGGAGACATGTACAGATTTTG
                TGCATTTATGTTCTGAATGCAAGTCAACAATTCTGATCTAGAGTTTAAAAGTGAAAGTAC
                ATTAGCACCATAACATGCGTCTTTAAAGCCTTCCCAAATATTAGTAATCTTGACCAGCAA
                TGACAAGAAAAAAGAGGAGCACCTTTACAAGCAGTTGATATCCAATATTAAAATAATTGT
                GGCTTTAAAAATATTTCTTTAAATTCTTGCATTACACTTTTCTTTTTAAACCAATCTTCC
                AGGAGATTAATCAATGAAATTTATAAGTTTTATCAACGTATAAAATTTTTTTCATCTTCT
                GGGACTCATAGAATACAATCTGTGTTTCTGACCAGTTGAGGTAGTTAAAATAGGGAGGGC
                TTTTCTAATTTCGT

Entry Created:  Jun 22 1999
Last Updated:   Jun 20 2002

COMMENTS
                DNA Sequencing and analyses by National Institutes of Health
                Intramural Sequencing Center (NISC).

LIBRARY
Lib Name:       Human bone marrow stromal cells
Organism:       Homo sapiens
Sex:            mixed
Tissue type:    bone marrow stroma
Develop. stage: mixed
Lab host:       XL1-Blue MRF'/SOLR
Vector:         pBluescript
R. Site 1:      EcoRI
R. Site 2:      XhoI
Description:    mRNA made from human bone marrow stroma, cDNA made by
                oligo-dT priming. Directionally cloned. Size-selected for
                average insert size >0.5 kb. Library constructed by Dr.
                Marian Young and Dr. Pamela Gehron Robey (NIDCR). Library
                supplied by Dr. Libin Jia (NHGRI)

SUBMITTER
Name:           Libin Jia
Lab:            Medical Genetics Branch
Institution:    National Human Genome Research Institute
Address:        10/10C101, 9000 Rockville Pike, Bethesda, MD 20892-1267, USA
Tel:            301-402-4877
Fax:            301-496-7157
E-mail:         libin@helix.nih.gov

CITATIONS
Medline UID:    21686149
Title:          Gene expression profile of human bone marrow stromal cells:
                high-throughput expressed sequence tag sequencing analysis
Authors:        Jia,L., Young,M.F., Powell,J., Yang,L., Ho,N.C., Hotchkiss
                ,R., Robey,P.G., Francomano,C.A.
Citation:       Genomics 79 (1): 7-17 2002


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M26939. Human collagen ty...[gi:180813] Links  


LOCUS       HUMCOL3A1A              2275 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagen type-III (COL3A1) gene, 5' end.
ACCESSION   M26939
VERSION     M26939.1  GI:180813
KEYWORDS    Alu repeat; collagen.
SOURCE      Human DNA, (libraries of A.Bank, T.Maniatis, and M.Baird).
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2275)
  AUTHORS   Benson-Chanda,V., Su,M.W., Weil,D., Chu,M.L. and Ramirez,F.
  TITLE     Cloning and analysis of the 5' portion of the human type-III
            procollagen gene (COL3A1)
  JOURNAL   Gene 78 (2), 255-265 (1989)
  MEDLINE   89378752
   PUBMED   2777083
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Ramirez, 12-OCT-1989.
            Though the sequence was obtained from genomic DNA, only the exon
            sequences in the coding regions are reported.
FEATURES             Location/Qualifiers
     source          1..2275
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..2275
                     /gene="COL3A1"
     repeat_region   <1..237
                     /note="Alu repeat"
     TATA_signal     1602..1607
                     /gene="COL3A1"
     mRNA            1629..>2275
                     /gene="COL3A1"
                     /product="COL3A1 mRNA"
     misc_signal     1727..1754
                     /gene="COL3A1"
                     /note="region of dyad symmetry"
     CDS             1748..>2275
                     /gene="COL3A1"
                     /note="alpha-1 preprocollagen type-III"
                     /codon_start=1
                     /protein_id="AAA52040.1"
                     /db_xref="GI:180814"
                     /translation="MMSFVQKGSWLLLALLHPTIILAQQEAVEGGCSHLGQSYADRDV
                     WKPEPCQICVCDSGSVLCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQ
                     GPQGPKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDV
                     KSGVAVGGLAGYPGPA"
     sig_peptide     1748..1816
                     /gene="COL3A1"
                     /note="alpha-1 collagen type-III signal peptide"
     mat_peptide     2207..>2275
                     /gene="COL3A1"
                     /product="alpha-1 collagen type-III"
BASE COUNT      701 a    435 c    508 g    631 t
ORIGIN      1919 bp upstream of BamHI site.
        1 ggcgggcggg atcacgaggt caggacatca gatccatcct gactaactcg gtgaaacccg
       61 tctctactaa aaatacaaaa aattaaccag gcgtagtggc gggcctgtag tcccagctac
      121 tcgggaggct gaggcaggag aatggcgtga acccgggagc ggacttgccg tgagccagat
      181 tgcgcctgca gtctagcctg gagagagggc gaccccgtac gaaaaaaaaa aaaaaaagat
      241 gcaagaaaca agaaagcaag tttgccactg tccatgctta caggattcat gtacacctgg
      301 gatatacata tgacatgagt tgatgaaaat ttagtaacac aaattagaaa gagaaagatg
      361 aaaattagtt taaagaggta atttacttaa ggaaagtttt gcattaggca taaggaagtg
      421 acttccatta attggtgtga aatggggaga attttatgtg cacaggcata gagacggaca
      481 tgtttaggtg aaggtgaaga aaaggagttc aagtgacctt gattagaatt gaaatgtcca
      541 ccgtagatta tagagataga tgaactaact atgaactgat gattagggtg taaagaaata
      601 tctccattgc cagatatagc cttatactag tggacaagca tatttttcct gcagagattt
      661 atcaaagacc tataaatgta tttgcccctc tttcaaatat gacacaaaaa ttactgctta
      721 gatagatcca gaaatacatt ggaaagtttt ctggtatgag gatctggaaa gtatacacat
      781 aaaagtctaa attagaaata tttatactaa tctttagagt aatgcagaaa aaatgaccac
      841 tttctctggt actcttttgc tattctatgt gataaactct ttctcttcca gtttcctaaa
      901 aagttatgtc atgaaaaggc tgattctcta gaaacttgct tgcttcccag gcagcataaa
      961 atggaatctc acagaagact ctggcttgct gaagaaattg tgagaaatgg aaacagctat
     1021 agataagtat ctaactttta ggaagccatt caaacattgc tgaaatactt gtcttttgat
     1081 atttgcctga aacttaactt cctaggaccc aggtggtgat gaagtcgaag gggcataccg
     1141 gtgaggcatt tctttccgtg agtctcttac agtctcctat ttaaattgag ttaggattac
     1201 ttctggcaaa atcccaacat aaaaatcttc taggaagatc agttctgtaa attagacata
     1261 ctagataaat gggcatcaag cagtttttca aaattatgca gttgttaact tcataagggg
     1321 aaataaaaat gtatgcattt acattgtatg attaaaacaa ggcagagcat ttctatacgt
     1381 tcctaagtta tacaaacata tatgtaagag tgaaatatgt aaaaaaactt ttacataagc
     1441 agatgcatac aaactccaga tgtgctcttt ttcttactgt gggttgtgtc ttctataagg
     1501 gaaaaagaaa tatttatcat ttcttttact gctgagggga tgggtgcggc tctcatattt
     1561 cagaaagggg ctggaaagtg agggaagcca aactttttcc tatttaaggc caaagcaaag
     1621 gaatctcagt ggctgagttt tatgacgggc ccggtgcctg aagggcaggg aacaacttga
     1681 tggtgctact ttgaactgct tttcttttct ccttttgcac caagagtctc atgtctgata
     1741 tttagacatg atgagctttg tgcaaaaggg gagctggcta cttctcgctc tgcttcatcc
     1801 cactattatt ctggcacaac aggaagctgt tgaaggagga tgttcccatc ttggtcagtc
     1861 ctatgcggat agagatgtct ggaagccaga accatgccaa atatgtgtct gtgactcagg
     1921 atccgttctc tgcgatgaca taatatgtga cgatcaagaa ttagactgcc ccaacccaga
     1981 aattccattt ggagaatgtt gtgcagtttg cccacagcct ccaactgctc ctactcgccc
     2041 tcctaatggt caaggacctc aaggccccaa gggagatcca ggacctcctg gtattcctgg
     2101 gagaaatggt gaccctggta ttccaggaca accagggtcc cctggttctc ctggcccccc
     2161 tggaatctgt gaatcatgcc ctactggtcc tcagaactat tctccccagt atgattcata
     2221 tgatgtcaag tctggagtag cagtaggagg actcgcaggc tatcctggac cagct
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X14420. Human mRNA for pr...[gi:30057] Links  


LOCUS       HSCOL3AI                5460 bp    mRNA    linear   PRI 31-MAR-1995
DEFINITION  Human mRNA for pro-alpha-1 type 3 collagen.
ACCESSION   X14420
VERSION     X14420.1  GI:30057
KEYWORDS    COL3A1 gene; collagen; collagen alpha 1 type III; collagen type
            III.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3690; 4501 to 5460)
  AUTHORS   Ala-Kokko,L., Kontusaari,S., Baldwin,C.T., Kuivaniemi,H. and
            Prockop,D.J.
  TITLE     Structure of cDNA clones coding for the entire prepro alpha 1 (III)
            chain of human type III procollagen. Differences in protein
            structure from type I procollagen and conservation of codon
            preferences
  JOURNAL   Biochem. J. 260 (2), 509-516 (1989)
  MEDLINE   89350838
REFERENCE   2  (bases 1 to 5460)
  AUTHORS   Prockop,D.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-FEB-1989) Prockop D.J., Department of Biochemistry
            and Molecular Biology, Thomas Jefferson University, 1020 Locust
            Street, Room 490, Philadelphia, PA 19107
COMMENT     Data kindly reviewed (29-aug-1989) by Ala-Kokko L.
FEATURES             Location/Qualifiers
     source          1..5460
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="2q"
                     /clone="S413 and S31."
                     /cell_line="JIMM-69"
                     /cell_type="skin fibroblast"
                     /clone_lib="CB-JIMM-69"
                     /dev_stage="neonatal"
     CDS             103..4503
                     /codon_start=1
                     /product="prepro-alpha-1 type 3 collagen"
                     /protein_id="CAA32583.1"
                     /db_xref="GI:30058"
                     /db_xref="SWISS-PROT:P02461"
                     /translation="MMSFVQKGSWLLLALLHPTIILAQQEAVEGGCSHLGQSYADRDV
                     WKPEPCQICVCDSGSVLCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQ
                     GPQGPKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDV
                     KSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPP
                     GAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKG
                     ETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRPGLPGAAGARGNDGARGSDGQPGP
                     PGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSP
                     GGKGEMGPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAKGEPGPRGERG
                     EAGIPGVPGAKGEDGKDGSPGEPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPAGE
                     RGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRPGPP
                     GPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGPGPQGPPGKNGETGPQGPPGPTG
                     PGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGE
                     RGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPK
                     GDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPG
                     ERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPAGPPGP
                     QGVKGERGSPGGPGAAGFPGARGLPGPPGSNGNPGPPGPSGSPGKDGPPGPAGNTGAP
                     GSPGVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPG
                     PQGVKGESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGG
                     KGDRGENGSPGAPGAPGHPGPPGPVGPAGKSGDRGESGPAGPAGAPGPAGSRGAPGPQ
                     GPRGDKGETGERGAAGIKGHRGFPGNPGAPGSPGPAGQQGAIGSPGPAGPRGPVGPSG
                     PPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGVGAAA
                     IAGIGGEKAGGFAPYYGDEPMDFKINTDEIMTSLKSVNGQIESLISPDGSRKNPARNC
                     RDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSS
                     AEKKHVWFGESMDGGFQFSYGNPELPEDVLDVQLAFLRLLSSRASQNITYHCKNSIAY
                     MDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRKA
                     VRLPIVDIAPYDIGGPDQEFGVDVGPVCFL"
     sig_peptide     103..174
                     /note="signal peptide (AA 1-24)"
     mat_peptide     174..4500
                     /note="pro-alpha-1 type 3 collagen"
     variation       2740
                     /note="c is u in variant clone; changing cuu (Leu) to uuu
                     (Phe)"
BASE COUNT     1327 a   1314 c   1544 g   1275 t
ORIGIN      
        1 cgggcccggt gctgaagggc agggaacaac ttgatggtgc tactttgaac tgcttttctt
       61 ttctcctttt tgcacaaaga gtctcatgtc tgatatttag acatgatgag ctttgtgcaa
      121 aaggggagct ggctacttct cgctctgctt catcccacta ttattttggc acaacaggaa
      181 gctgttgaag gaggatgttc ccatcttggt cagtcctatg cggatagaga tgtctggaag
      241 ccagaaccat gccaaatatg tgtctgtgac tcaggatccg ttctctgcga tgacataata
      301 tgtgacgatc aagaattaga ctgccccaac ccagaaattc catttggaga atgttgtgca
      361 gtttgcccac agcctccaac tgctcctact cgccctccta atggtcaagg acctcaaggc
      421 cccaagggag atccaggccc tcctggtatt cctgggagaa atggtgaccc tggtattcca
      481 ggacaaccag ggtcccctgg ttctcctggc ccccctggaa tctgtgaatc atgccctact
      541 ggtcctcaga actattctcc ccagtatgat tcatatgatg tcaagtctgg agtagcagta
      601 ggaggactcg caggctatcc tggaccagct ggccccccag gccctcccgg tccccctggt
      661 acatctggtc atcctggttc ccctggatct ccaggatacc aaggaccccc tggtgaacct
      721 gggcaagctg gtccttcagg ccctccagga cctcctggtg ctataggtcc atctggtcct
      781 gctggaaaag atggagaatc aggtagaccc ggacgacctg gagagcgagg attgcctgga
      841 cctccaggta tcaaaggtcc agctgggata cctggattcc ctggtatgaa aggacacaga
      901 ggcttcgatg gacgaaatgg agaaaagggt gaaacaggtg ctcctggatt aaagggtgaa
      961 aatggtcttc caggcgaaaa tggagctcct ggacccatgg gtccaagagg ggctcctggt
     1021 gagcgaggac ggccaggact tcctggggct gcaggtgctc ggggtaatga cggtgctcga
     1081 ggcagtgatg gtcaaccagg ccctcctggt cctcctggaa ctgccggatt ccctggatcc
     1141 cctggtgcta agggtgaagt tggacctgca gggtctcctg gttcaaatgg tgcccctgga
     1201 caaagaggag aacctggacc tcagggacac gctggtgctc aaggtcctcc tggccctcct
     1261 gggattaatg gtagtcctgg tggtaaaggc gaaatgggtc ccgctggcat tcctggagct
     1321 cctggactga tgggagcccg gggtcctcca ggaccagccg gtgctaatgg tgctcctgga
     1381 ctgcgaggtg gtgcaggtga gcctggtaag aatggtgcca aaggagagcc cggaccacgt
     1441 ggtgaacgcg gtgaggctgg tattccaggt gttccaggag ctaaaggcga agatggcaag
     1501 gatggatcac ctggagaacc tggtgcaaat gggcttccag gagctgcagg agaaaggggt
     1561 gcccctgggt tccgaggacc tgctggacca aatggcatcc caggagaaaa gggtcctgct
     1621 ggagagcgtg gtgctccagg ccctgcaggg cccagaggag ctgctggaga acctggcaga
     1681 gatggcgtcc ctggaggtcc aggaatgagg ggcatgcccg gaagtccagg aggaccagga
     1741 agtgatggga aaccagggcc tcccggaagt caaggagaaa gtggtcgacc aggtcctcct
     1801 gggccatctg gtccccgagg tcagcctggt gtcatgggct tccccggtcc taaaggaaat
     1861 gatggtgctc ctggtaagaa tggagaacga ggtggccctg gaggacctgg ccctcagggt
     1921 cctcctggaa agaatggtga aactggacct caaggacccc cagggcctac tgggcctggt
     1981 ggtgacaaag gagacacagg accccctggt ccacaaggat tacaaggctt gcctggtaca
     2041 ggtggtcctc caggagaaaa tggaaaacct ggggaaccag gtccaaaggg tgatgccggt
     2101 gcacctggag ctccaggagg caagggtgat gctggtgccc ctggtgaacg tggacctcct
     2161 ggattggcag gggccccagg acttagaggt ggagctggtc cccctggtcc cgaaggagga
     2221 aagggtgctg ctggtcctcc tgggccacct ggtgctgctg gtactcctgg tctgcaagga
     2281 atgcctggag aaagaggagg tcttggaagt cctggtccaa agggtgacaa gggtgaacca
     2341 ggcggcccag gtgctgatgg tgtcccaggg aaagatggcc caaggggtcc tactggtcct
     2401 attggtcctc ctggcccagc tggccagcct ggagataagg gtgaaggtgg tgcccccgga
     2461 cttccaggta tagctggacc tcgtggtagc cctggtgaga gaggtgaaac tggccctcca
     2521 ggacctgctg gtttccctgg tgctcctgga cagaatggtg aacctggtgg taaaggagaa
     2581 agaggggctc cgggtgagaa aggtgaagga ggccctcctg gagttgcagg accccctgga
     2641 ggttctggac ctgctggtcc tcctggtccc caaggtgtca aaggtgaacg tggcagtcct
     2701 ggtggacctg gtgctgctgg cttccctggt gctcgtggtc ttcctggtcc tcctggtagt
     2761 aatggtaacc caggaccccc aggtcccagc ggttctccag gcaaggatgg gcccccaggt
     2821 cctgcgggta acactggtgc tcctggcagc cctggagtgt ctggaccaaa aggtgatgct
     2881 ggccaaccag gagagaaggg atcgcctggt gcccagggcc caccaggagc tccaggccca
     2941 cttgggattg ctgggatcac tggagcacgg ggtcttgcag gaccaccagg catgccaggt
     3001 cctaggggaa gccctggccc tcagggtgtc aagggtgaaa gtgggaaacc aggagctaac
     3061 ggtctcagtg gagaacgtgg tccccctgga ccccagggtc ttcctggtct ggctggtaca
     3121 gctggtgaac ctggaagaga tggaaaccct ggatcagatg gtcttccagg ccgagatgga
     3181 tctcctggtg gcaagggtga tcgtggtgaa aatggctctc ctggtgcccc tggcgctcct
     3241 ggtcatccag gcccacctgg tcctgtcggt ccagctggaa agagtggtga cagaggagaa
     3301 agtggccctg ctggccctgc tggtgctccc ggtcctgctg gttcccgagg tgctcctggt
     3361 cctcaaggcc cacgtggtga caaaggtgaa acaggtgaac gtggagctgc tggcatcaaa
     3421 ggacatcgag gattccctgg taatccaggt gccccaggtt ctccaggccc tgctggtcag
     3481 cagggtgcaa tcggcagtcc aggacctgca ggccccagag gacctgttgg acccagtgga
     3541 cctcctggca aagatggaac cagtggacat ccaggtccca ttggaccacc agggcctcga
     3601 ggtaacagag gtgaaagagg atctgagggc tccccaggcc acccagggca accaggccct
     3661 cctggacctc ctggtgcccc tggtccttgc tgtggtggtg ttggagccgc tgccattgct
     3721 gggattggag gtgaaaaagc tggcggtttt gccccgtatt atggagatga accaatggat
     3781 ttcaaaatca acaccgatga gattatgact tcactcaagt ctgttaatgg acaaatagaa
     3841 agcctcatta gtcctgatgg ttctcgtaaa aaccccgcta gaaactgcag agacctgaaa
     3901 ttctgccatc ctgaactcaa gagtggagaa tactgggttg accctaacca aggatgcaaa
     3961 ttggatgcta tcaaggtatt ctgtaatatg gaaactgggg aaacatgcat aagtgccaat
     4021 cctttgaatg ttccacggaa acactggtgg acagattcta gtgctgagaa gaaacacgtt
     4081 tggtttggag agtccatgga tggtggtttt cagtttagct acggcaatcc tgaacttcct
     4141 gaagatgtcc ttgatgtgca gctggcattc cttcgacttc tctccagccg agcttcccag
     4201 aacatcacat atcactgcaa aaatagcatt gcatacatgg atcaggccag tggaaatgta
     4261 aagaaggccc tgaagctgat ggggtcaaat gaaggtgaat tcaaggctga aggaaatagc
     4321 aaattcacct acacagttct ggaggatggt tgcacgaaac acactgggga atggagcaaa
     4381 acagtctttg aatatcgaac acgcaaggct gtgagactac ctattgtaga tattgcaccc
     4441 tatgacattg gtggtcctga tcaagaattt ggtgtggacg ttggccctgt ttgcttttta
     4501 taaaccaaac tctatctgaa atcccaacaa aaaaaattta actccatatg tgttcctctt
     4561 gttctaatct tgtcaaccag tgcaagtgac cgacaaaatt ccagttattt atttccaaaa
     4621 tgtttggaaa cagtataatt tgacaaagaa aaatgatact tctctttttt tgctgttcca
     4681 ccaaatacaa ttcaaatgct ttttgtttta tttttttacc aattccaatt tcaaaatgtc
     4741 tcaatggtgc tataataaat aaacttcaac actctttatg ataacaacac tgtgttatat
     4801 tctttgaatc ctagcccatc tgcagagcaa tgactgtgct caccagtaaa agataacctt
     4861 tctttctgaa atagtcaaat acgaaattag aaaagccctc cctattttaa ctacctcaac
     4921 tggtcagaaa cacagattgt attctatgag tcccagaaga tgaaaaaaat tttatacgtt
     4981 gataaaactt ataaatttca ttgattaatc tcctggaaga ttggtttaaa aagaaaagtg
     5041 taatgcaaga atttaaagaa atatttttaa agccacaatt attttaatat tggatatcaa
     5101 ctgcttgtaa aggtgctcct cttttttctt gtcattgctg gtcaagatta ctaatatttg
     5161 ggaaggcttt aaagacgcat gttatggtgc taatgtactt tcacttttaa actctagatc
     5221 agaattgttg acttgcattc agaacataaa tgcacaaaat ctgtacatgt ctcccatcag
     5281 aaagattcat tggcatgcca cagggattct cctccttcat cctgtaaagg tcaacaataa
     5341 aaaccaaatt atggggctgc ttttgtcaca ctagcataga gaatgtgttg aaatttaact
     5401 ttgtaagctt gtatgtggtt gttgatcttt tttttcctta cagacaccca taataaaata
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X04701. Human mRNA for co...[gi:29538] Links  


LOCUS       HSC1R                   2386 bp    mRNA    linear   PRI 30-MAR-1995
DEFINITION  Human mRNA for complement component C1r.
ACCESSION   X04701
VERSION     X04701.1  GI:29538
KEYWORDS    complement protein C1r.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2386)
  AUTHORS   Journet,A. and Tosi,M.
  TITLE     Cloning and sequencing of full-length cDNA encoding the precursor
            of human complement component C1r
  JOURNAL   Biochem. J. 240 (3), 783-787 (1986)
  MEDLINE   87156625
FEATURES             Location/Qualifiers
     source          1..2386
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             52..2169
                     /note="precursor of C1r (AA -17 to 688)"
                     /codon_start=1
                     /protein_id="CAA28407.1"
                     /db_xref="GI:29539"
                     /db_xref="SWISS-PROT:P00736"
                     /translation="MWLLYLLVPALFCRAGGSIPIPQKLFGEVTSPLFPKPYPNNFET
                     TTVITVPTGYRVKLVFQQFDLEPSEGCFYDYVKISADKKSLGRFCGQLGSPLGNPPGK
                     KEFMSQGNKMLLTFHTDFSNEENGTIMFYKGFLAYYQAVDLDECASRSKLGEEDPQPQ
                     CQHLCHNYVGGYFCSCRPGYELQEDRHSCQAECSSELYTEASGYISSLEYPRSYPPDL
                     RCNYSIRVERGLTLHLKFLEPFDIDDHQQVHCPYDQLQIYANGKNIGEFCGKQRPPDL
                     DTSSNAVDLLFFTDESGDSRGWKLRYTTEIIKCPQPKTLDEFTIIQNLQPQYQFRDYF
                     IATCKQGYQLIEGNQVLHSFTAVCQDDGTWHRAMPRCKIKDCGQPRNLPNGDFRYTTT
                     MGVNTYKARIQYYCHEPYYKMQTRAGSRESEQGVYTCTAQGIWKNEQKGEKIPRCLPV
                     CGKPVNPVEQRQRIIGGQKAKMGNFPWQVFTNIHGRGGGALLGDRWILTAAHTLYPKE
                     HEAQSNASLDVFLGHTNVEELMKLGNHPIRRVSVHPDYRQDESYNFEGDIALLELENS
                     VTLGPNLLPICLPDNDTFYDLGLMGYVSGFGVMEEKIAHDLRFVRLPVANPQACENWL
                     RGKNRMDVFSQNMFCAGHPSLKQDACQGDSGGVFAVRDPNTDRWVATGIVSWGIGCSR
                     GYGFYTKVLNYVDWIKKEMEEED"
     sig_peptide     52..102
                     /note="signal peptide (AA -17 to -1)"
     mat_peptide     103..2166
                     /product="C1r zymogenic form (AA 1-688)"
     misc_feature    424..426
                     /note="N-linked glycosylation site"
     misc_feature    550..552
                     /note="N-linked glycosylation site"
     misc_feature    1440..1441
                     /note="zymogen cleavage site"
     misc_feature    1591..1593
                     /note="N-linked glycosylation site"
     misc_feature    1792..1794
                     /note="N-linked glycosylation site"
     misc_feature    2264..2269
                     /note="polyadenylation signal"
     polyA_site      2282
                     /note="polyadenylation site"
BASE COUNT      587 a    661 c    648 g    490 t
ORIGIN      
        1 tgcacgaaga cgctgtcggg agagcccagg attcaacacg ggccttgaga aatgtggctc
       61 ttgtacctcc tggtgccggc cctgttctgc agggcaggag gctccattcc catccctcag
      121 aagttatttg gggaggtgac ttcccctctg ttccccaagc cttaccccaa caactttgaa
      181 acaaccactg tgatcacagt ccccacggga tacagggtga agctcgtctt ccagcagttt
      241 gacctggagc cttctgaagg ctgcttctat gattatgtca agatctctgc tgataagaaa
      301 agcctgggga ggttctgtgg gcaactgggt tctccactgg gcaacccccc gggaaagaag
      361 gaatttatgt cccaagggaa caagatgctg ctgaccttcc acacagactt ctccaacgag
      421 gagaatggga ccatcatgtt ctacaagggc ttcctggcct actaccaagc tgtggacctt
      481 gatgaatgtg cttcccggag caaattaggg gaggaggatc cccagcccca gtgccagcac
      541 ctgtgtcaca actacgttgg aggctacttc tgttcctgcc gtccaggcta tgagcttcag
      601 gaagacaggc attcctgcca ggctgagtgc agcagcgagc tgtacacgga ggcatcaggc
      661 tacatctcca gcctggagta ccctcggtcc tacccccctg acctgcgctg caactacagc
      721 atccgggtgg agcggggcct caccctgcac ctcaagttcc tggagccttt tgatattgat
      781 gaccaccagc aagtacactg cccctatgac cagctacaga tctatgccaa cgggaagaac
      841 attggcgagt tctgtgggaa gcaaaggccc cccgacctcg acaccagcag caatgctgtg
      901 gatctgctgt tcttcacaga tgagtcgggg gacagccggg gctggaagct gcgctacacc
      961 accgagatca tcaagtgccc ccagcccaag accctagacg agttcaccat catccagaac
     1021 ctgcagcctc agtaccagtt ccgtgactac ttcattgcta cctgcaagca aggctaccag
     1081 ctcatagagg ggaaccaggt gctgcattcc ttcacagctg tctgccagga tgatggcacg
     1141 tggcatcgtg ccatgcccag atgcaagatc aaggactgtg ggcagccccg aaacctgcct
     1201 aatggtgact tccgttacac caccacaatg ggagtgaaca cctacaaggc ccgtatccag
     1261 tactactgcc atgagccata ttacaagatg cagaccagag ctggcagcag ggagtctgag
     1321 caaggggtgt acacctgcac agcacagggc atttggaaga atgaacagaa gggagagaag
     1381 attcctcggt gcttgccagt gtgtgggaag cccgtgaacc ccgtggaaca gaggcagcgc
     1441 atcatcggag ggcaaaaagc caagatgggc aacttcccct ggcaggtgtt caccaacatc
     1501 cacgggcgcg ggggcggggc cctgctgggc gaccgctgga tcctcacagc tgcccacacc
     1561 ctgtatccca aggaacacga agcgcaaagc aacgcctctt tggatgtgtt cctgggccac
     1621 acaaatgtgg aagagctcat gaagctagga aatcacccca tccgcagggt cagcgtccac
     1681 ccggactacc gtcaggatga gtcctacaat tttgaggggg acatcgccct gctggagctg
     1741 gaaaatagtg tcaccctggg tcccaacctc ctccccatct gcctccctga caacgatacc
     1801 ttctacgacc tgggcttgat gggctatgtc agtggcttcg gggtcatgga ggagaagatt
     1861 gctcatgacc tcaggtttgt ccgtctgccc gtagctaatc cacaggcctg tgagaactgg
     1921 ctccggggaa agaataggat ggatgtgttc tctcaaaaca tgttctgtgc tggacaccca
     1981 tctctaaagc aggacgcctg ccagggggat agtgggggcg tttttgcagt aagggacccg
     2041 aacactgatc gctgggtggc cacgggcatc gtgtcctggg gcatcgggtg cagcaggggc
     2101 tatggcttct acaccaaagt gctcaactac gtggactgga tcaagaaaga gatggaggag
     2161 gaggactgag cccagaattc actaggttcg aatccagaga gcagtgtgga aaaaaaaaaa
     2221 caaaaaacaa ctgaccagtt gttgataacc actaagagtc tctattaaaa ttactgatgc
     2281 agaaagaccg tgtgtgaaat tctctttcct gtagtcccat tgatgtactt tacctgaaac
     2341 aaccaaaggg cccctttctt tcttctgagg attgcagagg atatag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF078035. Homo sapiens tran...[gi:4322303] Links  


LOCUS       AF078035                3900 bp    mRNA    linear   PRI 14-APR-1999
DEFINITION  Homo sapiens translation initiation factor IF2 mRNA, complete cds.
ACCESSION   AF078035
VERSION     AF078035.1  GI:4322303
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3900)
  AUTHORS   Lee,J.H., Choi,S.K., Roll-Mecak,A., Burley,S.K. and Dever,T.E.
  TITLE     Universal conservation in translation initiation revealed by human
            and archaeal homologs of bacterial translation initiation factor
            IF2
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 96 (8), 4342-4347 (1999)
  MEDLINE   99218282
   PUBMED   10200264
REFERENCE   2  (bases 1 to 3900)
  AUTHORS   Lee,J.H. and Dever,T.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-JUL-1998) LEGR, NICHD, NIH, 6 Center Dr., Bethesda,
            MD 20892-2716, USA
FEATURES             Location/Qualifiers
     source          1..3900
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="testis"
     CDS             118..3780
                     /codon_start=1
                     /product="translation initiation factor IF2"
                     /protein_id="AAD16006.1"
                     /db_xref="GI:4322304"
                     /translation="MGKKQKNKSEDSTKDDIDLDALAAEIEGAGAAKEQEPQKSKGKK
                     KKEKKKQDFDEDDILKELEELSLEAQGIKADRETVAVKPTENNEEEFTSKDKKKKGQK
                     GKKQSFDDNDSEELEDKDSKSKKTAKPKVEMYSGSDDDDDFNKLPKKAKGKAQKSNKK
                     WDGSEEDEDNSKKIKERSRMNSSGESGDESDEFLQSRKGQKKNQKNKPGPNIESGNED
                     DDASFKIKTVAQKKAEKKERERKKRDEEKAKLRKLKEREELETGKKDQSKQKESQRKF
                     EEETVKSKVTVDTGVIPASEEKAETPTAAEDDNEGDKKKKDKKKKKGEKEEKEKEKKK
                     GPSKATVKAMQEALAKLKEEEERQKREEEERIKRLEELEAKRKEEERLEQEKRERKKQ
                     KEKERKERLKKEGKLLTKSQREARARAEATLKLLQAQGVEVPSKDSLPKKRPIYEDKK
                     RKKIPQQLESKEVSESMELCAAVEVMEQGVPEKEETPPPVEPEEEEDTEDAGLDDWEA
                     MASDEETEKVEGNTVHIEVKENPEEEEEEEEEEEEDEESEVEEEEEGESEGSEGDEED
                     EKVSDEKDSGKTLDKKPSKEMSSDSEYDSDDDRTKEERAYDKAKRRIEKRRLEHSKNV
                     NTEKLRAPIICVLGHVDTGKTKILDKLRHTHVQDGEAGGITQQIWATNVPLEAINEQT
                     KMIKNFDRENVRIPGMLIIDTPGHESFSNLRNRGSSLCDIAILVVDIMHGLEPQTIES
                     INLLKSKKCPFIVALNKIDRLYDWKKSPDSDVAATLKKQKKNTKDEFEERAKAIIVEF
                     AQQGLNAALFYENKDPRTFVSLVPTSAHTGDGMGSLIYLLVELTQTMLSKRLAHCEEL
                     RAQVMEVKALPGMGTTIDVILINGRLKEGDTIIVPGVEGPIVTQIRGLLLPPPMKELR
                     VKNQYEKHKEVEAAQGVKILGKDLEKTLAGLPLLVAYKEDEIPVLKDELIHELKQTLN
                     AIKLEEKGVYVQASTLGSLEALLEFLKTSEVPYAGINIGPVHKKDVMKASVMLEHDPQ
                     YAVILAFDVRIERDAQEMADSLGVRIFSAEIIYHLFDAFTKYRQDYKKQKQEEFKHIA
                     VFPCKIKILPQYIFNSRDPIVMGVTVEAGQVKQGTPMCVPSKNFVDIGIVTSIEINHK
                     QVDVAKKGQEVCVKIEPIPGESPKMFGRHFEATDILVSKISRQSIDALKDWFRDEMQK
                     SDWQLIVELKKVFEII"
BASE COUNT     1528 a    579 c    983 g    810 t
ORIGIN      
        1 gcgagcggcg gcacgacgag gggaaaagag ctgagcgaga ccaaagtcag ccgggagaca
       61 gtgggtctgt gagagaccga atagaggggc tggggccacg agcgccattg acaagcaatg
      121 gggaagaaac agaaaaacaa gagcgaagac agcaccaagg atgacattga tcttgatgcc
      181 ttggctgcag aaatagaagg agctggtgct gccaaagaac aggagcctca aaagtcaaaa
      241 gggaaaaaga aaaaagagaa aaaaaagcag gactttgatg aagatgatat cctgaaagaa
      301 ctggaagaat tgtctttgga agctcaaggc atcaaagctg acagagaaac tgttgcagtg
      361 aagccaacag aaaacaatga agaggaattc acctcaaaag ataaaaaaaa gaaaggacag
      421 aagggcaaaa aacagagttt tgatgataat gatagcgaag aattggaaga taaagattca
      481 aaatcaaaaa agactgcaaa accgaaagtg gaaatgtact ctgggagtga tgatgatgat
      541 gattttaaca aacttcctaa aaaagctaaa gggaaagctc aaaaatcaaa taagaagtgg
      601 gatgggtcag aggaggatga ggataacagt aaaaaaatta aagagcgttc aagaatgaat
      661 tcttctggtg aaagtggtga tgaatcagat gaatttttgc aatctagaaa aggacagaaa
      721 aaaaatcaga aaaacaagcc aggtcctaac atagaaagtg ggaatgaaga tgatgacgcc
      781 tccttcaaaa ttaagacagt ggcccaaaag aaggcagaaa agaaggagcg cgagagaaaa
      841 aagcgagatg aagaaaaagc gaaactgcgg aagctgaaag aaagagaaga gttagaaaca
      901 ggtaaaaagg atcagagtaa acaaaaggaa tctcaaagga aatttgaaga agaaactgta
      961 aaatccaaag tgactgttga tactggagta attcctgcct ctgaagagaa agcagagact
     1021 cccacagctg cagaagatga caatgaagga gacaaaaaga agaaagataa gaagaaaaag
     1081 aaaggagaaa aggaagaaaa agagaaagag aagaaaaaag gacctagcaa agccactgtt
     1141 aaagctatgc aagaagctct ggctaagctt aaagaggaag aagaaagaca gaagagagaa
     1201 gaggaagaac gtataaaacg gcttgaagaa ttagaagcca agcgtaaaga agaggaacga
     1261 ttggaacaag aaaaaagaga aaggaaaaag caaaaagaaa aagaaagaaa agaacgcttg
     1321 aaaaaagaag ggaaactttt aactaaatcc cagagagaag ccagagccag agccgaagct
     1381 actcttaaac tgctacaagc tcagggtgtt gaagtgccat caaaagactc tttgccaaag
     1441 aagaggccaa tttatgaaga taaaaagagg aaaaaaatac cacagcagct agaaagtaaa
     1501 gaagtgtctg aatcaatgga attatgtgct gctgtagaag ttatggaaca aggagtacca
     1561 gaaaaggaag agacaccacc tcctgttgaa ccagaagaag aagaagatac tgaggatgct
     1621 ggattggatg attgggaagc tatggccagt gatgaggaga cagaaaaagt agaaggaaac
     1681 acagttcata tagaagtaaa agaaaaccct gaagaggagg aggaggagga agaagaggaa
     1741 gaagaagatg aagaaagtga agtagaggag gaagaggagg gagaaagtga aggcagtgaa
     1801 ggtgatgagg aagatgaaaa ggtgtcagat gagaaggatt cagggaagac attagataaa
     1861 aagccaagta aagaaatgag ctcagattct gaatatgact ctgatgatga tcggactaaa
     1921 gaagaaaggg cttatgacaa agcaaaacgg aggattgaga aacggcgact tgaacatagt
     1981 aaaaatgtaa acaccgaaaa gctaagagcc cctattatct gcgtacttgg gcatgtggac
     2041 acagggaaga caaaaattct agataagctc cgtcacacac atgtacaaga cggtgaagca
     2101 ggtggtatca cacaacaaat ttgggccacc aatgttcctc ttgaagctat taatgaacag
     2161 actaagatga ttaaaaattt tgatagagag aatgtacgga ttccaggaat gctaattatt
     2221 gatactcctg ggcatgaatc tttcagtaat ctgagaaata gaggaagctc tctttgtgac
     2281 attgccattt tagttgttga tattatgcat ggtttggagc cccagacaat tgagtctatc
     2341 aaccttctca aatctaaaaa atgtcccttc attgttgcac tcaataagat tgataggtta
     2401 tatgattgga aaaagagtcc tgactctgat gtggctgcta ctttaaagaa gcagaaaaag
     2461 aatacaaaag atgaatttga ggagcgagca aaggctatta ttgtagaatt tgcacagcag
     2521 ggtttgaatg ctgctttgtt ttatgagaat aaagatcccc gcacttttgt gtctttggta
     2581 cctacctctg cacatactgg tgatggcatg ggaagtctga tctaccttct tgtagagtta
     2641 actcagacca tgttgagcaa gagacttgca cactgtgaag agctgagagc acaggtgatg
     2701 gaggttaaag ctctcccggg gatgggcacc actatagatg tcattttgat caatgggcgt
     2761 ttgaaggaag gagatacaat cattgttcct ggagtagaag ggcccattgt aactcagatt
     2821 cgaggcctcc tgttacctcc tcctatgaag gaattacgag tgaagaacca gtatgaaaag
     2881 cataaagaag tagaagcagc tcagggggta aagattcttg gaaaagacct ggagaaaaca
     2941 ttggctggtt tacccctcct tgtggcttat aaagaagatg aaatccctgt tcttaaagat
     3001 gaattgatcc atgagttaaa gcagacacta aatgctatca aattagaaga aaaaggagtc
     3061 tatgtccagg catctacact gggttctttg gaagctctac tggaatttct gaaaacatca
     3121 gaagtgccct atgcaggaat taacattggc ccagtgcata aaaaagatgt tatgaaggct
     3181 tcagtgatgt tggaacatga ccctcagtat gcagtaattt tggccttcga tgtgagaatt
     3241 gaacgagatg cacaagaaat ggctgatagt ttaggagtta gaatttttag tgcagaaatt
     3301 atttatcatt tatttgatgc ctttacaaaa tatagacaag actacaagaa acagaaacaa
     3361 gaagaattta agcacatagc agtatttccc tgcaagataa aaatcctccc tcagtacatt
     3421 tttaattctc gagatccgat agtgatgggg gtgacggtgg aagcaggtca ggtgaaacag
     3481 gggacaccca tgtgtgtccc aagcaaaaat tttgttgaca tcggaatagt aacaagtatt
     3541 gaaataaacc ataaacaagt ggatgttgca aaaaaaggac aagaagtttg tgtaaaaata
     3601 gaacctatcc ctggtgagtc acccaaaatg tttggaagac attttgaagc tacagatatt
     3661 cttgttagta agatcagccg gcagtccatt gatgcactca aagactggtt cagagatgaa
     3721 atgcagaaga gtgactggca gcttattgtg gagctgaaga aagtatttga aatcatctaa
     3781 ttttttcaca tggagcagga actggagtaa atgcaatact gtgttgtaat atcccaacaa
     3841 aaatcagaca aaaaatggaa cagacgtatt tggacactga tggacttaag tatggaagga
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AJ006776. Homo sapiens mRNA...[gi:5002644] Links  


LOCUS       HSA6776                 4169 bp    mRNA    linear   PRI 23-JUL-1999
DEFINITION  Homo sapiens mRNA for translation initiation factor 2 (IF2 gene).
ACCESSION   AJ006776
VERSION     AJ006776.1  GI:5002644
KEYWORDS    IF2 gene; IF2 protein; translation initiation factor 2.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Wilson,S.A., Sieiro-Vazquez,C., Edwards,N.J., Iourin,O.,
            Byles,E.D., Kotsopoulou,E., Adamson,C.S., Kingsman,S.M.,
            Kingsman,A.J. and Martin-Rendon,E.
  TITLE     Cloning and characterization of hIF2, a human homologue of
            bacterial translation initiation factor 2, and its interaction with
            HIV-1 matrix
  JOURNAL   Biochem. J. 342 (Pt 1), 97-103 (1999)
  MEDLINE   99362399
REFERENCE   2  (bases 1 to 4169)
  AUTHORS   Wilson,S.A.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-JUN-1998) Wilson S.A., Biochemistry, Oxford
            University, South Parks Road, Oxford, Oxon OX1 3QU, U.K
FEATURES             Location/Qualifiers
     source          1..4169
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..4169
                     /gene="IF2"
     CDS             141..3803
                     /gene="IF2"
                     /function="human homologue of bacterial translation
                     initiation factor 2"
                     /codon_start=1
                     /product="IF2 protein"
                     /protein_id="CAB44357.1"
                     /db_xref="GI:5002645"
                     /translation="MGKKQKNKSEDSTKDDIDLDALAAEIEGAGAAKEQEPQKSKGKK
                     KKEKKKQDFDEDDILKELEELSLEAQGIKADRETVAVKPTENNEEEFISKDKKKKGQK
                     GKKQSFDDNDSEELEDKDSKSKKTAKPKVEMYSGSDDDDDFNKLPKKAKGKAQKSNKK
                     WDGSEEDEDNSKKIKERSRINSSGESGDESDEFLQSRKGQKKNQKNKPGPNIESGNED
                     DDASFKIKTVAQKKAEKKERERKKRDEEKAKLRKLKEKEELETGKKDQSKQKESQRKF
                     EEETVKSKVTVDTGVIPASEEKAETPTAAEDDNEGDKKKKDKKKKKGEKEEKEKEKKK
                     GPSKATVKAMQEALAKLKEEEERQKREEEERIKRLEELEAKRKEEERLEQEKRERKKQ
                     KEKERKERLKKEGKLLTKSQREARARAEATLKLLQAQGVEVPSKDSLPKKRPIYEDKK
                     RKKIPQQLESKEVSESMELCAAVEVMEQGVPEKEETPPPVEPEEEEDTEDAGLDDWEA
                     MASDEETEKVEGNTVHIEVKENPEEEEEEEEEEEEDEESEEEEEEEGESEGSEGDEED
                     EKVSDEKDSGKTLDKKPSKEMSSDSEYDSDDDRTKEERAYDKAKRRIEKRRLEHSKNV
                     NTEKLRAPIICVLGHVDTGKTKILDKLRHTHVQDGEAGGITQQIGATNVPLEAINEQT
                     KMIKNFDRENVRIPGMLIIDTPGHESFSNLRNRGSSLCDIAILVVDIMHGLEPQTIES
                     INLLKSKKCPFIVALNKIDRLYDWKKSPDSDVAATLKKQKKNTKDEFEERAKAIIVEF
                     AQQGLNAALFYENKDPRTFVSLVPTSAHTGDGMGSLIYLLVELTQTMLSKRLAHCEEL
                     RAQVMEVKALPGMGTTIDVILINGRLKEGDTIIVPGVKGPIVTQIRGLLLPPPMKELR
                     VKNQYEKHKEVEAAQGVKILGKDLEKTLAGLPLLVAYKEDEIPVLKDELIHELKQTLN
                     AIKLEEKGVYVQASTLGSLEALLEFLKTSEVPYAGINIGPVHKKDVMKASVMLEHDPQ
                     YAVILAFDVRIERDAQEMADSLGVRIFSAEIIYHLFDAFTKYRQDYKKQKQEEFKHIA
                     VFPCKIKILPQYIFNSRDPIVMGVTVEAGQVKQGTPMCVPSKNFVDIGIVTSIEINHK
                     QVDVAKKGQEVCVKIEPIPGESPKMFGRHFEATDILVSKISRQSIDALKDWFRDEMQK
                     SDWQLIVELKKVFEII"
BASE COUNT     1601 a    641 c   1031 g    896 t
ORIGIN      
        1 cgcgggtctg tggagagccg ggtgcgagcg gcggcagcac gaggggaaaa gagctgagcg
       61 gagaccaaag tcagccggga gacagtgggt ctgtgagaga ccgaatagag gggctggggc
      121 acgagcgcca ttgacaagca atggggaaga aacagaaaaa caagagcgaa gacagcacca
      181 aggatgacat tgatcttgat gccttggctg cagaaataga aggagctggt gctgccaaag
      241 aacaggagcc tcaaaagtca aaagggaaaa agaaaaaaga gaaaaaaaag caggactttg
      301 atgaagatga tatcctgaaa gaactggaag aattgtcttt ggaagctcaa ggcatcaaag
      361 ctgacagaga aactgttgca gtgaagccaa cagaaaacaa tgaagaggaa ttcatctcaa
      421 aagataaaaa aaagaaagga cagaagggca aaaaacagag ttttgatgat aatgatagcg
      481 aagaattgga agataaagat tcaaaatcaa aaaagactgc aaaaccgaaa gtggaaatgt
      541 actctgggag tgatgatgat gatgatttta acaaacttcc taaaaaagct aaagggaaag
      601 ctcaaaaatc aaataagaag tgggatgggt cagaggagga tgaggataac agtaaaaaaa
      661 ttaaagagcg ttcaagaata aattcttctg gtgaaagtgg tgatgaatca gatgaatttt
      721 tgcaatctag aaaaggacag aaaaaaaatc agaaaaacaa gccaggtcct aacatagaaa
      781 gtgggaatga agatgatgac gcctccttca aaattaagac agtggcccaa aagaaggcag
      841 aaaagaagga gcgcgagaga aaaaagcgag atgaagaaaa agcgaaactg cggaagctga
      901 aagaaaaaga agagttagaa acaggtaaaa aggatcagag taaacaaaag gaatctcaaa
      961 ggaaatttga agaagaaact gtaaaatcca aagtgactgt tgatactgga gtaattcctg
     1021 cctctgaaga gaaagcagag actcccacag ctgcagaaga tgacaatgaa ggagacaaaa
     1081 agaagaaaga taagaagaaa aagaaaggag aaaaggaaga aaaagagaaa gagaagaaaa
     1141 aaggacctag caaagccact gttaaagcta tgcaagaagc tctggctaag cttaaagagg
     1201 aagaagaaag acagaagaga gaagaggaag aacgtataaa acggcttgaa gaattagaag
     1261 ccaagcgtaa agaagaggaa cgattggaac aagaaaaaag agaaaggaaa aagcaaaaag
     1321 aaaaagaaag aaaagaacgc ttgaaaaaag aagggaaact tttaactaaa tcccagagag
     1381 aagccagagc cagagccgaa gctactctta aactgctaca agctcagggt gttgaagtgc
     1441 catcaaaaga ctctttgcca aagaagaggc caatttatga agataaaaag aggaaaaaaa
     1501 taccacagca gctagaaagt aaagaagtgt ctgaatcaat ggaattatgt gctgctgtag
     1561 aagttatgga acaaggagta ccagaaaagg aagagacacc acctcctgtt gaaccagaag
     1621 aagaagaaga tactgaggat gctggattgg atgattggga agctatggcc agtgatgagg
     1681 agacagaaaa agtagaagga aacacagttc atatagaagt aaaagaaaac cctgaagagg
     1741 aggaggagga ggaagaagag gaagaagaag atgaagaaag tgaagaagag gaggaagagg
     1801 agggagaaag tgaaggcagt gaaggtgatg aggaagatga aaaggtgtca gatgagaagg
     1861 attcagggaa gacattagat aaaaagccaa gtaaagaaat gagctcagat tctgaatatg
     1921 actctgatga tgatcggact aaagaagaaa gggcttatga caaagcaaaa cggaggattg
     1981 agaaacggcg acttgaacat agtaaaaatg taaacaccga aaagctaaga gcccctatta
     2041 tctgcgtact tgggcatgtg gacacaggga agacaaaaat tctagataag ctccgtcaca
     2101 cacatgtaca agacggtgaa gcaggtggta tcacacaaca aattggggcc accaatgttc
     2161 ctcttgaagc tattaatgaa cagactaaga tgattaaaaa ttttgataga gagaatgtac
     2221 ggattccagg aatgctaatt attgatactc ctgggcatga atctttcagt aatctgagaa
     2281 atagaggaag ctctctttgt gacattgcca ttttagttgt tgatattatg catggtttgg
     2341 agccccagac aattgagtct atcaaccttc tcaaatctaa aaaatgtccc ttcattgttg
     2401 cactcaataa gattgatagg ttatatgatt ggaaaaagag tcctgactct gatgtggctg
     2461 ctactttaaa gaagcagaaa aagaatacaa aagatgaatt tgaggagcga gcaaaggcta
     2521 ttattgtaga atttgcacag cagggtttga atgctgcttt gttttatgag aataaagatc
     2581 cccgcacttt tgtgtctttg gtacctacct ctgcacatac tggtgatggc atgggaagtc
     2641 tgatctacct tcttgtagag ttaactcaga ccatgttgag caagagactt gcacactgtg
     2701 aagagctgag agcacaggtg atggaggtta aagctctccc ggggatgggc accactatag
     2761 atgtcatctt gatcaatggg cgtttgaagg aaggagatac aatcattgtt cctggagtaa
     2821 aagggcccat tgtaactcag attcgaggcc tcctgttacc tcctcctatg aaggaattac
     2881 gagtgaagaa ccagtatgaa aagcataaag aagtagaagc agctcagggg gtaaagattc
     2941 ttggaaaaga cctggagaaa acattggctg gtttacccct ccttgtggct tataaagaag
     3001 atgaaatccc tgttcttaaa gatgaattga tccatgagtt aaagcagaca ctaaatgcta
     3061 tcaaattaga agaaaaagga gtctatgtcc aggcatctac actgggttct ttggaagctc
     3121 tactggaatt tctgaaaaca tcagaagtgc cctatgcagg aattaacatt ggcccagtgc
     3181 ataaaaaaga tgttatgaag gcttcagtga tgttggaaca tgaccctcag tatgcagtaa
     3241 ttttggcctt cgatgtgaga attgaacgag atgcacaaga aatggctgat agtttaggag
     3301 ttagaatttt tagtgcagaa attatttatc atttatttga tgcctttaca aaatatagac
     3361 aagactacaa gaaacagaaa caagaagaat ttaagcacat agcagtattt ccctgcaaga
     3421 taaaaatcct ccctcagtac atttttaatt ctcgagatcc gatagtgatg ggggtgacgg
     3481 tggaagcagg tcaggtgaaa caggggacac ccatgtgtgt cccaagcaaa aattttgttg
     3541 acatcggaat agtaacaagt attgaaataa accataaaca agtggatgtt gcaaaaaaag
     3601 gacaagaagt ttgtgtaaaa atagaaccta tccctggtga gtcacccaaa atgtttggaa
     3661 gacattttga agctacagat attcttgtta gtaagatcag ccggcagtcc attgatgcac
     3721 tcaaagactg gttcagagat gaaatgcaga agagtgactg gcagcttatt gtggagctga
     3781 agaaagtatt tgaaatcatc taattttttc acatggagca ggaactggag taaatgcaat
     3841 actgtgttgt aatatcccaa caaaaatcag acaaaaaatg gaacagacgt atttggacac
     3901 tgatggactt aagtatggaa ggaagaaaaa taggtgtata aaatgttttc catgagaaac
     3961 caagaaactt acactggttt gacagtggtc agttacatgt ccccacagtt ccaatgtgcc
     4021 tgttcactca cctctccctt ccccaaccct tctctacttg gctgctgttt taaagtttgc
     4081 ccttccccaa atttggattt ttattacaga gtctaaagct ctttcgattt tatactgatt
     4141 aaatcagtac tgcagtattt gattaacca
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC022323. Homo sapiens, pop...[gi:18490615] Links  


LOCUS       BC022323                1369 bp    mRNA    linear   PRI 04-FEB-2002
DEFINITION  Homo sapiens, popeye protein 3, clone MGC:22671 IMAGE:4293961,
            mRNA, complete cds.
ACCESSION   BC022323
VERSION     BC022323.1  GI:18490615
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1369)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-FEB-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: CLONTECH
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 36 Row: c Column: 1
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 11641280.
FEATURES             Location/Qualifiers
     source          1..1369
                     /organism="Homo sapiens"
                     /db_xref="LocusID:64208"
                     /db_xref="taxon:9606"
                     /clone="MGC:22671 IMAGE:4293961"
                     /tissue_type="Skeletal Muscle"
                     /clone_lib="NIH_MGC_81"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     CDS             280..1155
                     /codon_start=1
                     /product="popeye protein 3"
                     /protein_id="AAH22323.1"
                     /db_xref="GI:18490616"
                     /translation="MERNSSLWKNLIDEHPVCTTWKQEAEGAIYHLASILFVVGFMGG
                     SGFFGLLYVFSLLGLGFLCSAVWAWVDVCAADIFSWNFVLFVICFMQFVHIAYQVRSI
                     TFAREFQVLYSSLFQPLGISLPVFRTIALSSEVVTLEKEHCYAMQGKTSIDKLSLLVS
                     GRIRVTVDGEFLHYIFPLQFLDSPEWDSLRPTEEGIFQVTLTAETDCRYVSWRRKKLY
                     LLFAQHRYISRLFSVLIGSDIADKLYALNDRVYIGKRYHYDIRLPNFYQMSTPEIRRS
                     PLTQHFQNSRRYCDK"
BASE COUNT      409 a    280 c    276 g    404 t
ORIGIN      
        1 aagctccggg cagggctggg aaggaaagga aataccaaaa tatttgcaga ctggatccaa
       61 atcaggagcc cagatgaact taaagaagcg agcatgggat tcttagtttt tcaagatccg
      121 tacacacgaa gcctttaatc agcatcaact ccagtgtccg ttttctctgg ttttgtgaag
      181 actgcacaaa actctcatga tggagaacca aaggacttag ttacctgttt cagtgtcatc
      241 taaagtcaac tgaaaagtga agcaggcagt aatacagcca tggaaagaaa ttcaagttta
      301 tggaagaacc taatagatga acacccagtc tgcacaacct ggaagcaaga ggccgaagga
      361 gccatttatc atcttgccag tattttattt gtagtaggtt tcatgggtgg cagtggattc
      421 ttcgggctcc tttatgtctt cagtttgctg gggttgggtt ttctctgttc tgctgtctgg
      481 gcttgggtag atgtctgtgc agctgacata ttttcctgga attttgtact gtttgtcatc
      541 tgcttcatgc aatttgttca tattgcatat caagttcgca gcataacctt tgcccgagaa
      601 ttccaagtgt tgtacagctc ccttttccag cccctgggga tctctttgcc tgtcttcaga
      661 acgattgctt tgagctctga agtggttact ttggaaaagg aacactgtta tgccatgcag
      721 gggaaaactt ccattgataa actctccttg cttgtttcag gaaggatcag agtgacagtt
      781 gatggcgaat ttctgcatta cattttcccc cttcagttcc tggattctcc tgagtgggat
      841 tcactgagac ccacagagga aggcattttt caggtaaccc tcactgcaga aactgattgt
      901 cgatatgtgt cttggaggag aaagaaatta tatctgctct ttgctcagca tcgctacatc
      961 tcccgccttt tttcagtgct aattggcagt gacattgcag ataaactcta tgccttgaat
     1021 gacagggtat atataggaaa aagatatcac tatgatattc ggctacccaa cttctatcaa
     1081 atgtcaactc cagaaatacg cagatcaccc ctgacacaac attttcagaa ttccagacga
     1141 tactgtgata aatgacatca aagtctgaaa tttataagta taaaaaaaga ctctctcttc
     1201 atcattcccc agtgaaatag caaaatacaa aaaaagagct ccctaatgtt tttataaatc
     1261 aaattcagaa gcgagatgcc attgccaact gttttattcc tttcaacaac tgcattgtga
     1321 ataaacttta caaatttttc ttgtaaaaaa aaaaaaaaaa aaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF136273. Homo sapiens cath...[gi:6467379] Links  


LOCUS       AF136273                1500 bp    mRNA    linear   PRI 28-APR-2000
DEFINITION  Homo sapiens cathepsin Z precursor (CTSZ) mRNA, complete cds.
ACCESSION   AF136273
VERSION     AF136273.1  GI:6467379
KEYWORDS    .
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1500)
  AUTHORS   Deussing,J., von Olshausen,I. and Peters,C.
  TITLE     Murine and human cathepsin Z: cDNA-cloning, characterization of the
            genes and chromosomal localization
  JOURNAL   Biochim. Biophys. Acta 1491 (1-3), 93-106 (2000)
  MEDLINE   20225452
   PUBMED   10760573
REFERENCE   2  (bases 1 to 1500)
  AUTHORS   Deussing,J., von Olshausen,I. and Peters,C.
  TITLE     Direct Submission
  JOURNAL   Submitted (22-MAR-1999) Institut fuer Molekulare Medizin und
            Zellforschung, Albert-Ludwigs-University of Freiburg, Hugstetter
            Str. 55, Freiburg 79106, Germany
FEATURES             Location/Qualifiers
     source          1..1500
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="20"
                     /map="D20S171"
                     /tissue_type="colon tumor"
     gene            1..1500
                     /gene="CTSZ"
     5'UTR           1..125
                     /gene="CTSZ"
     CDS             126..1037
                     /gene="CTSZ"
                     /note="cysteine protease of the papain family"
                     /codon_start=1
                     /product="cathepsin Z precursor"
                     /protein_id="AAF13145.1"
                     /db_xref="GI:6467380"
                     /translation="MARRGPGWRPLLLLVLLAGAAQGGLYFRRGQTCYRPLRGDGLAP
                     LGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMA
                     DRINIKRKGAWPSTLLSVQNVIDCGNAGSCEGGNDLSVWDYAHQHGIPDETCNNYQAK
                     DQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMA
                     TERLANYTGGIYAEYQDTTYINHVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRIV
                     TSTYKDGKGARYNLAIEEHCTFGDPIV"
     sig_peptide     126..194
                     /gene="CTSZ"
     mat_peptide     309..1034
                     /gene="CTSZ"
                     /product="cathepsin Z"
     misc_feature    195..1034
                     /gene="CTSZ"
                     /note="encodes cathepsin Z preproprotein"
     3'UTR           1038..1500
                     /gene="CTSZ"
     polyA_signal    1476..1481
                     /gene="CTSZ"
                     /note="putative"
BASE COUNT      346 a    383 c    491 g    280 t
ORIGIN      
        1 ggggtcggcc gggtgctagg ccggggccga ggccgaggcc ggggcgggat ccagagcggg
       61 agccggcgcg ggatctggga ctcggagcgg gatccggagc gggacccagg agccggcgcg
      121 gggccatggc gaggcgcggg ccagggtggc ggccgcttct gctgctcgtg ctgctggcgg
      181 gcgcggcgca gggcggcctc tacttccgcc ggggacagac ctgctaccgg cctctgcggg
      241 gggacgggct ggctccgctg gggcgcagca catacccccg gcctcatgag tacctgtccc
      301 cagcggatct gcccaagagc tgggactggc gcaatgtgga tggtgtcaac tatgccagca
      361 tcacccggaa ccagcacatc ccccaatact gcggctcctg ctgggcccac gccagcacca
      421 gcgctatggc ggatcggatc aacatcaaga ggaagggagc gtggccctcc accctcctgt
      481 ccgtgcagaa cgtcatcgac tgcggtaacg ctggctcctg tgaagggggt aatgacctgt
      541 ccgtgtggga ctacgcccac cagcacggca tccctgacga gacctgcaac aactaccagg
      601 ccaaggacca ggagtgtgac aagtttaacc aatgtgggac atgcaatgaa ttcaaagagt
      661 gccacgccat ccggaactac accctctgga gagtgggaga ctacggctcc ctctctggga
      721 gggagaagat gatggcagaa atctacgcaa atggtcccat cagctgtgga ataatggcaa
      781 cagaaagact ggctaactac accggaggca tctatgccga ataccaggac accacatata
      841 taaaccatgt cgtttccgtg gctgggtggg gcatcagtga tgggactgag tactggattg
      901 tccggaattc atggggtgaa ccatggggcg agagaggctg gctgaggatc gtgaccagca
      961 cctataagga tgggaagggc gccagataca accttgccat cgaggagcac tgtacatttg
     1021 gggaccccat cgtttaaggc catgtcacta gaagcgcagt ttaagaaaag gcatggtgac
     1081 ccatgaccag aggggatcct atggttatgt gtgccaggct ggctggcagg aactggggtg
     1141 gctatcaata ttggatggcg aggacagcgt ggtactggct gcgagtgttc ctgagagttg
     1201 aaagtgggat gacttatgac acttgcacag catggctctg cctcacaatg atgcagtcag
     1261 ccacctggtg aagaagtgac ctgcaacaca ggaaacgatg ggacctcagt cttcttcagc
     1321 agaggacttg atattttgta tttggcaact gtgggcaata atatggcatt taagaggtga
     1381 aagagttcag acttatcacc attcttatgt cactttagaa tcaagggtgg gggagggagg
     1441 gagggagttg gcagtttcaa atcgcccaac tgataaataa agtatctggc tctgcacgag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X79535. H.sapiens mRNA fo...[gi:496886] Links  


LOCUS       HSBT278                 1594 bp    mRNA    linear   PRI 03-JUN-1994
DEFINITION  H.sapiens mRNA for beta tubulin, clone nuk_278.
ACCESSION   X79535
VERSION     X79535.1  GI:496886
KEYWORDS    beta tubulin.
SOURCE      human.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Leffers,H., Wiemann,S. and Ansorge,W.
  TITLE     Cloning and vaccinia virus expression of a cDNA containing the
            complete coding seqeunce of human beta tubulin mRNA
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 1594)
  AUTHORS   Leffers,H.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-JUN-1994) H. Leffers, Inst. of Medical Research
            Biochemistry & Danish Centre for Human Genome Research, Ole Worms
            Alle 170, Aarhus Univ., 8000 Aarhus C, DENMARK
FEATURES             Location/Qualifiers
     source          1..1594
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="nuk_278"
                     /cell_line="non fractionated non cultures normal
                     keratinocytes"
                     /cell_type="keratinocytes"
                     /tissue_type="skin"
                     /clone_lib="lambda ZapII"
     CDS             64..1401
                     /codon_start=1
                     /product="beta tubulin"
                     /protein_id="CAA56071.1"
                     /db_xref="GI:496887"
                     /db_xref="SPTREMBL:Q13885"
                     /translation="MREIVHIQAGQCGNQIGAKFWEVISDEHGIDPTGSYHGDSDLQL
                     ERINVYYNEAAGNKYVPRAILVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWA
                     KGHYTEGAELVDSVLDVVRKESESCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYP
                     DRIMNTFSVMPSPKVSDTVVEPYNATLSVHQLVENTDETYSIDNEALYDICFRTLKLT
                     TPTYGDLNHLVSATMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSR
                     GSQQYRALTVPELTQQMFDSKNMMAACDPRHGRYLTVAAIFRGRMSMKEVDEQMLNVQ
                     NKNSSYFVEWIPNNVKTAVCDIPPRGLKMSATFIGNSTAIQELFKRISEQFTAMFRRK
                     AFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQDATADEQGEFEEEEGEDEA"
     polyA_signal    1576..1581
BASE COUNT      357 a    485 c    462 g    290 t
ORIGIN      
        1 gcccgccggt ccacgccgcg caccgctccg agggccagcg ccacccgctc cgcagccggc
       61 accatgcgcg agatcgtgca catccaggcg ggccagtgcg gcaaccagat cggcgccaag
      121 ttttgggagg tcatcagcga tgagcatggg atcgacccca caggcagtta ccatggagac
      181 agtgacttgc agctggagag aatcaacgtg tactacaatg aggctgctgg taacaaatat
      241 gtacctcggg ccatcctggt ggatctggag cctggcacca tggactctgt caggtctgga
      301 cccttcggcc agatcttcag accagacaac ttcgtgttcg gccagagtgg agccgggaat
      361 aactgggcca agggccacta cacagaggga gccgagctgg tcgactcggt cctggatgtg
      421 gtgaggaagg agtcagagag ctgtgactgt ctccagggct tccagctgac ccactctctg
      481 gggggcggca cggggtccgg gatgggcacc ctgctcatca gcaagatccg ggaagagtac
      541 ccagaccgca tcatgaacac cttcagcgtc atgccctcac ccaaggtgtc agacacggtg
      601 gtggagccct acaacgccac cctctcggtc caccagctgg tggaaaacac agatgaaacc
      661 tactccattg ataacgaggc cctgtatgac atctgcttcc gcaccctgaa gctgaccacc
      721 cccacctacg gggacctcaa ccacctggtg tcggccacca tgagcggggt caccacctgc
      781 ctgcgcttcc cgggccagct gaacgcagac ctgcgcaagc tggcggtgaa catggtgccc
      841 ttccctcgcc tgcacttctt catgcccggc ttcgcgcccc tgaccagccg gggcagccag
      901 cagtaccggg cgctcacggt gcccgagctc acccagcaga tgttcgactc caagaacatg
      961 atggccgcct gcgacccgcg ccacggccgc tacctgacgg tggctgccat cttccggggc
     1021 cgcatgtcca tgaaggaggt ggacgagcag atgctcaacg tgcagaacaa gaacagcagc
     1081 tacttcgtgg agtggatccc caacaacgtg aagacggccg tgtgcgacat cccgccccgc
     1141 ggcctgaaga tgtcggccac cttcatcggc aacagcacgg ccatccagga gctgttcaag
     1201 cgcatctccg agcagttcac ggccatgttc cggcgcaagg ccttcctgca ctggtacacg
     1261 ggcgagggca tggacgagat ggagttcacc gaggccgaga gcaacatgaa cgacctggtg
     1321 tccgagtacc agcagtacca ggacgccacg gccgacgaac aaggggagtt cgaggaggag
     1381 gagggcgagg acgaggctta aaaacttctc agatcaatcg tgcatcctta gtgaacttct
     1441 gttgtcctca agcatggtct ttctacttgt aaactatggt gctcagtttt gcctctgtta
     1501 gaaattcaca ctgttgatgt aatgatgtgg aactcctcta aaaattacag tattgtctgt
     1561 gaaggtatct atactaataa aaaagcatgt gtag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M22488. Human bone morpho...[gi:179499] Links  


LOCUS       HUMBMP1A                2487 bp    mRNA    linear   PRI 31-OCT-1994
DEFINITION  Human bone morphogenetic protein 1 (BMP-1) mRNA.
ACCESSION   M22488
VERSION     M22488.1  GI:179499
KEYWORDS    bone morphogenetic protein.
SOURCE      Human osteosarcoma cell line U-2 OS, cDNA to mRNA, clone
            lambda-U2OS1-1.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J.,
            Kriz,R.W., Hewick,R.M. and Wang,E.A.
  TITLE     Novel regulators of bone formation: molecular clones and activities
  JOURNAL   Science 242 (4885), 1528-1534 (1988)
  MEDLINE   89072730
   PUBMED   3201241
REFERENCE   2  (bases 1 to 2487)
  AUTHORS   Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J.,
            Kriz,R.W., Hewick,R.M. and Wang,E.A.
  JOURNAL   Unpublished (1989)
COMMENT     [1]  sites.
            Draft entry and computer readable copy of sequence [1] kindly
            submitted by R.W. Kriz 10-FEB-1989.
FEATURES             Location/Qualifiers
     source          1..2487
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="8"
     gene            1..2487
                     /gene="BMP1"
     CDS             30..2222
                     /gene="BMP1"
                     /note="bone morphogenetic protein 1"
                     /codon_start=1
                     /protein_id="AAA51833.1"
                     /db_xref="GI:179500"
                     /db_xref="GDB:G00-125-203"
                     /translation="MPGVARLPLLLGLLLLPRPGRPLDLADYTYDLAEEDDSEPLNYK
                     DPCKAAAFLGDIALDEEDLRAFQVQQAVDLRRHTARKSSIKAAVPGNTSTPSCQSTNG
                     QPQRGACGRWRGRSRSRRAATSRPERVWPDGVIPFVIGGNFTGSQRAVFRQAMRHWEK
                     HTCVTFLERTDEDSYIVFTYRPCGCCSYVGRRGGGPQAISIGKNCDKFGIVVHELGHV
                     VGFWHEHTRPDRDRHVSIVRENIQPGQEYNFLKMEPQEVESLGETYDFDSIMHYARNT
                     FSRGIFLDTIVPKYEVNGVKPPIGQRTRLSKGDIAQARKLYKCPACGETLQDSTGNFS
                     SPEYPNGYSAHMHCVWRISVTPGEKIILNFTSLDLYRSRLCWYDYVEVRDGFWRKAPL
                     RGRFCGSKLPEPIVSTDSRLWVEFRSSSNWVGKGFFAVYEAICGGDVKKDYGHIQSPN
                     YPDDYRPSKVCIWRIQVSEGFHVGLTFQSFEIERHDSCAYDYLEVRDGHSESSTLIGR
                     YCGYEKPDDIKSTSSRLWLKFVSDGSINKAGFAVNFFKEVDECSRPNRGGCEQRCLNT
                     LGSYKCSCDPGYELAPDKRRCEAACGGFLTKLNGSITSPGWPKEYPPNKNCIWQLVAP
                     TQYRISLQFDFFETEGNDVCKYDFVEVRSGLTADSKLHGKFCGSEKPEVITSQYNNMR
                     VEFKSDNTVSKKGFKAHFFSEKRPALQPPRGRPHQLKFRVQKRNRTPQ"
BASE COUNT      503 a    804 c    707 g    473 t
ORIGIN      
        1 gccgcttccc tcgccgccgc cccgccagca tgcccggcgt ggcccgcctg ccgctgctgc
       61 tcgggctgct gctgctcccg cgtcccggcc ggccgctgga cttggccgac tacacctatg
      121 acctggcgga ggaggacgac tcggagcccc tcaactacaa agacccctgc aaggcggctg
      181 cctttcttgg ggacattgcc ctggacgaag aggacctgag ggccttccag gtacagcagg
      241 ctgtggatct cagacggcac acagctcgta agtcctccat caaagctgca gttccaggaa
      301 acacttctac ccccagctgc cagagcacca acgggcagcc tcagagggga gcctgtggga
      361 gatggagagg tagatcccgt agccggcggg cggcgacgtc ccgaccagag cgtgtgtggc
      421 ccgatggggt catccccttt gtcattgggg gaaacttcac tggtagccag agggcagtct
      481 tccggcaggc catgaggcac tgggagaagc acacctgtgt caccttcctg gagcgcactg
      541 acgaggacag ctatattgtg ttcacctatc gaccttgcgg gtgctgctcc tacgtgggtc
      601 gccgcggcgg gggcccccag gccatctcca tcggcaagaa ctgtgacaag ttcggcattg
      661 tggtccacga gctgggccac gtcgtcggct tctggcacga acacactcgg ccagaccggg
      721 accgccacgt ttccatcgtt cgtgagaaca tccagccagg gcaggagtat aacttcctga
      781 agatggagcc tcaggaggtg gagtccctgg gggagaccta tgacttcgac agcatcatgc
      841 attacgctcg gaacacattc tccaggggca tcttcctgga taccattgtc cccaagtatg
      901 aggtgaacgg ggtgaaacct cccattggcc aaaggacacg gctcagcaag ggggacattg
      961 cccaagcccg caagctttac aagtgcccag cctgtggaga gaccctgcaa gacagcacag
     1021 gcaacttctc ctcccctgaa taccccaatg gctactctgc tcacatgcac tgcgtgtggc
     1081 gcatctctgt cacacccggg gagaagatca tcctgaactt cacgtccctg gacctgtacc
     1141 gcagccgcct gtgctggtac gactatgtgg aggtccgaga tggcttctgg aggaaggcgc
     1201 ccctccgagg ccgcttctgc gggtccaaac tccctgagcc tatcgtctcc actgacagcc
     1261 gcctctgggt tgaattccgc agcagcagca attgggttgg aaagggcttc tttgcagtct
     1321 acgaagccat ctgcgggggt gatgtgaaaa aggactatgg ccacattcaa tcgcccaact
     1381 acccagacga ttaccggccc agcaaagtct gcatctggcg gatccaggtg tctgagggct
     1441 tccacgtggg cctcacattc cagtcctttg agattgagcg ccacgacagc tgtgcctacg
     1501 actatctgga ggtgcgcgac gggcacagtg agagcagcac cctcatcggg cgctactgtg
     1561 gctatgagaa gcctgatgac atcaagagca cgtccagccg cctctggctc aagttcgtct
     1621 ctgacgggtc cattaacaaa gcgggctttg ccgtcaactt tttcaaagag gtggacgagt
     1681 gctctcggcc caaccgcggg ggctgtgagc agcggtgcct caacaccctg ggcagctaca
     1741 agtgcagctg tgaccccggg tacgagctgg ccccagacaa gcgccgctgt gaggctgctt
     1801 gtggcggatt cctcaccaag ctcaacggct ccatcaccag cccgggctgg cccaaggagt
     1861 acccccccaa caagaactgc atctggcagc tggtggcccc cacccagtac cgcatctccc
     1921 tgcagtttga cttctttgag acagagggca atgatgtgtg caagtacgac ttcgtggagg
     1981 tgcgcagtgg actcacagct gactccaagc tgcatggcaa gttctgtggt tctgagaagc
     2041 ccgaggtcat cacctcccag tacaacaaca tgcgcgtgga gttcaagtcc gacaacaccg
     2101 tgtccaaaaa gggcttcaag gcccacttct tctcagaaaa gaggccagct ctgcagcccc
     2161 ctcggggacg cccccaccag ctcaaattcc gagtgcagaa aagaaaccgg accccccagt
     2221 gaggcctgcc aggcctcccg gaccccttgt tactcaggaa cctcaccttg gacggaatgg
     2281 gatgggggct tcggtgccca ccaacccccc acctccactc tgccattccg gcccacctcc
     2341 ctctggccgg acagaactgg tgctctcttc tccccactgt gcccgtccgc ggaccgggga
     2401 cccttccccg tgccctaccc cctcccattt tgatggtgtc tgtgacattt cctgttgtga
     2461 agtaaaagag ggacccctgc gtcctgc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC002678. Homo sapiens, pep...[gi:12803684] Links  


LOCUS       BC002678                1015 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, peptidylprolyl isomerase C (cyclophilin C), clone
            MGC:3673 IMAGE:3610178, mRNA, complete cds.
ACCESSION   BC002678
VERSION     BC002678.1  GI:12803684
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1015)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site:       http://www.nisc.nih.gov/
            Contact:        nisc_mgc@nhgri.nih.gov
            Shevchenko,Y., Wetherby,K.D., Beckstrom-Sternberg,S.M.,
            Benjamin,B., Blakesley,R.W., Bouffard,G.G., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Guan,X., Gupta,J., Ho,S.-L., Karlins,E., Legaspi,R.,
            Lim,M., Maduro,Q.L., Masiello,C., Mastrian,S.D., McCloskey,J.C.,
            McDowell,J., Pearson,R., Snyder,B., Stantripop,S., Thomas,P.J.,
            Tiongson,E.E., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A.,
            Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 12 Row: k Column: 17
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4505990.
FEATURES             Location/Qualifiers
     source          1..1015
                     /organism="Homo sapiens"
                     /db_xref="LocusID:5480"
                     /db_xref="taxon:9606"
                     /clone="MGC:3673 IMAGE:3610178"
                     /tissue_type="Uterus, endometrium adenocarcinoma"
                     /clone_lib="NIH_MGC_44"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             88..726
                     /codon_start=1
                     /product="peptidylprolyl isomerase C (cyclophilin C)"
                     /protein_id="AAH02678.1"
                     /db_xref="GI:12803685"
                     /translation="MGPGPRLLLPLVLCVGLGALVFSSGAEGFRKRGPSVTAKVFFDV
                     RIGDKDVGRIVIGLFGKVVPKTVENFVALATGEKGYGYKGSKFHRVIKDFMIQGGDIT
                     TGDGTGGVSIYGETFPDENFKLKHYGIGWVSMANAGPDTNGSQFFITLTKPTWLDGKH
                     VVFGKVIDGMTVVHSIELQATDGHDRPLTNCSIINSGKIDVKTPFVVEIADW"
BASE COUNT      250 a    220 c    271 g    274 t
ORIGIN      
        1 ggcacgaggc ccgtcagctg tcccagagcc tgtgtcgcgc ccgtgccggt agcgcccgtg
       61 ccggtagcgc cgctgccacc gctcaccatg ggcccgggtc ctcggctgct gctacctctc
      121 gtgctttgcg tggggctcgg cgcacttgtg ttttcttcgg gggccgaggg cttccgcaag
      181 cgaggcccct cggtgacggc caaggtcttc tttgatgtga ggattggaga caaagatgtt
      241 ggcagaattg tgattggcct ctttggaaaa gttgtgccca agacagtgga aaattttgtt
      301 gctctagcaa caggagagaa aggatatgga tataaaggaa gcaagtttca tcgtgtcatc
      361 aaggatttca tgattcaagg aggtgacatc accactggag atggcactgg gggtgtgagc
      421 atctatggtg agacatttcc agatgagaac ttcaagctga agcactatgg cattgggtgg
      481 gtcagcatgg ccaacgctgg gcctgacacc aatggctctc agttctttat caccttgacc
      541 aagcccacct ggttggacgg caaacatgtg gtgtttggaa aagtcattga tgggatgaca
      601 gtggtgcact ccatagagct ccaagcaact gatgggcatg accgtccact caccaactgc
      661 tcgatcatca acagtggcaa gatagacgtg aaaacgcctt ttgtggttga gatcgctgat
      721 tggtgacaca actggcagaa aacaaggata tgctttggca ggggtgtgtg tgtgtgtgtg
      781 tgtgtgtgtg tgttgtgttg tctttcaatt atttgctttt tttttttact ttctttttgt
      841 attctatccc agatcacagg aaagttataa aaatcaaacc gtcacccttt agtttgcttg
      901 aactttagta aaccacctgc ttagggactt tgaacttaaa tatatcccct tcctcaagtg
      961 gtgctatttt aaaactaaaa aaaactttga attggcaaaa aaaaaaaaaa aaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC011674. Homo sapiens, Sim...[gi:15079713] Links  


LOCUS       BC011674                2852 bp    mRNA    linear   PRI 02-AUG-2001
DEFINITION  Homo sapiens, Similar to procollagen-lysine, 2-oxoglutarate
            5-dioxygenase 3, clone MGC:15175 IMAGE:4300048, mRNA, complete cds.
ACCESSION   BC011674
VERSION     BC011674.1  GI:15079713
KEYWORDS    MGC.
SOURCE      Homo sapiens.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2852)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site:       http://www.nisc.nih.gov/
            Contact:        nisc_mgc@nhgri.nih.gov
            Shevchenko,Y., Wetherby,K.D., Beckstrom-Sternberg,S.M.,
            Benjamin,B., Blakesley,R.W., Bouffard,G.G., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Guan,X., Gupta,J., Ho,S.-L., Karlins,E., Legaspi,R.,
            Lim,M., Maduro,Q.L., Masiello,C., Mastrian,S.D., McCloskey,J.C.,
            McDowell,J., Pearson,R., Snyder,B., Stantripop,S., Thomas,P.J.,
            Tiongson,E.E., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A.,
            Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 26 Row: n Column: 12
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 7546823.
FEATURES             Location/Qualifiers
     source          1..2852
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:15175 IMAGE:4300048"
                     /tissue_type="Pancreas, epithelioid carcinoma"
                     /clone_lib="NIH_MGC_42"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             323..2539
                     /codon_start=1
                     /product="Similar to procollagen-lysine, 2-oxoglutarate
                     5-dioxygenase 3"
                     /protein_id="AAH11674.1"
                     /db_xref="GI:15079714"
                     /translation="MTSSGPGPRFLLLLPLLLPPAASASDRPRGRDPVNPEKLLVITV
                     ATAETEGYLRFLRSAEFFNYTVRTLGLGEEWRGGDVARTVGGGQKVRWLKKEMEKYAD
                     REDMIIMFVDSYDVILAGSPTELLKKFVQSGSRLLFSAESFCWPEWGLAEQYPEVGTG
                     KRFLNSGGFIGFATTIHQIVRQWKYKDDDDDQLFYTRLYLDPGLREKLSLNLDHKSRI
                     FQNLNGALDEVVLKFDRNRVRIRNVAYDTLPIVVHGNGPTKLQLNYLGNYVPNGWTPE
                     GGCGFCNQDRRTLPGGQPPPRVFLAVFVEQPTPFLPRFLQRLLLLDYPPDRVTLFLHN
                     NEVFHEPHIADSWPQLQDHFSAVKLVGPEEALSPGEARDMAMDLCRQDPECEFYFSLD
                     ADAVLTNLQTLRILIEENRKVIAPMLSRHGKLWSNFWGALSPDEYYARSEDYVELVQR
                     KRVGVWNVPYISQAYVIRGDTLRMELPQRDVFSGSDTDPDMAFCKSFRDKGIFLHLSN
                     QHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHENYSRALEGEGIVEQPCPDV
                     YWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENVPTVDIHMKQVGYEDQWL
                     QLLRTYVGPMTESLFPGYHTKARAVMNFVVRYRPDEQPSLRPHHDSSTFTLNVALNHK
                     GLDYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTWGTRYIMVSFVDP"
BASE COUNT      545 a    903 c    842 g    562 t
ORIGIN      
        1 ggcacgaggg cttccacatg tgtcaagcgg ctggctcagc ccagagtccc tgtctcccgc
       61 ccgccggccc gagccgccgc ccctcccccg cctcccgtgc gcccgggaca atcctcgcct
      121 tgtctgtggc gccggcatct ggagctttct gtagcctccg gatacgcctt tttttcaggg
      181 cgtagcccca gccaagctgc tccccgcggc ggccgcacag cagcccgagc gccccctttc
      241 cggagctccc ctccggagct gggatccagg cgcgtagcgg agatcccagg atcctgggtg
      301 ctgtctgggc ccgctcccca ccatgacctc ctcggggcct ggaccccggt tcctgctgct
      361 gctgccgctg ctgctgcccc ctgcggcctc agcctccgac cggccccggg gccgagaccc
      421 ggtcaaccca gagaagctgc tggtgatcac tgtggccaca gctgaaaccg aggggtacct
      481 gcgtttcctg cgctctgcgg agttcttcaa ctacactgtg cggaccctgg gcctgggaga
      541 ggagtggcga gggggtgatg tggctcgaac agttggtgga ggacagaagg tccggtggtt
      601 aaagaaggaa atggagaaat acgctgaccg ggaggatatg atcatcatgt ttgtggatag
      661 ctacgacgtg attctggccg gcagccccac agagctgctg aagaagttcg tccagagtgg
      721 cagccgcctg ctcttctctg cagagagctt ctgctggccc gagtgggggc tggcggagca
      781 gtaccctgag gtgggcacgg ggaagcgctt cctcaattct ggtggattca tcggttttgc
      841 caccaccatc caccaaatcg tgcgccagtg gaagtacaag gatgatgacg acgaccagct
      901 gttctacaca cggctctacc tggacccagg actgagggag aaactcagcc ttaatctgga
      961 tcataagtct cggatctttc agaacctcaa cggggcttta gatgaagtgg ttttaaagtt
     1021 tgatcggaac cgtgtgcgta tccggaacgt ggcctacgac acgctcccca ttgtggtcca
     1081 tggaaacggt cccactaagc tgcagctcaa ctacctggga aactacgtcc ccaatggctg
     1141 gactcctgag ggaggctgtg gcttctgcaa ccaggaccgg aggacactcc cgggggggca
     1201 gcctcccccc cgggtgtttc tggccgtgtt tgtggaacag cctactccgt ttctgccccg
     1261 cttcctgcag cggctgctac tcctggacta tccccccgac agggtcaccc ttttcctgca
     1321 caacaacgag gtcttccatg aaccccacat cgctgactcc tggccgcagc tccaggacca
     1381 cttctcagct gtgaagctcg tggggccgga ggaggctctg agcccaggcg aggccaggga
     1441 catggccatg gacctgtgtc ggcaggaccc cgagtgtgag ttctacttca gcctggacgc
     1501 cgacgctgtc ctcaccaacc tgcagaccct gcgtatcctc attgaggaga acaggaaggt
     1561 gatcgccccc atgctgtccc gccacggcaa gctgtggtcc aacttctggg gcgccctgag
     1621 ccccgatgag tactacgccc gctccgagga ctacgtggag ctggtgcagc ggaagcgagt
     1681 gggtgtgtgg aatgtaccat acatctccca ggcctatgtg atccggggtg ataccctgcg
     1741 gatggagctg ccccagaggg atgtgttctc gggcagtgac acagacccgg acatggcctt
     1801 ctgtaagagc tttcgagaca agggcatctt cctccatctg agcaatcagc atgaatttgg
     1861 ccggctcctg gccacttcca gatacgacac ggagcacctg caccccgacc tctggcagat
     1921 cttcgacaac cccgtcgact ggaaggagca gtacatccac gagaactaca gccgggccct
     1981 ggaaggggaa ggaatcgtgg agcagccatg cccggacgtg tactggttcc cactgctgtc
     2041 agaacaaatg tgtgatgagc tggtggcaga gatggagcac tacggccagt ggtcaggcgg
     2101 ccggcatgag gattcaaggc tggctggagg ctacgagaat gtgcccaccg tggacatcca
     2161 catgaagcag gtggggtacg aggaccagtg gctgcagctg ctgcggacgt atgtgggccc
     2221 catgaccgag agcctgtttc ccggttacca caccaaggcg cgggcggtga tgaactttgt
     2281 ggttcgctac cggccagacg agcagccgtc tctgcggcca caccacgact catccacctt
     2341 caccctcaac gttgccctca accacaaggg cctggactat gagggaggtg gctgccgctt
     2401 cctgcgctac gactgtgtga tctcctcccc gaggaagggc tgggcactcc tgcaccccgg
     2461 ccgcctcacc cactaccacg aggggctgcc aacgacctgg ggcacacgct acatcatggt
     2521 gtcctttgtc gacccctgac actcaaccac tctgccaaac ctgccctgcc attgtgcctt
     2581 tttagggggc ctggcccccg tcctgggagt tgggggatgg gtctctctgt ctccccactt
     2641 cctgagttca tgttccgcgt gcctgaactg aatatgtcac cttgctccca agacacggcc
     2701 ctctcaggaa gctcccggag tccccgcctc tctcctccgc ccacaggggt tcgtgggcac
     2761 agggcttctg gggactcccc gcgtgataaa ttattaatgt tccgcagtct cactctgaat
     2821 aaaggacagt ttgtaaaaaa aaaaaaaaaa aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D90279. Human mRNA for co...[gi:219509] Links  


LOCUS       HUMCA1V                 5676 bp    mRNA    linear   PRI 01-FEB-2000
DEFINITION  Human mRNA for collagen alpha 1(V) chain, complete cds.
ACCESSION   D90279
VERSION     D90279.1  GI:219509
KEYWORDS    alpha 1(V) chain; collagen.
SOURCE      Human placenta, cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 5676)
  AUTHORS   Takahara,K., Sato,Y., Okazawa,K., Okamoto,N., Noda,A., Yaoi,Y. and
            Kato,I.
  TITLE     Complete primary structure of human collagen alpha 1 (V) chain
  JOURNAL   J. Biol. Chem. 266 (20), 13124-13129 (1991)
  MEDLINE   91302336
COMMENT     These data kindly submitted in computer readable form by: Kazuhiko
            Takahara
            Takara Shuzo Co., Ltd.
            Biotechnology Research Laboratories
            3-4-1 Seta, Otsu
            Shiga 520-21
            Japan
            Phone: 81-775-43-7200
            Fax:    81-775-43-2494.
FEATURES             Location/Qualifiers
     source          1..5676
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             127..5643
                     /note="collagen alpha 1(V) chain precursor"
                     /codon_start=1
                     /protein_id="BAA14323.1"
                     /db_xref="GI:219510"
                     /translation="MDVHTRWKARSALRPGAPLLPPLLLLLLWAPPPSRAAQPADLLK
                     VLDFHNLPDGITKTTGFCATRRSSKGPDVAYRVTKDAQLSAPTKQLYPASAFPEDFSI
                     LTTVKAKKGSQAFLVSIYNEQGIQQIGLELGRSPVFLYEDHTGKPGPEDYPLFRGINL
                     SDGKWHRIALSVHKKNVTLILDCKKKTTKFLDRSDHPMIDINGIIVFGTRILDEEVFE
                     GDIQQLLFVSDHRAAYDYCEHYSPDCDTAVPDTPQSQDPNPDEYYTEGDGEGETYYYE
                     YPYYEDPEDLGKEPTPSKKPVEAAKETTEVPEELTPTPTEAAPMPETSEGAGKEEDVG
                     IGDYDYVPSEDYYTPSPYDDLTYGEGEENPDQPTDPGAGAEIPTSTADTSNSSNPAPP
                     PGEGADDLEGEFTEETIRNLDENYYDPYYDPTSSPSEIGPGMPANQDTIYEGIGGPRG
                     EKGQKGEPAIIEPGMLIEGPPGPEGPAGLPGPPGTMGPTGQVGDPGERGPPGRPGLPG
                     ADGLPGPPGTMLMLPFRFGGGGDAGSKGPMVSAQESQAQAILQQARLALRGPAGPMGL
                     TGRPGPVGPPGSGGLKGEPGDVGPQGPRGVQGPPGPAGKPGRRGRAGSDGARGMPGQT
                     GPKGDRGFDGLAGLPGEKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGKPGPRG
                     LLGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGP
                     PGEKGPLGKPGLPGMPGADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGAD
                     GIRGLKGTKGEKGEDGFPGFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPG
                     PLGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGP
                     TGPRGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKD
                     GLPGHPGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPG
                     LAGKEGTKGDPGPAGLPGKDGPPGLRGFPGDRGLPGPVGALGLKGNEGPPGPPGPAGS
                     PGERGPAGAAGPIGIPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGLQGPVGLPGPA
                     GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPPGPTGPQGPIGQPGPSGADGEPGPRG
                     QQGLFGQKGDEGPRGFPGPPGPVGLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGA
                     PGADGPQGPPGGIGNPGAVGEKGEPGEAGEPGPSGRSGPPGPKGERGEKGESGPSGAA
                     GPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGEPGPAGQDGPPGDKGDDGEPGQTGSPG
                     PTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGEAGLEGPPGKTGPIGPQGAPGKPGP
                     DGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGLKGDSGPKGEKGHPGLIGLIGPP
                     GEQGEKGDRGLPGPQGSSGPKGEQGITGPSGPIGPPGPPGLPGPPGPKGAKGSSGPTG
                     PRGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDASQLLDDGNGENYVDYADGMEE
                     IFGSLNSLKLEIEQMKRPLGTQQNPARTCKDLQLCHPDFPDGEYWVDPNQGCSRDSFK
                     VYCNFTAGGSTCVFPDKKSEGARITSWPKENPGSWFSEFKRGKLLSYVDAEGNPVGVV
                     QMTFLRLLSASAHQNVTYHCYQSVAWQDAATGSYDKALRFLGSNDEEMSYDNNPYIRA
                     LVDGCATKKGYQKTVLEIDTPKVEQVPIVDIMFNDFGEASQKFGFEVGPACFMG"
     sig_peptide     127..237
                     /note="signal peptide of collagen alpha 1(V) chain"
     mat_peptide     238..5640
                     /product="mature peptide of collagen alpha 1(V) chain"
BASE COUNT     1181 a   1803 c   1872 g    820 t
ORIGIN      
        1 gtccccatga cctcctaaag tggtgcggtc cctgctgagt gcgctgcccg ggccgtgacc
       61 cgcgcccctg tgcgtccccg cgcgcctccg agcgcccctg tgcgccccgg cccgcgcccc
      121 gccggcatgg acgtccatac ccgctggaaa gcgcgcagcg cgctccgccc gggcgccccg
      181 ctgctgcccc cgctgctgct gctgctgctg tgggcgccgc ctccgagccg cgcagctcag
      241 ccagcagatc tcctgaaggt tctagatttt cacaacttgc ctgatggaat aacaaagaca
      301 acaggctttt gcgccacgcg gcgatcttcc aaaggcccgg atgtcgctta cagagtcacc
      361 aaagacgcgc agctcagcgc acccaccaag cagctgtacc ctgcgtctgc atttcccgag
      421 gacttctcca tcctaacaac tgtgaaagcc aagaaaggca gccaggcctt cctggtctcc
      481 atctacaacg agcagggtat ccagcagatt gggctggagc tgggccgctc tcccgtcttc
      541 ctctacgagg accacacggg gaagcctggc ccggaagact accccctctt ccggggcatc
      601 aacctgtcag atggcaagtg gcacagaatt gctctcagcg tccacaagaa aaatgtcacc
      661 ttgatcctcg actgtaaaaa gaagaccacc aaattcctcg accgcagcga ccaccccatg
      721 atcgacatca atggcatcat cgtgtttggc acccggatcc tggatgagga ggtgtttgag
      781 ggtgacatcc agcagctgct ctttgtctcg gaccaccggg cagcttatga ttactgtgag
      841 cactacagcc ctgactgtga caccgcagta cctgacaccc cacagtcgca ggaccccaat
      901 ccagatgaat attacacgga aggagacggc gagggtgaga cctattacta cgaatacccc
      961 tactacgaag accccgaaga cctagggaag gagcccaccc ccagcaagaa gcccgtggaa
     1021 gctgccaaag aaaccacaga ggtccccgag gagctgaccc cgacccccac ggaagctgct
     1081 cccatgcctg aaaccagtga aggggctggg aaggaagagg acgtcggcat cggggactat
     1141 gactacgtgc ccagtgagga ctactacacg ccctcaccgt atgatgacct cacctatggc
     1201 gagggggagg agaaccctga ccagcccaca gacccaggcg ctggggccga aattcccacc
     1261 agcaccgccg acacctccaa ctcctccaat ccagctccgc ctccagggga aggtgcggat
     1321 gacttggagg gggagttcac tgaggaaacg atccggaacc ttgacgagaa ctactacgac
     1381 ccctactacg accccaccag ctccccgtcg gagatcgggc cgggaatgcc ggcgaaccag
     1441 gataccatct atgaagggat tggaggacct cggggcgaga aaggccaaaa gggagaacca
     1501 gcgattatcg agccgggcat gctcatcgag ggcccgcctg gcccagaagg ccccgcgggt
     1561 cttcccggac ctccaggaac catgggtccc actggccaag tcggggaccc tggagaaagg
     1621 ggcccccctg gacgcccagg ccttcctggg gccgatggcc tgcccggtcc tccaggaacc
     1681 atgctcatgc tgcccttccg gtttggaggt ggcggcgatg cgggctccaa aggccccatg
     1741 gtctcagccc aggagtccca ggcgcaagcc attctccagc aggccaggtt ggcactgagg
     1801 ggaccagctg gcccgatggg tctcacaggg agacctggcc ctgtgggtcc ccctgggagc
     1861 ggaggtttga agggcgagcc gggagacgtg gggcctcagg gtcctcgagg tgtgcaaggc
     1921 ccgcctggtc cggccgggaa gcccggaaga cggggtcggg ctgggagtga tggagccaga
     1981 ggaatgcctg gacaaactgg ccccaagggt gaccggggtt tcgacggcct ggctgggttg
     2041 ccaggcgaga agggccacag gggtgaccct ggtccttccg gcccaccagg acctccggga
     2101 gacgatggag aaaggggtga cgacggagaa gttgggccca gggggctgcc tgggaagccc
     2161 gggccacgtg gtctgcttgg gccgaaaggg cccccaggtc ctcccggacc tcccggtgtc
     2221 acgggtatgg acggccagcc ggggccaaaa ggaaatgtgg gtccccaggg agagcctggc
     2281 cccccaggac agcagggtaa tccaggcgcc cagggtcttc caggccccca gggtgcaatt
     2341 ggtcctccag gagaaaaggg tcccttgggg aaaccaggcc ttccaggaat gcccggtgct
     2401 gacggacccc cgggacaccc tggcaaagaa ggccctccag gagagaaagg aggtcagggt
     2461 ccacctggcc cccagggtcc gattggctac ccaggtcctc gaggagtcaa gggggccgat
     2521 ggcatccgtg gtctgaaggg cacaaagggc gagaagggtg aagacggctt tcctgggttt
     2581 aaaggagaca tgggcatcaa gggtgatcgg ggggagatcg gcccacccgg tcccagggga
     2641 gaagatggcc ctgaaggccc aaagggtcgc ggaggtccca atggtgaccc cggtcctctg
     2701 ggaccccctg gggagaaggg aaaactcgga gtcccagggt taccagggta tccaggaaga
     2761 caaggaccaa agggctctat tggattccct ggatttcctg gcgccaatgg agagaagggc
     2821 ggcaggggga cccctggaaa gccaggaccg cgggggcagc gaggcccaac gggtccgagg
     2881 ggtgaaagag gcccccgggg catcactggg aagcctggcc ccaagggcaa ctccggaggt
     2941 gacggcccag ctggccctcc tggtgaacgg ggacccaatg gaccccaagg acccacagga
     3001 tttcctggac caaagggccc ccctggccct ccaggcaagg atggactccc aggacaccct
     3061 ggacagagag gcgagactgg tttccaaggc aagaccggcc ctccaggccc ccccggcgtg
     3121 gtcggccctc agggtcccac gggagaaacg ggcccaatgg gtgagcgtgg ccaccctggg
     3181 ccccctggac cccccggtga acaggggctt ccgggccttg ctggaaaaga agggacgaag
     3241 ggtgacccag gccctgcagg cctccctggg aaagatggcc ctccaggatt acgtggtttc
     3301 cctggggacc gagggcttcc tggtccagtg ggagctcttg gactgaaagg caatgaaggg
     3361 ccccctggcc caccaggccc tgcgggatct ccaggggaga gaggtccagc tggagccgct
     3421 gggcccatcg gaattccagg gagacctggg ccccagggac ccccagggcc ggcaggagag
     3481 aaaggggctc ctggcgagaa aggcccacaa ggcccagctg gccgagacgg tctccagggg
     3541 cctgtggggc tcccgggtcc agctggccct gtgggtcccc ctggagaaga cggagataag
     3601 ggagagatcg gggagccggg gcagaaagga agcaaggggg acaaaggaga acagggtcct
     3661 cctgggccta caggtcctca aggccccatc ggacagccag gcccctctgg agctgacggc
     3721 gagccggggc ctcggggcca gcagggcctt ttcgggcaga aaggtgatga aggtcccaga
     3781 ggctttcctg gaccccctgg gccagtgggg ctgcagggtt tgccaggacc tccaggcgag
     3841 aagggtgaga caggagacgt gggccagatg ggccccccgg gtccccctgg cccccgagga
     3901 ccctccggag ctccaggtgc tgatggccca caaggtcccc caggtggaat aggaaaccct
     3961 ggtgcagtgg gagagaaggg cgagcctggc gaagcaggtg agcctggccc ttccgggaga
     4021 agcggccccc cgggacccaa aggagaaagg ggagagaagg gcgagtcagg cccttcaggt
     4081 gctgccggac cccctggacc caaaggccct cccggagatg atggtcccaa aggcagccct
     4141 ggcccagtgg gttttcctgg agatcctggc ccccccggag agcctggccc cgcgggtcaa
     4201 gatggtcccc ctggtgacaa aggagatgat ggtgaacccg ggcagacggg atcccccggc
     4261 cctactggtg aaccaggtcc atcggggcct ccaggaaaaa ggggtccccc aggccccgca
     4321 ggccccgaag gcagacaggg agagaaaggg gccaagggag aagccggctt ggaaggccct
     4381 cctgggaaga ctggccccat cggcccccag ggggcccctg ggaagcccgg accggatggc
     4441 cttcgaggga tccctggccc tgtgggagaa caaggtctcc caggatcccc aggcccggac
     4501 ggtccccccg gccccatggg tcccccagga cttcccggcc tcaaaggaga ttctggtccc
     4561 aaaggtgaaa agggtcatcc aggcctgatc gggctcatcg gtcctccggg tgaacagggt
     4621 gagaagggcg accgtggtct ccctggcccc cagggctcct ccggtcctaa gggagaacag
     4681 ggtatcactg gtccttctgg cccgattggg cctcctgggc cccctggcct gccgggtccg
     4741 cctggtccaa aaggtgctaa gggctcctcg ggtccaactg gcccgagggg tgaggcaggc
     4801 cacccaggac ccccaggccc cccgggcccc ccgggagagg tcatccagcc cctgccaatc
     4861 caggcatcca ggacgcggcg gaacatcgac gccagccagc tgctggacga cgggaatggc
     4921 gagaactacg tggactacgc ggacggcatg gaagagatct tcggctctct caactctctg
     4981 aagctggaga ttgagcagat gaaacggccc ctgggcacgc agcagaaccc cgcccgcacc
     5041 tgcaaggacc tgcagctctg ccaccccgac ttcccagatg gtgaatactg ggtcgatcct
     5101 aaccaaggat gctccaggga ttccttcaag gtttactgca acttcacagc cggggggtcg
     5161 acatgcgtct tccctgacaa gaagtccgaa ggggccagaa tcacttcttg gcccaaagaa
     5221 aacccgggct cctggttcag tgaattcaag cgtgggaaac tgctctccta tgtggacgcc
     5281 gagggcaacc ctgtgggtgt ggtacagatg accttcctgc ggctgctgag cgcctctgcc
     5341 caccagaacg tcacctacca ctgctaccag tcagtggcct ggcaggacgc agccacgggc
     5401 agctacgaca aggccctccg cttcctgggc tccaacgacg aggagatgtc ctatgacaac
     5461 aacccctaca tccgcgccct ggtggacggc tgtgctacca agaaaggcta ccagaagacg
     5521 gttctggaga tcgacacccc caaagtggag caggtgccca tcgtggacat catgttcaat
     5581 gacttcggtg aagcgtcaca gaaatttgga tttgaagtgg ggccggcttg cttcatgggc
     5641 taggagccgc cgagcccggg ctcccgagcc gaattc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L38808. Homo sapiens alph...[gi:1020325] Links  


LOCUS       HUMCOL5A1A              3732 bp    DNA     linear   PRI 23-OCT-1995
DEFINITION  Homo sapiens alpha-1 type V collagen (COL5A1) gene, 5' flank and
            exon 1.
ACCESSION   L38808
VERSION     L38808.1  GI:1020325
KEYWORDS    COL5A1 gene; alpha-1 type V collagen; collagen.
SOURCE      Homo sapiens (clone: CW45) placenta DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Takahara,K., Sato,Y., Okazawa,K., Okamoto,N., Noda,A., Yaoi,Y. and
            Kato,I.
  TITLE     Complete primary structure of human collagen alpha 1 (V) chain
  JOURNAL   J. Biol. Chem. 266 (20), 13124-13129 (1991)
  MEDLINE   91302336
   PUBMED   2071595
REFERENCE   2  (sites)
  AUTHORS   Greenspan,D.S., Cheng,W. and Hoffman,G.G.
  TITLE     The pro-alpha 1(V) collagen chain. Complete primary structure,
            distribution of expression, and comparison with the pro-alpha 1(XI)
            collagen chain
  JOURNAL   J. Biol. Chem. 266 (36), 24727-24733 (1991)
  MEDLINE   92105142
   PUBMED   1722213
REFERENCE   3  (bases 1 to 3732)
  AUTHORS   Lee,S. and Greenspan,D.S.
  TITLE     Transcriptional promoter of the human alpha 1(V) collagen gene
            (COL5A1)
  JOURNAL   Biochem. J. 310 (Pt 1), 15-22 (1995)
  MEDLINE   95374437
   PUBMED   7646438
FEATURES             Location/Qualifiers
     source          1..3732
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="9q34.2-q34.3"
                     /clone="CW45"
                     /tissue_type="placenta"
                     /note="(vector lambda EMBL3)"
     protein_bind    723..728
                     /bound_moiety="Sp1"
     protein_bind    complement(1453..1460)
                     /bound_moiety="AP2"
     protein_bind    complement(1705..1712)
                     /bound_moiety="AP2"
     protein_bind    1817..1822
                     /bound_moiety="NF1"
     protein_bind    complement(2103..2108)
                     /bound_moiety="Sp1"
     protein_bind    2106..2113
                     /bound_moiety="AP2"
     protein_bind    complement(2114..2119)
                     /bound_moiety="Sp1"
     protein_bind    complement(2157..2162)
                     /bound_moiety="Sp1"
     protein_bind    complement(2173..2178)
                     /bound_moiety="Sp1"
     gene            2390..3732
                     /gene="COL5A1"
     prim_transcript 2390..>3732
                     /gene="COL5A1"
                     /note="G00-131-457"
     exon            2390..2880
                     /gene="COL5A1"
                     /note="G00-131-457"
                     /number=1
     protein_bind    2459..2464
                     /gene="COL5A1"
                     /bound_moiety="Sp1"
     protein_bind    complement(2572..2577)
                     /bound_moiety="Sp1"
     protein_bind    complement(2595..2600)
                     /bound_moiety="Sp1"
     CDS             2772..>2880
                     /gene="COL5A1"
                     /codon_start=1
                     /product="alpha-1 type V collagen"
                     /protein_id="AAA79853.1"
                     /db_xref="GI:1020326"
                     /db_xref="GDB:G00-131-457"
                     /translation="MDVHTRWKARSALRPGAPLLPPLLLLLLWAPPPSRA"
     protein_bind    complement(2810..2815)
                     /bound_moiety="Sp1"
     protein_bind    2896..2901
                     /gene="COL5A1"
                     /bound_moiety="Sp1"
     enhancer        2981..2988
                     /gene="COL5A1"
                     /note="viral enhancer core consensus sequence"
     protein_bind    complement(3071..3076)
                     /bound_moiety="Sp1"
     protein_bind    3088..3093
                     /gene="COL5A1"
                     /bound_moiety="Sp1"
     protein_bind    complement(3276..3281)
                     /bound_moiety="Sp1"
BASE COUNT      744 a   1117 c   1165 g    706 t
ORIGIN      
        1 tccggaccct gtcacggatg aactgggtaa gtcccctacc ctctgtggcc cctgatttcc
       61 tcatctgtgg atgggagaga tgccagtatt ccctaccggt gcatgagaac tgatgcacgt
      121 gtgggtgaag agtgtgaggc aaaagcctta agatgtggct gatattctta gcttggcaac
      181 acagttcgaa tcctgattgg tactgccggg ccattctgcc atagtccctg gctgacacat
      241 caggccgcca agtttgggga cttctctctg agtttcctgg tgggatagaa actcatctct
      301 ccacccaact caaggctcag atgagggtgg aaactcacat ttgacagctt tccaagccac
      361 cgcaccactt gctctccaag agaggataaa gtgcatctgt catggctgct tgagctgcaa
      421 gactgagcca tcttaccatt gctgagaagt caactttggg ggaccttggg gtggtcactt
      481 gagtgtcagc tcttaaggcg ccagttgtgt taattcaggg caaagtttag atgctggctg
      541 tgagttacag gagggcttgc tgactggcac ggatgtgcac atgggactgt cagaaaggaa
      601 ctccacttgt gcttggggaa aaagaagctg gaaccagaaa gcctccactc tcttctgata
      661 accaggccat ggcctgtgtc agcaccccta gtcttcaggg gtgaagctgg ccttgctggt
      721 aggggcgggg gcactccaag gaccaagcac agctggggct ggcgcggagt ctggggtttg
      781 atggaatttc tcactcattg aagtccacag ggaaagtaaa gggccagtgg ctgcaggtta
      841 accaacgcac agacacgcac acaggcacac acacaaacac acacatgcac acaactcaca
      901 cacatttttg cctgatcaca tatgccttat acaagttgta aaacaaaata aaatcaacaa
      961 ggaaacaggt cagaactaac atttgcttta ctgaatgtat ctcaagcacc tccccacgtc
     1021 tgtgtgtgaa gagcttctca ttcttttcct gcgaaatatt tgatatctat gcaagcgcat
     1081 gtggcagata tggagactaa aagcctcatc accaagttag caacagcgac ctatcaccca
     1141 gctccggagc ctgccatggc tatcctggca aacctaacct cggacgcctc tctcacctcc
     1201 tgtttccaga atggtgagaa ttaaagagca tttggggaat ttcacgccca gtctggagat
     1261 gtaactggag aggggagtcc cagccctgtc tggcatctct ggctaaattg tttgttcaaa
     1321 gggaatgggg atggcccagg gtttgcaaat cgttcttaat tcattctgat tccctccttt
     1381 taaaaaacta gccctgcagc ccttggggaa ggggtgtggg tcaggcttca ggaggctgtg
     1441 cagggaggcc aggcctgggg aagctggagg gagagacgcc cacctaccac ccagagactg
     1501 gcgggcctcc tgctggagga gaccccgagg ggcttaaaga agccaggcag gagaatcagg
     1561 gccccagctc agctcttctc cgagagcagc cgatcaggaa tgccttctcc ctgcgccagc
     1621 tgtcactccc aaggggatgc cttctttggt ttccttttta ttgcagaggg gccaccaggg
     1681 agtgggcagg agtgaggggt gagagcctgg gggccgcaga gccaaacttc gcctgattaa
     1741 taaccccaga tgttcccgtc caagcacatg gaggtcccgg gagctcttgg gaagtcagac
     1801 cccagctcgg tcccgtgcca atctgcagtt aggaggcagg tcctcggtcc gggttgacac
     1861 gtaggccaca gtcttgggcg ccctgatctg gctggatggg aagtttagtc tccacccaga
     1921 agaggagctc tgaactcctc cacgttaaag aaaaagttag gcggggtaaa ctgcaacttt
     1981 ctttttaacc cgaagtgatc aaacctcggg gcccctctgc actggttctt ggagcatggc
     2041 caggtgcggg cacctggggt gggaacagtg gggggaccag ggtgggcccc tgggctccga
     2101 ggccgcccca ggcccgcccg cctaccggct ctcagagaaa gaacaggggg ccgcgcccgc
     2161 cccacgtccg ctccgccccg ggccagcccc ttcctcgctg cgactcgccc gctgtcccca
     2221 ccccctcgcc cgcggcgccc agtgggaggc gggggctggc ctcgccgagc ccagcgccgg
     2281 gctctgattt gctgcgggcg ttggggatcg acagcctccg cggctgcctt ccaggagaga
     2341 gggagggagg aaaaggggga aaaaagtgct cggcgccgaa ggcgaggtcc gcactctccg
     2401 tccccgcggc tggcgcagga cctcactcga gcggagcgcc cacggggagc gggtcgcggg
     2461 gcggcggcgg cgaggaggag gcgagaagga gttggaggag gaggaggagg aggcgagggc
     2521 gagcgagccc agcggggtcc cggccgcccc gcgggccaaa gtcgagccct cccgcccgtg
     2581 ggcgagcgcg ccagccgccc cttccagaac agccgccgcc acaaagaaga acggggggtg
     2641 ccgaggtccc catgacctcc taaagtggtg cggtccctgc tgagtgcgct gcccgggccg
     2701 tgacccgcgc ccctgtgcgt ccccgcgcgc ctccgagcgc ccctgtgcgc cccggcccgc
     2761 gccccgccgg catggacgtc catacccgct ggaaagcgcg cagcgcgctc cgcccgggcg
     2821 ccccgctgct gcccccgctg ctgctgctgc tgctgtgggc gccgcctccg agccgcgcag
     2881 gtaagggcgc cccgggggcg gggctgcggg atggggcgca gcccgggcgc cgctgtcatc
     2941 cccgggcgcc ttcgcccgca gaacttttct tccttggcct gtggaatgca cgggccaaga
     3001 ccacgaatgc catttgctgg gggccccccc gagatgacga cacgcagaca caatgcccgc
     3061 gggcgcgccg ccgccccctc cccagacggg cgggtcgggt ggggatgtcg cgcgagccga
     3121 ggaaggaccg agggctggat cagcaggagg ggttgtccac gaggcgggca aacttttgtc
     3181 ccaaaaacct tccttctctg tcccacatca caggcgccca gcttgggccc tcacagccct
     3241 acagcagaac ttcctctgct ccaaaccggg cagggccgcc ccctagagtt acagcccttc
     3301 aaatcccggg actcctgggg atggggggaa tcccagaagc ctgggccccc agcactgaac
     3361 ttcccccagg cactgtcact ctaggctgag ctggcgccct gctttcccca gggacagcgt
     3421 ttcctgcagc cttggtcacc taagtgttcg gggggcactg gggaccctgc agggtggtag
     3481 acagccctgc tcctaatccc acccccaaac ttcttgatcc ctggaaggca gcttttcctg
     3541 gattcgctgc tgagagatgg gatgcgtgga gcaaagccag agatgggctt gcaggcgggg
     3601 gctgtgagtg gttgggatga tatgcagatg tgggattcct tctgccaagg gtcagccctt
     3661 agaaggaggc ttgcttcccg gggagggcgc ttcaaggcgg ggactgtctg cgtccctgcc
     3721 tgggttctcg ag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M76729. Human pro-alpha-1...[gi:189519] Links  


LOCUS       HUMPA1V                 7138 bp    mRNA    linear   PRI 07-JAN-1995
DEFINITION  Human pro-alpha-1 (V) collagen mRNA, complete cds.
ACCESSION   M76729
VERSION     M76729.1  GI:189519
KEYWORDS    alpha-1 type V collagen.
SOURCE      Homo sapiens cDNA to mRNA.
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 7138)
  AUTHORS   Greenspan,D.S., Cheng,W. and Hoffman,G.G.
  TITLE     The pro-alpha 1(V) collagen chain. Complete primary structure,
            distribution of expression, and comparison with the pro-alpha 1(XI)
            collagen chain
  JOURNAL   J. Biol. Chem. 266 (36), 24727-24733 (1991)
  MEDLINE   92105142
   PUBMED   1722213
FEATURES             Location/Qualifiers
     source          1..7138
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="9q34.2-q34.3"
     gene            1..7138
                     /gene="COL5A1"
     CDS             230..5746
                     /gene="COL5A1"
                     /codon_start=1
                     /product="pro-alpha-1 type V collagen"
                     /protein_id="AAA59993.1"
                     /db_xref="GI:189520"
                     /db_xref="GDB:G00-131-457"
                     /translation="MDVHTRWKARSALRPGAPLLPPLLLLLLWAPPPSRAAQPADLLK
                     VLDFHNLPDGITKTTGFCATRRSSKGPDVAYRVTKDAHVSAPTKQLYPASAFPEDFSI
                     LTTVKAKKGSQAFLVSIYNEQGIQQIGLELGRSPVFLYEDHTGKPGPEDYPLFRGINL
                     SDGKWHRIALSVHKKNVTLILDCKKKTTKFLDRSDHPMIDINGIIVFGTRILDEEVFE
                     GDIQQLLFVSDHRAAYDYCEHYSPDCDTAVPDTPQSQDPNPDEYYTEGDGEGETYYYE
                     YPYYEDPEDLGKEPTPSKKPVEAAKETTEVPEELTPTPTEAAPMPETSEGAGKEEDVG
                     IGDYDYVPSEDYYTPSPYDDLTYGEGEENPDQPTDPGAGAEIPTSTADTSNSSNPRPP
                     PGEGADDLEGEFTEETIRNLDENYYDPYYDPTSSPSEIGPGMPANQDTIYEGIGGPRG
                     EKGQKGEPAIIEPGMLIEGPPGPEGPAGLPGPPGTMGPTGQVGDPGERGPPGRPGLPG
                     ADGLPGPPGTMLMLPFRFGGGGDAGSKGPMVSAQESQAQAILQQARLALRGPAGPMGL
                     TGRPGPVGPPGSGGLKGEPGDVGPQGPRGVQGPPGPAGKPGRRGRAGSDGARGMPGQT
                     GPKGDRGFDGLAGLPGEKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPRG
                     LLGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGP
                     PGEKGPLGKPGLPGMPGADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGAD
                     GIRGLKGTKGEKGEDGFPGFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPG
                     PLGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGP
                     TGPRGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKD
                     GLPGHPGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPG
                     LAGKEGTKGDPGPAGLPGKDGPPGLRGFPGDRGLPGPVGALGLKGNEGPPGPPGPAGS
                     PGERGPAGAAGPIGIPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGLQGPVGLPGPA
                     GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPPGPTGPQGPIGQPGPSGADGEPGPRG
                     QQGLFGQKGDEGPRGFPGPPGPVGLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGA
                     PGADGPQGPPGGIGNPGAVGEKGEPGEAGEPGLPGEGGPPGPKGERGEKGESGPSGAA
                     GPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGEPGPAGQDGPPGDKGDDGEPGQTGSPG
                     PTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGEAGLEGPPGKTGPIGPQGAPGKPGP
                     DGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGLKGDSGPKGEKGHPGLIGLIGPP
                     GEQGEKGDRGLPGPQGSSGPKGEQGITGPSGPIGPPGPPGLPGPPGPKGAKGSSGPTG
                     PKGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDASQLLDDGNGENYVDYADGMEE
                     IFGSLNSLKLEIEQMKRPLGTQQNPARTCKDLQLCHPDFPDGEYWVDPNQGCSRDSFK
                     VYCNFTAGGSTCVFPDKKSEGARITSWPKENPGSWFSEFKRGKLLSYVDAEGNPVGVV
                     QMTFLRLLSASAHQNVTYHCYQSVAWQDAATGSYDKALRFLGSNDEEMSYDNNPYIRA
                     LVDGCATKKGYQKTVLEIDTPKVEQVPIADIMFNDFGEASQKFGFEVGPACFMG"
BASE COUNT     1546 a   2238 c   2278 g   1076 t
ORIGIN      
        1 gccgccccgc gggccaaagt cgagccctcc cgcccgtggg cgagcgcgcc agccgcccct
       61 tccagaacag ccgccgccac aaagaagaac ggggggtgcc gaggtcccca tgacctccta
      121 aagtggtgcg gtccctgctg agtgcgctgc ccgggccgtg acccgcgccc ctgtgcgtcc
      181 ccgcgcgcct ccgagcgccc ctgtgcgccc cggcccgcgc cccgccggca tggacgtcca
      241 tacccgctgg aaagcgcgca gcgcgctccg cccgggcgcc ccgctgctgc ccccgctgct
      301 gctgctgctg ctgtgggcgc cgcctccgag ccgcgcagct cagccagcag atctcctgaa
      361 ggttctagat tttcacaact tgcctgatgg aataacaaag acaacaggct tttgcgccac
      421 gcggcgatct tccaaaggcc cggatgtcgc ttacagagtc accaaagacg cgcacgtcag
      481 cgcacccacc aagcagctgt accctgcgtc tgcatttccc gaggacttct ccatcctaac
      541 aactgtgaaa gccaagaaag gcagccaggc cttcctggtc tccatctaca acgagcaggg
      601 tatccagcag attgggctgg agctgggccg ctctcccgtc ttcctctacg aggaccacac
      661 ggggaagcct ggcccggaag actaccccct cttccggggc atcaacctgt cagatggcaa
      721 gtggcacaga attgctctca gcgtccacaa gaaaaatgtc accttgatcc tcgactgtaa
      781 aaagaagacc accaaattcc tcgaccgcag cgaccacccc atgatcgaca tcaatggcat
      841 catcgtgttt ggcacccgga tcctggatga ggaggtgttt gagggtgaca tccagcagct
      901 gctctttgtc tcggaccacc gggcagctta tgattactgt gagcactaca gccctgactg
      961 tgacaccgca gtacctgaca ccccacagtc gcaggacccc aatccagatg aatattacac
     1021 ggaaggagac ggcgagggtg agacctatta ctacgaatac ccctactacg aagaccccga
     1081 agacctaggg aaggagccca cccccagcaa gaagcccgtg gaagctgcca aagaaaccac
     1141 agaggtcccc gaggagctga ccccgacccc cacggaagct gctcccatgc ctgaaaccag
     1201 tgaaggggct gggaaggaag aggacgtcgg catcggggac tatgactacg tgcccagtga
     1261 ggactactac acgccctcac cgtatgatga cctcacctat ggcgaggggg aggagaaccc
     1321 tgaccagccc acagacccag gcgctggggc cgaaattccc accagcaccg ccgacacctc
     1381 caactcctcc aatccacgtc cgcctccagg ggaaggtgcg gatgacttgg agggggagtt
     1441 cactgaggaa acgatccgga accttgacga gaactactac gacccctact acgaccccac
     1501 cagctccccg tcggagatcg ggccgggaat gccggcgaac caggatacca tctatgaagg
     1561 gattggagga cctcggggcg agaaaggcca aaagggagaa ccagcgatta tcgagccggg
     1621 catgctcatc gagggcccgc ctggcccaga aggccccgcg ggtcttcccg gacctccagg
     1681 aaccatgggt cccactggcc aagtcgggga ccctggagaa aggggccccc ctggacgccc
     1741 aggccttcct ggggccgatg gcctgcccgg tcctccagga accatgctca tgctgccctt
     1801 ccggtttgga ggtggcggcg atgcgggctc caaaggcccc atggtctcag cccaggagtc
     1861 ccaggcgcaa gccattctcc agcaggccag gttggcactg aggggaccag ctggcccgat
     1921 gggtctcaca gggagacctg gccctgtggg tccccctggg agcggaggtt tgaagggcga
     1981 gccgggagac gtggggcctc agggtcctcg aggtgtgcaa ggcccgcctg gtccggccgg
     2041 gaagcccgga agacggggtc gggctgggag tgatggagcc agaggaatgc ctggacaaac
     2101 tggccccaag ggtgaccggg gtttcgacgg cctggctggg ttgccaggcg agaagggcca
     2161 caggggtgac cctggtcctt ccggcccacc aggacctccg ggagacgatg gagaaagggg
     2221 tgacgacgga gaagttgggc ccagggggct gcctggggag cccgggccac gtggtctgct
     2281 tgggccgaag gggcccccag gccctcccgg acctcccggt gtcacgggta tggacggcca
     2341 gccggggcca aaaggaaatg tgggtcccca gggagagcct ggccccccag gacagcaggg
     2401 taatccaggc gcccagggtc ttccaggccc ccagggtgca attggtcctc caggagaaaa
     2461 gggtcccttg gggaaaccag gccttccagg aatgcccggt gctgacggac ccccgggaca
     2521 ccctggcaaa gaaggccctc caggagagaa aggaggtcag ggtccacctg gcccccaggg
     2581 tccgattggc tacccaggtc ctcgaggagt caagggggcc gatggcatcc gtggtctgaa
     2641 gggcacaaag ggcgagaagg gtgaagacgg ctttcctggg tttaaaggag acatgggcat
     2701 caagggtgat cggggggaga tcggcccacc cggtcccagg ggagaagatg gccctgaagg
     2761 cccaaagggt cgcggaggtc ccaatggtga ccccggtcct ctgggacccc ctggggagaa
     2821 gggaaaactc ggagtcccag ggttaccagg gtatccagga agacaaggac caaagggctc
     2881 tattggattc cctggatttc ctggcgccaa tggagagaag ggcggcaggg ggacccctgg
     2941 aaagccagga ccgcgggggc agcgaggccc aacgggtccg aggggtgaaa gaggcccccg
     3001 gggcatcact gggaagcctg gccccaaggg caactccgga ggtgacggcc cagctggccc
     3061 tcctggtgaa cggggaccca atggacccca aggacccaca ggatttcctg gaccaaaggg
     3121 cccccctggc cctccaggca aggatggact cccaggacac cctggacaga gaggcgagac
     3181 tggtttccaa ggcaagaccg gccctccagg cccccccggc gtggtcggcc ctcagggtcc
     3241 cacgggagaa acgggcccaa tgggtgagcg tggccaccct gggccccctg gaccccccgg
     3301 tgaacagggg cttccgggcc ttgctggaaa agaagggacg aagggtgacc caggccctgc
     3361 aggcctccct gggaaagacg gccctccagg attacgtggt ttccctgggg accgagggct
     3421 tcctggtcca gtgggagctc ttggactgaa aggcaatgaa gggccccctg gcccaccagg
     3481 ccctgcggga tctccagggg agagaggtcc agctggagcc gctgggccca tcggaattcc
     3541 agggagacct gggccccagg gacccccagg gccggcagga gagaaagggg ctcctggcga
     3601 gaaaggccca caaggcccag ctggccgaga cggtctccag gggcctgtgg ggctcccggg
     3661 tccagctggc cctgtgggtc cccctggaga agacggagat aagggagaga tcggggagcc
     3721 ggggcagaaa ggaagcaagg gggacaaagg agaacagggt cctcctgggc ctacaggtcc
     3781 tcaaggcccc atcggacagc caggcccctc tggagctgac ggcgagccgg ggcctcgggg
     3841 ccagcagggc cttttcgggc agaaaggtga tgaaggtccc agaggctttc ctggaccccc
     3901 tgggccagtg gggctgcagg gtttgccagg acctccaggc gagaagggtg agacaggaga
     3961 cgtgggccag atgggccccc cgggtccccc tggcccccga ggaccctccg gagctccagg
     4021 tgctgatggc ccacaaggtc ccccaggtgg aataggaaac cctggtgcag tgggagagaa
     4081 gggcgagcct ggcgaagcag gtgagcctgg ccttccggga gaaggcggcc ccccgggacc
     4141 caaaggagaa aggggagaga agggcgagtc aggcccttca ggtgctgccg gaccccctgg
     4201 acccaaaggc cctcccggag atgatggtcc caaaggcagc cctggcccag tgggttttcc
     4261 tggagatcct ggcccccccg gagagcctgg ccccgcgggt caagatggtc cccctggtga
     4321 caaaggagat gatggtgaac ccgggcagac gggatccccc ggccctactg gtgaaccagg
     4381 tccatcgggg cctccaggaa aaaggggtcc cccaggcccc gcaggccccg aaggcagaca
     4441 gggagagaaa ggggccaagg gagaagccgg cttggaaggc cctcctggga agactggccc
     4501 catcggcccc cagggggccc ctgggaagcc cggaccggat ggccttcgag ggatccctgg
     4561 ccctgtggga gaacaaggtc tcccaggatc cccaggcccg gacggtcccc ccggccccat
     4621 gggtccccca ggacttcccg gcctcaaagg agattctggt cccaaaggtg aaaagggtca
     4681 tccaggcctg atcgggctca tcggtcctcc cggtgaacag ggtgagaagg gcgaccgtgg
     4741 tctccctggc ccccagggct cctccggtcc taagggagaa cagggtatca ctggtccttc
     4801 tggcccgatt gggcctcctg ggccccctgg cctgccgggt ccgcctggtc caaaaggtgc
     4861 taagggctcc tcgggtccaa ctggcccgaa gggtgaggca ggccacccag gacccccagg
     4921 ccccccgggc cccccgggag aggtcatcca gcccctgcca atccaggcat ccaggacgcg
     4981 gcggaacatc gacgccagcc agctgctgga cgacgggaat ggcgagaact acgtggacta
     5041 cgcggacggc atggaagaga tcttcggctc tctcaactct ctgaagctgg agattgagca
     5101 gatgaaacgg cccctgggca cgcagcagaa ccccgcccgc acctgcaagg acctgcagct
     5161 ctgccacccc gacttcccag atggtgaata ctgggtcgat cctaaccaag gatgctccag
     5221 ggattccttc aaggtttact gcaacttcac agccgggggg tcgacatgcg tcttccctga
     5281 caagaagtcc gaaggggcca gaatcacttc ttggcccaaa gaaaacccgg gctcctggtt
     5341 cagtgaattc aagcgtggga aactgctctc ctatgtggac gccgagggca accctgtggg
     5401 tgtggtacag atgaccttcc tgcggctgct gagcgcctct gcccaccaga acgtcaccta
     5461 ccactgctac cagtcagtgg cctggcagga cgcagccacg ggcagctacg acaaggccct
     5521 ccgcttcctg ggctccaacg acgaggagat gtcctatgac aacaacccct acatccgcgc
     5581 cctggtggac ggctgtgcta ccaagaaagg ctaccagaag acggttctgg agatcgacac
     5641 ccccaaagtg gagcaggtgc ccatcgcgga catcatgttc aatgacttcg gtgaagcgtc
     5701 acagaaattt ggatttgaag tggggccggc ttgcttcatg ggctaggagc cgccgagccc
     5761 gggctcccga gagcaacctc gtacctcagc atgccattgc ttcgtgagtg tcccgtgcac
     5821 gtcctgatcc tggacagtga aggcttctcc ctcccctccc acctgacttc atctacgcct
     5881 cggcaccacg gggtgtggga ccccagcccg gagagaacag agggaaggag ccgcgccccc
     5941 acctggagct gaatcacatg acctagctgc accccagcgc ctgggcccgc cccacgctct
     6001 gtccacaccc atgcgccccg ggagcggggc catgcctcca gccccccagc tcgcccgacc
     6061 catcctgttc gtgaataggt ctcaggggtt gggggaggga ctgccagatt tggacactat
     6121 atttttttct aaattcaact tgaagatgtg tatttcccct gaccttcaaa aaatgttcca
     6181 aggtaagcct cgtaaaggtc atcccaccat caccaaagcc tccgttttta acaacctcca
     6241 acacgatcca tttagaggcc aaatgtcatt ctgcaggtgc cttcccgatg gattaaaggt
     6301 gcttatgttt ttgtgagttt taagtaaata tttgtggaat ttaagatgaa gacagtaata
     6361 taatattttg gtacaaaaac agctgcagga agaggtggag gggggcctgt cattatgttt
     6421 cccccccacc ccccaacgaa aggaaaacta agactcccaa cataaacagg gccttgaagg
     6481 gggggattac aggcacttgg gcatggagtc ttcggctgca ggaagcactc cgcttattct
     6541 tcaggaatgg gaaaggcgtg acccaacgag agcatctgtc tcagagctcc actcagggtc
     6601 acccctctcc agaggccggt atggggtggc ttcagacttc cactgcacga cctggagcac
     6661 caagaccaca caccacaata ccaaattcac ccaagaagag gtctaattgt agtgttcaag
     6721 gagatgatgg agttgcagag agacctggtg ccaaatcgaa ggatacaggc agacaccaag
     6781 accaggaaga cggctatagc tgagatggcc agtgcaatgc gcagccctat agcacctctg
     6841 tgggagtcct cgatgcagct gctgtagatc cagaagagca aaagcaggag gcagtagagg
     6901 gccaagaggc cagaggcccc agctacaaag tagcacaggg atggtgctga gggacgggat
     6961 aaggccaggg aggagccatt cagggtggcc acaccataca ggggacatct accactgaag
     7021 gagccctggg tccgagtcat cgccgcggcc gccacggccc cgcacaggaa ggcggcagca
     7081 aagagcgcaa gctcgacgcg ctgcagccag gacagcgcca tggcgccctg accaccag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH004914. Human annexin V (...[gi:758692] Links  


LOCUS       HSANX5S01               1732 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, 5'-untranslated region, exons 1 and 2.
ACCESSION   U01681
VERSION     U01681.1  GI:430954
KEYWORDS    .
SEGMENT     1 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1732)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 1732)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..1732
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-46"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     promoter        1..1291
                     /function="regulatory region for transcription"
     repeat_region   695..705
                     /note="putative glucocorticoid response element half-site"
                     /rpt_type=dispersed
     repeat_region   798..819
                     /rpt_family="11-bp A-rich segment"
                     /rpt_type=tandem
     repeat_region   823..833
                     /note="putative glucocorticoid response element half-site"
                     /rpt_type=dispersed
     misc_feature    1018..1636
                     /standard_name="CpG island"
                     /note="GC-rich segment of 5'-upstream region through
                     intron 1; G+C averages 75.5% and (G+C)/(A+T) = 3.1"
     GC_signal       1225..1230
     GC_signal       1264..1269
     misc_signal     1291..1298
                     /standard_name="initiator element for TATA-less promoter"
                     /note="proposed transcription start site at `A' based on
                     the only consensus sequence for an initiator element of
                     TATA-less promoters between upstream SP1 sites and a known
                     mRNA start 33 bp downstream"
                     /label=Inr
     5'UTR           join(1292..1448,1681..1715)
                     /gene="ANX5"
                     /evidence=experimental
     exon            1292..1448
                     /gene="ANX5"
                     /number=1
                     /evidence=experimental
     intron          1449..1680
                     /gene="ANX5"
                     /number=1
                     /evidence=experimental
     exon            1681..1724
                     /gene="ANX5"
                     /number=2
                     /evidence=experimental
     intron          1725..>1732
                     /gene="ANX5"
                     /note="est. size 3+ kb; phase 0 intron"
                     /number=2
                     /evidence=experimental
BASE COUNT      353 a    515 c    470 g    394 t
ORIGIN      
        1 tctagaacac tgcatctagt gtcgcaaact tgtcctcttg ccccctctgc cctggcacct
       61 tccctcccca caccagtaag tctgagaagg tccctgtgtc ttctactttt ccttttccag
      121 catatggaga caaaagtgat tatatcccgg atgctaatcc gccatgttga cctttaataa
      181 ccccagtccc atgaatacct cctgattcct aggatttttt ttttaaactg tccttagcat
      241 aagaacatgt caaccttgat gctattgccc acattgtagg ctatgaagca tacggcattc
      301 tcacctgttc cggaggctgc ctttaattgt cttgcacaga gcagtatact ctttccttac
      361 ggtatataag gccagggtct ggggagtaac agtgcagaaa tttatctgct tgccgccgcc
      421 caaggccacg cttctgtcta ccacatcctc caatagcacc cctattacct acagactgga
      481 tttgtctgtc tcgttctttg gtttcttgac tccttcgcgt ttgggggctg ctttgcatat
      541 aaagcccttt cacagaacac agcaccatgc tagtacaata cgctgtagat tctccctccc
      601 tccccctctc tctcatatac tcatatatct tatgttgaac caatatgagg cattgctcaa
      661 atttaagtca tattaaagtt ctaggctagt tttgaaaaca gaaactgatt ggaagcagag
      721 gttttcaaat agcccacata cgctactaga aggctgtaca tttaagagag ggccatctag
      781 gaagcaataa taggcattaa aacaacaata aaacaacaaa acaaaacaga aacaaaaaca
      841 acttgggaaa cggccctcct ttcacgtttt ttctatccca tcgacaaagg cgcgctgtcc
      901 ttagctgcga tgattttgtc tcgcctccaa aaagacgccc acgcactatg ttgagcaccc
      961 aagtgaggct acggttcctg cggtcacaga gggcagggag gctcaagcac ctccaaaacc
     1021 ccgagccctg gacagctccc caggcccttc ccgcggcgcg aggacaagag gtctccgggg
     1081 ccctcggggg agcggcgcct cctcctggtt ccagcagctc tgcgccgctc cccacccagg
     1141 cccgcgagac cagcgggaca gtccgcgccg ggagaccaac tgggacgagc cgcgacccac
     1201 gcaggcgcgc tgaggccggg gcaggggcgg gcccggctgg cgcggccggc tgcggttggg
     1261 gctggcgggg gtgggacggg ccaagccggg cagggccggg gtggggcgct ggcgtttccg
     1321 ttgcttggat cagtctaggt gcagctgcgg atccttcagc gtctgcatct cggcgtcgcc
     1381 ccgcgtaccg tcgcccggct ctccgccgct ctcccggggg ttcggggcac ttgggtccca
     1441 cagtctgggt gagtggtcgc agcccgggga gggggctcct tctggagagg agagcgtggt
     1501 cgcggggcac tggattcgcg cggacgctcg gccgagagct gtcccggtag ctgcgagagg
     1561 gcgggtcggc ccgtggcggc gtccgggctg tctgagcgcg ccggtccccg cggacctgcg
     1621 cttggggagg gcacgagttg caaatggcgc gctaagcccg aggtttcttc tcttttgcag
     1681 tcctgcttca ccttcccctg acctgagtag tcgccatggc acaggtaagg cc
//
LOCUS       HSANX5S02                129 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exon 3.
ACCESSION   U01682
VERSION     U01682.1  GI:430955
KEYWORDS    .
SEGMENT     2 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 129)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 129)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..129
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-47"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     intron          <1..29
                     /gene="ANX5"
                     /note="est. size 3+ kb, phase 0 intron"
                     /number=2
                     /evidence=experimental
     exon            30..114
                     /gene="ANX5"
                     /number=3
                     /evidence=experimental
     intron          115..>129
                     /gene="ANX5"
                     /note="full size 1.9 kb; phase 1 intron"
                     /number=3
                     /evidence=experimental
BASE COUNT       32 a     24 c     35 g     38 t
ORIGIN      
        1 gaataatacc actgtgtttc gtgaattagg ttctcagagg cactgtgact gacttccctg
       61 gatttgatga gcgggctgat gcagaaactc ttcggaaggc tatgaaaggc ttgggtaaat
      121 ttagcttcc
//
LOCUS       HSANX5S03                149 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exon 4.
ACCESSION   U01683
VERSION     U01683.1  GI:430956
KEYWORDS    .
SEGMENT     3 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 149)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 149)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..149
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-47"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     intron          <1..34
                     /gene="ANX5"
                     /note="full size 1.9 kb; phase 1 intron"
                     /number=3
                     /evidence=experimental
     exon            35..129
                     /gene="ANX5"
                     /number=4
                     /evidence=experimental
     intron          130..>149
                     /gene="ANX5"
                     /note="full size 1.3 kb; phase 0 intron"
                     /number=4
                     /evidence=experimental
BASE COUNT       44 a     36 c     34 g     35 t
ORIGIN      
        1 ggagccaaaa gtaattgtta accataaatt gcaggcacag atgaggagag catcctgact
       61 ctgttgacat cccgaagtaa tgctcagcgc caggaaatct ctgcagcttt taagactctg
      121 tttggcaggg taagaccaca cttcatccc
//
LOCUS       HSANX5S04                268 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exon 5.
ACCESSION   U01684
VERSION     U01684.1  GI:430957
KEYWORDS    .
SEGMENT     4 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 268)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 268)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..268
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-47"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     intron          <1..71
                     /gene="ANX5"
                     /note="full size 1.3 kb; phase 0 intron"
                     /number=4
                     /evidence=experimental
     exon            72..185
                     /gene="ANX5"
                     /number=5
                     /evidence=experimental
     intron          186..>268
                     /gene="ANX5"
                     /note="full size 1.9 kb; phase 0 intron"
                     /citation=[1]
                     /number=5
                     /evidence=experimental
BASE COUNT       83 a     35 c     49 g    101 t
ORIGIN      
        1 cattcatatt taagaagaat cttattacaa tttcttgtac ggttaacaaa tttgtatttt
       61 ctgtgggtta ggatcttctg gatgacctga aatcagaact aactggaaaa tttgaaaaat
      121 taattgtggc tctgatgaaa ccctctcggc tttatgatgc ttatgaactg aaacatgcct
      181 tgaaggtaaa aaggatgagt gaaaggaaca ttcttgcata gatagtatta ttttgttttg
      241 gtaattactt cctcttattt gttatatt
//
LOCUS       HSANX5S05                242 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exon 6.
ACCESSION   U01685
VERSION     U01685.1  GI:430958
KEYWORDS    .
SEGMENT     5 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 242)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 242)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..242
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-49"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     intron          <1..42
                     /gene="ANX5"
                     /note="full size 1.9 kb; phase 0 intron"
                     /number=5
                     /evidence=experimental
     exon            43..133
                     /gene="ANX5"
                     /number=6
                     /evidence=experimental
     intron          134..>242
                     /gene="ANX5"
                     /note="full size 3.2 kb; phase 1 intron"
                     /number=6
                     /evidence=experimental
BASE COUNT       78 a     41 c     48 g     75 t
ORIGIN      
        1 ttaatcttat tgacatgttt tctctataaa atctccaaac agggagctgg aacaaatgaa
       61 aaagtactga cagaaattat tgcttcaagg acacctgaag aactgagagc catcaaacaa
      121 gtttatgaag aaggtaaatg tttgtgagtt atattttgac ttcacctcca ctatcacttc
      181 cttcatttct gtgatttgaa tgcagtcctt tgtaggtatg ggaggcacgc agtaaatgat
      241 gg
//
LOCUS       HSANX5S06                123 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exon 7.
ACCESSION   U01686
VERSION     U01686.1  GI:430959
KEYWORDS    .
SEGMENT     6 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 123)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 123)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..123
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-49"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     intron          <1..18
                     /gene="ANX5"
                     /note="full size 3.2 kb; phase 1 intron"
                     /number=6
                     /evidence=experimental
     exon            19..98
                     /gene="ANX5"
                     /number=7
                     /evidence=experimental
     intron          99..>123
                     /gene="ANX5"
                     /note="full size 0.5 kb; phase 0 intron"
                     /number=7
                     /evidence=experimental
BASE COUNT       28 a     19 c     38 g     38 t
ORIGIN      
        1 gaaattattt tgttgcagaa tatggctcaa gcctggaaga tgacgtggtg ggggacactt
       61 cagggtacta ccagcggatg ttggtggttc tccttcaggt atgcgtggag attaattacg
      121 ttt
//
LOCUS       HSANX5S07                154 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exon 8.
ACCESSION   U01687
VERSION     U01687.1  GI:430960
KEYWORDS    .
SEGMENT     7 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 154)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 154)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..154
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-49"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     intron          <1..63
                     /gene="ANX5"
                     /note="full size 0.5 kb; phase 0"
                     /number=7
                     /evidence=experimental
     exon            64..120
                     /gene="ANX5"
                     /number=8
                     /evidence=experimental
     intron          121..>154
                     /gene="ANX5"
                     /note="full size 5.7 kb; phase 0"
                     /number=8
                     /evidence=experimental
BASE COUNT       47 a     27 c     31 g     49 t
ORIGIN      
        1 ccatttttcc ttttgaatga gaatgctcta tagttcagta atgtgacata ttacctctca
       61 taggctaaca gagaccctga tgctggaatt gatgaagctc aagttgaaca agatgctcag
      121 gtgagtgaca ccaggatata tatgacttta ttat
//
LOCUS       HSANX5S08                255 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exon 9.
ACCESSION   U01688
VERSION     U01688.1  GI:430961
KEYWORDS    .
SEGMENT     8 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 255)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 255)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..255
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-49"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     intron          <1..109
                     /gene="ANX5"
                     /note="full size 5.7 kb; phase 0 intron"
                     /number=8
                     /evidence=experimental
     exon            110..203
                     /gene="ANX5"
                     /number=9
                     /evidence=experimental
     intron          204..>255
                     /gene="ANX5"
                     /note="full size 0.8 kb; phase 1 intron"
                     /number=9
                     /evidence=experimental
BASE COUNT       89 a     25 c     68 g     73 t
ORIGIN      
        1 gatgtgaatc aagtgagtta aaattcaggg gagtaaaaaa caagtgtgtt taagaacctg
       61 gaaatgagat ggaggaacaa tttattaatt gagtttgaaa tttttgcagg ctttatttca
      121 ggctggagaa cttaaatggg ggacagatga agaaaagttt atcaccatct ttggaacacg
      181 aagtgtgtct catttgagaa agggtgagta aaaaatctgt atggggcaga tccttggtaa
      241 tttggaaagt gcata
//
LOCUS       HSANX5S09                153 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exon 10.
ACCESSION   U01689
VERSION     U01689.1  GI:430962
KEYWORDS    .
SEGMENT     9 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 153)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 153)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..153
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-49"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     intron          <1..37
                     /gene="ANX5"
                     /note="full size 0.8 kb; phase 1 intron"
                     /number=9
                     /evidence=experimental
     exon            38..133
                     /gene="ANX5"
                     /number=10
                     /evidence=experimental
     intron          134..>153
                     /gene="ANX5"
                     /note="full size 1.4 kb; phase 1 intron"
                     /number=10
BASE COUNT       42 a     27 c     31 g     53 t
ORIGIN      
        1 atccctactg attagggaat atttttatct tttctagtgt ttgacaagta catgactata
       61 tcaggatttc aaattgagga aaccattgac cgcgagactt ctggcaattt agagcaacta
      121 ctccttgctg ttggtaggtg attacagtct atg
//
LOCUS       HSANX5S10                656 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exons 11 and 12.
ACCESSION   U01690
VERSION     U01690.1  GI:430963
KEYWORDS    .
SEGMENT     10 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 656)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 656)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..656
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-49"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     intron          <1..106
                     /gene="ANX5"
                     /note="full size 1.4 kb; phase 1"
                     /number=10
                     /evidence=experimental
     exon            107..165
                     /gene="ANX5"
                     /number=11
                     /evidence=experimental
     intron          166..391
                     /gene="ANX5"
                     /note="full size 225 bp; phase 0 intron"
                     /number=11
                     /evidence=experimental
     exon            392..514
                     /gene="ANX5"
                     /number=12
                     /evidence=experimental
     intron          515..>656
                     /gene="ANX5"
                     /note="full size 0.8 kb; phase 0 intron"
                     /number=12
                     /evidence=experimental
BASE COUNT      186 a    108 c    118 g    244 t
ORIGIN      
        1 ttttatagaa gtagagctat aagcaggaag cagcacttga tttttaatta cttaatcctt
       61 tacatatcaa aatagcattt gaccattgta ttgtttattt tcttagtgaa atctattcga
      121 agtatacctg cctaccttgc agagaccctc tattatgcta tgaaggtaag tggagaccat
      181 gaaggttcta tctaatacat ttctctttct aattatattc ttcacttttg tttatgtttt
      241 agatgataga ccttgggaat ttttccctta taacattcaa cccagttcta caaagttgtt
      301 ataccttatt gattcacccg cacccgggag catttgatcc attgttgaag ttttaagcat
      361 ttgcttctct cttttttttt tttttttaaa gggagctggg acagatgatc ataccctcat
      421 cagagtcatg gtttccagga gtgagattga tctgtttaac atcaggaagg agtttaggaa
      481 gaattttgcc acctctcttt attccatgat taaggtagta gaacctttct ttaaattacg
      541 tttgggaatc ctttgcagac tggtatgttg catgaaaaga caaggataag agaacctgat
      601 tcttgtgtta agtttaaaat tttagtgaca cgttagagca atgagttaaa aagctt
//
LOCUS       HSANX5S11                565 bp    DNA     linear   PRI 10-JAN-1997
DEFINITION  Human annexin V (ANX5) gene, exon 13 and 3'-untranslated region.
ACCESSION   U01691
VERSION     U01691.1  GI:430964
KEYWORDS    .
SEGMENT     11 of 11
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 565)
  AUTHORS   Fernandez,M.P., Morgan,R.O., Fernandez,M.R. and Carcedo,M.T.
  TITLE     The gene encoding human annexin V has a TATA-less promoter with a
            high G+C content
  JOURNAL   Gene 149 (2), 253-260 (1994)
  MEDLINE   95047484
   PUBMED   7958998
REFERENCE   2  (bases 1 to 565)
  AUTHORS   Fernandez,M.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-1993) Fernandez M.P., Universidad de Oviedo,
            Departamento de Biologia Funcional, c/Julian Claveria, 33071
            Oviedo, Asturias, Spain
FEATURES             Location/Qualifiers
     source          1..565
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q26"
                     /clone="clone lambda A5H-49"
                     /cell_line="WI38"
                     /cell_type="fibroblast"
                     /tissue_type="lung"
     gene            join(U01681.1:1292..1732,U01682.1:1..129,U01683.1:1..149,
                     U01684.1:1..268,U01685.1:1..242,U01686.1:1..123,
                     U01687.1:1..154,U01688.1:1..255,U01689.1:1..153,
                     U01690.1:1..656,1..565)
                     /gene="ANX5"
     mRNA            join(U01681.1:1292..1724,U01682.1:30..114,
                     U01683.1:35..129,U01684.1:72..185,U01685.1:43..133,
                     U01686.1:19..98,U01687.1:64..120,U01688.1:110..203,
                     U01689.1:38..133,U01690.1:107..165,U01690.1:392..514,
                     31..565)
                     /gene="ANX5"
     CDS             join(U01681.1:1716..1724,U01682.1:30..114,
                     U01683.1:35..129,U01684.1:72..185,U01685.1:43..133,
                     U01686.1:19..98,U01687.1:64..120,U01688.1:110..203,
                     U01689.1:38..133,U01690.1:107..165,U01690.1:392..514,
                     31..90)
                     /gene="ANX5"
                     /function="calcium channel inhibits phospholipase A2 &
                     protein kinase C"
                     /note="lipocortin V, endonexin II, PAP-I, PP4, VAC-alpha"
                     /codon_start=1
                     /evidence=experimental
                     /product="annexin V"
                     /protein_id="AAB40047.1"
                     /db_xref="GI:430966"
                     /translation="MAQVLRGTVTDFPGFDERADAETLRKAMKGLGTDEESILTLLTS
                     RSNAQRQEISAAFKTLFGRDLLDDLKSELTGKFEKLIVALMKPSRLYDAYELKHALKG
                     AGTNEKVLTEIIASRTPEELRAIKQVYEEEYGSSLEDDVVGDTSGYYQRMLVVLLQAN
                     RDPDAGIDEAQVEQDAQALFQAGELKWGTDEEKFITIFGTRSVSHLRKVFDKYMTISG
                     FQIEETIDRETSGNLEQLLLAVVKSIRSIPAYLAETLYYAMKGAGTDDHTLIRVMVSR
                     SEIDLFNIRKEFRKNFATSLYSMIKGDTSGDYKKALLLLCGEDD"
     intron          <1..30
                     /gene="ANX5"
                     /note="full size 0.8 kb; phase 0 intron"
                     /number=12
                     /evidence=experimental
     exon            31..565
                     /gene="ANX5"
                     /number=13
                     /evidence=experimental
     3'UTR           91..565
                     /gene="ANX5"
                     /evidence=experimental
     polyA_signal    538..543
                     /gene="ANX5"
BASE COUNT      163 a    107 c    104 g    191 t
ORIGIN      
        1 tggaggttaa tggaatacat ttggtttcag ggagatacat ctggggacta taagaaagct
       61 cttctgctgc tctgtggaga agatgactaa cgtgtcacgg ggaagagctc cctgctgtgt
      121 gcctgcacca ccccactgcc ttccttcagc acctttagct gcatttgtat gccagtgctt
      181 aacacattgc cttattcata ctagcatgct catgaccaac acatacacgt catagaagaa
      241 aatagtggtg cttctttctg atctctagtg gagatctctt tgactgctgt agtactaaag
      301 tgtacttaat gttactaagt ttaatgcctg gccattttcc atttatatat attttttaag
      361 aggctagagt gcttttagcc ttttttaaaa actccattta tattacattt gtaaccatga
      421 tactttaatt agaagcttag ccttgaaatt gtgaactctt ggaaatgtta ttagtgaagt
      481 tcgcaactaa actaaacctg taaaattatg atgattgtat tcaaaagatt aatgaaaaat
      541 aaacatttct gtccccctga attat
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

ProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: J03745. Human endonexin I...[gi:182111] Links  


LOCUS       HUMENN                  1592 bp    mRNA    linear   PRI 07-NOV-1994
DEFINITION  Human endonexin II mRNA, complete cds.
ACCESSION   J03745
VERSION     J03745.1  GI:182111
KEYWORDS    Ca2+ -dependent phospholipid binding protein; endonexin.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1592)
  AUTHORS   Kaplan,R., Jaye,M., Burgess,W.H., Schlaepfer,D.D. and Haigler,H.T.
  TITLE     Cloning and expression of cDNA for human endonexin II, a Ca2+ and
            phospholipid binding protein
  JOURNAL   J. Biol. Chem. 263 (17), 8037-8043 (1988)
  MEDLINE   88228020
   PUBMED   2967291
COMMENT     Original source text: Human placenta, cDNA to mRNA, (library of
            Clonetech Laboratories Inc.).
            Draft entry and computer-readable sequence for [1] kindly provided
            by H.T.Haigler, 06-APR-1988.
FEATURES             Location/Qualifiers
     source          1..1592
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q28-q32"
     gene            1..1592
                     /gene="ANX5"
     mRNA            <1..1592
                     /gene="ANX5"
                     /product="endonexin II mRNA"
     CDS             160..1122
                     /gene="ANX5"
                     /note="endonexin II"
                     /codon_start=1
                     /protein_id="AAA52386.1"
                     /db_xref="GI:182112"
                     /db_xref="GDB:G00-120-555"
                     /translation="MAQVLRGTVTDFPGFDERADAETLRKAMKGLGTDEESILTLLTS
                     RSNAQRQEISAAFKTLFGRDLLDDLKSELTGKFEKLIVALMKPSRLYDAYELKHALKG
                     AGTNEKVLTEIIASRTPEELRAIKQVYEEEYGSSLEDDVVGDTSGYYQRMLVVLLQAN
                     RDPDAGIDEAQVEQDAQALFQAGELKWGTDEEKFITIFGTRSVSHLRKVFDKYMTISG
                     FQIEETIDRETSGNLEQLLLAVVKSIRSIPAYLAETLYYAMKGAGTDDHTLIRVMVSR
                     SEIDLFNIRKEFRKNFATSLYSMIKGDTSGDYKKALLLLCGEDD"
BASE COUNT      434 a    337 c    366 g    455 t
ORIGIN      284 bp upstream of HincII site.
        1 ttggatcagt ctaggtgcag ctgccggatc cttcagcgtc tgcatctcgg cgtcgcccgc
       61 gtaccgtcgc ccggctctcc gccgctctcc cggggtttcg gggcacttgg gtcccacagt
      121 ctggtcctgc ttcaccttcc cctgacctga gtagtcgcca tggcacaggt tctcagaggc
      181 actgtgactg acttccctgg atttgatgag cgggctgatg cagaaactct tcggaaggct
      241 atgaaaggct tgggcacaga tgaggagagc atcctgactc tgttgacatc ccgaagtaat
      301 gctcagcgcc aggaaatctc tgcagctttt aagactctgt ttggcaggga tcttctggat
      361 gacctgaaat cagaactaac tggaaaattt gaaaaattaa ttgtggctct gatgaaaccc
      421 tctcggcttt atgatgctta tgaactgaaa catgccttga agggagctgg aacaaatgaa
      481 aaagtactga cagaaattat tgcttcaagg acacctgaag aactgagagc catcaaacaa
      541 gtttatgaag aagaatatgg ctcaagcctg gaagatgacg tggtggggga cacttcaggg
      601 tactaccagc ggatgttggt ggttctcctt caggctaaca gagaccctga tgctggaatt
      661 gatgaagctc aagttgaaca agatgctcag gctttatttc aggctggaga acttaaatgg
      721 gggacagatg aagaaaagtt tatcaccatc tttggaacac gaagtgtgtc tcatttgaga
      781 aaggtgtttg acaagtacat gactatatca ggatttcaaa ttgaggaaac cattgaccgc
      841 gagacttctg gcaatttaga gcaactactc cttgctgttg tgaaatctat tcgaagtata
      901 cctgcctacc ttgcagagac cctctattat gctatgaagg gagctgggac agatgatcat
      961 accctcatca gagtcatggt ttccaggagt gagattgatc tgtttaacat caggaaggag
     1021 tttaggaaga attttgccac ctctctttat tccatgatta agggagatac atctggggac
     1081 tataagaaag ctcttctgct gctctgtgga gaagatgact aacgtgtcac ggggaagagc
     1141 tccctgctgt gtgcctgcac caccccactg ccttccttca gcacctttag ctgcatttgt
     1201 atgccagtgc ttaacacatt gccttattca tactagcatg ctcatgacca acacatacac
     1261 gtcatagaat gaaaatagtg gtgcttcttt ctgatctcta gtggagatct ctttgactgc
     1321 tgtagtacta aagtgtactt aatgttacta agtttaatgc ctggccattt tccatttata
     1381 tatatttttt aagaggctag agtgctttta gcctttttta aaaactccat ttatattaca
     1441 tttgtaacca tgatacttta atcagaagct tagccttgaa attgtgaact cttggaaatg
     1501 ttattagtga agttcgcaac taaactaaac ctgtaaaatt atgatgattg tattcaaaag
     1561 attaatgaaa aataaacatt tctgtccccc tg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AB017363. Homo sapiens mRNA...[gi:3927882] Links  


LOCUS       AB017363                4350 bp    mRNA    linear   PRI 06-FEB-1999
DEFINITION  Homo sapiens mRNA for frizzled-1, complete cds.
ACCESSION   AB017363
VERSION     AB017363.1  GI:3927882
KEYWORDS    frizzled-1.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Sagara,N., Toda,G., Hirai,M., Terada,M. and Katoh,M.
  TITLE     Molecular cloning, differential expression, and chromosomal
            localization of human frizzled-1, frizzled-2, and frizzled-7
  JOURNAL   Biochem. Biophys. Res. Commun. 252 (1), 117-122 (1998)
  MEDLINE   99032814
REFERENCE   2  (bases 1 to 4350)
  AUTHORS   Katoh,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-SEP-1998) Masaru Katoh, National Cancer Center
            Research Institute, Genetics Division; Tsukiji 5-chome, Chuo-ku,
            Tokyo 104-0045, Japan (E-mail:mkatoh@ncc.go.jp,
            Tel:+81-3-3542-2511, Fax:+81-3-3541-2685)
FEATURES             Location/Qualifiers
     source          1..4350
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="7"
                     /map="7q21"
                     /tissue_type="lung"
                     /dev_stage="fetus"
     gene            1..4350
                     /gene="FZD1"
     CDS             414..2357
                     /gene="FZD1"
                     /codon_start=1
                     /product="frizzled-1"
                     /protein_id="BAA34666.1"
                     /db_xref="GI:3927883"
                     /translation="MAEEEAPKKSRAAGGGASWELCAGALSARLAEEGSGDAGGRRRP
                     PVDPRRLARQLLLLLWLLEAPLLLGVRAQAAGQGPGQGPGPGQQPPPPPQQQQSGQQY
                     NGERGISVPDHGYCQPISIPLCTDIAYNQTIMPNLLGHTNQEDAGLEVHQFYPLVKVQ
                     CSAELKFFLCSMYAPVCTVLEQALPPCRSLCERARQGCEALMNKFGFQWPDTLKCEKF
                     PVHGAGELCVGQNTSDKGTPTPSLLPEFWTSNPQHGGGGHRGGFPGGAGASERGKFSC
                     PRALKVPSYLNYHFLGEKDCGAPCEPTKVYGLMYFGPEELRFSRTWIGIWSVLCCAST
                     LFTVLTYLVDMRRFSYPERPIIFLSGCYTAVAVAYIAGFLLEDRVVCNDKFAEDGART
                     VAQGTKKEGCTILFMMLYFFSMASSIWWVILSLTWFLAAGMKWGHEAIEANSQYFHLA
                     AWAVPAIKTITILALGQVDGDVLSGVCFVGLNNVDALRGFVLAPLFVYLFIGTSFLLA
                     GFVSLFRIRTIMKHDGTKTEKLEKLMVRIGVFSVLYTVPATIVIACYFYEQAFRDQWE
                     RSWVAQSCKSYAIPCPHLQAGGGAPPHPPMSPDFTVFMIKYLMTLIVGITSGFWIWSG
                     KTLNSWRKFYTRLTNSKQGETTV"
BASE COUNT      917 a   1218 c   1236 g    979 t
ORIGIN      
        1 agttgaggga ttgacacaaa tggtcaggcg gcggcggcgg agaaggaggc ggaggcgcag
       61 gggggagccg agcccgctgg gctgcggaga gttgcgctct ctacggggcc gcggccacta
      121 gcgcggcgcc gccagccggg agccagcgag ccgagggcca ggaaggcggg acacgacccc
      181 ggcgcgccct agccacccgg gttctccccg ccgcccgcgc ttcatgaatc gcaagtttcc
      241 gcggcggcgg cggctgcggt acgcagaaca ggagccgggg gagcgggccg aaagcggctt
      301 gggctcgacg gagggcaccc gcgcagaggt ctccctggcc gcagggggag ccgccgccgg
      361 ccgtgcccct ggcagcccca gcggagcggc gccaagagag gagccgagaa agtatggctg
      421 aggaggaggc gcctaagaag tcccgggccg ccggcggtgg cgcgagctgg gaactttgtg
      481 ccggggcgct ctcggcccgg ctggcggagg agggcagcgg ggacgccggt ggccgccgcc
      541 gcccgccagt tgacccccgg cgattggcgc gccagctgct gctgctgctt tggctgctgg
      601 aggctccgct gctgctgggg gtccgggccc aggcggcggg ccaggggcca ggccaggggc
      661 ccgggccggg gcagcaaccg ccgccgccgc ctcagcagca acagagcggg cagcagtaca
      721 acggcgagcg gggcatctcc gtcccggacc acggctattg ccagcccatc tccatcccgc
      781 tgtgcacgga catcgcgtac aaccagacca tcatgcccaa cctgctgggc cacacgaacc
      841 aggaggacgc gggcctggag gtgcaccagt tctaccctct agtgaaagtg cagtgttccg
      901 ctgagctcaa gttcttcctg tgctccatgt acgcgcccgt gtgcaccgtg ctagagcagg
      961 cgctgccgcc ctgccgctcc ctgtgcgagc gcgcgcgcca gggctgcgag gcgctcatga
     1021 acaagttcgg cttccagtgg ccagacacgc tcaagtgtga gaagttcccg gtgcacggcg
     1081 ccggcgagct gtgcgtgggc cagaacacgt ccgacaaggg caccccgacg ccctcgctgc
     1141 ttccagagtt ctggaccagc aaccctcagc acggcggcgg agggcaccgt ggcggcttcc
     1201 cggggggcgc cggcgcgtcg gagcgaggca agttctcctg cccgcgcgcc ctcaaggtgc
     1261 cctcctacct caactaccac ttcctggggg agaaggactg cggcgcacct tgtgagccga
     1321 ccaaggtgta tgggctcatg tacttcgggc ccgaggagct gcgcttctcg cgcacctgga
     1381 ttggcatttg gtcagtgctg tgctgcgcct ccacgctctt cacggtgctt acgtacctgg
     1441 tggacatgcg gcgcttcagc tacccggagc ggcccatcat cttcttgtcc ggctgttaca
     1501 cggccgtggc cgtggcctac atcgccggct tcctcctgga agaccgagtg gtgtgtaatg
     1561 acaagttcgc cgaggacggg gcacgcactg tggcgcaggg caccaagaag gagggctgca
     1621 ccatcctctt catgatgctc tacttcttca gcatggccag ctccatctgg tgggtgatcc
     1681 tgtcgctcac ctggttcctg gcggctggca tgaagtgggg ccacgaggcc atcgaagcca
     1741 actcacagta ttttcacctg gccgcctggg ctgtgccggc catcaagacc atcaccatcc
     1801 tggcgctggg ccaggtggac ggcgatgtgc tgagcggagt gtgcttcgtg gggcttaaca
     1861 acgtggacgc gctgcgtggc ttcgtgctgg cgcccctctt cgtgtacctg tttatcggca
     1921 cgtcctttct gctggccggc tttgtgtcgc tcttccgcat ccgcaccatc atgaagcacg
     1981 atggcaccaa gaccgagaag ctggagaagc tcatggtgcg cattggcgtc ttcagcgtgc
     2041 tgtacactgt gccagccacc atcgtcatcg cctgctactt ctacgagcag gccttccggg
     2101 accagtggga acgcagctgg gtggcccaga gctgcaagag ctacgctatc ccctgccctc
     2161 acctccaggc gggcggaggc gccccgccgc acccgcccat gagcccggac ttcacggtct
     2221 tcatgattaa gtaccttatg acgctgatcg tgggcatcac gtcgggcttc tggatctggt
     2281 ccggcaagac cctcaactcc tggaggaagt tctacacgag gctcaccaac agcaaacaag
     2341 gggagactac agtctgagac ccggggctca gcccatgccc aggcctcggc cggggcgcag
     2401 cgatccccca aagccagcgc cgtggagttc gtgccaatcc tgacatctcg aggtttcctc
     2461 actagacaac tctctttcgc aggctccttt gaacaactca gctcctgcaa aagcttccgt
     2521 ccctgaggca aaaggacacg agggcccgac tgccagaggg aggatggaca gacctcttgc
     2581 cctcacactc tggtaccagg actgttcgct tttatgattg taaatagcct gtgtaagatt
     2641 tttgtaagta tatttgtatt taaatgacga ccgatcacgc gtttttcttt ttcaaaagtt
     2701 tttaattatt tagggcggtt taaccatttg aggcttttcc ttcttgccct tttcggagta
     2761 ttgcaaagga gctaaaactg gtgtgcaacc gcacagcgct cctggtcgtc ctcgcgcgcc
     2821 tctccctacc acgggtgctc gggacggctg ggcgccagct ccggggcgag ttcagcactg
     2881 cggggtgcga ctagggctgc gctgccaggg tcacttcccg cctcctcctt ttgccccctc
     2941 cccctccttc tgtcccctcc ctttctttcc tggcttgagg taggggctct taaggtacag
     3001 aactccacaa accttccaaa tctggaggag ggcccccata cattacaatt cctcccttgc
     3061 tcggcggtgg attgcgaagg cccgtccctt cgacttcctg aagctggatt tttaactgtc
     3121 cagaactttc ctccaacttc atgggggccc acgggtgtgg gcgctggcag tctcagcctc
     3181 cctccacggt caccttcaac gcccagacac tcccttctcc caccttagtt ggttacaggg
     3241 tgagtgagat aaccaatgcc aaactttttg aagtctaatt tttgaggggt gagctcattt
     3301 cattctctag tgtctaaaac ctggtatggg tttggccagc gtcatggaaa gatgtggtta
     3361 ctgagatttg ggaagaagca tgaagctttg tgtgggttgg aagagactga agatatgggt
     3421 tataaaatgt taattctaat tgcatacgga tgcctggcaa ccttgccttt gagaatgaga
     3481 cagcctgcgc ttagatttta ccggtctgta aaatggaaat gttgaggtca cctggaaagc
     3541 tttgttaagg agttgatgtt tgctttcctt aacaagacag caaaacgtaa acagaaattg
     3601 aaaacttgaa ggatatttca gtgtcatgga cttcctcaaa atgaagtgct attttcttat
     3661 ttttaatcaa ataactagac atatatcaga aactttaaaa tgtaaaagtt gtacactttc
     3721 aacattttat tacgattatt attcagcagc acattctgag gggggaacaa ttcacaccac
     3781 caataataac ctggtaagat ttcaggaggt aaagaaggtg gaataattga cggggagata
     3841 gcgcctgaaa taaacaaaat atgggcatgc atgctaaagg gaaaatgtgt gcaggtctac
     3901 tgcattaaat cctgtgtgct cctcttttgg atttacagaa atgtgtcaaa tgtaaatctt
     3961 tcaaagccat ttaaaaatat tcactttagt tctctgtgaa gaagaggaga aaagcaatcc
     4021 tcctgattgt attgttttaa actttaagaa tttatcaaaa tgccggtact taggacctaa
     4081 atttatctat gtctgtcata cgctaaaatg atattggtct ttgaatttgg tatacattta
     4141 ttctgttcac tatcacaaaa tcatctatat ttatagagga atagaagttt atatatatat
     4201 aataccatat ttttaatttc acaaataaaa aattcaaagt tttgtacaaa attatatgga
     4261 ttttgtgcct gaaaataata gagcttgagc tgtctgaact attttacatt ttatggtgtc
     4321 tcatagccaa tcccacagtg taaaaattca
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF153612. Homo sapiens pero...[gi:4929830] Links  


LOCUS       AF153612                1348 bp    mRNA    linear   PRI 01-JUN-1999
DEFINITION  Homo sapiens peroxisomal D3,D2-enoyl-CoA isomerase (PECI) mRNA,
            complete cds.
ACCESSION   AF153612
VERSION     AF153612.1  GI:4929830
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1348)
  AUTHORS   Geisbrecht,B.V., Zhang,D., Schulz,H. and Gould,S.J.
  TITLE     Characterization of PECI, a Novel Monofunctional D3,D2-Enoyl-CoA
            Isomerase of Mammalian Peroxisomes
  JOURNAL   J. Biol. Chem. (1999) In press
REFERENCE   2  (bases 1 to 1348)
  AUTHORS   Geisbrecht,B.V.
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-1999) Biological Chemistry, The Johns Hopkins
            University School of Medicine, 725 N. Wolfe St., Baltimore, MD
            21205, USA
FEATURES             Location/Qualifiers
     source          1..1348
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..1348
                     /gene="PECI"
     CDS             103..1182
                     /gene="PECI"
                     /function="catalyzes the isomerization of peroxisomal
                     3-cis- and 2-trans-enoyl-CoAs"
                     /codon_start=1
                     /product="peroxisomal D3,D2-enoyl-CoA isomerase"
                     /protein_id="AAD34173.1"
                     /db_xref="GI:4929831"
                     /translation="MRASQKDFENSMNQVKLLKKDPGNEVKLKLYALYKQATEGPCNM
                     PKPGVFDLINKAKWDAWNALGSLPKEAARQNYVDLVSSLSPSLESSSQVEPGTDRKST
                     GFETLVVTSEDGITKIMFNRPKKKNAINTEMYHEIMRALKAASKDDSIITVLTGNGDY
                     YSSGNDLTNFTDIPPGGVEEKAKNNAVLLREFVGCFIDFPKPLIAVVNGPAVGISVTL
                     LGLFDAVYASDRATFHTPFSHLGQSPEGCSSYTFPKIMSPAKATEMLIFGKKLTAGEA
                     CAQGLVTEVFPDSTFQKEVWTRLKAFAKLPPNALRISKEVIRKREREKLHAVNAEECN
                     VLQGRWLSDECTNAVVNFLSRKSKL"
BASE COUNT      411 a    275 c    322 g    340 t
ORIGIN      
        1 caggtggcgt acttggcttg gagactggcg cggcgttcgt gtccgagttc tctgcaggtc
       61 actagtttcc cggtagttca gctgcacatg aatagaacag caatgagagc cagtcagaag
      121 gactttgaaa attcaatgaa tcaagtgaaa ctcttgaaaa aggatccagg aaacgaagtg
      181 aagctaaaac tctacgcgct atataagcag gccactgaag gaccttgtaa catgcccaaa
      241 ccaggtgtat ttgacttgat caacaaggcc aaatgggacg catggaatgc ccttggcagc
      301 ctgcccaagg aagctgccag gcagaactat gtggatttgg tgtccagttt gagtccttca
      361 ttggaatcct ctagtcaggt ggagcctgga acagacagga aatcaactgg gtttgaaact
      421 ctggtggtga cctccgaaga tggcatcaca aagatcatgt tcaaccggcc caaaaagaaa
      481 aatgccataa acactgagat gtatcatgaa attatgcgtg cacttaaagc tgccagcaag
      541 gatgactcaa tcatcactgt tttaacagga aatggtgact attacagtag tgggaatgat
      601 ctgactaact tcactgatat tccccctggt ggagtagagg agaaagctaa aaataatgcc
      661 gttttactga gggaatttgt gggctgtttt atagattttc ctaagcctct gattgcagtg
      721 gtcaatggtc cagctgtggg catctccgtc accctccttg ggctattcga tgccgtgtat
      781 gcatctgaca gggcaacatt tcatacacca tttagtcacc taggccaaag tccggaagga
      841 tgctcctctt acacttttcc gaagataatg agcccagcca aggcaacaga gatgcttatt
      901 tttggaaaga agttaacagc gggagaggca tgtgctcaag gacttgttac tgaagttttc
      961 cctgatagca cttttcagaa agaagtctgg accaggctga aggcatttgc aaagcttccc
     1021 ccaaatgcct tgagaatttc aaaagaggta atcaggaaaa gagagagaga aaaactacac
     1081 gctgttaatg ctgaagaatg caatgtcctt cagggaagat ggctatcaga tgaatgcaca
     1141 aatgctgtgg tgaacttctt atccagaaaa tcaaaactgt gatgaccact acagcagagt
     1201 aaagcatgtc caaggaagga tgtgctgtta cctctgattt ccagtactgg aactaaataa
     1261 gcttcattgt gccttttgta gtgctagaat atcaattaca atgatgatat ttcactacag
     1321 ctctgatgaa taaaaagttt tgtaaaac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L12535. Human RSU-1/RSP-1...[gi:434050] Links  


LOCUS       HUMRSU1A                2194 bp    mRNA    linear   PRI 09-JAN-1995
DEFINITION  Human RSU-1/RSP-1 mRNA, complete cds.
ACCESSION   L12535
VERSION     L12535.1  GI:434050
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2194)
  AUTHORS   Tsuda,T. and Cutler,M.
  TITLE     Isolation of rsp-1, a novel cDNA capable of suppressing v-Ras
            transformant
  JOURNAL   Unpublished (1993)
REFERENCE   2  (bases 1 to 2194)
  AUTHORS   Cutler,M.L., Bassin,R.H., Zanoni,L. and Talbot,N.
  TITLE     Isolation of rsp-1, a novel cDNA cpable of suppressing v-Ras
            transformant
  JOURNAL   Mol. Cell. Biol. 12, 3752-3756 (1993)
COMMENT     Original source text: Homo sapiens adult skin cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..2194
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_type="fibroblast"
                     /tissue_type="skin"
                     /dev_stage="adult"
     gene            1..2194
                     /gene="RSU-1"
     CDS             828..1661
                     /gene="RSU-1"
                     /note="homologous to mouse Rsu-1; putative"
                     /codon_start=1
                     /protein_id="AAA60292.1"
                     /db_xref="GI:434051"
                     /translation="MSKSLKKLVEESREKNQPEVDMSDRGISNMLDVNGLFTLSHITQ
                     LVLSHNKLTMVPPNIAELKNLEVLNFFNNQIEELPTQISSLQKLKHLNLGMNRLNTLP
                     RGFGSLPALEVLDLTYNNLSENSLPGNFFYLTTLRALYLSDNDFEILPPDIGKLTKLQ
                     ILSLRDNDLISLPKEIGELTQLKELHIQGNRLTVLPPELGNLDLTGQKQVFKAENNPW
                     VTPIADQFQLGVSHVFEYIRSETYKYLYGRHMQANPEPPKKNNDKSKKISRKPLAAKN
                     R"
     polyA_site      2194
                     /gene="RSU-1"
BASE COUNT      517 a    588 c    524 g    565 t
ORIGIN      
        1 gaattccgtt tttttttttt gacgtgacat ctctttattg gttcagtcta tgcctggccc
       61 agcggcacgc ccaggtccag ggggggtctg gttggaggtc tctggacagt caggggcagc
      121 atcagcagaa accctgggtg tcggggggct gtgggagtag cagcactggt ccccgcgttg
      181 acaggtgccc tccttaaacc tcttgcaagg aaatgtcttg agccgggcag cccgctggca
      241 acatcttccg gcggctcctt ccgcttgttc tgaagcagcc tgcgtctctc ctccatagct
      301 tcgtcggccg ccttgcccct gtgcgagggg aggagtaggt gggaccccat ctgcccttta
      361 ccccttcttt ccctggctga gccccacccc caccttacca ccactcaccc tggccgggca
      421 cgaggccggg gaccccagtt ggcctccggg ttagcctgga agcgcttgag tcctcgctgt
      481 cgggagctgg ggttggcatc ttcccggatg gtgaagttgg cctgcagcct gggctccaca
      541 ctgagaatgc ggaacgctgg gctagtctct gatagtcgct cccgtttcac cggacattcc
      601 ttctcccagc aacgcatgat gtcccagagg gcagaggcag gggcatccgt cttcacagcg
      661 ttcttacagg cgtgggagag tgagacccgg aagtcagcgt ggaggagggc cgaccgcaac
      721 tgcaggaggc ttggtgtgtt gcagtggatg gtgctgctca gctggtgtgc gttctgccga
      781 agcttgtggt tgcacgccca tcgtcttagg ggctaccttc cgtgaccatg tccaagtctc
      841 tgaagaagtt ggtggaggag agccgggaga agaaccagcc cgaggtggac atgagtgacc
      901 ggggcatctc caacatgctg gatgtcaacg gcctctttac cttatcccat atcacacaac
      961 tggtcctcag ccataacaag ctaacaatgg tgccaccgaa catcgcagaa ctgaagaatt
     1021 tggaggtgct caactttttt aataaccaaa tcgaggagct gcccacacag atcagtagcc
     1081 ttcagaaact caaacacctg aaccttggca tgaacaggct gaacactttg ccacgaggct
     1141 tcggctccct gccagctctt gaggttctgg acttgacgta caacaacttg agcgaaaatt
     1201 ctcttcctgg aaacttcttc tacctgacca ccctgcgtgc actctatcta agtgacaacg
     1261 attttgaaat cctgccgcca gatattggga agctcacaaa gttgcagata ctcagcctta
     1321 gggataacga cctgatctcg ctgcctaagg aaatcgggga gcttacccag cttaaagagc
     1381 tccacattca ggggaaccgc ctcaccgttc tgcccccaga actaggaaac ttggatttaa
     1441 ctggccagaa gcaggtattc aaagcagaga acaatccctg ggtgaccccc attgcagacc
     1501 agttccagct tggcgtgtcc catgtttttg agtatatccg ttctgagaca tacaaatacc
     1561 tctacggcag acacatgcag gccaacccag aaccaccgaa gaagaataat gacaaatcga
     1621 aaaagatcag ccggaaaccc ctggcagcca agaacagata aggaagggat tggcatcggc
     1681 tggccttcca gcaccttctc tctccaacac ttcattctct cttgccctgt ctctcaaata
     1741 aacccaatgc tgcgtgtgag gcctttttta tttttctttt cactctcttt ctaatgcttc
     1801 ccaccttacc ttttagattc ttttgctagg tgggagattg ttataaggtc tttaaaccat
     1861 ttccatttgt ttctttaaca ttaccaaaag cagggaacaa agctcttatt caactgcgaa
     1921 ttccatagtg ggctctggct tttcttgaat agatatcaca aggttgctta ttatcaaaag
     1981 aataattaaa atcatgtaac catttaaatg tcactgttaa cacttttcac tctttctgtt
     2041 gattcaccta actcattatt ttgctttatt aaaagtcttc cttcaccacc gagatatgct
     2101 aatttaactt acaaatgatt ttaataaaat cttgagtttg tatcacatgt tacttattga
     2161 ctcagaataa aagaacagtc tgatcttggg gtat
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U60644. Human HU-K4 mRNA,...[gi:1575346] Links  


LOCUS       HSU60644                2131 bp    mRNA    linear   PRI 08-OCT-1996
DEFINITION  Human HU-K4 mRNA, complete cds.
ACCESSION   U60644
VERSION     U60644.1  GI:1575346
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2131)
  AUTHORS   Upton,C., Cao,J. and Koop,B.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-JUN-1996) Biochemistry & Microbiology, University of
            Victoria, PO Box 3055, Victoria, BC V8W 3P6, Canada
FEATURES             Location/Qualifiers
     source          1..2131
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="I.M.A.G.E Consortium Clone ID 159455"
                     /sex="female"
                     /tissue_type="breast"
                     /dev_stage="adult"
     CDS             488..1801
                     /note="similar to Vaccinia virus HindIII K4L ORF, and to
                     Vaccinia virus p37 (HindIII F13L ORF)"
                     /codon_start=1
                     /product="HU-K4"
                     /protein_id="AAB16799.1"
                     /db_xref="GI:1575347"
                     /translation="MTQLFLWEYGDLHLFGPNQRPAPCYDPCEAVLVESIPEGLDFPN
                     ASTGNPSTSQAWLGLLAGAHSSLDIASFYWTLTNNDTHTQEPSAQQGEEVLRQLQTLA
                     PKGVNVRIAVSKPSGPQPQADLQALLQSGAQVRMVDMQKLTHGVLHTKFWVVDQTHFY
                     LGSANMDWRSLTQVKELGVVMYNCSCLARDLTKIFEAYWFLGQAGSSIPSTWPRFYDT
                     RYNQETPMEICLNGTPALAYLASAPPPLCPSGRTPDLKALLNVVDNARSFIYVAVMNY
                     LPTLEFSHPHRFWPAIDDGLRRATYERGVKVRLLISCWGHSEPSMRAFLLSLAALRDN
                     HTHSDIQVKLFVVPADEAQARIPYARVNHNKYMVTERATYIGTSNWSGNYFTETAGTS
                     LLVTQNGRGGLRSQLEAIFLRDWDSPYIHDLDTSADSVGNACRLL"
BASE COUNT      426 a    710 c    598 g    397 t
ORIGIN      
        1 ctctttataa tttagtttcc atagaagtta tatgtgcatt taaaaaaatt caatgctgga
       61 gcgaccgtgt ctggggagcc gagccccgct tctcgctgcg gtgagcccgg actggggcac
      121 gcactgcgca gactccccgc tgcagtgggc ggagtcccac aggccccgcc cctcctccca
      181 ccctcgttca gcctgtccag acagaagctg gggcccagcg gaggtagcag cagacgcctg
      241 agagcgaggc cgaggccctc agggtttgga gaccctgaca cacccacctt ctcacctggg
      301 ctctgcgtat cccccagcct tgagggaaga tgaagcctaa actgatgtac caggagctga
      361 aggtgcctgc agaggagccc gccaatgagc tgcccatgaa tgagattgag gcgtggaagg
      421 ctgcggaaaa gaaagcccgc tgggtcctgc tggtcctcat tctggcggtt gtgggcttcg
      481 gagcctgatg actcagctgt ttctatggga atacggcgac ttgcatctct ttgggcccaa
      541 ccagcgccca gccccctgct atgacccttg cgaagcagtg ctggtggaaa gcattcctga
      601 gggcctggac ttccccaatg cctccacggg gaacccttcc accagccagg cctggctggg
      661 cctgctcgcc ggtgcgcaca gcagcctgga catcgcctcc ttctactgga ccctcaccaa
      721 caatgacacc cacacgcagg agccctctgc ccagcagggt gaggaggtcc tccggcagct
      781 gcagaccctg gcaccaaagg gcgtgaacgt ccgcatcgct gtgagcaagc ccagcgggcc
      841 ccagccacag gcggacctgc aggctctgct gcagagcggt gcccaggtcc gcatggtgga
      901 catgcagaag ctgacccatg gcgtcctgca taccaagttc tgggtggtgg accagaccca
      961 cttctacctg ggcagtgcca acatggactg gcgttcactg acccaggtca aggagctggg
     1021 cgtggtcatg tacaactgca gctgcctggc tcgagacctg accaagatct ttgaggccta
     1081 ctggttcctg ggccaggcag gcagctccat cccatcaact tggccccggt tctatgacac
     1141 ccgctacaac caagagacac caatggagat ctgcctcaat ggaacccctg ctctggccta
     1201 cctggcgagt gcgcccccac ccctgtgtcc aagtggccgc actccagacc tgaaggctct
     1261 actcaacgtg gtggacaatg cccggagttt catctacgtc gctgtcatga actacctgcc
     1321 cactctggag ttctcccacc ctcacaggtt ctggcctgcc attgacgatg ggctgcggcg
     1381 ggccacctac gagcgtggcg tcaaggtgcg cctgctcatc agctgctggg gacactcgga
     1441 gccatccatg cgggccttcc tgctctctct ggctgccctg cgtgacaacc atacccactc
     1501 tgacatccag gtgaaactct ttgtggtccc cgcggatgag gcccaggctc gaatcccata
     1561 tgcccgtgtc aaccacaaca agtacatggt gactgaacgc gccacctaca tcggaacctc
     1621 caactggtct ggcaactact tcacggagac ggcgggcacc tcgctgctgg tgacgcagaa
     1681 tgggaggggc ggcctgcgga gccagctgga ggccattttc ctgagggact gggactcccc
     1741 ttacattcat gaccttgaca cctcagctga cagcgtgggc aacgcctgcc gcctgctctg
     1801 aggcccgatc cagtgggcag gccaaggcct gctgggcccc cgcggaccca ggtgctctgg
     1861 gtcacggtcc ctgtccccgc acccccgctt ctgtctgccc cattgtggct cctcaggctc
     1921 tctcccctgc tctcccacct ctacctccac ccccaccggc ctgacgctgt ggccccggga
     1981 cccagcagag ctgggggagg gatcagcccc caaagaaatg ggggtgcatg ctggcctgcc
     2041 ccctggccca cccccacttt ccagggcaaa aagggcccag ggttataata agtaaataac
     2101 ttgtctgtaa aaaaaaaaaa aaaaaaaaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProbeSetProbeSetProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF013759. Homo sapiens calu...[gi:3153208] Links  


LOCUS       AF013759                3316 bp    mRNA    linear   PRI 27-MAY-1998
DEFINITION  Homo sapiens calumein (Calu) mRNA, complete cds.
ACCESSION   AF013759
VERSION     AF013759.1  GI:3153208
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3316)
  AUTHORS   Yabe,D., Taniwaki,M., Nakamura,T., Kanazawa,N., Tashiro,K. and
            Honjo,T.
  TITLE     Human calumenin gene (CALU): cDNA isolation and chromosomal mapping
            to 7q32
  JOURNAL   Genomics 49 (2), 331-333 (1998)
  MEDLINE   98260687
   PUBMED   9598325
REFERENCE   2  (bases 1 to 3316)
  AUTHORS   Yabe,D., Taniwaki,M., Nakamura,T., Kanazawa,N., Tashiro,K. and
            Honjo,T.
  TITLE     Direct Submission
  JOURNAL   Submitted (13-JUL-1997) Department of Medical Chemistry, Kyoto
            University Faculty of Medicine, Yoshida Konoe-cho, Sakyo-ku, Kyoto
            606, Japan
FEATURES             Location/Qualifiers
     source          1..3316
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="7"
                     /map="7q32"
     source          1..2415
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /note="derived from EST clone 645121"
     gene            1..3316
                     /gene="Calu"
     CDS             59..1006
                     /gene="Calu"
                     /function="Ca2+-binding in the ER"
                     /note="member of one subset of EF-hand superfamily that
                     includes reticulocalbin, Erc-55, and Cab-45"
                     /codon_start=1
                     /product="calumein"
                     /protein_id="AAC17216.1"
                     /db_xref="GI:3153209"
                     /translation="MDLRQFLMCLSLCTAFALSKPTEKKDRVHHEPQLSDKVHNDAQS
                     FDYDHDAFLGAEEAKTFDQLTPEESKERLGKIVSKIDGDKDGFVTVDELKDWIKFAQK
                     RWIYEDVERQWKGHDLNEDGLVSWEEYKNATYGYVLDDPDPDDGFNYKQMMVRDERRF
                     KMADKDGDLIATKEEFTAFLHPEEYDYMKDIVVQETMEDIDKNADGFIDLEEYIGDMY
                     SHDGNTDEPEWVKTEREQFVEFRDKNRDGKMDKEETKDWILPSDYDHAEAEARHLVYE
                     SDQNKDGKLTKEEIVDKYDLFVGSQATDFGEALVRHDEF"
     sig_peptide     59..115
                     /gene="Calu"
     misc_feature    299..334
                     /gene="Calu"
                     /note="Region: EF-hand motif I"
     misc_feature    407..442
                     /gene="Calu"
                     /note="Region: EF-hand motif II"
     misc_feature    521..550
                     /gene="Calu"
                     /note="Region: EF-hand motif III"
     misc_feature    656..694
                     /gene="Calu"
                     /note="Region: EF-hand motif IV"
     misc_feature    782..817
                     /gene="Calu"
                     /note="Region: EF-hand motif V"
     misc_feature    860..925
                     /gene="Calu"
                     /note="Region: EF-hand motif VI"
     misc_feature    922..1003
                     /gene="Calu"
                     /note="encodes C-terminal ER retention signal"
BASE COUNT      969 a    656 c    718 g    973 t
ORIGIN      
        1 gtgagcggcg gccacggcat cctgtgctgt gggggctacg aggaaagatc taattatcat
       61 ggacctgcga cagtttctta tgtgcctgtc cctgtgcaca gcctttgcct tgagcaaacc
      121 cacagaaaag aaggaccgtg tacatcatga gcctcagctc agtgacaagg ttcacaatga
      181 tgctcagagt tttgattatg accatgatgc cttcttgggt gctgaagaag caaagacctt
      241 tgatcagctg acaccagaag agagcaagga aaggcttgga aagattgtaa gtaaaataga
      301 tggcgacaag gacgggtttg tcactgtgga tgagctcaaa gactggatta aatttgcaca
      361 aaagcgctgg atttacgagg atgtagagcg acagtggaag gggcatgacc tcaatgagga
      421 cggcctcgtt tcctgggagg agtataaaaa tgccacctac ggctacgttt tagatgatcc
      481 agatcctgat gatggattta actataaaca gatgatggtt agagatgagc ggaggtttaa
      541 aatggcagac aaggatggag acctcattgc caccaaggag gagttcacag ctttcctgca
      601 ccctgaggag tatgactaca tgaaagatat agtagtacag gaaacaatgg aagatataga
      661 taagaatgct gatggtttca ttgatctaga agagtatatt ggtgacatgt acagccatga
      721 tgggaatact gatgagccag aatgggtaaa gacagagcga gagcagtttg ttgagtttcg
      781 ggataagaac cgtgatggga agatggacaa ggaagagacc aaagactgga tccttccctc
      841 agactatgat catgcagagg cagaagccag gcacctggtc tatgaatcag accaaaacaa
      901 ggatggcaag cttaccaagg aggagatcgt tgacaagtat gacttatttg ttggcagcca
      961 ggccacagat tttggggagg ccttagtacg gcatgatgag ttctgagctg cggaggaacc
     1021 ctcatttcct caaaagtaat ttatttttac agcttctggt ttcacatgaa attgtttgcg
     1081 ctactgagac tgttactaca aactttttaa gacatgaaaa ggcgtaatga aaaccatccc
     1141 gtccccattc ctcctcctct ctgagggact ggagggaagc cgtgcttctg aggaacaact
     1201 ctaattagta cacttgtgtt tgtagattta cactttgtat tatgtattaa catggcgtgt
     1261 ttatttttgt atttttctct ggttgggagt atgatatgaa ggatcaagat cctccactca
     1321 cacatgtaga caaacattag ctctttactc tttctcaacc ccttatatga ttttaataat
     1381 tctcacttca ctaattttgt aagcctgaga tcaataagaa atgttcagga gagaggaaag
     1441 aaaaaatata tatgctccac aatttatatt tagagagaga acacttagtc ttgcctgtca
     1501 aaaagtccaa catttcatag gtagtagggg ccacatatta cattcagttg ctataggtcc
     1561 agcaactgaa cctgccatta cctgggcaag gaaagatccc tttgctctag gaaagcttgg
     1621 cccaaattga ttttcttctt tttccccctg taggactgac tgttggctaa ttttgtcaag
     1681 cacagctgtg gtgggaagag ttagggccag tgtcttgaaa atcaatcaag tagtgaatgt
     1741 gatctctttg cagagctata gatagaaaca gctggaaaac taaaggaaaa atacaaatgt
     1801 tttcggggca tacatttttt ttctgggtgt gcatctgttg aaatgctcaa gacttaatta
     1861 tttgcctttt gaaatcactg taaatgcccc catccggttc ctcttcttcc caggtgtgcc
     1921 aaggaattaa tcttggtttc actacaatta aaattcactc ctttccaatc atgtcattga
     1981 aagtgccttt aacgaaagaa atggtcactg aatgggaatt ctcttaagaa accctgagat
     2041 taaaaaaaga ctatttggat aacttatagg aaagcctaga acctcccagt agagtgggga
     2101 tttttttctt cttccctttc tcttttggac aatagttaaa ttagcagtat tagttatgag
     2161 tttggttgca gtgttcttat cttgtgggct gatttccaaa aaccacatgc tgctgaattt
     2221 accagggatc ctcatacctc acaatgcaaa ccacttacta ccaggccttt ttctgtgtcc
     2281 actggagagc ttgagctcac actcaaagat cagaggacct acagagaggg ctctttggtt
     2341 tgaggaccat ggcttacctt tcctgccttt gacccatcac accccatttc ctcctctttc
     2401 cctctccccg ctgccaaaaa aaaaaaaaag gaaacgttta tcatgaatca acagggtttc
     2461 agtccttatc aaagagagat gtggaaagag ctaaagaaac caccctttgt tcccaactcc
     2521 actttaccca tattttatgc aacacaaaca ctgtcctttt gggtcccttt cttacagatg
     2581 gacctcttga gaagaattat cgtattccac gtttttagcc ctcaggttac caagataaat
     2641 atatgtatat ataaccttta ttattgctat atctttgtgg ataatacatt caggtggtgc
     2701 tgggtgattt attataatct gaacctaggt atatcctttg gtcttccaca gtcatgttga
     2761 ggtgggctcc ctggtatggt aaaaagccag gtataatgta acttcacccc agcctttgta
     2821 ctaagctctt gatagtggat atactctttt aagtttagcc ccaatatagg gtaatggaaa
     2881 tttcctgccc tctgggttcc ccatttttac tattaagaag accagtgata atttaataat
     2941 gccaccaact ctggcttagt taagtgagag tgtgaactgt gtggcaagag agcctcacac
     3001 ctcactaggt gcagagagcc caggccttat gttaaaatca tgcacttgaa aagcaaacct
     3061 taatctgcaa agacagcagc aagcattata cggtcatctt gaatgatccc tttgaaattt
     3121 tttttttgtt tgtttgttta aatcaagcct gaggctggtg aacagtagct acacacccat
     3181 attgtgtgtt ctgtgaatgc tagctctctt gaatttggat attggttatt ttttatagag
     3241 tgtaaaccaa gttttatatt ctgcaatgcg aacaggtacc tatctgtttc taaataaaac
     3301 tgtttacatt caaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U67280. Homo sapiens calu...[gi:2809323] Links  


LOCUS       HSU67280                1035 bp    mRNA    linear   PRI 22-MAR-2000
DEFINITION  Homo sapiens calumenin mRNA, complete cds.
ACCESSION   U67280
VERSION     U67280.1  GI:2809323
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1035)
  AUTHORS   Vorum,H., Liu,X., Madsen,P., Rasmussen,H.H. and Honore,B.
  TITLE     Molecular cloning of a cDNA encoding human calumenin, expression in
            Escherichia coli and analysis of its Ca2+-binding activity
  JOURNAL   Biochim. Biophys. Acta 1386 (1), 121-131 (1998)
  MEDLINE   98342150
   PUBMED   9675259
REFERENCE   2  (bases 1 to 1035)
  AUTHORS   Liu,X., Rasmussen,H.H., Celis,J.E. and Honore,B.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1996) Dept. of Med. Biochem., University of
            Aarhus, Ole Worms Alle, Bldg. 170, DK-8000 Aarhus C, Denmark
FEATURES             Location/Qualifiers
     source          1..1035
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="9268.8B"
                     /cell_type="keratinocytes"
     CDS             63..1010
                     /note="multiple EF-hand protein"
                     /codon_start=1
                     /product="calumenin"
                     /protein_id="AAB97725.1"
                     /db_xref="GI:2809324"
                     /translation="MDLRQFLMCLSLCTAFALSKPTEKKDRVHHEPQLSDKVHNDAQS
                     FDYDHDAFLGAEEAKTFDQLTPEESKERLGKIVSKIDGDKDGFVTVDELKDWIKFAQK
                     RWIYEDVERQWKGHDLNEDGLVSWEEYKNATYGYVLDDPDPDDGFNYKQMMVRDERRF
                     KMADKDGDLIATKEEFTAFLHPEEYDYMKDIVVQETMEDIDKNADGLIDLEEYIGDMY
                     SHDGNTDEPEWVKTEREQFVEFRDKNRDGKMDKEETKDWILPSDYDHAEAEARHLVYE
                     SDQNKDGKLTKEEIVDKYDLFVGSQATDFGEALVRHDEF"
BASE COUNT      314 a    193 c    294 g    234 t
ORIGIN      
        1 gtgggtgagc ggcggccacg gcatcctgtg ctgtgggggc tacgaggaaa gatctaatta
       61 tcatggacct gcgacagttt cttatgtgcc tgtccctgtg cacagccttt gccttgagca
      121 aacccacaga aaagaaggac cgtgtacatc atgagcctca gctcagtgac aaggttcaca
      181 atgatgctca gagttttgat tatgaccatg atgccttctt gggtgctgaa gaagcaaaga
      241 cctttgatca gctgacacca gaagagagca aggaaaggct tggaaagatt gtaagtaaaa
      301 tagatggcga caaggacggg tttgtcactg tggatgagct caaagactgg attaaatttg
      361 cacaaaagcg ctggatttac gaggatgtag agcgacagtg gaaggggcat gacctcaatg
      421 aggacggcct cgtttcctgg gaggagtata aaaatgccac ctacggctac gttttagatg
      481 atccagatcc tgatgatgga tttaactata aacagatgat ggttagagat gagcggaggt
      541 ttaaaatggc agacaaggat ggagacctca ttgccaccaa ggaggagttc acagctttcc
      601 tgcaccctga ggagtatgac tacatgaaag atatagtagt acaggaaaca atggaagata
      661 tagataagaa tgctgatggt ctcattgatc tagaagagta tattggtgac atgtacagcc
      721 atgatgggaa tactgatgag ccagaatggg taaagacaga gcgagagcag tttgttgagt
      781 ttcgggataa gaaccgtgat gggaagatgg acaaggaaga gaccaaagac tggatccttc
      841 cctcagacta tgatcatgca gaggcagaag ccaggcacct ggtctatgaa tcagaccaaa
      901 acaaggatgg caagcttacc aaggaggaga tcgttgacaa gtatgactta tttgttggca
      961 gccaggccac agattttggg gaggccttag tacggcatga tgagttctga gctacggagg
     1021 aaccctcatt tcctc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH005295. Human amyloid-bet...[gi:178614] Links  


LOCUS       HUMAMYB01               1154 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 1.
ACCESSION   M34862
VERSION     M34862.1  GI:178595
KEYWORDS    amyloid-beta protein.
SEGMENT     1 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 1154)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
FEATURES             Location/Qualifiers
     source          1..1154
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     protein_bind    484..490
                     /gene="APP"
                     /note="G00-119-692"
                     /bound_moiety="AP-1"
     misc_feature    517..530
                     /gene="APP"
                     /note="heat shock element; G00-119-692"
     protein_bind    789..795
                     /gene="APP"
                     /note="G00-119-692"
                     /bound_moiety="AP-1"
     exon            834..1037
                     /gene="APP"
                     /note="G00-119-692"
                     /number=1
BASE COUNT      237 a    367 c    341 g    209 t
ORIGIN      
        1 gatctctcct aatttttcct tcttccccaa ctcagatgga tgttacatcc ctgcttaaca
       61 acaaaaaaag accccccgcc ccgcaaaatc cacactgacc acccccttta acaaaacaaa
      121 accaaaaaca aacaaaaata taagaaagaa acaaaaccca agcccagaac cctgctttca
      181 agaagaagta aatgggttgg ccgcttcttt gccaggtcct gcgccttgct cctttggttc
      241 gttctaaaga tagaaattcc aggttgctcg tgcctgcttt tgacgttggg ggttaaaaaa
      301 tgaggttttg ctgtctcaac aagcaaagaa aatcctattt cctttaagct tcactcgttc
      361 tcattctctt ccagaaacgc ctgccccacc tctccaaacc gagagaaaaa acgaaatgcg
      421 gataaaaacg caccctagca gcagtccttt atacgacacc cccgggaggc ctgcggggtc
      481 ggatgattca agctcacggg gacgagcagg agcgctctcg acttttctag agcctcagcg
      541 tcctaggact cacctttccc tgatcctgca ccgtccctct cctggcccca gactctccct
      601 cccactgttc acgaagccca ggtggccgtc ggccggggag cggagggggc gcgtggggtg
      661 caggcggcgc caaggcgctg cacctgtggg cgcggggcga gggcccctcc cggcgcgagc
      721 gggcgcagtt ccccggcggc gccgctaggg gtctctctcg ggtgccgagc ggggtgggcc
      781 ggatcagctg actcgcctgg ctctgagccc cgccgccgcg ctcgggctcc gtcagtttcc
      841 tcggcagcgg taggcgagag cacgcggagg agcgtgcgcg ggggccccgg gagacggcgg
      901 cggtggcggc gcgggcagag caaggacgcg gcggatccca ctcgcacagc agcgcactcg
      961 gtgccccgcg cagggtcgcg atgctgcccg gtttggcact gctcctgctg gccgcctgga
     1021 cggctcgggc gctggaggtg ggtgccgcgc ctcggaaggc ggggggaggc tgcacggtgg
     1081 ggacgcgata ccccccaaga ccttaaccca agtctttaat gcagagaagc cgggggtccg
     1141 tcaatgggac ccct
//
LOCUS       HUMAMYB02                771 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 2.
ACCESSION   M34863
VERSION     M34863.1  GI:178596
KEYWORDS    amyloid-beta protein.
SEGMENT     2 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 771)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Human DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..771
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34862.1:1038..1154,1..278)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=1
     exon            279..446
                     /gene="APP"
                     /note="G00-119-692"
                     /number=2
BASE COUNT      221 a    153 c    146 g    251 t
ORIGIN      
        1 tcatatggga aaacgtgtaa atttttcatg aataaattcc tttcggtatt ggtaatttcc
       61 ttctctgttt tagtataaaa gaaagcattg ccttataggt ggctaagaac atctgtagca
      121 tacataacac ttaatctaaa ggagtgttga agaccgggct gattcctaat tgagaatgag
      181 caagaataga actctttttg atagttatcc ctgttcttcc tccaagcctc tgccttggag
      241 ctatggatac tataactaac tgaagcttct tctttcaggt acccactgat ggtaatgctg
      301 gcctgctggc tgaaccccag attgccatgt tctgtggcag actgaacatg cacatgaatg
      361 tccagaatgg gaagtgggat tcagatccat cagggaccaa aacctgcatt gataccaagg
      421 aaggcatcct gcagtattgc caagaagtaa gtcctgtccg gtggctagca attcacgttg
      481 gatcacatgc atttgttttc aaaaaattta acttcttgta ttttgcatca gtattttaac
      541 cctacagtaa aaatcttggt tcctaatgat tcaccatacc attaatatat ttatttgcat
      601 taccctatga tatacatata aatgttttta aaattatgat gtcgtattat gaccatcact
      661 aaacagtagt ttaagatgtc acagcacttt tttttttcct cactctgtca cccaggttgg
      721 agtgcagtgg caagattatg gcttactgta gccttgacct actgggctca a
//
LOCUS       HUMAMYB03                604 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 3.
ACCESSION   M34864
VERSION     M34864.1  GI:178597
KEYWORDS    amyloid-beta protein.
SEGMENT     3 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 604)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..604
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34863.1:447..771,1..229)
                     /gene="APP"
                     /note="G00-119-692"
     exon            230..359
                     /gene="APP"
                     /note="G00-119-692"
                     /number=3
BASE COUNT      147 a    141 c    148 g    168 t
ORIGIN      
        1 ggagtatttt ggattttgga ctttccggat tccagatgct caaccagtaa gtatataatg
       61 cagatgttct aaaatcggaa aaagtcagag accttgaaag cacttctggt cccaagcatt
      121 ttggataagg gacactccac ctgtacctta cagtggaggc ttgttagatg cttgtaaatg
      181 ccagcccctg cctcaagtaa caattgattc tttttgtgtg ctctcccagg tctaccctga
      241 actgcagatc accaatgtgg tagaagccaa ccaaccagtg accatccaga actggtgcaa
      301 gcggggccgc aagcagtgca agacccatcc ccactttgtg attccctacc gctgcttagg
      361 tgagccggcc ggccgtgggg ctggtgttga ttgggggcct ggtcttgagg gaagaaaaag
      421 aggatgctcc tgttaggtca catacacaga cttgttcttc agcacattgc cactctgtgt
      481 tgtactgtgt tttggactct tgcagttaca ttctgtgcac tgaccctata ggagcagtat
      541 ttttgagttc cctgcctcag aatgaattta cccagggtgt atattgaaat tacaaattcc
      601 tggg
//
LOCUS       HUMAMYB04                249 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein gene, exon 4.
ACCESSION   M34865
VERSION     M34865.1  GI:178598
KEYWORDS    amyloid-beta protein.
SEGMENT     4 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 249)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..249
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34864.1:360..604,1..131)
                     /gene="APP"
                     /note="G00-119-692"
     exon            132..244
                     /gene="APP"
                     /note="G00-119-692"
                     /number=4
BASE COUNT       57 a     49 c     61 g     82 t
ORIGIN      
        1 tcttgattgg gttgcttagg cattaaaagg ctgtttaact tgtcttgaag tctatctttc
       61 cttgatgtct tctgcggtaa gaacactgtg atacagatgg aatgacggga agtggttttc
      121 ctttctttca gttggtgagt ttgtaagtga tgcccttctc gttcctgaca agtgcaaatt
      181 cttacaccag gagaggatgg atgtttgcga aactcatctt cactggcaca ccgtcgccaa
      241 agaggtacc
//
LOCUS       HUMAMYB05                910 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 5.
ACCESSION   M34866
VERSION     M34866.1  GI:178599
KEYWORDS    amyloid-beta protein.
SEGMENT     5 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 910)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..910
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34865.1:245..249,1..485)
                     /gene="APP"
                     /note="G00-119-692"
     exon            486..679
                     /gene="APP"
                     /note="G00-119-692"
                     /number=5
BASE COUNT      270 a    163 c    194 g    283 t
ORIGIN      
        1 aggattgtta ctctgaaagt gaaccaagag agtcagaatt tctaggataa ggcaggtcca
       61 tgtgagataa acttccatct gtaagggagg taggagtgag ctggggatct acctggaatg
      121 cgcacaattt ctggatacac gaattaagcc agaaggcatt aattctctgg tatctgttca
      181 ccagtggaga ctgctacatc ttatttccca attagtttcc attccagttg ttcgtataaa
      241 cctctattat ataacatcgt ggtcttatta aataaataat aaacttaaga ataaaaagag
      301 taccaaagtg taacccatgc taaaaaaaat cctgttgtaa attcactcaa aaagcaaata
      361 atcttttcct ttgtgaaatt ggttcctaat atattgggtc tgcatgttga ttattttatg
      421 tggagttttc ttaaaatgaa acacatctac tctaccactc actgttttct ccttacactt
      481 tgtagacatg cagtgagaag agtaccaact tgcatgacta cggcatgttg ctgccctgcg
      541 gaattgacaa gttccgaggg gtagagtttg tgtgttgccc actggctgaa gaaagtgaca
      601 atgtggattc tgctgatgcg gaggaggatg actcggatgt ctggtggggc ggagcagaca
      661 cagactatgc agatgggagg taaggtggcc tttgtgttca gcctcagaga tgctgaaaca
      721 tcttgtatgg agtatttgta tcctgtaaat taatctttct gtttatcact gaaaaggtct
      781 ctgcccactc ccatcagagt ctgctgttat gcaaaaatct gaactatgaa tttttatggc
      841 atcctgttga attaataata tcaatcaccc atcacagagt taattttaac tatttaatat
      901 taaacttggg
//
LOCUS       HUMAMYB06                745 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 6.
ACCESSION   M34867
VERSION     M34867.1  GI:178600
KEYWORDS    amyloid-beta protein.
SEGMENT     6 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 745)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..745
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34866.1:680..910,1..199)
                     /gene="APP"
                     /note="G00-119-692"
     exon            200..402
                     /gene="APP"
                     /note="G00-119-692"
                     /number=6
BASE COUNT      220 a    152 c    161 g    212 t
ORIGIN      
        1 ctgcagaaac taagaactgt ccccaaggat aaaaattttt gcagacccac tgagctggga
       61 tttatactta gacctgctga attgaagttt tgggtgtgtt tcttctcaaa ttgccaaaat
      121 tccatatgga cgacttttct tttccttccc tgaaatgtgt ttaattgact ttttctgtta
      181 tttgtgtttg ccttcacagt gaagacaaag tagtagaagt agcagaggag gaagaagtgg
      241 ctgaggtgga agaagaagaa gccgatgatg acgaggacga tgaggatggt gatgaggtag
      301 aggaagaggc tgaggaaccc tacgaagaag ccacagagag aaccaccagc attgccacca
      361 ccaccaccac caccacagag tctgtggaag aggtggttcg aggtaatcca ccatttgctt
      421 ggattccccc cacccccaag gaaaagaaag cgtaatacca gagttggaaa tatccaccct
      481 agcaccactg ccttccccaa tcaaaaacat gttttttttt ccaaaaggct tcttatgctt
      541 gtgaaatttt tttggtttaa caagcaaaca atttcaaata atgtgaaatc tttattatac
      601 agtttgtttt gtaccttgta tatgctgctt ggcaaatccc aagttaattc tacaagactc
      661 cgcccaacca ggtagtcatc taatttgaca aacactgagt taggtgactc cttggtattc
      721 ctttggcagt gagtgtttgt tatat
//
LOCUS       HUMAMYB07                669 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 7.
ACCESSION   M34868
VERSION     M34868.1  GI:178601
KEYWORDS    amyloid-beta protein.
SEGMENT     7 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 669)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..669
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34867.1:403..745,1..184)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=6
     exon            185..352
                     /gene="APP"
                     /note="G00-119-692"
                     /number=7
BASE COUNT      149 a    161 c    163 g    196 t
ORIGIN      
        1 tgagaggtgt aaatctctag tcctggtggc cagttaaatt cctcagtaaa tgtttggtag
       61 atgctgccta ataaaccagt ccaggttgcc actgggagga ttaaaagaag taaacgtgta
      121 tacatgaaca gagagacagt gccttttcat gctaaatgtg gttccccaca tctcctctga
      181 ttagaggtgt gctctgaaca agccgagacg gggccgtgcc gagcaatgat ctcccgctgg
      241 tactttgatg tgactgaagg gaagtgtgcc ccattctttt acggcggatg tggcggcaac
      301 cggaacaact ttgacacaga agagtactgc atggccgtgt gtggcagcgc cagtaagtgg
      361 acccttcttc gagcctggcc acctttcgtc tctctcgcca ctgactctgc tttttgtaac
      421 agattgattt tcctggttct tgggaatggg cctgttgcta ccactaacca catttctgtc
      481 cacttctcta attgctcaga gtctccgcag tatgttcaat catgagcaca cctctccgtc
      541 tttccctgat aaagcatggc catggatgtg ttctcttcct agctgtagca catatgtctt
      601 gcaatccaga gggacttttg agtgcttctc ttttaaacaa agctggagtg gctgttttgt
      661 cttctgcag
//
LOCUS       HUMAMYB08                413 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 8.
ACCESSION   M34869
VERSION     M34869.1  GI:178602
KEYWORDS    amyloid-beta protein.
SEGMENT     8 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 413)
  AUTHORS   Kale,L.C., Higgins,G.A., Yoshioka,K., Izumi,R., Oishi,N. and
            Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
FEATURES             Location/Qualifiers
     source          1..413
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34868.1:353..669,1..158)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=7
     exon            159..215
                     /gene="APP"
                     /note="G00-119-692"
                     /number=8
     variation       269
                     /gene="APP"
                     /note="g in one allele; gg in another allele"
                     /replace="gg"
BASE COUNT       88 a     80 c     86 g    159 t
ORIGIN      
        1 atgtttatat gttcattttg gttttgttgg agggaccaaa cctaagtgag tgattttgtt
       61 tgttaggttg tttttttgtc agtggactcg tgcatttcag ccatcattcc catgtttctc
      121 tttttgtttt tagttatgtt ctcttatttt ttccatagtg tcccaaagtt tactcaagac
      181 tacccaggaa cctcttgccc gagatcctgt taaacgtacg ttgtcattca cctgagggaa
      241 gggaagaggg gaggaggatg ctgcttggtt cacataactc cagcatcatc accttctttg
      301 catggttttg tgtttcttga acacctgtct tagtaaaatg tttcttccca ttaccttgct
      361 tgtaattaca tctgattttg ccagacagct tgagatgttg ggctaagaac atc
//
LOCUS       HUMAMYB09                639 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 9.
ACCESSION   M34870
VERSION     M34870.1  GI:178603
KEYWORDS    amyloid-beta protein.
SEGMENT     9 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 639)
  AUTHORS   Kale,L.C., Higgins,G.A., Yoshioka,K., Izumi,R., Oishi,N. and
            Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..639
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34869.1:216..413,1..309)
                     /gene="APP"
                     /note="G00-119-692"
     exon            310..443
                     /gene="APP"
                     /note="G00-119-692"
                     /number=9
BASE COUNT      157 a    144 c    129 g    209 t
ORIGIN      
        1 tcttcagcac caactgtttt tgctctttgc atgcttgttt cgtaaagaac tctaatgcta
       61 taattgcaaa atggaccatt ttaaagattt ttccttcatt ctgtacttgg gagtggtgaa
      121 agacatcctt actgtgctgc acagtgtctc atggtgttct cttaaacagc attaacgtct
      181 tgtatgcgct gctttactaa attctctgtt ctgagaaata actgaaaata cggctttcta
      241 ttaaacgagt ggattattct gttgttgttg gctttttttt ctcaaacctc cttctcttct
      301 actttatagt tcctacaaca gcagccagta cccctgatgc cgttgacaag tatctcgaga
      361 cacctgggga tgagaatgaa catgcccatt tccagaaagc caaagagagg cttgaggcca
      421 agcaccgaga gagaatgtcc caggtaagtc tggctcttcc atcattcagc cctacgatat
      481 tgggaaacct gagcttgcct ctgcctcagt ctccccacag gcctgctggc tttatcaaga
      541 tctttaaaga tgtaaagttc taattttaaa tgtttactgt gtgggcacag tttgtgggtt
      601 ttttgcatcc tccgtgaaca tgctgcctag gaaggatcc
//
LOCUS       HUMAMYB10                468 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 10.
ACCESSION   M34871
VERSION     M34871.1  GI:178604
KEYWORDS    amyloid-beta protein.
SEGMENT     10 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 468)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..468
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34870.1:444..639,1..263)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=9
     exon            264..338
                     /gene="APP"
                     /note="G00-119-692"
                     /number=10
BASE COUNT      164 a     80 c     83 g    141 t
ORIGIN      
        1 ggggtaaaaa aagaccaacc ttgactcaaa aagctatatt cctgcaataa atatgaacat
       61 ttatgtttaa tatactgctg tgattaatta tttatatcct agtgggaggt caaatattct
      121 tcagatagga aggggtatgt aataaatttt ttaaatatct aaaatgacaa aaatagagga
      181 aaaacataat catccatcct attaagtctg tattcaaagg atgaactgat gattttaaat
      241 tcaaatgttt ccttaattta taggtcatga gagaatggga agaggcagaa cgtcaagcaa
      301 agaacttgcc taaagctgat aagaaggcag ttatccaggt aaaacctgaa cccatttcct
      361 accaaacatc acatgccagc agtcttttta atgagtgtct gcaggtttac ctttatctgc
      421 cagcacctca cgtgtgattg ttcatagtcc catgaacttc agctgttc
//
LOCUS       HUMAMYB11                742 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 11.
ACCESSION   M34872
VERSION     M34872.1  GI:178605
KEYWORDS    amyloid-beta protein.
SEGMENT     11 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 742)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..742
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34871.1:339..468,1..316)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=10
     exon            317..475
                     /gene="APP"
                     /note="G00-119-692"
                     /number=11
BASE COUNT      221 a    160 c    179 g    182 t
ORIGIN      
        1 ttttttttta atatgtaata taagagtttc aagagtgcat atatttatcc aaagttatta
       61 gtcaaatcag ccttcagaga gcctatgaaa tactgatttt caatgagaaa gtaggtttga
      121 tgagggttgg agagtgcaag aaagtgaagt aagctataaa acgtagaatt agtcaatgtt
      181 ggaatgacta tgcagtcttt aggatacttt ttagcactag aagaaaatgg aataggatgt
      241 cttttttaga agacttgaaa ttgctgcttc atcctactta ttcagtcccc atggacatat
      301 gtgtttatga tggcagcatt tccaggagaa agtggaatct ttggaacagg aagcagccaa
      361 cgagagacag cagctggtgg agacacacat ggccagagtg gaagccatgc tcaatgaccg
      421 ccgccgcctg gccctggaga actacatcac cgctctgcag gctgttcctc ctcgggtagg
      481 tctcgctgca gccgagttca cacttcaggt cacagcacag acagtaaggg tggggcactg
      541 ggaactggaa gccatacaaa aagaatgagg agaaatgcct tgagcactgt tattcagagg
      601 ttcaacccct gtccattcca tcttgaaggt caaagggtca cagggcagct acctccacaa
      661 ggtcatctct acacagcagg tactcacact tccccacaga gcagccagac aggactccca
      721 gggactcaca ttgcaagccc tg
//
LOCUS       HUMAMYB12                488 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 12.
ACCESSION   M34873
VERSION     M34873.1  GI:178606
KEYWORDS    amyloid-beta protein.
SEGMENT     12 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 488)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..488
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34872.1:476..742,1..207)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=11
     exon            208..336
                     /gene="APP"
                     /note="G00-119-692"
                     /number=12
BASE COUNT      141 a    111 c    114 g    122 t
ORIGIN      
        1 ctcttgtgga agtgaaaagc agccgatcta agtaattacc cattaagaaa ggagtaggag
       61 aaatgtttag aaggcagacc tgcaaaaagg tgactcacag tgcgttcaca tgaccggatg
      121 cattagtgga acctctaacc catcgccaat ggaagaagca gtgttttgca caaacttgaa
      181 aaagagtttt tcattttcct cccacagcct cgtcacgtgt tcaatatgct aaagaagtat
      241 gtccgcgcag aacagaagga cagacagcac accctaaagc atttcgagca tgtgcgcatg
      301 gtggatccca agaaagccgc tcagatccgg tcccaggtaa gcgtggggta taatcatctt
      361 ctgcagcttt gacaatccag gattcttgct tcttgggttg gtgacaggat cccaggcatt
      421 ctccgagcac ctataggtgt tttccctatg gcaagagctc tttaacttct ggtaaattct
      481 aaaagctt
//
LOCUS       HUMAMYB13                413 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) exon 13.
ACCESSION   M34874
VERSION     M34874.1  GI:178607
KEYWORDS    amyloid-beta protein.
SEGMENT     13 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 413)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
FEATURES             Location/Qualifiers
     source          1..413
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34873.1:337..488,1..92)
                     /gene="APP"
                     /note="alternately spliced; G00-119-692"
                     /number=12
     exon            93..192
                     /gene="APP"
                     /note="alternately spliced; G00-119-692"
                     /number=13
BASE COUNT      119 a     84 c     94 g    116 t
ORIGIN      
        1 ctccgtctca aaaaaaaaag aaaaaaagaa agaaaaagaa ccattcctac ccccagacat
       61 tgttgacctg gagttgtcat cctttgatgc aggttatgac acacctccgt gtgatttatg
      121 agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagtggcc gaggagattc
      181 aggatgaagt tggtaagtaa gctgttcttt tgatgctgca catggacatg tatttttccc
      241 cagaggaaaa ttgaggaagt gagttatctg tttgaggaat atggaagtta atagaacagc
      301 ttgcctttta cgatgaaact gtcaggctgg aactagcttt ctagccttag caattgaatc
      361 agttcttacc tctccatact tgtaagactg gaggaatgtc gcctgactta ggg
//
LOCUS       HUMAMYB14               1037 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 13.
ACCESSION   M34875
VERSION     M34875.1  GI:178608
KEYWORDS    amyloid-beta protein.
SEGMENT     14 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 1037)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..1037
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     mRNA            join(M34862.1:834..1037,M34863.1:279..446,
                     M34864.1:230..359,M34865.1:132..244,M34866.1:486..679,
                     M34867.1:200..402,M34868.1:185..352,M34869.1:159..215,
                     M34870.1:310..443,M34871.1:264..338,M34872.1:317..475,
                     M34873.1:208..336,289..740)
                     /gene="APP"
                     /note="alternately transcribed; G00-119-692"
     prim_transcript join(M34862.1:834..1037,order(M34862.1:1038..1154,
                     M34863.1:1..278),M34863.1:279..446,
                     order(M34863.1:447..771,M34864.1:1..229),
                     M34864.1:230..359,order(M34864.1:360..604,
                     M34865.1:1..131),M34865.1:132..244,
                     order(M34865.1:245..249,M34866.1:1..485),
                     M34866.1:486..679,order(M34866.1:680..910,
                     M34867.1:1..199),M34867.1:200..402,
                     order(M34867.1:403..745,M34868.1:1..184),
                     M34868.1:185..352,order(M34868.1:353..669,
                     M34869.1:1..158),M34869.1:159..215,
                     order(M34869.1:216..413,M34870.1:1..309),
                     M34870.1:310..443,order(M34870.1:444..639,
                     M34871.1:1..263),M34871.1:264..338,
                     order(M34871.1:339..468,M34872.1:1..316),
                     M34872.1:317..475,order(M34872.1:476..742,
                     M34873.1:1..207),M34873.1:208..336,
                     order(M34873.1:337..488,M34874.1:1..413,1..288),289..740)
                     /gene="APP"
                     /note="G00-119-692"
     CDS             join(M34862.1:981..1037,M34863.1:279..446,
                     M34864.1:230..359,M34865.1:132..244,M34866.1:486..679,
                     M34867.1:200..402,M34868.1:185..352,M34869.1:159..215,
                     M34870.1:310..443,M34871.1:264..338,M34872.1:317..475,
                     M34873.1:208..336,289..345)
                     /gene="APP"
                     /note="alternate amyloid-beta protein"
                     /codon_start=1
                     /product="amyloid-beta protein"
                     /protein_id="AAB59501.1"
                     /db_xref="GI:178615"
                     /db_xref="GDB:G00-119-692"
                     /translation="MLPGLALLLLAAWTARALEVPTDGNAGLLAEPQIAMFCGRLNMH
                     MNVQNGKWDSDPSGTKTCIDTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGR
                     KQCKTHPHFVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSE
                     KSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYA
                     DGSEDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTT
                     TTTTESVEEVVREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTE
                     EYCMAVCGSAMSQSLLKTTQEPLARDPVKLPTTAASTPDAVDKYLETPGDENEHAHFQ
                     KAKERLEAKHRERMSQVMREWEEAERQAKNLPKADKKAVIQHFQEKVESLEQEAANER
                     QQLVETHMARVEAMLNDRRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHT
                     LKHFEHVRMVDPKKAAQIRSQVQWLMPVIPAFWEAKVGR"
     intron          order(M34873.1:337..488,M34874.1:1..413,1..288)
                     /gene="APP"
                     /note="alternately spliced; G00-119-692"
                     /number=12
     exon            289..740
                     /gene="APP"
                     /note="alternately spliced; G00-119-692"
                     /number=13
BASE COUNT      298 a    192 c    197 g    350 t
ORIGIN      
        1 ggatcctttt gacacacaaa aattttcaat ttagaagtct attttatcag ttttttcttt
       61 tgtggcagtt actatggtat ggtatctagg aaaccattgt ccgaaacaag gttacaacga
      121 tttatacctg ttaccttcta agagttttgt agttttagct cttacagtta aatctttgat
      181 tcattttaat tttcatatat ggtgtgaggt agaggtttga ctttattctt ttgcatataa
      241 atatccagtg tcccagcacc atttgttgaa aaagactatt cttgccaggt gcagtggctc
      301 atgcctgtaa ttccagcatt ttgggaggcc aaggtgggca gatgacttga gcccagaagt
      361 tcaagaccag attgggaaac atggcaagac cacatttcta caaaaaaatt atccaggcat
      421 gataacatct atttgtagtc ccagctactc aggaggctgt ggtgggagga tctcccgagc
      481 ctggggtggc tgaggctgca gtgagccttg atcacgccac ctgggcaata gagcaagacc
      541 ctgtctcaaa aaaaggaaga aaaagactat tatttccccc attgaatggt cttggcacta
      601 ttacacaaaa tcaattgtcc atagataata tgggtttatt tcttaattct tagttctttt
      661 ctttgatctg tgtgcctgtg cttactgtag taccacactg ttttgattat tgtagctttg
      721 tagtaaattt tgaaatcagc aagtgtgagt ccgttatctt tgttcttctt tttcaatatt
      781 actttgagta tccggagtct cttgcattac aacatgaatt ttaagatctt cctcaccccc
      841 tgcccccact accatttctg caaaaaaaaa aaaaaaaaaa aaaaaaaaag tgcggattgc
      901 attggatctg tagatcattt tggggaatat ttatatataa tgttaatgag gttttaaaca
      961 acacttttat ctactagaac tgatatttgc ctcgaaatga ttccaccttg agccccattc
     1021 aggaacgtta gtatttt
//
LOCUS       HUMAMYB15                577 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 14.
ACCESSION   M34876
VERSION     M34876.1  GI:178609
KEYWORDS    amyloid-beta protein.
SEGMENT     15 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 577)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
FEATURES             Location/Qualifiers
     source          1..577
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34874.1:193..413,M34875.1:1..1037,1..241)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=13
     exon            242..463
                     /gene="APP"
                     /note="G00-119-692"
                     /number=14
BASE COUNT      147 a    119 c    150 g    161 t
ORIGIN      
        1 ctgcagtcct gccagcagct cttgtgagct gcgactttgc ataagtagct tgaatgccat
       61 gtgcctcagt tttcacatct gtaaaaggga gatgataatg gtacctatgt catggctcta
      121 aacgcgatca tgcacgtgaa agcagttgaa gtcttgcctg gcagaagtaa atggtggctg
      181 ctgctgctgc tgctgttgtg attgttgtta ctcaccaaag agatggtttt gtttggttta
      241 gatgagctgc ttcagaaaga gcaaaactat tcagatgacg tcttggccaa catgattagt
      301 gaaccaagga tcagttacgg aaacgatgct ctcatgccat ctttgaccga aacgaaaacc
      361 accgtggagc tccttcccgt gaatggagag ttcagcctgg acgatctcca gccgtggcat
      421 tcttttgggg ctgactctgt gccagccaac acagaaaacg aaggtaagag tcccctgagc
      481 cagcaagggc gttctgggag gtatatatac acatacataa cgtgtgtgca agagagagag
      541 ttatcttttg tatgttcttg agtggtgaat ttttttt
//
LOCUS       HUMAMYB16                534 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 15.
ACCESSION   M34877
VERSION     M34877.1  GI:178610
KEYWORDS    amyloid-beta protein.
SEGMENT     16 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 534)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..534
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34876.1:464..577,1..258)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=14
     exon            259..312
                     /gene="APP"
                     /note="G00-119-692"
                     /number=15
BASE COUNT      103 a    137 c    109 g    185 t
ORIGIN      
        1 gcgtatcctg gtgagggctt ttattagaaa actaaggagc aggactggca catcaatagc
       61 gataagccat ttaaagtggt gcccctcctg gtcctcgatg ctcgggatta actcactgtg
      121 gttcttttct ggctgctcct tttttgtaac ttgtttattg catatgcttt tttccctgct
      181 cgactatgtt tgggagccac gacttaccat cttgatttgt cttgtttgct ttctgtgtcc
      241 cttgcttcct gtgcccagtt gagcctgttg atgcccgccc tgctgccgac cgaggactga
      301 ccactcgacc aggtatcaga accgcttgac ttgtgcctct ccgcatcttg ggcctaagct
      361 cgagcagacg ttgtctcctt tctgcatttt aagcctctct caattgtact tactgcagag
      421 tagaatttga agtgacctta gaaaccaaac ttttaaaaat tccccttgac tattccttac
      481 tcttcatttc ccaagttccg agtgaaatga ttttttttgt ttctgagctt tccc
//
LOCUS       HUMAMYB17                553 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 16.
ACCESSION   M34878
VERSION     M34878.1  GI:178611
KEYWORDS    amyloid-beta protein.
SEGMENT     17 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 553)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..553
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34877.1:313..534,1..173)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=15
     exon            174..274
                     /gene="APP"
                     /note="amyloid-beta protein"
                     /number=16
BASE COUNT      179 a     94 c    113 g    167 t
ORIGIN      
        1 tagtaattga agttttaaat atagggtatc atttttcttt aagagtcatt tatcaatttt
       61 cttctaactt caggcctaga aagaagtttt gggtaggctt tgtcttacag tgttattatt
      121 tatgagtaaa actaattggt tgtcctgcat actttaatta tgatgtaata caggttctgg
      181 gttgacaaat atcaagacgg aggagatctc tgaagtgaag atggatgcag aattccgaca
      241 tgactcagga tatgaagttc atcatcaaaa attggtacgt aaaataattt acctctttcc
      301 actactgttt gtcttgccaa atgacctatt aactctggtt catcctgtgc tagaaatcaa
      361 attaaggaaa agataaaaat acaatgcttg cctataggat taccatgaaa acatgaagaa
      421 aataaatagg ctaggctgag cgcagtgctc aagcctgtaa tcccagcact ttgggaggcc
      481 aaggcgggtg gatcacgagg tcagaaattc gagaccagcc tggccaatat ggtgaaaccc
      541 catctctact aaa
//
LOCUS       HUMAMYB18                567 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 17.
ACCESSION   M34879
VERSION     M34879.1  GI:178612
KEYWORDS    amyloid-beta protein.
SEGMENT     18 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 567)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
FEATURES             Location/Qualifiers
     source          1..567
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     intron          order(M34878.1:275..553,1..194)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=16
     exon            195..341
                     /gene="APP"
                     /note="G00-119-692"
                     /number=17
BASE COUNT      159 a     85 c    113 g    210 t
ORIGIN      
        1 tttttttttt ttttttaaag taagcatcaa atatttgacc aaccagttgg gcagagaata
       61 tactgaaact ttttatataa cctcatccaa atgtcccctg catttaagaa atgaaattct
      121 tctaattgcg tttataaatt gtaaattata ttgcatttag aaattaaaat tctttttctt
      181 aatttgtttt caaggtgttc tttgcagaag atgtgggttc aaacaaaggt gcaatcattg
      241 gactcatggt gggcggtgtt gtcatagcga cagtgatcgt catcaccttg gtgatgctga
      301 agaagaaaca gtacacatcc attcatcatg gtgtggtgga ggtaggtaaa cttgactgca
      361 tgtttccaag tgggaattaa gactatgaga gaattaggct tagctttttg ctaagaacta
      421 gctaagtatc tcttttaaaa aacgaatcag tgtgcttcca tgatgcttgg gttacagttg
      481 ttctttcttg ttttggtttt cattcattgc aacttaccgt gaatattctg ctcaaggtat
      541 tgagagtgtg tgttgttatc tcaactt
//
LOCUS       HUMAMYB19               1607 bp    DNA     linear   PRI 08-AUG-1995
DEFINITION  Human amyloid-beta protein (APP) gene, exon 18.
ACCESSION   M33112
VERSION     M33112.1  GI:178613
KEYWORDS    amyloid-beta protein.
SEGMENT     19 of 19
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Yoshikai,S., Sasaki,H., Doh-ura,K., Furuya,H. and Sakaki,Y.
  TITLE     Genomic organization of the human amyloid beta-protein precursor
            gene
  JOURNAL   Gene 87 (2), 257-263 (1990)
  MEDLINE   90236318
   PUBMED   2110105
REFERENCE   2  (bases 1 to 1607)
  AUTHORS   Yoshioka,K., Izumi,R., Oishi,N. and Sakaki,Y.
  JOURNAL   Unpublished (1992)
COMMENT     Original source text: Homo sapiens DNA.
            [1]  sites; intron/exon boundaries.
            Computer-readable sequence for [1] kindly submitted by Y.Sakaki,
            01-MAY-1992.
            [1]  sites.
FEATURES             Location/Qualifiers
     source          1..1607
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="21q21.2"
     gene            join(M34862.1:484..1154,M34863.1:1..771,M34864.1:1..604,
                     M34865.1:1..249,M34866.1:1..910,M34867.1:1..745,
                     M34868.1:1..669,M34869.1:1..413,M34870.1:1..639,
                     M34871.1:1..468,M34872.1:1..742,M34873.1:1..488,
                     M34874.1:1..413,M34875.1:1..1037,M34876.1:1..577,
                     M34877.1:1..534,M34878.1:1..553,M34879.1:1..567,1..1380)
                     /gene="APP"
     mRNA            join(M34862.1:834..1037,M34863.1:279..446,
                     M34864.1:230..359,M34865.1:132..244,M34866.1:486..679,
                     M34867.1:200..402,M34868.1:185..352,M34869.1:159..215,
                     M34870.1:310..443,M34871.1:264..338,M34872.1:317..475,
                     M34873.1:208..336,M34874.1:93..192,M34876.1:242..463,
                     M34877.1:259..312,M34878.1:174..274,M34879.1:195..341,
                     160..1380)
                     /gene="APP"
                     /note="G00-119-692"
     prim_transcript order(M34862.1:834..1037,order(M34862.1:1038..1154,
                     M34863.1:1..278),M34863.1:279..446,
                     order(M34863.1:447..771,M34864.1:1..229),
                     M34864.1:230..359,order(M34864.1:360..604,
                     M34865.1:1..131),M34865.1:132..244,
                     order(M34865.1:245..249,M34866.1:1..485),
                     M34866.1:486..679,order(M34866.1:680..910,
                     M34867.1:1..199),M34867.1:200..402,
                     order(M34867.1:403..745,M34868.1:1..184),
                     M34868.1:185..352,order(M34868.1:353..669,
                     M34869.1:1..158),M34869.1:159..215,
                     order(M34869.1:216..413,M34870.1:1..309),
                     M34870.1:310..443,order(M34870.1:444..639,
                     M34871.1:1..263),M34871.1:264..338,
                     order(M34871.1:339..468,M34872.1:1..316),
                     M34872.1:317..475,order(M34872.1:476..742,
                     M34873.1:1..207),M34873.1:208..336,M34874.1:93..192,
                     order(M34874.1:193..413,M34875.1:1..1037,M34876.1:1..241),
                     M34876.1:242..463,order(M34876.1:464..577,
                     M34877.1:1..258),M34877.1:259..312,
                     order(M34877.1:313..534,M34878.1:1..173),
                     M34878.1:174..274,order(M34878.1:275..553,
                     M34879.1:1..194),M34879.1:195..341,
                     order(M34879.1:342..567,1..159),160..1380)
                     /gene="APP"
                     /note="G00-119-692"
     CDS             join(M34862.1:981..1037,M34863.1:279..446,
                     M34864.1:230..359,M34865.1:132..244,M34866.1:486..679,
                     M34867.1:200..402,M34868.1:185..352,M34869.1:159..215,
                     M34870.1:310..443,M34871.1:264..338,M34872.1:317..475,
                     M34873.1:208..336,M34874.1:93..192,M34876.1:242..463,
                     M34877.1:259..312,M34878.1:174..274,M34879.1:195..341,
                     160..261)
                     /gene="APP"
                     /codon_start=1
                     /product="amyloid-beta protein"
                     /protein_id="AAB59502.1"
                     /db_xref="GI:178616"
                     /db_xref="GDB:G00-119-692"
                     /translation="MLPGLALLLLAAWTARALEVPTDGNAGLLAEPQIAMFCGRLNMH
                     MNVQNGKWDSDPSGTKTCIDTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGR
                     KQCKTHPHFVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSE
                     KSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYA
                     DGSEDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTT
                     TTTTESVEEVVREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTE
                     EYCMAVCGSAMSQSLLKTTQEPLARDPVKLPTTAASTPDAVDKYLETPGDENEHAHFQ
                     KAKERLEAKHRERMSQVMREWEEAERQAKNLPKADKKAVIQHFQEKVESLEQEAANER
                     QQLVETHMARVEAMLNDRRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHT
                     LKHFEHVRMVDPKKAAQIRSQVMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEVDELL
                     QKEQNYSDDVLANMISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSF
                     GADSVPANTENEVEPVDARPAADRGLTTRPGSGLTNIKTEEISEVKMDAEFRHDSGYE
                     VHHQKLVFFAEDVGSNKGAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVEVDA
                     AVTPEERHLSKMQQNGYENPTYKFFEQMQN"
     intron          order(M34879.1:342..567,1..159)
                     /gene="APP"
                     /note="G00-119-692"
                     /number=17
     exon            160..1380
                     /gene="APP"
                     /note="G00-119-692"
                     /number=18
BASE COUNT      447 a    304 c    318 g    538 t
ORIGIN      
        1 gttctgctcc aagatgtcaa agtgggggag aaaattatgg gtgttctaca atcttggcaa
       61 aagaagagta ctgtactcct tatcttttta ctgcttctcc atgttcaccc ttaaaagatt
      121 gatttttatt ttttactcag ctctcctctt gtttttcagg ttgacgccgc tgtcacccca
      181 gaggagcgcc acctgtccaa gatgcagcag aacggctacg aaaatccaac ctacaagttc
      241 tttgagcaga tgcagaacta gacccccgcc acagcagcct ctgaagttgg acagcaaaac
      301 cattgcttca ctacccatcg gtgtccattt atagaataat gtgggaagaa acaaacccgt
      361 tttatgattt actcattatc gccttttgac agctgtgctg taacacaagt agatgcctga
      421 acttgaatta atccacacat cagtaatgta ttctatctct ctttacattt tggtctctat
      481 actacattat taatgggttt tgtgtactgt aaagaattta gctgtatcaa actagtgcat
      541 gaatagattc tctcctgatt atttatcaca tagcccctta gccagttgta tattattctt
      601 gtggtttgtg acccaattaa gtcctacttt acatatgctt taagaatcga tgggggatgc
      661 ttcatgtgaa cgtgggagtt cagctgcttc tcttgcctaa gtattccttt cctgatcact
      721 atgcatttta aagttaaaca tttttaagta tttcagatgc tttagagaga ttttttttcc
      781 atgactgcat tttactgtac agattgctgc ttctgctata tttgtgatat aggaattaag
      841 aggatacaca cgtttgtttc ttcgtgcctg ttttatgtgc acacattagg cattgagact
      901 tcaagctttt ctttttttgt ccacgtatct ttgggtcttt gataaagaaa agaatccctg
      961 ttcattgtaa gcacttttac ggggcgggtg gggaggggtg ctctgctggt cttcaattac
     1021 caagaattct ccaaaacaat tttctgcagg atgattgtac agaatcattg cttatgacat
     1081 gatcgctttc tacactgtat tacataaata aattaaataa aataaccccg ggcaagactt
     1141 ttctttgaag gatgactaca gacattaaat aatcgaagta attttgggtg gggagaagag
     1201 gcagattcaa ttttctttaa ccagtctgaa gtttcattta tgatacaaaa gaagatgaaa
     1261 atggaagtgg caatataagg ggatgaggaa ggcatgcctg gacaaaccct tcttttaaga
     1321 tgtgtcttca atttgtataa aatggtgttt tcatgtaaat aaatacattc ttggaggagc
     1381 accattgtgc tggtgtgaat gattccatag taacaatctt gaccatttac tgacgtacag
     1441 accagtgaga agtcttcgca tgaagggtac ccacacctgt tgtgtcttaa ttgcaagtct
     1501 gagtaggaag ttggggccaa catgtgtctc ccagtgctgg gaaaatattt catagaccta
     1561 atttacagtc tttacttgat ctaaaacatt ttgctgccat attttgg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X84700. H.sapiens mRNA fo...[gi:840770] Links  


LOCUS       HSCD97                  2921 bp    mRNA    linear   PRI 30-OCT-1995
DEFINITION  H.sapiens mRNA for leucocyte antigen CD97.
ACCESSION   X84700
VERSION     X84700.1  GI:840770
KEYWORDS    antigen CD97.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Hamann,J., Eichler,W., Hamann,D., Kerstens,H.M., Poddighe,P.J.,
            Hoovers,J.M., Hartmann,E., Strauss,M. and van Lier,R.A.
  TITLE     Expression cloning and chromosomal mapping of the leukocyte
            activation antigen CD97, a new seven-span transmembrane molecule of
            the secretion receptor superfamily with an unusual extracellular
            domain
  JOURNAL   J. Immunol. 155 (4), 1942-1950 (1995)
  MEDLINE   95363161
REFERENCE   2  (bases 1 to 2921)
  AUTHORS   Hamann,J.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-FEB-1995) J. Hamann, Central Lab. Netherlands Red
            Cross Blood Transfusion Service, CLB, Dept. KVI, Plesmanlaan 125,
            1066 CX Amsterdam, NETHERLANDS
FEATURES             Location/Qualifiers
     source          1..2921
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="19"
                     /map="19p13.12-13.2"
                     /cell_type="PBMC"
     CDS             71..2299
                     /codon_start=1
                     /product="leucocyte antigen CD97"
                     /protein_id="CAA59173.1"
                     /db_xref="GI:840771"
                     /db_xref="SWISS-PROT:P48960"
                     /translation="MGGRVFLAFCVWLTLPGAETQDSRGCARWCPQNSSCVNATACRC
                     NPGFSSFSEIITTPTETCDDINECATPSKVSCGKFSDCWNTEGSYDCVCSPGYEPVSG
                     AKTFKNESENTCQDVDECSSGQHQCDSSTVCFNTVGSYSCRCRPGWKPRHGIPNNQKD
                     TVCEDMTFSTWTPPPGVHSQTLSRFFDKVQDLGRDSKTSSAEVTIQNVIKLVDELMEA
                     PGDVEALAPPVRHLIATQLLSNLEDIMRILAKSLPKGPFTYISPSNTELTLMIQERGD
                     KNVTMGQSSARMKLNWAVAAGAEDPGPAVAGILSIQNMTTLLANASLNLHSKKQAELE
                     EIYESSIRGVQLRRLSAVNSIFLSHNNTKELNSPILFAFSHLESSDGEAGRDPPAKDV
                     MPGPRQELLCAFWKSDSDRGGHWATEVCQVLGSKNGSTTCQCSHLSSFTILMAHYDVE
                     DWKLTLITRVGLALSLFCLLLCILTFLLVRPIQGSRTTIHLHLCICLFVGSTIFLAGI
                     ENEGGQVGLRCRLVAGLLHYCFLAAFCWMSLEGLELYFLVVRVFQGQGLSTRWLCLIG
                     YGVPLLIVGVSAAIYSKGYGRPRYCWLDFEQGFLWSFLGPVTFIILCNAVIFVTTVWK
                     LTQKFSEINPDMKKLKKARALTITAIAQLFLLGCTWVFGLFIFDDRSLVLTYVFTILN
                     CLQGAFLYLLHCLLNKKVREEYRKWACLVAGGSKYSEFTSTTSGTGHNQTRALRASES
                     GI"
BASE COUNT      609 a    891 c    799 g    622 t
ORIGIN      
        1 agcctgtgga gacgggacag ccctgtccca ctcactcttt cccctgccgc tcctgccggc
       61 agctccaacc atgggaggcc gcgtctttct cgcattctgt gtctggctga ctctgccggg
      121 agctgaaacc caggactcca ggggctgtgc ccggtggtgc cctcagaact cctcgtgtgt
      181 caatgccacc gcctgtcgct gcaatccagg gttcagctct ttttctgaga tcatcaccac
      241 cccgacggag acttgtgacg acatcaacga gtgtgcaaca ccgtcgaaag tgtcatgcgg
      301 aaaattctcg gactgctgga acacagaggg gagctacgac tgcgtgtgca gcccgggata
      361 tgagcctgtt tctggggcaa aaacattcaa gaatgagagc gagaacacct gtcaagatgt
      421 ggacgagtgc agctccgggc agcatcagtg tgacagctcc accgtctgct tcaacaccgt
      481 gggttcatac agctgccgct gccgcccagg ctggaagccc agacacggaa tcccgaataa
      541 ccaaaaggac actgtctgtg aagatatgac tttctccacc tggaccccgc cccctggagt
      601 ccacagccag acgctttccc gattcttcga caaagtccag gacctgggca gagactccaa
      661 gacaagctca gccgaggtca ccatccagaa tgtcatcaaa ttggtggatg aactgatgga
      721 agctcctgga gacgtagagg ccctggcgcc acctgtccgg cacctcatag ccacccagct
      781 gctctcaaac cttgaagata tcatgaggat cctggccaag agcctgccta aaggcccctt
      841 cacctacatt tccccttcga acacagagct gaccctgatg atccaggagc ggggggacaa
      901 gaacgtcact atgggtcaga gcagcgcacg catgaagctg aattgggctg tggcagctgg
      961 agccgaggat ccaggccccg ccgtggcggg catcctctcc atccagaaca tgacgacatt
     1021 gctggccaat gcctccttga acctgcattc caagaagcaa gccgaactgg aggagatata
     1081 tgaaagcagc atccgtggtg tccaactcag acgcctctct gccgtcaact ccatctttct
     1141 gagccacaac aacaccaagg aactcaactc ccccatcctt ttcgccttct cccaccttga
     1201 gtcctccgat ggggaggcgg gaagagaccc tcctgccaag gacgtgatgc ctgggccacg
     1261 gcaggagctg ctctgtgcct tctggaagag tgacagcgac aggggagggc actgggccac
     1321 cgaggtctgc caggtgctgg gcagcaagaa cggcagcacc acctgccaat gcagccacct
     1381 gagcagcttt acgatcctta tggctcatta tgacgtggag gactggaagc tgaccctgat
     1441 caccagggtg ggactggcgc tgtcactctt ctgcctgctg ctgtgcatcc tcactttcct
     1501 gctggtgcgg cccatccagg gctcgcgcac caccatacac ctgcacctct gcatctgcct
     1561 cttcgtgggc tccaccatct tcctggccgg catcgagaac gaaggcggcc aggtggggct
     1621 gcgctgccgc ctggtggccg ggctgctgca ctactgtttc ctggccgcct tctgctggat
     1681 gagcctcgaa ggcctggagc tctactttct tgtggtgcgc gtgttccaag gccagggcct
     1741 gagtacgcgc tggctctgcc tgatcggcta tggcgtgccc ctgctcatcg tgggcgtctc
     1801 ggctgccatc tacagcaagg gctacggccg ccccagatac tgctggttgg actttgagca
     1861 gggcttcctc tggagcttct tgggacctgt gaccttcatc attttgtgca atgctgtcat
     1921 tttcgtgact accgtctgga agctcactca gaagttttct gaaatcaatc cagacatgaa
     1981 gaaattaaag aaggcgaggg cgctgaccat cacggccatc gcgcagctct tcctgttggg
     2041 ctgcacctgg gtctttggcc tgttcatctt cgacgatcgg agcttggtgc tgacctatgt
     2101 gtttaccatc ctcaactgcc tgcagggcgc cttcctctac ctgctgcact gcctgctcaa
     2161 caagaaggtt cgggaagaat accggaagtg ggcctgccta gttgctgggg ggagcaagta
     2221 ctcagaattc acctccacca cgtctggcac tggccacaat cagacccggg ccctcagggc
     2281 atcagagtcc ggcatatgaa ggcgcatggt tctggacggc ccagcagctc ctgtggccac
     2341 agcagctttg tacacgaaga ccatccatcc tcccttcgtc caccactcta ctccctccac
     2401 cctccctccc tgatcccgtg tgccaccagg agggagtggc agctatagtc tggcaccaaa
     2461 gtccaggaca cccagtgggg tggagtcgga gccactggtc ctgctgctgg ctgcctctct
     2521 gctccacctt gtgacccagg gtggggacag gggctggccc agggctgcaa tgcagcatgt
     2581 tgccctggca cctgtggcca gtactcggga cagactaagg gcgcttgtcc catcctggac
     2641 ttttcctctc atgtctttgc tgcagaactg aagagactag gcgctggggc tcagcttccc
     2701 tcttaagcta agactgatgt cagaggcccc atggcgaggc cccttggggc cactgcctga
     2761 ggctcacggt acagaggcct gccctgcctg gccgggcagg aggttctcac tgttgtgaag
     2821 gttgtagacg ttgtgtaatg tgtttttatc tgttaaaatt tttcagtgtt gacacttaaa
     2881 attaaacaca tgcatacaga aaaaaaaaaa aaaaaaaaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: J00129. Human fibrinogen ...[gi:182429] Links  


LOCUS       HUMFBRB                 1883 bp    mRNA    linear   PRI 08-NOV-1994
DEFINITION  Human fibrinogen beta-chain mRNA, partial cds.
ACCESSION   J00129
VERSION     J00129.1  GI:182429
KEYWORDS    beta-fibrinogen; fibrin; fibrinogen; glycoprotein.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1883)
  AUTHORS   Chung,D.W., Que,B.G., Rixon,M.W., Mace,M. Jr. and Davie,E.W.
  TITLE     Characterization of complementary deoxyribonucleic acid and genomic
            deoxyribonucleic acid for the beta chain of human fibrinogen
  JOURNAL   Biochemistry 22 (13), 3244-3250 (1983)
  MEDLINE   83283433
   PUBMED   6688356
COMMENT     Original source text: Human liver, cDNA to mRNA, clone
            pHI-beta-[1-6].
            The authors identified three potential translation initiation
            codons.  Two of these codons were located upstream of position 1,
            while the third is located at positions 19-21.  The exact
            initiation codon was not confirmed.
FEATURES             Location/Qualifiers
     source          1..1883
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q28"
     gene            1..1883
                     /gene="FGB"
     mRNA            <1..1883
                     /gene="FGB"
                     /note="b-fibrinogen mRNA (alt.) [2]"
     mRNA            <1..1621
                     /gene="FGB"
                     /note="b-fibrinogen mRNA (alt.)"
     mRNA            <1..1618
                     /gene="FGB"
                     /note="b-fibrinogen mRNA (alt.)"
     mRNA            <1..1552
                     /gene="FGB"
                     /note="b-fibrinogen mRNA (alt.)"
     CDS             <1..1452
                     /gene="FGB"
                     /note="beta-fibrinogen precursor"
                     /codon_start=1
                     /protein_id="AAA52429.1"
                     /db_xref="GI:182430"
                     /db_xref="GDB:G00-119-130"
                     /translation="FHKLKTMKHLLLLLLCVFLVKSQGVNDNEEGFFSARGHRPLDKK
                     REEAPSLRPAPPPISGGGYRARPAKAAATQKKVERKAPDAGGCLHADPDLGVLCPTGC
                     QLQEALLQQERPIRNSVDELNNNVEAVSQTSSSSFQYMYLLKDLWQKRQKQVKDNENV
                     VNEYSSELEKHQLYIDETVNSNIATNLRVLRSILENLRSKIQKLESDVSAQMEYCRTP
                     CTVSCNIPVVSGKECEEIIRKGGETSEMYLIQPDSSVKPYRVYCDMNTENGGWTVIQN
                     RQDGSVDFGRKWDPYKQGFGNVATNTDGKNYCGLPGEYWLGNDKISQLTRMGPTELLI
                     EMEDWKGDKVKAHYGGFTVQNEANKYQISVNKYRGTAGNALMDGASQLMGENRTMTIH
                     NGMFFSTYDRDNDGWLTSDPRKQCSKEDGGGWWYNRCHAANPNGRYYWGGQYTWDMAK
                     HGTDDGVVWMNWKGSWYSMRKMSMKIRPFFPQQ"
     sig_peptide     <1..66
                     /gene="FGB"
                     /note="beta-fibrinogen signal peptide"
     mat_peptide     67..1449
                     /gene="FGB"
                     /product="beta-fibrinogen"
BASE COUNT      612 a    351 c    431 g    489 t
ORIGIN      114 bp upstream of TaqI site; chromosome 4q31.
        1 ttccacaaac ttaaaaccat gaaacatcta ttattgctac tattgtgtgt ttttctagtt
       61 aagtcccaag gtgtcaacga caatgaggag ggtttcttca gtgcccgtgg tcatcgaccc
      121 cttgacaaga agagagaaga ggctcccagc ctgaggcctg ccccaccgcc catcagtgga
      181 ggtggctatc gggctcgtcc agccaaagca gctgccactc aaaagaaagt agaaagaaaa
      241 gcccctgatg ctggaggctg tcttcacgct gacccagacc tgggggtgtt gtgtcctaca
      301 ggatgtcagt tgcaagaggc tttgctacaa caggaaaggc caatcagaaa tagtgttgat
      361 gagttaaata acaatgtgga agctgtttcc cagacctcct cttcttcctt tcagtacatg
      421 tatttgctga aagacctgtg gcaaaagagg cagaagcaag taaaagataa tgaaaatgta
      481 gtcaatgagt actcctcaga actggaaaag caccaattat atatagatga gactgtgaat
      541 agcaatatcg caactaacct tcgtgtgctt cgttcaatcc tagaaaacct gagaagcaaa
      601 atacaaaagt tagaatctga tgtctcagct caaatggaat attgtcgcac cccatgcact
      661 gtcagttgca atattcctgt ggtgtctggc aaagaatgtg aggaaattat caggaaagga
      721 ggtgaaacat ctgaaatgta tctcattcaa cctgacagtt ctgtcaaacc gtatagagta
      781 tactgtgaca tgaatacaga aaatggagga tggacagtga ttcagaaccg tcaagacggt
      841 agtgttgact ttggcaggaa atgggatcca tataaacagg gatttggaaa tgttgcaacc
      901 aacacagatg ggaagaatta ctgtggccta ccaggtgaat attggcttgg aaatgataaa
      961 attagccagc ttaccaggat gggacccaca gaacttttga tagaaatgga ggactggaaa
     1021 ggagacaaag taaaggctca ctatggagga ttcactgtac agaatgaagc caacaaatac
     1081 cagatctcag tgaacaaata cagaggaaca gccggtaatg ccctcatgga tggagcatct
     1141 cagctgatgg gagaaaacag gaccatgacc attcacaacg gcatgttctt cagcacgtat
     1201 gacagagaca atgacggctg gttaacatca gatcccagaa aacagtgttc taaagaagac
     1261 ggtggtggat ggtggtataa tagatgtcat gcagccaatc caaacggcag atactactgg
     1321 ggtggacagt acacctggga catggcaaag catggcacag atgatggtgt agtatggatg
     1381 aattggaagg ggtcatggta ctcaatgagg aagatgagta tgaagatcag gcccttcttc
     1441 ccacagcaat agtccccaat acgtagattt ttgctcttct gtatgtgaca acatttttgt
     1501 acattatgtt attggaattt tctttcatac attatattcc tctaaaactc tcaagcagac
     1561 gtgagtgtga ctttttgaaa aaagtatagg ataaattaca ttaaaatagc acatgatttt
     1621 cttttgtttt cttcatttct cttgctcacc aagaagtaac aaaagtatag ttttgacaga
     1681 gttggtgttc ataatttcag ttctagttga ttgcgagaat tttcaaataa ggaagagggg
     1741 tcttttatcc ttgtcgtagg aaaaccatga cggaaaggaa aaactgatgt ttaaaagtcc
     1801 acttttaaaa ctatatttat ttatgtagga tctgtcaaag aaaacttcca aaaagattta
     1861 ttaattaaac cagactctgt tgc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M64983. Homo sapiens fibr...[gi:182597] Links  


LOCUS       HUMFIBRB                8878 bp    DNA     linear   PRI 18-MAY-2000
DEFINITION  Homo sapiens fibrinogen beta chain (FGB), complete cds.
ACCESSION   M64983
VERSION     M64983.1  GI:182597
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 8878)
  AUTHORS   Chung,D.W., Harris,J.E. and Davie,E.W.
  TITLE     Nucleotide sequences of the three genes coding for human fibrinogen
  JOURNAL   (in) Liu,C.Y. and Chien,S. (Eds.);
            FIBRINOGEN, THROMBOSIS, COAGULATION AND FIBRINOLYSIS: 39-48;
            Plenum Press, New York (1991)
FEATURES             Location/Qualifiers
     source          1..8878
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            <470..8878
                     /gene="FGB"
     mRNA            join(<470..583,3258..3449,3939..4122,5043..5270,
                     5831..5944,6633..6758,6967..7252,7871..8537)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
     mRNA            join(<470..583,3258..3449,3939..4122,5043..5270,
                     5831..5944,6633..6758,6967..7252,7871..8534)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
                     /note="putative"
     mRNA            join(470..583,3258..3449,3939..4122,5043..5270,5831..5944,
                     6633..6758,6967..7252,7871..8273)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
                     /note="putative"
     mRNA            join(<470..583,3258..3449,3939..4122,5043..5270,
                     5831..5944,6633..6758,6967..7252,7871..8268)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
                     /note="putative"
     mRNA            join(<470..583,3258..3449,3939..4122,5043..5270,
                     5831..5944,6633..6758,6967..7252,7871..8202)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
                     /note="putative"
     CDS             join(470..583,3258..3449,3939..4122,5043..5270,5831..5944,
                     6633..6758,6967..7252,7871..8102)
                     /gene="FGB"
                     /note="start codon not confirmed"
                     /codon_start=1
                     /product="fibrinogen beta chain"
                     /protein_id="AAA18024.2"
                     /db_xref="GI:7924018"
                     /translation="MKRMVSWSFHKLKTMKHLLLLLLCVFLVKSQGVNDNEEGFFSAR
                     GHRPLDKKREEAPSLRPAPPPISGGGYRARPAKAAATQKKVERKAPDAGGCLHADPDL
                     GVLCPTGCQLQEALLQQERPIRNSVDELNNNVEAVSQTSSSSFQYMYLLKDLWQKRQK
                     QVKDNENVVNEYSSELEKHQLYIDETVNSNIPTNLRVLRSILENLRSKIQKLESDVSA
                     QMEYCRTPCTVSCNIPVVSGKECEEIIRKGGETSEMYLIQPDSSVKPYRVYCDMNTEN
                     GGWTVIQNRQDGSVDFGRKWDPYKQGFGNVATNTDGKNYCGLPGEYWLGNDKISQLTR
                     MGPTELLIEMEDWKGDKVKAHYGGFTVQNEANKYQISVNKYRGTAGNALMDGASQLMG
                     ENRTMTIHNGMFFSTYDRDNDGWLTSDPRKQCSKEDGGGWWYNRCHAANPNGRYYWGG
                     QYTWDMAKHGTDDGVVWMNWKGSWYSMRKMSMKIRPFFPQQ"
     3'UTR           8103..8537
                     /gene="FGB"
     misc_feature    8538..8878
                     /gene="FGB"
                     /note="3' flanking sequence"
BASE COUNT     2912 a   1478 c   1663 g   2825 t
ORIGIN      
        1 gaattcatgc cccttttgaa atagacttat gtcattgtca gaaaacataa gcatttatgg
       61 tatatcatta atgagtcacg attttagtgg ttgccttgtg agtaggtcaa atttactaag
      121 cttagatttg ttttctcaca tattctttcg gagcttgtgt agtttccaca ttaatttacc
      181 agaaacaaga tacacactct ctttgaggag tgccctaact tcccatcatt ttgtccaatt
      241 aaatgaattg aagaaattta atgtttctaa actagaccaa caaagaataa tagttgtatg
      301 acaagtaaat aagctttgct gggaagatgt tgcttaaatg ataaaatggt tcagccaaca
      361 agtgaaccaa aaattaaata ttaactaagg aaaggtaacc atttctgaag tcattcctag
      421 cagaggactc agatatatat aggattgaag atctctcagt taagtctaca tgaaaaggat
      481 ggtttcttgg agcttccaca aacttaaaac catgaaacat ctattattgc tactattgtg
      541 tgtttttcta gttaagtccc aaggtgtcaa cgacaatgag gaggtgaatt ttttaaagca
      601 ttattatatt attagtagta ttattaatat aagatgtaac ataatcatat tatgtgctta
      661 ttttaatgaa attagcattg cttatagtta tgaaatggaa ttgttaacct ctgacttatt
      721 gtatttaaag aatgtttcat agtatttctt atataaaaac aaagtaattt cttgttttct
      781 agtttatcac ctttgttttc ttaagatgag gatggcttag ctaatgtaag atgtgttttt
      841 ctcacttgct attctgagta ctgtgatttt catttacttc tagcaataca ggattacaat
      901 taagaggaca agatctgaaa atctcacaaa ctataaaata ataaaagagc agaattttaa
      961 gataaaagaa actggtggta ggtagattgt tctttggtga aggaaggtaa tatatattgt
     1021 tactgagatt actatttata aaaattataa ctaagcctaa aagcaaaata catcaagtgt
     1081 aatgatagaa aatgaaatat tgcttttttc agatgaaaag ttcaaattag agttagtgtg
     1141 tattgttatt attaatagtt atgaaacacg gttcagtcta atttatttat ttgtagaaca
     1201 gtttgtcctc aactattatt tttgctgact tattgctgtt aatttgcagt tactaaaaat
     1261 acagaaatgc atttaggaca atggatattt aagaaattta aattttatca tcaaacgtat
     1321 catggccaaa tttcttacat atagcatagt atcattaaac tagaaataag aatacacaat
     1381 aatatttaaa tgaagtgatt catttcggat cattattgag tttcaaggga acttgagtgt
     1441 tgtacttatc agactctaca tgtaagaaca tatagttaat ctggttgtgt gtgtaaaaac
     1501 atatggttaa tctggttaag tctggttaat catattaggt aagaaaaatg taaagaatgt
     1561 gtaagacgaa atttttgtaa agtactctgc aaagcacttt cacatttctg cttatcaact
     1621 aaacctcaca gagatagttt aatagtttag gctttaaaat ggattttgat tattcaacaa
     1681 gtggccttca taatttcttt aagtgttttt ctttaagtat atactttctt taaatatttt
     1741 ttaaaatttc cttttctcta gtaaagccag accatccatg ctacctctct agtggcactc
     1801 tgaaataaaa agaaaatagt tttctctgtt ataattgtat ttgtaataag cagatgaatc
     1861 acatttctta aaatttgttt tagagagggt aagctctgac taggaccatg acttcaatgt
     1921 gaaatatgta tatatcctcc gaatctttac atattaagaa tgtatatagt caactggtta
     1981 aacaggaaaa tctggaacag cctggctggg ttttaatctt agcaccatcc tactaaatgt
     2041 taaataatat tataatctaa tgaataaatg acaatgcaat tccaaataga gttcatctga
     2101 tgacttctag actcacaaaa ttgcaagaga gctcagttgt tgctcagttg ttccaaatca
     2161 tgtcgtttgt taatttgtaa ttaagctcca aaggatgtat agctactgac aaaaaaaaaa
     2221 atgagaatgt agttaatcca aatcaaaact ttcctattgc aatgcgtatt ttctgcttca
     2281 ttatccttta atataatatt ttaagttagc aagtaatttt aattacaatg cacaagcctt
     2341 gagaattatt ttaaatataa gaaaatcata atgtttgata aagaaatcat gtaagaaatt
     2401 tcaagataat ggtttaacaa ataattttgt tgatagaaga taagactaaa agtgaaattc
     2461 gaagtggaga ggacacttaa actgtagtac ttgttatgtg tgattccagt aaaaatagta
     2521 atgagcactt attattgcca agtactgttc tgagggtacc atatgcaata agttatttaa
     2581 tccttacaat aatcttgtaa ggcagattca aactatcatt acacttattt tacagatgag
     2641 aaaactgggg cacagataaa gcaacttgcc caaggtctca tagctgtaag tcaaccctac
     2701 ggtcaagacc tacaagtagc cgagctccag agtacattat gagggtcaaa gattgtctta
     2761 ttacaaataa attccaagta gaatcaacct ttaataagtc tttaatgtct cttaaatatg
     2821 tttatatagg agtctaatca ccaattcaca aaaatgaaag tagggaaatg attaacaata
     2881 atcataggaa tctaacaatc caagtggctt gagaatattc attcttcttg acagtataga
     2941 ttctttacaa tttcgtaagt tccaatgtat gttttaggaa tatgaggtca ttactattca
     3001 taatctgata cagctttatc ctaaggcctc tctttaaaaa ctacactgca tcatagcttt
     3061 tttgtgcagt tggtctttct actgttactg aacagtaagc aacctacaga ttcactatca
     3121 ccaaccagcc agttgatgga tcttaagcaa attatcaagc ttgtgataac ctaaattata
     3181 aaatgagggt gttggaatag ttacattcca aatcttctat aacactctgt attatatttc
     3241 tgcctcattc cttgtagggt ttcttcagtg cccgtggtca tcgacccctt gacaagaaga
     3301 gagaagaggc tcccagcctg aggcctgccc caccgcccat cagtggaggt ggctatcggg
     3361 ctcgtccagc caaagcagct gccactcaaa agaaagtaga aagaaaagcc cctgatgctg
     3421 gaggctgtct tcacgctgac ccagacctgg tgggtgcact gatgtttctt gcagtggtgg
     3481 ctctctcatg cagagaaagc ctgtagtcat ggcagtctgc taatgtttca ctgacccaca
     3541 ttaccatcac tgttattttg tttgtttatt ttggaaataa aattcaaaac ataaacatat
     3601 tgggcctttg gtttaggctt tctttcttgt tttctttggt ctgggcccaa aatttcaaat
     3661 taggatatgt gggtgccacc tttccatttg tattttgcca ctgcctttgt ttagttggta
     3721 aaattttcat agcccaatta tattttttct ggggtaagta atattttaaa tctctatgag
     3781 agtatgatga tgactttcga atttctggtc ttacagaaaa ccaaataata aatttttatg
     3841 ttggctaatc gtatcgctga attttcctat gtgctatttt aacaaatgtc catgacccaa
     3901 atccttcatc taatgcctgc tattttcttt gtttttaggg ggtgttgtgt cctacaggat
     3961 gtcagttgca agaggctttg ctacaacagg aaaggccaat cagaaatagt gttgatgagt
     4021 taaataacaa tgtggaagct gtttcccaga cctcctcttc ttcctttcag tacatgtatt
     4081 tgctgaaaga cctgtggcaa aagaggcaga agcaagtaaa aggtagatat ccttgtgctt
     4141 tccattcgat tttcagctat aaaattggaa ccgttagact gccacgagaa tgcatggttg
     4201 tgagaagatt aacatttctg ggttagtgaa tagcattcat acgcttttgg gcaccttccc
     4261 ctgcaacttg ccagataagc actattcagc tcttattccc agtctgacat cagcaagtgt
     4321 gattttctat gaaaaattct actatgactc cttattttaa gtatacaaga aacttgtgac
     4381 tcagaagata atatttacag agtggaaaaa aacccctagc atttatagtt ttaacatttg
     4441 aggttttgaa tgagagagtt atccataata tattcaattg tgttgtggat aatgacacct
     4501 aacctgtgaa tcttgaggtc agaatgttga gtgctgttga cttggtggtc aggaaacagc
     4561 tagtgcgtga gcctggcaca ggcatctcag tgagtagcat acccacagtt ggaaattttt
     4621 caaagaaatc aaaggaatca tgacatctta taaatttcaa ggttctgcta tacttatgtg
     4681 aaatggataa ataaatcaag catatccact ctgtaagatt gaacttctca gatggaagac
     4741 cccaatactg ctttctcctc ttttccctca ccaaagaaat aaacaaccta tttcatttat
     4801 tactggacac aatctttagc gtatacctat ggtaaattac tagtatggtg gttaggattt
     4861 atgttaattt gtatatgtca tgcgccaaat catttccact aaatatgact atatatcata
     4921 actgcttggt gatagctcag tgtttaatag tttattctca gaaaatcaaa attgtatagt
     4981 taaatacatt agttttatga ggcaaaaatg ctaactattt ctacataatt tcatttttcc
     5041 agataatgaa aatgtagtca atgagtactc ctcagaactg gaaaagcacc aattatatat
     5101 agatgagact gtgaatagca atatcccaac taaccttcgt gtgcttcgtt caatcctgga
     5161 aaacctgaga agcaaaatac aaaagttaga atctgatgtc tcagctcaaa tggaatattg
     5221 tcgcacccca tgcactgtca gttgcaatat tcctgtggtg tctggcaaag gtaactgatt
     5281 cataaacata tttttagaga gttccagaag aactcacaca ccaaaaataa gagaacaaca
     5341 acaacaacaa aaatgctaag tggattttcc caacagatca taatgacatt acagtacatc
     5401 ataaaaatat ccttagccag ttgtgttttg gactggcctg gtgcatttgc tggttttgat
     5461 gagcaggatg gggcacaggt agtcccaggg gtggctgatg tgtgcatctg cgtactggct
     5521 tgaacagatg gcagaaccac agatagatgt agaagtttct ccattttgtg tgttctggga
     5581 gctcatggat attccaggac acaaaaggtg gagaagagct ttgttcatcc tcttagcaga
     5641 taaacgtcct caaaactggg ttggacttac taaagtaaaa tgaaaatcta atatttgtta
     5701 tattattttc aaaggtctat aataacacac tccttagtaa cttatgtaat gttattttaa
     5761 agaattggtg actaaataca aagtaattat gtcataaacc cctgaacata atgttgtctt
     5821 acatttgcag aatgtgagga aattatcagg aaaggaggtg aaacatctga aatgtatctc
     5881 attcaacctg acagttctgt caaaccgtat agagtatact gtgacatgaa tacagaaaat
     5941 ggaggtaagc tttcgacagt tgttgacctg ttgatctgta attatttgga taccgtaaaa
     6001 tgccaggaaa caaggccagg tgtggtggct catacctgta attccagcac cttgggaggc
     6061 caaagtgggc tgatagcttg agcctaggag tttgaaacta gcctgggcaa cataatgaga
     6121 ccctaactct acaaaaaaaa aaaaaatacc aaaaaaaaaa aaaaaatcag ctgtgttggt
     6181 agtatgtgcc tgtagtccca gctatccagg aggctgagat gggagatcac ctgagcccac
     6241 aacctggagt cttgatcatg ctactgaact gtagcctggg caacagagga tagtgagatc
     6301 ctgtctcaaa aaaaaaaatt aattaaaaag ccaggaaaca agacttagct ctaacatcta
     6361 acatagctga caaaggagta atttgatgtg gaattcaacc tgatatttaa aagttataaa
     6421 atatctataa ttcacaattt ggggtaagat aaagcacttg cagtttccaa agattttaca
     6481 agtttacctc tcatatttat ttccttattg tgtctatttt agagcaccaa atatatacta
     6541 aatggaatgg acaggggatt cagatattat tttcaaagtg acattatttg ctgttggtta
     6601 atatatgctc tttttgtttc tgtcaaccaa aggatggaca gtgattcaga accgtcaaga
     6661 cggtagtgtt gactttggca ggaaatggga tccatataaa cagggatttg gaaatgttgc
     6721 aaccaacaca gatgggaaga attactgtgg cctaccaggt aacgaacagg catgcaaaat
     6781 aaaatcattc tatttgaaat gggatttttt ttaattaaaa aacattcatt gttggaagcc
     6841 tgttttaggc agttaagagg agtttcctga caaaaatgtg gaagctaaag ataagggaag
     6901 aaaggcagtt tttagtttcc caaaatttta tttttggtga gagattttat tttgtttttc
     6961 ttttaggtga atattggctt ggaaatgata aaattagcca gcttaccagg atgggaccca
     7021 cagaactttt gatagaaatg gaggactgga aaggagacaa agtaaaggct cactatggag
     7081 gattcactgt acagaatgaa gccaacaaat accagatctc agtgaacaaa tacagaggaa
     7141 cagccggtaa tgccctcatg gatggagcat ctcagctgat gggagaaaac aggaccatga
     7201 ccattcacaa cggcatgttc ttcagcacgt atgacagaga caatgacggc tggtatgtgt
     7261 ggcactcttt gctcctgctt taaaaatcac actaatatca ttactcagaa tcattaacaa
     7321 tatttttaat agctaccact tcctgggcac ttactgtcag ccactgtcct aagctcttta
     7381 tgcatcactc gaaagcattt caactataag gtagacattc ttattctcat tttacagatg
     7441 agatttagag agattacgtg atttgtccaa tgtcacacaa ctacccagag ataaaactag
     7501 aatttgagca cagttacttt ctgaataatg agcatttaga taaataccta tatctctata
     7561 ttctaaagtg tgtgtgaaaa ctttcatttt catttccagg gttctctgat actaagggtt
     7621 gtaaaagcta ttattccagt ataaagtaac aaacacagtc cctagatgga ttgccacaaa
     7681 ggcccagtta tctctctttc ttgctatagg gcacaggagg tctttggtgt attagtgtga
     7741 ctctatgtat agcacccaaa ggaaagacta ctgtgcacac gagtgtagca gtcttttatg
     7801 ggtaatctgc aaaacgtaac ttgaccaccg tagttctgtt tctaataacg ccaaacacat
     7861 tttctttcag gttaacatca gatcccagaa aacagtgttc taaagaagac ggtggtggat
     7921 ggtggtataa tagatgtcat gcagccaatc caaacggcag atactactgg ggtggacagt
     7981 acacctggga catggcaaag catggcacag atgatggtgt agtatggatg aattggaagg
     8041 ggtcatggta ctcaatgagg aagatgagta tgaagatcag gcccttcttc ccacagcaat
     8101 agtccccaat acgtagattt ttgctcttct gtatgtgaca acatttttgt acattatgtt
     8161 attggaattt tctttcatac attatattcc tctaaaactc tcaagcagac gtgagtgtga
     8221 ctttttgaaa aaagtatagg ataaattaca ttaaaatagc acatgatttt cttttgtttt
     8281 cttcatttct cttgctcacc caagaagtaa caaaagtata gttttgacag agttggtgtt
     8341 cataatttca gttctagttg attgcgagaa ttttcaaata aggaagaggg gtcttttatc
     8401 cttgtcgtag gaaaaccatg acggaaagga aaaactgatg tttaaaagtc cacttttaaa
     8461 actatattta tttatgtagg atctgtcaaa gaaaacttcc aaaaagattt attaattaaa
     8521 ccagactctg ttgcaataag ttaatgtttt cttgttttgt aatccacaca ttcaatgagt
     8581 taggctttgc acttgtaagg aaggagaagc gttcacaacc tcaaatagct aataaaccgg
     8641 tcttgaatat ttgaagattt aaaatctgac tctaggacgg gcacggtggc tcacgactat
     8701 aatcccaaca ctttgggagg ctgaggcggg cggtcacaag gtcaggagtt caagaccagc
     8761 ctgaccaata tggtgaaacc ccatctctac taaaaataca aaaattagcc aggcgtggtg
     8821 gcaggtgcct gtaggtccca gctagcctgt gaggtggaga ttgcattgag ccaagatc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC000055. Homo sapiens, fol...[gi:12652618] Links  


LOCUS       BC000055                3714 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, follistatin-like 1, clone MGC:1993 IMAGE:3505833,
            mRNA, complete cds.
ACCESSION   BC000055
VERSION     BC000055.1  GI:12652618
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3714)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-NOV-2000) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Rachel Dickhoff, Jessica Fahey, Stephanie Ford, Julia
            Greene, Mark Ketteman and Anuradha Madan
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 7 Row: n Column: 19
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 3184392.
FEATURES             Location/Qualifiers
     source          1..3714
                     /organism="Homo sapiens"
                     /db_xref="LocusID:11167"
                     /db_xref="taxon:9606"
                     /clone="MGC:1993 IMAGE:3505833"
                     /tissue_type="Kidney, renal cell adenocarcinoma"
                     /clone_lib="NIH_MGC_14"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             92..1018
                     /codon_start=1
                     /product="follistatin-like 1"
                     /protein_id="AAH00055.1"
                     /db_xref="GI:12652619"
                     /translation="MWKRWLALALALVAVAWVRAEEELRSKSKICANVFCGAGRECAV
                     TEKGEPTCLCIEQCKPHKRPVCGSNGKTYLNHCELHRDACLTGSKIQVDYDGHCKEKK
                     SVSPSASPVVCYQSNRDELRRRIIQWLEAEIIPDGWFSKGSNYSEILDKYFKNFDNGD
                     SRLDSSEFLKFVEQNETAINITTYPDQENNKLLRGLCVDALIELSDENADWKLSFQEF
                     LKCLNPSFNPPEKKCALEDETYADGAETEVDCNRCVCACGNWVCTAMTCDGKNQKGAQ
                     TQTEEEMTRYVQELQKHQETAEKTKRVSTKEI"
BASE COUNT     1042 a    824 c    837 g   1011 t
ORIGIN      
        1 ggcacgaggg atcggcggag ctcccacctc cgcttacagc tcgctgccgc cgtcctgccc
       61 cgcgccccca ggagacctgg accagaccac gatgtggaaa cgctggctcg cgctcgcgct
      121 cgcgctggtg gcggtcgcct gggtccgcgc cgaggaagag ctaaggagca aatccaagat
      181 ctgtgccaat gtgttttgtg gagccggccg ggaatgtgca gtcacagaga aaggggaacc
      241 cacctgtctc tgcattgagc aatgcaaacc tcacaagagg cctgtgtgtg gcagtaatgg
      301 caagacctac ctcaaccact gtgaactgca tcgagatgcc tgcctcactg gatccaaaat
      361 ccaggttgat tacgatggac actgcaaaga gaagaaatcc gtaagtccat ctgccagccc
      421 agttgtttgc tatcagtcca accgtgatga gctccgacgt cgcatcatcc agtggctgga
      481 agctgagatc attccagatg gctggttctc taaaggcagc aactacagtg aaatcctaga
      541 caagtatttt aagaactttg ataatggtga ttctcgcctg gactccagtg aattcctgaa
      601 gtttgtggaa cagaatgaaa ctgccatcaa tattacaacg tatccagacc aggagaacaa
      661 caagttgctt aggggactct gtgttgatgc tctcattgaa ctgtctgatg aaaatgctga
      721 ttggaaactc agcttccaag agtttctcaa gtgcctcaac ccatctttca accctcctga
      781 gaagaagtgt gccctggagg atgaaacgta tgcagatgga gctgagaccg aggtggactg
      841 taaccgctgt gtctgtgcct gtggaaattg ggtctgtaca gccatgacct gtgacggaaa
      901 gaatcagaag ggggcccaga cccagacaga ggaggagatg accagatatg tccaggagct
      961 ccaaaagcat caggaaacag ctgaaaagac caagagagtg agcaccaaag agatctaatg
     1021 aggaggcaca gaccagtgtc tggatcccag catcttctcc acttcagcgc tgagttcagt
     1081 atacacaagt gtctgctaca gtcgccaaat caccagtatt tgcttatata gcaatgagtt
     1141 ttattttgtt tatttgtttt gcaataaagg atatgaaggt ggctggctag gaagggaagg
     1201 gccacagcct tcatttctag gagtgcttta agagaaactg taaatggtgc tctggggctg
     1261 gaggctagta aggaaactgc atcacgattg aaagaggaac agacccaaat ctgaacctct
     1321 tttgagttta ctgcatctgt cagcaggctg cagggagtgc acacgatgcc agagagaact
     1381 tagcagggtg tccccggagg agaggtttgg gaagctccac ggagaggaac gctctctgct
     1441 tccagcctct ttccattgcc gtcagcatga cagacctcca gcatccacgc atctcttggt
     1501 cccaataact gcctctagat acatagccat actgctagtt aacccagtgt ccctcagact
     1561 tggatggagt ttctgggagg gtacacccaa atgatgcaga tacttgtata ctttgagccc
     1621 cttagcgacc taaccaaatt ttaaaaatac tttttaccaa aggtgctatt tctctgtaaa
     1681 acactttttt tttggcaagt tgactttatt cttcaattat tatcattata ttattgtttt
     1741 ttaatatttt attttcttga ctaggtatta agcttttgta attatttttc agtagtccca
     1801 ccacttcata ggtggaagga gtttggggtt cttcctggtg caggggctga aataacccag
     1861 atgcccccac cctgccacat actagatgca gcccatagtt ggccccccta gcttccagca
     1921 gtccactatc tgccagagga gcaagggtgc cttagaccga agccagggga agaagcatct
     1981 tcataaaaaa ctttcaagat ccaaacatta atttgttttt atttattctg agaagttgag
     2041 gcaaatcagt attcccaagg atggcgacaa gggcagccaa gcagggctta ggatatccca
     2101 gcctaccaat atgctcattc gactaactag gagggtgagt tggccctgtc tcttcttttt
     2161 tctggacctc agtttcctca gtgagctggt aagaatgcac taaccttttg atttgataag
     2221 ttataaattc tgtggttctg atcattggtc cagaggggag ataggttcct gtgatttttc
     2281 cttcttctct atagaataaa tgaaatcttg ttactagaac aagaaatgtc agatggccaa
     2341 aaacaagatg accagatttg atctcagcct gatgacccta caggtcgtgc tatgatatgg
     2401 agtcctcatg ggtaaagcag gaagagagtg ggaaagagaa ccaccccact ctgtcttcat
     2461 atttgcattt catgtttaac ctccggctgg aaatagaaag cattccctta gagatgagga
     2521 taaaagaaag tttcagattc aacaggggga agaaaatgga gatttaatcc taaaactgtg
     2581 acttggggag gtcagtcatt tacagttagt cctgtgtctt tcgacttctg tgattattaa
     2641 ccccactcac taccctgttt cagatgcatt tggaatacca aagattaaat ccttgacata
     2701 agatctcatt tgcagaaagc agattaaaga ccatcagaag gaaattattt aggttgtaat
     2761 gcacaggcaa ctgtgagaaa ctgttgtgcc aaaaatagaa ttccttctag tttttcttgt
     2821 tctcatttga aaggagaaaa ttccactttg tttagcattt caagctttta tgtatccatc
     2881 ccatctaaaa actcttcaaa ctccacttgt tcagtctgaa atgcagctcc ctgtccaagt
     2941 gccttggaga actcacagca gcacgcctta atcaaaggtt ttaccagccc ttggacacta
     3001 tgggaggagg gcaagagtac accaatttgt taaaagcaag aaaccacagt gtctcttcac
     3061 tagtcattta gaacatggtt atcatccaag actactctac cctgcaacat tgaactccca
     3121 agagcaaatc cacattcctc ttgagttctg cagcttctgt gtaaataggg cagctgtcgt
     3181 ctatgccgta gaatcacatg atctgaggac cattcatgga agctgctaaa tagcctagtc
     3241 tggggagtct tccataaagt tttgcatgga gcaaacaaac aggattaaac taggtttggt
     3301 tccttcagcc ctctaaaagc atagggctta gcctgcaggc ttccttgggc tttctctgtg
     3361 tgtgtagttt tgtaaacact atagcatctg ttaagatcca gtgtccatgg aaacattccc
     3421 acatgccgtg actctggact atatcagttt ttggaaagca gggttcctct gcctgctaac
     3481 aagcccacgt ggaccagtct gaatgtcttt cctttacacc tatgttttta agtagtcaaa
     3541 cttcaagaaa caatctaaac aagtttctgt tgcatatgtg tttgtgaact tgtatttgta
     3601 tttagtaggc ttctatattg catttaactt gtttttgtaa ctcctgattc ttccttttcg
     3661 gatactattg atgaataaag aaattaaagt gaaaaaaaaa aaaaaaaaaa aaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D78152. Human mRNA for an...[gi:1060889] Links  


LOCUS       HUMAIV                  1971 bp    mRNA    linear   PRI 05-FEB-1999
DEFINITION  Human mRNA for annexin IV (carbohydrate-binding protein p33/41),
            complete cds.
ACCESSION   D78152
VERSION     D78152.1  GI:1060889
KEYWORDS    annexin IV (carbohydrtate-binding protein p33/41).
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Satoh,A., Takayama,E., Kojima,K., Ogawa,H., Yamori,T., Sato,S.,
            Kawaguchi,T., Tsuruo,T., Katsura,Y., Kina,T. and Matsumoto,I.
  TITLE     Expression of carbohydrate-binding protein p33/41 in human tumor
            cell lines
  JOURNAL   J. Biochem. 119 (2), 346-353 (1996)
  MEDLINE   97037082
REFERENCE   2  (bases 1 to 1971)
  AUTHORS   Satoh,A., Takayama,E., Kojima,K., Ogawa,H., Katsura,Y., Kina,T. and
            Matsumoto,I.
  TITLE     Expression of carbohydrate-binding protein p33/41 in human tumor
            cell lines
  JOURNAL   Unpublished
REFERENCE   3  (bases 1 to 1971)
  AUTHORS   Satoh,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (24-OCT-1995) Ayano Satoh, Ochanomizu University,
            Chemistry; 2-1-1 Ohtsuka, Bunkyo-ku, Tokyo 112, Japan
            (E-mail:ayano@fs.cc.ocha.ac.jp, Tel:03-5978-5345, Fax:03-5978-5344)
FEATURES             Location/Qualifiers
     source          1..1971
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             63..1028
                     /codon_start=1
                     /evidence=experimental
                     /product="annexin IV (carbohydrtate-binding protein
                     p33/41)"
                     /protein_id="BAA11227.1"
                     /db_xref="GI:1060890"
                     /translation="MAMATKGGTVKAASGFNAMEDAQTLRKAMKGLGTDEDAIISVLA
                     YRNTAQRQEIRTAYKSTIGRDLIDDLKSELSGNFEQVIVGMMTPTVLYDVQELRRAMK
                     GAGTDEGCLIEILASRTPEEIRRISQTYQQQYGRSLEDDIRSDTSFMFQRVLVSLSAG
                     GRDEGNYLDDALVRQDAQDLYEAGEKKWGTDEVKFLTVLCSRNRNHLLHVFDEYKRIS
                     QKDIEQSIKSETSGSFEDALLAIVKCMRNKSAYFAEKLYKSMKGLGTDDNTLIRVMVS
                     RAEIDMLDIRAHFKRLYGKSLYSFIKGDTSGDYRKVLLVLCGGDD"
     polyA_signal    1953..1958
BASE COUNT      593 a    379 c    415 g    584 t
ORIGIN      
        1 gcgcacgccg gcctcgaaga acttctgctt gggtggctga actctgatct tgacctagag
       61 tcatggccat ggcaaccaaa ggaggtactg tcaaagctgc ttcaggattc aatgccatgg
      121 aagatgccca gaccctgagg aaggccatga aagggctcgg caccgatgaa gacgccatta
      181 ttagcgtcct tgcctaccgc aacaccgccc agcgccagga gatcaggaca gcctacaaga
      241 gcaccatcgg cagggacttg atagacgacc tgaagtcaga actgagtggc aacttcgagc
      301 aggtgattgt ggggatgatg acgcccacgg tgctgtatga cgtgcaagag ctgcgaaggg
      361 ccatgaaggg agccggcact gatgagggct gcctaattga gatcctggcc tcccggaccc
      421 ctgaggagat ccggcgcata agccaaacct accagcagca atatggacgg agccttgaag
      481 atgacattcg ctctgacaca tcgttcatgt tccagcgagt gctggtgtct ctgtcagctg
      541 gtgggaggga tgaaggaaat tatctggacg atgctctcgt gagacaggat gcccaggacc
      601 tgtatgaggc tggagagaag aaatggggga cagatgaggt gaaatttcta actgttctct
      661 gttcccggaa ccgaaatcac ctgttgcatg tgtttgatga atacaaaagg atatcacaga
      721 aggatattga acagagtatt aaatctgaaa catctggtag ctttgaagat gctctgctgg
      781 ctatagtaaa gtgcatgagg aacaaatctg catattttgc tgaaaagctc tataaatcga
      841 tgaagggctt gggcaccgat gataacaccc tcatcagagt gatggtttct cgagcagaaa
      901 ttgacatgtt ggatatccgg gcacacttca agagactcta tggaaagtct ctgtactcgt
      961 tcatcaaggg tgacacatct ggagactaca ggaaagtact gcttgttctc tgtggaggag
     1021 atgattaaaa taaaaatccc agaaggacag gaggattctc aacactttga atttttttaa
     1081 cttcattttt ctacactgct attatcatta tctcagaatg cttatttcca attaaaacgc
     1141 ctacagctgc ctcctagaat atagactgtc tgtattatta ttcacctata attagtcatt
     1201 atgatgcttt aaagctgtac ttgcatttca aagcttataa gatataaatg gagattttaa
     1261 agtagaaata aatatgtatt ccatgttttt aaaagattac tttctacttt gtgtttcaca
     1321 gacattgaat atattaaatt attccatatt ttcttttcag tgaaaaattt tttaaatgga
     1381 agactgttct aaaatcactt ttttccctaa tccaattttt agagtggcta gtagtttctt
     1441 catttgaaat tgtaagcatc cggtcagtaa gaatgcccat ccagttttct atatttcata
     1501 gtcaaagcct tgaaagcatc tacaaatctc tttttttagg ttttgtccat agcatcagtt
     1561 gatccttact aagtttttca tgggagactt ccttcatcac atcttatgtt gaaatcactt
     1621 tctgtagtca aagtatacca aaaccaattt atctgaacta aattctaaag tatggttata
     1681 caaaccatat acatctggtt accaaacata aatgctgaac attccatatt attatagtta
     1741 atgtcttaat ccagcttgca agtgaatgga aaaaaaaata agcttcaaac taggtattct
     1801 gggaatgatg taatgctctg aatttagtat gatataaaga aaactttttt gtgctaaaaa
     1861 tactttttaa aatcaatttt gttgattgta gtaatttcta tttgcactgt gcctttcaac
     1921 tccagaaaca ttctaagatg tacttggatt taattaaaaa gttcactttg t
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M82809. Human annexin IV ...[gi:178698] Links  


LOCUS       HUMANX4A                1976 bp    mRNA    linear   PRI 31-OCT-1994
DEFINITION  Human annexin IV (ANX4) mRNA, complete cds.
ACCESSION   M82809
VERSION     M82809.1  GI:178698
KEYWORDS    annexin IV; chromobindin 4; placental anticoagulant protein II.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1976)
  AUTHORS   Tait,J.F., Smith,C., Frankenberry,D.A., Miao,C.H., Adler,D.A. and
            Disteche,C.M.
  TITLE     Chromosomal mapping of the human annexin IV (ANX4) gene
  JOURNAL   Genomics 12 (2), 313-318 (1992)
  MEDLINE   92155721
   PUBMED   1346776
COMMENT     Original source text: Homo sapiens placenta cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..1976
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="Unassigned"
                     /tissue_type="placenta"
     gene            1..1976
                     /gene="ANX4"
     CDS             74..1039
                     /gene="ANX4"
                     /function="calcium-dependent phospholipid binding protein"
                     /standard_name="annexin IV"
                     /codon_start=1
                     /evidence=experimental
                     /product="annexin IV (placental anticoagulant protein II)"
                     /protein_id="AAA51740.1"
                     /db_xref="GI:178699"
                     /db_xref="GDB:G00-131-395"
                     /translation="MAMATKGGTVKAASGFNAMEDAQTLRKAMKGLGTDEDAIISVLA
                     YRNTAQRQEIRTAYKSTIGRDLIDDLKSELSGNFEQVIVGMMTPTVLYDVQELRRAMK
                     GAGTDEGCLIEILASRTPEEIRRISQTYQQQYGRSLEDDIRSDTSFMFQRVLVSLSAG
                     GRDEGNYLDDALVRQDAQDLYEAGEKKWGTDEVKFLTVLCSRNRNHLLHVFDEYKRIS
                     QKDIEQSIKSETSGSFEDALLAIVKCMRNKSAYFAEKLYKSMKGLGTDDNTLIRVMVS
                     RAEIDMLDIRAHFKRLYGKSLYSFIKGDTSGDYRKVLLVLCGGDD"
BASE COUNT      597 a    379 c    420 g    580 t
ORIGIN      
        1 gcagaggagg agcgcacgcc ggcctcgaag aacttctgct tgggtggctg aactctgatc
       61 ttgacctaga gtcatggcca tggcaaccaa aggaggtact gtcaaagctg cttcaggatt
      121 caatgccatg gaagatgccc agaccctgag gaaggccatg aaagggctcg gcaccgatga
      181 agacgccatt attagcgtcc ttgcctaccg caacaccgcc cagcgccagg agatcaggac
      241 agcctacaag agcaccatcg gcagggactt gatagacgac ctgaagtcag aactgagtgg
      301 caacttcgag caggtgattg tggggatgat gacgcccacg gtgctgtatg acgtgcaaga
      361 gctgcgaagg gccatgaagg gagccggcac tgatgagggc tgcctaattg agatcctggc
      421 ctcccggacc cctgaggaga tccggcgcat aagccaaacc taccagcagc aatatggacg
      481 gagccttgaa gatgacattc gctctgacac atcgttcatg ttccagcgag tgctggtgtc
      541 tctgtcagct ggtgggaggg atgaaggaaa ttatctggac gatgctctcg tgagacagga
      601 tgcccaggac ctgtatgagg ctggagagaa gaaatggggg acagatgagg tgaaatttct
      661 aactgttctc tgttcccgga accgaaatca cctgttgcat gtgtttgatg aatacaaaag
      721 gatatcacag aaggatattg aacagagtat taaatctgaa acatctggta gctttgaaga
      781 tgctctgctg gctatagtaa agtgcatgag gaacaaatct gcatattttg ctgaaaagct
      841 ctataaatcg atgaagggct tgggcaccga tgataacacc ctcatcagag tgatggtttc
      901 tcgagcagaa attgacatgt tggatatccg ggcacacttc aagagactct atggaaagtc
      961 tctgtactcg ttcatcaagg gtgacacatc tggagactac aggaaagtac tgcttgttct
     1021 ctgtggagga gatgattaaa ataaaaatcc cagaaggaca ggaggattct caacactttg
     1081 aattttttta acttcatttt tctacactgc tattatcatt atctcagaat gcttatttcc
     1141 aattaaaacg cctacagctg cctcctagaa tatagactgt ctgtattatt attcacctat
     1201 aattagtcat tatgatgctt taaagctgta cttgcatttc aaagcttata agatataaat
     1261 ggagatttta aagtagaaat aaatatgtat tccatgtttt taaaagatta ctttctactt
     1321 tgtgtttcac agacattgaa tatattaaat tattccatat tttcttttca gtgaaaaatt
     1381 ttttaaatgg aagactgttc taaaatcact tttttcccta atccaatttt tagagtggct
     1441 agtagtttct tcatttgaaa ttgtaagcat ccggtcagta agaatgccca tccagttttc
     1501 tatatttcat agtcaaagcc ttgaaagcat ctacaaatct ctttttttag gttttgtcca
     1561 tagcatcagt tgatccttac taagtttttc atgggagact tccttcatca catcttatgt
     1621 tgaaatcact ttctgtagtc aaagtatacc aaaaccaatt tatctgaact aaattctaaa
     1681 gtatggttat acaaaccata tacatctggt taccaaacat aaatgctgaa cattccatat
     1741 tattatagtt aatgtcttaa tccagcttgc aagtgaatgg aaaaaaaaat aagcttcaaa
     1801 ctaggtattc tgggaatgat gtaatgctct gaatttagta tgatataaag aaaacttttt
     1861 tgtgctaaaa atacttttta aaatcaattt tgttgattgt agtaatttct atttgcactg
     1921 tgcctttcaa ctccagaaac attctaagat gtacttggat ttaattaaaa agttca
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC013144. Homo sapiens, clo...[gi:15341915] Links  


LOCUS       BC013144                3149 bp    mRNA    linear   PRI 29-AUG-2001
DEFINITION  Homo sapiens, clone MGC:21388 IMAGE:4475866, mRNA, complete cds.
ACCESSION   BC013144
VERSION     BC013144.1  GI:15341915
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3149)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-AUG-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: villalon@bcm.tmc.edu.
            Villalon, D.K., Luna, R.A., Hale, S.M., Hulyk, S., Lu, X., Garcia,
            A.M., Holloway, M., Telford, B, Hodgson, A., Bouck, J., Yu, W.,
            Muzny,D.M., Gibbs,R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 28 Row: d Column: 22
            This clone was selected for full length sequencing because it
            passed the following selection criteria: GenomeScan gene
            prediction, Similarity but not identity to protein.
FEATURES             Location/Qualifiers
     source          1..3149
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:21388 IMAGE:4475866"
                     /tissue_type="Prostate, adenocarcinoma."
                     /clone_lib="NIH_MGC_91"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     CDS             4..2397
                     /codon_start=1
                     /product="Unknown (protein for MGC:21388)"
                     /protein_id="AAH13144.1"
                     /db_xref="GI:15341916"
                     /translation="MAAAPVAAGSGAGRGRRSAATVAAWGGWGGRPRPGNILLQLRQG
                     QLTGRGLVRAVQFTETFLTERDKQSKWSGIPQLLLKLHTTSHLHSDFVECQNILKEIS
                     PLLSMEAMAFVTEERKLTQETTYPNTYIFDLFGGVDLLVEILMRPTISIRGQKLKISD
                     EMSKDCLSILYNTCVCTEGVTKRLAEKNDFVIFLFTLMTSKKTFLQTATLIEDILGVK
                     KEMIRLDEVPNLSSLVSNFDQQQLANFCRILAVTISEMDTGNDDKHTLLAKNAQQKKS
                     LSLGPSAAEINQAALLSIPGFVERLCKLATRKVSESTGTASFLQELEEWYTWLDNALV
                     LDALMRVANEESEHNQASIVFPPPGASEENGLPHTSARTQLPQSMKIMHEIMYKLEVL
                     YVLCVLLMGRQRNQVHRMIAEFKLIPGLNNLFDKLIWRKHSASALVLHGHNQNCDCSP
                     DITLKIQFLRLLQSFSDHHENKYLLLNNQELNELSAISLKANIPEVEAVLNTDRSLVC
                     DGKRGLLTRLLQVMKKEPAESSFRFWQARAVESFLRGTTSYADQMFLLKRGLLEHILY
                     CIVDSECKSRDVLQSYFDLLGELMKFNVDAFKRFNKYINTDAKFQVFLKQINSSLVDS
                     NMLVRCVTLSLDRFENQVDMKVAEVLSECRLLAYISQVPTQMSFLFRLINIIHVQTLT
                     QENVSCLNTSLVILMLARRKERLPLYLRLLQRMEHSKKYPGFLLNNFHNLLRFWQQHY
                     LHKDKDSTCLENSSCISFSYWKETVSILLNPDRQSPSALVSYIEEPYMDIDRDFTEE"
BASE COUNT      737 a    834 c    833 g    745 t
ORIGIN      
        1 gacatggcgg cggcgccggt agcggctggg tctggagccg gccgagggag acggtcggca
       61 gccacagtgg cggcttgggg cggatggggc ggccggccgc ggcctggtaa cattctgctg
      121 cagctgcggc agggccagct gaccggccgg ggcctggtcc gggcggtgca gttcactgag
      181 acttttttga cggagaggga caaacaatcc aagtggagtg gaattcctca gctgctcctc
      241 aagctgcaca ccaccagcca cctccacagt gactttgttg agtgtcaaaa catcctcaag
      301 gaaatttctc ctcttctctc catggaggct atggcatttg ttactgaaga gaggaaactt
      361 acccaagaaa ccacttatcc aaatacttat atttttgact tgtttggagg tgttgatctt
      421 cttgtagaaa ttcttatgag gcctacgatc tctatccggg gacagaaact gaaaataagt
      481 gatgaaatgt ccaaggactg cttgagtatc ctgtataata cctgtgtctg tacagaggga
      541 gttacaaagc gtttggcaga aaagaatgac tttgtgatct tcctgtttac attgatgaca
      601 agtaagaaga cattcttaca aacagcaacc ctcattgaag atattttggg tgttaaaaag
      661 gaaatgatcc gactagatga agtccccaat ctgagttcct tagtatccaa tttcgatcag
      721 cagcagctcg ctaatttctg ccggattctg gctgtcacca tttcagagat ggatacaggg
      781 aatgatgaca agcacacgct tcttgccaaa aatgctcaac agaagaagag cttgagtttg
      841 gggccttctg cagctgaaat caatcaagcg gcccttctca gcattcctgg ctttgttgag
      901 cggctttgca aactggcgac tcgaaaggtg tcagagtcaa cgggcacagc cagcttcctt
      961 caggagttgg aagagtggta cacatggcta gacaatgctt tggtgctaga tgccctgatg
     1021 cgagtggcca atgaggagtc agagcacaat caagcctcca ttgtgttccc tcctccaggg
     1081 gcttctgagg agaatggcct gcctcacacg tcagccagaa cccagctgcc ccagtcaatg
     1141 aagattatgc atgagatcat gtacaaactg gaagtgctct atgtcctctg cgtgctgctg
     1201 atggggcgtc agcgaaacca ggttcacaga atgattgcag agttcaagct gatccctgga
     1261 cttaataatt tgtttgacaa actgatttgg aggaagcatt cagcatctgc ccttgtcctc
     1321 catggtcaca accagaactg tgactgtagc ccggacatca ccttgaagat acagtttttg
     1381 aggcttcttc agagcttcag tgaccaccac gagaacaagt acttgttact caacaaccag
     1441 gagctgaatg aactcagtgc catctctctc aaggccaaca tccctgaggt ggaagctgtc
     1501 ctcaacaccg acaggagttt ggtgtgtgat gggaagaggg gcttattaac tcgtctgctg
     1561 caggtcatga agaaggagcc agcagagtcg tctttcaggt tttggcaagc tcgggctgtg
     1621 gagagtttcc tccgagggac cacctcctat gcagaccaga tgttcctgct gaagcgaggc
     1681 ctcttggagc acatccttta ctgcattgtg gacagcgagt gtaagtcaag ggatgtgctc
     1741 cagagttact ttgacctcct gggggagctg atgaagttca acgttgatgc attcaagaga
     1801 ttcaataaat atatcaacac cgatgcaaag ttccaggtat tcctgaagca gatcaacagc
     1861 tccctggtgg actccaacat gctggtgcgc tgtgtcactc tgtccctgga ccgatttgaa
     1921 aaccaggtgg atatgaaagt tgccgaggta ctgtctgaat gccgcctgct cgcctacata
     1981 tcccaggtgc ccacgcagat gtccttcctc ttccgcctca tcaacatcat ccacgtgcag
     2041 acgctgaccc aggagaacgt cagctgcctc aacaccagcc tggtgatcct gatgctggcc
     2101 cgacggaaag agcggctgcc cctgtacctg cggctgctgc agcggatgga gcacagcaag
     2161 aagtaccccg gcttcctgct caacaacttc cacaacctgc tgcgcttctg gcagcagcac
     2221 tacctgcaca aggacaagga cagcacctgc ctagagaaca gctcctgcat cagcttctca
     2281 tactggaagg agacagtgtc catcctgttg aacccggacc ggcagtcacc ctctgctctc
     2341 gttagctaca ttgaggagcc ctacatggac atagacaggg acttcactga ggagtgacct
     2401 tgggccaggc ctcgggaggc tgctgggcca gtgtgggtga gcgtgggtac gatgccacac
     2461 gccctgccct gttcccgttc ctccctgctg ctctctgcct gccccaggtc tttgggtaca
     2521 ggcttggtgg gagggaagtc ctagaagccc ttggtccccc tgggtctgag ggccctaggt
     2581 catggagagc ctcagtcccc ataatgagga cagggtacca tgcccacctt tccttcagaa
     2641 ccctggggcc cagggccacc cagaggtaag aggacattta gcattagctc tgtgtgagct
     2701 cctgccggtt tcttggctgt cagtcagtcc cagagtgggg aggaagatat gggtgacccc
     2761 caccccccat ctgtgagcca agcctccctt gtccctggcc tttggaccca ggcaaaggct
     2821 tctgagccct gggcaggggt ggtgggtacc agagaatgct gccttccccc aagcctgccc
     2881 ctctgcctca ttttcctgta gctcctctgg ttctgtttgc tcattggctg ctgtgttcat
     2941 ccaagggggt tctcccagaa gtgaggggcc tttccctcca tcccttgagg cacggggcag
     3001 ctgtgcctgc cctgcctctg cctgaggcag ccgctcctgc ctgagcctgg acatggggcc
     3061 cttccttgtg ttgccaattt attaacagca aataaaccaa ttaaatggag actattaaat
     3121 aactttattt taaaaaaaaa aaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC010273. Homo sapiens, mul...[gi:16307449] Links  


LOCUS       BC010273                3322 bp    mRNA    linear   PRI 22-OCT-2001
DEFINITION  Homo sapiens, multifunctional polypeptide similar to SAICAR
            synthetase and AIR carboxylase, clone MGC:5024 IMAGE:2900848, mRNA,
            complete cds.
ACCESSION   BC010273
VERSION     BC010273.1  GI:16307449
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3322)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: villalon@bcm.tmc.edu.
            Villalon, D.K., Luna, R.A., Hale, S.M., Hulyk, S., Lu, X., Garcia,
            A.M., Holloway, M., Telford, B, Hodgson, A., Bouck, J., Yu, W.,
            Muzny,D.M., Gibbs,R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 3 Row: k Column: 17
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 5453538.
FEATURES             Location/Qualifiers
     source          1..3322
                     /organism="Homo sapiens"
                     /db_xref="LocusID:10606"
                     /db_xref="taxon:9606"
                     /clone="MGC:5024 IMAGE:2900848"
                     /tissue_type="Placenta, choriocarcinoma"
                     /clone_lib="NIH_MGC_10"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     CDS             206..1483
                     /codon_start=1
                     /product="multifunctional polypeptide similar to SAICAR
                     synthetase and AIR carboxylase"
                     /protein_id="AAH10273.1"
                     /db_xref="GI:16307450"
                     /translation="MATAEVLNIGKKLYEGKTKEVYELLDSPGKVLLQSKDQITAGNA
                     ARKNHLEGKAAISNKITSCIFQLLQEAGIKTAFTRKCGETAFIAPQCEMIPIEWVCRR
                     IATGSFLKRNPGVKEGYKFYPPKVELFFKDDANNDPQWSEEQLIAAKFCFAGLLIGQT
                     EVDIMSHATQAIFEILEKSWLPQNCTLVDMKIEFGVDVTTKEIVLADVIDNDSWRLWP
                     SGDRSQQKDKQSYRDLKEVTPEGLQMVKKNFEWVAERVELLLKSESQCRVVVLMGSTS
                     DLGHCEKIKKACGNFGIPCELRVTSAHKGPDETLRIKAEYEGDGIPTVFVAVAGRSNG
                     LGPVMSGNTAYPVISCPPLTPDWGVQDVWSSLRLPSGLGCSTVLSPEGSAQFAAQIFG
                     LSNHLVWSKLRASILNTWISLKQADKKIRECNL"
BASE COUNT      960 a    721 c    683 g    958 t
ORIGIN      
        1 gaaagctgta tttgctgcac gtggaaatct ccgttatttt ccagcaccca acagtagcgt
       61 aatgggagta acggacttaa cctcatttct ctttcagagc atttagcctt catatgccct
      121 tccctgcatg cttcccccag gccgtcaaga cttgagttct gcctcgcttc ccggcgcggt
      181 cgcagccctc agcccactta ggataatggc gacagctgag gtactgaaca ttggtaaaaa
      241 attatatgag ggtaaaacaa aagaagtcta cgaattgtta gacagtccag gaaaagtcct
      301 cctgcagtcc aaggaccaga ttacagcagg aaatgcagct agaaaaaacc acctggaagg
      361 aaaagctgca atctcaaata aaatcaccag ttgtattttt cagttattac aggaagcagg
      421 tattaaaact gccttcacca gaaaatgtgg ggagacagct ttcattgcac cgcagtgtga
      481 aatgattcca attgaatggg tttgcagaag aatagcaact ggttcttttc tcaaaagaaa
      541 tcctggtgtc aaggaaggat ataagtttta cccacctaaa gtggagttgt ttttcaagga
      601 tgatgccaat aatgacccac agtggtctga ggaacagctg attgctgcaa aattttgctt
      661 tgctggactt cttataggcc agactgaagt ggatatcatg agtcatgcta cacaggctat
      721 atttgaaata ctggagaaat cctggttgcc ccagaattgt acactggttg atatgaagat
      781 tgaatttggt gttgatgtaa ccaccaaaga aattgttctt gctgatgtta ttgacaatga
      841 ttcctggaga ctctggccat caggagatcg aagccaacag aaagacaaac agtcttatcg
      901 ggacctcaaa gaagtaactc ctgaagggct ccaaatggta aagaaaaact ttgagtgggt
      961 tgcagagaga gtagagttgc ttttgaaatc agaaagtcag tgcagggttg tagtgttgat
     1021 gggctctact tctgatcttg gtcactgtga aaaaatcaag aaggcctgtg gaaattttgg
     1081 cattccatgt gaacttcgag taacatctgc gcataaagga ccagatgaaa ctctgaggat
     1141 taaagctgag tatgaagggg atggcattcc tactgtattt gtggcagtgg caggcagaag
     1201 taatggtttg ggaccagtga tgtctgggaa cactgcatat ccagttatca gctgtcctcc
     1261 cctcacacca gactggggag ttcaggatgt gtggtcttct cttcgactac ccagtggtct
     1321 tggctgttca accgtacttt ctccagaagg atcagctcaa tttgctgctc agatatttgg
     1381 gttaagcaac catttggtat ggagcaaact gcgagcaagc attttgaaca catggatttc
     1441 cttgaagcag gctgacaaga aaatcagaga atgtaattta taagaaagaa tgccattgaa
     1501 ttttttaggg gaaaaactac aaatttctaa tttagctgaa ggaaaatcaa gcaagatgaa
     1561 aaggtaattt taaattagag aacacaaata aaatgtatta gtgaataaat gcttctctag
     1621 atccatatta ataaacatga gcatctaacc cctcctttct taggctagac accaagatat
     1681 ttcagccagc ctttatcatt cctcttactt tatccttttt ccttaagtat tggtggtcac
     1741 tactattgag tttcttcctt aacactgatt aaatgatctt aactccctca gctaaaactg
     1801 gcattactga ctcccagcta tatttctcca gacttgcatt tttttttttt tttttgagac
     1861 agggtctcac tgtcgcccag gctggagtgc agtggcgtga tctcagttca ctgctgcttt
     1921 ccctcctggg ctcaagcagt tctcccacct cagcctctcg actaacaggg actataatct
     1981 tgcagcacca tgccgagcta attttatttt ttgtagagat gagctctcac tatgtcaccc
     2041 aggttcgtct caaactcctg aaccctagta attctcctat ctcagcctcc caaagtgcta
     2101 gggttacaga catgagccac tgtgcctgtc tagacttgta ctttcaactg tccatttctc
     2161 cctgtctgtc ccatgggcac tcatgaaaaa acagaatgct cccaacttta ttcatcttcc
     2221 aagcctgtag ctcttggtat actcactgtt gcaagtcaga agcttgattt catcattgat
     2281 gtttttctca cgtttcacat ctcactcatc accaagtcat gttggtgtta atttctgatt
     2341 aacccttgaa tttaccgtct tctcatcctc tgtacaaaag cctcaagtga gggtcaaatt
     2401 caacattatc ctgatctaga cagcccccat tctcaatcca cccttttcca agttgattgc
     2461 ccaaggactt ctaacaataa actctctttt gcaccacaga cttctttgaa aatatacatg
     2521 ctgttgaccc tctctgtaga aaaccgcaca cataaaactt accaacagat ttcattggtt
     2581 cttgggttct cccgaagcct atccatggtt tatagattaa gaattgatga ggtagctggg
     2641 cacagtggct cacacctacg atcacagcac ttcgggaggc tgaagcaagc agatcacttg
     2701 aggtcaggag tttgagacca gcctggccaa catggtgaaa ccctgtctct actaaaaata
     2761 caaaaagtag ccagccgtga tgacaggcac ctgtaatccc agctactcgg gaggctgagg
     2821 catgagaatt gcttgaaccc gggaggcgga ggttgcagtg agcctggatc atgccactgc
     2881 actccaacct gggcagcaga gcaagactct gtctcaaaag gggaaaaaaa aaattgctga
     2941 tgtgacccat gaagggaact cattttcctc gtaattttgg actgccacac attggtacct
     3001 ttagttctct gaaggcccac gtttttatca ttaagaccta tttgttagct agtagagctt
     3061 tatgttcgct gtccatgaaa ccttctgtaa ccacagtgac tacaagtagt tctttctcta
     3121 ttgaattatt aggtccagaa tagaagatgt cattgtacac tttatttccc tcacactgtg
     3181 ttatgctctg atgtgctatg cttagctatc tgtcagagat tagtaaatta taaaactcat
     3241 gtgtactact taagtttata tcttatgcta gtttataaga acaattaaaa ggacttagaa
     3301 gattaaaaaa aaaaaaaaaa aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D86962. Human mRNA for KI...[gi:1503997] Links  


LOCUS       D86962                  5431 bp    mRNA    linear   PRI 06-OCT-2001
DEFINITION  Human mRNA for KIAA0207 gene, complete cds.
ACCESSION   D86962
VERSION     D86962.1  GI:1503997
KEYWORDS    KIAA0207.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y.,
            Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N.
  TITLE     Prediction of the coding sequences of unidentified human genes. VI.
            The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by
            analysis of cDNA clones from cell line KG-1 and brain
  JOURNAL   DNA Res. 3 (5), 321-329 (1996)
  MEDLINE   97191544
REFERENCE   2  (bases 1 to 5431)
  AUTHORS   Ohara,O., Nagase,T., Kikuno,R. and Nomura,N.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-AUG-1996) Osamu Ohara, Kazusa DNA Research Institute;
            1532-3, Yana, Kisarazu, Chiba 292-0812, Japan
            (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913)
FEATURES             Location/Qualifiers
     source          1..5431
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="7"
                     /sex="male"
                     /cell_line="KG-1"
                     /cell_type="myeloblast"
     gene            1..5431
                     /gene="KIAA0207"
     5'UTR           1..781
                     /gene="KIAA0207"
     CDS             782..2548
                     /gene="KIAA0207"
                     /note="similarto mouse growth factor receptor-binding
                     protein Grb10."
                     /codon_start=1
                     /protein_id="BAA13198.1"
                     /db_xref="GI:1503998"
                     /translation="MQAAGPLFRSKDKVEQTPRSQQDPAGPGLPAQSDRLANHQEDDV
                     DLEALVNDMNASLESLYSACSMQSDTVPLLQNGQHARSQPRASGPPRSIQPQVSPRQR
                     VQRSQPVHILAVRRLQEEDQQFRTSSLPAIPNPFPELCGPGSPAVLTPGSLPPSQAAA
                     KQDVKVFSEDGTSKVVEILADMTARDLCQLLVYKSHCVDDNSWTLVEHHPHLGLERCL
                     EDHELVVQVESTMASESKFLFRKNYAKYEFFKNPMNFFPEQMVTWCQQSNGSQTQLLQ
                     NFLNSSSCPEIQGFLHVKELGKKSWKKLYVCLRRSGLYCSTKGTSKEPRHLQLLADLE
                     DSNIFSLIAGRKQYNAPTDHGLCIKPNKVRNETKELRLLCAEDEQTRTCWMTAFRLLK
                     YGMLLYQNYRIPQQRKALLSPFSTPVRSVSENSLVAMDFSGQTGRVIENPAEAQSAAL
                     EEGHAWRKRSTRMNILGSQSPLHPSTLSTVIHRTQHWFHGRISREESHRIIKQQGLVD
                     GLFLLRDSQSNPKAFVLTLCHHQKIKNFQILPCEDDGQTFFSLDDGNTKFSDLIQLVD
                     FYQLNKGVLPCKLKHHCIRVAL"
     3'UTR           2549..5431
                     /gene="KIAA0207"
BASE COUNT     1306 a   1385 c   1395 g   1345 t
ORIGIN      
        1 ggcagccggg cgccccgcgg ggctctccgc gctgcgttcc cgacccctgg ggggaggtgt
       61 ggagtccaag cggtgcattc ttgaaccatc ttgtcagacg ccggcggctc gcgggctgtg
      121 gcgggggctg cggtcaaggc cgcgctcctg ggggccgccg cctgggaggg tgggcgccca
      181 ggcgtccctg cagccccggg tgctccgact gcgcggcggg gccgcggcgc gcgcgcccgg
      241 gcgtccgggc gtccgggaca gtggtgccag acactcccaa atcccgagcc ggcccagcct
      301 cgtacggagg accttttttt tggttctgtt ggtgacccgt tagccgccgc tggggcctaa
      361 caccaagttg agggctcgcg gattagccgc ccgccagccg tggaaatgtg ataagagcgg
      421 taccgtttgc agaaggaaat ttctgatgca actcttcgcc tttgctgatt gcctctccaa
      481 acgcctgcct gacgactgcc ttggagcatg tgcgttatgg aaattaggct ttggcgctga
      541 ccacaatgct gagcaggaag cagcagctgc aggcccagtg actggtagct cagtgaccag
      601 cagcccagtg accggcagcc aggtcctcac ctgggtcctc tcagtgaagc cagggtggcc
      661 gccccagcag acagtgctac agagccaact cctgacaggt tctgaaaata ttgtgcacag
      721 ggcaggctga ggacacagcc acgtgatacc cactgtagag agagggagag agagacctcc
      781 tatgcaagct gccggccctc tgttccgtag taaggacaag gtggagcaga cacctcgcag
      841 tcaacaagac ccggcaggac caggactccc cgcacagtct gaccgacttg cgaatcacca
      901 ggaggatgat gtggacctgg aagccctggt gaacgatatg aatgcatccc tggagagcct
      961 gtactcggcc tgcagcatgc agtcagacac ggtgcccctc ctgcagaatg gccagcatgc
     1021 ccgcagccag cctcgggctt caggccctcc tcggtccatc cagccacagg tgtccccgag
     1081 gcagagggtg cagcgctccc agcctgtgca catcctcgct gtcaggcgcc ttcaggagga
     1141 agaccagcag tttagaacct catctctgcc ggccatcccc aatccttttc ctgaactctg
     1201 tggccctggg agccccgctg tgctcacgcc gggttcttta cctccgagcc aggccgccgc
     1261 aaagcaggat gttaaagtct ttagtgaaga tgggacaagc aaagtggtgg agattctagc
     1321 agacatgaca gccagagacc tgtgccaatt gctggtttac aaaagtcact gtgtggatga
     1381 caacagctgg acactagtgg agcaccaccc gcacctagga ttagagaggt gcttggaaga
     1441 ccatgagctg gtggtccagg tggagagtac catggccagt gagagtaaat ttctattcag
     1501 gaagaattac gcaaaatacg agttctttaa aaatcccatg aatttcttcc cagaacagat
     1561 ggttacttgg tgccagcagt caaatggcag tcaaacccag cttttgcaga attttctgaa
     1621 ctccagtagt tgtcctgaaa ttcaagggtt tttgcatgtg aaagagctgg gaaagaaatc
     1681 atggaaaaag ctgtatgtgt gtttgcggag atctggcctt tattgctcca ccaagggaac
     1741 ttcaaaggaa cccagacacc tgcagctgct ggccgacctg gaggacagca acatcttctc
     1801 cctgatcgct ggcaggaagc agtacaacgc ccctacagac cacgggctct gcataaagcc
     1861 aaacaaagtc aggaatgaaa ctaaagagct gaggttgctc tgtgcagagg acgagcaaac
     1921 caggacgtgc tggatgacag cgttcagact cctcaagtat ggaatgctcc tttaccagaa
     1981 ttaccgaatc cctcagcaga ggaaggcctt gctgtccccg ttctcgacgc cagtgcgcag
     2041 tgtctccgag aactccctcg tggcaatgga tttttctggg caaacaggac gcgtgataga
     2101 gaatccggcg gaggcccaga gcgcagccct ggaggagggc cacgcctgga ggaagcgaag
     2161 cacacggatg aacatcctag gtagccaaag tcccctccac ccttctaccc taagtacagt
     2221 gattcacagg acacagcact ggtttcacgg gaggatctcc agggaggaat cccacaggat
     2281 cattaaacag caagggctcg tggatgggct ttttctcctc cgtgacagcc agagtaatcc
     2341 aaaggcattt gtactcacac tgtgtcatca ccagaaaatt aaaaatttcc agatcttacc
     2401 ttgcgaggac gacgggcaga cgttcttcag cctagatgac gggaacacca aattctctga
     2461 cctgatccag ctggttgact tttaccagct gaacaaagga gtcctgcctt gcaaactcaa
     2521 gcaccactgc atccgagtgg ccttatgacc gcagatgtcc tctcggctga agactggagg
     2581 aagtgaacac tggagtgaag aagcggtctg tgcgttggtg aagaacacac atcgattctg
     2641 cacctgggga cccagagcga gatgggtttg ttcggtgcca gccgaccaag attgactagt
     2701 ttgttggact taaacgacga tttgctgctg tgaacccagc agggtcgcct ccctctgcgt
     2761 cagccaaatt ggggagggca tggaagatcc agcggaaagt tgaaaataaa ctggaatgat
     2821 catcttggct tgggccgctt aggaacaaga accggagaga agtgattgga aatgaactct
     2881 tgccctggaa taatcttgac aattaaaact gatatgttta ctttttttgt attgatcact
     2941 tttttgcact ccttctttgt tttcaatatt gtattcagcc tattgtagga gggggatgtg
     3001 gcgtttcaac tcatataata cagaaagagt tttgaatggg cagatttcaa actgaatatg
     3061 ggtccccaaa tgttcccaga gggtcctcca caccctctgc cgactaccac ggtgtggatt
     3121 cagctcccaa atgacaaacc cagcccttcc cagtatactt gaaaagcttt cttgttaaaa
     3181 taaaaggtgt cactgtggta ggcatttggc atattttgtg gactcagtca agcaaccaca
     3241 gtctgttaat catttctcta tgctcagatg tcagatcctc ttgttattag tgtgtcttgt
     3301 tctgcacagt gcaggagact ttattccttt ggaaaattca ctgttccaca aacagcaggc
     3361 tgaatggcct cgcctctaga ttgacgtggg ccagcctcct tgagacacac ctggcacccg
     3421 tcatcggcca gcggtggatg ctgcataatc cacctgggta cttcagcctt gcgtttccac
     3481 agccttcagc ctgttctaga acgatcactg ccttacccct gctgctgcag tggtgtgagt
     3541 cgtttcacgg ctgatgtccc tcgggggatt aaaggatcta aagagaaaat ggcacctggt
     3601 tgtcttcgtg ctgtgtctca tgggtttcca tagtgataaa gacaaggaaa cgctgcaggg
     3661 gccacaggca caggctgata tttaaagatc tttgcttgca gccctccgtc ctgctgaaaa
     3721 cccccataag ccagtgaaca cagagcagct agaggctcct cctctgctgg cttagggtca
     3781 gaagtacctc acagtggttg tggacatgga agagttttgt caacacaaca ctttgtcccc
     3841 gctccgggag atgagtcaga tggtggcttg agttgtcact tggtcccctc cgcccctcgg
     3901 gtggccccct ttgccacgtc cccttagctt agtgatcagg tgtgagagtg gccatttcct
     3961 tacctttgat ccctgtaaag cagaaaggac tcctttgaca ggcgacaaac tactgtggtg
     4021 agcagaatga tttccttttt caagacaaca cctgcctggc ttctattaat gtgtgctggc
     4081 catgatattg ccccaaatcc gccccactga agtgttccct aaggaacagc atttctctgc
     4141 tcctcagtca acccccgtag cctagagcag tgtcacaagc ttcagtaagg ccagtcagct
     4201 ggaagtcagt ctaccgtata gtaacactgt atttcagtct acagaccaca ctctagttgt
     4261 tttccatgaa aggtatacaa atgaagaatt ttctagcaaa acatgttttt aaccatcagt
     4321 gctcaattgc attttcttcc tttcgcagcc agtcagtctt tcaaactatt gacagtaaga
     4381 taattctcac gttcacacct ggtggcaggc ttcactgtag ggacggacat tgcagttaca
     4441 ccacgattcc ttcctcttca ctggctcgag gtaaaccctt ttcaaggaaa aacaactcta
     4501 ggatttcttt tttctgtgta cgtagaccag tcccatcagt gtataatctc tctctcacac
     4561 gcctctctcc aatagacagc ttgtatttgc agtatttcat atttataaat atgcgtttat
     4621 ttaaaaggag aacaaaagct tgactctgat tcacagtttt gtatgtagct ggtttgacgt
     4681 agtcttttgt attttccctg ccgaagtgaa ttgttggaga atgtaaaccg cctccacgtg
     4741 gcggcagact tcctaaggcc ccagctcgct ggcctcgcgc tgggcggctg ggaattccac
     4801 ctgagaacaa gtcccgcaaa ccggggacgg aaggacattt gacttttatt tttgtattta
     4861 attgacatga atgtaaaggg gacagctcag ggttgttttg gagcctgttg actttgtatc
     4921 tctgcctgtg attttctttt ctaaatgaaa ctccatgtag caaccaggac gaagttgaga
     4981 aggaaaacgc caaatgcttt ggttattaga gtttaatagg taagctctgt tacactaggt
     5041 gttagagttc cagaatgttc ttttgtttgc taaaccttga agaaacatgt gcctcagcct
     5101 agatgttttg tcttctcttt tctgcactta atacctgaca gtatgaccga tctctgcgcc
     5161 tttctggggg cgggcaagct ggcggtagat ttgtgatgtc acagtgcaaa ctgcagtgac
     5221 tgtaaattgg cctggcgtgt ataaacgttt tcagggaatg cagaaggtat taatgaagag
     5281 acaaaacctt tattccatgt gctttgcttc attctgtaca tagctctttg gctcgtgaac
     5341 ctaattgtaa actttcaggt atttttgtac aaataaggga ctgatgttct gtttcttgta
     5401 attagaaata aacattaata cagtgttctt c
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: BG829769. 602764248F1 NIH_M...[gi:14177356] Links  


IDENTIFIERS

dbEST Id:       8569742
EST name:       602764248F1
GenBank Acc:    BG829769
GenBank gi:     14177356

CLONE INFO
Clone Id:       IMAGE:4899683 (5')
Plate:          LLCM1791 Row: o Column: 12
DNA type:       cDNA

PRIMERS
PolyA Tail:     Unknown

SEQUENCE
                GGCGAGAATGAAGACTATTCTCAGCAATCAGACTGTCGACATTCCAGAAAATGTCGACAT
                TACTCTGAAGGGACGCACAGTTATCGTGAAGGGCCCCAGAGGAACCCTGCGGAGGGACTT
                CAATCACATCAATGTAGAACTCAGCCTTCTTGGAAAGAAAAAAAAGAGGCTCCGGGTTGA
                CAAATGGTGGGGTAACAGAAAGGAACTGGCTACCGTTCGGACTATTTGTAGTCATGTACA
                GAACATGATCAAGGGTGTTACACTGGGCTTCCGTTACAAGATGAGGTCTGTGTATGCTCA
                CTTCCCCATCAACGTTGTTATCCAGGAGAATGGGTCTCTTGTTGAAATCCGAAATTTCTT
                GGGTGAAAAATACATCCGCAGGGTTCGGATGAGACCAGGTGTTGCTTGTTCAGTATCTCA
                AGCCCAGAAAGATGAATTAATCCTTGAAGGAAATGACATTGAGCTTGTTTCAAATTCAGC
                GGCTTTGATTCAGCAAGCCACAACAGTTAAAAACAAGGATATCAGGAAATTTTGGATGGT
                ATCTATGTCTCTGAAAAAGGAACTGTTCAGCAGGCTGATGAATAAGATCTAAGAGTTACC
                TGGCTACAGAAAGAAGATGCCAGATGACACTTAAGACCTACTTGTGATATTTAAATGATG
                CAATAAAAGACCTATTGATTTGGACCTTCTTCTTAAAAAAAAAAAAAAAAAACCCGGGGC
                TATATGGGGGGGGGGAA
Quality:        High quality sequence stops at base: 702

Entry Created:  May 21 2001
Last Updated:   May 22 2001

COMMENTS
                Tissue Procurement: ATCC
                cDNA Library Preparation: Ling Hong/Rubin Laboratory
                cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
                DNA Sequencing by: Incyte Genomics, Inc.
                Clone distribution: MGC clone distribution information can
                be found through the I.M.A.G.E. Consortium/LLNL at:
                http://image.llnl.gov

LIBRARY
Lib Name:       NIH_MGC_42
Organism:       Homo sapiens
Organ:          pancreas
Tissue type:    epithelioid carcinoma cell line
Lab host:       DH10B (phage-resistant)
Vector:         pOTB7
R. Site 1:      XhoI
R. Site 2:      EcoRI
Description:    cDNA made by oligo-dT priming. Directionally cloned into
                EcoRI/XhoI sites using the following 5' adaptor: GGCACGAG(G
                ). Size-selected >500bp for average insert size 1.8kb.
                Library constructed by Ling Hong in the laboratory of Gerald
                M. Rubin (University of California, Berkeley) using ZAP-cDNA
                synthesis kit (Stratagene) and Superscript II RT (Life
                Technologies). Note: this is a NIH_MGC Library. |

SUBMITTER
Name:           Robert Strausberg, Ph.D.
E-mail:         cgapbs-r@mail.nih.gov

CITATIONS
Title:          National Institutes of Health, Mammalian Gene Collection
                (MGC)
Authors:        NIH-MGC http://mgc.nci.nih.gov/
Year:           1999
Status:         Unpublished


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMTaxonomyTaxonomyTracesTracesUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U09953. Human ribosomal p...[gi:1323732] Links  


LOCUS       HSU09953                 712 bp    mRNA    linear   PRI 30-MAY-1996
DEFINITION  Human ribosomal protein L9 mRNA, complete cds.
ACCESSION   U09953
VERSION     U09953.1  GI:1323732
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 712)
  AUTHORS   Mazuruk,K., Schoen,T.J., Chader,G.J., Iwata,T. and Rodriguez,I.R.
  TITLE     Structural organization and chromosomal localization of the human
            ribosomal protein L9 gene
  JOURNAL   Biochim. Biophys. Acta 1305 (3), 151-162 (1996)
  MEDLINE   96180319
   PUBMED   8597601
REFERENCE   2  (bases 1 to 712)
  AUTHORS   Rodriguez,I.R.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-MAY-1994) Ignacio R. Rodriguez, LRCMB, National Eye
            Institute, National Institutes of Health, 9000 Rockville Pike,
            Bldg. 6 Rm. 304, Bethesda, MD 20892, USA
COMMENT     On May 21, 1996 this sequence version replaced gi:607790.
FEATURES             Location/Qualifiers
     source          1..712
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="4"
                     /map="4p13"
                     /clone="pL9KM1"
                     /tissue_type="retina"
                     /clone_lib="fetal cDNA library (Stratagene) and 5'RACE"
     CDS             30..608
                     /codon_start=1
                     /evidence=experimental
                     /product="ribosomal protein L9"
                     /protein_id="AAB01040.1"
                     /db_xref="GI:1323733"
                     /translation="MKTILSNQTVEIPENVDITLKGRTVIVKGPRGTLRRDFNHINVE
                     LSLLGKKKKRLRVDKWWGNRKELATVRTICSHVQNMIKGVTLGFRYKMRSVYAHFPIN
                     VVIQENGSLVEIRNFLGEKYIRRVRMRPGVACSVSQAQKDELILEGNDIELVSNSAAL
                     IQQATTVKNKDIRKFLDGIYVSEKGTVQQADE"
     polyA_signal    685..690
                     /evidence=experimental
BASE COUNT      219 a    136 c    168 g    189 t
ORIGIN      
        1 ttctttcttt gctgcgtcta cctgcgagaa tgaagactat tctcagcaat cagactgtcg
       61 agattccaga aaatgtcgac attactctga agggacgcac agttatcgtg aagggcccca
      121 gaggaaccct gcggagggac ttcaatcaca tcaatgtaga actcagcctt cttggaaaga
      181 aaaaaaagag gctccgggtt gacaaatggt ggggtaacag aaaggaactg gctaccgttc
      241 ggactatttg tagtcatgta cagaacatga tcaagggtgt tacactgggc ttccgttaca
      301 agatgaggtc tgtgtatgct cacttcccca tcaacgttgt tatccaggag aatgggtctc
      361 ttgttgaaat ccgaaatttc ttgggtgaaa aatacatccg cagggttcgg atgagaccag
      421 gtgttgcttg ttcagtatct caagcccaga aagatgaatt aatccttgaa ggaaatgaca
      481 ttgagcttgt ttcaaattca gcggctttga ttcagcaagc cacaacagtt aaaaacaagg
      541 atatcaggaa atttttggat ggtatctatg tctctgaaaa aggaactgtt cagcaggctg
      601 atgaataaga tctaagagtt acctggctac agaaagaaga tgccagatga cacttaagac
      661 ctacttgtga tatttaaatg atgcaataaa agacctattg atttggacct tc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC015520. Homo sapiens, rib...[gi:15930171] Links  


LOCUS       BC015520                1414 bp    mRNA    linear   PRI 29-OCT-2001
DEFINITION  Homo sapiens, ribonuclease, RNase A family, 4, clone MGC:9306
            IMAGE:3905439, mRNA, complete cds.
ACCESSION   BC015520
VERSION     BC015520.1  GI:15930171
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1414)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-OCT-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 22 Row: i Column: 14
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4506556.
FEATURES             Location/Qualifiers
     source          1..1414
                     /organism="Homo sapiens"
                     /db_xref="LocusID:6038"
                     /db_xref="taxon:9606"
                     /clone="MGC:9306 IMAGE:3905439"
                     /tissue_type="Pancreas, epithelioid carcinoma"
                     /clone_lib="NIH_MGC_70"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     CDS             173..616
                     /codon_start=1
                     /product="ribonuclease, RNase A family, 4"
                     /protein_id="AAH15520.1"
                     /db_xref="GI:15930172"
                     /translation="MALQRTHSLLLLLLLTLLGLGLVQPSYGQDGMYQRFLRQHVHPE
                     ETGGSDRYCNLMMQRRKMTLYHCKRFNTFIHEDIWNIRSICSTTNIQCKNGKMNCHEG
                     VVKVTDCRDTGSSRAPNCRYRAIASTRRVVIACEGNPQVPVHFDG"
BASE COUNT      402 a    297 c    298 g    417 t
ORIGIN      
        1 cgccaacccc acctagatgc aaagcaggat tcaaaagaac atctttgcgt tttctaccgg
       61 ctccccatca tcgtactagg gaggaagaag cgggtgagaa acaaaacttc tttccattgt
      121 cctgcccgtt tctgcggact tgttctgagg ccgaggcacc tctaagatac tgatggctct
      181 gcagaggacc cattcattgc ttctgctttt gctgctgacc ctgctggggc tggggctggt
      241 ccagccctcc tatggccagg atggcatgta ccagcgattc ctgcggcaac acgtgcaccc
      301 tgaggagaca ggtggcagtg atcgctactg caacttgatg atgcaaagac ggaagatgac
      361 tttgtatcac tgcaagcgct tcaacacctt catccatgaa gatatctgga acattcgtag
      421 tatctgcagc accaccaata tccaatgcaa gaacggcaag atgaactgcc atgagggtgt
      481 agtgaaggtc acagattgca gggacacagg aagttccagg gcacccaact gcagatatcg
      541 ggccatagcg agcactagac gtgttgtcat tgcctgtgag ggtaacccac aggtgcctgt
      601 gcactttgac ggttagatgc caccatgtag ggattatcgc gagtggttga ccttacactt
      661 actccttaaa tagcagtgag taatgcattt gagctgcccc aggctctgtc tcctcagctc
      721 atttcttact ctttttctct atataactca ttctattaaa tacattgcac caaagagata
      781 tggagacata aacctgtaat gaatgaggct gggcttttct gtaataagct tccttttata
      841 atactggtca gcttagctct ctcagatcct atcctgtgga atttagttat tatgtgtatt
      901 tatgtagtat ttcaaacatt tcaaaatgct ttcatctatg tttatcacat tttaatacca
      961 cagcacttat aatgatgtca ctacatatag aagctcaaag ttaagggatt tgctgaagac
     1021 tgtaaagtta atggaagaat tgagacaaaa atccagtgta gctggccact tatccagggc
     1081 tttttctact tcatcacaag gaatgttttg aaagtgtctg ctttttttat ccttaaaatt
     1141 cacctgtcag ggaggcatta aaaatttgga aatgtatgcc agcaaaatgt gagctctgta
     1201 ttttttggca ttcttatgtt tgggtttaat aagattaaga aaatgatact gggaattttc
     1261 tttttcctga aactttgaat caccctagta agtcaaagta ctaaaaaatg tactagatca
     1321 ttaagactta tgtgctctta ctgattgaaa gattttttat gttttccttg taataaagga
     1381 cctaaaccga aggtacctga aaaaaaaaaa aaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X53961. Human mRNA for la...[gi:34415] Links  


LOCUS       HSLTFRG                 2619 bp    mRNA    linear   PRI 31-MAR-1995
DEFINITION  Human mRNA for lactoferrin.
ACCESSION   X53961
VERSION     X53961.1  GI:34415
KEYWORDS    lactoferrin; lactotransferrin; secreted protein.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2619)
  AUTHORS   Rey,M.W., Woloshuk,S.L., deBoer,H.A. and Pieper,F.R.
  TITLE     Complete nucleotide sequence of human mammary gland lactoferrin
  JOURNAL   Nucleic Acids Res. 18 (17), 5288 (1990)
  MEDLINE   90384839
REFERENCE   2  (bases 1 to 2619)
  AUTHORS   Rey,M.W.
  TITLE     Direct Submission
  JOURNAL   Submitted (16-JUL-1990) Rey M.W., Genencor International, 180
            Kimball Way, South San Francisco, CA 94080, USA
COMMENT     sequence is both genomic and cDNA
            see  for conflicting sequence.
FEATURES             Location/Qualifiers
     source          1..2619
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="HLF-1"
                     /cell_type="epithelial"
                     /tissue_type="mammary gland"
                     /clone_lib="lambda gt11"
     promoter        227..232
                     /note="pot. TATA box"
     CDS             295..2430
                     /codon_start=1
                     /product="precursor (AA -19 to 692)"
                     /protein_id="CAA37914.1"
                     /db_xref="GI:34416"
                     /db_xref="SWISS-PROT:P02788"
                     /translation="MKLVFLVLLFLGALGLCLAGRRRRSVQWCAVSQPEATKCFQWQR
                     NMRKVRGPPVSCIKRDSPIQCIQAIAENRADAVTLDGGFIYEAGLAPYKLRPVAAEVY
                     GTERQPRTHYYAVAVVKKGGSFQLNELQGLKSCHTGLRRTAGWNVPTGTLRPFLNWTG
                     PPEPIEAAVARFFSASCVPGADKGQFPNLCRLCAGTGENKCAFSSQEPYFSYSGAFKC
                     LRDGAGDVAFIRESTVFEDLSDEAERDEYELLCPDNTRKPVDKFKDCHLARVPSHAVV
                     ARSVNGKEDAIWNLLRQAQEKFGKDKSPKFQLFGSPSGQKDLLFKDSAIGFSRVPPRI
                     DSGLYLGSGYFTAIQNLRKSEEEVAARRARVVWCAVGEQELRKCNQWSGLSEGSVTCS
                     SASTTEDCIALVLKGEADAMSLDGGYVYTACKCGLVPVLAENYKSQQSSDPDPNCVDR
                     PVEGYLAVAVVRRSDTSLTWNSVKGKKSCHTAVDRTAGWNIPMGLLFNQTGSCKFDEY
                     FSQSCAPGSDPRSNLCALCIGDEQGENKCVPNSNERYYGYTGAFRCLAENAGDVAFVK
                     DVTVLQNTDGNNNEAWAKDLKLADFALLCLDGKRKPVTEARSCHLAMAPNHAVVSRMD
                     KVERLKQVLLHQQAKFGRNGSDCPDKFCLFQSETKNLLFNDNTECLARLHGKTTYEKY
                     LGPQYVAGITNLKKCSTSPLLEACEFLRK"
     sig_peptide     295..351
                     /note="signal peptide (AA -19 to -1)"
     mat_peptide     352..2427
                     /product="lactoferrin (AA 1-692)"
BASE COUNT      626 a    661 c    767 g    565 t
ORIGIN      
        1 gactcctagg ggcttgcaga cctagtggga gagaaagaac atcgcagcag ccaggcagaa
       61 ccaggacagg tgaggtgcag gctggctttc ctctcgcagc gcggtgtgga gtcctgtcct
      121 gcctcagggc ttttcggagc ctggatcctc aaggaacaag tagacctggc cgcggggagt
      181 ggggagggaa ggggtgtcta ttgggcaaca gggcggcaaa gccctgaata aaggggcgca
      241 gggcaggcgc aagtgcagag ccttcgtttg ccaagtcgcc tccagaccgc agacatgaaa
      301 cttgtcttcc tcgtcctgct gttcctcggg gccctcggac tgtgtctggc tggccgtagg
      361 agaaggagtg ttcagtggtg cgccgtatcc caacccgagg ccacaaaatg cttccaatgg
      421 caaaggaata tgagaaaagt gcgtggccct cctgtcagct gcataaagag agactccccc
      481 atccagtgta tccaggccat tgcggaaaac agggccgatg ctgtgaccct tgatggtggt
      541 ttcatatacg aggcaggcct ggccccctac aaactgcgac ctgtagcggc ggaagtctac
      601 gggaccgaaa gacagccacg aactcactat tatgccgtgg ctgtggtgaa gaagggcggc
      661 agctttcagc tgaacgaact gcaaggtctg aagtcctgcc acacaggcct tcgcaggacc
      721 gctggatgga atgtccctac agggacactt cgtccattct tgaattggac gggtccacct
      781 gagcccattg aggcagctgt ggccaggttc ttctcagcca gctgtgttcc cggtgcagat
      841 aaaggacagt tccccaacct gtgtcgcctg tgtgcgggga caggggaaaa caaatgtgcc
      901 ttctcctccc aggaaccgta cttcagctac tctggtgcct tcaagtgtct gagagacggg
      961 gctggagacg tggcttttat cagagagagc acagtgtttg aggacctgtc agacgaggct
     1021 gaaagggacg agtatgagtt actctgccca gacaacactc ggaagccagt ggacaagttc
     1081 aaagactgcc atctggcccg ggtcccttct catgccgttg tggcacgaag tgtgaatggc
     1141 aaggaggatg ccatctggaa tcttctccgc caggcacagg aaaagtttgg aaaggacaag
     1201 tcaccgaaat tccagctctt tggctcccct agtgggcaga aagatctgct gttcaaggac
     1261 tctgccattg ggttttcgag ggtgcccccg aggatagatt ctgggctgta ccttggctcc
     1321 ggctacttca ctgccatcca gaacttgagg aaaagtgagg aggaagtggc tgcccggcgt
     1381 gcgcgggtcg tgtggtgtgc ggtgggcgag caggagctgc gcaagtgtaa ccagtggagt
     1441 ggcttgagcg aaggcagcgt gacctgctcc tcggcctcca ccacagagga ctgcatcgcc
     1501 ctggtgctga aaggagaagc tgatgccatg agtttggatg gaggatatgt gtacactgca
     1561 tgcaaatgtg gtttggtgcc tgtcctggca gagaactaca aatcccaaca aagcagtgac
     1621 cctgatccta actgtgtgga tagacctgtg gaaggatatc ttgctgtggc ggtggttagg
     1681 agatcagaca ctagccttac ctggaactct gtgaaaggca agaagtcctg ccacaccgcc
     1741 gtggacagga ctgcaggctg gaatatcccc atgggcctgc tcttcaacca gacgggctcc
     1801 tgcaaatttg atgaatattt cagtcaaagc tgtgcccctg ggtctgaccc gagatctaat
     1861 ctctgtgctc tgtgtattgg cgacgagcag ggtgagaata agtgcgtgcc caacagcaac
     1921 gagagatact acggctacac tggggctttc cggtgcctgg ctgagaatgc tggagacgtt
     1981 gcatttgtga aagatgtcac tgtcttgcag aacactgatg gaaataacaa tgaggcatgg
     2041 gctaaggatt tgaagctggc agactttgcg ctgctgtgcc tcgatggcaa acggaagcct
     2101 gtgactgagg ctagaagctg ccatcttgcc atggccccga atcatgccgt ggtgtctcgg
     2161 atggataagg tggaacgcct gaaacaggtg ctgctccacc aacaggctaa atttgggaga
     2221 aatggatctg actgcccgga caagttttgc ttattccagt ctgaaaccaa aaaccttctg
     2281 ttcaatgaca acactgagtg tctggccaga ctccatggca aaacaacata tgaaaaatat
     2341 ttgggaccac agtatgtcgc aggcattact aatctgaaaa agtgctcaac ctcccccctc
     2401 ctggaagcct gtgaattcct caggaagtaa aaccgaagaa gatggcccag ctccccaaga
     2461 aagcctcagc cattcactgc ccccagctct tctccccagg tgtgttgggg ccttggctcc
     2521 cctgctgaag gtggggattg cccatccatc tgcttacaat tccctgctgt cgtcttagca
     2581 agaagtaaaa tgagaaattt tgttgatatt caaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: J03578. Human calelectrin...[gi:179975] Links  


LOCUS       HUMCBPE                 2414 bp    mRNA    linear   PRI 27-APR-1993
DEFINITION  Human calelectrin mRNA, complete cds.
ACCESSION   J03578
VERSION     J03578.1  GI:179975
KEYWORDS    calcium-binding protein; calelectrin.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2414)
  AUTHORS   Sudhof,T.C., Slaughter,C.A., Leznicki,I., Barjon,P. and
            Reynolds,G.A.
  TITLE     Human 67-kDa calelectrin contains a duplication of four repeats
            found in 35-kDa lipocortins
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85 (3), 664-668 (1988)
  MEDLINE   88124902
   PUBMED   2963335
COMMENT     Original source text: Human retina, cDNA to mRNA (library of
            J.Nathans), clones lambda-CE[1,9].
            Draft entry and printed copy of sequence for [1] kindly provided by
            T.C.Suedhof, 03-DEC-1987.
FEATURES             Location/Qualifiers
     source          1..2414
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             57..2078
                     /note="calelectrin"
                     /codon_start=1
                     /protein_id="AAA35656.1"
                     /db_xref="GI:179976"
                     /translation="MAKPAQGAKYRGSIHDFPGFDPNQDAEALYTAMKGFGSDKEAIL
                     DIITSRSNRQRQEVCQSYKSLYGKDLIADLKYELTGKFERLIVGLMRPPAYCDAKEIK
                     DAISGIGTDEKCLIEILASRTNEQMHQLVAAYKDAYERDLEADIIGDTSGHFQKMLVV
                     LLQGTREEDDVVSEDLVQQDVQDLYEAGELKWGTDEAQFIYILGNRSKQHLRLVFDEY
                     LKTTGKPMKASIRGELSGDFEKLMLAVVKCIRSTPEYFAERLFKAMKGLGTRDNTLIR
                     IMVSRSELDMLDIREIFRTKYEKSLYSMIKNDTSGEYKKTLLKLSGGDDDAAGQFFPE
                     AAQVAYQMWELSAVARVELKGTVRPANDFNPDADAKALRKAMKGLGTDEDTIIDIITH
                     RSNVQRQQIRQTFKSHFGRDLMTDLKSEISGDLARLILGLMMPPAHYDAKQLKKAMEG
                     AGTDEKALIEILATRTNAEIRAINEAYKEDYHKSLEDALSSDTSGHFRRILISLATGH
                     REEGGENLDQAREDAQVAAEILEIADTPSGDKTSLETRFMTILCTRTYPHLRRVFQEF
                     IKMTNYDVEHTIKKEMSGDVRDAFVAIVQSVKNKPLFFADKLYKSMKGAGTDEKTLTR
                     IMVSRSEIDLLNIRREFIEKYDKSLHQAIEGDTSGDFLKALLALCGGED"
BASE COUNT      603 a    623 c    699 g    489 t
ORIGIN      Unreported.
        1 gggcgcaggg acccgaggcc tccgctgcgg ctggattctg ctgcgaaccg gagaccatgg
       61 ccaaaccagc acagggtgcc aagtaccggg gctccatcca tgacttccca ggctttgacc
      121 ccaaccagga tgccgaggct ctgtacactg ccatgaaggg ctttggcagt gacaaggagg
      181 ccatactgga cataatcacc tcacggagca acaggcagag gcaggaggtc tgccagagct
      241 acaagtccct ctacggcaag gacctcattg ctgatttaaa gtatgaattg acgggcaagt
      301 ttgaacggtt gattgtgggc ctgatgaggc cacctgccta ttgtgatgcc aaagaaatta
      361 aagatgccat ctcgggcatt ggcactgatg agaagtgcct cattgagatc ttggcttccc
      421 ggaccaatga gcagatgcac cagctggtgg cagcatacaa agatgcctac gagcgggacc
      481 tggaggctga catcatcggc gacacctctg gccacttcca gaagatgctt gtggtcctgc
      541 tccagggaac cagggaggag gatgacgtag tgagcgagga cctggtacaa caggatgtcc
      601 aggacctata cgaggcaggg gaactgaaat ggggaacaga tgaagcccag ttcatttaca
      661 tcttgggaaa tcgcagcaag cagcatcttc ggttggtgtt cgatgagtat ctgaagacca
      721 cagggaagcc gatgaaggcc agcatccgag gggagctgtc tggggacttt gagaagctaa
      781 tgctggccgt agtgaagtgt atccggagca ccccggaata ttttgctgaa aggctcttca
      841 aggctatgaa gggcctgggg actcgggaca acaccctgat ccgcatcatg gtctcccgta
      901 gtgagttgga catgctcgac attcgggaga tcttccggac caagtatgag aagtccctct
      961 acagcatgat caagaatgac acctctggcg agtacaagaa gactctgctg aagctgtctg
     1021 ggggagatga tgatgctgct ggccagttct tcccggaggc agcgcaggtg gcctatcaga
     1081 tgtgggaact tagtgcagtg gcccgagtag agctgaaggg aactgtgcgc ccagccaatg
     1141 acttcaaccc tgacgcagat gccaaagcgc tgcggaaagc catgaaggga ctcgggactg
     1201 acgaagacac aatcatcgat atcatcacgc accgcagcaa tgtccagcgg cagcagatcc
     1261 ggcagacctt caagtctcac tttggccggg acttaatgac tgacctgaag tctgagatct
     1321 ctggagacct ggcaaggctg attctggggc tcatgatgcc accggcccat tacgatgcca
     1381 agcagttgaa gaaggccatg gagggagccg gcacagatga aaaggctctt attgaaatcc
     1441 tggccactcg gaccaatgct gaaatccggg ccatcaatga ggcctataag gaggactatc
     1501 acaagtccct ggaggatgct ctgagctcag acacatctgg ccacttcagg aggatcctca
     1561 tttctctggc cacggggcat cgtgaggagg gaggagaaaa cctggaccag gcacgggaag
     1621 atgcccaggt ggctgctgag atcttggaaa tagcagacac acctagtgga gacaaaactt
     1681 ccttggagac acgtttcatg acgatcctgt gtacccggac gtatccgcac ctccggagag
     1741 tcttccagga gttcatcaag atgaccaact atgacgtgga gcacaccatc aagaaggaga
     1801 tgtctgggga tgtcagggat gcatttgtgg ccattgttca aagtgtcaag aacaagcctc
     1861 tcttctttgc cgacaaactt tacaaatcca tgaagggtgc tggcacagat gagaagactc
     1921 tgaccaggat catggtatcc cgcagtgaga ttgacctgct caacatccgg agggaattca
     1981 ttgagaaata tgacaagtct ctccaccaag ccattgaggg tgacacctcc ggagacttcc
     2041 tgaaggcctt gctggctctc tgtggtggtg aggactaggg ccacagcttt ggcgggcact
     2101 tctgccaaga aatggttatc agcaccagcc gccatggcca agcctgattg ttccagctcc
     2161 agagactaag gaaggggcag gggtgggggg aggggttggg ttgggctctt atcttcatgg
     2221 agcttaggaa acgctcccac tcccacgggc catcgagggc cagcacggct gagcggtgaa
     2281 aaaccgtagc catagatcct gtccacctcc actcccctct gaccctcagg ctttcccagc
     2341 ttcctcccct tgctacagcc tctgccctgg tttggctatg tcagatccaa taaacatcct
     2401 gaacctctgt ctgt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X77673. H.sapiens Annexin...[gi:474823] Links  


LOCUS       HSANNVI                  752 bp    DNA     linear   PRI 14-JUN-1994
DEFINITION  H.sapiens Annexin VI gene, exon 1.
ACCESSION   X77673
VERSION     X77673.1  GI:474823
KEYWORDS    annexin VI.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Smith,P.D., Davies,A., Crumpton,M.J. and Moss,S.E.
  TITLE     Structure of the human annexin VI gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 91 (7), 2713-2717 (1994)
  MEDLINE   94195813
REFERENCE   2  (bases 1 to 752)
  AUTHORS   Smith,P.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (11-FEB-1994) P.D. Smith, Institute of Cancer Research,
            Haddow Laboratories, 15 Cotswold Road, Sutton, Surrey SM2 5NG, UK
FEATURES             Location/Qualifiers
     source          1..752
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="5"
                     /map="Q32"
     gene            230..650
                     /gene="annexin VI"
     CAAT_signal     230..234
                     /gene="annexin VI"
     TATA_signal     273..277
                     /gene="annexin VI"
     exon            506..650
                     /gene="annexin VI"
                     /product="annexin VI"
                     /number=1
BASE COUNT      143 a    217 c    230 g    162 t
ORIGIN      
        1 tctcagttat caatttcctg tctacgcgtg attacaactc gtggtctccc gtccgttccc
       61 tcgttcccag agcacgaaga agcttgcctc cgcaacccac tatttgtgtg accttggtcc
      121 aggaactgga ctgggctctc ctcctgaaag tgggtaaata atgtcaccga cctaatgagg
      181 ctgttgaggc atggaatctg atcaggcaga taaagtcccc tggcaaacct caattttctg
      241 tttctcaaca aagcttcgaa gctttggcga cttataagga tttgcgggaa agtgtccaga
      301 acccggagtc gacttctagc cctgaagggc gcgtgacctt ggccaaagcc ctaatccttc
      361 tcagcctcag atccgggggc tggaaactga ggaggttctc ccggcaaccc ctcgggctag
      421 tgacattcta agattccaag agctaaggag ctagtggggt ggggtggagg agggaggcgg
      481 gggccggatt gattggcctc tgcgcgccac gtgtccggct cggagcccac ggctgtcctc
      541 ccggtccgcg ccgctgcggt tgctgctggg ctaacgggct ccgatccagc gagcgctgcg
      601 tcctcgagtc cgcccgtgcg tccgtctgcg acccgaggcc tccgctgcgc ggtgagtggg
      661 ctgaggctgg agaggggagt cgctctggga accccttccc gggggctgca gggacacaag
      721 aaataagcgc ccagtcatcg gctgatggta cc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M34181. Human testis-spec...[gi:189982] Links  


LOCUS       HUMPRKACB               2945 bp    mRNA    linear   PRI 08-JAN-1995
DEFINITION  Human testis-specific cAMP-dependent protein kinase catalytic
            subunit (C-beta isoform) mRNA, complete cds.
ACCESSION   M34181
VERSION     M34181.1  GI:189982
KEYWORDS    cAMP dependent; cAMP-dependent protein kinase; cAMP-dependent
            protein kinase catalytic subunit; cAMP-dependent protein kinase
            catalytic subunit-beta; protein kinase; subunit.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2945)
  AUTHORS   Beebe,S.J., Oyen,O., Sandberg,M., Froysa,A., Hansson,V. and
            Jahnsen,T.
  TITLE     Molecular cloning of a tissue-specific protein kinase (C gamma)
            from human testis--representing a third isoform for the catalytic
            subunit of cAMP-dependent protein kinase
  JOURNAL   Mol. Endocrinol. 4 (3), 465-475 (1990)
  MEDLINE   90258940
   PUBMED   2342480
COMMENT     Original source text: clones T124, T175, T31, C-beta-10.
FEATURES             Location/Qualifiers
     source          1..2945
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p36.1"
                     /tissue_type="testis"
     gene            1..2945
                     /gene="PRKACB"
     CDS             48..1103
                     /gene="PRKACB"
                     /EC_number="2.7.1.37"
                     /note="C-beta isoform"
                     /codon_start=1
                     /product="cAMP-dependent protein kinase catalytic subunit"
                     /protein_id="AAA60170.1"
                     /db_xref="GI:189983"
                     /db_xref="GDB:G00-120-718"
                     /translation="MGNAATAKKGSEVESVKEFLAKAKEDFLKKWENPTQNNAGLEDF
                     ERKKTLGTGSFGRVMLVKHKATEQYYAMKILDKQKVVKLKQIEHTLNEKRILQAVNFP
                     FLVRLEYAFKDNSNLYMVMEYVPGGEMFSHLRRIGRFSEPHARFYAAQIVLTFEYLHS
                     LDLIYRDLKPENLLIDHQGYIQVTDFGFAKRVKGRTWTLCGTPEYLAPEIILSKGYNK
                     AVDWWALGVLIYEMAAGYPPFFADQPIQIYEKIVSGKVRFPSHFSSDLKDLLRNLLQV
                     DLTKRFGNLKNGVSDIKTHKWFATTDWIAIYQRKVEAPFIPKFRGSGDTSNFDDYEEE
                     DIRVSITEKCAKEFGEF"
BASE COUNT      872 a    551 c    568 g    954 t
ORIGIN      
        1 ccagcccccc ttcccttccc tgaccccttc ttgccatcgc cccagacatg gggaacgcgg
       61 cgaccgccaa gaaaggcagc gaggtggaga gcgtgaaaga gtttctagcc aaagccaaag
      121 aagacttttt gaaaaaatgg gagaatccaa ctcagaataa tgccggactt gaagattttg
      181 aaaggaaaaa aacccttgga acaggttcat ttggaagagt catgttggta aaacacaaag
      241 ccactgaaca gtattatgcc atgaagatct tagataagca gaaggttgtt aaactgaagc
      301 aaatagagca tactttgaat gagaaaagaa tattacaggc agtgaatttt cctttccttg
      361 ttcgactgga gtatgctttt aaggataatt ctaatttata catggttatg gaatatgtcc
      421 ctgggggtga aatgttttca catctaagaa gaattggaag gttcagtgag ccccatgcac
      481 ggttctatgc agctcagata gtgctaacat tcgagtacct ccattcacta gacctcatct
      541 acagagatct aaaacctgaa aatctcttaa ttgaccatca aggctatatc caggtcacag
      601 actttgggtt tgccaaaaga gttaaaggca gaacttggac attatgtgga actccagagt
      661 atttggctcc agaaataatt ctcagcaagg gctacaataa ggcagtggat tggtgggcat
      721 taggagtgct aatctatgaa atggcagctg gctatccccc attctttgca gaccaaccaa
      781 ttcagattta tgaaaagatt gtttctggaa aggtccgatt cccatcccac ttcagttcag
      841 atctcaagga ccttctacgg aacctgctgc aggtggattt gaccaagaga tttggaaatc
      901 taaagaatgg tgtcagtgat ataaaaactc acaagtggtt tgccacgaca gattggattg
      961 ctatttacca gaggaaggtt gaagctccat tcataccaaa gtttagaggc tctggagata
     1021 ccagcaactt tgatgactat gaagaagaag atatccgtgt ctctataaca gaaaaatgtg
     1081 caaaagaatt tggtgaattt taaagaggaa caagatgaca tctgagctca cactcagtgt
     1141 ttgcactctg ttgagagata aggtagagct gagaccgtcc ttgttgaagc agttacctag
     1201 ttccttcatt ccaacgactg agtgaggtct ttattgccat catccgtgtg cgcactctgc
     1261 atccacctat gtaacaaggc accgctaagc aagcattgtc tgtgccataa cacagtacta
     1321 gaccactttc ttacttctct ttgggttgtc tttctcctct cctacatcca tttcttcctt
     1381 ttcaatttca ttggttttct ctaaacagtg ctccatttta ttttgttggt gtttcagatg
     1441 ggcagtgtta tggctacgtg atatttgaag ggaaggataa gtgttgcttt cagtagttat
     1501 tgccaatatt gttgttggtc aatggcttga agataaactt tctaataatt attatttctt
     1561 tgagtagctc agacttggtt ttgccaaaac tcttggtaat ttttgaagat agactgtctt
     1621 atcaccaagg aaatttatac aaattaagac taactttctt ggaattcact attctggcaa
     1681 taaattttgg tagactaata cagtacagct agacccagaa atttggaagg ctgtagatca
     1741 gaggttctag ttccctttcc ctccttttat atcctcctct ccttgagtaa tgaagtgacc
     1801 agcctgtgta gtgtgacaaa cgtgtctcat tcagcaggaa aaactaatga tatggatcat
     1861 cacccagatt ctctcacttg gtaccagcat ttctgtaggt attagagaag agttctaagt
     1921 tttctaaacc ttaactgttc cttaaggatt ttagccagta ttttaataga acatgattaa
     1981 tgaaagtgac aaattttaaa ttttctctaa tagtcctcat cataaacttt ttaaaggaaa
     2041 ataagcaaac taaaaagaac attggtttag ataaatactt atactttgca aagtcaaaaa
     2101 tggcttgatt tttggaaaca atatagaggt attcatattt aaatgagggt ttacatttgt
     2161 tttgttttgt aaccgttaaa aagaagttgt ttccagctaa ttattgtggt gtactatatt
     2221 tgtgagccta gggtaggggc actgctgcaa cttctgcttt catcccatgc ctcatcaatg
     2281 aggaaaggga acaaagtgta taaaacctgc cacaattgta ttttaatttt gaggtatgat
     2341 attttcagat atttcataat ttctaacctc tgttctctca gtaaacagaa tgtctgatcg
     2401 atcatgcaga tacaatgttg gtatttgaga ggttagtttt tttcctacac ttttttttgc
     2461 caactgactt aacaacattg ctgtcaggtg gaaatttcaa gcacttttgc acatttagtt
     2521 cagtgtttgt tgagaatcca tggcttaacc cacttgtttt gctatttttt tctttgcttt
     2581 taattttccc catctgattt tatctctgcg tttcagtgac ctaccttaaa acaacacacg
     2641 agaagagtta aactgggttc attttaatga tcaatttacc tgcatataaa atttattttt
     2701 aatcaagctg atcttaatgt atataatcat tctatttgct ttattatcgg tgcaggtagg
     2761 tcattaacac cacttctttt catctgtacc acaccctggt gaaacctttg aagacataaa
     2821 aaaaacctgt ctgagatgtt ctttctacca atctatatgt ctttcggtta tcaagtgttt
     2881 ctgcatggta atgtcatgta aatgctgata ttgatttcac tggtccatct atatttaaaa
     2941 cgtgc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D78014. Homo sapiens mRNA...[gi:1330241] Links  


LOCUS       D78014                  5047 bp    mRNA    linear   PRI 05-FEB-1999
DEFINITION  Homo sapiens mRNA for dihydropyrimidinase related protein-3,
            complete cds.
ACCESSION   D78014
VERSION     D78014.1  GI:1330241
KEYWORDS    dihydropyrimidinase related protein-3; unc-33.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Hamajima,N., Matsuda,K., Sakata,S., Tamaki,N., Sasaki,M. and
            Nonaka,M.
  TITLE     A novel gene family defined by human dihydropyrimidinase and three
            related proteins with differential tissue distribution
  JOURNAL   Gene 180 (1-2), 157-163 (1996)
  MEDLINE   97128821
REFERENCE   2  (bases 1 to 5047)
  AUTHORS   Hamajima,N., Matsuda,K., Sakata,S., Tamaki,N., Sasaki,M. and
            Nonaka,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-OCT-1995) Naoki Hamajima, Nagoya City University
            Medical School, Department of Pediatrics; 1 Kawasumi, Mizuho-cho,
            Mizuho-ku, Nagoya, Aichi 467-8601, Japan
            (E-mail:hamajima@med.nagoya-cu.ac.jp, Tel:+81-52-853-8246,
            Fax:+81-52-842-3449)
FEATURES             Location/Qualifiers
     source          1..5047
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="brain"
                     /dev_stage="fetus"
     CDS             111..1823
                     /codon_start=1
                     /product="dihydropyrimidinase related protein-3"
                     /protein_id="BAA11192.1"
                     /db_xref="GI:1330242"
                     /translation="MSYQGKKNIPRITSDRLLIKGGRIVNDDQSFYADIYMEDGLIKQ
                     IGDNLIVPGGVKTIEANGKMVIPGGIDVHTHFQMPYKGMTTVDDFFQGTKAALAGGTT
                     MIIDHVVPEPESSLTEAYEKWREWADGKSCCDYALHVDITHWNDSVKQEVQNLIKDKG
                     VNSFMVYMAYKDLYQVSNTELYEIFTCLGELGAIAQVHAENGDIIAQEQTRMLEMGIT
                     GPEGHVLSRPEELEAEAVFRAITIASQTNCPLYVTKVMSKSAADLISQARKKGNVVFG
                     EPITASLGIDGTHYWSKNWAKAAAFVTSPPLSPDPTTPDYINSLLASGDLQLSGSAHC
                     TFSTAQKAIGKDNFTAIPEGTNGVEERMSVIWDKAVATGKMDENQFVAVTSTNAAKIF
                     NLYPRKGRISVGSDSDLVIWDPDAVKIVSAKNHQSAAEYNIFEGMELRGAPLVVICQG
                     KIMLEDGNLHVTQGAGRFIPCSPFSDYVYKRIKARRKMADLHAVPRGMYDGPVFDLTT
                     TPKGGTPAGSARGSPTRPNPPVRNLHQSGFSLSGTQVDEGVRSASKRIVAPPGGRSNI
                     TSLS"
BASE COUNT     1288 a   1215 c   1269 g   1275 t
ORIGIN      
        1 ccgttgctgt cgccgttgct gtcgggggcg ctgtgcgctg aggaaggcgc gggcgagccg
       61 gagcagaaga aggagggagg gagccagccg ctgcagccac caccgccacc atgtcctacc
      121 aaggcaagaa gaacatcccg cggatcacga gtgaccgtct ccttatcaag ggaggcagaa
      181 tcgtcaatga tgatcagtcc ttttatgctg atatttacat ggaagatggc ttaataaaac
      241 aaattggaga caatctgatt gttcctggag gagtgaagac cattgaagcc aatgggaaga
      301 tggtgatccc tggaggcatc gatgtccata ctcacttcca gatgccatat aagggaatga
      361 ccacagtaga tgacttcttc caagggacaa aggcggcctt agcaggtggc accaccatga
      421 tcattgacca tgtggtgcct gagcctgagt ccagcctgac tgaggcctat gagaaatgga
      481 gagagtgggc tgatgggaag agttgctgtg actatgccct gcatgtggac atcacccact
      541 ggaatgacag cgtcaagcag gaagtgcaga acctcatcaa ggacaaaggg gttaactcct
      601 tcatggttta tatggcttat aaggatttgt atcaagtatc taacacagag ctctatgaga
      661 tcttcacctg cctgggagag ctgggggcca ttgctcaagt tcatgctgag aatggggata
      721 tcattgccca ggagcaaacc cgcatgttgg aaatggggat aactggccca gaaggccatg
      781 tactgagcag gccagaagag ctggaagctg aggctgtgtt ccgtgccatc accattgcca
      841 gccaaaccaa ttgccctctc tacgtcacaa aggtcatgag caagagtgca gctgacctca
      901 tctcacaagc caggaaaaaa ggaaatgtag tctttggtga gcccatcact gccagcctcg
      961 gcatagatgg aacccattat tggagcaaga actgggccaa ggcggctgca tttgtgacat
     1021 ccccacccct gagccctgac ccaactactc cggactacat caactccttg ctggccagcg
     1081 gggatctgca gctatctggg agtgcccact gcaccttcag cactgcccag aaagcaattg
     1141 ggaaggacaa cttcacagcc attcctgagg gcaccaatgg tgtggaggag cggatgtctg
     1201 tcatctggga caaggctgtg gccacaggga aaatggacga aaaccagttc gtggctgtga
     1261 caagcacaaa cgctgccaag atcttcaacc tgtatccccg caagggaaga atatctgtgg
     1321 gttctgacag cgacctcgtc atctgggatc cagatgctgt gaagatcgtc tctgccaaga
     1381 accaccagtc tgcggcagag tacaacatct ttgaagggat ggagctgcgc ggggctcctc
     1441 tggttgtcat ctgccagggc aagatcatgc tggaagatgg caacctgcac gtgacccagg
     1501 gggctggccg cttcataccc tgcagcccgt tctccgacta tgtctacaag cgcattaaag
     1561 cacggaggaa gatggcagac ctgcatgccg tcccaagggg catgtacgat gggcctgtgt
     1621 ttgacctgac caccaccccc aaaggtggca cccccgcagg ctctgctcgg ggctctccta
     1681 ctcggccgaa cccacctgtg aggaatcttc atcagtcggg atttagcctg tcaggcaccc
     1741 aagtggatga gggggttcgc tcagccagca agcgcatcgt ggccccccca ggcggccgtt
     1801 ctaatatcac atctctgagt taagcaagcc ttcctcaaag agaggggcag aagcaagaag
     1861 agattgtttt gaagccaaaa tggtacaccg atatttaaga aggaaagcga atccaaacgg
     1921 ttgtgatcta aagaatcaat aagcctcaag ccttatgttt ctccaatgtt acgctcgctt
     1981 gcctagcttt acgaatattg ctttgttttc tgtttatgca tagccttgat ttgtttgact
     2041 cccctccccc catttacatg catgcaatca gacaggccac taaggtaaaa gagtctgctc
     2101 tatcatagtg ttgagagcgt gtgtagtgct gcatcttatg acaaggggac agacaagctg
     2161 ggacgtcagg gaaatgaaca aaagggacgc aggttatttg gggtgagtgg gtggtgggag
     2221 cctggagcaa ggtggagggt gcagaggggc tggggtaggg catgtaggag ggaggtgggt
     2281 gggtcaggtg agtggaaggg gtgttgtata ttgtgttgat gacgtacgtt atttccatgg
     2341 aagatagccg ctgtggcagc tgtcacatca ccacagctcc ctagggtctg ccgagaaggc
     2401 aggcagtctt tgggttctgt tctttgtcac gtcccctaca agtaaatttt gtttctttga
     2461 acgtttatta aaatgccaag acccaaccat ttcttccacc tgcttgattg tgccagtgtt
     2521 tgctcaggcc tctttcttag tgttgctttc aaatccttct ctttcctggg ttgggaaggc
     2581 caggcaggga cagagcaaat gacacttctc ttcctcttgc cctccctgcc tctttggtgc
     2641 tcttaaaagc cagcagctga gaacatagca caggcccacg tggtgagggc acccacagct
     2701 taaagacgct tccttctaaa cacggcgagg tcacctctca ctcttctgtc tttgcaaacc
     2761 gagaagagtg gcatgcttct ggcatcccaa gtcaggattt tagctcagat gaggcagaat
     2821 gaagggcctc tcttacaggc agtttgtgtt tgattctctc gatcctggca catccatgat
     2881 aaataggagt ttttgaaagt tggttttatt aggtgttccc taatttttac cgtaataggt
     2941 catctcagct tatatgaaag tcaagtgggg aactgggaaa gccaaagtca gtcttgagca
     3001 gagggagcac attttgtgga cctggttcca cctttccatt ccaaaccacc tgtttcccct
     3061 tccattagca gaaactctgg gggaactttg tgtctcagtc ctagaatctc cccaagtgag
     3121 tggaagtgac atgatgcagt cttcctcatg gggcacctga aagaaattag tgtgggtgct
     3181 tcgatctacc ttgtctgtca gagttgaata tctctttccc tatcatgctg cttctgaaaa
     3241 ttcagttttg gagcaagtcc tgtgagcaag ataagaatct atagaaccaa gatgctcatt
     3301 ttcagaagaa atatgttcaa cctgggatca gacttccatg ctctggggaa tccaagtggt
     3361 agcacctgta accctgtgta ctaagtgctt tgaagagaag agcaggcctc agacaccttt
     3421 taattgctta ggagaaacca ttgtctctga ctgcaggttt gaataagttg aagaccagag
     3481 aaaagtacac actgggctac aaaggaattt ggagatagcc aaggaacagg atttccccta
     3541 gcaagctacc ttctgttcaa atcatgaaaa aagactattt ccccttagaa tagggaagct
     3601 tgctatttta aagctcttgt agtgcttttc ttttaaggga gatgtagtaa aagggaaaat
     3661 gtagctctta gtttacactt caaagatgtg ggggtctttc agagaactaa gaataacagt
     3721 tttatgtgca gagagagttt gccagatctg aagcatatac ctcattgact aggctgttac
     3781 tttgggatag gttgcagtac cagccacagc cagcagatag aggaaaagac acacataaac
     3841 tcgcttctga gcgtccactt ctgcactctc tgctctgctg ttactcagcc cctgagtctg
     3901 actcatctct gcacaacctc tctgtgccat gaagataagt cttccatggc caaatcggtc
     3961 atccgcactg cccttgggac ttccgaagtg aaccattcca ccagaacctt tgattctgca
     4021 caagatttcc ttgctctggg aacaaccccc aaatgccctt gggaggaaca acatgagctc
     4081 aggaagcctc tctttcttca cttaccatta ctaactctcc aagcatagaa atccctggga
     4141 attgcgagaa taactcccac tattttaaaa tttatattca gatttgtttc gtttcataag
     4201 acacatcaaa caggcctata caaaaggttt aggaaaagaa aacaatggtg agtcccggcc
     4261 ctcttcgaat tcactggcac ctcatgcaag tgtaggaagg cacgctggat cgtctatctg
     4321 attccaaagc tgtcctttgc catctcatcc cttggcctgc cccccaaccc tgaggatgcc
     4381 cctgccatcc ccccaacctc ctcatattgc ctctgaaccc agatggcaat ccatcccggt
     4441 tctctctgag ggccacgggc ttgggtagtg gaaagggtgt ttgggaaatt gttaaatcag
     4501 ttacccgtag tagagctatt tcttgtactt ctaagttttc tagaagtgga aggattgtag
     4561 tcatcctgaa aatgggttta cttcaaaatc cctcagcctt gttcttcacg actgtctata
     4621 ctgagagtgt catgtttcca caaagggctg acacctgagc ctggattttc actcatccct
     4681 gagaagccct ttccagtagg gtgggcaatt cccaacttcc ttgccacaag cttcccaggc
     4741 tttctcccct ggaaaactcc agcttgagtc ccagatacac tcatgggctg ccctgggcag
     4801 ccagcattca ttgtaagttc cctctttgaa aactggtgtg tgggtgttca gttctgtgtc
     4861 tggtgggtat ggacagacag taatctcctg tgatctgtgc tagctgtgag gcagctctgg
     4921 aacgtgaaga gctgtttggt ttgaaccgtg aacaaaactg tgttttgagt ttagctgaca
     4981 ttaaagaaaa aagttcatca cgtgactgtt aatgtaaacc tggttattaa aataactatg
     5041 aaattac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U50078. Human guanine nuc...[gi:4220427] Links  


LOCUS       HSU50078               15164 bp    mRNA    linear   PRI 24-OCT-2001
DEFINITION  Human guanine nucleotide exchange factor p532 mRNA, complete cds.
ACCESSION   U50078
VERSION     U50078.1  GI:4220427
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 15164)
  AUTHORS   Rosa,J.L., Casaroli-Marano,R.P., Buckler,A.J., Vilaro,S. and
            Barbacid,M.
  TITLE     p619, a giant protein related to the chromosome condensation
            regulator RCC1, stimulates guanine nucleotide exchange on ARF1 and
            Rab proteins
  JOURNAL   EMBO J. 15 (16), 4262-4273 (1996)
  MEDLINE   97015127
   PUBMED   8861955
REFERENCE   2  (bases 1 to 15164)
  AUTHORS   Rosa,J.L. and Barbacid,M.
  TITLE     A giant protein that stimulates guanine nucleotide exchange on ARF1
            and Rab proteins forms a cytosolic ternary complex with clathrin
            and Hsp70
  JOURNAL   Oncogene 15 (1), 1-6 (1997)
  MEDLINE   97377001
   PUBMED   9233772
REFERENCE   3  (bases 1 to 15164)
  AUTHORS   Cruz,C., Paladugu,A., Ventura,F., Bartrons,R., Aldaz,M. and
            Rosa,J.L.
  TITLE     Assignment of the human P532 gene (HERC1) to chromosome 15q22 by
            fluorescence in situ hybridization
  JOURNAL   Cytogenet. Cell Genet. 86 (1), 68-69 (1999)
  MEDLINE   99447611
   PUBMED   10516438
REFERENCE   4  (bases 1 to 15164)
  AUTHORS   Rosa,J.L.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-FEB-1996) Jose L. Rosa, Unitat Bioquimica, Campus de
            Bellvitge, Universitat de Barcelona, Feixa Llarga s/n (Pavello
            Central), Hospitalet, Barcelona 08907, Spain
COMMENT     On Feb 4, 1999 this sequence version replaced gi:1477564.
FEATURES             Location/Qualifiers
     source          1..15164
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             97..14682
                     /codon_start=1
                     /product="p532"
                     /protein_id="AAD12586.1"
                     /db_xref="GI:1477565"
                     /translation="MATMIPPVKLKWLEHLNSSWITEDSESIATREGVAVLYSKLVSN
                     KEVVPLPQQVLCLKGPQLPDFERESLSSDEQDHYLDALLSSQLALAKMVCSDSPFAGA
                     LRKRLLVLQRVFYALSNKYHDKGKVKQQQHSPESSSGSADVHSVSERPRSSTDALIEM
                     GVRTGLSLLFALLRQSWMMPVSGPGLSLCNDVIHTAIEVVSSLPPLSLANESKIPPMG
                     LDCLSQVTTFLKGVTIPNSGADTLGRRLASELLLGLAAQRGSLRYLLEWIEMALGASA
                     VVHTMEKGKLLSSQEGMISFDCFMTILMQMRRSLGSSADRSQWREPTRTSDGLCSLYE
                     AALCLFEEVCRMASDYSRTCASPDSIQTGDAPIVSETCEVYVWGSNSSHQLVEGTQEK
                     ILQPKLAPSFSDAQTIEAGQYCTFVISTDGSVRACGKGSYGRLGLGDSNNQSTLKKLT
                     FEPHRSIKKVSSSKGSDGHTLAFTTEGEVFSWGDGDYGKLGHGNSSTQKYPKLIQGPL
                     QGKVVVCVSAGYRHSAAVTEDGELYTWGEGDFGRLGHGDSNSRNIPTLVKDISNVGEV
                     SCGSSHTIALSKDGRTVWSFGGGDNGKLGHGDTNRVYKPKVIEALQGMFIRKVCAGSQ
                     SSLALTSTGQVYAWGCGACLGCGSSEATALRPKLIEELAATRIVDVSIGDSHCLALSH
                     DNEVYAWGNNSMGQCGQGNSTGPITKPKKVSGLDGIAIQQISAGTSHSLAWTALPRDR
                     QVVAWHRPYCVDLEESTFSHLRSFLERYCDKINSEIPPLPFPSSREHHSFLKLCLKLL
                     SNHLALALAGGVATSILGRQAGPLRNLLFRLMDSTVPDEIQEVVIETLSVGATMLLPP
                     LRERMELLHSLLPQGPDRWESLSKGQRMQLDIILTSLQDHTHVASLLGYSSPSDAADL
                     SSVCTGYGNLSDQPYGTQSCHPDTHLAEILMKTLLRNLGFYTDQAFGELEKNSDKFLL
                     GTSSSENSQPAHLHELLCSLQKQLLAFCHINNISENSSSVALLHKHLQLLLPHATDIY
                     SRSANLLKESPWNGSVGEKLRDVIYVSAAGSMLCQIVNSLLLLPVSVARPLLSYLLDL
                     LPPLDCLNRLLPAADLLEDQELQWPLHGGPELIDPAGLPLPQPAQSWVWLVDLERTIA
                     LLIGRCLGGMLQGSPVSPEEQDTAYWMKTPLFSDGVEMDTPQLDKCMSCLLEVALSGN
                     EEQKPFDYKLRPEIAVYVDLALGCSKEPARSLWISMQDYAVSKDWDSATLSNESLLDT
                     VSRFVLAALLKHTNLLSQACGESRYQPGKHLSEVYRCVYKVRSRLLACKNLELIQTRS
                     SSRDRWISENQDSADVDPQEHSFTRTIDEEAEMEEQAERDREEGHPEPEDEEEEREHE
                     VMTAGKIFQCFLSAREVARSRDRDRMNSGAGSGARADDPPPQSQQERRVSTDLPEGQD
                     VYTAACNSVIHRCALLILGVSPVIDELQKRREEGQLQQPSTSASEGGGLMTRSESLTA
                     ESRLVHTSPNYRLIKSRSESDLSQPESDEEGYALSGRQNVDLDLAASHRKRGPMHSQL
                     ESLSDSWARLKHSRDWLCNSSYSFESDFDLTKSLGVHTLIENVVSFVSGDVGNAPGFK
                     EPEESMSTSPQASIIAMEQQQLRAELRLEALHQILVLLSGMEEKGSISLAGSRLSSGF
                     QSSTLLTSVRLQFLAGCFGLGTVGHTGAKGESGRLHHYQDGIRAAKRNIQIEIQVAVH
                     KIYQQLSATLERALQANKHHIEAQQRLLLVTVFALSVHYQPVDVSLAISTGLLNVLSQ
                     LCGTDTMLGQPLQLLPKTGVSQLSTALKVASTRLLQILAITTGTYADKLSPKVVQSLL
                     DLLCSQLKNLLSQTGVLHMASFGEGEQEDGEEEEKKVDSSGETEKKDFRAALRKQHAA
                     ELHLGDFLVFLRRVVSSKAIQSKMASPKWTEVLLNIASQKCSSGIPLVGNLRTRLLAL
                     HVLEAVLPACESGVEDDQMAQIVERLFSLLSDCMWETPIAQAKHAIQIKEKEQEIKLQ
                     KQGELEEEDENLPIQEVSFDPEKAQCCLVENGQILTHGSGGKGYGLASTGVTSGCYQW
                     KFYIVKENRGNEGTCVGVSRWPVHDFNHRTTSDMWLYRAYSGNLYHNGEQTLTLSSFT
                     QGDFITCVLDMEARTISFGKNGEEPKLAFEDVDAAELYPCVMFYSSNPGEKVKICDMQ
                     MRGTPRDLLPGDPICSPVAAVLAEATIQLVRILHRTDRWTYCINKKMMERLHKIKICI
                     KESGQKLKKSRSVQSREENEMREEKESKEEEKGKHTRHGLADLSELQLRTLCIEVWPV
                     LAVIGGVDAGLRVGGRCVHKQTGRHATLLGVVKEGSTSAKVQWDEAEITISFPTFWSP
                     SDTPLYNLEPCEPLPFDVARFRGLTASVLLDLTYLTGVHEDMGKQSTKRHEKKHRHES
                     EEKGDVEQKPESESALDMRTGLTSDDVKSQSTTSSKSENEIASFSLDPTLPSVESQHQ
                     ITEGKRKNHEHMSKNHDVAQSEIRAVQLSYLYLGAMKSLSALLGCSKYAELLLIPKVL
                     AENGHNSDCASSPVVHEDVEMRAALQFLMRHMVKRAVMRSPIKRALGLADLERAQAMI
                     YKLVVHGLLEDQFGGKIKQEIDQQAEESDPAQQAQTPVTTSPSASSTTSFMSSSLEDT
                     TTATTPVTDTETVPASESPGVMPLSLLRQMFSSYPTTTVLPTRRAQTPPISSLPTSPS
                     DEVGRRQSLTSPDSQSARPANRTALSDPSSRLSTSPPPPAIAVPLLEMGFSLRQIAKA
                     MEATGARGEADAQNITVLAMWMIEHPGHEDEEEPQSGSTADSRPGAAVLGSGGKSNDP
                     CYLQSPGDIPSADAAEMEEGFSESPDNLDHTENAASGSGPSARGRSAVTRRHKFDLAA
                     RTLLARAAGLYRSVQAHRNQSRREGISLQQDPGALYDFNLDEELEIDLDDEAMEAMFG
                     QDLTSDNDILGMWIPEVLDWPTWHVCESEDREEVVVCELCECSVVSFNQHMKRNHPGC
                     GRSANRQGYRSNGSYVDGWFGGECGSGNPYYLLCGTCREKYLAMKTKSKSTSSERYKG
                     QAPDLIGKQDSVYEEDWDMLDVDEDEKLTGEEEFELLAGPLGLNDRRIVPEPVQFPDS
                     DPLGASVAMVTATNSMEETLMQIGCHGSVEKSSSGRITLGEQAAALANPHDRVVALRR
                     VTAAAQVLLARTMVMRALSLLSVSGSSCSLAAGLESLGLTDIRTLVRLMCLAAAGRAG
                     LSTSPSAMASTSERSRGGHSKANKPISCLAYLSTAVGCLASNAPSAAKLLVQLCTQNL
                     ISAATGVNLTTVDDSIQRKFLPSFLRGIAEENKLVTSPNFVVTQALVALLADKGAKLR
                     PNYDKSEVEKKGPLELANALAACCLSSRLSSQHRQWAAQQLVRTLAAHDRDNQTTLQT
                     LADMGGDLRKCSFIKLEAHQNRVMTCVWCNKKGLLATSGNDGTIRVWNVTKKQYSLQQ
                     TCVFNRLEGDAEESLGSPSDPSFSPVSWSISGKYLAGALEKMVNIWQVNGGKGLVDIQ
                     PHWVSALAWPEEGPATAWSGESPELLLVGRMDGSLGLIEVVDVSTMHRRELEHCYRKD
                     VSVTCIAWFSEDRPFAVGYFDGKLLLGTKEPLEKGGIVLIDAHKDTLISMKWDPTGHI
                     LMTCAKEDSVKLWGSISGCWCCLHSLCHPSIVNGIAWCRLPGKGSKLQLLMATGCQSG
                     LVCVWRIPQDTTQTNVTSAEGWWDQESNCQDGYRKSSGAKCVYQLRGHITPVRTVAFS
                     SDGLALVSGGLGGLMNIWSLRDGSVLQTVVIGSGAIQTTVWIPEVGVAACSNRSKDVL
                     VVNCTAEWAAANHVLATCRTALKQQGVLGLNMAPCMRAFLERLPMMLQEQYAYEKPHV
                     VCGDQLVHSPYMQCLASLAVGLHLDQLLCNPPVPPHHQNCLPDPASWNPNEWAWLECF
                     STTIKAAEALTNGAQFPESFTVPDLEPVPEDELVFLMDNSKWINGMDEQIMSWATSRP
                     EDWHLGGKCDVYLWGAGRHGQLAEAGRNVMVPAAAPSFSQAQQVICGQNCTFVIQANG
                     TVLACGEGSYGRLGQGNSDDLHVLTVISALQGFVVTQLVTSCGSDGHSMALTESGEVF
                     SWGDGDYGKLGHGNSDRQRRPRQIEALQGEEVVQMSCGFKHSAVVTSDGKLFTFGNGD
                     YGRLGLGNTSNKKLPERVTALEGYQIGQVACGLNHTLAVSADGSMVWAFGDGDYGKLG
                     LGNSTAKSSPQKIDVLCGIGIKKVACGTQFSVALTKDGHVYTFGQDRLIGLPEGRARN
                     HNRPQQIPVLAGVIIEDVAVGAEHTLALASNGDVYAWGSNSEGQLGLGHTNHVREPTL
                     VTGLQGKNVRQISAGRCHSAAWTAPPVPPRAPGVSVPLQLGLPDTVPPQYGALREVSI
                     HTVRARLRLLYHFSDLMYSSWRLLNLSPNNQNSTSHYNAGTWGIVQGQLRPLLAPRVY
                     TLPMVRSIGKTMVQGKNYGPQITVKRISTRGRKCKPIFVQIARQVVKLNASDLRLPSR
                     AWKVKLVGEGADDAGGVFDDTITEMCQELETGIVDLLIPSPNATAEVGYNRDRFLFNP
                     SACLDEHLMQFKFLGILMGVAIRTKKPLDLHLAPLVWKQLCCVPLTLEDLEEVDLLYV
                     QTLNSILHIEDSGITEESFHEMIPLDSFVGQSADGKMVPIIPGGNSIPLTFSNRKEYV
                     ERAIEYRLHEMDRQVAAVREGMSWIVPVPLLSLLTAKQLEQMVCGMPEISVEVLKKVV
                     RYREVDEQHQLVQWFWHTLEEFSNEERVLFMRFVSGRSRLPANTADISQRFQIMKVDR
                     PYDSLPTSQTCFFQLRLPPYSSQLVMAERLRYAINNCRSIDMDNYMLSRNVDNAEGSD
                     TDY"
BASE COUNT     4130 a   3277 c   3847 g   3910 t
ORIGIN      
        1 gaattccgcc tctgcggagc cgggctcggg tcgccggagc cgcgccccac cccgccagct
       61 ccagagccac gactaatggc tgaaggataa atcaacatgg caactatgat tccaccagtg
      121 aagctgaaat ggcttgaaca cttgaacagc tcctggatta cagaggacag tgaatctatt
      181 gctacaagag agggagttgc tgttctgtat tctaaactgg ttagcaataa ggaagtagta
      241 cctttgcccc aacaagtttt atgcctcaaa ggaccacagt tgccagactt tgaacgtgag
      301 tctctttcaa gtgatgagca ggaccactat ttggatgccc ttcttagcag ccagctagca
      361 ttggcaaaga tggtatgttc agattcccca tttgccgggg cacttagaaa acgactgctt
      421 gtactccagc gtgtctttta tgcactttct aataaatacc atgacaaagg caaggtgaag
      481 cagcagcagc attctccgga gagcagttct ggttcagcag atgtccattc tgttagtgaa
      541 cgcccccggt caagcactga tgcacttata gaaatgggtg ttcgaactgg tctaagttta
      601 ttatttgcgc ttctaagaca aagttggatg atgcctgtgt caggacctgg tctcagtctt
      661 tgcaacgatg tcattcatac tgcaattgaa gttgtgagct ctttgccacc attatcatta
      721 gcaaatgaaa gcaagattcc tcctatgggc ttggactgct tatcgcaagt aacaacattt
      781 cttaaaggag tcactattcc taattctggg gcagacactt taggtcgtag attagcttct
      841 gagttgctgc ttggtttggc agctcaacga ggctcattgc gatatcttct tgaatggata
      901 gaaatggctt tgggggcttc ggcagttgta cacaccatgg agaaaggcaa actactctca
      961 agccaggaag gaatgatcag ctttgactgc tttatgacca tattaatgca gatgaggcgt
     1021 tctttgggtt catctgctga tcggagtcag tggagagaac caaccagaac atcggatggc
     1081 ttgtgctccc tttacgaggc agcattatgt ctctttgaag aggtttgcag aatggcttct
     1141 gattattcga gaacatgtgc tagcccagat agcattcaga ctggtgatgc tcccattgtc
     1201 tccgaaacct gtgaggttta tgtttggggg agcaatagca gccatcagtt ggtagaaggt
     1261 acacaggaga aaatactgca acccaaactg gctcctagtt tctctgatgc acagaccatt
     1321 gaagctggac agtactgcac ttttgtcatt tctacggatg gctcagttag agcttgcggg
     1381 aaaggcagct atgggagact gggccttgga gactccaata atcagtcaac tttaaaaaag
     1441 ttaacattcg agcctcacag atccattaaa aaggtttcat cttctaaagg atctgatggt
     1501 cacactttag cctttacgac agaaggagaa gtcttcagtt ggggagatgg tgattatggg
     1561 aaactggggc atggaaatag ttcaacacag aaatatccca agcttattca gggacctcta
     1621 caaggaaagg tagttgtttg tgtgtcagct ggatacagac atagtgctgc tgtcacagag
     1681 gatggggaat tatacacatg gggtgaagga gactttggaa gattaggtca tggtgacagc
     1741 aatagtcgta acattccaac attagtaaaa gacatcagca atgtaggaga ggtttcttgt
     1801 ggcagttcac atactattgc tctgtctaaa gatgggagaa ctgtatggtc ttttggagga
     1861 ggagacaatg gtaaacttgg tcatggtgat accaacagag tgtataaacc taaagttatt
     1921 gaagctttac aaggaatgtt cattcgcaaa gtttgtgctg ggagccagtc ttcacttgct
     1981 ttgacatcaa cagggcaggt ctatgcttgg ggctgtggag cttgtctagg ttgtggttct
     2041 tcagaagcta ctgctttgag acccaagctt attgaagaac tggctgccac aagaatagtt
     2101 gatgtttcta ttggagacag tcattgtttg gctctttctc atgataatga agtttatgcc
     2161 tggggcaata actcaatggg gcaatgtggt cagggaaatt ccacaggtcc tattactaaa
     2221 ccaaagaaag tgagtggctt agatggcata gctattcagc agatttcggc tggaacatca
     2281 catagtctgg catggactgc tcttcctagg gacagacaag ttgttgcatg gcaccgacct
     2341 tattgtgtag atcttgaaga gagtaccttc tcacacctgc gttcttttct tgagagatac
     2401 tgtgataaaa taaacagtga gattccccca ctccctttcc cttcatcaag agaacaccac
     2461 agttttctca agctgtgcct gaagctactt tcaaatcacc ttgctcttgc acttgcggga
     2521 ggggtagcta ccagcattct cgggaggcag gcaggtccac ttcgaaattt gctcttcaga
     2581 ctgatggact caactgtccc agatgaaatc caagaggtgg taattgaaac tttatcagtg
     2641 ggagcaacca tgctgttacc tccattacga gaacggatgg aattacttca ttctctttta
     2701 cctcaaggac ctgatagatg ggaaagctta tctaaaggac agagaatgca actggatatc
     2761 atcctgacaa gtttgcaaga tcatacccac gtagcctccc tacttggcta tagttcaccc
     2821 tctgatgctg ctgacctatc ttctgtgtgt actggctacg gaaatctgtc agatcaacct
     2881 tacggcactc agagctgcca tccagatacc cacctggctg aaattttgat gaagaccctc
     2941 ttaagaaatt taggatttta tacagatcaa gcatttggag agctagaaaa gaatagtgat
     3001 aaatttctac ttggaacatc atcatcagaa aacagtcagc ctgctcatct tcatgaactg
     3061 ctatgttcac tacagaaaca gctgctggca ttttgccata tcaataacat tagtgagaac
     3121 tcaagcagtg tggcattgct tcataaacat cttcagcttt tgttgcctca tgccacagat
     3181 atttattcac gttctgcaaa tttgctcaaa gaaagtcctt ggaatggcag tgttggagaa
     3241 aaattaagag atgtgatata cgtctcagct gctggcagta tgctctgcca gattgttaac
     3301 tccctgctgt tactccctgt gtcagtggct cggcctttat tgagttacct cctcgacttg
     3361 ttgccacctc ttgattgcct taatagactc ctgccagctg ctgatctttt agaagaccag
     3421 gagttacagt ggcctcttca tggagggcca gaactaattg atcctgctgg tctgccatta
     3481 cctcagccag ctcagtcctg ggtatggctt gtggatctag aaagaacaat tgctctcctt
     3541 attgggcggt gtcttggtgg catgcttcag ggctcccctg tgtctccaga ggaacaggac
     3601 actgcatatt ggatgaaaac gccactgttc agtgacggtg tagaaatgga cactcctcaa
     3661 ttggataaat gtatgagttg cctgttagaa gtagcacttt ctggaaatga agaacagaag
     3721 ccttttgatt ataaattgcg gcctgaaatt gctgtctatg tagacttggc attgggttgt
     3781 tctaaagagc ctgcccgaag cctttggatc agcatgcagg actatgctgt tagtaaagat
     3841 tgggacagtg caactttaag taatgagtca ctcttggaca ctgtgtctag atttgttctt
     3901 gcagctcttc tgaaacacac aaatttactt agtcaagcat gtggagaaag ccgatatcaa
     3961 cctggtaaac acttatcaga agtgtaccgt tgtgtataca aagttcgaag tcgtttactt
     4021 gcttgcaaga accttgaact tattcaaaca aggtcatcat cacgggacag atggatatca
     4081 gaaaaccagg actctgcaga tgttgatcct caggagcatt catttactcg aactattgat
     4141 gaagaagctg aaatggaaga acaggctgag agagaccggg aagaggggca tccggagcca
     4201 gaggatgaag aggaggaacg ggaacatgaa gtgatgacag ctggcaaaat ctttcagtgt
     4261 ttcctctcag cccgtgaagt agctcgtagc cgagaccgag atagaatgaa cagtggggca
     4321 gggtctgggg ctcgagctga tgatccacct cctcagtctc agcaagagcg aagggtcagc
     4381 acagaccttc ctgagggtca ggatgtgtac actgctgcat gcaactccgt gatccatcgg
     4441 tgtgccctgt taatattagg agtaagtcct gtgatagatg agcttcagaa gcgaagagaa
     4501 gaaggacagt tgcagcaacc ttcaacaagt gcctctgaag ggggtggact tatgaccagg
     4561 agtgaaagtc ttactgcaga gagccggcta gtccacacaa gcccaaatta tagactgatc
     4621 aaatcgagga gtgaatctga tttgtctcag cctgaatcag atgaagaggg ttacgcactg
     4681 agtggcagac aaaatgttga tttggatttg gcagcatctc acagaaagag aggtcctatg
     4741 cacagtcaat tggaatccct gagtgactct tgggctcgcc tgaaacatag cagagactgg
     4801 ttatgcaact cctcctattc ctttgagtca gattttgatc ttaccaagtc tttgggagtt
     4861 cacactttga ttgaaaatgt tgtaagcttt gtgagtggag atgtggggaa tgccccaggt
     4921 tttaaagagc cagaggaaag tatgtctaca agtccccagg cctccatcat tgcaatggaa
     4981 cagcagcagt taagggcaga acttcgttta gaggcacttc atcagatcct cgttctattg
     5041 tctgggatgg aagaaaaagg tagcatctca ctggcaggaa gcagattgag ttcaggcttc
     5101 cagtcctcca cactactcac gtctgtgagg ctgcagttcc tagcagggtg ttttggttta
     5161 ggcactgttg gacacacagg agccaaggga gagagtggcc gattgcatca ctatcaggat
     5221 gggatcagag cagctaagag aaatattcag attgaaatcc aggtagctgt gcataaaatt
     5281 tatcaacagt tgtctgctac cctggaaaga gccctgcaag caaacaagca tcacattgaa
     5341 gcccagcaac gtctgcttct ggttacagtt tttgccctaa gtgttcatta tcaaccagta
     5401 gatgtttctt tggcaatttc cactggtctg ctaaacgtat tgtcacagtt gtgtggtaca
     5461 gacaccatgc taggacagcc cctgcagttg ttgccaaaga cgggtgtttc ccagcttagc
     5521 acagctttga aagtggccag tacaaggttg ctccagattc tagccatcac tactgggacc
     5581 tatgctgata aactgagtcc caaagtagtt caatccttgt tggatctact ctgtagtcag
     5641 ttgaagaatt tattgtccca aactggtgta ctacatatgg cctctttcgg agaaggggag
     5701 caagaagacg gtgaagaaga agaaaaaaaa gttgactcca gtggagaaac tgagaagaaa
     5761 gatttcagag ctgctcttag gaaacaacat gcagccgaac tccatctagg ggatttttta
     5821 gtttttcttc gcagagttgt atcttcaaaa gcaattcaat caaaaatggc ttccccaaag
     5881 tggaccgaag tgcttctaaa tatagcatct cagaaatgtt cttcaggtat ccctctggtt
     5941 ggtaacttaa gaacaaggct ccttgcactt catgtccttg aagctgtgct gccagcttgt
     6001 gaatctggtg tagaagatga tcaaatggcc cagattgttg agcgcttatt ttcccttctc
     6061 tctgattgta tgtgggagac acccattgct caggccaaac atgctattca gataaaggaa
     6121 aaagaacaag aaataaaact acagaagcag ggcgagttgg aagaagaaga tgagaatctt
     6181 cctatccaag aagtatcctt tgacccggag aaagctcagt gttgcctagt ggagaatgga
     6241 cagattttaa ctcacggcag tggagggaaa ggatatggat tggcatctac aggagtaact
     6301 tctgggtgct atcagtggaa gttttatatt gtgaaggaaa acagaggtaa tgaaggcacg
     6361 tgtgttggag tttctcgctg gccagtacat gactttaatc accgcactac ctcggatatg
     6421 tggctctata gggcctacag tggtaacctc tatcacaatg gagaacagac tctcacattg
     6481 tccagcttta ctcaaggaga tttcattacc tgtgtgttag acatggaagc caggaccatt
     6541 tcttttggga aaaatggaga ggaacccaaa ttagcttttg aagatgtgga tgcagcagag
     6601 ttgtacccat gtgtgatgtt ctatagtagc aatccagggg aaaaggtgaa aatttgtgat
     6661 atgcagatgc gtggcacacc ccgagactta cttccaggag accctatttg tagtccagta
     6721 gcagcagtgc tggctgaggc cactattcag ctcgtccgta tccttcaccg aacagaccgt
     6781 tggacttact gcattaacaa aaaaatgatg gaaaggcttc acaaaattaa gatatgtatt
     6841 aaagagtcag gtcagaagct aaagaaaagc cgctcggttc agagccgaga ggaaaatgaa
     6901 atgagagagg agaaggagag caaagaggaa gagaaaggta aacatactag gcatggcctc
     6961 gctgacctct cagagctgca gctgaggact ctttgcatag aggtgtggcc cgtgctggct
     7021 gtgataggag gagttgatgc tggtcttaga gttggaggtc ggtgtgttca caagcaaact
     7081 gggcgccatg ccacgctgct gggagtggtc aaagagggca gcacgtctgc caaggtccaa
     7141 tgggatgaag cagaaattac tatcagcttc ccaacttttt ggtcgcctag tgatactcca
     7201 ttgtataatc tggaaccctg tgaaccattg ccgtttgatg tggcgcgatt ccgaggcctg
     7261 acggcttctg tgctgctgga cctaacatat ctcactggcg ttcatgaaga catgggcaaa
     7321 cagagcacca aacgacatga aaagaaacac cgacatgaat ccgaggagaa aggggatgtt
     7381 gagcagaaac ctgagagtga atccgcttta gatatgcgaa caggcctaac atctgatgac
     7441 gtcaaaagtc agagtaccac aagctccaaa tcagaaaatg aaatcgcttc attttcttta
     7501 gatccaacac tgccaagtgt ggaatcccaa catcaaataa cagaagggaa aagaaaaaat
     7561 catgaacaca tgtccaaaaa ccatgatgta gcccagtcag aaatcagagc agtccagctg
     7621 tcctatcttt acctcggtgc tatgaagtca cttagtgccc ttcttggctg tagtaaatat
     7681 gctgagctgt tgctgatacc aaaagttctg gctgaaaatg gccacaactc agactgtgca
     7741 agttctccag ttgttcatga agacgtggag atgcgagcag ccctgcagtt cttgatgcga
     7801 cacatggtga agcgagcagt catgcggtca cccataaaga gagcattggg attagctgat
     7861 ctggaacgag cgcaagccat gatctataaa ttagtggttc atgggctttt ggaagaccag
     7921 tttgggggca aaattaagca agagattgat caacaagctg aagaaagtga ccctgcccag
     7981 caggcacaga caccagttac tactagccca tcagcctcaa gcacgacctc ctttatgagc
     8041 agctctctgg aggacaccac aactgccacc actccagtca ctgacacaga aacagtgcct
     8101 gcatccgagt ccccgggagt gatgcctctt agtcttctca ggcaaatgtt ctctagttac
     8161 ccaactacca ctgtacttcc cacacgtcgg gcacagactc ctccaatatc ttcgttacca
     8221 acctctcctt ctgatgaagt aggaaggagg caaagtttaa cttctcctga ttcccagtca
     8281 gcaaggccag ctaaccgcac agccttgtca gacccaagca gtagactttc aacttctcct
     8341 cctcctccag caattgcagt tcccttgctg gaaatggggt tctctcttcg gcagattgcc
     8401 aaagccatgg aagctacagg tgctagggga gaggctgatg cccagaatat cactgtcctt
     8461 gccatgtgga tgatagagca ccctgggcat gaggatgaag aggagcccca gtcgggcagc
     8521 acagcagact ctaggcctgg agcagccgtt ctaggcagtg gcgggaagtc aaatgatccc
     8581 tgttatttgc agtcacctgg agacatacca tcagctgatg ctgctgaaat ggaggaaggt
     8641 tttagtgaaa gccctgataa tttggatcat acagagaatg cagcttctgg aagtggacca
     8701 tcagctagag gtcgctcagc ggtaacaaga agacacaagt ttgacttagc tgctcgcaca
     8761 ctgctagcaa gagcagcggg attataccgc tctgtgcagg cccacaggaa tcaaagtcgg
     8821 agagaaggaa tatctttgca gcaagaccca ggggcgttgt atgactttaa tttagatgag
     8881 gaattggaaa ttgatcttga tgatgaggcg atggaagcta tgtttggaca agacctgacc
     8941 agtgacaatg atattctggg aatgtggatc ccagaggtac tggattggcc tacctggcat
     9001 gtttgtgagt ctgaagacag ggaagaagtg gtggtgtgtg aactgtgtga atgcagcgtc
     9061 gtcagcttca atcagcacat gaagagaaac catccaggct gtgggcgcag tgcaaaccgc
     9121 cagggctatc gcagcaatgg ttcctatgtg gatggctggt ttggcggtga atgtgggagt
     9181 ggaaatccat actacctgtt atgtggcacc tgcagggaga agtacttagc catgaagacc
     9241 aaatctaagt caacaagttc tgaaaggtac aagggacaag ctccagatct aattggcaag
     9301 caagacagtg tgtatgaaga agactgggac atgttggatg ttgatgaaga tgaaaagcta
     9361 actggtgaag aagaatttga attacttgct ggaccgcttg gtttaaatga ccggcgcatt
     9421 gtaccagaac cagttcagtt ccctgacagc gatccactgg gagcatcagt agcaatggtc
     9481 acagccacca acagtatgga agagactctg atgcaaatag gttgccatgg ctccgtagaa
     9541 aagagctcct ctgggagaat aacgttagga gagcaggcag ctgccctagc aaaccctcat
     9601 gaccgtgtgg tggctttaag gagagtgact gctgctgctc aggttcttct ggccagaacc
     9661 atggtcatga gagcgctgtc tcttctctca gtcagtggtt ccagttgtag cctggctgct
     9721 ggtcttgagt ctctggggct aacagatatc cgaacgctag ttcgattaat gtgcttggca
     9781 gcagcaggga gagctggcct ctccaccagc ccttctgcca tggctagcac ctcagaacga
     9841 tcacgaggtg ggcatagcaa ggctaacaag cctatctctt gcctggccta tttgagcaca
     9901 gcagtgggat gtctggcatc aaatgctcct agtgctgcca aactgcttgt acagttgtgt
     9961 acacagaact tgatttctgc tgcaacaggt gtaaatctaa ccacagttga tgactcaatt
    10021 cagcgaaagt ttctacccag ctttctccga ggaattgctg aagagaacaa gcttgtgacc
    10081 tccccaaact ttgttgtaac acaggccctt gtggcattgc tagcagacaa aggggccaaa
    10141 ctaagaccta actatgataa gtcagaagtt gaaaagaaag gccctctgga gttggctaat
    10201 gccctggcag cctgctgcct ctcctccagg ctgtcctcac agcatcggca atgggcagct
    10261 cagcaactcg tgcgcactct tgctgcacac gaccgtgaca accaaactac tctgcagaca
    10321 cttgctgata tgggaggaga tcttagaaaa tgctccttta tcaaattgga ggctcatcag
    10381 aacagagtaa tgacatgtgt ttggtgtaat aaaaaaggtc ttttggctac aagtggcaat
    10441 gatggcacca tccgcgtatg gaatgttacc aagaagcaat attcactgca acagacctgt
    10501 gtgttcaaca gattggaagg ggatgctgag gaaagcctgg gatcacccag tgatccaagt
    10561 ttctcaccag tttcctggag tatcagtggc aaatatctag caggcgcttt ggaaaagatg
    10621 gtgaatatct ggcaagttaa tggaggaaaa ggattagtag atattcagcc tcattgggta
    10681 tctgccctgg cttggccaga agagggtccg gctacagcct ggtcaggaga gtctccagaa
    10741 ttgttgttgg tgggacggat ggatggatct ctgggactga ttgaagttgt tgatgtgtcc
    10801 accatgcacc gtcgagaatt ggagcattgc tatcgaaagg atgtgtctgt tacttgcatt
    10861 gcatggttca gtgaagacag accatttgca gtgggatatt ttgatggaaa actgttactg
    10921 ggaacaaagg aaccacttga gaaaggaggc attgttctaa ttgatgcaca taaggatact
    10981 cttattagca tgaagtggga ccctacaggt catattctta tgacatgtgc caaagaagac
    11041 agtgtgaaac tctggggctc tatttcggga tgctggtgct gtctacattc actctgccat
    11101 ccatctattg taaatggcat tgcttggtgc cgccttccag ggaaaggatc caagttgcag
    11161 ttactgatgg ctactggctg tcagagtggc ttagtatgtg tttggcgcat tcctcaagat
    11221 actacacaga ccaatgtgac tagtgcagaa ggatggtggg accaggaatc aaattgccag
    11281 gatggatata ggaaatcatc aggagccaag tgtgtttatc agctgcgggg acacatcact
    11341 cctgttcgga ctgttgcctt tagttctgat gggttggccc tggtgtctgg tggactaggt
    11401 gggctcatga acatttggtc tttaagggat ggctctgtct tgcaaactgt tgtgataggc
    11461 tctggagcta ttcagaccac agtatggatt ccagaagttg gagtagctgc ttgctcaaat
    11521 agatcaaagg atgttttggt cgtgaattgt acagcagaat gggcagctgc caatcatgtt
    11581 ttggcaacct gtaggacagc attgaaacag cagggtgttc tgggattgaa catggctccc
    11641 tgcatgagag catttttgga gcggctcccc atgatgcttc aggagcagta tgcctatgaa
    11701 aagcctcatg tggtttgtgg tgaccaactt gttcatagcc cctatatgca atgcttggct
    11761 tcccttgctg tgggacttca tctggatcag ctgttgtgta accctccagt gccaccacac
    11821 caccagaact gtctgcctga ccctgcatcc tggaatccaa atgaatgggc ctggttagaa
    11881 tgtttctcaa ccactataaa agctgccgaa gccctgacca atggagccca gtttccagaa
    11941 tcttttaccg ttccagatct agaacctgtt ccagaggatg aacttgtatt tctaatggat
    12001 aacagtaaat ggattaacgg catggatgaa caaattatgt cttgggcaac ttccagacct
    12061 gaggactggc acctgggagg taaatgtgat gtctacttat ggggtgctgg taggcatgga
    12121 cagctggcag aagctggaag aaatgtaatg gtacctgcag cagctccctc attctcacag
    12181 gcccaacagg tcatttgtgg tcagaattgt acctttgtca tccaggccaa tggcacagtg
    12241 ttggcttgtg gggaaggaag ttatggcaga ttaggacaag gaaattcaga tgaccttcat
    12301 gtgctgacag ttatttcagc cttacaaggc tttgtggtga cccagctggt gacttcctgt
    12361 ggttctgatg ggcactctat ggccctaact gaaagtggtg aggtctttag ctggggagat
    12421 ggtgactatg gtaaacttgg ccatgggaac agcgacaggc agcggcggcc caggcagatc
    12481 gaggccttac aaggagaaga agtggtgcag atgtcttgtg gcttcaagca ctcagcagtg
    12541 gtcacttcag atggcaaact gttcaccttt gggaatggtg actatggtcg tctgggtctt
    12601 ggaaatacct ctaacaaaaa acttccagag agagtgactg cactggaggg atatcagatt
    12661 ggacaggtgg cctgtggatt aaaccacact ttggcagtgt cagcagatgg ttccatggtg
    12721 tgggcttttg gagatggaga ctatggaaaa ctaggcttag gaaattccac tgcaaaatct
    12781 tcacctcaga aaattgacgt cctttgtgga attggaataa aaaaggttgc ttgtggaact
    12841 cagttttctg ttgctttgac caaagatggt catgtgtata cctttggtca agatcgcctg
    12901 ataggcttgc cagaggggcg tgctcgcaat cacaatcgac cgcaacaaat ccctgtcctg
    12961 gctggagtaa tcattgaaga tgtggcagtt ggagctgaac acacacttgc tttggcatca
    13021 aatggagatg tgtatgcctg ggggagcaat tcagaagggc agctcggctt aggccatacc
    13081 aaccatgttc gagaaccaac cctggtaaca ggtctgcaag ggaaaaatgt tcggcagatc
    13141 tcggctggcc gctgccacag tgctgcatgg acagcaccac ctgtcccacc aagagcacca
    13201 ggtgtgtcag tacctctgca gctgggcctg cctgacacag tgccccccca gtatggggcg
    13261 ctgagagaag tcagcattca cacggtgcgg gccaggctcc ggctgctcta ccacttctct
    13321 gacctcatgt actcatcctg gagactgctg aaccttagcc ccaacaacca gaacagcaca
    13381 tcccattata atgctggaac ttggggcatt gtacagggac aacttcggcc tttgttagcc
    13441 ccaagagtct acactctgcc aatggtgcgc tccataggaa aaaccatggt tcaaggcaaa
    13501 aactatggac ctcagataac tgtaaagagg atatcaacca gaggacggaa gtgtaagcct
    13561 atttttgtcc aaatagcgag acaagtagtt aagctgaatg cttcagacct ccgcctgcct
    13621 tcccgagcgt ggaaggttaa gctggttgga gaaggggctg atgatgctgg aggagtgttt
    13681 gatgacacca tcacagagat gtgccaggaa cttgaaactg gtattgttga ccttcttata
    13741 ccctctccca atgccaccgc agaagtgggt tacaataggg acaggttcct ttttaaccct
    13801 tctgcctgcc tcgatgaaca cttaatgcag tttaagtttt taggaatttt aatgggggtt
    13861 gccattcgca caaagaagcc tctggacctc cacttggccc ctctggtgtg gaagcagctg
    13921 tgctgtgtcc cactcaccct agaggacctg gaggaggtgg atctgctcta cgtgcagact
    13981 ctcaacagca ttcttcacat tgaagacagt gggattaccg aggagagttt ccatgagatg
    14041 attcctcttg attcttttgt tggccagagt gctgatggca aaatggttcc tataatccct
    14101 ggtggaaata gtatcccact cacattttcc aacaggaagg aatatgtgga gagggccatt
    14161 gaatatcgac ttcatgagat ggacagacag gtggctgcag tccgagaagg gatgtcctgg
    14221 attgttcctg tgccgctgct gtccctcctc acagcaaaac aactggagca gatggtgtgt
    14281 gggatgcccg agatctctgt ggaagtcttg aagaaagtgg tgcggtaccg tgaggtggat
    14341 gagcagcatc agctggtgca gtggttctgg cacacgctgg aagagttctc caatgaggag
    14401 cgggtgcttt tcatgaggtt tgtgtcagga agatctcgac taccagccaa cactgctgac
    14461 atttctcaga gatttcaaat catgaaggtt gataggcctt acgacagtct gcctacctca
    14521 cagacctgct tcttccagct gaggctgccc ccgtactcca gccagctggt catggccgag
    14581 cgcctgcgct atgccatcaa caactgccgc tcaatcgaca tggacaacta catgctctcg
    14641 agaaacgtgg acaacgccga gggctccgac actgactact gaccgtgcgg gtgctctcac
    14701 cctcccttct ctccctcaat aatgctcact tctgatttga tgttgatata cttttatggt
    14761 aactacatag atgttataag aacataaacc aacattataa acaatggcca catttagtta
    14821 ctctaaatgt aacaaagaaa ttagatgttt ttatttttct gtgattgtac aaaaacaaca
    14881 aaaacgaagt gctctcagtc aggtttttcc ctccatattt ttggtcactt ttgataagtt
    14941 tgcatgaaac cattttggtg catttttagt tgggaatggt acatttttgt aaatccaccc
    15001 agtgaacatg aaattgtaca ttgtgtataa ttgttcatta gaaaggacag ttttacatga
    15061 atattcatat atttattttg ttttaatttg aattgcctgt tcagggttcc ttatgcagag
    15121 aaataaagca gattcaggaa ttggaaaaaa aaaaaaaaaa aaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC003061. Homo sapiens, pro...[gi:13111749] Links  


LOCUS       BC003061                1981 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, protease, cysteine, 1 (legumain), clone MGC:1395
            IMAGE:3504506, mRNA, complete cds.
ACCESSION   BC003061
VERSION     BC003061.1  GI:13111749
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1981)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Rachel Dickhoff, Jessica Fahey, Stephanie Ford, Julia
            Greene, Mark Ketteman and Anuradha Madan
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 7 Row: o Column: 5
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 5031990.
FEATURES             Location/Qualifiers
     source          1..1981
                     /organism="Homo sapiens"
                     /db_xref="LocusID:5641"
                     /db_xref="taxon:9606"
                     /clone="MGC:1395 IMAGE:3504506"
                     /tissue_type="Placenta, choriocarcinoma"
                     /clone_lib="NIH_MGC_21"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             142..1443
                     /codon_start=1
                     /product="protease, cysteine, 1 (legumain)"
                     /protein_id="AAH03061.1"
                     /db_xref="GI:13111750"
                     /translation="MVWKVAVFLSVALGIGAIPIDDPEDGGKHWVVIVAGSNGWYNYR
                     HQADACHAYQIIHRNGIPDEQIVVMMYDDIAYSEDNPTPGIVINRPNGTDVYQGVPKD
                     YTGEDVTPQNFLAVLRGDAEAVKGIGSGKVLKSGPQDHVFIYFTDHGSTGILVFPNED
                     LHVKDLNETIHYMYKHKMYRKMVFYIEACESGSMMNHLPDNINVYATTAANPRESSYA
                     CYYDEKRSTYLGDWYSVNWMEDSDVEDLTKETLHKQYHLVKSHTNTSHVMQYGNKTIS
                     TMKVMQFQGMKRKASSPVPLPPVTHLDLTPSPDVPLTIMKRKLMNTNDLEESRQLTEE
                     IQRHLDARHLIEKSVRKIVSLLAASEAEVEQLLSERAPLTGHSCYPEALLHFRTHCFN
                     WHSPTYEYALRHLYVLVNLCEKPYPLHRIKLSMDHVCLGHY"
BASE COUNT      527 a    505 c    493 g    456 t
ORIGIN      
        1 ggcacgaggg aggctgcgag ccgccgcgag ttctcacggt cccgccggcg ccaccaccgc
       61 ggtcactcac cgccgccgcc gccaccactg ccaccacggt cgcctgccac aggtgtctgc
      121 aattgaactc caaggtgcag aatggtttgg aaagtagctg tattcctcag tgtggccctg
      181 ggcattggtg ccattcctat agatgatcct gaagatggag gcaagcactg ggtggtgatc
      241 gtggcaggtt caaatggctg gtataattat aggcaccagg cagacgcgtg ccatgcctac
      301 cagatcattc accgcaatgg gattcctgac gaacagatcg ttgtgatgat gtacgatgac
      361 attgcttact ctgaagacaa tcccactcca ggaattgtga tcaacaggcc caatggcaca
      421 gatgtctatc agggagtccc gaaggactac actggagagg atgttacccc acaaaatttc
      481 cttgctgtgt tgagaggcga tgcagaagca gtgaagggca taggatccgg caaagtcctg
      541 aagagtggcc cccaggatca cgtgttcatt tacttcactg accatggatc tactggaata
      601 ctggtttttc ccaatgaaga tcttcatgta aaggacctga atgagaccat ccattacatg
      661 tacaaacaca aaatgtaccg aaagatggtg ttctacattg aagcctgtga gtctgggtcc
      721 atgatgaacc acctgccgga taacatcaat gtttatgcaa ctactgctgc caaccccaga
      781 gagtcgtcct acgcctgtta ctatgatgag aagaggtcca cgtacctggg ggactggtac
      841 agcgtcaact ggatggaaga ctcggacgtg gaagatctga ctaaagagac cctgcacaag
      901 cagtaccacc tggtaaaatc gcacaccaac accagccacg tcatgcagta tggaaacaaa
      961 acaatctcca ccatgaaagt gatgcagttt cagggtatga aacgcaaagc cagttctccc
     1021 gtccccctac ctccagtcac acaccttgac ctcaccccca gccctgatgt gcctctcacc
     1081 atcatgaaaa ggaaactgat gaacaccaat gatctggagg agtccaggca gctcacggag
     1141 gagatccagc ggcatctgga tgccaggcac ctcattgaga agtcagtgcg taagatcgtc
     1201 tccttgctgg cagcgtccga ggctgaggtg gagcagctcc tgtccgagag agccccgctc
     1261 acggggcaca gctgctaccc agaggccctg ctgcacttcc ggacccactg cttcaactgg
     1321 cactccccca cgtacgagta tgcgttgaga catttgtacg tgctggtcaa cctttgtgag
     1381 aagccgtatc cacttcacag gataaaattg tccatggacc acgtgtgcct tggtcactac
     1441 tgaagagctg cctcctggaa gcttttccaa gtgtgagcgc cccaccgact gtgtgctgat
     1501 cagagactgg agaggtggag tgagaagtct ccgctgctcg ggccctcctg gggagccccc
     1561 gctccagggc tcgctccagg accttcttca caagatgact tgctcgctgt tacctgcttc
     1621 cccagtcttt tctgaaaaac tacaaattag ggtgggaaaa gctctgtatt gagaagggtc
     1681 atatttgctt tctaggaggt ttgttgtttt gcctgttagt tttgaggagc aggaagctca
     1741 tgggggcttc tgtagcccct ctcaaaagga gtctttattc tgagaatttg aagctgaaac
     1801 ctctttaaat cttcagaatg attttattga agagggccgc aagccccaaa tggaaaactg
     1861 tttttagaaa atatgatgat ttttgattgc ttttgtattt aattctgcag gtgttcaagt
     1921 cttaaaaaat aaagatttat aacagaaccc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1981 a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF007551. Homo sapiens Bet1...[gi:2253425] Links  


LOCUS       AF007551                 617 bp    mRNA    linear   PRI 30-OCT-2001
DEFINITION  Homo sapiens Bet1p homolog (hbet1) mRNA, complete cds.
ACCESSION   AF007551
VERSION     AF007551.1  GI:2253425
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 617)
  AUTHORS   Zhang,T., Wong,S.H., Tang,B.L., Xu,Y., Peter,F., Subramaniam,V.N.
            and Hong,W.
  TITLE     The mammalian protein (rbet1) homologous to yeast Bet1p is
            primarily associated with the pre-Golgi intermediate compartment
            and is involved in vesicular transport from the endoplasmic
            reticulum to the Golgi apparatus
  JOURNAL   J. Cell Biol. 139 (5), 1157-1168 (1997)
  MEDLINE   98044220
   PUBMED   9382863
REFERENCE   2  (bases 1 to 617)
  AUTHORS   Zhang,T., Wong,S.H., Xu,Y. and Hong,W.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-JUN-1997) Membrane Biology Laboratory, Institute of
            Molecular and Cell Biology, 15 Lower Kent Ridge Road, Singapore
            119076, Singapore
FEATURES             Location/Qualifiers
     source          1..617
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..617
                     /gene="hbet1"
     CDS             120..476
                     /gene="hbet1"
                     /note="similar to yeast Bet1p; integral membrane protein"
                     /codon_start=1
                     /product="Bet1p homolog"
                     /protein_id="AAB62941.1"
                     /db_xref="GI:2253426"
                     /translation="MRRAGLGEGVPPGNYGNYGYANSGYSACEEENERLTESLRSKVT
                     AIKSLSIEIGHEVKTQNKLLAEMDSQFDSTTGFLGKTMGKLKILSRGSQTKLLCYMML
                     FSLFVFFIIYWIIKLR"
BASE COUNT      174 a    120 c    139 g    184 t
ORIGIN      
        1 ggggaagaag ttggtgtttc gctgggccct ggtactgaag acgcggtccg ggtcgcccct
       61 agctgtttcc tactcaccca aagccccgca cccgcctttt ctctctctcc tctggcagga
      121 tgaggcgtgc aggcctgggt gaaggagtac ctcctggcaa ctatgggaac tatggctatg
      181 ctaatagtgg gtatagtgcc tgtgaagaag aaaatgagag gctcactgaa agtctgagaa
      241 gcaaagtaac tgctataaaa tctctttcca ttgaaatagg ccatgaagtt aaaacccaga
      301 ataaattatt agctgaaatg gattcacaat ttgattccac aactggattt ctaggtaaaa
      361 ctatgggcaa actgaagatt ttatccagag ggagccaaac aaagctgctg tgctatatga
      421 tgctgttttc tttatttgtc ttttttatca tttattggat tattaaactg aggtgatgca
      481 tgtaattgtg aatttggaat ttgttccaac ttaatggctt gcagtgcagt accactttga
      541 taaaaatcag catcaaaaca ttcccagtgt tcaaatacgt ggcattttcc attgaaaatt
      601 gctgaatttt agactta
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH002654. Human collagenase...[gi:180615] Links  


LOCUS       HUMCLG4Q01               868 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 1.
ACCESSION   M58552 J05471
VERSION     M58552.1  GI:180602
KEYWORDS    type IV collagenase.
SEGMENT     1 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 868)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..868
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     exon            417..858
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=1
     exon            427..858
                     /gene="CLG4A"
                     /note="G00-120-592"
     sig_peptide     706..792
                     /gene="CLG4A"
                     /note="G00-120-592"
BASE COUNT      155 a    278 c    315 g    120 t
ORIGIN      
        1 caggtcaacg gatcatctgt ttctgaccat tccttcccgt tcctgacccc agggagtgca
       61 gggtgtccta gccaagccgg cgtccctcct agtagtaccg ctgctctcta acctcaggac
      121 gtcaagggcc tagagcgaca gatgtttccc agcagggggt tctgaggctg tgcgcccaga
      181 tcgcgagaga ggcaagtggg gtgacgaggt cgtgcactga gggtggacgt agaggccagg
      241 agtagcaggc ggccggggaa aagaggtgga gaaaggaaaa aagaggagaa aagtggagga
      301 gggcgagtag gggggtgggg cagagagggg cgggcccgag tgcgcccccc gcccccagcc
      361 ccgctctgcc agctccctcc cagcccagcc ggctacatct ggcggctgcc ctcccttgtt
      421 tccgctgcat ccagacttcc tcaggcggtg gctggaggct gcgcatctgg ggctttaaac
      481 atacaaaggg attgccagga cctgcggcgg cggcggcggc ggcgggggct ggggcgcggg
      541 ggccggacca tgagccgctg agccgggcaa accccaggcc accgagccag cggaccctcg
      601 gagcgcagcc ctgcgccgcg gaccaggctc caaccaggcg gcgaggcggc cacacgcacc
      661 gagccagcga cccccgggcg acgcgcgggg ccagggagcg ctacgatgga ggcgctaatg
      721 gcccggggcg cgctcacggg tcccctgagg gcgctctgtc tcctgggctg cctgctgagc
      781 cacgccgccg ccgcgccgtc gcccatcatc aagttccccg gcgatgtcgc ccccaaaacg
      841 gacaaagagt tggcagtggt gagttgct
//
LOCUS       HUMCLG4Q02               247 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 2.
ACCESSION   M55582 J05471
VERSION     M55582.1  GI:180603
KEYWORDS    type IV collagenase.
SEGMENT     2 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 247)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..247
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M58552.1:859..>868,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=1
     exon            11..237
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=2
BASE COUNT       69 a     71 c     59 g     48 t
ORIGIN      
        1 tttctttcag caatacctga acaccttcta tggctgcccc aaggagagct gcaacctgtt
       61 tgtgctgaag gacacactaa agaagatgca gaagttcttt ggactgcccc agacaggtga
      121 tcttgaccag aataccatcg agaccatgcg gaagccacgc tgcggcaacc cagatgtggc
      181 caactacaac ttcttccctc gcaagcccaa gtgggacaag aaccagatca catacaggtg
      241 ccggggc
//
LOCUS       HUMCLG4Q03               169 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 3.
ACCESSION   M55583 J05471
VERSION     M55583.1  GI:180604
KEYWORDS    type IV collagenase.
SEGMENT     3 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 169)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..169
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55582.1:238..247,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=2
     exon            11..159
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=3
BASE COUNT       38 a     46 c     46 g     39 t
ORIGIN      
        1 ccacctccag gatcattggc tacacacctg atctggaccc agagacagtg gatgatgcct
       61 ttgctcgtgc cttccaagtc tggagcgatg tgaccccact gcggttttct cgaatccatg
      121 atggagaggc agacatcatg atcaactttg gccgctgggg taggcagaa
//
LOCUS       HUMCLG4Q04               149 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 4.
ACCESSION   M55584 J05471
VERSION     M55584.1  GI:180605
KEYWORDS    type IV collagenase.
SEGMENT     4 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 149)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..149
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55583.1:160..>169,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
     exon            11..139
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=4
BASE COUNT       32 a     33 c     51 g     33 t
ORIGIN      
        1 gtgtgttcag agcatggcga tggatacccc tttgacggta aggacggact cctggctcat
       61 gccttcgccc caggcactgg tgttggggga gactcccatt ttgatgacga tgagctatgg
      121 accttgggag aaggccaagg tgagaaagg
//
LOCUS       HUMCLG4Q05               194 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 5.
ACCESSION   M55585 J05471
VERSION     M55585.1  GI:180606
KEYWORDS    type IV collagenase.
SEGMENT     5 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 194)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..194
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55584.1:140..>149,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=4
     exon            11..184
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=5
BASE COUNT       43 a     52 c     52 g     47 t
ORIGIN      
        1 cactctttag tggtccgtgt gaagtatggc aacgccgatg gggagtactg caagttcccc
       61 ttcttgttca atggcaagga gtacaacagc tgcactgata ctggccgcag cgatggcttc
      121 ctctggtgct ccaccaccta caactttgag aaggatggca agtacggctt ctgtccccat
      181 gaaggtgagc atcc
//
LOCUS       HUMCLG4Q06               194 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 6.
ACCESSION   M55586 J05471
VERSION     M55586.1  GI:180607
KEYWORDS    type IV collagenase.
SEGMENT     6 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 194)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..194
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55585.1:185..>194,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=5
     exon            11..184
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=6
BASE COUNT       40 a     65 c     55 g     34 t
ORIGIN      
        1 ccacccttag ccctgttcac catgggcggc aacgctgaag gacagccctg caagtttcca
       61 ttccgcttcc agggcacatc ctatgacagc tgcaccactg agggccgcac ggatggctac
      121 cgctggtgcg gcaccactga ggactacgac cgcgacaaga agtatggctt ctgccctgag
      181 accggtgggt gcca
//
LOCUS       HUMCLG4Q07               194 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 7.
ACCESSION   M55587 J05471
VERSION     M55587.1  GI:180608
KEYWORDS    type IV collagenase.
SEGMENT     7 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 194)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..194
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55586.1:185..>194,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=6
     exon            11..184
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=7
BASE COUNT       44 a     60 c     54 g     36 t
ORIGIN      
        1 taacccacag ccatgtccac tgttggtggg aactcagaag gtgccccctg tgtcttcccc
       61 ttcactttcc tgggcaacaa atatgagagc tgcaccagcg ccggccgcag tgacggaaag
      121 atgtggtgtg cgaccacagc caactacgat gacgaccgca agtggggctt ctgccctgac
      181 caaggtacga ggcc
//
LOCUS       HUMCLG4Q08               176 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 8.
ACCESSION   M55588 J05471
VERSION     M55588.1  GI:180609
KEYWORDS    type IV collagenase.
SEGMENT     8 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 176)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..176
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55587.1:185..>194,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=7
     exon            11..166
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=8
BASE COUNT       40 a     59 c     43 g     34 t
ORIGIN      
        1 aaaccctcag ggtacagcct gttcctcgtg gcagcccacg agtttggcca cgccatgggg
       61 ctggagcact cccaagaccc tggggccctg atggcaccca tttacaccta caccaagaac
      121 ttccgtctgt cccaggatga catcaagggc attcaggagc tctatggtaa acctcc
//
LOCUS       HUMCLG4Q09               156 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 9.
ACCESSION   M55589 J05471
VERSION     M55589.1  GI:180610
KEYWORDS    type IV collagenase.
SEGMENT     9 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 156)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..156
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55588.1:167..>176,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=8
     exon            11..146
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=9
BASE COUNT       29 a     51 c     38 g     38 t
ORIGIN      
        1 ttctccccag gggcctctcc tgacattgac cttggcaccg gccccacccc cacactgggc
       61 cctgtcactc ctgagatctg caaacaggac attgtatttg atggcatcgc tcagatccgt
      121 ggtgagatct tcttcttcaa ggaccggtga gtgcag
//
LOCUS       HUMCLG4Q10               157 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 10.
ACCESSION   M55590 J05471
VERSION     M55590.1  GI:180611
KEYWORDS    type IV collagenase.
SEGMENT     10 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 157)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..157
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55589.1:147..>156,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=9
     exon            11..147
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=10
BASE COUNT       29 a     41 c     53 g     34 t
ORIGIN      
        1 ctcctgccag gttcatttgg cggactgtga cgccacgtga caagcccatg gggcccctgc
       61 tggtggccac attctggcct gagctcccgg aaaagattga tgcggtatac gaggccccac
      121 aggaggagaa ggctgtgttc tttgcaggtg tgtggga
//
LOCUS       HUMCLG4Q11               180 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 11.
ACCESSION   M55591 J05471
VERSION     M55591.1  GI:180612
KEYWORDS    type IV collagenase.
SEGMENT     11 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 180)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..180
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55590.1:137..>157,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=10
     exon            11..170
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=11
BASE COUNT       50 a     50 c     49 g     31 t
ORIGIN      
        1 tccaccccag ggaatgaata ctggatctac tcagccagca ccctggagcg agggtacccc
       61 aagccactga ccagcctggg actgccccct gatgtccagc gagtggatgc cgcctttaac
      121 tggagcaaaa acaagaagac atacatcttt gctggagaca aattctggag gtaagggagg
//
LOCUS       HUMCLG4Q12               130 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 12.
ACCESSION   M55592 J05471
VERSION     M55592.1  GI:180613
KEYWORDS    type IV collagenase.
SEGMENT     12 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 130)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..130
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     intron          order(M55591.1:171..>180,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
     exon            11..120
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=12
BASE COUNT       32 a     35 c     36 g     27 t
ORIGIN      
        1 ttctttacag atacaatgag gtgaagaaga aaatggatcc tggctttccc aagctcatcg
       61 cagatgcctg gaatgccatc cccgataacc tggatgccgt cgtggacctg cagggcggcg
      121 gtgagcaccc
//
LOCUS       HUMCLG4Q13               911 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagenase type IV (CLG4) gene, exon 13.
ACCESSION   M55593 J05471
VERSION     M55593.1  GI:180614
KEYWORDS    type IV collagenase.
SEGMENT     13 of 13
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Huhtala,P., Eddy,R.L., Fan,Y.S., Byers,M.G., Shows,T.B. and
            Tryggvason,K.
  TITLE     Completion of the primary structure of the human type IV
            collagenase preproenzyme and assignment of the gene (CLG4) to the
            q21 region of chromosome 16
  JOURNAL   Genomics 6 (3), 554-559 (1990)
  MEDLINE   90228972
   PUBMED   2158484
REFERENCE   2  (bases 1 to 911)
  AUTHORS   Huhtala,P., Chow,L.T. and Tryggvason,K.
  TITLE     Structure of the human type IV collagenase gene
  JOURNAL   J. Biol. Chem. 265 (19), 11077-11082 (1990)
  MEDLINE   90293047
   PUBMED   2162831
COMMENT     Original source text: Human DNA.
FEATURES             Location/Qualifiers
     source          1..911
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="16q13-q21"
     gene            join(M58552.1:417..868,M55582.1:1..247,M55583.1:1..169,
                     M55584.1:1..149,M55585.1:1..194,M55586.1:1..194,
                     M55587.1:1..194,M55588.1:1..176,M55589.1:1..156,
                     M55590.1:1..157,M55591.1:1..180,M55592.1:1..130,1..911)
                     /gene="CLG4A"
     mRNA            join(M58552.1:417..858,M55582.1:11..237,M55583.1:11..159,
                     M55584.1:11..139,M55585.1:11..184,M55586.1:11..184,
                     M55587.1:11..184,M55588.1:11..166,M55589.1:11..146,
                     M55590.1:11..147,M55591.1:11..170,M55592.1:11..120,
                     11..911)
                     /gene="CLG4A"
                     /note="G00-120-592"
     mRNA            join(M58552.1:427..858,M55582.1:11..237,M55583.1:11..159,
                     M55584.1:11..139,M55585.1:11..184,M55586.1:11..184,
                     M55587.1:11..184,M55588.1:11..166,M55589.1:11..146,
                     M55590.1:11..147,M55591.1:11..170,M55592.1:11..120,
                     11..911)
                     /gene="CLG4A"
                     /note="alternate mRNA; G00-120-592"
     CDS             join(M58552.1:706..858,M55582.1:11..237,M55583.1:11..159,
                     M55584.1:11..139,M55585.1:11..184,M55586.1:11..184,
                     M55587.1:11..184,M55588.1:11..166,M55589.1:11..146,
                     M55590.1:11..147,M55591.1:11..170,M55592.1:11..120,
                     11..114)
                     /gene="CLG4A"
                     /codon_start=1
                     /product="type IV collagenase"
                     /protein_id="AAA52028.1"
                     /db_xref="GI:180616"
                     /db_xref="GDB:G00-120-592"
                     /translation="MEALMARGALTGPLRALCLLGCLLSHAAAAPSPIIKFPGDVAPK
                     TDKELAVQYLNTFYGCPKESCNLFVLKDTLKKMQKFFGLPQTGDLDQNTIETMRKPRC
                     GNPDVANYNFFPRKPKWDKNQITYRIIGYTPDLDPETVDDAFARAFQVWSDVTPLRFS
                     RIHDGEADIMINFGRWEHGDGYPFDGKDGLLAHAFAPGTGVGGDSHFDDDELWTLGEG
                     QVVRVKYGNADGEYCKFPFLFNGKEYNSCTDTGRSDGFLWCSTTYNFEKDGKYGFCPH
                     EALFTMGGNAEGQPCKFPFRFQGTSYDSCTTEGRTDGYRWCGTTEDYDRDKKYGFCPE
                     TAMSTVGGNSEGAPCVFPFTFLGNKYESCTSAGRSDGKMWCATTANYDDDRKWGFCPD
                     QGYSLFLVAAHEFGHAMGLEHSQDPGALMAPIYTYTKNFRLSQDDIKGIQELYGASPD
                     IDLGTGPTPTLGPVTPEICKQDIVFDGIAQIRGEIFFFKDRFIWRTVTPRDKPMGPLL
                     VATFWPELPEKIDAVYEAPQEEKAVFFAGNEYWIYSASTLERGYPKPLTSLGLPPDVQ
                     RVDAAFNWSKNKKTYIFAGDKFWRYNEVKKKMDPGFPKLIADAWNAIPDNLDAVVDLQ
                     GGGHSYFFKGAYYLKLENQSLKSVKFGSIKSDWLGC"
     mat_peptide     join(M58552.1:793..858,M55582.1:11..237,M55583.1:11..159,
                     M55584.1:11..139,M55585.1:11..12,M55585.1:184,
                     M55586.1:11..184,M55587.1:11..184,M55588.1:11..166,
                     M55589.1:11..146,M55590.1:11..147,M55591.1:11..170,
                     M55592.1:11,M55592.1:120,11..111)
                     /gene="CLG4A"
                     /product="type IV collagenase"
                     /note="G00-120-592"
     intron          order(M55592.1:121..>130,<1..10)
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=12
     exon            11..911
                     /gene="CLG4A"
                     /note="G00-120-592"
                     /number=13
BASE COUNT      192 a    283 c    204 g    232 t
ORIGIN      
        1 tctatcccag gtcacagcta cttcttcaag ggtgcctatt acctgaagct ggagaaccaa
       61 agtctgaaga gcgtgaagtt tggaagcatc aaatccgact ggctaggctg ctgagctggc
      121 cctggctccc acaggccctt cctctccact gccttcgata caccgggcct ggagaactag
      181 agaaggaccc ggaggggcct ggcagccgtg ccttcagctc tacagctaat cagcattctc
      241 actcctacct ggtaatttaa gattccagag agtggctcct cccggtgccc aagaatagat
      301 gctgactgta ctcctcccag gcgccccttc cccctccaat cccaccaacc ctcagagcca
      361 cccctaaaga gatcctttga tattttcaac gcagccctgc tttgggctgc cctggtgctg
      421 ccacacttca ggctcttctc ctttcacaac cttctgtggc tcacagaacc cttggagcca
      481 atggagactg tctcaagagg gcactggtgg cccgacagcc tggcacaggg cagtgggaca
      541 gggcatggcc aggtggccac tccagacccc tggcttttca ctgctggctg ccttagaacc
      601 tttcttacat tagcagtttg ctttgtatgc actttgtttt tttctttggg tcttgttttt
      661 tttttccact tagaaattgc atttcctgac agaaggactc aggttgtctg aagtcactgc
      721 acagtgcatc tcagcccaca tagtgatggt tcccctgttc actctactta gcatgtccct
      781 accgagtctc ttctccactg gatggaggaa aaccaagccg tggcttcccg ctcagccctc
      841 cctgcccctc ccttcaacca ttccccatgg gaaatgtcaa caagtatgaa taaagacacc
      901 tactgagtgg c
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Z19585. H.sapiens mRNA fo...[gi:311625] Links  


LOCUS       HSTHROMB4               3074 bp    mRNA    linear   PRI 05-MAY-1995
DEFINITION  H.sapiens mRNA for thrombospondin-4.
ACCESSION   Z19585
VERSION     Z19585.1  GI:311625
KEYWORDS    thrombospondin-4.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1588 to 3074)
  AUTHORS   Lawler,J., McHenry,K., Duquette,M. and Derick,L.
  TITLE     Characterization of human thrombospondin-4
  JOURNAL   J. Biol. Chem. 270 (6), 2809-2814 (1995)
  MEDLINE   95155352
REFERENCE   2  (bases 1 to 3074)
  AUTHORS   Lawler,J.W.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-JAN-1993) Lawler J. W., Brigham and Women's Hospital,
            Pathology, 221 Longwood Ave, Boston, Massachusetts, U.S.A., 02115
FEATURES             Location/Qualifiers
     source          1..3074
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="heart"
                     /clone_lib="human heart"
                     /dev_stage="adult"
     CDS             28..2913
                     /function="unknown"
                     /standard_name="thrombospondin-4"
                     /codon_start=1
                     /product="thrombospondin-4"
                     /protein_id="CAA79635.1"
                     /db_xref="GI:311626"
                     /db_xref="SWISS-PROT:P35443"
                     /translation="MLAPRGAAVLLLHLVLQRWLAAGAQATPQVFDLLPSSSQRLNPG
                     ALLPVLTDPALNDLYVISTFKLQTKSSATIFGLYSSTDNSKYFEFTVMGRLSKAILRY
                     LKNDGKVHLVVFNNLQLADGRRHRILLRLSNLQRGAGSLELYLDCIQVDSVHNLPRAF
                     AGPSQKPETIELRTFQRKPQDFLEELKLVVRGSLFQVASLQDCFLQQSEPLAATGTGD
                     FNRQFLGQMTQLNQLLGEVKDLLRQQVKETSFLRNTIAECQACGPLKFQSPTPSTVVA
                     PAPPAPPTRPPRRCDSNPCFRGVQCTDSRDGFQCGPCPEGYTGNGITCIDVDECKYHP
                     CYPGVHCINLSPGFRCDACPVGFTGPMVQGVGISFAKSNKQVCTDIDECRNGACVPNS
                     ICVNTLGSYRCGPCKPGYTGDQIRGCKVERNCRNPELNPCSVNAQCIEERQGDVTCVC
                     GVGWAGDGYICGKDVDIDSYPDEELPCSARNCKKDNCKYVPNSGQEDADRDGIGDACD
                     EDADGDGILNEQDNCVLIHNVDQRNSDKDIFGDACDNCLSVLNNDQKDTDGDGRGDAC
                     DDDMDGDGIKNILDNCPKFPNRDQRDKDGDGVGDACDSCPDVSNPNQSDVDNDLVGDS
                     CDTNQDSDGDGHQDSTDNCPTVINSAQLDTDKDGIGDECDDDDDNDGIPDLVPPGPDN
                     CRLVPNPAQEDSNSDGVGDICESDFDQDQVIDRIDVCPENAEVTLTDFRAYQTVGLDP
                     EGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTQTDDDYAGF
                     IFGYQDSSSFYVVMWKQTEQTYWQATPFRAVAEPGIQLKAVKSKTGPGEHLRNSLWHT
                     GDTSDQVRLLWKDSRNVGWKDKVSYRWFLQHRPQVGYIRVRFYEGSELVADSGVTIDT
                     TMRGGRLGVFCFSQENIIWSNLKYRCNDTIPEDFQEFQTQNFDRFDN"
     sig_peptide     28..90
     mat_peptide     91..2910
                     /product="thrombospondin-4"
                     /function="unknown"
                     /standard_name="thrombospondin-4"
                     /citation=[1]
     repeat_unit     889..1008
                     /rpt_family="thrombospondin type 2"
                     /rpt_type=TANDEM
     repeat_unit     1009..1167
                     /rpt_family="thrombospondin type 2"
                     /rpt_type=TANDEM
     repeat_unit     1168..1290
                     /rpt_family="thrombospondin type 2"
                     /rpt_type=TANDEM
     repeat_unit     1291..1416
                     /rpt_family="thrombospondin type 2"
                     /rpt_type=TANDEM
     repeat_unit     1480..2175
                     /function="calcium binding"
                     /rpt_family="thrombospondin type 3"
                     /rpt_type=TANDEM
     misc_binding    1711..1722
                     /bound_moiety="integrins"
                     /function="cell binding site"
     polyA_site      3031..3036
BASE COUNT      796 a    768 c    842 g    668 t
ORIGIN      
        1 gaattccggg gagcaggaag agccaacatg ctggccccgc gcggagccgc cgtcctcctg
       61 ctgcacctgg tcctgcagcg gtggctagcg gcaggcgccc aggccacccc ccaggtcttt
      121 gaccttctcc catcttccag tcagaggcta aacccaggcg ctctgctgcc agtcctgaca
      181 gaccccgccc tgaatgatct ctatgtgatt tccaccttca agctgcagac taaaagttca
      241 gccaccatct tcggtcttta ctcttcaact gacaacagta aatattttga atttactgtg
      301 atgggacgct taagcaaagc catcctccgt tacctgaaga acgatgggaa ggtgcatttg
      361 gtggttttca acaacctgca gctggcagac ggaaggcggc acaggatcct cctgaggctg
      421 agcaatttgc agcgaggggc cggctcccta gagctctacc tggactgcat ccaggtggat
      481 tccgttcaca atctccccag ggcctttgct ggcccctccc agaaacctga gaccattgaa
      541 ttgaggactt tccagaggaa gccacaggac ttcttggaag agctgaagct ggtggtgaga
      601 ggctcactgt tccaggtggc cagcctgcaa gactgcttcc tgcagcagag tgagccactg
      661 gctgccacag gcacagggga ctttaaccgg cagttcttgg gtcaaatgac acaattaaac
      721 caactcctgg gagaggtgaa ggaccttctg agacagcagg ttaaggaaac atcatttttg
      781 cgaaacacca tagctgaatg ccaggcttgc ggtcctctca agtttcagtc tccgacccca
      841 agcacggtgg tcgccccggc tccccctgca ccgccaacac gcccacctcg tcggtgtgac
      901 tccaacccat gtttccgagg tgtccaatgt accgacagta gagatggctt ccagtgtggg
      961 ccctgccccg agggctacac aggaaacggg atcacctgta ttgatgttga tgagtgcaaa
     1021 taccatccct gctacccggg cgtgcactgc ataaatttgt ctcctggctt cagatgtgac
     1081 gcctgcccag tgggcttcac agggcccatg gtgcagggtg ttgggatcag ttttgccaag
     1141 tcaaacaagc aggtctgcac tgacattgat gagtgtcgaa atggagcgtg cgttcccaac
     1201 tcgatctgcg ttaatacttt gggatcttac cgctgtgggc cttgtaagcc ggggtatact
     1261 ggtgatcaga taaggggatg caaagtggaa agaaactgca gaaacccaga gctgaaccct
     1321 tgcagtgtga atgcccagtg cattgaagag aggcaggggg atgtgacatg tgtgtgtgga
     1381 gtcggttggg ctggagatgg ctatatctgt ggaaaggatg tggacatcga cagttacccc
     1441 gacgaagaac tgccatgctc tgccaggaac tgtaaaaagg acaactgcaa atatgtgcca
     1501 aattctggcc aagaagatgc agacagagat ggcattggcg acgcttgtga cgaggatgct
     1561 gacggagatg ggatcctgaa tgagcaggat aactgtgtcc tgattcataa tgtggaccaa
     1621 aggaacagcg ataaagatat ctttggggat gcctgtgata actgcctgag tgtcttaaat
     1681 aacgaccaga aagacaccga tggggatgga agaggagatg cctgtgatga tgacatggat
     1741 ggagatggaa taaaaaacat tctggacaac tgcccaaaat ttcccaatcg tgaccaacgg
     1801 gacaaggatg gtgatggtgt gggggatgcc tgtgacagtt gtcctgatgt cagcaaccct
     1861 aaccagtctg atgtggataa tgatctggtt ggggactcct gtgacaccaa tcaggacagt
     1921 gatggagatg ggcaccagga cagcacagac aactgcccca ccgtcattaa cagtgcccag
     1981 ctggacaccg ataaggatgg aattggtgac gagtgtgatg atgatgatga caatgatggt
     2041 atcccagacc tggtgccccc tggaccagac aactgccggc tggtccccaa cccagcccag
     2101 gaggatagca acagcgacgg agtgggagac atctgtgagt ctgactttga ccaggaccag
     2161 gtcatcgatc ggatcgacgt ctgcccagag aacgcagagg tcaccctgac cgacttcagg
     2221 gcttaccaga ccgtgggcct ggatcctgaa ggggatgccc agatcgatcc caactgggtg
     2281 gtcctgaacc agggcatgga gattgtacag accatgaaca gtgatcctgg cctggcagtg
     2341 gggtacacag cttttaatgg agttgacttc gaagggacct tccatgtgaa tacccagaca
     2401 gatgatgact atgcaggctt tatctttggc taccaagata gctccagctt ctacgtggtc
     2461 atgtggaagc agacggagca gacatattgg caagccaccc cattccgagc agttgcagaa
     2521 cctggcattc agctcaaggc tgtgaagtct aagacaggtc caggggagca tctccggaac
     2581 tccctgtggc acacggggga caccagtgac caggtcaggc tgctgtggaa ggactccagg
     2641 aatgtgggct ggaaggacaa ggtgtcctac cgctggttcc tacagcacag gccccaggtg
     2701 ggctacatca gggtacgatt ttatgaaggc tctgagttgg tggctgactc tggcgtcacc
     2761 atagacacca caatgcgtgg aggccgactt ggcgttttct gcttctctca agaaaacatc
     2821 atctggtcca acctcaagta tcgctgcaat gacaccatcc ctgaggactt ccaagagttt
     2881 caaacccaga atttcgaccg cttcgataat taaaccaagg aagcaatctg taactgcttt
     2941 tcggaacact aaaaccatat atattttaac ttcaattttc tttagctttt accaacccaa
     3001 atatatcaaa acgttttatg tgaatgtggc aataaaggag aagagatcat ttttaaaaaa
     3061 aaaaaaaaaa aaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Z74615. H.sapiens mRNA fo...[gi:1418927] Links  


LOCUS       HSPPA1ICO               6728 bp    mRNA    linear   PRI 07-MAR-1997
DEFINITION  H.sapiens mRNA for prepro-alpha1(I) collagen.
ACCESSION   Z74615
VERSION     Z74615.1  GI:1418927
KEYWORDS    alpha1(I)-collagen.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1537 to 3803)
  AUTHORS   Bernard,M.P., Chu,M.L., Myers,J.C., Ramirez,F., Eikenberry,E.F. and
            Prockop,D.J.
  TITLE     Nucleotide sequences of complementary deoxyribonucleic acids for
            the pro alpha 1 chain of human type I procollagen. Statistical
            evaluation of structures that are conserved during evolution
  JOURNAL   Biochemistry 22 (22), 5213-5223 (1983)
  MEDLINE   84080385
REFERENCE   2  (bases 1 to 36)
  AUTHORS   Chu,M.L., de Wet,W., Bernard,M. and Ramirez,F.
  TITLE     Fine structural analysis of the human pro-alpha 1 (I) collagen
            gene. Promoter structure, AluI repeats, and polymorphic transcripts
  JOURNAL   J. Biol. Chem. 260 (4), 2315-2320 (1985)
  MEDLINE   85130970
REFERENCE   3  (bases 3804 to 4481)
  AUTHORS   Makela,J.K., Raassina,M., Virta,A. and Vuorio,E.
  TITLE     Human pro alpha 1(I) collagen: cDNA sequence for the C-propeptide
            domain
  JOURNAL   Nucleic Acids Res. 16 (1), 349 (1988)
  MEDLINE   88124208
REFERENCE   4  (bases 37 to 1536)
  AUTHORS   Tromp,G., Kuivaniemi,H., Stacey,A., Shikata,H., Baldwin,C.T.,
            Jaenisch,R. and Prockop,D.J.
  TITLE     Structure of a full-length cDNA clone for the prepro alpha 1(I)
            chain of human type I procollagen
  JOURNAL   Biochem. J. 253 (3), 919-922 (1988)
  MEDLINE   89025644
REFERENCE   5  (bases 4482 to 6728)
  AUTHORS   Maatta,A., Bornstein,P. and Penttinen,R.P.
  TITLE     Highly conserved sequences in the 3'-untranslated region of the
            COL1A1 gene bind cell-specific nuclear proteins
  JOURNAL   FEBS Lett. 279 (1), 9-13 (1991)
  MEDLINE   91138770
REFERENCE   6  (bases 1 to 6728)
  AUTHORS   Dalgleish,R.
  TITLE     The human type I collagen mutation database
  JOURNAL   Nucleic Acids Res. 25 (1), 181-187 (1997)
  MEDLINE   97169389
REFERENCE   7  (bases 1 to 6728)
  AUTHORS   Dalgleish,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-JUL-1996) Raymond Dalgleish, Department of Genetics,
            University of Leicester, University Road, Leicester, LE1 7RH,
            United Kingdom
FEATURES             Location/Qualifiers
     source          1..6728
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     exon            1..222
                     /number=1
     5'UTR           1..119
     CDS             120..4514
                     /codon_start=1
                     /product="prepro-alpha1(I) collagen"
                     /protein_id="CAA98968.1"
                     /db_xref="GI:1418928"
                     /db_xref="SWISS-PROT:P02452"
                     /translation="MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNG
                     LRYHDRDVWKPEPCRICVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGSESP
                     TDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFA
                     PQLSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMG
                     PRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGA
                     KGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARGNDGATGAAGPP
                     GPTGPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAGNPGADG
                     QPGAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGSKGDTGAKGE
                     PGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPA
                     GERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPG
                     PPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGP
                     AGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFP
                     GERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPG
                     PKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGESGPSGPAGPTGARGA
                     PGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNV
                     GAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETG
                     PAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGF
                     PGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPAAEGSPGRDGSP
                     GAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAG
                     PQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGS
                     AGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSAGFDFSFLPQP
                     PQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDL
                     KMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKD
                     KRHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMD
                     QQTGNLKKALLLKGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKSSR
                     LPIIDVAPLDVGAPDQEFGFDVGPVCFL"
     sig_peptide     120..185
     misc_feature    186..602
                     /note="N_propeptide"
     exon            223..417
                     /number=2
     exon            418..452
                     /number=3
     exon            453..488
                     /number=4
     exon            489..590
                     /number=5
     exon            591..662
                     /number=6
     misc_feature    603..653
                     /note="N_telopeptide"
     misc_feature    654..3695
                     /note="triple_helix"
     exon            663..707
                     /number=7
     exon            708..761
                     /number=8
     exon            762..815
                     /number=9
     exon            816..869
                     /number=10
     exon            870..923
                     /number=11
     exon            924..977
                     /number=12
     exon            978..1022
                     /number=13
     exon            1023..1076
                     /number=14
     exon            1077..1121
                     /number=15
     exon            1122..1175
                     /number=16
     exon            1176..1274
                     /number=17
     exon            1275..1319
                     /number=18
     exon            1320..1418
                     /number=19
     exon            1419..1472
                     /number=20
     exon            1473..1580
                     /number=21
     exon            1581..1634
                     /number=22
     exon            1635..1733
                     /number=23
     exon            1734..1787
                     /number=24
     exon            1788..1886
                     /number=25
     exon            1887..1940
                     /number=26
     exon            1941..1994
                     /number=27
     exon            1995..2048
                     /number=28
     exon            2049..2102
                     /number=29
     exon            2103..2147
                     /number=30
     exon            2148..2246
                     /number=31
     exon            2247..2354
                     /number=32
     exon            2355..2462
                     /note="exons 33-34"
     exon            2463..2516
                     /number=35
     exon            2517..2570
                     /number=36
     exon            2571..2678
                     /number=37
     exon            2679..2732
                     /number=38
     exon            2733..2786
                     /number=39
     exon            2787..2948
                     /number=40
     exon            2949..3056
                     /number=41
     exon            3057..3164
                     /number=42
     exon            3165..3218
                     /number=43
     exon            3219..3326
                     /number=44
     exon            3327..3380
                     /number=45
     exon            3381..3488
                     /number=46
     exon            3489..3542
                     /number=47
     exon            3543..3650
                     /number=48
     old_sequence    3569..3571
                     /citation=[3]
                     /replace="cnc"
     exon            3651..3933
                     /number=49
     misc_feature    3696..3773
                     /note="C_telopeptide"
     misc_feature    3774..4511
                     /note="C_propeptide"
     exon            3934..4124
                     /number=50
     exon            4125..4367
                     /number=51
     misc_feature    4212..4220
                     /note="carbohydrate attachment site"
     exon            4368..5921
                     /number=52
     polyA_signal    4763..4775
     polyA_site      4798
     polyA_signal    5891..5896
     polyA_site      5921
BASE COUNT     1213 a   2144 c   1971 g   1400 t
ORIGIN      
        1 agcagacggg agtttctcct cggggtcgga gcaggaggca cgcggagtgt gaggccacgc
       61 atgagcggac gctaaccccc tccccagcca caaagagtct acatgtctag ggtctagaca
      121 tgttcagctt tgtggacctc cggctcctgc tcctcttagc ggccaccgcc ctcctgacgc
      181 acggccaaga ggaaggccaa gtcgagggcc aagacgaaga catcccacca atcacctgcg
      241 tacagaacgg cctcaggtac catgaccgag acgtgtggaa acccgagccc tgccggatct
      301 gcgtctgcga caacggcaag gtgttgtgcg atgacgtgat ctgtgacgag accaagaact
      361 gccccggcgc cgaagtcccc gagggcgagt gctgtcccgt ctgccccgac ggctcagagt
      421 cacccaccga ccaagaaacc accggcgtcg agggacccaa gggagacact ggcccccgag
      481 gcccaagggg acccgcaggc ccccctggcc gagatggcat ccctggacag cctggacttc
      541 ccggaccccc cggacccccc ggacctcccg gaccccctgg cctcggagga aactttgctc
      601 cccagctgtc ttatggctat gatgagaaat caaccggagg aatttccgtg cctggcccca
      661 tgggtccctc tggtcctcgt ggtctccctg gcccccctgg tgcacctggt ccccaaggct
      721 tccaaggtcc ccctggtgag cctggcgagc ctggagcttc aggtcccatg ggtccccgag
      781 gtcccccagg tccccctgga aagaatggag atgatgggga agctggaaaa cctggtcgtc
      841 ctggtgagcg tgggcctcct gggcctcagg gtgctcgagg attgcccgga acagctggcc
      901 tccctggaat gaagggacac agaggtttca gtggtttgga tggtgccaag ggagatgctg
      961 gtcctgctgg tcctaagggt gagcctggca gccctggtga aaatggagct cctggtcaga
     1021 tgggcccccg tggcctgcct ggtgagagag gtcgccctgg agcccctggc cctgctggtg
     1081 ctcgtggaaa tgatggtgct actggtgctg ccgggccccc tggtcccacc ggccccgctg
     1141 gtcctcctgg cttccctggt gctgttggtg ctaagggtga agctggtccc caagggcccc
     1201 gaggctctga aggtccccag ggtgtgcgtg gtgagcctgg cccccctggc cctgctggtg
     1261 ctgctggccc tgctggaaac cctggtgctg atggacagcc tggtgctaaa ggtgccaatg
     1321 gtgctcctgg tattgctggt gctcctggct tccctggtgc ccgaggcccc tctggacccc
     1381 agggccccgg cggccctcct ggtcccaagg gtaacagcgg tgaacctggt gctcctggca
     1441 gcaaaggaga cactggtgct aagggagagc ctggccctgt tggtgttcaa ggaccccctg
     1501 gccctgctgg agaggaagga aagcgaggag ctcgaggtga acccggaccc actggcctgc
     1561 ccggaccccc tggcgagcgt ggtggacctg gtagccgtgg tttccctggc gcagatggtg
     1621 ttgctggtcc caagggtccc gctggtgaac gtggttctcc tggccccgct ggccccaaag
     1681 gatctcctgg tgaagctggt cgtcccggtg aagctggtct gcctggtgcc aagggtctga
     1741 ctggaagccc tggcagccct ggtcctgatg gcaaaactgg cccccctggt cccgccggtc
     1801 aagatggtcg ccccggaccc ccaggcccac ctggtgcccg tggtcaggct ggtgtgatgg
     1861 gattccctgg acctaaaggt gctgctggag agcccggcaa ggctggagag cgaggtgttc
     1921 ccggaccccc tggcgctgtc ggtcctgctg gcaaagatgg agaggctgga gctcagggac
     1981 cccctggccc tgctggtccc gctggcgaga gaggtgaaca aggccctgct ggctcccccg
     2041 gattccaggg tctccctggt cctgctggtc ctccaggtga agcaggcaaa cctggtgaac
     2101 agggtgttcc tggagacctt ggcgcccctg gcccctctgg agcaagaggc gagagaggtt
     2161 tccctggcga gcgtggtgtg caaggtcccc ctggtcctgc tggaccccga ggggccaacg
     2221 gtgctcccgg caacgatggt gctaagggtg atgctggtgc ccctggagct cccggtagcc
     2281 agggcgcccc tggccttcag ggaatgcctg gtgaacgtgg tgcagctggt cttccagggc
     2341 ctaagggtga cagaggtgat gctggtccca aaggtgctga tggctctcct ggcaaagatg
     2401 gcgtccgtgg tctgaccggc cccattggtc ctcctggccc tgctggtgcc cctggtgaca
     2461 agggtgaaag tggtcccagc ggccctgctg gtcccactgg agctcgtggt gcccccggag
     2521 accgtggtga gcctggtccc cccggccctg ctggctttgc tggcccccct ggtgctgacg
     2581 gccaacctgg tgctaaaggc gaacctggtg atgctggtgc caaaggcgat gctggtcccc
     2641 ctgggcctgc cggacccgct ggaccccctg gccccattgg taatgttggt gctcctggag
     2701 ccaaaggtgc tcgcggcagc gctggtcccc ctggtgctac tggtttccct ggtgctgctg
     2761 gccgagtcgg tcctcctggc ccctctggaa atgctggacc ccctggccct cctggtcctg
     2821 ctggcaaaga aggcggcaaa ggtccccgtg gtgagactgg ccctgctgga cgtcctggtg
     2881 aagttggtcc ccctggtccc cctggccctg ctggcgagaa aggatcccct ggtgctgatg
     2941 gtcctgctgg tgctcctggt actcccgggc ctcaaggtat tgctggacag cgtggtgtgg
     3001 tcggcctgcc tggtcagaga ggagagagag gcttccctgg tcttcctggc ccctctggtg
     3061 aacctggcaa acaaggtccc tctggagcaa gtggtgaacg tggtcccccc ggtcccatgg
     3121 gcccccctgg attggctgga ccccctggtg aatctggacg tgagggggct cctgctgccg
     3181 aaggttcccc tggacgagac ggttctcctg gcgccaaggg tgaccgtggt gagaccggcc
     3241 ccgctggacc ccctggtgct cctggtgctc ctggtgcccc tggccccgtt ggccctgctg
     3301 gcaagagtgg tgatcgtggt gagactggtc ctgctggtcc cgccggtccc gtcggccccg
     3361 tcggcgcccg tggccccgcc ggaccccaag gcccccgtgg tgacaagggt gagacaggcg
     3421 aacagggcga cagaggcata aagggtcacc gtggcttctc tggcctccag ggtccccctg
     3481 gccctcctgg ctctcctggt gaacaaggtc cctctggagc ctctggtcct gctggtcccc
     3541 gaggtccccc tggctctgct ggtgctcctg gcaaagatgg actcaacggt ctccctggcc
     3601 ccattgggcc ccctggtcct cgcggtcgca ctggtgatgc tggtcctgtt ggtccccccg
     3661 gccctcctgg acctcctggt ccccctggtc ctcccagcgc tggtttcgac ttcagcttcc
     3721 tgccccagcc acctcaagag aaggctcacg atggtggccg ctactaccgg gctgatgatg
     3781 ccaatgtggt tcgtgaccgt gacctcgagg tggacaccac cctcaagagc ctgagccagc
     3841 agatcgagaa catccggagc ccagagggaa gccgcaagaa ccccgcccgc acctgccgtg
     3901 acctcaagat gtgccactct gactggaaga gtggagagta ctggattgac cccaaccaag
     3961 gctgcaacct ggatgccatc aaagtcttct gcaacatgga gactggtgag acctgcgtgt
     4021 accccactca gcccagtgtg gcccagaaga actggtacat cagcaagaac cccaaggaca
     4081 agaggcatgt ctggttcggc gagagcatga ccgatggatt ccagttcgag tatggcggcc
     4141 agggctccga ccctgccgat gtggccatcc agctgacctt cctgcgcctg atgtccaccg
     4201 aggcctccca gaacatcacc taccactgca agaacagcgt ggcctacatg gaccagcaga
     4261 ctggcaacct caagaaggcc ctgctcctca agggctccaa cgagatcgag atccgcgccg
     4321 agggcaacag ccgcttcacc tacagcgtca ctgtcgatgg ctgcacgagt cacaccggag
     4381 cctggggcaa gacagtgatt gaatacaaaa ccaccaagtc ctcccgcctg cccatcatcg
     4441 atgtggcccc cttggacgtt ggtgccccag accaggaatt cggcttcgac gttggccctg
     4501 tctgcttcct gtaaactccc tccatcccaa cctggctccc tcccacccaa ccaactttcc
     4561 ccccaacccg gaaacagaca agcaacccaa actgaacccc cccaaaagcc aaaaaatggg
     4621 agacaatttc acatggactt tggaaaatat ttttttcctt tgcattcatc tctcaaactt
     4681 agtttttatc tttgaccaac cgaacatgac caaaaaccaa aagtgcattc aaccttacca
     4741 aaaaaaaaaa aaaaaaaaaa agaataaata aataagtttt taaaaaagga agcttggtcc
     4801 acttgcttga agacccatgc gggggtaagt ccctttctgc ccgttgggtt atgaaacccc
     4861 aatgctgccc tttctgctcc tttctccaca ccccccttgg cctcccctcc actccttccc
     4921 aaatctgtct ccccagaaga cacaggaaac aatgtattgt ctgcccagca atcaaaggca
     4981 atgctcaaac acccaagtgg cccccaccct cagcccgctc ctgcccgccc agcaccccca
     5041 ggccctgggg acctggggtt ctcagactgc caaagaagcc ttgccatctg gcgctcccat
     5101 ggctcttgca acatctcccc ttcgtttttg agggggtcat gccgggggag ccaccagccc
     5161 ctcactgggt tcggaggaga gtcaggaagg gccacgacaa agcagaaaca tcggatttgg
     5221 ggaacgcgtg tcatcccttg tgccgcaggc tgggcgggag agactgttct gttctgttcc
     5281 ttgtgtaact gtgttgctga aagactacct cgttcttgtc ttgatgtgtc accggggcaa
     5341 ctgcctgggg gcggggatgg gggcagggtg gaagcggctc cccattttta taccaaaggt
     5401 gctacatcta tgtgatgggt ggggtgggga gggaatcact ggtgctatag aaattgagat
     5461 gcccccccag gccagcaaat gttccttttt gttcaaagtc tatttttatt ccttgatatt
     5521 ttttctttct tttttttttt ttttgtggat ggggacttgt gaatttttct aaaggtgcta
     5581 tttaacatgg gaggagagcg tgtgcgctcc agcccagccc gctgctcact ttccaccctc
     5641 tctccacctg cctctggctt ctcaggcctc tgctctccga cctctctcct ctgaaaccct
     5701 cctccacagc tgcagcccat cctcccggct ccctcctagt ctgtcctgcg tcctctgtcc
     5761 ccgggtttca gagacaactt cccaaagcac aaagcagttt ttccctaggg gtgggaggaa
     5821 gcaaaagact ctgtacctat tttgtatgtg tataataatt tgagatgttt ttaattattt
     5881 tgattgctgg aataaagcat gtggaaatga cccaaacata atccgcagtg gcctcctaat
     5941 ttccttcttt ggagttgggg gaggggtaga catggggaag gggccttggg gtgatgggct
     6001 tgccttccat tcctgccctt tccctcccca ctattctctt ctagatccct ccataacccc
     6061 actccccttt ctctcaccct tcttataccg caaacctttc tacttcctct ttcattttct
     6121 attcttgcaa tttccttgca ccttttccaa atcctcttct cccctgcaat accatacagg
     6181 caatccacgt gcacaacaca cacacacact cttcacatct ggggttgtcc aaacctcata
     6241 cccactcccc ttcaagccca tccactctcc accccctgga tgccctgcac ttggtggcgg
     6301 tgggatgctc atggatactg ggagggtgag gggagtggaa cccgtgagga ggacctgggg
     6361 gcctctcctt gaactgacat gaagggtcat ctggcctctg ctcccttctc acccacgctg
     6421 acctcctgcc gaaggagcaa cgcaacagga gaggggtctg ctgagcctgg cgagggtctg
     6481 ggagggacca ggaggaaggc gtgctccctg ctcgctgtcc tggccctggg ggagtgaggg
     6541 agacagacac ctgggagagc tgtggggaag gcactcgcac cgtgctcttg ggaaggaagg
     6601 agacctggcc ctgctcacca cggactgggt gcctcgacct cctgaatccc cagaacacaa
     6661 cccccctggg ctggggtggt ctggggaacc atcgtgcccc cgcctcccgc ctactccttt
     6721 ttaagctt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC007038. Homo sapiens, lum...[gi:13937864] Links  


LOCUS       BC007038                1804 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, lumican, clone MGC:12410 IMAGE:3950745, mRNA,
            complete cds.
ACCESSION   BC007038
VERSION     BC007038.1  GI:13937864
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1804)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-APR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: CLONTECH
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 16 Row: g Column: 14
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4505046.
FEATURES             Location/Qualifiers
     source          1..1804
                     /organism="Homo sapiens"
                     /db_xref="LocusID:4060"
                     /db_xref="taxon:9606"
                     /clone="MGC:12410 IMAGE:3950745"
                     /tissue_type="Prostate"
                     /clone_lib="NIH_MGC_83"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     CDS             142..1158
                     /codon_start=1
                     /product="lumican"
                     /protein_id="AAH07038.1"
                     /db_xref="GI:13937865"
                     /translation="MSLSAFTLFLALIGGTSGQYYDYDFPLSIYGQSSPNCAPECNCP
                     ESYPSAMYCDELKLKSVPMVPPGIKYLYLRNNQIDHIDEKAFENVTDLQWLILDHNLL
                     ENSKIKGRVFSKLKQLKKLHINHNNLTESVGPLPKSLEDLQLTHNKITKLGSFEGLVN
                     LTFIHLQHNRLKEDAVSAAFKGLKSLEYLDLSFNQIARLPSGLPVSLLTLYLDNNKIS
                     NIPDEYFKRFNALQYLRLSHNELADSGIPGNSFNVSSLVELDLSYNKLKNIPTVNENL
                     ENYYLEVNQLEKFDIKSFCKILGPLSYSKIKHLRLDGNRISETSLPPDMYECLRVANE
                     VTLN"
BASE COUNT      583 a    348 c    310 g    563 t
ORIGIN      
        1 gtatcactca gaatctggca gccagttccg tcctgacaga gttcacagca tatattggtg
       61 gattcttgtc catagtgcat ctgctttaag aattaacgaa agcagtgtca agacagtaag
      121 gattcaaacc atttgccaaa aatgagtcta agtgcattta ctctcttcct ggcattgatt
      181 ggtggtacca gtggccagta ctatgattat gattttcccc tatcaattta tgggcaatca
      241 tcaccaaact gtgcaccaga atgtaactgc cctgaaagct acccaagtgc catgtactgt
      301 gatgagctga aattgaaaag tgtaccaatg gtgcctcctg gaatcaagta tctttacctt
      361 aggaataacc agattgacca tattgatgaa aaggcctttg agaatgtaac tgatctgcag
      421 tggctcattc tagatcacaa ccttctagaa aactccaaga taaaagggag agttttctct
      481 aaattgaaac aactgaagaa gctgcatata aaccacaaca acctgacaga gtctgtgggc
      541 ccacttccca aatctctgga ggatctgcag cttactcata acaagatcac aaagctgggc
      601 tcttttgaag gattggtaaa cctgaccttc atccatctcc agcacaatcg gctgaaagag
      661 gatgctgttt cagctgcttt taaaggtctt aaatcactcg aataccttga cttgagcttc
      721 aatcagatag ccagactgcc ttctggtctc cctgtctctc ttctaactct ctacttagac
      781 aacaataaga tcagcaacat ccctgatgag tatttcaagc gttttaatgc attgcagtat
      841 ctgcgtttat ctcacaacga actggctgat agtggaatac ctggaaattc tttcaatgtg
      901 tcatccctgg ttgagctgga tctgtcctat aacaagctta aaaacatacc aactgtcaat
      961 gaaaaccttg aaaactatta cctggaggtc aatcaacttg agaagtttga cataaagagc
     1021 ttctgcaaga tcctggggcc attatcctac tccaagatca agcatttgcg tttggatggc
     1081 aatcgcatct cagaaaccag tcttccaccg gatatgtatg aatgtctacg tgttgctaac
     1141 gaagtcactc ttaattaata tctgtatcct ggaacaatat tttatggtta tgtttttctg
     1201 tgtgtcagtt ttcatagtat ccatatttta ttactgttta ttacttccat gaattttaaa
     1261 atctgaggga aatgttttgt aaacatttat tttttttaaa gaaaagatga aaggcaggcc
     1321 tatttcatca caagaacaca cacatataca cgaatagaca tcaaactcaa tgctttattt
     1381 gtaaatttag tgttttttta tttctactgt caaatgatgt gcaaaacctt ttactggttg
     1441 catggaaatc agccaagttt tataatcctt aaatcttaat gttcctcaaa gcttggatta
     1501 aatacatatg gatgttactc tcttgcacca aattatcttg atacattcaa atttgtctgg
     1561 ttaaaaaata ggtggtagat attgaggcca agaatattgc aaaatacatg aagcttcatg
     1621 cacttaaaga agtattttta gaataagaat ttgcatactt acctagtgaa acttttctag
     1681 aattattttt cactctaagt catgtatgtt tctctttgat tatttgcatg ttatgtttaa
     1741 taagctacta gcaaaataaa acatagcaaa tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1801 aaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: AI755052. cr34f05.x1 Jia bo...[gi:5133305] Links  


IDENTIFIERS

dbEST Id:       2664787
EST name:       cr34f05.x1
GenBank Acc:    AI755052
GenBank gi:     5133305

CLONE INFO
Clone Id:       HBMSC_cr34f05 (3')
Source:         Libin Jia
Plate:          34 Row: f Column: 05
DNA type:       cDNA

PRIMERS
Sequencing:     -21M13 forward primer (ABI)
PolyA Tail:     Unknown

SEQUENCE
                TTTTTTTTTTTTTTTTTTTGAATTTTAATATGATATTTTATTATGGGTGTCTGTAAGGAA
                AAAAAAGATCAACAACCACATACAAGCTTACAAAGTTAAATTTCAACACATTCTCTATGC
                TAGTGTGACAAAAGCAGCCCCATAATTTGGTTTTTATTGTTGACCTTTACAGGATGAAGG
                AGGAGAATCCCCTGTGGCATGCCAATGAATCTTTCTGATGGGAGACATGTACAGATTTTG
                TGCATTTATGTTCTGAATGCAAGTCAACAATTCTGATCTAGAGTTTAAAAGTGAAAGTAC
                ATTAGCACCATAACATGCGTCTTTAAAGCCTTCCCAAATATTAGTAATCTTGACCAGCAA
                TGACAAGAAAAAAGAGGAGCACCTTTACAAGCAGTTGATATCCAATATTAAAATAATTGT
                GGCTTTAAAAATATTTCTTTAAATTCTTGCATTACACTTTTCTTTTTAAACCAATCTTCC
                AGGAGATTAATCAATGAAATTTATAAGTTTTATCAACGTATAAAATTTTTTTCATCTTCT
                GGGACTCATAGAATACAATCTGTGTTTCTGACCAGTTGAGGTAGTTAAAATAGGGAGGGC
                TTTTCTAATTTCGT

Entry Created:  Jun 22 1999
Last Updated:   Jun 20 2002

COMMENTS
                DNA Sequencing and analyses by National Institutes of Health
                Intramural Sequencing Center (NISC).

LIBRARY
Lib Name:       Human bone marrow stromal cells
Organism:       Homo sapiens
Sex:            mixed
Tissue type:    bone marrow stroma
Develop. stage: mixed
Lab host:       XL1-Blue MRF'/SOLR
Vector:         pBluescript
R. Site 1:      EcoRI
R. Site 2:      XhoI
Description:    mRNA made from human bone marrow stroma, cDNA made by
                oligo-dT priming. Directionally cloned. Size-selected for
                average insert size >0.5 kb. Library constructed by Dr.
                Marian Young and Dr. Pamela Gehron Robey (NIDCR). Library
                supplied by Dr. Libin Jia (NHGRI)

SUBMITTER
Name:           Libin Jia
Lab:            Medical Genetics Branch
Institution:    National Human Genome Research Institute
Address:        10/10C101, 9000 Rockville Pike, Bethesda, MD 20892-1267, USA
Tel:            301-402-4877
Fax:            301-496-7157
E-mail:         libin@helix.nih.gov

CITATIONS
Medline UID:    21686149
Title:          Gene expression profile of human bone marrow stromal cells:
                high-throughput expressed sequence tag sequencing analysis
Authors:        Jia,L., Young,M.F., Powell,J., Yang,L., Ho,N.C., Hotchkiss
                ,R., Robey,P.G., Francomano,C.A.
Citation:       Genomics 79 (1): 7-17 2002


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M26939. Human collagen ty...[gi:180813] Links  


LOCUS       HUMCOL3A1A              2275 bp    DNA     linear   PRI 01-NOV-1994
DEFINITION  Human collagen type-III (COL3A1) gene, 5' end.
ACCESSION   M26939
VERSION     M26939.1  GI:180813
KEYWORDS    Alu repeat; collagen.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2275)
  AUTHORS   Benson-Chanda,V., Su,M.W., Weil,D., Chu,M.L. and Ramirez,F.
  TITLE     Cloning and analysis of the 5' portion of the human type-III
            procollagen gene (COL3A1)
  JOURNAL   Gene 78 (2), 255-265 (1989)
  MEDLINE   89378752
   PUBMED   2777083
COMMENT     Original source text: Human DNA, (libraries of A.Bank, T.Maniatis,
            and M.Baird).
            Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Ramirez, 12-OCT-1989.
            Though the sequence was obtained from genomic DNA, only the exon
            sequences in the coding regions are reported.
FEATURES             Location/Qualifiers
     source          1..2275
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..2275
                     /gene="COL3A1"
     repeat_region   <1..237
                     /note="Alu repeat"
     TATA_signal     1602..1607
                     /gene="COL3A1"
     mRNA            1629..>2275
                     /gene="COL3A1"
                     /product="COL3A1 mRNA"
     misc_signal     1727..1754
                     /gene="COL3A1"
                     /note="region of dyad symmetry"
     CDS             1748..>2275
                     /gene="COL3A1"
                     /note="alpha-1 preprocollagen type-III"
                     /codon_start=1
                     /protein_id="AAA52040.1"
                     /db_xref="GI:180814"
                     /translation="MMSFVQKGSWLLLALLHPTIILAQQEAVEGGCSHLGQSYADRDV
                     WKPEPCQICVCDSGSVLCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQ
                     GPQGPKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDV
                     KSGVAVGGLAGYPGPA"
     sig_peptide     1748..1816
                     /gene="COL3A1"
                     /note="alpha-1 collagen type-III signal peptide"
     mat_peptide     2207..>2275
                     /gene="COL3A1"
                     /product="alpha-1 collagen type-III"
BASE COUNT      701 a    435 c    508 g    631 t
ORIGIN      1919 bp upstream of BamHI site.
        1 ggcgggcggg atcacgaggt caggacatca gatccatcct gactaactcg gtgaaacccg
       61 tctctactaa aaatacaaaa aattaaccag gcgtagtggc gggcctgtag tcccagctac
      121 tcgggaggct gaggcaggag aatggcgtga acccgggagc ggacttgccg tgagccagat
      181 tgcgcctgca gtctagcctg gagagagggc gaccccgtac gaaaaaaaaa aaaaaaagat
      241 gcaagaaaca agaaagcaag tttgccactg tccatgctta caggattcat gtacacctgg
      301 gatatacata tgacatgagt tgatgaaaat ttagtaacac aaattagaaa gagaaagatg
      361 aaaattagtt taaagaggta atttacttaa ggaaagtttt gcattaggca taaggaagtg
      421 acttccatta attggtgtga aatggggaga attttatgtg cacaggcata gagacggaca
      481 tgtttaggtg aaggtgaaga aaaggagttc aagtgacctt gattagaatt gaaatgtcca
      541 ccgtagatta tagagataga tgaactaact atgaactgat gattagggtg taaagaaata
      601 tctccattgc cagatatagc cttatactag tggacaagca tatttttcct gcagagattt
      661 atcaaagacc tataaatgta tttgcccctc tttcaaatat gacacaaaaa ttactgctta
      721 gatagatcca gaaatacatt ggaaagtttt ctggtatgag gatctggaaa gtatacacat
      781 aaaagtctaa attagaaata tttatactaa tctttagagt aatgcagaaa aaatgaccac
      841 tttctctggt actcttttgc tattctatgt gataaactct ttctcttcca gtttcctaaa
      901 aagttatgtc atgaaaaggc tgattctcta gaaacttgct tgcttcccag gcagcataaa
      961 atggaatctc acagaagact ctggcttgct gaagaaattg tgagaaatgg aaacagctat
     1021 agataagtat ctaactttta ggaagccatt caaacattgc tgaaatactt gtcttttgat
     1081 atttgcctga aacttaactt cctaggaccc aggtggtgat gaagtcgaag gggcataccg
     1141 gtgaggcatt tctttccgtg agtctcttac agtctcctat ttaaattgag ttaggattac
     1201 ttctggcaaa atcccaacat aaaaatcttc taggaagatc agttctgtaa attagacata
     1261 ctagataaat gggcatcaag cagtttttca aaattatgca gttgttaact tcataagggg
     1321 aaataaaaat gtatgcattt acattgtatg attaaaacaa ggcagagcat ttctatacgt
     1381 tcctaagtta tacaaacata tatgtaagag tgaaatatgt aaaaaaactt ttacataagc
     1441 agatgcatac aaactccaga tgtgctcttt ttcttactgt gggttgtgtc ttctataagg
     1501 gaaaaagaaa tatttatcat ttcttttact gctgagggga tgggtgcggc tctcatattt
     1561 cagaaagggg ctggaaagtg agggaagcca aactttttcc tatttaaggc caaagcaaag
     1621 gaatctcagt ggctgagttt tatgacgggc ccggtgcctg aagggcaggg aacaacttga
     1681 tggtgctact ttgaactgct tttcttttct ccttttgcac caagagtctc atgtctgata
     1741 tttagacatg atgagctttg tgcaaaaggg gagctggcta cttctcgctc tgcttcatcc
     1801 cactattatt ctggcacaac aggaagctgt tgaaggagga tgttcccatc ttggtcagtc
     1861 ctatgcggat agagatgtct ggaagccaga accatgccaa atatgtgtct gtgactcagg
     1921 atccgttctc tgcgatgaca taatatgtga cgatcaagaa ttagactgcc ccaacccaga
     1981 aattccattt ggagaatgtt gtgcagtttg cccacagcct ccaactgctc ctactcgccc
     2041 tcctaatggt caaggacctc aaggccccaa gggagatcca ggacctcctg gtattcctgg
     2101 gagaaatggt gaccctggta ttccaggaca accagggtcc cctggttctc ctggcccccc
     2161 tggaatctgt gaatcatgcc ctactggtcc tcagaactat tctccccagt atgattcata
     2221 tgatgtcaag tctggagtag cagtaggagg actcgcaggc tatcctggac cagct
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X14420. Human mRNA for pr...[gi:30057] Links  


LOCUS       HSCOL3AI                5460 bp    mRNA    linear   PRI 31-MAR-1995
DEFINITION  Human mRNA for pro-alpha-1 type 3 collagen.
ACCESSION   X14420
VERSION     X14420.1  GI:30057
KEYWORDS    COL3A1 gene; collagen; collagen alpha 1 type III; collagen type
            III.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3690; 4501 to 5460)
  AUTHORS   Ala-Kokko,L., Kontusaari,S., Baldwin,C.T., Kuivaniemi,H. and
            Prockop,D.J.
  TITLE     Structure of cDNA clones coding for the entire prepro alpha 1 (III)
            chain of human type III procollagen. Differences in protein
            structure from type I procollagen and conservation of codon
            preferences
  JOURNAL   Biochem. J. 260 (2), 509-516 (1989)
  MEDLINE   89350838
REFERENCE   2  (bases 1 to 5460)
  AUTHORS   Prockop,D.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-FEB-1989) Prockop D.J., Department of Biochemistry
            and Molecular Biology, Thomas Jefferson University, 1020 Locust
            Street, Room 490, Philadelphia, PA 19107
COMMENT     Data kindly reviewed (29-aug-1989) by Ala-Kokko L.
FEATURES             Location/Qualifiers
     source          1..5460
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="2q"
                     /clone="S413 and S31."
                     /cell_line="JIMM-69"
                     /cell_type="skin fibroblast"
                     /clone_lib="CB-JIMM-69"
                     /dev_stage="neonatal"
     CDS             103..4503
                     /codon_start=1
                     /product="prepro-alpha-1 type 3 collagen"
                     /protein_id="CAA32583.1"
                     /db_xref="GI:30058"
                     /db_xref="SWISS-PROT:P02461"
                     /translation="MMSFVQKGSWLLLALLHPTIILAQQEAVEGGCSHLGQSYADRDV
                     WKPEPCQICVCDSGSVLCDDIICDDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQ
                     GPQGPKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDV
                     KSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPP
                     GAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKG
                     ETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRPGLPGAAGARGNDGARGSDGQPGP
                     PGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSP
                     GGKGEMGPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAKGEPGPRGERG
                     EAGIPGVPGAKGEDGKDGSPGEPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPAGE
                     RGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRPGPP
                     GPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGPGPQGPPGKNGETGPQGPPGPTG
                     PGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGE
                     RGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPK
                     GDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPG
                     ERGETGPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPAGPPGP
                     QGVKGERGSPGGPGAAGFPGARGLPGPPGSNGNPGPPGPSGSPGKDGPPGPAGNTGAP
                     GSPGVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPG
                     PQGVKGESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGG
                     KGDRGENGSPGAPGAPGHPGPPGPVGPAGKSGDRGESGPAGPAGAPGPAGSRGAPGPQ
                     GPRGDKGETGERGAAGIKGHRGFPGNPGAPGSPGPAGQQGAIGSPGPAGPRGPVGPSG
                     PPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAPGPCCGGVGAAA
                     IAGIGGEKAGGFAPYYGDEPMDFKINTDEIMTSLKSVNGQIESLISPDGSRKNPARNC
                     RDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSS
                     AEKKHVWFGESMDGGFQFSYGNPELPEDVLDVQLAFLRLLSSRASQNITYHCKNSIAY
                     MDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRKA
                     VRLPIVDIAPYDIGGPDQEFGVDVGPVCFL"
     sig_peptide     103..174
                     /note="signal peptide (AA 1-24)"
     mat_peptide     174..4500
                     /note="pro-alpha-1 type 3 collagen"
     variation       2740
                     /note="c is u in variant clone; changing cuu (Leu) to uuu
                     (Phe)"
BASE COUNT     1327 a   1314 c   1544 g   1275 t
ORIGIN      
        1 cgggcccggt gctgaagggc agggaacaac ttgatggtgc tactttgaac tgcttttctt
       61 ttctcctttt tgcacaaaga gtctcatgtc tgatatttag acatgatgag ctttgtgcaa
      121 aaggggagct ggctacttct cgctctgctt catcccacta ttattttggc acaacaggaa
      181 gctgttgaag gaggatgttc ccatcttggt cagtcctatg cggatagaga tgtctggaag
      241 ccagaaccat gccaaatatg tgtctgtgac tcaggatccg ttctctgcga tgacataata
      301 tgtgacgatc aagaattaga ctgccccaac ccagaaattc catttggaga atgttgtgca
      361 gtttgcccac agcctccaac tgctcctact cgccctccta atggtcaagg acctcaaggc
      421 cccaagggag atccaggccc tcctggtatt cctgggagaa atggtgaccc tggtattcca
      481 ggacaaccag ggtcccctgg ttctcctggc ccccctggaa tctgtgaatc atgccctact
      541 ggtcctcaga actattctcc ccagtatgat tcatatgatg tcaagtctgg agtagcagta
      601 ggaggactcg caggctatcc tggaccagct ggccccccag gccctcccgg tccccctggt
      661 acatctggtc atcctggttc ccctggatct ccaggatacc aaggaccccc tggtgaacct
      721 gggcaagctg gtccttcagg ccctccagga cctcctggtg ctataggtcc atctggtcct
      781 gctggaaaag atggagaatc aggtagaccc ggacgacctg gagagcgagg attgcctgga
      841 cctccaggta tcaaaggtcc agctgggata cctggattcc ctggtatgaa aggacacaga
      901 ggcttcgatg gacgaaatgg agaaaagggt gaaacaggtg ctcctggatt aaagggtgaa
      961 aatggtcttc caggcgaaaa tggagctcct ggacccatgg gtccaagagg ggctcctggt
     1021 gagcgaggac ggccaggact tcctggggct gcaggtgctc ggggtaatga cggtgctcga
     1081 ggcagtgatg gtcaaccagg ccctcctggt cctcctggaa ctgccggatt ccctggatcc
     1141 cctggtgcta agggtgaagt tggacctgca gggtctcctg gttcaaatgg tgcccctgga
     1201 caaagaggag aacctggacc tcagggacac gctggtgctc aaggtcctcc tggccctcct
     1261 gggattaatg gtagtcctgg tggtaaaggc gaaatgggtc ccgctggcat tcctggagct
     1321 cctggactga tgggagcccg gggtcctcca ggaccagccg gtgctaatgg tgctcctgga
     1381 ctgcgaggtg gtgcaggtga gcctggtaag aatggtgcca aaggagagcc cggaccacgt
     1441 ggtgaacgcg gtgaggctgg tattccaggt gttccaggag ctaaaggcga agatggcaag
     1501 gatggatcac ctggagaacc tggtgcaaat gggcttccag gagctgcagg agaaaggggt
     1561 gcccctgggt tccgaggacc tgctggacca aatggcatcc caggagaaaa gggtcctgct
     1621 ggagagcgtg gtgctccagg ccctgcaggg cccagaggag ctgctggaga acctggcaga
     1681 gatggcgtcc ctggaggtcc aggaatgagg ggcatgcccg gaagtccagg aggaccagga
     1741 agtgatggga aaccagggcc tcccggaagt caaggagaaa gtggtcgacc aggtcctcct
     1801 gggccatctg gtccccgagg tcagcctggt gtcatgggct tccccggtcc taaaggaaat
     1861 gatggtgctc ctggtaagaa tggagaacga ggtggccctg gaggacctgg ccctcagggt
     1921 cctcctggaa agaatggtga aactggacct caaggacccc cagggcctac tgggcctggt
     1981 ggtgacaaag gagacacagg accccctggt ccacaaggat tacaaggctt gcctggtaca
     2041 ggtggtcctc caggagaaaa tggaaaacct ggggaaccag gtccaaaggg tgatgccggt
     2101 gcacctggag ctccaggagg caagggtgat gctggtgccc ctggtgaacg tggacctcct
     2161 ggattggcag gggccccagg acttagaggt ggagctggtc cccctggtcc cgaaggagga
     2221 aagggtgctg ctggtcctcc tgggccacct ggtgctgctg gtactcctgg tctgcaagga
     2281 atgcctggag aaagaggagg tcttggaagt cctggtccaa agggtgacaa gggtgaacca
     2341 ggcggcccag gtgctgatgg tgtcccaggg aaagatggcc caaggggtcc tactggtcct
     2401 attggtcctc ctggcccagc tggccagcct ggagataagg gtgaaggtgg tgcccccgga
     2461 cttccaggta tagctggacc tcgtggtagc cctggtgaga gaggtgaaac tggccctcca
     2521 ggacctgctg gtttccctgg tgctcctgga cagaatggtg aacctggtgg taaaggagaa
     2581 agaggggctc cgggtgagaa aggtgaagga ggccctcctg gagttgcagg accccctgga
     2641 ggttctggac ctgctggtcc tcctggtccc caaggtgtca aaggtgaacg tggcagtcct
     2701 ggtggacctg gtgctgctgg cttccctggt gctcgtggtc ttcctggtcc tcctggtagt
     2761 aatggtaacc caggaccccc aggtcccagc ggttctccag gcaaggatgg gcccccaggt
     2821 cctgcgggta acactggtgc tcctggcagc cctggagtgt ctggaccaaa aggtgatgct
     2881 ggccaaccag gagagaaggg atcgcctggt gcccagggcc caccaggagc tccaggccca
     2941 cttgggattg ctgggatcac tggagcacgg ggtcttgcag gaccaccagg catgccaggt
     3001 cctaggggaa gccctggccc tcagggtgtc aagggtgaaa gtgggaaacc aggagctaac
     3061 ggtctcagtg gagaacgtgg tccccctgga ccccagggtc ttcctggtct ggctggtaca
     3121 gctggtgaac ctggaagaga tggaaaccct ggatcagatg gtcttccagg ccgagatgga
     3181 tctcctggtg gcaagggtga tcgtggtgaa aatggctctc ctggtgcccc tggcgctcct
     3241 ggtcatccag gcccacctgg tcctgtcggt ccagctggaa agagtggtga cagaggagaa
     3301 agtggccctg ctggccctgc tggtgctccc ggtcctgctg gttcccgagg tgctcctggt
     3361 cctcaaggcc cacgtggtga caaaggtgaa acaggtgaac gtggagctgc tggcatcaaa
     3421 ggacatcgag gattccctgg taatccaggt gccccaggtt ctccaggccc tgctggtcag
     3481 cagggtgcaa tcggcagtcc aggacctgca ggccccagag gacctgttgg acccagtgga
     3541 cctcctggca aagatggaac cagtggacat ccaggtccca ttggaccacc agggcctcga
     3601 ggtaacagag gtgaaagagg atctgagggc tccccaggcc acccagggca accaggccct
     3661 cctggacctc ctggtgcccc tggtccttgc tgtggtggtg ttggagccgc tgccattgct
     3721 gggattggag gtgaaaaagc tggcggtttt gccccgtatt atggagatga accaatggat
     3781 ttcaaaatca acaccgatga gattatgact tcactcaagt ctgttaatgg acaaatagaa
     3841 agcctcatta gtcctgatgg ttctcgtaaa aaccccgcta gaaactgcag agacctgaaa
     3901 ttctgccatc ctgaactcaa gagtggagaa tactgggttg accctaacca aggatgcaaa
     3961 ttggatgcta tcaaggtatt ctgtaatatg gaaactgggg aaacatgcat aagtgccaat
     4021 cctttgaatg ttccacggaa acactggtgg acagattcta gtgctgagaa gaaacacgtt
     4081 tggtttggag agtccatgga tggtggtttt cagtttagct acggcaatcc tgaacttcct
     4141 gaagatgtcc ttgatgtgca gctggcattc cttcgacttc tctccagccg agcttcccag
     4201 aacatcacat atcactgcaa aaatagcatt gcatacatgg atcaggccag tggaaatgta
     4261 aagaaggccc tgaagctgat ggggtcaaat gaaggtgaat tcaaggctga aggaaatagc
     4321 aaattcacct acacagttct ggaggatggt tgcacgaaac acactgggga atggagcaaa
     4381 acagtctttg aatatcgaac acgcaaggct gtgagactac ctattgtaga tattgcaccc
     4441 tatgacattg gtggtcctga tcaagaattt ggtgtggacg ttggccctgt ttgcttttta
     4501 taaaccaaac tctatctgaa atcccaacaa aaaaaattta actccatatg tgttcctctt
     4561 gttctaatct tgtcaaccag tgcaagtgac cgacaaaatt ccagttattt atttccaaaa
     4621 tgtttggaaa cagtataatt tgacaaagaa aaatgatact tctctttttt tgctgttcca
     4681 ccaaatacaa ttcaaatgct ttttgtttta tttttttacc aattccaatt tcaaaatgtc
     4741 tcaatggtgc tataataaat aaacttcaac actctttatg ataacaacac tgtgttatat
     4801 tctttgaatc ctagcccatc tgcagagcaa tgactgtgct caccagtaaa agataacctt
     4861 tctttctgaa atagtcaaat acgaaattag aaaagccctc cctattttaa ctacctcaac
     4921 tggtcagaaa cacagattgt attctatgag tcccagaaga tgaaaaaaat tttatacgtt
     4981 gataaaactt ataaatttca ttgattaatc tcctggaaga ttggtttaaa aagaaaagtg
     5041 taatgcaaga atttaaagaa atatttttaa agccacaatt attttaatat tggatatcaa
     5101 ctgcttgtaa aggtgctcct cttttttctt gtcattgctg gtcaagatta ctaatatttg
     5161 ggaaggcttt aaagacgcat gttatggtgc taatgtactt tcacttttaa actctagatc
     5221 agaattgttg acttgcattc agaacataaa tgcacaaaat ctgtacatgt ctcccatcag
     5281 aaagattcat tggcatgcca cagggattct cctccttcat cctgtaaagg tcaacaataa
     5341 aaaccaaatt atggggctgc ttttgtcaca ctagcataga gaatgtgttg aaatttaact
     5401 ttgtaagctt gtatgtggtt gttgatcttt tttttcctta cagacaccca taataaaata
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC011872. Homo sapiens, clo...[gi:15080200] Links  


LOCUS       BC011872                2517 bp    mRNA    linear   PRI 02-AUG-2001
DEFINITION  Homo sapiens, clone MGC:20531 IMAGE:3028515, mRNA, complete cds.
ACCESSION   BC011872
VERSION     BC011872.1  GI:15080200
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2517)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site:       http://www.nisc.nih.gov/
            Contact:        nisc_mgc@nhgri.nih.gov
            Shevchenko,Y., Wetherby,K.D., Beckstrom-Sternberg,S.M.,
            Benjamin,B., Blakesley,R.W., Bouffard,G.G., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Guan,X., Gupta,J., Ho,S.-L., Karlins,E., Legaspi,R.,
            Lim,M., Maduro,Q.L., Masiello,C., Mastrian,S.D., McCloskey,J.C.,
            McDowell,J., Pearson,R., Snyder,B., Stantripop,S., Thomas,P.J.,
            Tiongson,E.E., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A.,
            Zhang,L.-H. and Green,E.D.

            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 28 Row: a Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: Hexamer frequency ORF
            analysis.
FEATURES             Location/Qualifiers
     source          1..2517
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:20531 IMAGE:3028515"
                     /tissue_type="Muscle, rhabdomyosarcoma"
                     /clone_lib="NIH_MGC_17"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             457..885
                     /codon_start=1
                     /product="Unknown (protein for MGC:20531)"
                     /protein_id="AAH11872.1"
                     /db_xref="GI:15080201"
                     /translation="MDMGLREMCVGEKRTVIIPPHLGYGEAGVDGEVPGSAVLVFDIE
                     LLELVAGLPEGYMFIWNGEVSPNLFEEIDKDGNGEVLLEEFSEYIHAQVASGKGKLAP
                     GFDAELIVKNMFTNQDRNGDGKVTAEEFKLKDQEAKQDEL"
BASE COUNT      685 a    538 c    650 g    644 t
ORIGIN      
        1 ggcacgaggg tgagccggag agggtgaagc aggcaggaag ctgcaggccg gcggtggatg
       61 gggccaggaa gcatgcaggg ggagtgagga ccgtgtatga gagaaacact gcccaagaga
      121 gaagaaagaa aatcacagga cccgaggaaa aagaatgagt ggtgtgttca gagatgacac
      181 atgacaacaa attgtttgct tggaggattt tcttacgcgt gcgtgtttaa actggcctca
      241 aggaggaaga agaaacatgc gttcctggat gactgagttc tggaatattc tgtcataaga
      301 ctttgcaact ggagtagccc cagaaaaagg caggcagtgg gagttggctc cctccccaaa
      361 gccatccctg agttggcttt gttttgggtt cctcccaggt ggaatttagg caaaacttac
      421 agtattgttc tgggatctgg gcaagttgtg ttggggatgg acatgggtct cagagagatg
      481 tgcgttggcg agaaacggac agtgatcatt ccgcctcacc tgggctatgg ggaagctggc
      541 gtggatggag aagtgcccgg cagtgccgta ttagtgtttg acattgagct gctggagctg
      601 gtggctggcc ttcccgaggg gtacatgttc atatggaatg gtgaggtgtc acccaacctt
      661 tttgaagaaa ttgacaagga tggcaacgga gaagtcctcc tggaagagtt ctcagagtac
      721 attcacgccc aggtggcatc tggcaaaggg aaactcgctc ctggctttga tgctgagctg
      781 attgtgaaga atatgttcac caaccaggac cggaatggag atgggaaggt cacagctgag
      841 gaatttaaac tcaaagacca ggaagccaaa caggatgaac tctaaacctg gcacgaacca
      901 gatggtgcca gggagtacgt gacaccaagc caactgtgtg gcagaacgtg cagtgagggt
      961 gcaagggtct ttcagaagtt gcatcattag ccagtagtag gtgggtcaca tagtacctgg
     1021 tgtacacatc ggggtgggtt gatatatggg gtgagaagtt taggctgatc gccagtgata
     1081 gtaaacaaaa tctgtgcaga gggccttagc atgggatgtg tccagtattg aaaaggctgc
     1141 actgccaacc atgatttgtg aaccttctgg gaaattttgt tattaaaaga atatatagtg
     1201 tcagacggaa gttataatca tcttggagga accataagaa aaagtgtcca gggtatctat
     1261 ataaagaggg ttaaattttt ttttaacttg ctggttaaaa cattttagaa atattcttga
     1321 gatgggcagg agagtcaaag ggcttgcttg ccccagcaga gttcccagca gacagccatg
     1381 gctcttccca gcagcctgtg caaattctga tgatgacccc acccccgcac acgcacacgc
     1441 acatcatgct tttccagctc atcacacccc gccccactat gggcctacca ttaatagtgt
     1501 atatcttgga ggttaaaaga gccttttgga cagaaaactg ggccaggaaa aggcatctca
     1561 gaccacaaat agagaatttg attcctcatt tgccacataa gtcatctgct tagcttttcc
     1621 tttctttttt tttttttttt tttttttgga ggcagagtct ccgtttgtcg ccaggctgga
     1681 gtgcagtggt gccatctcgg ctcactgcag cactgtctcg gctcactgca gcctccgcct
     1741 cccgtattca agcgattctc ctgtctcagc ctcctgagta gctaggacta caggtgtgca
     1801 ccaccacgcc ccgctaattt ttgtattttt ggtagagacg gggtttcacc gtgttggcca
     1861 ggatggtctc aatctcgacc tcgtgatccg cccacctcgg cctcccaaag tgttgggatt
     1921 acaggcatga atcaccatgc ctagccactt agttttttgt cattcccacc tttctatccc
     1981 atagaacact cttttttatc ttccctgaac catattgatg agataaatag ggctgggggc
     2041 tgggccccgc tggtcactca acagagtatt tcccttggcc gagatggaag ttttgtccca
     2101 atagatgagc tgctgagcat caacaaggtg acatttttct gctgcccatt tgtgtcctgg
     2161 agacggtggt accctgaagg cagaggccag ctgctgcaag acagcaatga cagtccacct
     2221 gccggcctga ttcctgcatc atggaataac cacatggcta ccttctatcc tctgttccca
     2281 aatggtggtg gcacttatcc tgaagtcatc aatgatttcc ctttgaaact actttatttt
     2341 actaatttaa actattttgt actgatgtag ccctgaggta gttcatgaaa atgctgtgca
     2401 ctcattccat ggaataaatg ttggaaagct gatcttttct gatataaaat gttgaattat
     2461 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U80998. Human basic helix...[gi:1769549] Links  


LOCUS       HSU80998                 797 bp    DNA     linear   PRI 09-JAN-1997
DEFINITION  Human basic helix-loop-helix DNA binding protein (TWIST) gene,
            complete cds.
ACCESSION   U80998
VERSION     U80998.1  GI:1769549
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 797)
  AUTHORS   Howard,T.D., Paznekas,W.A., Green,E.D., Chiang,L.C., Ma,N., Ortiz
            de Luna,R.I., Garcia Delgado,C., Gonzalez-Ramos,M., Kline,A.D. and
            Jabs,E.W.
  TITLE     Mutations in TWIST, a basic helix-loop-helix transcription factor,
            in Saethre-Chotzen syndrome
  JOURNAL   Nat. Genet. 15 (1), 36-41 (1997)
  MEDLINE   97141916
   PUBMED   8988166
REFERENCE   2  (bases 1 to 797)
  AUTHORS   Howard,T.D. and Jabs,E.W.
  TITLE     Direct Submission
  JOURNAL   Submitted (04-DEC-1996) Pediatrics/Center for Medical Genetics,
            CMSC 1004, Johns Hopkins University, 600 N. Wolfe St., Baltimore,
            MD 21287-3914, USA
FEATURES             Location/Qualifiers
     source          1..797
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="7"
                     /map="7p22-21"
     gene            62..670
                     /gene="TWIST"
     CDS             62..670
                     /gene="TWIST"
                     /function="transcription factor"
                     /codon_start=1
                     /product="basic helix-loop-helix DNA binding protein"
                     /protein_id="AAC50930.1"
                     /db_xref="GI:1769550"
                     /translation="MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSGKRGGRKRRSSRR
                     SAGGGAGPGGAAGGGVGGGDEPGSPAQGKRGKKSAGCGGGGGAGGGGGSSSGGGSPQS
                     YEELQTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLY
                     QVLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH"
BASE COUNT      133 a    277 c    298 g     89 t
ORIGIN      
        1 gaggcgcccc gctcttctcc tctgccccgg gcccgcgagg ccacgcgtcg ccgctcgaga
       61 gatgatgcag gacgtgtcca gctcgccagt ctcgccggcc gacgacagcc tgagcaacag
      121 cgaggaagag ccagaccggc agcagccgcc gagcggcaag cgcgggggac gcaagcggcg
      181 cagcagcagg cgcagcgcgg gcggcggcgc ggggcccggc ggagccgcgg gtgggggcgt
      241 cggaggcggc gacgagccgg gcagcccggc ccagggcaag cgcggcaaga agtctgcggg
      301 ctgtggcggc ggcggcggcg cgggcggcgg cggcggcagc agcagcggcg gcgggagtcc
      361 gcagtcttac gaggagctgc agacgcagcg ggtcatggcc aacgtgcggg agcgccagcg
      421 cacccagtcg ctgaacgagg cgttcgccgc gctgcggaag atcatcccca cgctgccctc
      481 ggacaagctg agcaagattc agaccctcaa gctggcggcc aggtacatcg acttcctcta
      541 ccaggtcctc cagagcgacg agctggactc caagatggca agctgcagct atgtggctca
      601 cgagcggctc agctacgcct tctcggtctg gaggatggag ggggcctggt ccatgtccgc
      661 gtcccactag caggcggagc cccccacccc ctcagcaggg ccggagacct aggtaaggac
      721 cgcgccgctg caccccttcg cctctcaggt ggcagacggc aggccggcca ggccgcggtt
      781 cccagtccac ctcgatt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X99268. H.sapiens mRNA fo...[gi:1495422] Links  


LOCUS       HSBHLH                  1457 bp    mRNA    linear   PRI 11-SEP-1996
DEFINITION  H.sapiens mRNA for B-HLH DNA binding protein.
ACCESSION   X99268
VERSION     X99268.1  GI:1495422
KEYWORDS    B-HLH protein; H-twist gene.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Bourgeois,P., Stoetzel,C., Bolcato-Bellemin,A.L., Mattei,M.G. and
            Perrin-Schmitt,F.
  TITLE     The human H-twist gene is located at 7p21 and encodes a B-HLH
            protein which is 96% similar to its murine M-twist counterpart
  JOURNAL   Mamm. Genome
REFERENCE   2  (bases 1 to 1457)
  AUTHORS   Bourgeois,P.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-JUL-1996) P. Bourgeois, LGME du CNRS / U184 de
            INSERM, Institut de Chimie Biologique, 11 rue Human, 67085
            STRASBOURG Cedex, France
FEATURES             Location/Qualifiers
     source          1..1457
                     /organism="Homo sapiens"
                     /strain="caucasian"
                     /db_xref="taxon:9606"
                     /chromosome="7"
                     /map="p21"
                     /clone="pSK+"
                     /sex="female"
                     /tissue_type="placenta"
                     /clone_lib="lambda EXlox"
                     /dev_stage="adult"
     gene            1..1457
                     /gene="H-twist"
     CDS             111..731
                     /gene="H-twist"
                     /codon_start=1
                     /product="B-HLH DNA binding protein"
                     /protein_id="CAA67664.1"
                     /db_xref="GI:1495423"
                     /db_xref="SWISS-PROT:Q15672"
                     /translation="MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSAKRGARKRRSSRR
                     SAGGGAGPGGAAGGAVGGGDEPGSPAQGKRGKKSAGCGGGGGAGGGGGGGGGSSSGGG
                     SPQSYEELQTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYI
                     DFLYQVLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH"
BASE COUNT      378 a    370 c    420 g    289 t
ORIGIN      
        1 tccgttgctg tcggcgcgcg gcggcccggg cgggggaagc tggcgggctg aggcgccccg
       61 ctcttctcct ctgccccggg cccgcgaggc cacgcgtcgc cgcacgagag atgatgcagg
      121 acgtgtccag ctcgccagtc tcgccggccg acgacagcct gagcaacagc gaggaagagc
      181 cagaccggca gcagccgccg agcgcgaagc gcggggcacg caagcggcgc agcagcaggc
      241 gcagcgcggg cggcggcgcg gggcccggcg gagccgcggg tggggccgtc ggaggcggcg
      301 acgagccggg cagcccggcc cagggcaagc gcggcaagaa gtctgcgggc tgtggcggcg
      361 gcggcggcgc gggcggcggc ggcggcggcg gcggcggcag cagcagcggc ggcgggagtc
      421 cgcagtctta cgaggagctg cagacgcagc gggtcatggc caacgtgcgg gagcgccagc
      481 gcacccagtc gctgaacgag gcgttcgccg cgctgcggaa gatcatcccc acgctgccct
      541 cggacaagct gagcaagatt cagaccctca agctggcggc caggtacatc gacttcctct
      601 accaggtcct ccagagcgac gagctggact ccaagatggc aagctgcagc tatgtggctc
      661 acgagcggct cagctacgcc ttctcggtct ggaggatgga gggggcctgg tccatgtccg
      721 cgtcccacta gcagcggagc cccccacccc ctcagcaggg ccggagacct agatgtcatt
      781 gtttccagag aaggagaaaa tggacagtct agagactctg gagctggata actaaaaata
      841 aaaatatatg ccaaagattt tcttggaaat tagaagagca aaatccaaat tcaaagaaac
      901 agggcgtggg gcgcactttt aaaagagaaa gcgagacagg cccgtggaca gtgattccca
      961 gacgggcagc gcaccatcct cacatcctct gcattctgat agaagtctga acagttgttt
     1021 gtgttttttt tttttttttt ttgacgaaga atgtttttat ttttattttt ttcatgcatg
     1081 cattctcaag aggtcgtgcc aatcatcagc cactgaaagg aaaggcatca ctatggactt
     1141 tctctatttt aaaatggtaa caatcagagg aactataaga acacctttag aaataaaaat
     1201 actgggatca aactggcctg caaaaccata gtcagttaat tctttttttc atccttcctc
     1261 tgaggggaaa aacaaaaaaa aacttaaaat acaaaaaata acattctatt tatttattga
     1321 ggacccatgg taaatgcaat agtccggtgt ctaaatgcat tcatattttt atgattgttt
     1381 tgtaaatatc tttgtatatt tttctgcaat aaataaatat aaaaaattta gagaaaaaaa
     1441 aaaaaaaaaa aaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: AU118365. AU118365 HEMBA1 H...[gi:10933406] Links  


IDENTIFIERS

dbEST Id:       6509838
EST name:       AU118365
GenBank Acc:    AU118365
GenBank gi:     10933406

CLONE INFO
Clone Id:       HEMBA1003446 (5')
DNA type:       cDNA

PRIMERS
PolyA Tail:     Unknown

SEQUENCE
                TTGGTGATCAGAATCAGAAGTTCGGATTTGAAGTTGGTCCTGTTTGTTTTCTTGGCTAAG
                ATTAAGACAAAGAACATATCAAATCAACAGAAAATATACCTTGGTGCCACCAACCCATTT
                TGTGCCACATGCAAGTTTTGAATAAGGATGGTATAGAAAACAACGCTGCATATACAGGTA
                CCATTTAGGAAATACCGATGCCTTTGTGGGGGCAGAATCACATGGCAAAAGCTTTGAAAA
                TCATAAAGATATAAGTTGGTGTGGCTAAGATGGAAACAGGGCTGATTCTTGATTCCCAAT
                TCTCAACTCTCCTTTTCCTATTTGAATTTCTTTGGTGCTGTAGAAAACAAAAAAAGAAAA
                ATATATATTCATAAAAAATATGGTGCTCATTCTCATCCATCCAGGATGTACTAAAACAGT
                GTGTTTAATAAATTGTAATTATTTTGTGTACAGTTCTATACTGTTATCTGTGTCCATTTC
                CAAAACTTGCACGTGTCCCTGAATTCCATCTGACTCTAATTTTATGAGAATTGCAGAACT
                CTGATGGCAATAAATATATGTATTATGAAAAAATAAAGTTGTAATTTCTGATGACTCTAA
                GTCCCTTTCTTTGGTTAATAATAAAATGCCTTTGTATATATTGATGTTGAAGAGTTCAAT
                TATTTGATGTCGCCAACAAAATTCTCAGAGGGCAAAAATCTGGAAGACTTTTGGAAGCAC
                ACTCTGATCAACTCTTCTCTGNCGACAGTCATTTTGCTGAATTCANCCCAAAATATTATG
                CATTTTGATGCTTTATTCANGCTTTCCTCAACTTTTCTT

Entry Created:  Oct 19 2000
Last Updated:   Aug 1 2002

COMMENTS
                HRI human cDNA project; 5'- & 3'-end one pass sequencing:
                Helix Research Institute; cDNA library construction:
                Department of Virology, Institute of Medical Science,
                University of Tokyo, and Helix Research Institute.

LIBRARY
Lib Name:       HEMBA1
Organism:       Homo sapiens
Tissue type:    whole embryo, mainly head
Develop. stage: embryo, 10 weeks
Vector:         pME18SFL3

SUBMITTER
Name:           Takao Isogai
Lab:            Genomics Laboratory
Institution:    Helix Research Institute
Address:        1532-3 Yana, Kisarazu, Chiba 292-0812, Japan
Tel:            81-438-52-3975
Fax:            81-438-52-3986
E-mail:         genomics@hri.co.jp

CITATIONS
Title:          HRI human cDNA project
Authors:        Ota,T., Nishikawa,T., Suzuki,Y., Ishii,S., Saito,K., Kawai
                ,Y., Yamamoto,J., Wakamatsu,A., Nakamura,Y., Nagai,T.,
                Sugano,S., Isogai,T.
Year:           2000
Status:         Unpublished


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: J04177. Human alpha-1 typ...[gi:179729] Links  


LOCUS       HUMCA1XIA               6158 bp    mRNA    linear   PRI 31-OCT-1994
DEFINITION  Human alpha-1 type XI collagen (COL11A1) mRNA, complete cds.
ACCESSION   J04177 J05407
VERSION     J04177.1  GI:179729
KEYWORDS    alpha-1 type XI collagen; collagen; type XI collagen.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1773 to 6158)
  AUTHORS   Bernard,M., Yoshioka,H., Rodriguez,E., Van der Rest,M., Kimura,T.,
            Ninomiya,Y., Olsen,B.R. and Ramirez,F.
  TITLE     Cloning and sequencing of pro-alpha 1 (XI) collagen cDNA
            demonstrates that type XI belongs to the fibrillar class of
            collagens and reveals that the expression of the gene is not
            restricted to cartilagenous tissue
  JOURNAL   J. Biol. Chem. 263 (32), 17159-17166 (1988)
  MEDLINE   89034222
   PUBMED   3182841
REFERENCE   2  (bases 1 to 1835)
  AUTHORS   Yoshioka,H. and Ramirez,F.
  TITLE     Pro-alpha 1(XI) collagen. Structure of the amino-terminal
            propeptide and expression of the gene in tumor cell lines
  JOURNAL   J. Biol. Chem. 265 (11), 6423-6426 (1990)
  MEDLINE   90202924
   PUBMED   1690726
COMMENT     Original source text: Human placenta fibroblast, cDNA to mRNA,
            clone OK4 [1].
            Draft entry and computer-readable sequence for [1] kindly submitted
            by F.Ramirez, 02-FEB-1990; for [2] by M.Bernard, 21-SEP-1988.
FEATURES             Location/Qualifiers
     source          1..6158
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="1p21"
     gene            1..6158
                     /gene="COL11A1"
     CDS             162..5582
                     /gene="COL11A1"
                     /note="alpha-1 (type XI) collagen precursor"
                     /codon_start=1
                     /protein_id="AAA51891.1"
                     /db_xref="GI:179730"
                     /db_xref="GDB:G00-120-595"
                     /translation="MEPWSSRWKTKRWLWDFTVTTLALTFLFQAREVRGAAPVDVLKA
                     LDFHNSPEGISKTTGFCTNRKNSKGSDTAYRVSKQAQLSAPTKQLFPGGTFPEDFSIL
                     FTVKPKKGIQSFLLSIYNEHGIQQIGVEVGRSPVFLFEDHTGKPAPEDYPLFRTVNIA
                     DGKWHRVAISVEKKTVTMIVDCKKKTTKPLDRSERAIVDTNGITVFGTRILDEEVFEG
                     DIQQFLITGDPKAAYDYCEHYSPDCDSSAPKAAQAQEPQIDEYAPEDIIEYDYEYGEA
                     EYKEAESVTEGPTVTEETIAQTEANIVDDFQEYNYGTMESYQTEAPRHVSGTNEPNPV
                     EEIFTEEYLTGEDYDSQRKNSEDTLYENKEIDGRDSDLLVDGDLGEYDFYEYKEYEDK
                     PTSPPNEEFGPGVPAETDITETSINGHGAYGEKGQKGEPAVVEPGMLVEGPPGPAGPA
                     GIMGPPGLQGPTGPPGDPGDRGPPGRPGLPGADGLPGPPGTMLMLPFRYGGDGSKGPT
                     ISAQEAQAQAILQQARIALRGPPGPMGLTGRPGPVGGPGSSGAKGESGDPGPQGPRGV
                     QGPPGPTGKPGKRGRPGADGGRGMPGEPGAKGDRGFDGLPGLPGDKGHRGERGPQGPP
                     GPPGDDGMRGEDGEIGPRGLPGEAGPRGLLGPRGTPGAPGQPGMAGVDGPPGPKGNMG
                     PQGEPGPPGQQGNPGPQGLPGPQGPIGPPGEKGPQGKPGLAGLPGADGPPGHPGKEGQ
                     SGEKGALGPPGPQGPIGXPGPRGVKGADGVRGLKGSKGEKGEDGFPGFKGDMGLKGDR
                     GEVGQIGPRGXDGPEGPKGRAGPTGDPGPSGQAGEKGKLGVPGLPGYPGRQGPKGSTG
                     FPGFPGANGEKGARGVAGKPGPRGQRGPTGPRGSRGARGPTGKPGPKGTSGGDGPPGP
                     PGERGPQGPQGPVGFPGPKGPPGPPGRMGCPGHPGQRGETGFQGKTGPPGPGGVVGPQ
                     GPTGETGPIGERGYPGPPGPPGEQGLPGAAGKEGAKGDPGPQGISGKDGPAGLRGFPG
                     ERGLPGAQGAPGLKGGEGPQGPPGPVGSPGERGSAGTAGPIGLRGRPGPQGPPGPAGE
                     KGAPGEKGPQGPAGRDGVQGPVGLPGPAGPAGSPGEDGDKGEIGEPGQKGSKGGKGEN
                     GPPGPPGLQGPVGAPGIAGGDGEPGPRGQQGMFGQKGDEGARGFPGPPGPIGLQGLPG
                     PPGEKGENGDVGPWGPPGPPGPRGPQGPNGADGPQGPPGSVGSVGGVGEKGEPGEAGN
                     PGPPGEAGVGGPKGERGEKGEAGPPGAAGPPGAKGPPGDDGPKGNPGPVGFPGDPGPP
                     GELGPAGQDGVGGDKGEDGDPGQPGPPGPSGEAGPPGPPGKRGPPGAAGAEGRQGEKG
                     AKGEAGAEGPPGKTGPVGPQGPAGKPGPEGLRGIPGPVGEQGLPGAAGQDGPPGPMGP
                     PGLPGLKGDPGSKGEKGHPGLIGLIGPPGEQGEKGDRGLPGTQGSPGAKGDGGIPGPA
                     GPLGPPGPPGLPGPQGPKGNKGSTGPAGQKGDSGLPGPPGPPGPPGEVIQPLPILSSK
                     KTRRHTEGMQADADDNILDYSDGMEEIFGSLNSLKQDIEHMKFPMGTQTNPARTCKDL
                     QLSHPDFPDGEYWIDPNQGCSGDSFKVYCNFTSGGETCIYPDKKSEGVRISSWPKEKP
                     GSWFSEFKRGKLLSYLDVEGNSINMVQMTFLKLLTASARQNFTYHCHQSAAWYDVSSG
                     SYDKALRFLGSNDEEMSYDNNPFIKTLYDGCTSRKGYEKTVIEINTPKIDQVPIVDVM
                     ISDFGDQNQKFGFEVGPVCFLG"
     sig_peptide     162..269
                     /gene="COL11A1"
                     /note="alpha-1 (type XI) collagen signal peptide"
     mat_peptide     270..1745
                     /gene="COL11A1"
                     /product="alpha-1 (type XI) collagen"
     mat_peptide     5013..5579
                     /gene="COL11A1"
                     /product="alpha-1 (type XI) collagen"
BASE COUNT     1693 a   1355 c   1746 g   1361 t      3 others
ORIGIN      
        1 aaccatcaaa tttagaagaa aaagcccttt gactttttcc ccctctccct ccccaatggc
       61 tgtgtagcaa acatccctgg cgataccttg gaaaggacga agttggtctg cagtcgcaat
      121 ttcgtgggtt gagttcacag ttgtgagtgc ggggctcgga gatggagccg tggtcctcta
      181 ggtggaaaac gaaacggtgg ctctgggatt tcaccgtaac aaccctcgca ttgaccttcc
      241 tcttccaagc tagagaggtc agaggagctg ctccagttga tgtactaaaa gcactagatt
      301 ttcacaattc tccagaggga atatcaaaaa caacgggatt ttgcacaaac agaaagaatt
      361 ctaaaggctc agatactgct tacagagttt caaagcaagc acaactcagt gccccaacaa
      421 aacagttatt tccaggtgga actttcccag aagacttttc aatactattt acagtaaaac
      481 caaaaaaagg aattcagtct ttccttttat ctatatataa tgagcatggt attcagcaaa
      541 ttggtgttga ggttgggaga tcacctgttt ttctgtttga agaccacact ggaaaacctg
      601 ccccagaaga ctatcccctc ttcagaactg ttaacatcgc tgacgggaag tggcatcggg
      661 tagcaatcag cgtggagaag aaaactgtga caatgattgt tgattgtaag aagaaaacca
      721 cgaaaccact tgatagaagt gagagagcaa ttgttgatac caatggaatc acggtttttg
      781 gaacaaggat tttggatgaa gaagtttttg agggggacat tcagcagttt ttgatcacag
      841 gtgatcccaa ggcagcatat gactactgtg agcattatag tccagactgt gactcttcag
      901 cacccaaggc tgctcaagct caggaacctc agatagatga gtatgcacca gaggatataa
      961 tcgaatatga ctatgagtat ggggaagcag agtataaaga ggctgaaagt gtaacagagg
     1021 gacccactgt aactgaggag acaatagcac agacggaggc aaacatcgtt gatgattttc
     1081 aagaatacaa ctatggaaca atggaaagtt accagacaga agctcctagg catgtttctg
     1141 ggacaaatga gccaaatcca gttgaagaaa tatttactga agaatatcta acgggagagg
     1201 attatgattc ccagaggaaa aattctgagg atacactata tgaaaacaaa gaaatagacg
     1261 gcagggattc tgatcttctg gtagatggag atttaggcga atatgatttt tatgaatata
     1321 aagaatatga agataaacca acaagccccc ctaatgaaga atttggtcca ggtgtaccag
     1381 cagaaactga tattacagaa acaagcataa atggccatgg tgcatatgga gagaaaggac
     1441 agaaaggaga accagcagtg gttgagcctg gtatgcttgt cgaaggacca ccaggaccag
     1501 caggacctgc aggtattatg ggtcctccag gtctacaagg ccccactgga ccccctggtg
     1561 accctggcga taggggcccc ccaggacgtc ctggcttacc aggggctgat ggtctacctg
     1621 gtcctcctgg tactatgttg atgttaccgt tccgttatgg tggtgatggt tccaaaggac
     1681 caaccatctc tgctcaggaa gctcaggctc aagctattct tcagcaggct cggattgctc
     1741 tgagaggccc acctggccca atgggtctaa ctggaagacc aggtcctgtg ggggggcctg
     1801 gttcatctgg ggccaaaggt gagagtggtg atccaggtcc tcagggccct cgaggcgtcc
     1861 agggtccccc tggtccaacg ggaaaacctg gaaaaagggg tcgtccaggt gcagatggag
     1921 gaagaggaat gccaggagaa cctggggcaa agggagatcg agggtttgat ggacttccgg
     1981 gtctgccagg tgacaaaggt cacaggggtg aacgaggtcc tcaaggtcct ccaggtcctc
     2041 ctggtgatga tggaatgagg ggagaagatg gagaaattgg accaagaggt cttccaggtg
     2101 aagctggccc acgaggtttg ctgggtccaa ggggaactcc aggagctcca gggcagcctg
     2161 gtatggcagg tgtagatggc cccccaggac caaaagggaa catgggtccc caaggggagc
     2221 ctgggcctcc aggtcaacaa gggaatccag gacctcaggg tcttcctggt ccacaaggtc
     2281 caattggtcc tcctggtgaa aaaggaccac aaggaaaacc aggacttgct ggacttcctg
     2341 gtgctgatgg gcctcctggt catcctggga aagaaggcca gtctggagaa aagggggctc
     2401 tgggtccccc tggtccacaa ggtcctattg gatnnccggg cccccgggga gtaaagggag
     2461 cagatggtgt cagaggtctc aagggatcta aaggtgaaaa gggtgaagat ggttttccag
     2521 gattcaaagg tgacatgggt ctaaaaggtg acagaggaga agttggtcaa attggcccaa
     2581 gagggnaaga tggccctgaa ggacccaaag gtcgagcagg cccaactgga gacccaggtc
     2641 cttcaggtca agcaggagaa aagggaaaac ttggagttcc aggattacca ggatatccag
     2701 gaagacaagg tccaaagggt tccactggat tccctgggtt tccaggtgcc aatggagaga
     2761 aaggtgcacg gggagtagct ggcaaaccag gccctcgggg tcagcgtggt ccaacgggtc
     2821 ctcgaggttc aagaggtgca agaggtccca ctgggaaacc tgggccaaag ggcacttcag
     2881 gtggcgatgg ccctcctggc cctccaggtg aaagaggtcc tcaaggacct cagggtccag
     2941 ttggattccc tggaccaaaa ggccctcctg gaccaccagg aaggatgggc tgcccaggac
     3001 accctgggca acgtggggag actggatttc aaggcaagac cggccctcct gggccagggg
     3061 gagtggttgg accacaggga ccaaccggtg agactggtcc aataggggaa cgtgggtatc
     3121 ctggtcctcc tggccctcct ggtgagcaag gtcttcctgg tgctgcagga aaagaaggtg
     3181 caaagggtga tccaggtcct caaggtatct cagggaaaga tggaccagca ggattacgtg
     3241 gtttcccagg ggaaagaggt cttcctggag ctcagggtgc acctggactg aaaggagggg
     3301 aaggtcccca gggcccacca ggtccagttg gctcaccagg agaacgtggg tcagcaggta
     3361 cagctggccc aattggttta cgagggcgcc cgggacctca gggtcctcct ggtccagctg
     3421 gagagaaagg tgctcctgga gaaaaaggtc cccaagggcc tgcagggaga gatggagttc
     3481 aaggtcctgt tggtctccca gggccagctg gtcctgccgg ctcccctggg gaagacggag
     3541 acaagggtga aattggtgag ccgggacaaa aaggcagcaa gggtggcaag ggagaaaatg
     3601 gccctcccgg tcccccaggt cttcaaggac cagttggtgc ccctggaatt gctggaggtg
     3661 atggtgaacc aggtcctaga ggacagcagg ggatgtttgg gcaaaaaggt gatgagggtg
     3721 ccagaggctt ccctggacct cctggtccaa taggtcttca gggtctgcca ggcccacctg
     3781 gtgaaaaagg tgaaaatggg gatgttggtc catgggggcc acctggtcct ccaggcccaa
     3841 gaggccctca aggtcccaat ggagctgatg gaccacaagg acccccaggt tctgttggtt
     3901 cagttggtgg tgttggagaa aagggtgaac ctggagaagc aggaaaccca gggcctcctg
     3961 gggaagcagg tgtaggcggt cccaaaggag aaagaggaga gaaaggggaa gctggtccac
     4021 ctggagctgc tggacctcca ggtgccaagg ggccgccagg tgatgatggc cctaagggta
     4081 acccgggtcc tgttggtttt cctggagatc ctggtcctcc tggggaactt ggccctgcag
     4141 gtcaagatgg tgttggtggt gacaagggtg aagatggaga tcctggtcaa ccgggtcctc
     4201 ctggcccatc tggtgaggct ggcccaccag gtcctcctgg aaaacgaggt cctcctggag
     4261 ctgcaggtgc agagggaaga caaggtgaaa aaggtgctaa gggggaagca ggtgcagaag
     4321 gtcctcctgg aaaaaccggc ccagtcggtc ctcagggacc tgcaggaaag cctggtccag
     4381 aaggtcttcg gggcatccct ggtcctgtgg gagaacaagg tctccctgga gctgcaggcc
     4441 aagatggacc acctggtcct atgggacctc ctggcttacc tggtctcaaa ggtgaccctg
     4501 gctccaaggg tgaaaaggga catcctggtt taattggcct gattggtcct ccaggagaac
     4561 aaggggaaaa aggtgaccga gggctccctg gaactcaagg atctccagga gcaaaagggg
     4621 atgggggaat tcctggtcct gctggtccct taggtccacc tggtcctcca ggcttaccag
     4681 gtcctcaagg cccaaagggt aacaaaggct ctactggacc cgctggccag aaaggtgaca
     4741 gtggtcttcc agggcctcct gggcctccag gtccacctgg tgaagtcatt cagcctttac
     4801 caatcttgtc ctccaaaaaa acgagaagac atactgaagg catgcaagca gatgcagatg
     4861 ataatattct tgattactcg gatggaatgg aagaaatatt tggttccctc aattccctga
     4921 aacaagacat cgagcatatg aaatttccaa tgggtactca gaccaatcca gcccgaactt
     4981 gtaaagacct gcaactcagc catcctgact tcccagatgg tgaatattgg attgatccta
     5041 accaaggttg ctcaggagat tccttcaaag tttactgtaa tttcacatct ggtggtgaga
     5101 cttgcattta tccagacaaa aaatctgagg gagtaagaat ttcatcatgg ccaaaggaga
     5161 aaccaggaag ttggtttagt gaatttaaga ggggaaaact gctttcatac ttagatgttg
     5221 aaggaaattc catcaatatg gtgcaaatga cattcctgaa acttctgact gcctctgctc
     5281 ggcaaaattt cacctaccac tgtcatcagt cagcagcctg gtatgatgtg tcatcaggaa
     5341 gttatgacaa agcacttcgc ttcctgggat caaatgatga ggagatgtcc tatgacaata
     5401 atccttttat caaaacactg tatgatggtt gtacgtccag aaaaggctat gaaaaaactg
     5461 tcattgaaat caatacacca aaaattgatc aagtacctat tgttgatgtc atgatcagtg
     5521 actttggtga tcagaatcag aagttcggat ttgaagttgg tcctgtttgt tttcttggct
     5581 aagattaaga caaagaacat atcaaatcaa cagaaaatgt accttggtgc caccaaccca
     5641 ttttgtgcca catgcaagtt ttgaataagg atgtatggaa aacaacgctg catatacagg
     5701 taccatttag gaaataccga tgcctttgtg ggggcagaat cacagacaaa agctttgaaa
     5761 atcataaaga tataagttgg tgtggctaag atggaaacag ggctgattct tgattcccaa
     5821 ttctcaactc tccttttcct atttgaattt ctttggtgct gtagaaaaca aaaaaagaaa
     5881 aatatatatt cataaaaaat atggtgctca ttctcatcca tccaggatgt actaaaacag
     5941 tgtgtttaat aaattgtaat tattttgtgt acagttctat actgttatct gtgtccattt
     6001 ccaaaacttg cacgtgtccc tgaattccgc tgactctaat ttatgaggat gccgaactct
     6061 gatggcaata atatatgtat tatgaaaatg aagttatgat ttccgatgac cctaagtccc
     6121 tttctttggt taatgatgaa attcctttgt gtgtgttt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U12139. Human alpha1(XI) ...[gi:639437] Links  


LOCUS       HSU12139                1775 bp    DNA     linear   PRI 26-JAN-1995
DEFINITION  Human alpha1(XI) collagen (COL11A1) gene, 5' region and exon 1.
ACCESSION   U12139
VERSION     U12139.1  GI:639437
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1775)
  AUTHORS   Yoshioka,H., Greenwel,P., Inoguchi,K., Truter,S., Inagaki,Y.,
            Ninomiya,Y. and Ramirez,F.
  TITLE     Structural and functional analysis of the promoter of the human
            alpha 1(XI) collagen gene
  JOURNAL   J. Biol. Chem. 270 (1), 418-424 (1995)
  MEDLINE   95113862
   PUBMED   7814404
REFERENCE   2  (bases 1 to 1775)
  AUTHORS   Ramirez,F.
  TITLE     Direct Submission
  JOURNAL   Submitted (08-JUL-1994) Francesco Ramirez, Brookdale Center for
            Molecular Biology, Mount Sinai School of Medicine, One Gustave L.
            Levy Place, New York, NY 10029, USA
FEATURES             Location/Qualifiers
     source          1..1775
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="rhabdomyosarcoma (CCL 136)"
     gene            1..1775
                     /gene="COL11A1"
     promoter        1..1454
                     /gene="COL11A1"
     exon            1455..>1775
                     /gene="COL11A1"
                     /number=1
     CDS             1773..>1775
                     /gene="COL11A1"
                     /note="translation start codon"
                     /codon_start=1
                     /product="alpha1(XI) collagen"
BASE COUNT      438 a    418 c    478 g    441 t
ORIGIN      
        1 ctgcaggtca acggatcatg aggcatattt tgatgactaa tagaaaactg agttcctaaa
       61 gattagtggc agctgtagaa ttgaacatat gtacccagta aacttagttg agggccttta
      121 agcatttaaa tgacttaaat actcaaagta ggccctggcc aatgcttcta gtctttgtca
      181 acagaaaaat gtgttcattc atccaaccca tctctattga gtggctaatg tatgctggca
      241 ccctgatggg cactgttcca gaaaatcttt gagagaaaca tcttaggaat tattaacgaa
      301 caaaaaggag tcaacagaat ggaatgattt taacgtggaa aaagtgactt gggaatctat
      361 attccaggtg taagaggatt actttgagga tgggttaagt agagagggaa tttagaaacc
      421 tgccctcttc attactctca ctgtggagtg accacgtttt tggctcccga ggctgagggc
      481 cagggccagg cccttaatga ggagctgtgg gccgaggctc ttggagattt ctgctagttg
      541 ctttcctcgc cttttctaca cactcctcag taccccctca tccaggatca gaacagaatc
      601 agctggaaaa ttaaattact caccctctta atgtcagctc ccataaattt tttcctcctt
      661 ctccctcaga gtacagttca actcttttaa gaggaaagcc actgaatgaa cctagtgctg
      721 aatttaaacg tttaagagat aaacctgtca gtttcagatc tccaaagagg acttccctac
      781 atgctctagg ttttgacttc tagggtccct ccattctatc ttcattcctt tcttctaatt
      841 attgggttaa aaaaataaaa ataaaaacac ttttcagagt aggcagccga atgagtccac
      901 tggttccacg tctgccatac gaagcagatt tttgcacaag gataaggtta ccgctgccag
      961 cccaggactc cagcggtgtg gccgatcagg cgctggcctc ctcccctctc ccgccgacca
     1021 gcgagaggct aacagacggg tgcaagtaga caagtagctg tttttattga atacagaagc
     1081 tccttgaaaa ctccgtgtgc tccggggctc tcctgcaatt cctttcattc ccaaagtgct
     1141 ggagccaagc acgcgtcccc cgtaactccc tcccattttt tctcggggat ttggtaggcg
     1201 aggagggagg agaggagtgg ggagatgggg ggtggttggt gggctgggcc tgctcggagt
     1261 cctcattctt gggctgaggg aggcgggggc gggcttgggg ccgccccaga gtcgtgtgat
     1321 tgggtctgac cctcagcctg cttgtcagtt tcgccctggg agggggagct gggagcaggg
     1381 aggggagtgg gcggaggagg gggctgcccg gagccactcg tccagcccac tgacggcatg
     1441 aagcctttag gggcacacag tactctcagc ttgttggtgg aagcccctca tctgccttca
     1501 ttctgaaggc agggcccggc agaggaagga tcagagggtc gcggccggag ggtcccggcc
     1561 ggtggggcca actcagaggg agaggaaagg gctagagaca cgaagaacgc aaaccatcaa
     1621 atttagaaga aaaagccctt tgactttttc cccctctccc tccccaatgg ctgtgtagca
     1681 aacatccctg gcgatacctt ggaaaggacg aagttggtct gcagtcgcaa tttcgtgggt
     1741 tgagttcaca gttgtgagtg cggggctcgg agatg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  




&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC006541. Homo sapiens, int...[gi:16306830] Links  


LOCUS       BC006541                3401 bp    mRNA    linear   PRI 22-OCT-2001
DEFINITION  Homo sapiens, integrin, beta 5, clone MGC:2338 IMAGE:2958666, mRNA,
            complete cds.
ACCESSION   BC006541
VERSION     BC006541.1  GI:16306830
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3401)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (24-APR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Rachel Dickhoff, Jessica Fahey, Stephanie Ford, Julia
            Greene, Mark Ketteman and Anuradha Madan
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 2 Row: m Column: 6
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 33952.
FEATURES             Location/Qualifiers
     source          1..3401
                     /organism="Homo sapiens"
                     /db_xref="LocusID:3693"
                     /db_xref="taxon:9606"
                     /clone="MGC:2338 IMAGE:2958666"
                     /tissue_type="Kidney, renal cell adenocarcinoma"
                     /clone_lib="NIH_MGC_14"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             307..2706
                     /codon_start=1
                     /product="integrin, beta 5"
                     /protein_id="AAH06541.1"
                     /db_xref="GI:16306831"
                     /translation="MPRAPAPLYACLLGLCALLPRLAGLNICTSGSATSCEECLLIHP
                     KCAWCSKEDFGSPRSITSRCDLRANLVKNGCGGEIESPASSFHVLRSLPLSSKGSGSA
                     GWDVIQMTPQEIAVNLRPGDKTTFQLQVRQVEDYPVDLYYLMDLSLSMKDDLDNIRSL
                     GTKLAEEMRKLTSNFRLGFGSFVDKDISPFSYTAPRYQTNPCIGYKLFPNCVPSFGFR
                     HLLPLTDRVDSFNEEVRKQRVSRNRDAPEGGFDAVLQAAVCKEKIGWRKDALHLLVFT
                     TDDVPHIALDGKLGGLVQPHDGQCHLNEANEYTASNQMDYPSLALLGEKLAENNINLI
                     FAVTKNHYMLYKNFTALIPGTTVEILDGDSKNIIQLIINAYNSIRSKVELSVWDQPED
                     LNLFFTATCQDGVSYPGQRKCEGLKIGDTASFEVSLEARSCPSRHTEHVFALRPVGFR
                     DSLEVGVTYNCTCGCSVGLEPNSARCNGSGTYVCGLCECSPGYLGTRCECQDGENQSV
                     YQNLCREAEGKPLCSGRGDCSCNQCSCFESEFGKIYGPFCECDNFSCARNKGVLCSGH
                     GECHCGECKCHAGYIGDNCNCSTDISTCRGRDGQICSERGHCLCGQCQCTEPGAFGEM
                     CEKCPTCPDACSTKRDCVECLLLHSGKPDNQTCHSLCRDEVITWVDTIVKDDQEAVLC
                     FYKTAKDCVMMFTYVELPSGKSNLTVLREPECGNTPNAMTILLAVVGSILLVGLALLA
                     IWKLLVTIHDRREFAKFQSERSRARYEMASNPLYRKPISTHTVDFTFNKFNKSYNGTV
                     D"
BASE COUNT      764 a    928 c   1000 g    709 t
ORIGIN      
        1 ggcacgaggg cggagccagc ccctccccta cccggagcag cccgctgggg ccgtcccgag
       61 cggcgacaca ctaggagtcc cggccggcca gccagggcag ccgcggtccc gggactcggc
      121 cgtgagtgct gcgggacgga tggtggcggc ggggcgcggg ccagcgcggg cgccgtgagc
      181 cggagctgcg cgcggggcat gcggctgcgg cccccggccc tcggcccccg cgctccggcc
      241 ccagccccgg ccgccggccc ccgcggagtg cagcgaccgc gccgccgctg agggaggcgc
      301 cccaccatgc cgcgggcccc ggcgccgctg tacgcctgcc tcctggggct ctgcgcgctc
      361 ctgccccggc tcgcaggtct caacatatgc actagtggaa gtgccacctc atgtgaagaa
      421 tgtctgctaa tccacccaaa atgtgcctgg tgctccaaag aggacttcgg aagcccacgg
      481 tccatcacct ctcggtgtga tctgagggca aaccttgtca aaaatggctg tggaggtgag
      541 atagagagcc cagccagcag cttccatgtc ctgaggagcc tgcccctcag cagcaagggt
      601 tcgggctctg caggctggga cgtcattcag atgacaccac aggagattgc cgtgaacctc
      661 cggcccggtg acaagaccac cttccagcta caggttcgcc aggtggagga ctatcctgtg
      721 gacctgtact acctgatgga cctctccctg tccatgaagg atgacttgga caatatccgg
      781 agcctgggca ccaaactcgc ggaggagatg aggaagctca ccagcaactt ccggttggga
      841 tttgggtctt ttgttgataa ggacatctct cctttctcct acacggcacc gaggtaccag
      901 accaatccgt gcattggtta caagttgttt ccaaattgcg tcccctcctt tgggttccgc
      961 catctgctgc ctctcacaga cagagtggac agcttcaatg aggaagttcg gaaacagagg
     1021 gtgtcccgga accgagatgc ccctgagggg ggctttgatg cagtactcca ggcagccgtc
     1081 tgcaaggaga agattggctg gcgaaaggat gcactgcatt tgctggtgtt cacaacagat
     1141 gatgtgcccc acatcgcatt ggatggaaaa ttgggaggcc tggtgcagcc acacgatggc
     1201 cagtgccacc tgaacgaggc caacgagtac actgcatcca accagatgga ctatccatcc
     1261 cttgccttgc ttggagagaa attggcagag aacaacatca acctcatctt tgcagtgaca
     1321 aaaaaccatt atatgctgta caagaatttt acagccctga tacctggaac aacggtggag
     1381 attttagatg gagactccaa aaatattatt caactgatta ttaatgcata caatagtatc
     1441 cggtctaaag tggagttgtc agtctgggat cagcctgagg atcttaatct cttctttact
     1501 gctacctgcc aagatggggt atcctatcct ggtcagagga agtgtgaggg tctgaagatt
     1561 ggggacacgg catcttttga agtatcattg gaggcccgaa gctgtcccag cagacacacg
     1621 gagcatgtgt ttgccctgcg gccggtggga ttccgggaca gcctggaggt gggggtcacc
     1681 tacaactgca cgtgcggctg cagcgtgggg ctggaaccca acagtgccag gtgcaacggg
     1741 agcgggacct atgtctgcgg cctgtgtgag tgcagccccg gctacctggg caccaggtgc
     1801 gagtgccagg atggggagaa ccagagcgtg taccagaacc tgtgccggga ggcagagggc
     1861 aagccactgt gcagcgggcg tggggactgc agctgcaacc agtgctcctg cttcgagagc
     1921 gagttcggca agatctatgg gcctttctgt gagtgcgaca acttctcctg tgccaggaac
     1981 aagggagtcc tctgctcagg ccatggcgag tgtcactgcg gggaatgcaa gtgccatgca
     2041 ggttacatcg gggacaactg taactgctcg acagacatca gcacatgccg gggcagagat
     2101 ggccagatct gcagcgagcg tgggcactgt ctctgtgggc agtgccaatg cacggagccg
     2161 ggggcctttg gggagatgtg tgagaagtgc cccacctgcc cggatgcatg cagcaccaag
     2221 agagattgcg tcgagtgcct gctgctccac tctgggaaac ctgacaacca gacctgccac
     2281 agcctatgca gggatgaggt gatcacatgg gtggacacca tcgtgaaaga tgaccaggag
     2341 gctgtgctat gtttctacaa aaccgccaag gactgcgtca tgatgttcac ctatgtggag
     2401 ctccccagtg ggaagtccaa cctgaccgtc ctcagggagc cagagtgtgg aaacaccccc
     2461 aacgccatga ccatcctcct ggctgtggtc ggtagcatcc tccttgttgg gcttgcactc
     2521 ctggctatct ggaagctgct tgtcaccatc cacgaccgga gggagtttgc aaagtttcag
     2581 agcgagcgat ccagggcccg ctatgaaatg gcttcaaatc cattatacag aaagcctatc
     2641 tccacgcaca ctgtggactt caccttcaac aagttcaaca aatcctacaa tggcactgtg
     2701 gactgatgtt tccttctccg aggggctgga gcggggatct gatgaaaagg tcagactgaa
     2761 acgccttgca cggctgctcg gcttgatcac agctccctag gtaggcacca cagagaagac
     2821 cttctagtga gcctgggcca ggagcccaca gtgcctgtac aggaaggtgc ctggccatgt
     2881 cacctggctg ctaggccaga gccatgccag gctgcgtccc tccgagcttg ggataaagca
     2941 aggggacctt ggcactctca gctttccctg ccacatccag cttgttgtcc caatgaaata
     3001 ctgagatgct gggctgtctc tcccttccag gaatgctggg cccccagcct ggccagacaa
     3061 gacgactgtc aggaagggtc ggagtctgta aaaccagcat acagtttggc ttttttcaca
     3121 ttgatcattt ttatatgaaa taaaaagatc ctgcatttat ggtgtagttc tgagtcctga
     3181 gacttttccg cgtgatggct atgccttgca cacaggtgtt ggtgatgggg ctgttgagat
     3241 gcctgttgaa ggtacatcgt ttgcaaatgt cagtttcctc tcctgtccgt gtttgtttag
     3301 tacttttata atgaaaagaa acaagattgt ttgggattgg aagtaaagat taaaaccaaa
     3361 agaatttgtg tttgtctgat aaaaaaaaaa aaaaaaaaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC015705. Homo sapiens, Sim...[gi:16041676] Links  


LOCUS       BC015705                2001 bp    mRNA    linear   HTC 11-OCT-2001
DEFINITION  Homo sapiens, Similar to collagen, type V, alpha 2, clone
            IMAGE:3909137, mRNA.
ACCESSION   BC015705
VERSION     BC015705.1  GI:16041676
KEYWORDS    HTC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2001)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-OCT-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
            Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
            Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
            Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
            Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
            Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
            Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
            George Yang, Scott Zuyderduyn, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 15 Row: e Column: 10
            This clone has the following problem: incomplete processing.
FEATURES             Location/Qualifiers
     source          1..2001
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:3909137"
                     /tissue_type="Uterus, leiomyosarcoma"
                     /clone_lib="NIH_MGC_71"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
BASE COUNT      645 a    356 c    341 g    659 t
ORIGIN      
        1 aataaacctg tttggtatgg tcttgatatg aacagagggt ctcagttcgc ttatggagac
       61 caccaatcac ctaatacagc cattactcag atgacttttt tgcgcctttt atcaaaagaa
      121 gcctcccaga acatcactta catctgtaaa aacagtgtag gatacatgga cgatcaagct
      181 aagaacctca aaaaagctgt ggttctcaaa ggggcaaatg acttagatat caaagcagag
      241 ggaaatatta gattccggta tatcgttctt caagacactt gctctaagcg gaatggaaat
      301 gtgggcaaga ctgtctttga atatagaaca cagaatgtgg cacgcttgcc catcatagat
      361 cttgctcctg tggatgttgg cggctcagac caggaattcg gcgttgaaat tgggccagtt
      421 tgttttgtgt aaagtaagcc aagacacatc gacaatgagc accaccatca atgaccaccg
      481 ccattcacaa gaactttgac tgtttgaagt tgatcctgag actcttgaag taatggctga
      541 tcctgcatca gcattgtata tatggtctta agtgcctggc ctccttatcc ttcagaatat
      601 ttattttact tacaatcctc aagttttaat tgattttaaa tatttttcaa tacaacagtt
      661 taggtttaag atgaccaatg acaatgacca cctttgcaga aagtaaactg attgaataaa
      721 taaatctccg ttttcttcaa tttatttcag tgtaatgaaa aagttgctta gtatttatga
      781 ggaaattctt cttcctggca ggtagcttaa agagtggggt atatagagcc acaacacatg
      841 tttattttgc ttggctgcag ttgaaaaata gaaattagtg cccttttgtg acctctcatt
      901 ccaagattgt caattaaaaa tgagtttaaa atgtttaact tgtgatcgag acctacatgc
      961 atgtcttgat attgtgtaac tataatagag actctttaag gagaatctta aaaaaaaaaa
     1021 acgtttctca ctgtcttaaa tagaattttt aaatagtata tattcagtgg cattttggag
     1081 aacaaagtga atttacttcg acttcttaaa tttttgtaaa agactataag tttagacatc
     1141 tttctcattc aaatttaaag atatctttct cctcttgatc aatctatcaa tattgataga
     1201 agtcacacta gtatatacca tttaatacat ttacactttc ttatttaaga agatattgaa
     1261 tgcaaaataa ttgacatata gaactttaca aacatatgtc caaggactct aaattgagac
     1321 tcttccacat gtacaatctc atcatcctga agcctataat gaagaaaaag atctagaaac
     1381 tgagttgtgg agctgactct aatcaaatgt gatgattgga attagaccat ttggcctttg
     1441 aactttcata ggaaaaatga cccaacattt cttagcatga gctacctcat ctctagaagc
     1501 tgggatggac ttactattct tgtttatatt ttagatactg aaaggtgcta tgcttctgtt
     1561 attattccaa gactggagat aggcagggct aaaaaggtat tattattttt cctttaatga
     1621 tggtgctaaa attcttccta taaaattcct taaaaataaa gatggtttaa tcactaccat
     1681 tgtgaaaaca taactgttag acttcccgtt tctgaaagaa agagcatcgt tccaatgctt
     1741 gttcactgtt cctctgtcat actgtatctg gaatgctttg taatacttgc atgcttctta
     1801 gaccagaaca tgtaggtccc cttgtgtctc aatacttttt ttttcttaat tgcatttgtt
     1861 ggctctattt taattttttt cttttaaaat aaacagctgg gaccatccca aaagacaagc
     1921 catgcataca actttggtca tgtatctctg caaagcatca aattaaatgc acgcttttgt
     1981 catgtcaaaa aaaaaaaaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMTaxonomyTaxonomyLinkOutLinkOutHelpHelp  






    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M58529. Human alpha 2(V) ...[gi:180834] Links  


LOCUS       HUMCOL5A2A              1671 bp    DNA     linear   PRI 09-NOV-1994
DEFINITION  Human alpha 2(V) type I collagen (COL5A2) gene, 5' end.
ACCESSION   M58529
VERSION     M58529.1  GI:180834
KEYWORDS    extracellular matrix protein; type I collagen.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1671)
  AUTHORS   Greenspan,D.S., Lee,S.T., Lee,B.S. and Hoffman,G.G.
  TITLE     Homology between alpha 2(V) and alpha 1(III) collagen promoters and
            evidence for negatively acting elements in the alpha 2(V) first
            intron and 5' flanking sequences
  JOURNAL   Gene Expr. 1 (1), 29-39 (1991)
  MEDLINE   92314691
   PUBMED   1820205
COMMENT     Original source text: Homo sapiens (tissue library: EMBL-3) DNA.
FEATURES             Location/Qualifiers
     source          1..1671
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="2q32-q33"
                     /cell_type="leukocyte"
                     /tissue_lib="EMBL-3"
     gene            1050..1671
                     /gene="COL5A2"
     TATA_signal     1050..1055
                     /gene="COL5A2"
                     /note="G00-119-064"
     exon            1082..1336
                     /gene="COL5A2"
                     /note="G00-119-064"
                     /number=1
     misc_feature    1172..1186
                     /gene="COL5A2"
                     /note="small upstream ORF"
     misc_feature    1223..1237
                     /gene="COL5A2"
                     /note="small upstream ORF"
     CDS             1240..>1336
                     /gene="COL5A2"
                     /codon_start=1
                     /product="pro-alpha-2 type V collagen"
                     /protein_id="AAC41699.1"
                     /db_xref="GI:553235"
                     /db_xref="GDB:G00-119-064"
                     /translation="MMANWAEARPLLILIVLLGQFVSIKAQEEDED"
     intron          1337..1671
                     /partial
                     /gene="COL5A2"
                     /note="G00-119-064"
                     /number=1
     enhancer        1391..1398
                     /gene="COL5A2"
                     /note="G00-119-064"
BASE COUNT      525 a    282 c    397 g    467 t
ORIGIN      
        1 agatcttggc aagtgtgagg gtggcacaaa agttcatcga taattttcct taaaatgcat
       61 gttcaactgg ggctatcata ttccctgcaa agctggctta aagctaataa taccaccatc
      121 tgcttttatg tagctctttg caatttggag cttcattaat aatacattac tcattttagt
      181 gtcactaaac cctgtacact ggtcagggta aagccctttc ttttctcaaa tgagaaaatt
      241 ggagtgcaca gagataaagg actatatgca catttacaaa attattgagt ggcagaacca
      301 ctcacaaaag gcagaagcaa gccttttttc atccaaaggt agagtacttc ccattcctct
      361 gcactgctta gtagactatt tttcaacatg gtaaaactct ctgggacaag aggaacctga
      421 agatgtggcc ccttaacaag agatgtggag aaattaggaa aggacatgct ttcttaattc
      481 aaattacaat gtatcacaaa ttcatttatc caaaatggtt ttctaacctg gactttacct
      541 gggcaacatt ctaataatct ccttccagag tatgggaaaa gaaaccgaat ctgagaaggg
      601 gataagaata ctccctattc agtcaaaaga aagggttaaa atgtgattgt ttaaagattt
      661 aaggtggaga agagagtctt gaattatcta aatgtaaaaa aactagggta aaaggctgga
      721 tttccttcgg aatgtctttt tcagcttata ggggaaaagt taaagggtgt gtgtctgggg
      781 ggaagggtta gggaggggaa aataaaaccc ttttcttcta acagcctttc ttaaaaaata
      841 aaagaaaaga gaaacaaatg tttgtttctg ttattaagcc ccatctatgc ttaaaagtta
      901 aatgaaatag ggaaagttca ggcacagaga cgcgtgttct gatttggttg tttaccatca
      961 atcagaccgt tgcttggcag acactggatg gttatgagcc tgaacaagct gaaaaggggc
     1021 aggaaaagaa gtggaggcag cattcttcct atttaaagct gcatcgcttg aaaaaagttt
     1081 tcgcagactg tgctggagct ggtgctgaaa aagggggttt gcagaggctg ccctggggct
     1141 ggtgctgaaa gaagagccca cagctgactt catggtgcta caataacctc agaatctact
     1201 tttcactctc aggagaaccc acatgtctaa tatttagaca tgatggcaaa ctgggcggaa
     1261 gcaagacctc tcctcattct tattgtttta ttagggcaat ttgtctcaat aaaagcccag
     1321 gaagaagacg aggatggtga gtttgccatt tgctttgctt gtagtgttta ttgcacggtt
     1381 tgggaaatag gtggaatgtt aagcctcagg aagagcgtga agaggaggtt ctggtttgaa
     1441 gtttgagaag ggggctgacg gatttgcacc tggaaaacga cttgaaagac aattgggatg
     1501 tttgtaaatc cggagcttct agatccaccg ttggagagag tacactacta aatttttctg
     1561 ggaaagccta caataagtcc ttttggctga gtgtgagagt gggacgttta aaataagaaa
     1621 agagtgaatg cttaaaacgg agcactaaag gcattttagt tagaggagct c
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Y14690. Homo sapiens mRNA...[gi:2370201] Links  


LOCUS       HSY14690                4629 bp    mRNA    linear   PRI 04-SEP-1997
DEFINITION  Homo sapiens mRNA for procollagen alpha 2(V).
ACCESSION   Y14690
VERSION     Y14690.1  GI:2370201
KEYWORDS    COL5A2 gene; procollagen type V alpha 2.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Weil,D., Bernard,M., Gargano,S. and Ramirez,F.
  TITLE     The pro alpha 2(V) collagen gene is evolutionarily related to the
            major fibrillar-forming collagens
  JOURNAL   Nucleic Acids Res. 15 (1), 181-198 (1987)
  MEDLINE   87146331
   PUBMED   3029669
REFERENCE   2
  AUTHORS   Woodbury,D., Bensonchanda,V. and Ramirez,F.
  TITLE     Amino-terminal propeptide of human pro-alpha-2(V) collagen conforms
            to the structural criteria of a fibrillar procollagen molecule
  JOURNAL   J. Biol. Chem. 5, 2735-2738 (1989)
REFERENCE   3  (bases 1 to 4629)
  AUTHORS   Richards,A.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-SEP-1997) A.J. Richards, University of Cambridge,
            Department of Pathology, Tennis Court Road, Cambridge, UK
COMMENT     Related sequence: X04758.
FEATURES             Location/Qualifiers
     source          1..4629
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="2"
     gene            1..4629
                     /gene="COL5A2"
     CDS             139..4629
                     /gene="COL5A2"
                     /codon_start=1
                     /product="procollagen alpha 2(V)"
                     /protein_id="CAA75002.1"
                     /db_xref="GI:2370202"
                     /translation="MMANWAEARPLLILIVLLGQFVSIKAQEEDEDEGYGEEIACTQN
                     GQMYLNRDIWKPAPCQICVCDNGAILCDKIECQDVLDCADPVTPPGECCPVCSQTPGG
                     GNTNFGRGRKGQKGEPGLVPVVTGIRGRPGPAGPPGSQGPRGERGPKGRPGPRGPQGI
                     DGEPGVPGQPGAPGPPGHPSHPGPDGLSRPFSAQMAGLDEKSGLGSQVGLMPGSVGPV
                     GPRGPQGLQGQQGGAGPTGPPGEPGDPGPMGPIGSRGPEGPPGKPGEDGEPGRNGNPG
                     EVGFAGSPGARGFPGAPGLPGLKGHRGHKGLEGPKGEVGAPGSKGEAGPTGPMGAMGP
                     LGPRGMPGERGRLGPQGAPGQRGAHGMPGKPGPMGPLGIPGSSGFPGNPGMKGEAGPT
                     GARGPEGPQGQRGETGPPGPVGSPGLPGAIGTDGTPGPKGPTGSPGTSGPPGSAGPPG
                     SPGPQGSTGPQGNSGLPGDPGFKGEAGPKGEPGPHGIQGPIGPPGEEGKRGPRGDPGT
                     LGPPGPVGERGAPGNRGFPGSDGLPGPKGAQGERGPVGSSGPKGSQGDPGRPGEPGLP
                     GARGLTGNPGVQGPEGKLGPLGAPGEDGRPGPPGSIGIKGQPGTMGLPGPKGSNGDPG
                     KPGEAGNPGVPGQRGAPGKDGKVGPYGPPGPPGLRGERGEQGPPGPTGFQGHPGPPGP
                     PGEGGKPGDQGVPGGPGAVGPLGPRGERGNPGERGEPGITGLPGEKGMAGGHGPDGPK
                     GSPGPSGTPGDTGPPGLQGMPGERGIAGTPGPKGDRGGIGEKGAEGTAGNDGAGGLPG
                     PLGPPGPAGLLGEKGEPGPRGLVGPPGSRGNPGSRGENGPTGAVGFAGPQGSDGQPGV
                     KGEPGEPGQKGDAGSPGPQGLAGSPGPHGPNGVPGLKGGRGTQGPPGATGFPGSAGRV
                     GPPGPAGAPGPAGPLGEPGKEGPPGPRGDPGSHGRVGVRGPAGPPGGPGDKGDPGEDG
                     QPGPDGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGTPGKVGPTGATGDKGPPGP
                     VGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGERGDRGDPGPAGLPGSQGAPGTPGPV
                     GAPGDAGQRGDPGSRGPIGHLGRAGKRGLPGPQGPRGDKGDHGDRGDRGQKGHRGFTG
                     LQGLPGPPGPNGEQGSAGIPGPFGPRGPPGPVGPSGKEGNPGPLGPLGPPGVRGSVGE
                     AGPEGPPGEPGPPGPPGPPGHLTAALGDIMGHYDESMPDPLPEFTEDQAAPDDKNKTD
                     PGVHATLKSLSSQIETMRSPDGSKKHPARTCDDLKLCHSAKQSGEYWIDPNQGSVEDA
                     IKVYCNMETGETCISANPSSVPRKTWWASKSPDNKPVWYGLDMNRGSQFAYGDHQSPN
                     TAITQMTFLRLLSKEASQNITYICKNSVGYMDDQAKNLKKAVVLKGANDLDIKAEGNI
                     RFRYIVLQDTCSKRNGNVGKTVFEYRTQNVARLPIIDLAPVDVGGTDQEFGVEIGPVC
                     FV"
BASE COUNT     1130 a   1127 c   1454 g    918 t
ORIGIN      
        1 gggggtgaaa aagggggttt gcagaggctg ccctggggct ggtgctgaaa gaagagccca
       61 cagctgactt catggtgcta caataacctc agaatctact tttcactctc aggagaaccc
      121 acagtctaat atttagacat gatggcaaac tgggcggaag caagacctct cctcattctt
      181 attgttttat tagggcaatt tgtctcaata aaagcccagg aagaagacga ggatgaagga
      241 tatggtgaag aaatagcctg cactcagaat ggccagatgt acttaaacag ggacatttgg
      301 aaacctgccc cttgtcagat ctgtgtctgt gacaatggag ccattctctg tgacaagata
      361 gaatgccagg atgtgctgga ctgtgccgac cctgtaacgc cccctgggga atgctgtcct
      421 gtctgttcac aaacacctgg aggtggcaat acaaattttg gtagaggaag aaagggacaa
      481 aagggagaac caggattagt gcctgttgta acaggcatac gtggtcgtcc aggaccggca
      541 ggacctccag gatcacaggg accaagagga gagcgagggc caaaaggaag acctggccct
      601 cgtggacctc agggaattga tggagaacca ggtgttcctg gtcaacctgg tgctccagga
      661 cctcctggac atccgtccca cccaggaccc gatggcttga gcaggccgtt ttcagctcaa
      721 atggctgggt tggatgaaaa atctggactt gggagtcaag taggactaat gcctggctct
      781 gtgggtcctg ttggcccaag gggaccacag ggtttacaag gacagcaagg tggtgcagga
      841 cctacaggac ctcctggtga acctggtgat cctggaccaa tgggtccgat tggttcacgt
      901 ggaccagagg gccctcctgg taaacctggg gaagatggtg aacctggcag aaatggaaat
      961 cctggtgaag tgggatttgc aggatctccg ggagctcgtg gatttcctgg ggctcctggt
     1021 cttccaggtc tgaagggtca ccgaggacac aaaggtcttg aaggccctaa aggtgaagtt
     1081 ggagcacctg gttccaaggg tgaagctggc cccactggtc caatgggtgc catgggtcct
     1141 ctgggtccga ggggaatgcc aggagagaga gggagacttg ggccacaggg tgctcctgga
     1201 caacgaggtg cacatggtat gcctggaaaa cctggaccaa tgggtcctct tgggatacca
     1261 ggctcttctg gttttccagg aaatcctgga atgaagggag aagcaggtcc tacaggggcg
     1321 cgaggccctg aaggtcctca ggggcagaga ggtgaaactg ggcccccagg tccagttggc
     1381 tctccaggtc ttcctggtgc aataggaact gatggtactc ctggtcccaa aggcccaacg
     1441 ggctctccgg gtacctctgg tcctcctggc tcagcagggc ctcctggatc tccaggacct
     1501 cagggtagca ctggtcctca ggggaattcg ggccttccgg gtgatccagg tttcaaagga
     1561 gaagctggcc caaaagggga accagggcca catggtattc agggtccgat aggcccaccc
     1621 ggtgaagaag gcaaaagagg tcccagaggt gacccaggaa cacttggtcc tccagggcca
     1681 gtgggagaaa ggggtgctcc tggcaatcgt ggttttccag gctctgatgg tttacctggg
     1741 ccaaagggtg ctcaaggaga acggggtcct gtaggttctt caggacccaa aggaagccag
     1801 ggggatccag gacgtccagg ggaacctggg cttccaggtg ctcggggttt gacaggaaat
     1861 cctggtgttc aaggtcctga aggaaaactt ggacctttgg gtgcgccagg ggaagatggc
     1921 cgtccaggtc ctccaggctc cataggaatc aaagggcagc ccgggaccat gggccttcca
     1981 ggccccaaag gtagcaatgg tgaccctggg aaacctggag aagcaggaaa tcctggagtt
     2041 cctgggcaaa ggggagctcc tggaaaagat ggtaaagttg gtccttatgg tcctcctggg
     2101 ccgccgggtc tacgtggtga aagaggagaa caaggacctc cagggcccac aggttttcag
     2161 gggcatcctg gtcctccagg tcctcctgga gaaggtggaa aaccaggtga tcaaggtgtt
     2221 cctggaggtc ccggagcagt tggcccgtta ggacctagag gagaacgagg aaatcctggg
     2281 gaaagaggag aacctgggat aactggactc cctggtgaga agggaatggc tggaggacat
     2341 ggtcctgatg gcccaaaagg cagtccaggt ccatctggga cccctggaga tacaggccca
     2401 ccaggtcttc aaggtatgcc gggagaaaga ggaattgcag gaactcctgg ccccaagggt
     2461 gacagaggtg gcataggaga aaaaggtgct gaaggcacag ctggaaatga tggtgcagga
     2521 ggtcttccag gtcctttggg ccctccaggt ccggcaggcc tactgggaga aaagggtgaa
     2581 cctggtcctc gaggtttagt tggtcctcct ggctcccggg gcaatcctgg ttctcgaggt
     2641 gaaaatgggc caactggagc tgttggtttt gccggacccc aggggtctga cggacagcct
     2701 ggagtaaaag gtgaacctgg agagccagga cagaagggag atgctggttc tcctggacca
     2761 caaggtttag caggatcccc tggccctcat ggtcctaatg gtgttcctgg actaaaaggt
     2821 ggtcgaggaa cccaaggtcc gcctggtgct acaggatttc ctggttctgc gggcagagtt
     2881 ggacctccag gccctgctgg agctccagga cctgcgggac ccctagggga acccgggaag
     2941 gagggacctc caggtcctcg tggggaccct ggctctcatg ggcgtgtggg agtccgagga
     3001 ccagctggcc cccctggtgg cccaggagac aaaggggacc caggagaaga tgggcaacct
     3061 ggtccagatg gcccccctgg tccagctgga acgaccgggc agagaggaat tgttggcatg
     3121 cctgggcaac gtggagagag aggcatgccc ggcctaccag gcccagcggg aacaccagga
     3181 aaagtaggac caactggtgc aacaggagat aaaggtccac ctggacctgt ggggccccca
     3241 ggctccaatg gtcctgtagg ggaacctgga ccagaaggtc cagctggcaa tgatggtacc
     3301 ccaggacggg atggtgctgt tggagaacgt ggtgatcgtg gagaccctgg gcctgcaggt
     3361 ctgccaggct ctcagggtgc ccctggaact cctggccctg tgggtgctcc aggagatgca
     3421 ggacaaagag gagatccggg ttctcggggt cctataggac acctgggtcg agctggaaaa
     3481 cgtggattac ctggacccca aggacctcgt ggtgacaaag gtgatcatgg agaccgaggc
     3541 gacagaggtc agaagggcca cagaggcttt actggtcttc agggtcttcc tggccctcct
     3601 ggtccaaatg gtgaacaagg aagtgctgga atccctggac catttggccc aagaggtcct
     3661 ccaggcccag ttggtccttc aggtaaagaa ggaaaccctg ggccacttgg gccattggga
     3721 cctccaggtg tacgaggcag tgtaggagaa gcaggacctg agggccctcc tggtgagcct
     3781 ggcccacctg gccctccggg tccccctggc caccttacag ctgctcttgg ggatatcatg
     3841 gggcactatg atgaaagcat gccagatcca cttcctgagt ttactgaaga tcaggcggct
     3901 cctgatgaca aaaacaaaac ggacccaggg gttcatgcta ccctgaagtc actcagtagt
     3961 cagattgaaa ccatgcgcag ccccgatggc tcgaaaaagc acccagcccg cacgtgtgat
     4021 gacctaaagc tttgccattc cgcaaagcag agtggtgaat actggattga tcctaaccaa
     4081 ggatctgttg aagatgccat caaagtttac tgcaacatgg aaacaggaga aacatgtatt
     4141 tcagcaaacc catccagtgt accacgtaaa acctggtggg ccagtaaatc tcctgacaat
     4201 aaacctgttt ggtatggtct tgatatgaac agagggtctc agttcgctta tggagaccac
     4261 caatcaccta atacagccat tactcagatg acttttttgc gccttttatc aaaagaagcc
     4321 tcccagaaca tcacttacat ctgtaaaaac agtgtaggat acatggacga tcaagctaag
     4381 aacctcaaaa aagctgtggt tctcaaaggg gcaaatgact tagatatcaa agcagaggga
     4441 aatattagat tccggtatat cgttcttcaa gacacttgct ctaagcggaa tggaaatgtg
     4501 ggcaagactg tctttgaata tagaacacag aatgtggcac gcttgcccat catagatctt
     4561 gctcctgtgg atgttggcgg cacagaccag gaattcggcg ttgaaattgg gccagtttgt
     4621 tttgtgtaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH007047. Homo sapiens micr...[gi:3983462] Links  


LOCUS       HSMAGP01                1240 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            putative promoter and exon 1.
ACCESSION   AF084918 AF071547
VERSION     AF084918.1  GI:3983452
KEYWORDS    .
SEGMENT     1 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1240)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 1240)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..1240
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     promoter        1..976
                     /gene="MAGP2"
                     /note="putative"
     exon            977..1187
                     /gene="MAGP2"
                     /number=1
BASE COUNT      445 a    232 c    290 g    273 t
ORIGIN      
        1 gatcatgcca ctgtactcca gcctgggtga cagagcaaga ccctatctta aaaaaaagaa
       61 aaaatcctgg atcagagaaa gtgttatcta cacattatat gatctaagga aaaatattgg
      121 tagaaaatga ggcaagcata ataaccttgc ctttcaattt tctttgggca tctgattgca
      181 ttttatcctt aagagcccag aaacccagat tcttagttaa gctcaaacaa agccaaaaga
      241 caaagtaagg ctggcacccc tttcacagag cttctctaca tttgaaaatg atttggagag
      301 ttctgaaaag ttcctcaact ttttataacc ctaaggtagc cagccttcct cttggacaaa
      361 actcataagg tatctgtgtt cgcttggttg tgagaataca taggatattc caaggggaaa
      421 aaaaacaaga agagtctaaa tgtgtaggtc aagaaagagg cagagatgaa aaagaaatac
      481 aggaataaaa agaaatttgt tagtgctgaa gagacgaata ttgaaaaaga aagtgagaaa
      541 gaaggatgaa ggaatgattc aaatatggaa tataaggagg ctgaggcagg ataattgctt
      601 gaactcagga ggcggaggtt gcagtgagcc aagatcacat cattgcactc cagcctgggt
      661 gacaggagca aaactctgtc ttaaaaaaaa aaaacaaaaa acaaagaaga agaagatagc
      721 cagagagaga gaaacagcat ctataatgtg cattaaaaca caccaaggag gagacggaac
      781 ttttcggtaa gaaagaaccc aaggaacaaa ggggagctgg gaccagatcg tgagaggagg
      841 gaaaataggc taatggaaga aaggcaatac atgagctggg gccaacactt ccagggagca
      901 gcaagagctc tcaccagtgt agtcataaca tttggaatga gggtgtgagc aactgcaaat
      961 tcccatctcc cttctcattc cagcctcatt gtaacacaca ttctacgcct agcctggctt
     1021 tcttgctctc cctcatctca ttgtttcagc ggaggccaaa tctgaagtcc tttccaggga
     1081 gtggctctgt tcatcttatt cgccagccaa agtaggaaca gcgtaagagg agagagacac
     1141 attcagcagc caaaggactc ggtggaaaga gcagaacacc atagacagtg agttatttga
     1201 ttacctgaaa ccctaaagag acagagggaa tgtgtgtatg
//
LOCUS       HSMAGP02                 134 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            exon 2.
ACCESSION   AF084919
VERSION     AF084919.1  GI:3983453
KEYWORDS    .
SEGMENT     2 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 134)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 134)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..134
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     exon            36..95
                     /gene="MAGP2"
                     /number=2
BASE COUNT       23 a     46 c     21 g     44 t
ORIGIN      
        1 catcacattc tctgctatgc ccaccacccc tatagatatg tcgctcttgg gacccaaggt
       61 gctgctgttt cttgctgcat tcatcatcac ctctggtgag tcctttattc tttgctccct
      121 aaaccctcag ctcc
//
LOCUS       HSMAGP03                 111 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            exon 3.
ACCESSION   AF084920
VERSION     AF084920.1  GI:3983454
KEYWORDS    .
SEGMENT     3 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 111)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 111)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..111
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     exon            36..71
                     /gene="MAGP2"
                     /number=3
BASE COUNT       21 a     26 c     28 g     36 t
ORIGIN      
        1 gtcactcaca tctcttttct cttcctttgc tatagactgg atacccctgg gggtcaatag
       61 tcaacgagga ggtaggtgaa cccttggata cggctctgtt ggggttctta t
//
LOCUS       HSMAGP04                 103 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            exon 4.
ACCESSION   AF084921
VERSION     AF084921.1  GI:3983455
KEYWORDS    .
SEGMENT     4 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 103)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 103)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..103
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     exon            26..70
                     /gene="MAGP2"
                     /number=4
BASE COUNT       27 a     18 c     17 g     41 t
ORIGIN      
        1 aaatgttttc taattttctt ttcagacgat gtgactcaag cgactccaga aacattcaca
       61 gaagatccta gtaagctaca ttttgtttgt ttgtttgttt gct
//
LOCUS       HSMAGP05                 102 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            exon 5.
ACCESSION   AF084922
VERSION     AF084922.1  GI:3983456
KEYWORDS    .
SEGMENT     5 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 102)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 102)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..102
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     exon            40..72
                     /gene="MAGP2"
                     /number=5
BASE COUNT       31 a     17 c     19 g     35 t
ORIGIN      
        1 tgactgcatt ttttcattcc ttgttatgtg tctttgcaga tctggtgaat gatcccgcta
       61 cagatgaaac aggtaaattt taccgcagtt aaaaaaaaaa tg
//
LOCUS       HSMAGP06                 109 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            exon 6.
ACCESSION   AF084923
VERSION     AF084923.1  GI:3983457
KEYWORDS    .
SEGMENT     6 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 109)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 109)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..109
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     exon            44..88
                     /gene="MAGP2"
                     /number=6
BASE COUNT       18 a     27 c     24 g     40 t
ORIGIN      
        1 aatttgaagc cctcctcata cctgctgttt tgtgcctttt cagttttggc tgttttggct
       61 gatattgcac cttccacaga tgacttgggt gagttcagat ctggacccc
//
LOCUS       HSMAGP07                 109 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            exon 7.
ACCESSION   AF084924
VERSION     AF084924.1  GI:3983458
KEYWORDS    .
SEGMENT     7 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 109)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 109)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..109
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     exon            37..66
                     /gene="MAGP2"
                     /number=7
BASE COUNT       28 a     24 c     18 g     39 t
ORIGIN      
        1 aattggtaac agtggaataa aatttctgcc tttcagcctc cctcagtgaa aaaaatacca
       61 ctgcaggtat gattgttgct tgcatgcctt cttaagcctt tttttcttc
//
LOCUS       HSMAGP08                 175 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            exon 8.
ACCESSION   AF084925
VERSION     AF084925.1  GI:3983459
KEYWORDS    .
SEGMENT     8 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 175)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 175)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..175
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     exon            43..130
                     /gene="MAGP2"
                     /number=8
BASE COUNT       42 a     40 c     38 g     55 t
ORIGIN      
        1 tctatttgga cctccttatt tttttgtatt gggcttcctc agagtgctgg gatgagaaat
       61 ttacctgcac aaggctctac tctgtgcatc ggccggttaa acaatgcatt catcagttat
      121 gcttcaccag gtaagggatc ccagaatgcc caaaaagcat cttccatggt agtgt
//
LOCUS       HSMAGP09                 164 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            exon 9.
ACCESSION   AF084926
VERSION     AF084926.1  GI:3983460
KEYWORDS    .
SEGMENT     9 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 164)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 164)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..164
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     exon            43..116
                     /gene="MAGP2"
                     /number=9
BASE COUNT       44 a     32 c     42 g     46 t
ORIGIN      
        1 cttccatctc cttccttgcc tatgacactg ggacgtctgc agtttacgac gtatgtacat
       61 cgtcaacaag gagatctgct ctcgtcttgt ctgtaaggaa cacgaagcta tgaaaggtaa
      121 gatgatgtct ggatgtttaa tggagagaag cagttatgag aagg
//
LOCUS       HSMAGP10                 537 bp    DNA     linear   PRI 09-DEC-1998
DEFINITION  Homo sapiens microfibril-associated glycoprotein 2 (MAGP2) gene,
            exon 10 and complete cds.
ACCESSION   AF084927
VERSION     AF084927.1  GI:3983461
KEYWORDS    .
SEGMENT     10 of 10
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 537)
  AUTHORS   Hatzinikolas,G. and Gibson,M.A.
  TITLE     The exon structure of the human MAGP-2 gene. Similarity with the
            MAGP-1 gene is confined to two exons encoding a cysteine-rich
            region
  JOURNAL   J. Biol. Chem. 273 (45), 29309-29314 (1998)
  MEDLINE   99009031
   PUBMED   9792630
REFERENCE   2  (bases 1 to 537)
  AUTHORS   Gibson,M.A. and Hatzinikolas,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-AUG-1998) Pathology, University of Adelaide,
            Adelaide, SA 5005, Australia
FEATURES             Location/Qualifiers
     source          1..537
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12p12.3-p13.1"
                     /cell_type="leukocyte"
     gene            order(AF084918.1:1..1240,AF084919.1:1..134,
                     AF084920.1:1..111,AF084921.1:1..103,AF084922.1:1..102,
                     AF084923.1:1..109,AF084924.1:1..109,AF084925.1:1..175,
                     AF084926.1:1..164,1..537)
                     /gene="MAGP2"
     mRNA            join(AF084918.1:977..1187,AF084919.1:36..95,
                     AF084920.1:36..71,AF084921.1:26..70,AF084922.1:40..72,
                     AF084923.1:44..88,AF084924.1:37..66,AF084925.1:43..130,
                     AF084926.1:43..116,41..537)
                     /gene="MAGP2"
                     /product="microfibril-associated glycoprotein 2"
     CDS             join(AF084919.1:38..95,AF084920.1:36..71,
                     AF084921.1:26..70,AF084922.1:40..72,AF084923.1:44..88,
                     AF084924.1:37..66,AF084925.1:43..130,AF084926.1:43..116,
                     41..153)
                     /gene="MAGP2"
                     /note="fibrillin-containing microfibrils"
                     /codon_start=1
                     /product="microfibril-associated glycoprotein 2"
                     /protein_id="AAC83942.1"
                     /db_xref="GI:3983463"
                     /translation="MSLLGPKVLLFLAAFIITSDWIPLGVNSQRGDDVTQATPETFTE
                     DPNLVNDPATDETVLAVLADIAPSTDDLASLSEKNTTAECWDEKFTCTRLYSVHRPVK
                     QCIHQLCFTSLRRMYIVNKEICSRLVCKEHEAMKDELCRQMAGLPPRRLRRSNYFRLP
                     PCENVDLQRPNGL"
     exon            41..>537
                     /gene="MAGP2"
                     /number=10
BASE COUNT      152 a    116 c     90 g    179 t
ORIGIN      
        1 ttctgcattc tcttcccatg atgttctgct tctcttctag atgagctttg ccgtcagatg
       61 gctggtctgc cccctaggag actccgtcgc tccaattact tccgacttcc tccctgtgaa
      121 aatgtggatt tgcagagacc caatggtctg tgatcattga aaaagaggaa agaagaaaaa
      181 atgtatgggt gagaggaagg aggatctcct tcttctccaa ccattgacag ctaaccctta
      241 gacagtattt cttaaaccaa tccttttgca atgtccagct tttaccccta ctctctactt
      301 tttcacccaa actgataaca tttatctcat tttctagcac ttaaaataca aagtctatat
      361 tattgcataa ttttgctgct tctcaatatc atagacacag tgaatagatg atgactatat
      421 ggcttatata caaacattct atgtacaatt tcaagggaga ctaaacttta ggctaataat
      481 ctttactatt gaatctgtct gatatagatc ttagggttga agaagctatc tttgtct
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

ProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X52022. H.sapiens RNA for...[gi:3127925] Links  


LOCUS       HSCOLLVI3              10558 bp    mRNA    linear   PRI 09-MAY-1998
DEFINITION  H.sapiens RNA for type VI collagen alpha3 chain.
ACCESSION   X52022
VERSION     X52022.1  GI:3127925
KEYWORDS    alternate splicing; COL6A3 gene; collagen alpha 3 type VI.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 9930)
  AUTHORS   Chu,M.L., Zhang,R.Z., Pan,T.C., Stokes,D., Conway,D., Kuo,H.J.,
            Glanville,R., Mayer,U., Mann,K., Deutzmann,R. and Timpl,R.
  TITLE     Mosaic structure of globular domains in the human type VI collagen
            alpha 3 chain: similarity to von Willebrand factor, fibronectin,
            actin, salivary proteins and aprotinin type protease inhibitors
  JOURNAL   EMBO J. 9 (2), 385-393 (1990)
  MEDLINE   90151612
REFERENCE   2
  AUTHORS   Chu,M.L.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-SEP-1997) Chu, M.L. Thomas Jefferson University, Dept
            of Biochemistry & Molec Biology, 233 South 10th Street,
            Philadelphia, PA 19107, USA
  REMARK    revised by author 30-SEP-97 and [3]
REFERENCE   3  (bases 1 to 10558)
  AUTHORS   Chu,M.L.
  TITLE     Direct Submission
  JOURNAL   Submitted (08-MAY-1998) Chu, M.L. Thomas Jefferson University, Dept
            of Biochemistry & Molec Biology, 233 South 10th Street,
            Philadelphia, PA 19107, USA
COMMENT     On May 12, 1998 this sequence version replaced gi:2462471.
FEATURES             Location/Qualifiers
     source          1..10558
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..10558
                     /gene="COL6A3"
     CDS             256..9786
                     /gene="COL6A3"
                     /codon_start=1
                     /product="collagen type VI, alpha 3 chain"
                     /protein_id="CAA36267.1"
                     /db_xref="GI:3127926"
                     /translation="MRKHRHLPLVAVFCLFLSGFPTTHAQQQQADVKNGAAADIIFLV
                     DSSWTIGEEHFQLVREFLYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEV
                     LSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLAL
                     PSAELKSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVH
                     SSVSPERAGDTETLKDITAQDSADIIFLIDGSNNTGSVNFAVILDFLVNLLEKLPIGT
                     QQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHF
                     TRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQASVFSFGLGAQAASRAELQHI
                     ATDDNLVFTVPEFRSFGDLQEKLLPYIVGVAQRHIVLKPPTIVTQVIEVNKRDIVFLV
                     DGSSALGLANFNAIRDFIAKVIQRLEIGQDLIQVAVAQYADTVRPEFYFNTHPTKREV
                     ITAVRKMKPLDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDEIS
                     QPAQELKRSSIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPL
                     RTLSGTPEVHSNKRDIIFLLDGSANVGKTNFPYVRDFVMNLVNSLDIGNDNIRVGLVQ
                     FSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIR
                     EHVPQLLLLLTAGQSEDSYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSLVYL
                     MDDFSSLPALPQQLIQPLTTYVSGGVEEVPLAQPESKRDILFLFDGSANLVGQFPVVR
                     DFLYKIIDELNVKPEGTRIAVAQYSDDVKVESRFDEHQSKPEILNLVKRMKIKTGKAL
                     NLGYALDYAQRYIFVKSAGSRIEDGVLQFLVLLVAGRSSDRVDGPASNLKQSGVVPFI
                     FQAKNADPAELEQIVLSPAFILAAESLPKIGDLHPQIVNLLKSVHNGAPAPVSGEKDV
                     VFLLDGSEGVRSGFPLLKEFVQRVVESLDVGQDRVRVAVVQYSDRTRPEFYLNSYMNK
                     QDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEGVPQLLIVLTADRSGD
                     DVRNPSVVVKRGGAVPIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQVISER
                     VTQLTREELSRLQPVLQPLPSPGVGGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL
                     DVGFDTTRVAVIQFSDDPKAEFLLNAHSSKDEVQNAVQRLRPKGGRQINVGNALEYVS
                     RNIFKRPLGSRIEEGVPQFLVLISSGKSDDEVVVPAVELKQFGVAPFTIARNADQEEL
                     VKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAVESDAAD
                     IVFLIDSSEGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYR
                     SQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKS
                     QDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNIEERIM
                     NSFGPSAATPAPPGVDTPPPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFVSEIVDT
                     VYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHL
                     RVNHFVPEAGSRLDQRVPQIAFVITGGKSVEDAQDVSLALTQRGVKVFAVGVRNIDSE
                     EVGKIASNSATAFRVGNVQELSELSEQVLETLHDAMHETLCPGVTDAAKACNLDVILG
                     FDGSRDQNVFVAQKGFESKVDAILNRISQMHRVSCSGGRSPTVRVSVVANTPSGPVEA
                     FDFDEYQPEMLEKFRNMRSQHPYVLTEDTLKVYLNKFRQSSPDSVKVVIHFTDGADGD
                     LADLHRASENLRQEGVRALILVGLERVVNLERLMHLEFGRGFMYDRPLRLNLLDLDYE
                     LAEQLDNIAEKACCGVPCKCSGQRGDRGPIGSIGPKGIPGEDGYRGYPGDEGGPGERG
                     PPGVNGTQGFQGCPGQRGVKGSRGFPGEKGEVGEIGLDGLDGEDGDKGLPGSSGEKGN
                     PGRRGDKGPRGEKGERGDVGIRGDPGNPGQDSQERGPKGETGDLGPMGVPGRDGVPGG
                     PGETGKNGGFGRRGPPGAKGNKGGPGQPGFEGEQGTRGAQGPAGPAGPPGLIGEQGIS
                     GPRGSGGARGAPGERGRTGPLGRKGEPGEPGPKGGIGNPGPRGETGDDGRDGVGSEGR
                     RGKKGERGFPGYPGPKGNPGEPGLNGTTGPKGIRGRRGNSGPPGIVGQKGRPGYPGPA
                     GPRGNRGDSIDQCALIQSIKDKCPCCYGPLECPVFPTELAFALDTSEGVNQDTFGRMR
                     DVVLSIVNVLTIAESNCPTGARVAVVTYNNEVTTEIRFADSKRKSVLLDKIKNLQVAL
                     TSKQQSLETAMSFVARNTFKRVRNGFLMRKVAVFFSNTPTRASPQLREAVLKLSDAGI
                     TPLFLTRQEDRQLINALQINNTAVGHALVLPAGRDLTDFLENVLTCHVCLDICNIDPS
                     CGFGSWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQFNEMKKYIAYLVRQLDMSPD
                     PKASQHFARVAVVQHAPSESVDNASMPPVKVEFSLTDYGSKEKLVDFLSRGMTQLQGT
                     RALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVPEQQLEEAQRVILQAKCKGYFFV
                     VLGIGRKVNIKEVYTFASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFVSSENAFYL
                     SPDIRKQCDWFQGDQPTKNLVKFGHKQVNVPNNVTSSPTSNPVTTTKPVTTTKPVTTT
                     TKPVTTTTKPVTIINQPSVKPAAAKPAPAKPVAAKPVATKTATVRPPVAVKPATAAKP
                     VAAKPAAVRPPAAAAKPVATKPEVPRPQAAKPAATKPATTKPVVKMLREVQVFEITEN
                     SAKLHWERPEPPGPYFYDLTVTSAHDQSLVLKQNLTVTDRVIGGLLAGQTYHVAVVCY
                     LRSQVRATYHGSFSTKKSQPPPPQPARSASSSTINLMVSTEPLALTETDICKLPKDEG
                     TCRDFILKWYYDPNTKSCARFWYGGCGGNENKFGSQKECEKVCAPVLAKPGVISVMGT
                     "
     misc_feature    347..964
                     /gene="COL6A3"
                     /note="alternatively spliced domain
                     domain N10"
     misc_feature    965..1567
                     /gene="COL6A3"
                     /note="alternatively spliced domain
                     domain N9"
     misc_feature    2153..2752
                     /gene="COL6A3"
                     /note="alternatively spliced domain
                     domain N7"
     misc_feature    4541..5041
                     /gene="COL6A3"
                     /note="alternatively spliced domain
                     domain N3"
BASE COUNT     2621 a   2645 c   2950 g   2342 t
ORIGIN      
        1 cagtttggag ctcagtcttc caccaaaggc cgttcagttc tcctgggctc cagcctcctg
       61 caaggactgc aagagttttc ctccgcagct ctgagtctcc acttttttgg tggagaaagg
      121 ctgcaaaaag aaaaagagac gcagtgagtg ggaaaagtat gcatcctatt caaacctaat
      181 tgaatcgagg agcccaggga cacacgcctt caggtttgct caggggttca tatttggtgc
      241 ttagacaaat tcaaaatgag gaaacatcgg cacttgccct tagtggccgt cttttgcctc
      301 tttctctcag gctttcctac aactcatgcc cagcagcagc aagcagatgt caaaaatggt
      361 gcggctgctg atataatatt tctagtggat tcctcttgga ccattggaga ggaacatttc
      421 caacttgttc gagagtttct atatgatgtt gtaaaatcct tagctgtggg agaaaatgat
      481 ttccattttg ctctggtcca gttcaacgga aacccacata ccgagttcct gttaaatacg
      541 tatcgtacta aacaagaagt cctttctcat atttccaaca tgtcttatat tgggggaacc
      601 aatcagactg gaaaaggatt agaatacata atgcaaagcc acctcaccaa ggctgctgga
      661 agccgggccg gtgacggagt ccctcaggtt atcgtagtgt taactgatgg acactcgaag
      721 gatggccttg ctctgccctc agcggaactt aagtctgctg atgttaacgt gtttgcaatt
      781 ggagttgagg atgcagatga aggagcgtta aaagaaatag caagtgaacc gctcaatatg
      841 catatgttca acctagagaa ttttacctca cttcatgaca tagtaggaaa cttagtgtcc
      901 tgtgtgcatt catccgtgag tccagaaagg gctggggaca cggaaaccct taaagacatc
      961 acagcacaag actctgctga cattattttc cttattgatg gatcaaacaa caccggaagt
     1021 gtcaatttcg cagtcattct cgacttcctt gtaaatctcc ttgagaaact cccaattgga
     1081 actcagcaga tccgagtggg ggtggtccag tttagcgatg agcccagaac catgttttcc
     1141 ttggacacct actccaccaa ggcccaggtt ctgggtgcag tgaaagccct cgggtttgct
     1201 ggtggggagt tggccaatat cggcctcgcc cttgatttcg tggtggagaa ccacttcacc
     1261 cgggcagggg gcagccgcgt ggaggaaggg gttccccagg tgctggtcct cataagtgcc
     1321 gggccttcta gtgacgagat tcgctacggg gtggtagcac tgaagcaggc tagcgtgttc
     1381 tcattcggcc ttggagccca ggccgcctcc agggcagagc ttcagcacat agctaccgat
     1441 gacaacttgg tgtttactgt cccggaattc cgtagctttg gggacctcca ggagaaatta
     1501 ctgccgtaca ttgttggcgt ggcccaaagg cacattgtct tgaaaccgcc aaccattgtc
     1561 acacaagtca ttgaagtcaa caagagagac atagtcttcc tggtggatgg ctcatctgca
     1621 ctgggactgg ccaacttcaa tgccatccga gacttcattg ctaaagtcat ccagaggctg
     1681 gaaatcggac aggatcttat ccaggtggca gtggcccagt atgcagacac tgtgaggcct
     1741 gaattttatt tcaataccca tccaacaaaa agggaagtca taaccgctgt gcggaaaatg
     1801 aagcccctgg acggctcggc cctgtacacg ggctctgctc tagactttgt tcgtaacaac
     1861 ctattcacga gttcagccgg ctaccgggct gccgagggga ttcctaagct tttggtgctg
     1921 atcacaggtg gtaagtccct agatgaaatc agccagcctg cccaggagct gaagagaagc
     1981 agcataatgg cctttgccat tgggaacaag ggtgccgatc aggctgagct ggaagagatc
     2041 gctttcgact cctccctggt gttcatccca gctgagttcc gagccgcccc attgcaaggc
     2101 atgctgcctg gcttgctggc acctctcagg accctctctg gaacccctga agttcactca
     2161 aacaaaagag atatcatctt tcttttggat ggatcagcca acgttggaaa aaccaatttc
     2221 ccttatgtgc gcgactttgt aatgaaccta gttaacagcc ttgatattgg aaatgacaat
     2281 attcgtgttg gtttagtgca atttagtgac actcctgtaa cggagttctc tttaaacaca
     2341 taccagacca agtcagatat ccttggtcat ctgaggcagc tgcagctcca gggaggttcg
     2401 ggcctgaaca caggctcagc cctaagctat gtctatgcca accacttcac ggaagctggc
     2461 ggcagcagga tccgtgaaca cgtgccgcag ctcctgcttc tgctcacagc tgggcagtct
     2521 gaggactcct atttgcaagc tgccaacgcc ttgacacgcg cgggcatcct gactttttgt
     2581 gtgggagcta gccaggcgaa taaggcagag cttgagcaga ttgcttttaa cccaagcctg
     2641 gtgtatctca tggatgattt cagctccctg ccagctttgc ctcagcagct gattcagccc
     2701 ctaaccacat atgttagtgg aggtgtggag gaagtaccac tcgctcagcc agagagcaag
     2761 cgagacattc tgttcctctt tgacggctca gccaatcttg tgggccagtt ccctgttgtc
     2821 cgtgactttc tctacaagat tatcgatgag ctcaatgtga agccagaggg gacccgaatt
     2881 gcggtggctc agtacagcga tgatgtcaag gtggagtccc gttttgatga gcaccagagt
     2941 aagcctgaga tcctgaatct tgtgaagaga atgaagatca agacgggcaa agccctcaac
     3001 ctgggctacg cgctggacta tgcacagagg tacatttttg tgaagtctgc tggcagccgg
     3061 atcgaggatg gagtgcttca gttcctggtg ctgctggtcg caggaaggtc atctgaccgt
     3121 gtggatgggc cagcaagtaa cctgaagcag agtggggttg tgcctttcat cttccaagcc
     3181 aagaacgcag accctgctga gttagagcag atcgtgctgt ctccagcgtt tatcctggct
     3241 gcagagtcgc ttcccaagat tggagatctt catccacaga tagtgaatct cttaaaatca
     3301 gtgcacaacg gagcaccagc accagtttca ggtgaaaagg acgtggtgtt tctgcttgat
     3361 ggctctgagg gcgtcaggag cggcttccct ctgttgaaag agtttgtcca gagagtggtg
     3421 gaaagcctgg atgtgggcca ggaccgggtc cgcgtggccg tggtgcagta cagcgaccgg
     3481 accaggcccg agttctacct gaattcatac atgaacaagc aggacgtcgt caacgctgtc
     3541 cgccagctga ccctgctggg agggccgacc cccaacaccg gggccgccct ggagtttgtc
     3601 ctgaggaaca tcctggtcag ctctgcggga agcaggataa cagaaggtgt gccccagctg
     3661 ctgatcgtcc tcacggccga caggtctggg gatgatgtgc ggaacccctc cgtggtcgtg
     3721 aagaggggtg gggctgtgcc cattggcatt ggcatcggga acgctgacat cacagagatg
     3781 cagaccatct ccttcatccc ggactttgcc gtggccattc ccacctttcg ccagctgggg
     3841 accgtccaac aggtcatctc tgagagggtg acccagctca cccgcgagga gctgagcagg
     3901 ctgcagccgg tgttgcagcc tctaccgagc ccaggtgttg gtggcaagag ggacgtggtc
     3961 tttctcatcg atgggtccca aagtgccggg cctgagttcc agtacgttcg caccctcata
     4021 gagaggctgg ttgactacct ggacgtgggc tttgacacca cccgggtggc tgtcatccag
     4081 ttcagcgatg accccaaggc ggagttcctg ctgaacgccc attccagcaa ggatgaagtg
     4141 cagaacgcgg tgcagcggct gaggcccaag ggagggcggc agatcaacgt gggcaatgcc
     4201 ctggagtacg tgtccaggaa catcttcaag aggcccctgg ggagccgcat tgaagagggc
     4261 gtcccacagt tcctggtcct catctcgtct ggaaagtctg acgatgaggt ggtcgtcccg
     4321 gcggtggagc tcaagcagtt tggcgtggcc cctttcacga tcgccaggaa cgcagaccag
     4381 gaggagctgg tgaagatctc gctgagcccc gaatatgtgt tctcggtgag caccttccgg
     4441 gagctgccca gcctggagca gaaactgctg acgcccatca cgaccctgac ctcagagcag
     4501 atccagaagc tcttagccag cactcgctat ccacctccag cagttgagag tgatgctgca
     4561 gacattgtct ttctgatcga cagctctgag ggagttaggc cagatggctt tgcacatatt
     4621 cgagattttg ttagcaggat tgttcgaaga ctcaacatcg gccccagtaa agtgagagtt
     4681 ggggtcgtgc agttcagcaa tgatgtcttc ccagaattct atctgaaaac ctacagatcc
     4741 caggccccgg tgctggacgc catacggcgc ctgaggctca gaggggggtc cccactgaac
     4801 actggcaagg ctctcgaatt tgtggcaaga aacctctttg ttaagtctgc ggggagtcgc
     4861 atagaagacg gggtgcccca acacctggtc ctggtcctgg gtggaaaatc ccaggacgat
     4921 gtgtccaggt tcgcccaggt gatccgttcc tcgggcattg tgagtttagg ggtaggagac
     4981 cggaacatcg acagaacaga gctgcagacc atcaccaatg accccagact ggtcttcaca
     5041 gtgcgagagt tcagagagct tcccaacata gaagaaagaa tcatgaactc gtttggaccc
     5101 tccgcagcca ctcctgcacc tccaggggtg gacacccctc ctccttcacg gccagagaag
     5161 aagaaagcag acattgtgtt cctgttggat ggttccatca acttcaggag ggacagtttc
     5221 caggaagtgc ttcgttttgt gtctgaaata gtggacacag tttatgaaga tggcgactcc
     5281 atccaagtgg ggcttgtcca gtacaactct gaccccactg acgaattctt cctgaaggac
     5341 ttctctacca agaggcagat tattgacgcc atcaacaaag tggtctacaa agggggaaga
     5401 cacgccaaca ctaaggtggg ccttgagcac ctgcgggtaa accactttgt gcctgaggca
     5461 ggcagccgcc tggaccagcg ggtccctcag attgcctttg tgatcacggg aggaaagtcg
     5521 gtggaagatg cacaggatgt gagcctggcc ctcacccaga ggggggtcaa agtgtttgct
     5581 gttggagtga ggaatatcga ctcggaggag gttggaaaga tagcgtccaa cagcgccaca
     5641 gcgttccgcg tgggcaacgt ccaggagctg tccgaactga gcgagcaagt tttggaaact
     5701 ttgcatgatg cgatgcatga aaccctttgc cctggtgtaa ctgatgctgc caaagcttgt
     5761 aatctggatg tgattctggg gtttgatggt tctagagacc agaatgtttt tgtggcccag
     5821 aagggcttcg agtccaaggt ggacgccatc ttgaacagaa tcagccagat gcacagggtc
     5881 agctgcagcg gtggccgctc gcccaccgtg cgtgtgtcag tggtggccaa cacgccctcg
     5941 ggcccggtgg aggcctttga ctttgacgag taccagccag agatgctcga gaagttccgg
     6001 aacatgcgca gccagcaccc ctacgtcctc acggaggaca ccctgaaggt ctacctgaac
     6061 aagttcagac agtcctcgcc ggacagcgtg aaggtggtca ttcattttac tgatggagca
     6121 gacggagatc tggctgattt acacagagca tctgagaacc tccgccaaga aggagtccgt
     6181 gccttgatcc tggtgggcct tgaacgagtg gtcaacttgg agcggctaat gcatctggag
     6241 tttgggcgag ggtttatgta tgacaggccc ctgaggctta acttgctgga cttggattat
     6301 gaactagcgg agcagcttga caacattgcc gagaaagctt gctgtggggt tccctgcaag
     6361 tgctctgggc agaggggaga ccgcgggccc atcggcagca tcgggccaaa gggtattcct
     6421 ggagaagacg gctaccgagg ctatcctggt gatgagggtg gacccggtga gcgtggtccg
     6481 cctggtgtga acggcactca aggtttccag ggctgcccgg gccagagagg agtaaagggc
     6541 tctcggggat tcccaggaga gaagggcgaa gtaggagaaa ttggactgga tggtctggat
     6601 ggtgaagatg gagacaaagg attgcctggt tcttctggag agaaagggaa tcctggaaga
     6661 aggggtgata aaggacctcg aggagagaaa ggagaaagag gagatgttgg gattcgaggg
     6721 gacccgggta acccaggaca agacagccag gagagaggac ccaaaggaga aaccggtgac
     6781 ctcggcccca tgggtgtccc agggagagat ggagtacctg gaggacctgg agaaactggg
     6841 aagaatggtg gctttggccg aaggggaccc cccggagcta agggcaacaa gggcggtcct
     6901 ggccagccgg gctttgaggg agagcagggg accagaggtg cacagggccc agctggtcct
     6961 gctggtcctc cagggctgat aggagaacaa ggcatttctg gacctagggg aagcggaggt
     7021 gcccgtggcg ctcctggaga acgaggcaga accggtccac tgggaagaaa gggtgagccc
     7081 ggagagccag gaccaaaagg aggaatcggg aacccgggcc ctcgtgggga gacgggagat
     7141 gacgggagag acggagttgg cagtgaagga cgcagaggca aaaaaggaga aagaggattt
     7201 cctggatacc caggaccaaa gggtaaccca ggtgaacctg ggctaaatgg aacaacagga
     7261 cccaaaggca tcagaggccg aaggggaaat tcgggacctc cagggatagt tggacagaag
     7321 gggagacctg gctacccagg accagctggt ccaaggggca acaggggcga ctccatcgat
     7381 caatgtgccc tcatccaaag catcaaagat aaatgccctt gctgttacgg gcccctggag
     7441 tgccccgtct tcccaacaga actagccttt gctttagaca cctctgaggg agtcaaccaa
     7501 gacactttcg gccggatgcg agatgtggtc ttgagtattg tgaatgtcct gaccattgct
     7561 gagagcaact gcccgacggg ggcccgggtg gctgtggtca cctacaacaa cgaggtgacc
     7621 acggagatcc ggtttgctga ctccaagagg aagtcggtcc tcctggacaa gattaagaac
     7681 cttcaggtgg ctctgacatc caaacagcag agtctggaga ctgccatgtc gtttgtggcc
     7741 aggaacacat ttaagcgtgt gaggaacgga ttcctaatga ggaaagtggc tgttttcttc
     7801 agcaacacac ccacaagagc atccccacag ctcagagagg ctgtgctcaa actctcagat
     7861 gcggggatca cccccttgtt ccttacaagg caggaagacc ggcagctcat caacgctttg
     7921 cagatcaata acacagcagt ggggcatgcg cttgtcctgc ctgcagggag agacctcaca
     7981 gacttcctgg agaatgtcct cacgtgtcat gtttgcttgg acatctgcaa catcgaccca
     8041 tcctgtggat ttggcagttg gaggccttcc ttcagggaca ggagagcggc agggagtgat
     8101 gtggacatcg acatggcttt catcttagac agcgctgaga ccaccaccct gttccagttc
     8161 aatgagatga agaagtacat agcgtacctg gtcagacaac tggacatgag cccagatccc
     8221 aaggcctccc agcacttcgc cagagtggca gttgtgcagc acgcgccctc tgagtccgtg
     8281 gacaatgcca gcatgccacc tgtgaaggtg gaattctccc tgactgacta tggctccaag
     8341 gagaagctgg tggacttcct cagcagggga atgacacagt tgcagggaac cagggcctta
     8401 ggcagtgcca ttgaatacac catagagaat gtctttgaaa gtgccccaaa cccacgggac
     8461 ctgaaaattg tggtcctgat gctgacgggc gaggtgccgg agcagcagct ggaggaggcc
     8521 cagagagtca tcctgcaggc caaatgcaag ggctacttct tcgtggtcct gggcattggc
     8581 aggaaggtga acatcaagga ggtatacacc ttcgccagtg agccaaacga cgtcttcttc
     8641 aaattagtgg acaagtccac cgagctcaac gaggagcctt tgatgcgctt cgggaggctg
     8701 ttgccgtcct tcgtcagcag tgaaaatgct ttttacttgt ccccagatat caggaaacag
     8761 tgtgattggt tccaagggga ccaacccaca aagaaccttg tgaagtttgg tcacaaacaa
     8821 gtaaatgttc cgaataacgt tacttcaagt cctacatcca acccagtgac gacaacgaag
     8881 ccggtgacta cgacgaagcc ggtgaccacc acaacaaagc ctgtaaccac cacaacaaag
     8941 cctgtgacta ttataaatca gccatctgtg aagccagccg ctgcaaagcc ggcccctgcg
     9001 aaacctgtgg ctgccaagcc tgtggccaca aagacggcca ctgttagacc cccagtggcg
     9061 gtgaagccag caacagcagc gaagcctgta gcagcaaagc cagcagctgt aagacccccc
     9121 gctgctgctg caaaaccagt ggcgaccaag cctgaggtcc ctaggccaca ggcagccaaa
     9181 ccagctgcca ccaagccagc caccactaag cccgtggtta agatgctccg tgaagtccag
     9241 gtgtttgaga taacagagaa cagcgccaaa ctccactggg agaggcctga gccccccggt
     9301 ccttattttt atgacctcac cgtcacctca gcccatgatc agtccctggt tctgaagcag
     9361 aacctcacgg tcacggaccg cgtcattgga ggcctgctcg ctgggcagac ataccatgtg
     9421 gctgtggtct gctacctgag gtctcaggtc agagccacct accacggaag tttcagtaca
     9481 aagaaatctc agcccccacc tccacagcca gcaaggtcag cttctagttc aaccatcaat
     9541 ctaatggtga gcacagaacc attggctctc actgaaacag atatatgcaa gttgccgaaa
     9601 gacgaaggaa cttgcaggga tttcatatta aaatggtact atgatccaaa caccaaaagc
     9661 tgtgcaagat tctggtatgg aggttgtggt ggaaacgaaa acaaatttgg atcacagaaa
     9721 gaatgtgaaa aggtttgcgc tcctgtgctc gccaaacccg gagtcatcag tgtgatggga
     9781 acctaagcgt gggtggccaa catcatatac ctcttgaaga agaaggagtc agccatcgcc
     9841 aacttgtctc tgtagaagct ccgggtgtag attcccttgc actgtatcat ttcatgcttt
     9901 gatttacact cgaactcggg agggaacatc ctgctgcatg acctatcagt atggtgctaa
     9961 tgtgtctgtg gaccctcgct ctctgtctcc agcagttctc tcgaatactt tgaatgttgt
    10021 gtaacagtta gccactgctg gtgtttatgt gaacattcct atcaatccaa attccctctg
    10081 gagtttcatg ttatgcctgt tgcaggcaaa tgtaaagtct agaaaataat gcaaatgtca
    10141 cggctactct atatactttt gcttggttca ttttttttcc cttttagtta agcatgactt
    10201 tagatgggaa gcctgtgtat cgtggagaaa caagagacca actttttcat tccctgcccc
    10261 caatttccca gactagattt caagctaatt ttctttttct gaagcctcta acaaatgatc
    10321 tagttcagaa ggaagcaaaa tcccttaatc tatgtgcacc gttgggacca atgccttaat
    10381 taaagaattt aaaaaagttg taatagagaa tatttttggc attcctctca atgttgtgtg
    10441 tttttttttt ttgtgtgctg gagggagggg atttaatttt aattttaaaa tgtttaggaa
    10501 atttatacaa agaaactttt taataaagta tattgaaagt ttaaaaaaaa aaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF038953. Homo sapiens E25 ...[gi:3329375] Links  


LOCUS       AF038953                1082 bp    mRNA    linear   HTC 22-MAY-2001
DEFINITION  Homo sapiens E25 protein mRNA, complete cds.
ACCESSION   AF038953
VERSION     AF038953.1  GI:3329375
KEYWORDS    HTC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1082)
  AUTHORS   Mao,M., Fu,G., Wu,J.-S., Zhang,Q.-H., Zhou,J., Kan,L.-X.,
            Huang,Q.-H., He,K.-L., Gu,B.-W., Han,Z.-G., Shen,Y., Gu,J.,
            Yu,Y.-P., Xu,S.-H., Wang,Y., Chen,S.-J. and Chen,Z.
  TITLE     Identification of genes expressed in human CD34(+) hematopoietic
            stem/progenitor cells by expressed sequence tags and efficient
            full-length cDNA cloning
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 95 (14), 8175-8180 (1998)
  MEDLINE   98318631
   PUBMED   9653160
REFERENCE   2  (bases 1 to 1082)
  AUTHORS   Zhang,Q.H., Ye,M., Wu,X.Y., Ren,S.X., Zhao,M., Zhao,C.J., Fu,G.,
            Shen,Y., Fan,H.Y., Lu,G., Zhong,M., Xu,X.R., Han,Z.G., Zhang,J.W.,
            Tao,J., Huang,Q.H., Zhou,J., Hu,G.X., Gu,J., Chen,S.J. and Chen,Z.
  TITLE     Cloning and functional analysis of cDNAs with open reading frames
            for 300 previously undefined genes expressed in CD34+ hematopoietic
            stem/progenitor cells
  JOURNAL   Genome Res. 10 (10), 1546-1560 (2000)
  MEDLINE   20499367
   PUBMED   11042152
REFERENCE   3  (bases 1 to 1082)
  AUTHORS   Wang,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (16-DEC-1997) Rui-Jin Hospital, Shanghai Second Medical
            University, Shanghai Institute of Hematology, 197 Rui-Jin Road II,
            Shanghai 200025, P. R. China
FEATURES             Location/Qualifiers
     source          1..1082
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_type="CD34+ cell"
                     /tissue_type="cord blood"
     CDS             140..931
                     /codon_start=1
                     /product="E25 protein"
                     /protein_id="AAC39867.1"
                     /db_xref="GI:3329376"
                     /translation="MVKIAFNTPTAVQKEEARQDVEALLSRTVRTQILTGKELRVATQ
                     EKEGSSGRCMLTLLGLSFILAGLIVGGACIYKYFMPKSTIYRGEMCFFDSEDPANSLR
                     GGEPNFLPVTEEADIREDDNIAIIDVPVPSFSDSDPAAIIHDFEKGMTAYLDLLLGNC
                     YLMPLNTSIVMPPKNLVELFGKLASGRYLPQTYVVREDLVAVEEIRDVSNLGIFIYQL
                     CNNRKSFRLRRRDLLLGFNKRAIDKCWKIRHFPNEFIVETKICQE"
BASE COUNT      299 a    242 c    252 g    289 t
ORIGIN      
        1 gatcccagac ctcggcttgc agtagtgtta gactgaagat aaagtaagtg ctgtttgggc
       61 taacaggatc tcctcttgca gtctgcagcc caggacgctg attccagcag cgccttaccg
      121 cgcagcccga agattcacta tggtgaaaat cgccttcaat acccctaccg ccgtgcaaaa
      181 ggaggaggcg cggcaagacg tggaggccct cctgagccgc acggtcagaa ctcagatact
      241 gaccggcaag gagctccgag ttgccaccca ggaaaaagag ggctcctctg ggagatgtat
      301 gcttactctc ttaggccttt cattcatctt ggcaggactt attgttggtg gagcctgcat
      361 ttacaagtac ttcatgccca agagcaccat ttaccgtgga gagatgtgct tttttgattc
      421 tgaggatcct gcaaattccc ttcgtggagg agagcctaac ttcctgcctg tgactgagga
      481 ggctgacatt cgtgaggatg acaacattgc aatcattgat gtgcctgtcc ccagtttctc
      541 tgatagtgac cctgcagcaa ttattcatga ctttgaaaag ggaatgactg cttacctgga
      601 cttgttgctg gggaactgct atctgatgcc cctcaatact tctattgtta tgcctccaaa
      661 aaatctggta gagctctttg gcaaactggc gagtggcaga tatctgcctc aaacttatgt
      721 ggttcgagaa gacctagttg ctgtggagga aattcgtgat gttagtaacc ttggcatctt
      781 tatttaccaa ctttgcaata acagaaagtc cttccgcctt cgtcgcagag acctcttgct
      841 gggtttcaac aaacgtgcca ttgataaatg ctggaagatt agacacttcc ccaacgaatt
      901 tattgttgag accaagatct gtcaagagta agaggcaaca gatagagtgt ccttggtaat
      961 aagaagtcag agatttacaa tatgacttta acattaaggt ttatgggata ctcaagatat
     1021 ttactcatgc atttactcta ttgcttatgc cgtaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1081 aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC008492. Homo sapiens, rib...[gi:14250147] Links  


LOCUS       BC008492                1322 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, ribosomal protein L3, clone MGC:14821 IMAGE:4251511,
            mRNA, complete cds.
ACCESSION   BC008492
VERSION     BC008492.1  GI:14250147
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1322)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-MAY-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: CLONTECH
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 21 Row: o Column: 12
            This clone was selected for full length sequencing because it
            passed the following selection criteria: Hexamer frequency ORF
            analysis, GenomeScan gene prediction, Similarity but not identity
            to protein.
FEATURES             Location/Qualifiers
     source          1..1322
                     /organism="Homo sapiens"
                     /db_xref="LocusID:6122"
                     /db_xref="taxon:9606"
                     /clone="MGC:14821 IMAGE:4251511"
                     /tissue_type="Prostate"
                     /clone_lib="NIH_MGC_83"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     CDS             27..1238
                     /codon_start=1
                     /product="ribosomal protein L3"
                     /protein_id="AAH08492.1"
                     /db_xref="GI:14250148"
                     /translation="MSHRKFSAPRHGSLGFLPRKRSSRHRGKVKSFPKDDPSKPVHLT
                     AFLGYKAGMTHIVREVDRPGSKVNKKEVVEAVTIVETPPMVVVGIVGYVETPRGLRTF
                     KTVFAEHISDECKRRFYKNWHKSKKKAFTKYCKKWQDEDGKKQLEKDFSSMKKYCQVI
                     RVIAHTQMRLLPLRQKKAHLMEIQVNGGTVAEKLDWARERLEQQVPVNQVFGQDEMID
                     VIGVTKGKGYKGVTSRWHTKKLPRKTHRGLRKVACIGAWHPARVAFSVARAGQKGYHH
                     RTEINKKIYKIGQGYLIKDGKLIKNNASTDYDLPDKSINPLGGFVHYGEVTNDFVMLK
                     GCVVGTKKRVLTLRKSLLVQTKRRALEKIDLKFIDTTSKFGHGRFQTMEEKKAFMGPL
                     KEDRIAKEEGA"
BASE COUNT      362 a    323 c    383 g    254 t
ORIGIN      
        1 ctctaccggc gggatttgat ggcgtgatgt ctcacagaaa gttctccgct cccagacatg
       61 ggtccctcgg cttcctgcct cggaagcgca gcagcaggca tcgtgggaag gtgaagagct
      121 tccctaagga tgacccgtcc aagccggtcc acctcacagc cttcctggga tacaaggctg
      181 gcatgactca catcgtgcgg gaagtcgaca ggccgggatc caaggtgaac aagaaggagg
      241 tggtggaggc tgtgaccatt gtagagacac cacccatggt ggttgtgggc attgtgggct
      301 acgtggaaac ccctcgaggc ctccggacct tcaagactgt ctttgctgag cacatcagtg
      361 atgaatgcaa gaggcgtttc tataagaatt ggcataaatc taagaagaag gcctttacca
      421 agtactgcaa gaaatggcag gatgaggatg gcaagaagca gctggagaag gacttcagca
      481 gcatgaagaa gtactgccaa gtcatccgtg tcattgccca cacccagatg cgcctgcttc
      541 ctctgcgcca gaagaaggcc cacctgatgg agatccaggt gaacggaggc actgtggccg
      601 agaagctgga ctgggcccgc gagaggcttg agcagcaggt acctgtgaac caagtgtttg
      661 ggcaggatga gatgatcgac gtcatcgggg tgaccaaggg caaaggctac aaaggggtca
      721 ccagtcgttg gcacaccaag aagctgcccc gcaagaccca ccgaggcctg cgcaaggtgg
      781 cctgtattgg ggcatggcat cctgctcgtg tagccttctc tgtggcacgc gctgggcaga
      841 aaggctacca tcaccgcact gagatcaaca agaagattta taagattggc cagggctacc
      901 ttatcaagga cggcaagctg atcaagaaca atgcctccac tgactatgac ctacctgaca
      961 agagcatcaa ccctctgggt ggctttgtcc actatggtga agtgaccaat gactttgtca
     1021 tgctgaaagg ctgtgtggtg ggaaccaaga agcgggtgct caccctccgc aagtccttgc
     1081 tggtgcagac gaagcggcgg gctctggaga agattgacct taagttcatt gacaccacct
     1141 ccaagtttgg ccatggccgc ttccagacca tggaggagaa gaaagcattc atgggaccac
     1201 tgaaggaaga ccgaattgca aaggaagaag gagcttaatg ccaggaacag attttgcagt
     1261 tggtggggtc tcaataaaag ttattttcca ccaaaaaaaa aaaaaaaaaa aaaaaaaaga
     1321 aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC012146. Homo sapiens, Sim...[gi:15082460] Links  


LOCUS       BC012146                1298 bp    mRNA    linear   PRI 06-AUG-2001
DEFINITION  Homo sapiens, Similar to ribosomal protein L3, clone MGC:20359
            IMAGE:4549682, mRNA, complete cds.
ACCESSION   BC012146
VERSION     BC012146.1  GI:15082460
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1298)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-AUG-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
            Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
            Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
            Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
            Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
            Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
            Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
            George Yang, Scott Zuyderduyn, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 29 Row: b Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: Similarity but not
            identity to protein.
FEATURES             Location/Qualifiers
     source          1..1298
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:20359 IMAGE:4549682"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_18"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             14..1225
                     /codon_start=1
                     /product="Similar to ribosomal protein L3"
                     /protein_id="AAH12146.1"
                     /db_xref="GI:15082461"
                     /translation="MSHRKFSAPRHGSLGFLPRKRSSRHRGKVKSFPKDDPSKPVHLT
                     AFLGYKAGMTHIVREVDRPGSKVNKKEVVEAVTIVETPPMVVVGIVGYVETPRGLRTF
                     KTVFAEHISDECKRRFYKNWHKSKKKAFTKYCKKWQDEDGKKQLEKDFSSMKKYCQVI
                     RVIAHTQMRLLPLRQKKAHLMEIQVNGGTVAEKLDWARERLEQQVPVNQVFGQDEMID
                     VIGVTKGKGYKGVTSRWHTKKLPRKTHRGLRKVACIGAWHPARVAFSVARAGQKGYHH
                     RTEINKKIYKIGQGYLIKDGKLIKNNASTDYDLSDKSINPLGGFVHYGEVTNDFVMLK
                     GCVVGTKKRVLTLRKSLLVQTKRRALEKIDLKFIDTTSKFGHGRFQTMEEKKAFMGPL
                     KKDRIAKEEGA"
BASE COUNT      351 a    316 c    377 g    254 t
ORIGIN      
        1 atttgatggc gtgatgtctc acagaaagtt ctccgctccc agacatgggt ccctcggctt
       61 cctgcctcgg aagcgcagca gcaggcatcg tgggaaggtg aagagcttcc ctaaggatga
      121 cccgtccaag ccggtccacc tcacagcctt cctgggatac aaggctggca tgactcacat
      181 cgtgcgggaa gtcgacaggc cgggatccaa ggtgaacaag aaggaggtgg tggaggctgt
      241 gaccattgta gagacaccac ccatggtggt tgtgggcatt gtgggctacg tggaaacccc
      301 tcgaggcctc cggaccttca agactgtctt tgctgagcac atcagtgatg aatgcaagag
      361 gcgtttctat aagaattggc ataaatctaa gaagaaggcc tttaccaagt actgcaagaa
      421 atggcaggat gaggatggca agaagcagct ggagaaggac ttcagcagca tgaagaagta
      481 ctgccaagtc atccgtgtca ttgcccacac ccagatgcgc ctgcttcctc tgcgccagaa
      541 gaaggcccac ctgatggaga tccaggtgaa cggaggcact gtggccgaga agctggactg
      601 ggcccgcgag aggcttgagc agcaggtacc tgtgaaccaa gtgtttgggc aggatgagat
      661 gatcgacgtc atcggggtga ccaagggcaa aggctacaaa ggggtcacca gtcgttggca
      721 caccaagaag ctgccccgca agacccaccg aggcctgcgc aaggtggcct gtattggggc
      781 atggcatcct gctcgtgtag ccttctctgt ggcacgcgct gggcagaaag gctaccatca
      841 ccgcactgag atcaacaaga agatttataa gattggccag ggctacctta tcaaggacgg
      901 caagctgatc aagaacaatg cctccactga ctatgaccta tctgacaaga gcatcaaccc
      961 tctgggtggc tttgtccact atggtgaagt gaccaatgac tttgtcatgc tgaaaggctg
     1021 tgtggtggga accaagaagc gggtgctcac cctccgcaag tccttgctgg tgcagacgaa
     1081 gcggcgggct ctggagaaga ttgaccttaa gttcattgac accacctcca agtttggcca
     1141 tggccgcttc cagaccatgg aggagaagaa agcattcatg ggaccactga agaaagaccg
     1201 aattgcaaag gaagaaggag cttaatgcca ggaacagatt ttgcagttgg tggggtctca
     1261 ataaaagtta ttttccactg aaaaaaaaaa aaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF052124. Homo sapiens clon...[gi:3360431] Links  


LOCUS       AF052124                1524 bp    mRNA    linear   PRI 05-AUG-1998
DEFINITION  Homo sapiens clone 23810 osteopontin mRNA, complete cds.
ACCESSION   AF052124
VERSION     AF052124.1  GI:3360431
KEYWORDS    FLI_CDNA.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1524)
  AUTHORS   Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A.
  TITLE     A 'double adaptor' method for improved shotgun library construction
  JOURNAL   Anal. Biochem. 236 (1), 107-113 (1996)
  MEDLINE   96207227
   PUBMED   8619474
REFERENCE   2  (bases 1 to 1524)
  AUTHORS   Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W.,
            Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A.
  TITLE     Large-scale concatenation cDNA sequencing
  JOURNAL   Genome Res. 7 (4), 353-358 (1997)
  MEDLINE   97264341
   PUBMED   9110174
REFERENCE   3  (bases 1 to 1524)
  AUTHORS   Yu,W., Sarginson,J. and Gibbs,R.A.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-MAR-1998) Molecular and Human Genetics, Baylor
            College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA
FEATURES             Location/Qualifiers
     source          1..1524
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="I.M.A.G.E. Consortium clone ID 23810"
                     /sex="female"
                     /tissue_type="brain"
                     /clone_lib="1NIB"
                     /dev_stage="infant"
     gene            1..1524
                     /gene="SPP1"
     CDS             88..990
                     /gene="SPP1"
                     /note="similar to Homo sapiens osteopontin encoded by
                     GenBank Accession Number J04765"
                     /codon_start=1
                     /product="osteopontin"
                     /protein_id="AAC28619.1"
                     /db_xref="GI:3360432"
                     /translation="MRIAVICFCLLGITCAIPVKQADSGSSEEKQLYNKYPDAVATWL
                     NPDPSQKQNLLAPQTLPSKSNESHDHMDDMDDEDDDDHVDSQDSIDSNDSDDVDDTDD
                     SHQSDESHHSDESDELVTDFPTDLPATEVFTPVVPTVDTYDGRGDSVVYGLRSKSKKF
                     RRPDIQYPDATDEDITSHMESEELNGAYKAIPVAQDLNAPSDWDSRGKDSYETSQLDD
                     QSAETHSHKQSRLYKRKANDESNEHSDVIDSQELSKVSREFHSHEFHSHEDMLVVDPK
                     SKEEDKHLKFRISHELDSASSEVN"
BASE COUNT      485 a    308 c    309 g    422 t
ORIGIN      
        1 gcagagcaca gcatcgtcgg gaccagactc gtctcaggcc agttgcagcc ttctcagcca
       61 aacgccgacc aaggaaaact cactaccatg agaattgcag tgatttgctt ttgcctccta
      121 ggcatcacct gtgccatacc agttaaacag gctgattctg gaagttctga ggaaaagcag
      181 ctttacaaca aatacccaga tgctgtggcc acatggctaa accctgaccc atctcagaag
      241 cagaatctcc tagccccaca gacccttcca agtaagtcca acgaaagcca tgaccacatg
      301 gatgatatgg atgatgaaga tgatgatgac catgtggaca gccaggactc cattgactcg
      361 aacgactctg atgatgtaga tgacactgat gattctcacc agtctgatga gtctcaccat
      421 tctgatgaat ctgatgaact ggtcactgat tttcccacgg acctgccagc aaccgaagtt
      481 ttcactccag ttgtccccac agtagacaca tatgatggcc gaggtgatag tgtggtttat
      541 ggactgaggt caaaatctaa gaagtttcgc agacctgaca tccagtaccc tgatgctaca
      601 gacgaggaca tcacctcaca catggaaagc gaggagttga atggtgcata caaggccatc
      661 cccgttgccc aggacctgaa cgcgccttct gattgggaca gccgtgggaa ggacagttat
      721 gaaacgagtc agctggatga ccagagtgct gaaacccaca gccacaagca gtccagatta
      781 tataagcgga aagccaatga tgagagcaat gagcattccg atgtgattga tagtcaggaa
      841 ctttccaaag tcagccgtga attccacagc catgaatttc acagccatga agatatgctg
      901 gttgtagacc ccaaaagtaa ggaagaagat aaacacctga aatttcgtat ttctcatgaa
      961 ttagatagtg catcttctga ggtcaattaa aaggagaaaa aatacaattt ctcactttgc
     1021 atttagtcaa aagaaaaaat gctttatagc aaaatgaaag agaacatgaa atgcttcttt
     1081 ctcagtttat tggttgaatg tgtatctatt tgagtctgga aataactaat gtgtttgata
     1141 attagtttag tttgtggctt catggaaact ccctgtaaac taaaagcttc agggttatgt
     1201 ctatgttcat tctatagaag aaatgcaaac tatcactgta ttttaatatt tgttattctc
     1261 tcatgaatag aaatttatgt agaagcaaac aaaatacttt tacccactta aaaagagaat
     1321 ataacatttt atgtcactat aatcttttgt tttttaagtt agtgtatatt ttgttgtgat
     1381 tatctttttg tggtgtgaat aaatctttta tcttgaatgt aataagaatt tggtggtgtc
     1441 aattgcttat ttgttttccc acggttgtcc agcaattaat aaaacataac cttttttact
     1501 gcctaaaaaa aaaaaaaaaa aaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF047576. Homo sapiens tran...[gi:2952290] Links  


LOCUS       AF047576                1074 bp    DNA     linear   PRI 12-MAR-1998
DEFINITION  Homo sapiens transcobalamin II (TCII) gene, 5' flanking region,
            exon 1, and partial cds.
ACCESSION   AF047576
VERSION     AF047576.1  GI:2952290
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1074)
  AUTHORS   Platica,O., Janeczko,R., Quadros,E.V., Regec,A., Romain,R. and
            Rothenberg,S.P.
  TITLE     The cDNA sequence and the deduced amino acid sequence of human
            transcobalamin II show homology with rat intrinsic factor and human
            transcobalamin I
  JOURNAL   J. Biol. Chem. 266 (12), 7860-7863 (1991)
  MEDLINE   91210312
   PUBMED   1708393
REFERENCE   2  (bases 1 to 1074)
  AUTHORS   Regec,A., Quadros,E.V., Platica,O. and Rothenberg,S.P.
  TITLE     The cloning and characterization of the human transcobalamin II
            gene
  JOURNAL   Blood 85 (10), 2711-2719 (1995)
  MEDLINE   95261033
   PUBMED   7742531
REFERENCE   3  (bases 1 to 1074)
  AUTHORS   Regec,A., Quadros,E.V., Platica,O. and Rothenberg,S.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-FEB-1998) Medicine, SUNY-HSCB, 450 Clarkson Avenue,
            Brooklyn, NY 11203, USA
FEATURES             Location/Qualifiers
     source          1..1074
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="22"
     gene            <691..>1074
                     /gene="TCII"
     mRNA            691..>913
                     /gene="TCII"
                     /product="transcobalamin II"
     exon            691..913
                     /gene="TCII"
                     /number=1
     5'UTR           691..849
                     /gene="TCII"
     CDS             850..>913
                     /gene="TCII"
                     /codon_start=1
                     /product="transcobalamin II"
                     /protein_id="AAC05491.1"
                     /db_xref="GI:2952291"
                     /translation="MRHLGAFLFLLGVLGALTEMC"
     intron          914..>1074
                     /gene="TCII"
                     /number=1
BASE COUNT      196 a    352 c    271 g    255 t
ORIGIN      
        1 atgctcaagt gatctgcccg cctcagcctt tcaaagtgct aggattacag gtgtgagcca
       61 ccgtgcccgg acttaatccc attctttaaa cttgttttgt tttgttcctc tccaggaggc
      121 tcccagccct ttcggattgg ttgagaaaag tggcctggct ggtctggggc cagcggcacc
      181 caccctcccc tcaatttgcc caactacccc cccccacaca gaactgccca acgtacccgc
      241 ccgaactgcc aaccccaccc acaatccctc ccgccacaac tgagggaggc ggtgctgtaa
      301 acagctgact ccagcaatgc tgccacgtga ccactgcagc tgcagctcac tgttccactc
      361 cttgtcctgg gctaggtggg cactaccagg ggctcctttg gtaaggagta cgggtaggca
      421 cccggtcctg ccaatccacc actggaacag ctggggggac agcagacagg cacggtcgga
      481 cagacttgac agatcaggca tcaggccctc tgagctggtc ccgggctctt taagcaggaa
      541 cgtgaatggc ctcaagatgt ctcacatggt cccactagcc ctcctcctcc ctttgttcct
      601 tacctccagg agggctgtct gcccttcctt cctctgttct ttggccttat gttccccgcc
      661 accacagacc ttcccccgcc ccacccctct gcagacttag ccgtgcattg caggcatgga
      721 ggattaatca gtgacaggaa ggtgcgtctc tcggagcggt gaccagctgt ggtcaggaga
      781 gcctcagcag gggccagccc caggagtctt tcccgattct tgctcactgc tcacccacct
      841 gctgctgcca tgaggcacct tggggccttc ctcttccttc tgggggtcct gggggccctc
      901 actgagatgt gtggtgagta actcgcctct atcctgtgcc tctttcctcc tgggtcctta
      961 gtggggtggc tagggcatag gatcagggaa cttacctgcc cttctaagct cccatagcag
     1021 tttgggctta gctggacctc agcatttaca catcctattg tgattgatta tatg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyLinkOutLinkOutHelpHelp  






    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M60396. Human transcobala...[gi:339195] Links  


LOCUS       HUMTCII                 1866 bp    mRNA    linear   PRI 13-JAN-1995
DEFINITION  Human transcobalamin II (TCII) mRNA, complete cds.
ACCESSION   M60396
VERSION     M60396.1  GI:339195
KEYWORDS    transcobalamin II.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1866)
  AUTHORS   Platica,O., Janeczko,R., Quadros,E.V., Regec,A., Romain,R. and
            Rothenberg,S.P.
  TITLE     The cDNA sequence and the deduced amino acid sequence of human
            transcobalamin II show homology with rat intrinsic factor and human
            transcobalamin I
  JOURNAL   J. Biol. Chem. 266 (12), 7860-7863 (1991)
  MEDLINE   91210312
   PUBMED   1708393
COMMENT     Original source text: Human umbilical vein endothelial cell, cDNA
            to mRNA.
FEATURES             Location/Qualifiers
     source          1..1866
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="22q"
                     /cell_type="endothelial cell"
                     /tissue_type="umbilical vein"
     gene            1..1866
                     /gene="TCN2"
     CDS             38..1321
                     /gene="TCN2"
                     /codon_start=1
                     /product="transcobalamin II"
                     /protein_id="AAA61054.1"
                     /db_xref="GI:339196"
                     /db_xref="GDB:G00-119-608"
                     /translation="MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDR
                     LSLEHLNPSIYVGLRLSSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSM
                     GQLALYLLALRANCEFVRGHKGDRLVSQLKWFLEDEKRAIGHDHKGHPHTSYYQYGLG
                     ILALCLHQKRVHDSVVDKLLYAVEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQR
                     ITMAIRTVREEILKAQTPEGHFGNVYSTPLALQFLMTSPMPGAELGTACLKARVALLA
                     SLQDGAFQNALMISQLLPVLNHKTYIDLIFPDCLAPRVMLEPAAETIPQTQEIISVTL
                     QVLSLLPPYRQSISVLAGSTVEDVLKKAHELGGFTYETQASSSGPYLTSVMGKAAGER
                     EFWQLLRDPNTPLLQGIADYRPKDGETIELRLVSW"
     sig_peptide     38..91
                     /gene="TCN2"
                     /note="G00-119-608"
     mat_peptide     92..1318
                     /gene="TCN2"
                     /product="transcobalamin II"
                     /note="G00-119-608"
                     /evidence=experimental
BASE COUNT      395 a    575 c    500 g    396 t
ORIGIN      
        1 ccgattcttg ctcactgctc acccacctgc tgctgccatg aggcaccttg gggccttcct
       61 cttccttctg ggggtcctgg gggccctcac tgagatgtgt gaaataccag agatggacag
      121 ccatctggta gagaagttgg gccagcacct cttaccttgg atggaccggc tttccctgga
      181 gcacttgaac cccagcatct atgtgggcct acgcctctcc agtctgcagg ctgggaccaa
      241 ggaagacctc tacctgcaca gcctcaagct tggttaccag cagtgcctcc tagggtctgc
      301 cttcagcgag gatgacggtg actgccaggg caagccttcc atgggccagc tggccctcta
      361 cctgctcgct ctcagagcca actgtgagtt tgtcaggggc cacaaggggg acaggctggt
      421 ctcacagctc aaatggttcc tggaggatga gaagagagcc attgggcatg atcacaaggg
      481 ccacccccac actagctact accagtatgg cctgggcatt ctggccctgt gtctccacca
      541 gaagcgggtc catgacagcg tggtggacaa acttctgtat gctgtggaac ctttccacca
      601 gggccaccat tctgtggaca cagcagccat ggcaggcttg gcattcacct gtctgaagcg
      661 ctcaaacttc aaccctggtc ggagacaacg gatcaccatg gccatcagaa cagtgcgaga
      721 ggagatcttg aaggcccaga cccccgaggg ccactttggg aatgtctaca gcaccccatt
      781 ggcattacag ttcctcatga cttcccccat gcctggggca gaactgggaa cagcatgtct
      841 caaggcgagg gttgctttgc tggccagtct gcaggatgga gccttccaga atgctctcat
      901 gatttcccag ctgctgcccg ttctgaacca caagacctac attgatctga tcttcccaga
      961 ctgtctggca ccacgagtca tgttggaacc agctgctgag accattcctc agacccaaga
     1021 gatcatcagt gtcacgctgc aggtgcttag tctcttgccg ccgtacagac agtccatctc
     1081 tgttctggcc gggtccaccg tggaagatgt cctgaagaag gcccatgagt taggaggatt
     1141 cacatatgaa acacaggcct cctcgtcagg cccctactta acctccgtga tggggaaagc
     1201 ggccggagaa agggagttct ggcagcttct ccgagacccc aacaccccac tgttgcaagg
     1261 tattgctgac tacagaccca aggatggaga aaccattgag ctgaggctgg ttagctggta
     1321 gcccctgagc tccctcatcc cagcagcctc gcacactccc taggcttcta ccctccctcc
     1381 tgatgtccct ggaacaggaa ctcgcctgac cctgctgcca cctcctgtgc actttgagca
     1441 atgccccctg ggatcacccc agccacaagc ccttcgaggg ccctatacca tggcccacct
     1501 tggagcagag agccaagcat cttccctggg aagtctttct ggccaagtct ggccagcctg
     1561 gccctgcagg tctcccatga aggccacccc atggtctgat gggcatgaag catctcagac
     1621 tccttggcaa aaaacggagt ccgcaggccg caggtgttgt gaagaccact cgttctgtgg
     1681 ttggggtcct gcaagaaggc ctcctcagcc cgggggctat ggccctgacc ccagctctcc
     1741 actctgctgt tagagtggca gctctgagct ggttgtggca cagtagctgg ggagacctca
     1801 gcagggctgc tcagtgcctg cctctgacaa aattaaagca ttgatggcct gtggacctgc
     1861 aaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M34641. Human fibroblast ...[gi:182529] Links  


LOCUS       HUMFGF1A                3343 bp    mRNA    linear   PRI 14-DEC-2001
DEFINITION  Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds.
ACCESSION   M34641
VERSION     M34641.1  GI:182529
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3343)
  AUTHORS   Wennstrom,S., Sandstrom,C. and Claesson-Welsh,L.
  TITLE     cDNA cloning and expression of a human FGF receptor which binds
            acidic and basic FGF
  JOURNAL   Growth Factors 4 (3), 197-208 (1991)
  MEDLINE   92118394
   PUBMED   1722683
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Claesson-Welsh, 25-MAY-1990.
              Author address: L.Claesson-Welsh
              Ludwig Institute for Cancer Research
              Biomedical Center
              Box 595
              S-751 24 Uppsala
              SWEDEN.
FEATURES             Location/Qualifiers
     source          1..3343
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             10..2472
                     /note="FGF receptor-1 precursor"
                     /codon_start=1
                     /protein_id="AAA35835.1"
                     /db_xref="GI:182530"
                     /translation="MWSWKCLLFWAVLVTATLCTARPSPTLPEQAQPWGAPVEVESFL
                     VHPGDLLQLRCRLRDDVQSINWLRDGVQLAESNRTRITGEEVEVQDSVPADSGLYACV
                     TSSPSGSDTTYFSVNVSDALPSSEDDDDDDDSSSEEKETDNTKPNPVAPYWTSPEKME
                     KKLHAVPAAKTVKFKCPSSGTPNPTLRWLKNSKEFKPDHRIGGYKVRYATWSIIMDSV
                     VPSDKGNYTCIVENEYGSINHTYQLDVVERSPHRPILQAGLPANKTVALGSNVEFMCK
                     VYSDPQPHIQWLKHIEVNGSKIGPDNLPYVQILKTAGVNTTDKEMEVLHLRNVSFEDA
                     GEYTCLAGNSIGLSHHSAWLTVLEALEERPAVMTSPLYLEIIIYCTGAFLISCMVGSV
                     IVYKMKSGTKKSDFHSQMAVHKLAKSIPLRRQVTVSADSSASMNSGVLLVRPSRLSSS
                     GTPMLAGVSEYELPEDPRWELPRDRLVLGKPLGEGCFGQVVLAEAIGLDKDKPNRVTK
                     VAVKMLKSDATEKDLSDLISEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGN
                     LREYLQARRPPGLEYCYNPSHNPEEQLSSKDLVSCAYQVARGMEYLASKKCIHRDLAA
                     RNVLVTEDNVMKIADFGLARDIHHIDYYKKTTNGRLPVKWMAPEALFDRIYTHQSDVW
                     SFGVLLWEIFTLGGSPYPGVPVEELFKLLKEGHRMDKPSNCTNELYMMMRDCWHAVPS
                     QRPTFKQLVEDLDRIVALTSNQEYLDLSMPLDQYSPSFPDTRSSTCSSGEDSVFSHEP
                     LPEEPCLPRHPAQLANGGLKRR"
     sig_peptide     10..72
                     /note="FGF receptor-1 signal peptide"
     mat_peptide     73..2469
                     /product="FGF receptor-1"
BASE COUNT      766 a    957 c    911 g    709 t
ORIGIN      
        1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca
       61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct
      121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg
      181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac
      241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc
      301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat
      361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag
      421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa
      481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct
      541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct
      601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct
      661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc
      721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca
      781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg
      841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag
      901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc
      961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat
     1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg
     1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc
     1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg
     1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag
     1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct
     1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg
     1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc
     1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg
     1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac
     1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg
     1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc
     1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca
     1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac
     1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc
     1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata
     1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac
     1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac
     2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc
     2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg
     2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca
     2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc
     2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc
     2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag
     2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc
     2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc
     2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg
     2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct
     2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc
     2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg
     2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc
     2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc
     2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc
     2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac
     3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc
     3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat
     3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc
     3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga
     3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt
     3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  






    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X66945. H.sapiens N-sam m...[gi:35109] Links  


LOCUS       HSNSAMTK                3981 bp    mRNA    linear   PRI 24-NOV-1993
DEFINITION  H.sapiens N-sam mRNA for fibroblast growth factor receptor.
ACCESSION   X66945 S37352
VERSION     X66945.1  GI:35109
KEYWORDS    FGF receptor related gene; fibroblast growth factor; Tyrosine
            kinase.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3981)
  AUTHORS   Hattori,Y., Odagiri,H., Katoh,O., Sakamoto,H., Morita,T.,
            Shimotohno,K., Tobinai,K., Sugimura,T. and Terada,M.
  TITLE     K-sam-related gene, N-sam, encodes fibroblast growth factor
            receptor and is expressed in T-lymphocytic tumors
  JOURNAL   Cancer Res. 52 (12), 3367-3371 (1992)
  MEDLINE   92282615
   PUBMED   1317750
REFERENCE   2  (bases 1 to 3981)
  AUTHORS   Hattori,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-1992) Y. Hattori, National Cancer Center Research
            Insitute, Genetics Division, 5-1-1, Tsukiji, Chuo-ku, Tokyo 104,
            JAPAN
FEATURES             Location/Qualifiers
     source          1..3981
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="8"
                     /clone="N-sam4, 5N-sam"
                     /cell_type="immature teratoma"
     gene            727..3195
                     /gene="N-sam"
     CDS             727..3195
                     /gene="N-sam"
                     /function="tyrosine kinase"
                     /codon_start=1
                     /product="fibroblast growth factor receptor"
                     /protein_id="CAA47375.1"
                     /db_xref="GI:35110"
                     /db_xref="SWISS-PROT:P11362"
                     /translation="MWSWKCLLFWAVLVTATLCTARPSPTLPEQAQPWGAPVEVESFL
                     VHPGDLLQLRCRLRDDVQSINWLRDGVQLAESNRTRITGEEVEVQDSVPADSGLYACV
                     TSSPSGSDTTYFSVNVSDALPSSEDDDDDDDSSSEEKETDNTKPNRMPVAPYWTSPEK
                     MEKKLHAVPAAKTVKFKCPSSGTPNPTLRWLKNGKEFKPDHRIGGYKVRYATWSIIMD
                     SVVPSDKGNYTCIVENEYGSINHTYQLDVVERSPHRPILQAGLPANKTVALGSNVEFM
                     CKVYSDPQPHIQWLKHIEVNGSKIGPDNLPYVQILKTAGVNTTDKEMEVLHLRNVSFE
                     DAGEYTCLAGNSIGLSHHSAWLTVLEALEERPAVMTSPLYLEIIIYCTGAFLISCMVG
                     SVIVYKMKSGTKKSDFHSQMAVHKLAKSIPLRRQVTVSADSSASMNSGVLLVRPSRLS
                     SSGTPMLAGVSEYELPEDPRWELPRDRLVLGKPLGEGCFGQVVLAEAIGLDKDKPNRV
                     TKVAVKMLKSDATEKDLSDLISEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASK
                     GNLREYLQARRPPGLEYCYNPSHNPEEQLSSKDLVSCAYQVARGMEYLASKKCIHRDL
                     AARNVLVTEDNVMKIADFGLARDIHHIDYYKKTTNGRLPVKWMAPEALFDRIYTHQSD
                     VWSFGVLLWEIFTLGGSPYPGVPVEELFKLLKEGHRMDKPSNCTNELYMMMRDCWHAV
                     PSQRPTFKQLVEDLDRIVALTSNQEYLDLSMPLDQYSPSFPDTRSSTCSSGEDSVFSH
                     EPLPEEPCLPRHPAQLANGGLKRR"
     sig_peptide     727..783
                     /gene="N-sam"
BASE COUNT      856 a   1205 c   1174 g    746 t
ORIGIN      
        1 cctcttgcgg ccacaggcgc ggcgtcctcg gcggcgggcg gcagctagcg ggagccggga
       61 cgccggtgca gccgcagcgc gcggaggaac ccgggtgtgc cgggagctgg gcggccacgt
      121 ccggacggga ccgagacccc tcgtagcgca ttgcggcgac ctcgccttcc ccggccgcga
      181 gcgcgccgct gcttgaaaag ccgcggaacc caaggacttt tctccggtcc gagctcgggg
      241 cgccccgcag gcgcacggta cccgtgctgc agtcgggcac gccgcggcgc cgggggcctc
      301 cgcagggcga tggagccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga
      361 ggggggcaca aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct
      421 ccggggcacc agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc
      481 gccgccggga gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga
      541 gcccgggggc ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg
      601 gcggactctc ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc
      661 gagctcactg tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga
      721 actgggatgt ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc
      781 tgcaccgcta ggccgtcccc gaccttgcct gaacaagccc agccctgggg agcccctgtg
      841 gaagtggagt ccttcctggt ccaccccggt gacctgctgc agcttcgctg tcggctgcgg
      901 gacgatgtgc agagcatcaa ctggctgcgg gacggggtgc agctggcgga aagcaaccgc
      961 acccgcatca caggggagga ggtggaggtg caggactccg tgcccgcaga ctccggcctc
     1021 tatgcttgcg taaccagcag cccctcgggc agtgacacca cctacttctc cgtcaatgtt
     1081 tcagatgctc tcccctcctc ggaggatgat gatgatgatg atgactcctc ttcagaggag
     1141 aaagaaacag ataacaccaa accaaaccgt atgcccgtag ctccatattg gacatcccca
     1201 gaaaagatgg aaaagaaatt gcatgcagtg ccggctgcca agacagtgaa gttcaaatgc
     1261 ccttccagtg ggaccccaaa ccccacactg cgctggttga aaaatggcaa agaattcaaa
     1321 cctgaccaca gaattggagg ctacaaggtc cgttatgcca cctggagcat cataatggac
     1381 tctgtggtgc cctctgacaa gggcaactac acctgcattg tggagaatga gtacggcagc
     1441 atcaaccaca cataccagct ggatgtcgtg gagcggtccc ctcaccggcc catcctgcaa
     1501 gcagggttgc ccgccaacaa aacagtggcc ctgggtagca acgtggagtt catgtgtaag
     1561 gtgtacagtg acccgcagcc gcacatccag tggctaaagc acatcgaggt gaatgggagc
     1621 aagattggcc cagacaacct gccttatgtc cagatcttga agactgctgg agttaatacc
     1681 accgacaaag agatggaggt gcttcactta agaaatgtct cctttgagga cgcaggggag
     1741 tatacgtgct tggcgggtaa ctctatcgga ctctcccatc actctgcatg gttgaccgtt
     1801 ctggaagccc tggaagagag gccggcagtg atgacctcgc ccctgtacct ggagatcatc
     1861 atctattgca caggggcctt cctcatctcc tgcatggtgg ggtcggtcat cgtctacaag
     1921 atgaagagtg gtaccaagaa gagtgacttc cacagccaga tggctgtgca caagctggcc
     1981 aagagcatcc ctctgcgcag acaggtaaca gtgtctgctg actccagtgc atccatgaac
     2041 tctggggttc ttctggttcg gccatcacgg ctctcctcca gtgggactcc catgctagca
     2101 ggggtctctg agtatgagct tcccgaagac cctcgctggg agctgcctcg ggacagactg
     2161 gtcttaggca aacccctggg agagggctgc tttgggcagg tggtgttggc agaggctatc
     2221 gggctggaca aggacaaacc caaccgtgtg accaaagtgg ctgtgaagat gttgaagtcg
     2281 gacgcaacag agaaagactt gtcagacctg atctcagaaa tggagatgat gaagatgatc
     2341 gggaagcata agaatatcat caacctgctg ggggcctgca cgcaggatgg tcccttgtat
     2401 gtcatcgtgg agtatgcctc caagggcaac ctgcgggagt acctgcaggc ccggaggccc
     2461 ccagggctgg aatactgcta caaccccagc cacaacccag aggagcagct ctcctccaag
     2521 gacctggtgt cctgcgccta ccaggtggcc cgaggcatgg agtatctggc ctccaagaag
     2581 tgcatacacc gagacctggc agccaggaat gtcctggtga cagaggacaa tgtgatgaag
     2641 atagcagact ttggcctcgc acgggacatt caccacatcg actactataa aaagacaacc
     2701 aacggccgac tgcctgtgaa gtggatggca cccgaggcat tatttgaccg gatctacacc
     2761 caccagagtg atgtgtggtc tttcggggtg ctcctgtggg agatcttcac tctgggcggc
     2821 tccccatacc ccggtgtgcc tgtggaggaa cttttcaagc tgctgaagga gggtcaccgc
     2881 atggacaagc ccagtaactg caccaacgag ctgtacatga tgatgcggga ctgctggcat
     2941 gcagtgccct cacagagacc caccttcaag cagctggtgg aagacctgga ccgcatcgtg
     3001 gccttgacct ccaaccagga gtacctggac ctgtccatgc ccctggacca gtactccccc
     3061 agctttcccg acacccggag ctctacgtgc tcctcagggg aggattccgt cttctctcat
     3121 gagccgctgc ccgaggagcc ctgcctgccc cgacacccag cccagcttgc caatggcgga
     3181 ctcaaacgcc gctgactgcc acccacacgc cctccccaga ctccaccgtc agctgtaacc
     3241 ctcacccaca gcccctgctg ggcccaccac ctgtccgtcc ctgtcccctt tcctgctggc
     3301 aggagccggc tgcctaccag gggccttcct gtgtggcctg ccttcacccc actcagctca
     3361 cctctccctc cacctcctct ccacctgctg gtgagaggtg caaagaggca gatctttgct
     3421 gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc accaggcact
     3481 gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc ttcctgagct
     3541 ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac tctggtggca
     3601 ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc tcgattgaag
     3661 gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac tagtttgctt
     3721 tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg ggggcagtgt
     3781 tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga agaaaacaca
     3841 aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt accagagatt
     3901 tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata tattaaaaac
     3961 aaaaaagaaa aaaaaaaaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M34641. Human fibroblast ...[gi:182529] Links  


LOCUS       HUMFGF1A                3343 bp    mRNA    linear   PRI 14-DEC-2001
DEFINITION  Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds.
ACCESSION   M34641
VERSION     M34641.1  GI:182529
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3343)
  AUTHORS   Wennstrom,S., Sandstrom,C. and Claesson-Welsh,L.
  TITLE     cDNA cloning and expression of a human FGF receptor which binds
            acidic and basic FGF
  JOURNAL   Growth Factors 4 (3), 197-208 (1991)
  MEDLINE   92118394
   PUBMED   1722683
COMMENT     Draft entry and computer-readable sequence for [1] kindly submitted
            by L.Claesson-Welsh, 25-MAY-1990.
              Author address: L.Claesson-Welsh
              Ludwig Institute for Cancer Research
              Biomedical Center
              Box 595
              S-751 24 Uppsala
              SWEDEN.
FEATURES             Location/Qualifiers
     source          1..3343
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             10..2472
                     /note="FGF receptor-1 precursor"
                     /codon_start=1
                     /protein_id="AAA35835.1"
                     /db_xref="GI:182530"
                     /translation="MWSWKCLLFWAVLVTATLCTARPSPTLPEQAQPWGAPVEVESFL
                     VHPGDLLQLRCRLRDDVQSINWLRDGVQLAESNRTRITGEEVEVQDSVPADSGLYACV
                     TSSPSGSDTTYFSVNVSDALPSSEDDDDDDDSSSEEKETDNTKPNPVAPYWTSPEKME
                     KKLHAVPAAKTVKFKCPSSGTPNPTLRWLKNSKEFKPDHRIGGYKVRYATWSIIMDSV
                     VPSDKGNYTCIVENEYGSINHTYQLDVVERSPHRPILQAGLPANKTVALGSNVEFMCK
                     VYSDPQPHIQWLKHIEVNGSKIGPDNLPYVQILKTAGVNTTDKEMEVLHLRNVSFEDA
                     GEYTCLAGNSIGLSHHSAWLTVLEALEERPAVMTSPLYLEIIIYCTGAFLISCMVGSV
                     IVYKMKSGTKKSDFHSQMAVHKLAKSIPLRRQVTVSADSSASMNSGVLLVRPSRLSSS
                     GTPMLAGVSEYELPEDPRWELPRDRLVLGKPLGEGCFGQVVLAEAIGLDKDKPNRVTK
                     VAVKMLKSDATEKDLSDLISEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGN
                     LREYLQARRPPGLEYCYNPSHNPEEQLSSKDLVSCAYQVARGMEYLASKKCIHRDLAA
                     RNVLVTEDNVMKIADFGLARDIHHIDYYKKTTNGRLPVKWMAPEALFDRIYTHQSDVW
                     SFGVLLWEIFTLGGSPYPGVPVEELFKLLKEGHRMDKPSNCTNELYMMMRDCWHAVPS
                     QRPTFKQLVEDLDRIVALTSNQEYLDLSMPLDQYSPSFPDTRSSTCSSGEDSVFSHEP
                     LPEEPCLPRHPAQLANGGLKRR"
     sig_peptide     10..72
                     /note="FGF receptor-1 signal peptide"
     mat_peptide     73..2469
                     /product="FGF receptor-1"
BASE COUNT      766 a    957 c    911 g    709 t
ORIGIN      
        1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca
       61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct
      121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg
      181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac
      241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc
      301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat
      361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag
      421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa
      481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct
      541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct
      601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct
      661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc
      721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca
      781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg
      841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag
      901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc
      961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat
     1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg
     1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc
     1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg
     1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag
     1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct
     1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg
     1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc
     1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg
     1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac
     1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg
     1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc
     1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca
     1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac
     1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc
     1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata
     1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac
     1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac
     2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc
     2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg
     2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca
     2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc
     2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc
     2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag
     2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc
     2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc
     2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg
     2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct
     2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc
     2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg
     2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc
     2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc
     2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc
     2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac
     3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc
     3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat
     3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc
     3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga
     3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt
     3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X66945. H.sapiens N-sam m...[gi:35109] Links  


LOCUS       HSNSAMTK                3981 bp    mRNA    linear   PRI 24-NOV-1993
DEFINITION  H.sapiens N-sam mRNA for fibroblast growth factor receptor.
ACCESSION   X66945 S37352
VERSION     X66945.1  GI:35109
KEYWORDS    FGF receptor related gene; fibroblast growth factor; Tyrosine
            kinase.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3981)
  AUTHORS   Hattori,Y., Odagiri,H., Katoh,O., Sakamoto,H., Morita,T.,
            Shimotohno,K., Tobinai,K., Sugimura,T. and Terada,M.
  TITLE     K-sam-related gene, N-sam, encodes fibroblast growth factor
            receptor and is expressed in T-lymphocytic tumors
  JOURNAL   Cancer Res. 52 (12), 3367-3371 (1992)
  MEDLINE   92282615
   PUBMED   1317750
REFERENCE   2  (bases 1 to 3981)
  AUTHORS   Hattori,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-1992) Y. Hattori, National Cancer Center Research
            Insitute, Genetics Division, 5-1-1, Tsukiji, Chuo-ku, Tokyo 104,
            JAPAN
FEATURES             Location/Qualifiers
     source          1..3981
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="8"
                     /clone="N-sam4, 5N-sam"
                     /cell_type="immature teratoma"
     gene            727..3195
                     /gene="N-sam"
     CDS             727..3195
                     /gene="N-sam"
                     /function="tyrosine kinase"
                     /codon_start=1
                     /product="fibroblast growth factor receptor"
                     /protein_id="CAA47375.1"
                     /db_xref="GI:35110"
                     /db_xref="SWISS-PROT:P11362"
                     /translation="MWSWKCLLFWAVLVTATLCTARPSPTLPEQAQPWGAPVEVESFL
                     VHPGDLLQLRCRLRDDVQSINWLRDGVQLAESNRTRITGEEVEVQDSVPADSGLYACV
                     TSSPSGSDTTYFSVNVSDALPSSEDDDDDDDSSSEEKETDNTKPNRMPVAPYWTSPEK
                     MEKKLHAVPAAKTVKFKCPSSGTPNPTLRWLKNGKEFKPDHRIGGYKVRYATWSIIMD
                     SVVPSDKGNYTCIVENEYGSINHTYQLDVVERSPHRPILQAGLPANKTVALGSNVEFM
                     CKVYSDPQPHIQWLKHIEVNGSKIGPDNLPYVQILKTAGVNTTDKEMEVLHLRNVSFE
                     DAGEYTCLAGNSIGLSHHSAWLTVLEALEERPAVMTSPLYLEIIIYCTGAFLISCMVG
                     SVIVYKMKSGTKKSDFHSQMAVHKLAKSIPLRRQVTVSADSSASMNSGVLLVRPSRLS
                     SSGTPMLAGVSEYELPEDPRWELPRDRLVLGKPLGEGCFGQVVLAEAIGLDKDKPNRV
                     TKVAVKMLKSDATEKDLSDLISEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASK
                     GNLREYLQARRPPGLEYCYNPSHNPEEQLSSKDLVSCAYQVARGMEYLASKKCIHRDL
                     AARNVLVTEDNVMKIADFGLARDIHHIDYYKKTTNGRLPVKWMAPEALFDRIYTHQSD
                     VWSFGVLLWEIFTLGGSPYPGVPVEELFKLLKEGHRMDKPSNCTNELYMMMRDCWHAV
                     PSQRPTFKQLVEDLDRIVALTSNQEYLDLSMPLDQYSPSFPDTRSSTCSSGEDSVFSH
                     EPLPEEPCLPRHPAQLANGGLKRR"
     sig_peptide     727..783
                     /gene="N-sam"
BASE COUNT      856 a   1205 c   1174 g    746 t
ORIGIN      
        1 cctcttgcgg ccacaggcgc ggcgtcctcg gcggcgggcg gcagctagcg ggagccggga
       61 cgccggtgca gccgcagcgc gcggaggaac ccgggtgtgc cgggagctgg gcggccacgt
      121 ccggacggga ccgagacccc tcgtagcgca ttgcggcgac ctcgccttcc ccggccgcga
      181 gcgcgccgct gcttgaaaag ccgcggaacc caaggacttt tctccggtcc gagctcgggg
      241 cgccccgcag gcgcacggta cccgtgctgc agtcgggcac gccgcggcgc cgggggcctc
      301 cgcagggcga tggagccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga
      361 ggggggcaca aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct
      421 ccggggcacc agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc
      481 gccgccggga gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga
      541 gcccgggggc ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg
      601 gcggactctc ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc
      661 gagctcactg tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga
      721 actgggatgt ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc
      781 tgcaccgcta ggccgtcccc gaccttgcct gaacaagccc agccctgggg agcccctgtg
      841 gaagtggagt ccttcctggt ccaccccggt gacctgctgc agcttcgctg tcggctgcgg
      901 gacgatgtgc agagcatcaa ctggctgcgg gacggggtgc agctggcgga aagcaaccgc
      961 acccgcatca caggggagga ggtggaggtg caggactccg tgcccgcaga ctccggcctc
     1021 tatgcttgcg taaccagcag cccctcgggc agtgacacca cctacttctc cgtcaatgtt
     1081 tcagatgctc tcccctcctc ggaggatgat gatgatgatg atgactcctc ttcagaggag
     1141 aaagaaacag ataacaccaa accaaaccgt atgcccgtag ctccatattg gacatcccca
     1201 gaaaagatgg aaaagaaatt gcatgcagtg ccggctgcca agacagtgaa gttcaaatgc
     1261 ccttccagtg ggaccccaaa ccccacactg cgctggttga aaaatggcaa agaattcaaa
     1321 cctgaccaca gaattggagg ctacaaggtc cgttatgcca cctggagcat cataatggac
     1381 tctgtggtgc cctctgacaa gggcaactac acctgcattg tggagaatga gtacggcagc
     1441 atcaaccaca cataccagct ggatgtcgtg gagcggtccc ctcaccggcc catcctgcaa
     1501 gcagggttgc ccgccaacaa aacagtggcc ctgggtagca acgtggagtt catgtgtaag
     1561 gtgtacagtg acccgcagcc gcacatccag tggctaaagc acatcgaggt gaatgggagc
     1621 aagattggcc cagacaacct gccttatgtc cagatcttga agactgctgg agttaatacc
     1681 accgacaaag agatggaggt gcttcactta agaaatgtct cctttgagga cgcaggggag
     1741 tatacgtgct tggcgggtaa ctctatcgga ctctcccatc actctgcatg gttgaccgtt
     1801 ctggaagccc tggaagagag gccggcagtg atgacctcgc ccctgtacct ggagatcatc
     1861 atctattgca caggggcctt cctcatctcc tgcatggtgg ggtcggtcat cgtctacaag
     1921 atgaagagtg gtaccaagaa gagtgacttc cacagccaga tggctgtgca caagctggcc
     1981 aagagcatcc ctctgcgcag acaggtaaca gtgtctgctg actccagtgc atccatgaac
     2041 tctggggttc ttctggttcg gccatcacgg ctctcctcca gtgggactcc catgctagca
     2101 ggggtctctg agtatgagct tcccgaagac cctcgctggg agctgcctcg ggacagactg
     2161 gtcttaggca aacccctggg agagggctgc tttgggcagg tggtgttggc agaggctatc
     2221 gggctggaca aggacaaacc caaccgtgtg accaaagtgg ctgtgaagat gttgaagtcg
     2281 gacgcaacag agaaagactt gtcagacctg atctcagaaa tggagatgat gaagatgatc
     2341 gggaagcata agaatatcat caacctgctg ggggcctgca cgcaggatgg tcccttgtat
     2401 gtcatcgtgg agtatgcctc caagggcaac ctgcgggagt acctgcaggc ccggaggccc
     2461 ccagggctgg aatactgcta caaccccagc cacaacccag aggagcagct ctcctccaag
     2521 gacctggtgt cctgcgccta ccaggtggcc cgaggcatgg agtatctggc ctccaagaag
     2581 tgcatacacc gagacctggc agccaggaat gtcctggtga cagaggacaa tgtgatgaag
     2641 atagcagact ttggcctcgc acgggacatt caccacatcg actactataa aaagacaacc
     2701 aacggccgac tgcctgtgaa gtggatggca cccgaggcat tatttgaccg gatctacacc
     2761 caccagagtg atgtgtggtc tttcggggtg ctcctgtggg agatcttcac tctgggcggc
     2821 tccccatacc ccggtgtgcc tgtggaggaa cttttcaagc tgctgaagga gggtcaccgc
     2881 atggacaagc ccagtaactg caccaacgag ctgtacatga tgatgcggga ctgctggcat
     2941 gcagtgccct cacagagacc caccttcaag cagctggtgg aagacctgga ccgcatcgtg
     3001 gccttgacct ccaaccagga gtacctggac ctgtccatgc ccctggacca gtactccccc
     3061 agctttcccg acacccggag ctctacgtgc tcctcagggg aggattccgt cttctctcat
     3121 gagccgctgc ccgaggagcc ctgcctgccc cgacacccag cccagcttgc caatggcgga
     3181 ctcaaacgcc gctgactgcc acccacacgc cctccccaga ctccaccgtc agctgtaacc
     3241 ctcacccaca gcccctgctg ggcccaccac ctgtccgtcc ctgtcccctt tcctgctggc
     3301 aggagccggc tgcctaccag gggccttcct gtgtggcctg ccttcacccc actcagctca
     3361 cctctccctc cacctcctct ccacctgctg gtgagaggtg caaagaggca gatctttgct
     3421 gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc accaggcact
     3481 gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc ttcctgagct
     3541 ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac tctggtggca
     3601 ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc tcgattgaag
     3661 gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac tagtttgctt
     3721 tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg ggggcagtgt
     3781 tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga agaaaacaca
     3841 aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt accagagatt
     3901 tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata tattaaaaac
     3961 aaaaaagaaa aaaaaaaaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC005354. Homo sapiens, rib...[gi:13529169] Links  


LOCUS       BC005354                 490 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, ribosomal protein, large P2, clone MGC:12453
            IMAGE:4052568, mRNA, complete cds.
ACCESSION   BC005354
VERSION     BC005354.1  GI:13529169
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 490)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-MAR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 16 Row: h Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 190235.
FEATURES             Location/Qualifiers
     source          1..490
                     /organism="Homo sapiens"
                     /db_xref="LocusID:6181"
                     /db_xref="taxon:9606"
                     /clone="MGC:12453 IMAGE:4052568"
                     /tissue_type="Kidney, hypernephroma"
                     /clone_lib="NIH_MGC_58"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     CDS             77..424
                     /codon_start=1
                     /product="ribosomal protein, large P2"
                     /protein_id="AAH05354.1"
                     /db_xref="GI:13529170"
                     /translation="MRYVASYLLAALGGNSSPSAKDIKKILDSVGIEADDDRLNKVIS
                     ELNGKNIEDVIAQGIGKLASVPAGGAVAVSAAPGSAAPAAGSAPAAAEEKKDEKKEES
                     EESDDDMGFGLFD"
BASE COUNT      125 a    136 c    128 g    101 t
ORIGIN      
        1 ttccttttcc tccctgtcgc caccgaggtc gcacgcgtga gacttctccg ccgcctccgc
       61 cgcagacgcc gccgcgatgc gctacgtcgc ctcctacctg ctggctgccc tagggggcaa
      121 ctcctccccc agcgccaagg acatcaagaa gatcttggac agcgtgggta tcgaggcgga
      181 cgacgaccgg ctcaacaagg ttatcagtga gctgaatgga aaaaacattg aagacgtcat
      241 tgcccagggt attggcaagc ttgccagtgt acctgctggt ggggctgtag ccgtctctgc
      301 tgccccaggc tctgcagccc ctgctgctgg ttctgcccct gctgcagcag aggagaagaa
      361 agatgagaag aaggaggagt ctgaagagtc agatgatgac atgggatttg gcctttttga
      421 ttaaattcct gctcccctgc aaataaagcc tttttacaca tctaaaaaaa aaaaaaaaaa
      481 aaaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC005920. Homo sapiens, rib...[gi:13543523] Links  


LOCUS       BC005920                 491 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, ribosomal protein, large P2, clone MGC:14517
            IMAGE:4274135, mRNA, complete cds.
ACCESSION   BC005920
VERSION     BC005920.1  GI:13543523
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 491)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-APR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: CLONTECH
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 21 Row: i Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 190235.
FEATURES             Location/Qualifiers
     source          1..491
                     /organism="Homo sapiens"
                     /db_xref="LocusID:6181"
                     /db_xref="taxon:9606"
                     /clone="MGC:14517 IMAGE:4274135"
                     /tissue_type="Prostate"
                     /clone_lib="NIH_MGC_83"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     CDS             75..422
                     /codon_start=1
                     /product="ribosomal protein, large P2"
                     /protein_id="AAH05920.1"
                     /db_xref="GI:13543524"
                     /translation="MRYVASYLLAALGGNSSPSAKDIKKILDSVGIEADDDRLNKVIS
                     ELNGKNIEDVIAQGIGKLASVPAGGAVAVSAAPGSAAPAAGSAPAAAEEKKDEKKEES
                     EESDDDMGFGLFD"
BASE COUNT      128 a    137 c    128 g     98 t
ORIGIN      
        1 ccttttcctc cctgtcgcca ccgaggtcgc acgcgtgaga cttctccgcc gcctccgccg
       61 cagacgccgc cgcgatgcgc tacgtcgcct cctacctgct ggctgcccta gggggcaact
      121 cctcccccag cgccaaggac atcaagaaga tcttggacag cgtgggtatc gaggcggacg
      181 acgaccggct caacaaggtt atcagtgagc tgaatggaaa aaacattgaa gacgtcattg
      241 cccagggtat tggcaagctt gccagtgtac ctgctggtgg ggctgtagcc gtctctgctg
      301 ccccaggctc tgcagcccct gctgctggtt ctgcccctgc tgcagcagag gagaagaaag
      361 atgagaagaa ggaggagtct gaagagtcag atgatgacat gggatttggc ctttttgatt
      421 aaattcctgc tcccctgcaa ataaagcctt tttacacatc caaaaaaaaa aaaaaaaaaa
      481 aaaaaaaaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC007573. Homo sapiens, rib...[gi:14043170] Links  


LOCUS       BC007573                 456 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, ribosomal protein, large P2, clone MGC:15530
            IMAGE:3049240, mRNA, complete cds.
ACCESSION   BC007573
VERSION     BC007573.1  GI:14043170
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 456)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-MAY-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
            Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
            Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
            Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
            Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
            Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
            Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
            George Yang, Scott Zuyderduyn, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 22 Row: f Column: 5.
FEATURES             Location/Qualifiers
     source          1..456
                     /organism="Homo sapiens"
                     /db_xref="LocusID:6181"
                     /db_xref="taxon:9606"
                     /clone="MGC:15530 IMAGE:3049240"
                     /tissue_type="Ovary, adenocarcinoma"
                     /clone_lib="NIH_MGC_9"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             51..398
                     /codon_start=1
                     /product="ribosomal protein, large P2"
                     /protein_id="AAH07573.1"
                     /db_xref="GI:14043171"
                     /translation="MRYVASYLLAALGGNSSPSAKDIKKILDSVGIEADDDRLNKVIS
                     ELNGKNIEDVIAQGIGKLASVPAGGAVAVSAAPGSAAPAAGSAPAAAEEKKDEKKEES
                     EESDDDMGFGLFD"
BASE COUNT      114 a    125 c    125 g     92 t
ORIGIN      
        1 ggtcgcacgc gtgagacttc tccgccgcct ccgccgcaga cgccgccgcg atgcgctacg
       61 tcgcctccta cctgctggct gccctagggg gcaactcctc ccccagcgcc aaggacatca
      121 agaagatctt ggacagcgtg ggtatcgagg cggacgacga ccggctcaac aaggttatca
      181 gtgagctgaa tggaaaaaac attgaagacg tcattgccca gggtattggc aagcttgcca
      241 gtgtacctgc tggtggggct gtagccgtct ctgctgcccc aggctctgca gcccctgctg
      301 ctggttctgc ccctgctgca gcagaggaga agaaagatga gaagaaggag gagtctgaag
      361 agtcagatga tgacatggga tttggccttt ttgattaaat tcctgctccc ctgcaaataa
      421 agccttttta cacatctcaa aaaaaaaaaa aaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Z38026. H.sapiens mRNA fo...[gi:558378] Links  


LOCUS       HSFALL39                 615 bp    mRNA    linear   PRI 18-APR-1995
DEFINITION  H.sapiens mRNA for FALL-39 peptide antibiotic.
ACCESSION   Z38026
VERSION     Z38026.1  GI:558378
KEYWORDS    FALL-39; peptide antibiotic.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 615)
  AUTHORS   Agerberth,B., Gunne,H., Odeberg,J., Kogner,P., Boman,H.G. and
            Gudmundsson,G.H.
  TITLE     FALL-39, a putative human peptide antibiotic, is cysteine-free and
            expressed in bone marrow and testis
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 92 (1), 195-199 (1995)
  MEDLINE   95116523
REFERENCE   2  (bases 1 to 615)
  AUTHORS   Gudmundsson,G.G.
  TITLE     Direct Submission
  JOURNAL   Submitted (28-SEP-1994) Gudmundur H. Gudmundsson, Microbiology,
            University of Stockholm, Svante, Arrheniusvag 16, Stockholm, S-106
            91, Sweden
FEATURES             Location/Qualifiers
     source          1..615
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="Hu7"
                     /tissue_type="Bone marrow"
                     /clone_lib="human bone marrow lambdagt11"
     CDS             12..524
                     /codon_start=1
                     /product="FALL-39 peptide antibiotic"
                     /protein_id="CAA86115.1"
                     /db_xref="GI:558379"
                     /db_xref="SWISS-PROT:P49913"
                     /translation="MKTQRNGHSLGRWSLVLLLLGLVMPLAIIAQVLSYKEAVLRAID
                     GINQRSSDANLYRLLDLDPRPTMDGDPDTPKPVSFTVKETVCPRTTQQSPEDCDFKKD
                     GLVKRCMGTVTLNQARGSFDISCDKDNKRFALLGDFFRKSKEKIGKEFKRIVQRIKDF
                     LRNLVPRTES"
     mat_peptide     405..521
                     /product="FALL-39 peptide antibiotic"
BASE COUNT      167 a    152 c    168 g    128 t
ORIGIN      
        1 gaattccggc catgaagacc caaaggaatg gccactccct ggggcggtgg tcactggtgc
       61 tcctgctgct gggcctggtg atgcctctgg ccatcattgc ccaggtcctc agctacaagg
      121 aagctgtcct tcgtgctata gatggcatca accagcggtc ctcggatgct aacctctacc
      181 gcctcctgga cctggacccc aggcccacga tggatgggga cccagacacg ccaaagcctg
      241 tgagcttcac agtgaaggag acagtgtgcc ccaggacgac acagcagtca ccagaggatt
      301 gtgacttcaa gaaggacggg ctggtgaagc ggtgtatggg gacagtgacc ctcaaccagg
      361 ccaggggctc ctttgacatc agttgtgata aggataacaa gagatttgcc ctgctgggtg
      421 atttcttccg gaaatctaaa gagaagattg gcaaagagtt taaaagaatt gtccagagaa
      481 tcaaggattt tttgcggaat cttgtaccca ggacagagtc ctagtgtgtg ccctaccctg
      541 gctcaggctt ctgggctctg agaaataaac tatgagagca atttcaaaaa aaaaaaaaaa
      601 aaaaaaccgg aattc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D10522. Homo sapiens mRNA...[gi:219893] Links  


LOCUS       HUMKCS                  2589 bp    mRNA    linear   PRI 02-FEB-1999
DEFINITION  Homo sapiens mRNA for 80K-L protein, complete cds.
ACCESSION   D10522 D90498
VERSION     D10522.1  GI:219893
KEYWORDS    80K-L protein; calmodulin binding protein; cytoplasm; plasma
            membrane; protein kinase C substrate.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Sakai,K., Hirai,M., Kudoh,J., Minoshima,S. and Shimizu,N.
  TITLE     Molecular cloning and chromosomal mapping of a cDNA encoding human
            80K-L protein: major substrate for protein kinase C
  JOURNAL   Genomics 14 (1), 175-178 (1992)
  MEDLINE   93052291
REFERENCE   2  (bases 1 to 2589)
  AUTHORS   Sakai,K.
  JOURNAL   Unpublished
REFERENCE   3  (bases 1 to 2589)
  AUTHORS   Shimizu,N.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-SEP-1991) Nobuyoshi Shimizu, Keio University School
            of Medicine, Department of Molecular Biology; 35 Shinanomachi,
            Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp,
            Tel:03-3353-1211(ex.2721), Fax:03-3351-2370)
FEATURES             Location/Qualifiers
     source          1..2589
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /clone="lambda80L-[1,2]"
                     /cell_line="A431"
                     /clone_lib="lamda gt10 A431 cDNA library"
     gene            1..2589
                     /gene="80K-L"
     CDS             370..1368
                     /gene="80K-L"
                     /note="tentative"
                     /codon_start=1
                     /product="80K-L protein"
                     /protein_id="BAA01392.1"
                     /db_xref="GI:219894"
                     /translation="MGAQFSKTAAKGEAAAERPGEAAVASSPSKANGQENGHVKVNGD
                     ASPAAAESGAKEELQANGSAPAADKEEPAAAGSGAASPSSAEKGEPAAAAAPEAGASP
                     VEKEAPAEGEAAEPGSATAAEGEAASAASSTSSPKAEDGATPSPSNETPKKKKKRFSF
                     KKSFKLSGFSFKKNKKEAGEGGEAEAPAAEGGKDEAAGGAAAAAAEAGAASGEQAAAP
                     GEEAAAGEEGAAGGDPQEAKPQEAAVAPEKPPASDETKAAEEPSKVEEKKAEEAGASA
                     AACEAPSAAGPGAPPEQEAAPAEEPAAAAASSACAAPSQEAQPECSPEAPPAEAAE"
     polyA_site      2589
                     /gene="80K-L"
BASE COUNT      608 a    659 c    682 g    640 t
ORIGIN      
        1 caaccaggga gatttctcca ttttcctctt gtctacagtg cggctacaaa tctgggattt
       61 ttttattact tctttttttt tcgaactaca cttgggctcc tttttttgtg ctcgactttt
      121 ccaccctttt tccctccctc ctgtgctgct gctttttgat ctcttcgact aaaatttttt
      181 tatccggagt gtatttaatc ggttctgttc tgtcctctcc accaccccca cccccctccc
      241 tccggtgtgt gtgccgctgc cgctgttgcc gccgccgctg ctgctgctgc tcgccccgtc
      301 gttacaccaa cccgaggctc tttgtttccc ctcttggatc tgttgagttt ctttgttgaa
      361 gaagccagca tgggtgccca gttctccaag accgcagcga agggagaagc cgccgcggag
      421 aggcctgggg aggcggctgt ggcctcgtcg ccttccaaag cgaacggaca ggagaatggc
      481 cacgtgaagg taaacggcga cgcttcgccc gcggccgccg agtcgggcgc caaggaggag
      541 ctgcaggcca acggcagcgc cccggccgcc gacaaggagg agcccgcggc cgccgggagc
      601 ggggcggcgt cgccctcctc ggccgagaaa ggtgagccgg ccgccgccgc tgcccccgag
      661 gccggggcca gcccggtaga gaaggaggcc cccgcggaag gcgaggctgc cgagcccggc
      721 tcggccacgg ccgcggaggg agaggccgcg tcggccgcct cctcgacttc ttcgcccaag
      781 gccgaggacg gggccacgcc ctcgcccagc aacgagaccc cgaaaaaaaa aaagaagcgc
      841 ttttccttca agaagtcttt caagctgagc ggcttctcct tcaagaagaa caagaaggag
      901 gctggagaag gcggtgaggc tgaggcgccc gctgccgaag gcggcaagga cgaggccgcc
      961 gggggcgcag ctgcggccgc cgccgaggcg ggcgcggcct ccggggagca ggcagcggcg
     1021 ccgggcgagg aggcggcagc gggcgaggag ggggcggcgg gtggcgaccc gcaggaggcc
     1081 aagccccagg aggccgctgt cgcgccagag aagccgcccg ccagcgacga gaccaaggcc
     1141 gccgaggagc ccagcaaggt ggaggagaaa aaggccgagg aggccggggc cagcgccgcc
     1201 gcctgcgagg ccccctccgc cgccgggccc ggcgcgcccc cggagcagga ggcagccccc
     1261 gcggaggagc ccgcggccgc cgcagcctcg tcagcctgcg cagccccctc acaggaggcc
     1321 cagcccgagt gcagtccaga agccccccca gcggaggcgg cagagtaaaa gagcaagctt
     1381 ttgtgagata atcgaagaac ttttctcccc cgtttgtttg ttggagtggt gccaggtact
     1441 gttttggaga acttgtctac aaccagggat tgattttaaa gatgtctttt tttattttac
     1501 ttttttttaa gcaccaaatt ttgttgtttt tttttttctc ccctccccac agatcccatc
     1561 tcaaatcatt ctgttaacca ccattccaac aggtcgagga gagcttaaac accttcttcc
     1621 tctgccttgt ttctctttta ttttttattt tttcgcatca gtattaatgt ttttgcatac
     1681 tttgcatctt tattcaaaag tgtaaacttt ctttgtcaat ctatggacat gcccatatat
     1741 gaaggagatg ggtgggtcaa aaagggatat caaatgaagt gataggggtc acaatgggga
     1801 aattgaagtg gtgcataaca ttgccaaaat agtgtgccac tagaaatggt gtaaaggctg
     1861 tctttttttt tttttttaaa gaaaagttat taccatgtat tttgtgaggc aggtttacaa
     1921 cactacaagt cttgagttaa gaaggaaaga ggaaaaaaga aaaaacacca atacccagat
     1981 ttaaaaaaaa aaaaacgatc atagtcttag gagttcattt aaaccatagg aacttttcac
     2041 ttatctcatg ttagctgtac cagtcagtga ttaagtagaa ctacaagttg tataggcttt
     2101 attgtttatt gctggtttat gaccttaata aagtgtaatt atgtattacc agcagggtgt
     2161 ttttaactgt gactattgta taaaaacaaa tcttgatatc cagaagcaca tgaagtttgc
     2221 aactttccac cctgcccatt tttgtaaaac tgcagtcatc ttggaccttt taaaacacaa
     2281 attttaaact caaccaagct gtgataagtg gaatggttac tgtttatact gtggtatgtt
     2341 tttgattaca gcagataatg ctttcttttc cagtcgtctt tgagaataaa ggaaaaaaaa
     2401 tcttcagatg caatggtttt gtgtagcatc ttgtctatca tgttttgtaa atactggaga
     2461 agctttgacc aatttgactt agagatggaa tgtaactttg cttacaaaaa ttgctattaa
     2521 actcctgctt aaggtgttct aattttctgt gagcacacta aaagcgaaaa ataaatgtga
     2581 ataaaatgt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC007459. Homo sapiens, clo...[gi:13938612] Links  


LOCUS       BC007459                1229 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, clone MGC:12230 IMAGE:4052054, mRNA, complete cds.
ACCESSION   BC007459
VERSION     BC007459.1  GI:13938612
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1229)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-MAY-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 16 Row: d Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 6912563.
FEATURES             Location/Qualifiers
     source          1..1229
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:12230 IMAGE:4052054"
                     /tissue_type="Kidney, hypernephroma"
                     /clone_lib="NIH_MGC_58"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     CDS             109..762
                     /codon_start=1
                     /product="Unknown (protein for MGC:12230)"
                     /protein_id="AAH07459.1"
                     /db_xref="GI:13938613"
                     /translation="MSKPPPKPVKPGEGGQVKVFRALYTFEPRTPDELYFEEGDIIYI
                     TDMSDTNWWKGTSKGRTGLIPSNYVAEQAESIDNPLHEAAKRGNLSWLRECLDNRVGV
                     NGLDKAGSTALYWACHGGHKDIVEMLFTQPNIELNQQNKLGDTALHAAAWKGYADIVQ
                     LFLAKGARTDLRNIEKKLAFDMATNAACASLLKKKQGTDAVRTLSNAEDYLDDEDSD"
BASE COUNT      400 a    213 c    283 g    333 t
ORIGIN      
        1 acggttgtaa gccagacaaa aagaactggg gtgcccggag tgccaggtgg cgggcaagcg
       61 gtgggctttt cggcggggtc tttaggattt gcagctccag gaagcgagat gtcgaagccg
      121 ccacccaaac cagtcaaacc aggtgaggga gggcaagtta aagtcttcag agccctgtat
      181 acgtttgaac ccagaactcc agatgaatta tactttgagg aaggtgatat tatctacatt
      241 actgacatga gcgataccaa ttggtggaaa ggcacctcca aaggcaggac tggactaatt
      301 ccaagcaact atgtggctga gcaggcagaa tccattgaca atccattgca tgaagcagca
      361 aaaagaggca acttgagctg gttgagagag tgtttggaca acagagtggg tgttaatggc
      421 ttagacaaag ctggaagcac tgccttatac tgggcttgcc acgggggcca caaagatata
      481 gtggaaatgc tatttactca accaaatatt gaactgaacc agcagaacaa gttgggagat
      541 acagctttgc atgctgctgc ctggaagggt tatgcagata tcgtccagtt gtttctggca
      601 aaaggtgcta gaacagactt aagaaacatt gagaagaagc tggccttcga catggctacc
      661 aatgctgcct gtgcatctct cctgaaaaag aaacagggaa cagatgcagt tcgaacatta
      721 agcaatgccg aggactatct cgatgatgaa gactcagatt aattcctttc tggagctttg
      781 agatctaaaa cttctgttgc ttttgccatt ccaaaacttt gtctttgcca gaaaagtgtt
      841 ggtaactata aagaaaatta tatatgaaca cggcagtgtt gcactgtgtt tgagtagaac
      901 gtgtaaatga attgttccca cctttggttt gccagtaagt gactggattc ttggcacatt
      961 tgtgttcacc aaagtagaac aagaagatat tatttctatt tatcaagcaa aaggaatttt
     1021 aagatttttt tttctttaaa aacaaattag gatttttttt tttttttttt ttttttagtt
     1081 aaaatgcttt acctcaatgg ttgagatatt ttgaatggat ttttcaaggg ggggaaatgc
     1141 ttattataat aataaaccaa aatacttaac agaaaattgt cagctattct gacaaaaaca
     1201 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  






&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC004143. Homo sapiens, B-f...[gi:13278731] Links  


LOCUS       BC004143                2512 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, B-factor, properdin, clone MGC:1795 IMAGE:2959705,
            mRNA, complete cds.
ACCESSION   BC004143
VERSION     BC004143.1  GI:13278731
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2512)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-MAR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Rachel Dickhoff, Jessica Fahey, Stephanie Ford, Julia
            Greene, Mark Ketteman and Anuradha Madan
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 2 Row: k Column: 14
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 452937.
FEATURES             Location/Qualifiers
     source          1..2512
                     /organism="Homo sapiens"
                     /db_xref="LocusID:629"
                     /db_xref="taxon:9606"
                     /clone="MGC:1795 IMAGE:2959705"
                     /tissue_type="Colon, adenocarcinoma"
                     /clone_lib="NIH_MGC_15"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             145..2439
                     /codon_start=1
                     /product="B-factor, properdin"
                     /protein_id="AAH04143.1"
                     /db_xref="GI:13278732"
                     /translation="MGSNLSPQLCLMPFILGLLSGGVTTTPWSLAWPQGSCSLEGVEI
                     KGGSFRLLQEGQALEYVCPSGFYPYPVQTRTCRSTGSWSTLKTQDQKTVRKAECRAIH
                     CPRPHDFENGEYWPRSPYYNVSDEISFHCYDGYTLRGSANRTCQVNGRWSGQTAICDN
                     GAGYCSNPGIPIGTRKVGSQYRLEDSVTYHCSRGLTLRGSQRRTCQEGGSWSGTEPSC
                     QDSFMYDTPQEVAEAFLSSLTETIEGVDAEDGHGPGEQQKRKIVLDPSGSMNIYLVLD
                     GSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADW
                     VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWNRTRHVIILMTDGLH
                     NMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQ
                     HVFKVKDMENLEDVFYQMIDESQSLSLCGMVWEHRKGTDYHKQPWQAKISVIRPSKGH
                     ESCMGAVVSEYFVLTAAHCFTVDDKEHSIKVSVGGEKRDLEIEVVLFHPNYNINGKKE
                     AGIPEFYDYDVALIKLKNKLKYGQTIRPICLPCTEGTTRALRLPPTTTCQQQKEELLP
                     AQDIKALFVSEEEKKLTRKEVYIKNGDKKGSCERDAQYAPGYDKVKDISEVVTPRFLC
                     TGGVSPYADPNTCRGDSGGPLIVHKRSRFIQVGVISWGVVDVCKNQKRQKQVPAHARD
                     FHINLFQVLPWLKEKLQDEDLGFL"
BASE COUNT      681 a    623 c    687 g    521 t
ORIGIN      
        1 ggcacgaggg ggagcagggg aagggaatgt gaccaggtct aggtctggag tttcagcttg
       61 gacactgagc caagcagaca agcaaagcaa gccaggacac accatcctgc cccaggccca
      121 gcttctctcc tgccttccaa cgccatgggg agcaatctca gcccccaact ctgcctgatg
      181 ccctttatct tgggcctctt gtctggaggt gtgaccacca ctccatggtc tttggcctgg
      241 ccccagggat cctgctctct ggagggggta gagatcaaag gcggctcctt ccgacttctc
      301 caagagggcc aggcactgga gtacgtgtgt ccttctggct tctacccgta ccctgtgcag
      361 acacgtacct gcagatctac ggggtcctgg agcaccctga agactcaaga ccaaaagact
      421 gtcaggaagg cagagtgcag agcaatccac tgtccaagac cacacgactt cgagaacggg
      481 gaatactggc cccggtctcc ctactacaat gtgagtgatg agatctcttt ccactgctat
      541 gacggttaca ctctccgggg ctctgccaat cgcacctgcc aagtgaatgg ccggtggagt
      601 gggcagacag cgatctgtga caacggagcg gggtactgct ccaacccggg catccccatt
      661 ggcacaagga aggtgggcag ccagtaccgc cttgaagaca gcgtcaccta ccactgcagc
      721 cgggggctta ccctgcgtgg ctcccagcgg cgaacgtgtc aggaaggtgg ctcttggagc
      781 gggacggagc cttcctgcca agactccttc atgtacgaca cccctcaaga ggtggccgaa
      841 gctttcctgt cttccctgac agagaccata gaaggagtcg atgctgagga tgggcacggc
      901 ccaggggaac aacagaagcg gaagatcgtc ctggaccctt caggctccat gaacatctac
      961 ctggtgctag atggatcaga cagcattggg gccagcaact tcacaggagc caaaaagtgt
     1021 ctagtcaact taattgagaa ggtggcaagt tatggtgtga agccaagata tggtctagtg
     1081 acatatgcca cataccccaa aatttgggtc aaagtgtctg aagcagacag cagtaatgca
     1141 gactgggtca cgaagcagct caatgaaatc aattatgaag accacaagtt gaagtcaggg
     1201 actaacacca agaaggccct ccaggcagtg tacagcatga tgagctggcc agatgacgtc
     1261 cctcctgaag gctggaaccg cacccgccat gtcatcatcc tcatgactga tggattgcac
     1321 aacatgggcg gggacccaat tactgtcatt gatgagatcc gggacttgct atacattggc
     1381 aaggatcgca aaaacccaag ggaggattat ctggatgtct atgtgtttgg ggtcgggcct
     1441 ttggtgaacc aagtgaacat caatgctttg gcttccaaga aagacaatga gcaacatgtg
     1501 ttcaaagtca aggatatgga aaacctggaa gatgttttct accaaatgat cgatgaaagc
     1561 cagtctctga gtctctgtgg catggtttgg gaacacagga agggtaccga ttaccacaag
     1621 caaccatggc aggccaagat ctcagtcatt cgcccttcaa agggacacga gagctgtatg
     1681 ggggctgtgg tgtctgagta ctttgtgctg acagcagcac attgtttcac tgtggatgac
     1741 aaggaacact caatcaaggt cagcgtagga ggggagaagc gggacctgga gatagaagta
     1801 gtcctatttc accccaacta caacattaat gggaaaaaag aagcaggaat tcctgaattt
     1861 tatgactatg acgttgccct gatcaagctc aagaataagc tgaaatatgg ccagactatc
     1921 aggcccattt gtctcccctg caccgaggga acaactcgag ctttgaggct tcctccaact
     1981 accacttgcc agcaacaaaa ggaagagctg ctccctgcac aggatatcaa agctctgttt
     2041 gtgtctgagg aggagaaaaa gctgactcgg aaggaggtct acatcaagaa tggggataag
     2101 aaaggcagct gtgagagaga tgctcaatat gccccaggct atgacaaagt caaggacatc
     2161 tcagaggtgg tcacccctcg gttcctttgt actggaggag tgagtcccta tgctgacccc
     2221 aatacttgca gaggtgattc tggcggcccc ttgatagttc acaagagaag tcgtttcatt
     2281 caagttggtg taatcagctg gggagtagtg gatgtctgca aaaaccagaa gcggcaaaag
     2341 caggtacctg ctcacgcccg agactttcac atcaacctct ttcaagtgct gccctggctg
     2401 aaggagaaac tccaagatga ggatttgggt tttctataag gggtttcctg ctggacaggg
     2461 gcgtgggatt gaattaaaac agctgcgaca acaaaaaaaa aaaaaaaaaa aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L15702. Human complement ...[gi:291921] Links  


LOCUS       HUMCOMFACB              2388 bp    mRNA    linear   PRI 16-MAR-1994
DEFINITION  Human complement factor B mRNA, complete cds.
ACCESSION   L15702
VERSION     L15702.1  GI:291921
KEYWORDS    complement factor; complement factor B.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2388)
  AUTHORS   Horiuchi,T., Kim,S., Matsumoto,M., Watanabe,I., Fujita,S. and
            Volanakis,J.E.
  TITLE     Human complement factor B: cDNA cloning, nucleotide sequencing,
            phenotypic conversion by site-directed mutagenesis and expression
  JOURNAL   Mol. Immunol. 30 (17), 1587-1592 (1993)
  MEDLINE   94067177
   PUBMED   8247029
COMMENT     Original source text: Homo sapiens (human).
FEATURES             Location/Qualifiers
     source          1..2388
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     5'UTR           1..40
     CDS             41..2335
                     /codon_start=1
                     /product="complement factor B"
                     /protein_id="AAA16820.1"
                     /db_xref="GI:291922"
                     /translation="MGSNLSPQLCLMPFILGLLSGGVTTTPWSLAQPQGSCSLEGVEI
                     KGGSFRLLQEGQALEYVCPSGFYPYPVQTRTCRSTGSWSTLKTQDQKTVRKAECRAIH
                     CPRPHDFENGEYWPRSPYYNVSDEISFHCYDGYTLRGSANRTCQVNGRWSGQTAICDN
                     GAGYCSNPGIPIGTRKVGSQYRLEDSVTYHCSRGLTLRGSQRRTCQEGGSWSGTEPSC
                     QDSFMYDTPQEVAEAFLSSLTETIEGVDAEDGHGPGEQQKRKIVLDPSGSMNIYLVLD
                     GSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADW
                     VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWNRTRHVIILMTDGLH
                     NMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQ
                     HVFKVKDMENLEDVFYQMIDESQSLSLCGMVWEHRKGTDYHKQPWQAKISVIRPSKGH
                     ESCMGAVVSEYFVLTAAHCFTVDDKEHSIKVSVGGEKRDLEIEVVLFHPNYNINGKKE
                     AGIPEFYDYDVALIKLKNKLKYGQTIRPICLPCTEGTTRALRLPPTTTCQQQKEELLP
                     AQDIKALFVSEEEKKLTRKEVYIKNGDKKGSCERDAQYAPGYDKVKDISEVVTPRFLC
                     TGGVSPYADPNTCRGDSGGPLIVHKRSRFIQVGVISWGVVDVCKNQKRQKQVPAHARD
                     FHINLFQVLPWLKEKLQDEDLGFL"
     sig_peptide     41..115
     3'UTR           2336..2388
     polyA_signal    2369..2374
BASE COUNT      630 a    601 c    649 g    508 t
ORIGIN      
        1 tcctgcccca ggcccagctt ctctcctgcc ttccaacgcc atggggagca atctcagccc
       61 ccaactctgc ctgatgccct ttatcttggg cctcttgtct ggaggtgtga ccaccactcc
      121 atggtctttg gcccagcccc agggatcctg ctctctggag ggggtagaga tcaaaggcgg
      181 ctccttccga cttctccaag agggccaggc actggagtac gtgtgtcctt ctggcttcta
      241 cccgtaccct gtgcagacac gtacctgcag atctacgggg tcctggagca ccctgaagac
      301 tcaagaccaa aagactgtca ggaaggcaga gtgcagagca atccactgtc caagaccaca
      361 cgacttcgag aacggggaat actggccccg gtctccctac tacaatgtga gtgatgagat
      421 ctctttccac tgctatgacg gttacactct ccggggctct gccaatcgca cctgccaagt
      481 gaatggccgg tggagtgggc agacagcgat ctgtgacaac ggagcggggt actgctccaa
      541 cccgggcatc cccattggca caaggaaggt gggcagccag taccgccttg aagacagcgt
      601 cacctaccac tgcagccggg ggcttaccct gcgtggctcc cagcggcgaa cgtgtcagga
      661 aggtggctct tggagcggga cggagccttc ctgccaagac tccttcatgt acgacacccc
      721 tcaagaggtg gccgaagctt tcctgtcttc cctgacagag accatagaag gagtcgatgc
      781 tgaggatggg cacggcccag gggaacaaca gaagcggaag atcgtcctgg acccttcagg
      841 ctccatgaac atctacctgg tgctagatgg atcagacagc attggggcca gcaacttcac
      901 aggagccaaa aagtgtctag tcaacttaat tgagaaggtg gcaagttatg gtgtgaagcc
      961 aagatatggt ctagtgacat atgccacata ccccaaaatt tgggtcaaag tgtctgaagc
     1021 agacagcagt aatgcagact gggtcacgaa gcagctcaat gaaatcaatt atgaagacca
     1081 caagttgaag tcagggacta acaccaagaa ggccctccag gcagtgtaca gcatgatgag
     1141 ctggccagat gacgtccctc ctgaaggctg gaaccgcacc cgccatgtca tcatcctcat
     1201 gactgatgga ttgcacaaca tgggcgggga cccaattact gtcattgatg agatccggga
     1261 cttgctatac attggcaagg atcgcaaaaa cccaagggag gattatctgg atgtctatgt
     1321 gtttggggtc gggcctttgg tgaaccaagt gaacatcaat gctttggctt ccaagaaaga
     1381 caatgagcaa catgtgttca aagtcaagga tatggaaaac ctggaagatg ttttctacca
     1441 aatgatcgat gaaagccagt ctctgagtct ctgtggcatg gtttgggaac acaggaaggg
     1501 taccgattac cacaagcaac catggcaggc caagatctca gtcattcgcc cttcaaaggg
     1561 acacgagagc tgtatggggg ctgtggtgtc tgagtacttt gtgctgacag cagcacattg
     1621 tttcactgtg gatgacaagg aacactcaat caaggtcagc gtaggagggg agaagcggga
     1681 cctggagata gaagtagtcc tatttcaccc caactacaac attaatggga aaaaagaagc
     1741 aggaattcct gaattttatg actatgacgt tgccctgatc aagctcaaga ataagctgaa
     1801 atatggccag actatcaggc ccatttgtct cccctgcacc gagggaacaa ctcgagcttt
     1861 gaggcttcct ccaactacca cttgccagca acaaaaggaa gagctgctcc ctgcacagga
     1921 tatcaaagct ctgtttgtgt ctgaggagga gaaaaagctg actcggaagg aggtctacat
     1981 caagaatggg gataagaaag gcagctgtga gagagatgct caatatgccc caggctatga
     2041 caaagtcaag gacatctcag aggtggtcac ccctcggttc ctttgtactg gaggagtgag
     2101 tccctatgct gaccccaata cttgcagagg tgattctggc ggccccttga tagttcacaa
     2161 gagaagtcgt ttcattcaag ttggtgtaat cagctgggga gtagtggatg tctgcaaaaa
     2221 ccagaagcgg caaaagcagg tacctgctca cgcccgagac tttcacatca acctctttca
     2281 agtgctgccc tggctgaagg agaaactcca agatgaggat ttgggttttc tataaggggt
     2341 ttcctgctgg acaggggcgt gggattgaat taaaacagct gcgacaac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AL121735. Isoform of human ...[gi:6012990] Links  


LOCUS       HS224A62B               1213 bp    mRNA    linear   PRI 04-OCT-1999
DEFINITION  Isoform of human GTP-binding protein G25K.
ACCESSION   AL121735
VERSION     AL121735.1  GI:6012990
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1213)
  AUTHORS   Rhodes,S. and Huckle,E.
  TITLE     Direct Submission
  JOURNAL   Submitted (04-OCT-1999) E-mail contact: humquery@sanger.ac.uk
COMMENT     This cDNA sequence was assembled from public domain ESTs and single
            pass sequencing reads from expressed DNA templates, aligned to the
            genomic DNA sequence from the bacterial clone 224A6 (AL031281). The
            EST sequences listed match this sequence with an identity of at
            least 95% between the coordinates shown.
            Further information can be found at
            http://www.sanger.ac.uk/HGP/Chr1/ Non-experimentally determined
            gene with isoforms dJ224A6.C2.1(HS224A62A), dJ224A6.C2.3(HS224A62C)
            and  dJ224A6.C2.4(HS224A62D) Sanger Centre name : dJ224A6.C1.2.2.
FEATURES             Location/Qualifiers
     source          1..1213
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="1"
                     /map="1p35.1-36.23"
     exon            1..54
                     /number=1
     misc_feature    12..252
                     /note="matches EST D54120"
     misc_feature    16..569
                     /note="matches EST AA424269 from clone 760101"
     misc_feature    17..272
                     /note="matches EST AA380865"
     misc_feature    28..306
                     /note="matches EST AA134643 from clone 532058"
     misc_feature    join(38..502,502..556)
                     /note="matches EST AI280348 from clone IMAGE:1879967"
     misc_feature    39..351
                     /note="matches EST AA362118"
     misc_feature    40..593
                     /note="matches EST AL037750 from clone DKFZp564C087"
     misc_feature    46..433
                     /note="matches EST T23834 from clone
                     b4HB3MA-Cot8-HAP-Ft-286"
     misc_feature    47..401
                     /note="matches EST D53086"
     misc_feature    49..391
                     /note="matches EST H73472 from clone 214563"
     misc_feature    50..446
                     /note="matches EST H16069 from clone 48569"
     misc_feature    50..306
                     /note="matches EST W79849 from clone 346498"
     misc_feature    51..391
                     /note="matches EST H05221 from clone 45208"
     misc_feature    52..512
                     /note="matches EST AA101993 from clone 489759"
     misc_feature    52..341
                     /note="matches EST AA043002 from clone 486843"
     misc_feature    join(54..279,418..558)
                     /note="matches EST AA054501 from clone 489364"
     misc_feature    54..454
                     /note="matches EST AA405745 from clone 742989"
     misc_feature    complement(join(54..85,146..243,242..390))
                     /note="matches EST AA405996 from clone 742991"
     misc_feature    55..446
                     /note="matches EST R15868 from clone 53015"
     exon            55..209
                     /number=2
     misc_feature    join(57..535,533..575)
                     /note="matches EST AA053878 from clone 380569"
     misc_feature    join(59..569,569..589)
                     /note="matches EST AL048769 from clone DKFZp566N013"
     misc_feature    59..535
                     /note="matches EST AA434195 from clone 770280"
     misc_feature    66..580
                     /note="matches EST AA041660 from clone 475213"
     misc_feature    66..472
                     /note="matches EST C89016 from clone 01B00056NC10"
     misc_feature    70..555
                     /note="matches EST AA452681 from clone 788465"
     misc_feature    join(100..562,561..595)
                     /note="matches EST T27915"
     CDS             105..680
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="CAB57326.1"
                     /db_xref="GI:6012991"
                     /translation="MQTIKCVVVGDGAVGKTCLLISYTTNKFPSEYVPTVFDNYAVTV
                     MIGGEPYTLGLFDTAGQEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENVKEKWVPEIT
                     HHCPKTPFLLVGTQIDLRDDPSTIEKLAKNKQKPITPETAEKLARDLKAVKYVECSAL
                     TQKGLKNVFDEAILAALEPPEPKKSRRCVLL"
     misc_feature    141..561
                     /note="matches EST AA272519 from clone 737098"
     exon            210..282
                     /number=3
     misc_feature    complement(211..930)
                     /note="matches EST AI937197 from clone IMAGE:2467500"
     misc_feature    225..584
                     /note="matches EST W16245 from clone 334178"
     misc_feature    complement(248..779)
                     /note="matches EST AA777840 from clone 448712"
     misc_feature    complement(join(251..271,269..296,310..930))
                     /note="matches EST AI421848 from clone IMAGE:2103192"
     misc_feature    complement(join(259..378,384..409,407..572))
                     /note="matches EST AA699668 from clone 446871"
     misc_feature    complement(join(268..590,642..929))
                     /note="matches EST AI829093 from clone IMAGE:2405113"
     misc_feature    complement(268..928)
                     /note="matches EST AI692933 from clone IMAGE:2338788"
     misc_feature    complement(280..783)
                     /note="matches EST AI952237 from clone IMAGE:2547056"
     exon            283..392
                     /number=4
     misc_feature    complement(295..779)
                     /note="matches EST AI148395 from clone IMAGE:1709613"
     misc_feature    complement(304..561)
                     /note="matches EST AI581884 from clone IMAGE:2173957"
     misc_feature    join(329..664,781..806)
                     /note="matches EST AI789078 from clone IMAGE:1972466"
     misc_feature    329..665
                     /note="matches EST AI315838 from clone IMAGE:1923065"
     misc_feature    329..664
                     /note="matches EST AI956373 from clone IMAGE:2136309"
     misc_feature    329..660
                     /note="matches EST AI788304 from clone IMAGE:1973005"
     misc_feature    complement(334..535)
                     /note="matches EST AU016167 from clone J0721E05"
     misc_feature    complement(join(356..416,413..474,470..930))
                     /note="matches EST AI203516 from clone IMAGE:1732397"
     misc_feature    join(372..627,624..656)
                     /note="matches EST AA033571 from clone 471162"
     misc_feature    complement(join(384..414,442..611,608..930))
                     /note="matches EST H97495 from clone 251935"
     exon            393..590
                     /number=5
     misc_feature    complement(396..930)
                     /note="matches EST AI122642 from clone IMAGE:1693852"
     misc_feature    complement(418..930)
                     /note="matches EST AI127477 from clone IMAGE:1708267"
     misc_feature    complement(421..931)
                     /note="matches EST AI076212 from clone IMAGE:1670672"
     misc_feature    join(422..664,665..777)
                     /note="matches EST W07693 from clone 300925"
     misc_feature    complement(440..930)
                     /note="matches EST AI283633 from clone IMAGE:1864505"
     misc_feature    450..583
                     /note="matches EST W58841 from clone 371707"
     misc_feature    450..534
                     /note="matches EST AA612466 from clone 1064595"
     misc_feature    complement(452..940)
                     /note="matches EST AI138687 from clone IMAGE:1710422"
     misc_feature    complement(458..936)
                     /note="matches EST AI092444 from clone IMAGE:1683429"
     misc_feature    complement(502..930)
                     /note="matches EST AI122672 from clone IMAGE:1693875"
     misc_feature    complement(508..930)
                     /note="matches EST AI184314 from clone IMAGE:1731855"
     misc_feature    complement(532..940)
                     /note="matches EST AI299273 from clone IMAGE:1901019"
     misc_feature    join(533..664,905..955)
                     /note="matches EST N31551 from clone 266055"
     misc_feature    join(560..584,583..611)
                     /note="matches EST AA028191 from clone 469877"
     misc_feature    complement(572..935)
                     /note="matches EST AI309937 from clone IMAGE:1913986"
     misc_feature    complement(582..940)
                     /note="matches EST AI823949 from clone IMAGE:2403462"
     exon            591..1213
                     /number=6
     misc_feature    complement(616..933)
                     /note="matches EST AI672641 from clone IMAGE:2345033"
     misc_feature    complement(632..932)
                     /note="matches EST T84051 from clone 114179"
     misc_feature    complement(756..930)
                     /note="matches EST AI913161 from clone IMAGE:2297671"
BASE COUNT      322 a    240 c    270 g    381 t
ORIGIN      
        1 ggcagccgag gagaccccgc gcagtgctgc caacgccccg gtggagaagc tgaggtcatc
       61 atcagatttg aaatatttaa agtggataca aaactatttc agcaatgcag acaattaagt
      121 gtgttgttgt gggcgatggt gctgttggta aaacatgtct cctgatatcc tacacaacaa
      181 acaaatttcc atcggaatat gtaccgactg tttttgacaa ctatgcagtc acagttatga
      241 ttggtggaga accatatact cttggacttt ttgatactgc agggcaagag gattatgaca
      301 gattacgacc gctgagttat ccacaaacag atgtatttct agtctgtttt tcagtggtct
      361 ctccatcttc atttgaaaac gtgaaagaaa agtgggtgcc tgagataact caccactgtc
      421 caaagactcc tttcttgctt gttgggactc aaattgatct cagagatgac ccctctacta
      481 ttgagaaact tgccaagaac aaacagaagc ctatcactcc agagactgct gaaaagctgg
      541 cccgtgacct gaaggctgtc aagtatgtgg agtgttctgc acttacacag aaaggcctaa
      601 agaatgtatt tgacgaagca atattggctg ccctggagcc tccagaaccg aagaagagcc
      661 gcaggtgtgt gctgctatga acatctctcc agagcccttt ctgcacagct ggtgtcggca
      721 tcatactaaa agcaatgttt aaatcaaact aaagattaaa aattaaaatt cgtttttgca
      781 ataatgacaa atgccctgca cctacccaca tgcactcgtg tgagacaagg cccataggta
      841 tggccccccc cttccccctc ccagtactag ttaattttga gtaattgtat tgtcagaaaa
      901 gtgattagta ctattttttt ttgttgtttc aaaaaaaaaa tttttgtgtg tgtgtgtttt
      961 tttttttttt ttttttgttg tttaaaagca aggcatgctt gtggatgact ctgtaacaga
     1021 ctaattggaa ttgttgaagc tgctccctgg ttccactctg gagagtaatc tgggacatct
     1081 tagtgttttg ttttgttttt ttccctcctc ttttttttgg gggggagtgt gtgtggggtt
     1141 tgttttttag tcttgttttt ttaattcatt aaccagtggt tagcccttaa ggggaggagg
     1201 acggattgat tcc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC003682. Homo sapiens, cel...[gi:13277547] Links  


LOCUS       BC003682                2136 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, cell division cycle 42 (GTP-binding protein, 25kD),
            clone MGC:5044 IMAGE:3457085, mRNA, complete cds.
ACCESSION   BC003682
VERSION     BC003682.1  GI:13277547
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2136)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (28-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 4 Row: p Column: 21
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4826460.
FEATURES             Location/Qualifiers
     source          1..2136
                     /organism="Homo sapiens"
                     /db_xref="LocusID:998"
                     /db_xref="taxon:9606"
                     /clone="MGC:5044 IMAGE:3457085"
                     /tissue_type="Cervix, carcinoma"
                     /clone_lib="NIH_MGC_12"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     CDS             54..629
                     /codon_start=1
                     /product="cell division cycle 42 (GTP-binding protein,
                     25kD)"
                     /protein_id="AAH03682.1"
                     /db_xref="GI:13277548"
                     /translation="MQTIKCVVVGDGAVGKTCLLISYTTNKFPSEYVPTVFDNYAVTV
                     MIGGEPYTLGLFDTAGQEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENVKEKWVPEIT
                     HHCPKTPFLLVGTQIDLRDDPSTIEKLAKNKQKPITPETAEKLARDLKAVKYVECSAL
                     TQKGLKNVFDEAILAALEPPEPKKSRRCVLL"
BASE COUNT      637 a    387 c    431 g    681 t
ORIGIN      
        1 gaggtcatca tcagatttga aatatttaaa gtggatacaa aactatttca gcaatgcaga
       61 caattaagtg tgttgttgtg ggcgatggtg ctgttggtaa aacatgtctc ctgatatcct
      121 acacaacaaa caaatttcca tcggaatatg taccgactgt ttttgacaac tatgcagtca
      181 cagttatgat tggtggagaa ccatatactc ttggactttt tgatactgca gggcaagagg
      241 attatgacag attacgaccg ctgagttatc cacaaacaga tgtatttcta gtctgttttt
      301 cagtggtctc tccatcttca tttgaaaacg tgaaagaaaa gtgggtgcct gagataactc
      361 accactgtcc aaagactcct ttcttgcttg ttgggactca aattgatctc agagatgacc
      421 cctctactat tgagaaactt gccaagaaca aacagaagcc tatcactcca gagactgctg
      481 aaaagctggc ccgtgacctg aaggctgtca agtatgtgga gtgttctgca cttacacaga
      541 aaggcctaaa gaatgtattt gacgaagcaa tattggctgc cctggagcct ccagaaccga
      601 agaagagccg caggtgtgtg ctgctatgaa catctctcca gagccctttc tgcacagctg
      661 gtgtcggcat catactaaaa gcaatgttta aatcaaacta aagattaaaa attaaaattc
      721 gtttttgcaa taatgacaaa tgccctgcac ctacccacat gcactcgtgt gagacaaggc
      781 ccataggtat ggcccccccc ttccccctcc cagtactagt taattttgag taattgtatt
      841 gtcagaaaag tgattagtac tatttttttt tgttgtttca aaaaaaaaat ttttgtgtgt
      901 gtgttttttt tttttttttt ttttttttgt tgtttaaaag caaggcatgc ttgtggatga
      961 ctctgtaaca gactaattgg aattgttgaa gctgctccct ggttccactc tggagagtaa
     1021 tctgggacat cttagtgttt tgttttgttt ttttccctcc tctttttttt gggggggagt
     1081 gtgtgtgggg tttgtttttt agtcttgttt ttttaattca ttaaccagtg gttagccctt
     1141 aaggggagga ggacggattg attccacatt ccacttccta gatctagttt agaaaacatg
     1201 ttccccatct ggtgctctta ggaaggagta tagtaaatgc ctcatttaat aacatactcc
     1261 tttttgaaag ttgccttttc tctccaccct tgagtagatc cagtatttga tgaaactcat
     1321 gaaagtgggt ggagcccatc ttgcccctcc tcttttctag gacgcactat atgtgactgt
     1381 gactttcaag gacatttgtt tgccatttgc tgattttttt gggaagttaa tttctaactt
     1441 ctttcactga taaatgaaga aaagtattgc acctttgaaa tgcaccaaat gaattgagtt
     1501 tgtaattaaa aaaatttttt tccctttcag tcattgtctt atatgcttag catagatttg
     1561 cagctcagta gtatatggtg ttcctagaat gcagctgaag acctgttatg tagaggaaat
     1621 acgaggggtg gtgctagaag acagacatct gtggaatgat tcacatcctc tcaagttagg
     1681 aggatggagg cctgcttcat taagaagctg ggggtagggt gggggtgggg agaacactta
     1741 acaacatggg gaccagtcag gggaatcccc ttatttctgt tttgcatatg aggaacccta
     1801 gagcagccag gtgaggctct ctagtttaat aaaaatcatg gaaagactct taatgcagac
     1861 tcttcttaag tgttaatagg gattttttca gcttattttg gttgcagttt ccaattttta
     1921 aaaatgttga ggtaatcttt cccaccttcc caaacctaat tcttgtagat gcattagtgt
     1981 tgaaccaatg ctttctcatg tctcaattct ttgtatatgc attcttttca gatgtattaa
     2041 acaaacaaaa acccttcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     2101 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M57298. Human GTP-binding...[gi:183489] Links  


LOCUS       HUMGPG25K               1175 bp    mRNA    linear   PRI 08-NOV-1994
DEFINITION  Human GTP-binding protein G25K mRNA, complete cds.
ACCESSION   M57298
VERSION     M57298.1  GI:183489
KEYWORDS    GTP-binding protein G25K.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1175)
  AUTHORS   Shinjo,K., Koland,J.G., Hart,M.J., Narasimhan,V., Johnson,D.I.,
            Evans,T. and Cerione,R.A.
  TITLE     Molecular cloning of the gene for the human placental GTP-binding
            protein Gp (G25K): identification of this GTP-binding protein as
            the human homolog of the yeast cell-division-cycle protein CDC42
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 87 (24), 9853-9857 (1990)
  MEDLINE   91088610
   PUBMED   2124704
COMMENT     Original source text: Human placenta, cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..1175
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="Unassigned"
                     /tissue_type="placenta"
     gene            1..1175
                     /gene="CDC42"
     CDS             70..645
                     /gene="CDC42"
                     /codon_start=1
                     /product="GTP-binding protein G25K"
                     /protein_id="AAA52592.1"
                     /db_xref="GI:183490"
                     /db_xref="GDB:G00-127-540"
                     /translation="MQTIKCVVVGDGAVGKTCLLISYTTNKFPSEYVPTVFDNYAVTV
                     MIGGEPYTLGLFDTAGQEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENVKEKWVPEIT
                     HHCPKTPFLLVGTQIDLRDDPSTIEKLAKNKQKPITPETAEKLARDLKAVKYVECSAL
                     TQKGLKNVFDEAILAALEPPEPKKSRRCVLL"
BASE COUNT      316 a    224 c    256 g    379 t
ORIGIN      
        1 ccccggtgga gaagctgagg tcatcatcag atttgaaata tttaaagtgg atacaaaatt
       61 atttcagcaa tgcagacaat taagtgtgtt gttgtgggcg atggtgctgt tggtaaaaca
      121 tgtctcctga tatcctacac aacaaacaaa tttccatcgg aatatgtacc gactgttttt
      181 gacaactatg cagtcacagt tatgattggt ggagaaccat atactcttgg actttttgat
      241 actgcagggc aagaggatta tgacagatta cgaccgctga gttatccaca aacagatgta
      301 tttctagtct gtttttcagt ggtctctcca tcttcatttg aaaacgtgaa agaaaagtgg
      361 gtgcctgaga taactcacca ctgtccaaag actcctttct tgcttgttgg gactcaaatt
      421 gatctcagag atgacccctc tactattgag aaacttgcca agaacaaaca gaagcctatc
      481 actccagaga ctgctgaaaa gctggcccgt gacctgaagg ctgtcaagta tgtggagtgt
      541 tctgcactta cacagaaagg cctaaagaat gtatttgacg aagcaatatt ggctgccctg
      601 gagcctccag aaccgaagaa gagccgcagg tgtgtgctgc tatgaacatc tctccagagc
      661 cctttctgca cagctggtgt cggcatcata ctaaaagcaa tgtttaaatc aaactaaaga
      721 ttaaaaatta aaattcgttt ttgcaataat gacaaatgcc ctgcacctac ccacatgcac
      781 tcgtgtgaga caaggcccat aggtatggcc ccccccttcc ccctcccagt actagttaat
      841 tttgagtaat tgtattgtca gaaaagtgat tagtactatt tttttttgtt gtttcaaaaa
      901 aaaaattttt gtgtgtctgt tttttttttt tttttttttt gttgtttaaa aggaaggcat
      961 gcttgtggat gactctgtaa cagactaatt ggaattgttg aagctgctcc ctggttccac
     1021 tctggagagt aatctgggac atcttagtgt tttgttttgt ttttttccct cctctttttt
     1081 ttggggggga gtgtgtgggg ggtttgtttt ttagtcttgt ttttttaatt cattaaccag
     1141 tggttaagcc cttaagggag gaggacggat tgatt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X91249. H.sapiens mRNA fo...[gi:1160185] Links  


LOCUS       HSWHITE                 2930 bp    mRNA    linear   PRI 21-AUG-1996
DEFINITION  H.sapiens mRNA for white gene protein.
ACCESSION   X91249
VERSION     X91249.1  GI:1160185
KEYWORDS    white gene.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Chen,H., Rossier,C., Lalioti,M.D., Lynn,A., Chakravarti,A.,
            Perrin,G. and Antonarakis,S.E.
  TITLE     Cloning of the cDNA for a human homologue of the Drosophila white
            gene and mapping to chromosome 21q22.3
  JOURNAL   Am. J. Hum. Genet. 59 (1), 66-75 (1996)
  MEDLINE   96256850
REFERENCE   2  (bases 1 to 2930)
  AUTHORS   Antonarakis,S.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (08-SEP-1995) S.E. Antonarakis, Div.of Medical Genetics,
            Univ. and Cantonal Hospital of Geneva, CMU, 1 Rue Michel-Servet,
            1211 Geneva, SWITZERLAND
FEATURES             Location/Qualifiers
     source          1..2930
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="21"
                     /map="21q22.3"
                     /tissue_type="retina"
                     /clone_lib="retina cDNA"
     gene            1..2930
                     /gene="white"
     CDS             31..2055
                     /gene="white"
                     /codon_start=1
                     /protein_id="CAA62631.1"
                     /db_xref="GI:1160186"
                     /db_xref="SWISS-PROT:P45844"
                     /translation="MAAFSVGTAMNASSYSAEMTEPKSVCVSVDEVVSSNMEATETDL
                     LNGHLKKVDNNLTEAQRFSSLPRRAAVNIEFRDLSYSVPEGPWWRKKGYKTLLKGISG
                     KFNSGELVAIMGPSGAGKSTLMNILAGYRETGMKGAVLINGLPRDLRCFRKVSCYIMQ
                     DDMLLPHLTVQEAMMVSAHLKLQEKDEGRREMVKEILTALGLLSCANTRTGSLSGGQR
                     KRLAIALELVNNPPVMFFDEPTSGLDSASCFQVVSLMKGLAQGGRSIICTIHQPSAKL
                     FELFDQLYVLSQGQCVYRGKVCNLVPYLRDLGLNCPTYHNPADFVMEVASGEYGDQNS
                     RLVRAVREGMCDSDHKRDLGGDAEVNPFLWHRPSEEVKQTKRLKGLRKDSSSMEGCHS
                     FSASCLTQFCILFKRTFLSIMRDSVLTHLRITSHIGIGLLIGLLYLGIGNETKKVLSN
                     SGFLFFSMLFLMFAALMPTVLTFPLEMGVFLREHLNYWYSLKAYYLAKTMADVPFQIM
                     FPVAYCSIVYWMTSQPSDAVRFVLFAALGTMTSLVAQSLGLLIGAASTSLQVATFVGP
                     VTAIPVLLFSGFFVSFDTIPTYLQWMSYISYVRYGFEGVILSIYGLDREDLHCDIDET
                     CHFQKSEAILRELDVENAKLYLDFIVLGIFFISLRLIAYLVLRYKIRAER"
     misc_feature    1141..1176
                     /gene="white"
                     /note="36bp are missing from some cDNA clones"
     repeat_region   2288..2303
                     /note="poly(T) polymorphism"
     polyA_signal    2910..>2930
                     /gene="white"
BASE COUNT      665 a    770 c    780 g    715 t
ORIGIN      
        1 gaattccggt ttcttcctaa aaaatgtctg atggccgctt tctcggtcgg caccgccatg
       61 aatgccagca gttactctgc agagatgacg gagcccaagt cggtgtgtgt ctcggtggat
      121 gaggtggtgt ccagcaacat ggaggccact gagacggacc tgctgaatgg acatctgaaa
      181 aaagtagata ataacctcac ggaagcccag cgcttctcct ccttgcctcg gagggcagct
      241 gtgaacattg aattcaggga cctttcctat tcggttcctg aaggaccctg gtggaggaag
      301 aaaggataca agaccctcct gaaaggaatt tccgggaagt tcaatagtgg tgagttggtg
      361 gccattatgg gtccttccgg ggccgggaag tccacgctga tgaacatcct ggctggatac
      421 agggagacgg gcatgaaggg ggccgtcctc atcaacggcc tgccccggga cctgcgctgc
      481 ttccggaagg tgtcctgcta catcatgcag gatgacatgc tgctgccgca tctcactgtg
      541 caggaggcca tgatggtgtc ggcacatctg aagcttcagg agaaggatga aggcagaagg
      601 gaaatggtca aggagatact gacagcgctg ggcttgctgt cttgcgccaa cacgcggacc
      661 gggagcctgt caggtggtca gcgcaagcgc ctggccatcg cgctggagct ggtgaacaac
      721 cctccagtca tgttcttcga tgagcccacc agcggcctgg acagcgcctc ctgcttccag
      781 gtggtctcgc tgatgaaagg gctcgctcaa gggggtcgct ccatcatttg caccatccac
      841 cagcccagcg ccaaactctt cgagctgttc gaccagcttt acgtcctgag tcaaggacaa
      901 tgtgtgtacc ggggaaaagt ctgcaatctt gtgccatatt tgagggattt gggtctgaac
      961 tgcccaacct accacaaccc agcagatttt gtcatggagg ttgcatccgg cgagtacggt
     1021 gatcagaaca gtcggctggt gagagcggtt cgggagggca tgtgtgactc agaccacaag
     1081 agagacctcg ggggtgatgc cgaggtgaac ccttttcttt ggcaccgccc ctctgaagag
     1141 gtaaagcaga caaaacgatt aaaggggttg agaaaggact cctcgtccat ggaaggctgc
     1201 cacagcttct ctgccagctg cctcacgcag ttctgcatcc tcttcaagag gaccttcctc
     1261 agcatcatga gggactcggt cctgacacac ctgcgcatca cctcgcacat tgggatcggc
     1321 ctcctcattg gcctgctgta cttggggatc gggaacgaaa ccaagaaggt cttgagcaac
     1381 tccggcttcc tcttcttctc catgctgttc ctcatgttcg cggccctcat gcctactgtt
     1441 ctgacatttc ccctggagat gggagtcttt cttcgggaac acctgaacta ctggtacagc
     1501 ctgaaggcct actacctggc caagaccatg gcagacgtgc cctttcagat catgttccca
     1561 gtggcctact gcagcatcgt gtactggatg acgtcgcagc cgtccgacgc cgtgcgcttt
     1621 gtgctgtttg ccgcgctggg caccatgacc tccctggtgg cacagtccct gggcctgctg
     1681 atcggagccg cctccacgtc cctgcaggtg gccactttcg tgggcccagt gacagccatc
     1741 ccggtgctcc tgttctcggg gttcttcgtc agcttcgaca ccatccccac gtacctacag
     1801 tggatgtcct acatctccta tgtcaggtat gggttcgaag gggtcatcct ctccatctat
     1861 ggcttagacc gggaagatct gcactgtgac atcgacgaga cgtgccactt ccagaagtcg
     1921 gaggccatcc tgcgggagct ggacgtggaa aatgccaagc tgtacctgga cttcatcgta
     1981 ctcgggattt tcttcatctc cctccgcctc attgcctatt tggtcctcag gtacaaaatc
     2041 cgggcagaga ggtaaaacac ctgaatgcca ggaaacagga agattagaca ctgtggccga
     2101 gggcacgtct agaatcgagg aggcaagcct gtgcccgacc gacgacacag agactcttct
     2161 gatccaaccc ctagaaccgc gttgggtttg tgggtgtctc gtgctcagcc actctgccca
     2221 gctgggttgg atcttctctc cattcccctt tctagcttta actaggaaga tgtaggcaga
     2281 ttggtggttt tttttttttt tttaacatac agaattttaa ataccacaac tggggcagaa
     2341 tttaaagctg caacacagct ggtgatgaga ggcttcctca gtccagtcgc tccttagcac
     2401 caggcaccgt gggtcctgga tggggaactg caagcagcct ctcagctgat ggctgcacag
     2461 tcagatgtct ggtggcagag agtccgagca tggagcgatt ccattttatg actgttgttt
     2521 ttcacatttt catctttcta aggtgtgtct cttttccaat gagaagtcat ttttgcaagc
     2581 caaaagtcga tcaatcgcat tcattttaag aaattatacc tttttagtac ttgctgaaga
     2641 atgattcagg gtaaatcaca tactttgttt agagaggcga ggggtttaac ccgagtcacc
     2701 cagctggtct catacataga cagcacttgt gaaggattga atgcaggttc caggtggagg
     2761 gaagacgtgg acaccatctc cactgagcca tgcagacatt tttaaaagct atacacaaaa
     2821 ttgtgagaag acattggcca actctttcaa agtctttctt tttccacgtg cttcttattt
     2881 taagcgaaat atattgtttg tttcttccta aaaaaaaaaa aaaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U28015. Human cysteine pr...[gi:975300] Links  


LOCUS       HSU28015                1400 bp    mRNA    linear   PRI 06-SEP-1995
DEFINITION  Human cysteine protease (ICErel-III) mRNA, complete cds.
ACCESSION   U28015
VERSION     U28015.1  GI:975300
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1400)
  AUTHORS   Munday,N.A., Vaillancourt,J.P., Ali,A., Casano,F.J., Miller,D.K.,
            Molineaux,S.M., Yamin,T.T., Yu,V.L. and Nicholson,D.W.
  TITLE     Molecular cloning and pro-apoptotic activity of ICErelII and
            ICErelIII, members of the ICE/CED-3 family of cysteine proteases
  JOURNAL   J. Biol. Chem. 270 (26), 15870-15876 (1995)
  MEDLINE   95318183
   PUBMED   7797592
REFERENCE   2  (bases 1 to 1400)
  AUTHORS   Munday,N.A., Vaillancourt,J.P., Ali,A., Casano,F.J., Miller,D.K.,
            Molineaux,S.M., Yamin,T., Yu,V.L. and Nicholson,D.W.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-MAY-1995) Jeff Aaronson, Bioinformatics, Merck & Co.,
            Inc., 126 E. Lincoln Ave., Rahway, NJ 07065, USA
COMMENT     On Sep 6, 1995 this sequence version replaced gi:903935.
FEATURES             Location/Qualifiers
     source          1..1400
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="THP-1, an acute monocitic human cell-line"
     gene            1..1400
                     /gene="ICErel-III"
     CDS             35..1291
                     /gene="ICErel-III"
                     /note="similar to interleukin-1 beta converting enzyme,
                     Swiss-Prot Accession Number P29466"
                     /codon_start=1
                     /product="cysteine protease"
                     /protein_id="AAA75172.1"
                     /db_xref="GI:903936"
                     /translation="MFKGILQSGLDNFVINHMLKNNVAGQTSIQTLVPNTDQKSTSVK
                     KDNHKKKTVKMLEYLGKDVLHGVFNYLAKHDVLTLKEEEKKKYYDAKIEDKALILVDS
                     LRKNRVAHQMFTQTLLNMDQKITSVKPLLQIEAGPPESAESTNILKLCPREEFLRLCK
                     KNHDEIYPIKKREDRRRLALIICNTKFDHLPARNGAHYDIVGMKRLLQGLGYTVVDEK
                     NLTARDMESVLRAFAARPEHKSSDSTFLVLMSHGILEGICGTAHKKKKPDVLLYDTIF
                     QIFNNRNCLSLKDKPKVIIVQACRGEKHGELWVRDSPASLAVISSQSSENLEADSVCK
                     IHEEKDFIAFCSSTPHNVSWRDRTRGSIFITELITCFQKYSCCCHLMEIFRKVQKSFE
                     VPQAKAQMPTIERATLTRDFYLFPGN"
     polyA_site      1400
                     /gene="ICErel-III"
                     /note="14 A nucleotides"
BASE COUNT      462 a    315 c    295 g    328 t
ORIGIN      
        1 cggcaaaaaa aaaaggcgta agaattttga agctatgttc aaaggtatcc ttcagagtgg
       61 attggataac ttcgtgataa accacatgct aaagaacaac gtggctggac aaacatctat
      121 ccagacccta gtacctaata cggatcaaaa gtcgaccagt gtaaaaaaag acaaccacaa
      181 aaaaaaaaca gttaagatgt tggaatacct gggcaaagat gttcttcatg gtgtttttaa
      241 ttatttggca aaacacgatg ttctgacatt gaaggaagag gaaaagaaaa aatattatga
      301 tgccaaaatt gaagacaagg ccctgatctt ggtagactct ttgcgaaaga atcgcgtggc
      361 tcatcaaatg tttacccaaa cacttctcaa tatggaccaa aagatcacca gtgtaaaacc
      421 tcttctgcaa atcgaggctg gaccacctga gtcagcagaa tctacaaata tactcaaact
      481 ttgtcctcgt gaagaattcc tgagactgtg taaaaaaaat catgatgaga tctatccaat
      541 aaaaaagaga gaggaccgca gacgcctggc tctcatcata tgcaatacaa agtttgatca
      601 cctgcctgca aggaatgggg ctcactatga catcgtgggg atgaaaaggc tgcttcaagg
      661 cctgggctac actgtggttg acgaaaagaa tctcacagcc agggatatgg agtcagtgct
      721 gagggcattt gctgccagac cagagcacaa gtcctctgac agcacgttct tggtactcat
      781 gtctcatggc atcctagagg gaatctgcgg aactgcgcat aaaaagaaaa aaccggatgt
      841 gctgctttat gacaccatct tccagatatt caacaaccgc aactgcctca gtctaaagga
      901 caaacccaag gtcatcattg tccaggcctg cagaggtgaa aaacatgggg aactctgggt
      961 cagagactct ccagcatcct tggcagtcat ctcttcacag tcatctgaga acctggaggc
     1021 agattctgtt tgcaagatcc acgaggagaa ggacttcatt gctttctgtt cttcaacacc
     1081 acataacgtg tcctggagag accgcacaag gggctccatc ttcattacgg aactcatcac
     1141 atgcttccag aaatattctt gctgctgcca cctaatggaa atatttcgga aggtacagaa
     1201 atcatttgaa gttccacagg ctaaagccca gatgcccacc atagaacgag caaccttgac
     1261 aagagatttc tacctctttc ctggcaattg aaaatgaaac cacaggcagc ccagccctcc
     1321 tctgtcaaca tcaaagagca catttaccag tatagcttgc atagtcaata tttggtattt
     1381 caataaaagt aaagactgta
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AL031117. Human DNA sequenc...[gi:4007557] Links  


LOCUS       HS914P14              155375 bp    DNA     linear   PRI 23-NOV-1999
DEFINITION  Human DNA sequence from clone 914P14 on chromosome Xq23 Contains
            calpain-like protease gene, DCX (doublecortin) ESTs, CA repeat,
            GSS, complete sequence.
ACCESSION   AL031117
VERSION     AL031117.1  GI:4007557
KEYWORDS    HTG; calpain-like protease; DCX.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 155375)
  AUTHORS   Bird,C.
  TITLE     Direct Submission
  JOURNAL   Submitted (07-JAN-1999) Sanger Centre, Hinxton, Cambridgeshire,
            CB10 1SA, UK. E-mail enquiries: humquery@sanger.ac.uk Clone
            requests: clonerequest@sanger.ac.uk
COMMENT     On Dec 13, 1998 this sequence version replaced gi:3980459.
            During sequence assembly data is compared from overlapping clones.
            Where differences are found these are annotated as variations
            together with a note of the overlapping clone name. Note that the
            variation annotation may not be found in the sequence submission
            corresponding to the overlapping clone, as we submit sequences with
            only a small overlap as described above.
            This sequence is the entire insert of clone 914P14. This sequence
            has been finished according to sequence map criteria as follows. An
            attempt is made to resolve all sequencing problems, such as
            compressions and repeats, but not necessarily within known
            annotated human repeat sequence elements (e.g. Alu). Where the
            sequence is ambiguous, there is an annotation using the 'unsure'
            feature key.
            This sequence was generated from part of bacterial clone contigs of
            human chromosome X, constructed by the Sanger Centre Chromosome X
            Mapping Group. Further information can be found at
            http://www.sanger.ac.uk/HGP/ChrX
            914P14 is from the library RPCI5 constructed at the Roswell Park
            Cancer Institute by the group of Pieter de Jong. For further
            details see http://bacpac.med.buffalo.edu/ VECTOR: pCYPAC2.
FEATURES             Location/Qualifiers
     source          1..155375
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="X"
                     /map="q23"
                     /clone="RP5-914P14"
                     /clone_lib="RPCI-5"
     repeat_region   700..869
                     /note="MER5B repeat: matches 8..178 of consensus"
     repeat_region   915..1093
                     /note="MER5A repeat: matches 2..188 of consensus"
     repeat_region   1358..1638
                     /note="L2 repeat: matches 2426..2698 of consensus"
     repeat_region   1686..1957
                     /note="AluJb repeat: matches 1..292 of consensus"
     prim_transcript 2767..6839
                     /note="match: multiple ESTs
                     match: R26396 AA604464 AA388261"
     repeat_region   3506..3660
                     /note="MIR repeat: matches 37..241 of consensus"
     repeat_region   4650..5294
                     /note="L2 repeat: matches 2022..2700 of consensus"
     repeat_region   5309..5601
                     /note="AluSx repeat: matches 1..285 of consensus"
     repeat_region   5967..6413
                     /note="L2 repeat: matches 1338..1786 of consensus"
     prim_transcript complement(6913..7325)
                     /note="match: multiple ESTs
                     match: AI298548 AA963981 T80123"
     repeat_region   7326..7426
                     /note="L1M4 repeat: matches 3079..3185 of consensus"
     prim_transcript 9209..10150
                     /note="match: multiple ESTs
                     match: 5' EST AA442619 clone 757964
                     Paired with EST AA436862 matching this clone
                     match: 3' EST AA436862 clone 757964
                     Paired with EST AA442619 matching this clone"
     prim_transcript 10556..>10773
                     /note="match: 5' EST AA346948"
     prim_transcript 11912..12460
                     /note="match: multiple ESTs
                     match: 5' EST AA350070 AA366676"
     repeat_region   12028..12123
                     /note="MIR repeat: matches 47..144 of consensus"
     repeat_region   12304..12597
                     /note="AluSp repeat: matches 1..295 of consensus"
     prim_transcript <12598..13769
                     /note="match: multiple ESTs
                     match: W04409 H89195 D81239 T78123 D80511 W74278 R37857
                     AA350069 AA346947 H95608 AA416745 N75991 AA393381 H89094
                     AA398592"
     repeat_region   14233..14283
                     /note="L2 repeat: matches 2658..2708 of consensus"
     repeat_region   15734..15823
                     /note="L1MD1 repeat: matches 6113..6206 of consensus"
     misc_feature    complement(17157..17622)
                     /note="match: GSS AQ013740 clone R-23D15"
     repeat_region   18241..18374
                     /note="MIR repeat: matches 109..262 of consensus"
     repeat_region   19018..19718
                     /note="L2 repeat: matches 1976..2750 of consensus"
     repeat_region   19719..20428
                     /note="L2 repeat: matches 1419..2198 of consensus"
     repeat_region   21359..21603
                     /note="MIR repeat: matches 11..250 of consensus"
     repeat_region   21907..22808
                     /note="L1MB8 repeat: matches 4540..5439 of consensus"
     repeat_region   22814..22997
                     /note="L1MA9 repeat: matches 5866..6053 of consensus"
     repeat_region   22996..23195
                     /note="L1MA9 repeat: matches 6047..6253 of consensus"
     repeat_region   23231..23976
                     /note="L1MB8 repeat: matches 5423..6175 of consensus"
     repeat_region   24077..24206
                     /note="L1PA13 repeat: matches 6024..6156 of consensus"
     repeat_region   24207..24412
                     /note="AluY repeat: matches 98..300 of consensus"
     repeat_region   24413..24885
                     /note="L1PA13 repeat: matches 5570..6024 of consensus"
     repeat_region   24881..24930
                     /note="L1 repeat: matches 4669..4718 of consensus"
     repeat_region   24911..25130
                     /note="L1M1 repeat: matches 5362..5579 of consensus"
     repeat_region   25129..26339
                     /note="L1M1 repeat: matches 4117..5345 of consensus"
     repeat_region   26342..26628
                     /note="AluJo repeat: matches 1..309 of consensus"
     repeat_region   26716..26979
                     /note="L2 repeat: matches 2244..2516 of consensus"
     repeat_region   27082..27781
                     /note="L1MEc repeat: matches 1447..2139 of consensus"
     repeat_region   27783..27834
                     /note="26 copies 2 mer at 75% conserved"
     repeat_region   27905..28210
                     /note="L1ME1 repeat: matches 5830..6116 of consensus"
     repeat_region   28495..28662
                     /note="MIR repeat: matches 89..256 of consensus"
     repeat_region   28697..30768
                     /note="L1PA2 repeat: matches 4078..6142 of consensus"
     gene            complement(31512..56932)
                     /gene="dJ914P14.1"
     mRNA            complement(join(31512..33168,33777..33913,34280..34401,
                     34978..35180,35344..35466,37326..37512,37618..37695,
                     37958..38151,38716..38908,39417..39625,40681..40812,
                     50181..50360,56780..56932))
                     /gene="dJ914P14.1"
                     /note="match: AJ000388
                     match: multiple ESTs
                     match: AA457330 AA426163 AA776442 AA368247 H00674 R80385
                     AA659839 T49054 H04141 AA169169 AA424869 AA653469 AI033040
                     AA242811 AA242946 T40580 R93331 R31135 H94028 AI039360
                     R65675 AA457238 AA994519 H04140 R80995 AA330217 W70887
                     AA000749 W89762 R80491 T39424 AA048453 R97145 AA073358
                     H47262 H94116 H13962 R65674 C16980 AA332140 AA050181
                     AA043152 R81046 W59548 C18799 R28157 H00765 AA260476
                     H13963 W89762 AA332770 D78717 C17331 AA169715 AA050030
                     R30717 AA170896 R81313 W27397 Z78325 W30734 T30449
                     AA482901 AA314178 AA111051 AA065249 T51846 AI299726
                     AA374851 W29545 AI256810 H63862 Z78311 AA789977 AA908636
                     AJ006264 AA617839 AJ006248 AA065323 AA016129 U80740
                     AI115975 AJ006259 AA416901 T61453 AI110245 AA726125
                     AA304196 AA314177 AA104873 C06490 AA145456 AA065329 W23942
                     U80755 AA106210 AI086488 AA065259 W91417 U80748 D25698
                     U80757 AA802685 AI316137"
                     /evidence=not_experimental
     repeat_region   31712..31819
                     /note="L2 repeat: matches 2397..2502 of consensus"
     CDS             complement(join(32986..33168,33777..33913,34280..34401,
                     34978..35180,35344..35466,37326..37512,37618..37695,
                     37958..38151,38716..38908,39417..39625,40681..40812,
                     50181..50345))
                     /gene="dJ914P14.1"
                     /note="match: O88501 (RAT)"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="dJ914P14.1 (calpain-like protease CANPX)"
                     /protein_id="CAA19965.1"
                     /db_xref="GI:4158167"
                     /translation="MGPPLKLFKNQKYQELKQECIKDSRLFCDPTFLPENDSLFYNRL
                     LPGKVVWKRPQDICDDPHLIVGNISNHQLTQGRLGHKPMVSAFSCLAVQESHWTKTIP
                     NHKEQEWDPQKTEKYAGIFHFRFWHFGEWTEVVIDDLLPTINGDLVFSFSTSMNEFWN
                     ALLEKAYAKLLGCYEALDGLTITDIIVDFTGTLAETVDMQKGRYTELVEEKYKLFGEL
                     YKTFTKGGLICCSIESPNQEEQEVETDWGLLKGHTYTMTDIRKIRLGERLVEVFSAEK
                     VYMVRLRNPLGRQEWSGPWSEISEEWQQLTASDRKNLGLVMSDDGEFWMSLEDFCRNF
                     HKLNVCRNVNNPIFGRKELESVLGCWTVDDDPLMNRSGGCYNNRDTFLQNPQYIFTVP
                     EDGHKVIMSLQQKDLRTYRRMGRPDNYIIGFELFKVEMNRKFRLHHLYIQERAGTSTY
                     IDTRTVFLSKYLKKGNYVLVPTMFQHGRTSEFLLRIFSEVPVQLRELTLDMPKMSCWN
                     LARGYPKVVTQITVHSAEDLEKKYANETVNPYLVIKCGKEEVRSPVQKNTVHAIFDTQ
                     AIFYRRTTDIPIIVQVWNSRKFCDQFLGQVTLDADPSDCRDLKSLYLRKKGGPTAKVK
                     QGHISFKVISSDDLTEL"
     misc_feature    complement(<35889..>36109)
                     /gene="dJ914P14.1"
                     /note="match: STS AL032604"
     repeat_region   36110..36242
                     /note="AluJo/FRAM repeat: matches 167..299 of consensus"
     repeat_region   40428..40522
                     /note="L1MC4 repeat: matches 7858..7960 of consensus"
     repeat_region   41647..41795
                     /note="L2 repeat: matches 2370..2525 of consensus"
     repeat_region   42526..42818
                     /note="AluSq repeat: matches 1..293 of consensus"
     repeat_region   43423..43507
                     /note="MIR repeat: matches 60..144 of consensus"
     repeat_region   45243..45622
                     /note="L1MB7 repeat: matches 5782..6171 of consensus"
     repeat_region   47250..47408
                     /note="L2 repeat: matches 2554..2744 of consensus"
     repeat_region   47594..47746
                     /note="MIR repeat: matches 1..162 of consensus"
     repeat_region   47824..47908
                     /note="MIR repeat: matches 178..262 of consensus"
     repeat_region   48193..48424
                     /note="L2 repeat: matches 1006..1230 of consensus"
     repeat_region   49575..49796
                     /note="L1ME1 repeat: matches 5685..5904 of consensus"
     repeat_region   49912..50050
                     /note="L1MA9 repeat: matches 6169..6308 of consensus"
     repeat_region   50524..50609
                     /note="MIR repeat: matches 59..145 of consensus"
     repeat_region   52573..52668
                     /note="AluSg/x repeat: matches 211..306 of consensus"
     repeat_region   52699..52767
                     /note="MIR repeat: matches 67..148 of consensus"
     repeat_region   52867..52944
                     /note="L2 repeat: matches 2636..2710 of consensus"
     repeat_region   54762..54871
                     /note="MIR repeat: matches 101..206 of consensus"
     repeat_region   55228..55271
                     /note="MIR repeat: matches 63..110 of consensus"
     repeat_region   55668..55719
                     /note="26 copies 2 mer ca 77% conserved"
     repeat_region   59803..59911
                     /note="MIR repeat: matches 6..113 of consensus"
     repeat_region   59964..60055
                     /note="L2 repeat: matches 2433..2497 of consensus"
     repeat_region   60285..60381
                     /note="MIR repeat: matches 64..164 of consensus"
     repeat_region   60381..60479
                     /note="L2 repeat: matches 2606..2701 of consensus"
     repeat_region   61766..62113
                     /note="L2 repeat: matches 2359..2696 of consensus"
     repeat_region   62184..62231
                     /note="24 copies 2 mer ca 79% conserved"
     repeat_region   62892..63051
                     /note="MIR repeat: matches 63..217 of consensus"
     repeat_region   63242..63282
                     /note="L1MD2 repeat: matches 5711..5751 of consensus"
     repeat_region   63288..63674
                     /note="L1MD2 repeat: matches 5951..6321 of consensus"
     repeat_region   63929..64048
                     /note="L1PA2 repeat: matches 6025..6144 of consensus"
     repeat_region   64336..64489
                     /note="MER45 repeat: matches 2..155 of consensus"
     repeat_region   64871..65008
                     /note="MIR repeat: matches 75..211 of consensus"
     repeat_region   65253..65587
                     /note="L1MD repeat: matches 36..370 of consensus"
     repeat_region   65795..66288
                     /note="L1MD repeat: matches 1183..1697 of consensus"
     repeat_region   66464..66599
                     /note="L2 repeat: matches 2570..2710 of consensus"
     repeat_region   66692..66802
                     /note="L2 repeat: matches 2615..2745 of consensus"
     repeat_region   66803..67029
                     /note="MER20 repeat: matches 1..218 of consensus"
     repeat_region   67030..67096
                     /note="L2 repeat: matches 2554..2615 of consensus"
     repeat_region   67111..67282
                     /note="L2 repeat: matches 2229..2411 of consensus"
     repeat_region   67349..67405
                     /note="L2 repeat: matches 2606..2666 of consensus"
     repeat_region   70222..70376
                     /note="L2 repeat: matches 2540..2723 of consensus"
     repeat_region   71754..72455
                     /note="L1MB2 repeat: matches 5478..6167 of consensus"
     repeat_region   72456..72674
                     /note="L1MA2 repeat: matches 6081..6308 of consensus"
     repeat_region   72675..72979
                     /note="L1MB2 repeat: matches 5171..5478 of consensus"
     repeat_region   73001..73411
                     /note="L1MA8 repeat: matches 5909..6286 of consensus"
     repeat_region   74115..74734
                     /note="L2 repeat: matches 2053..2710 of consensus"
     repeat_region   74742..75000
                     /note="L2 repeat: matches 2262..2491 of consensus"
     repeat_region   75358..75383
                     /note="13 copies 2 mer tt 92% conserved"
     repeat_region   75537..75649
                     /note="MIR repeat: matches 36..146 of consensus"
     repeat_region   75672..75825
                     /note="MER5A repeat: matches 8..189 of consensus"
     repeat_region   75843..75961
                     /note="MER5A repeat: matches 44..161 of consensus"
     misc_feature    complement(75878..>76292)
                     /note="match: GSS AQ043347 clone 2327L24"
     repeat_region   77344..77530
                     /note="MER5A repeat: matches 4..185 of consensus"
     repeat_region   78553..78824
                     /note="L1MB3 repeat: matches 5891..6180 of consensus"
     misc_feature    complement(78725..79138)
                     /note="match: GSS AQ277826 clone 2519012"
     repeat_region   79285..79594
                     /note="AluSx repeat: matches 1..310 of consensus"
     prim_transcript <80188..>81880
                     /note="match: multiple ESTs
                     match: D80418 AA447880 D81781 AA621430 AA456396 AA626676
                     AA133236 D59885 AA330682 AA129714 AA491415 C14177 D81162
                     D60841 AI051851 AA122224 D61049 D81185 D59318 AA620421
                     H05398 AA829724 D80600 AA455927 D81758 AI091637 R43291
                     AA460623 AA976682 AA629867 D80599 AA447721 T04865 AA932391
                     D81043 D81184 D80066 AA461549"
     repeat_region   82880..83194
                     /note="AluSg repeat: matches 1..300 of consensus"
     repeat_region   84920..85061
                     /note="MIR repeat: matches 2..146 of consensus"
     misc_feature    complement(85210..85530)
                     /note="match: Z67083 STS containing (CA) repeat"
     repeat_region   85360..85385
                     /note="13 copies 2 mer tg 100% conserved
                     differs from Z67083"
     repeat_region   86399..86454
                     /note="28 copies 2 mer ca 73% conserved"
     repeat_region   87014..87065
                     /note="26 copies 2 mer tt 71% conserved"
     gene            complement(87068..119562)
                     /gene="DCX"
     mRNA            complement(join(87068..88152,99051..99145,117328..117450,
                     119460..>119562))
                     /gene="DCX"
                     /note="match: AF040255
                     match: multiple ESTs
                     match:D80418 AA447880 D81781 AA621430 AA456396 AA626676
                     AA133236 D59885 AA330682 AA129714 AA491415 C14177 D81162
                     D60841 AI051851 AA122224 D61049 D81185 D59318 AA620421
                     H05398 AA829724 D80600 AA455927 D81758 AI091637 R43291
                     AA460623 AA976682 AA629867 D80599 AA447721 T04865 AA932391
                     D81043 D81184 D80066 AA461549"
                     /evidence=not_experimental
     CDS             complement(join(88096..88152,99051..99145,117328..117450,
                     119460..>119562))
                     /gene="DCX"
                     /codon_start=1
                     /evidence=not_experimental
                     /product="DCX (doublecortin)"
                     /protein_id="CAA19966.1"
                     /db_xref="GI:4158168"
                     /db_xref="SPTREMBL:O43911"
                     /translation="VTCLHDFFGDDDVFIACGPEKFRYAQDDFSLDENECRVMKGNPS
                     ATAGPKASPTPQKTSAKSPGPMRRSKSPADSANGTSSSQLSTPKSKQSPISTPTSPGS
                     LRKHKDLYLPLSLDDSDSLGDSM"
     repeat_region   88257..88355
                     /note="L1MD1 repeat: matches 6107..6198 of consensus"
     repeat_region   88393..89452
                     /note="L1MD2 repeat: matches 5078..6134 of consensus"
     repeat_region   89502..89668
                     /note="AluSg/x repeat: matches 127..295 of consensus"
     repeat_region   90227..90367
                     /note="MER58C repeat: matches 79..89 of consensus"
     misc_feature    complement(90289..90725)
                     /gene="DCX"
                     /note="match: GSS AQ024419"
     repeat_region   90651..91467
                     /note="L1M4 repeat: matches 6..826 of consensus"
     repeat_region   91472..91708
                     /note="L1M4c repeat: matches 1856..1612 of consensus"
     repeat_region   91818..92105
                     /note="L1MA9 repeat: matches 5877..6165 of consensus"
     repeat_region   92137..93342
                     /note="L1MEc repeat: matches 2114..3009 of consensus"
     repeat_region   93336..93573
                     /note="L1MB7 repeat: matches 5919..6167 of consensus"
     repeat_region   93693..93772
                     /note="40 copies 2 mer at 70% conserved"
     repeat_region   95365..95448
                     /note="MIR repeat: matches 111..191 of consensus"
     repeat_region   95971..96459
                     /note="L1MC4 repeat: matches 7330..7844 of consensus"
     repeat_region   96460..96768
                     /note="AluY repeat: matches 1..308 of consensus"
     repeat_region   96769..96790
                     /note="L1MC4 repeat: matches 7844..7864 of consensus"
     repeat_region   96837..96974
                     /note="L1MC4 repeat: matches 7842..7975 of consensus"
     repeat_region   97824..98028
                     /note="MER63A repeat: matches 1..210 of consensus"
     repeat_region   98056..98154
                     /note="MIR repeat: matches 77..191 of consensus"
     misc_feature    complement(98480..98863)
                     /gene="DCX"
                     /note="match: GSS AQ040335 clone 2328G12"
     misc_feature    <98884..99292
                     /note="match: GSS AQ014440 clone 2299M22"
     repeat_region   99415..99551
                     /note="MIR repeat: matches 89..216 of consensus"
     repeat_region   100290..100343
                     /note="27 copies 2 mer ga 78% conserved"
     repeat_region   100656..100894
                     /note="L2 repeat: matches 2435..2710 of consensus"
     repeat_region   100895..101140
                     /note="L1MB6 repeat: matches 5932..6174 of consensus"
     repeat_region   101289..101584
                     /note="L1HS repeat: matches 4940..5235 of consensus"
     repeat_region   101600..102492
                     /note="L1HS repeat: matches 5254..6146 of consensus"
     repeat_region   102516..102634
                     /note="L1PA12 repeat: matches 3991..4106 of consensus"
     repeat_region   102635..102938
                     /note="AluSq repeat: matches 1..308 of consensus"
     repeat_region   102939..104313
                     /note="L1PA12 repeat: matches 4106..5495 of consensus"
     repeat_region   104314..104622
                     /note="AluSg repeat: matches 1..309 of consensus"
     repeat_region   104623..104746
                     /note="L1PA12 repeat: matches 5495..5615 of consensus"
     repeat_region   104751..104779
                     /note="L1PA3 repeat: matches 6118..6146 of consensus"
     repeat_region   104814..105368
                     /note="L1PA12 repeat: matches 5600..6165 of consensus"
     repeat_region   105417..105764
                     /note="L2 repeat: matches 1996..2351 of consensus"
     repeat_region   107170..107182
                     /note="MIR repeat: matches 249..261 of consensus"
     repeat_region   107183..107218
                     /note="L2 repeat: matches 2695..2730 of consensus"
     repeat_region   107220..107251
                     /note="16 copies 2 mer tc 94% conserved"
     repeat_region   108543..108673
                     /note="MIR repeat: matches 91..216 of consensus"
     misc_feature    109116..109437
                     /note="match: GSS B59117 clone 2014O20"
     misc_feature    <109117..>109584
                     /note="match: GSS AQ222872"
     repeat_region   109599..109820
                     /note="MIR repeat: matches 20..249 of consensus"
     repeat_region   110002..110041
                     /note="20 copies 2 mer tt 83% conserved"
     repeat_region   110143..110610
                     /note="L2 repeat: matches 1671..2201 of consensus"
     repeat_region   110872..111119
                     /note="MIR repeat: matches 13..261 of consensus"
     repeat_region   111791..111974
                     /note="MIR repeat: matches 59..234 of consensus"
     repeat_region   112031..112325
                     /note="MER33 repeat: matches 2..324 of consensus"
     repeat_region   113841..113997
                     /note="LTR37A repeat: matches 1..167 of consensus"
     repeat_region   114011..114097
                     /note="MIR repeat: matches 42..132 of consensus"
     repeat_region   114965..115163
                     /note="MLT1F repeat: matches 1..234 of consensus"
     repeat_region   115194..115424
                     /note="MLT1F repeat: matches 318..541 of consensus"
     repeat_region   115762..116105
                     /note="L2 repeat: matches 2348..2710 of consensus"
     misc_feature    complement(116015..>116403)
                     /gene="DCX"
                     /note="match: GSS AQ027136 clone 2313O8"
     misc_feature    complement(117724..>118085)
                     /gene="DCX"
                     /note="match: GSS AQ012519 clone 2298O8"
     misc_feature    <117990..118257
                     /note="match: STS L41120"
     repeat_region   118229..118354
                     /note="MIR repeat: matches 20..153 of consensus"
     repeat_region   120874..121113
                     /note="MIR repeat: matches 4..255 of consensus"
     repeat_region   121134..121208
                     /note="L2 repeat: matches 2577..2654 of consensus"
     repeat_region   121310..121377
                     /note="TAR1 repeat: matches 645..715 of consensus"
     repeat_region   122075..122386
                     /note="AluSc repeat: matches 1..308 of consensus"
     misc_feature    122703..123150
                     /note="match: STS L41120 clone 41L13"
     repeat_region   122979..123086
                     /note="MIR repeat: matches 34..146 of consensus"
     repeat_region   124183..124509
                     /note="L2 repeat: matches 2359..2702 of consensus"
     repeat_region   125251..125634
                     /note="L2 repeat: matches 1253..1593 of consensus"
     repeat_region   125635..126035
                     /note="MSTC repeat: matches 1..405 of consensus"
     repeat_region   126036..126524
                     /note="L2 repeat: matches 1593..2222 of consensus"
     repeat_region   126632..130108
                     /note="L1P3 repeat: matches 522..3664 of consensus"
     repeat_region   130109..130403
                     /note="AluSx repeat: matches 1..295 of consensus"
     repeat_region   130404..132119
                     /note="L1P3 repeat: matches 3664..5459 of consensus"
     repeat_region   132120..132680
                     /note="L1P1 repeat: matches 5119..5687 of consensus"
     repeat_region   132724..133161
                     /note="L1MB5 repeat: matches 5753..6180 of consensus"
     repeat_region   133174..133249
                     /note="L1MB8 repeat: matches 6095..6171 of consensus"
     repeat_region   133419..133484
                     /note="L2 repeat: matches 2622..2688 of consensus"
     misc_feature    complement(<133477..>133977)
                     /note="match: GSS AQ281346 clone R-80M21"
     repeat_region   134417..134568
                     /note="MIR repeat: matches 47..208 of consensus"
     repeat_region   135275..135310
                     /note="18 copies 2 mer aa 86% conserved"
     repeat_region   136202..136672
                     /note="LTR33 repeat: matches 23..518 of consensus"
     repeat_region   136877..136978
                     /note="MIR repeat: matches 50..141 of consensus"
     repeat_region   137162..137534
                     /note="THE1C repeat: matches 1..371 of consensus"
     repeat_region   138166..138433
                     /note="AluSg repeat: matches 1..265 of consensus"
     repeat_region   138436..138669
                     /note="MER20 repeat: matches 1..218 of consensus"
     repeat_region   138782..138866
                     /note="MER5A repeat: matches 31..119 of consensus"
     repeat_region   139230..139283
                     /note="27 copies 2 mer ga 91% conserved"
     repeat_region   139627..139795
                     /note="MIR repeat: matches 48..216 of consensus"
     repeat_region   140105..140232
                     /note="L2 repeat: matches 2582..2710 of consensus"
     repeat_region   141688..141750
                     /note="L2 repeat: matches 2679..2738 of consensus"
     repeat_region   142950..143038
                     /note="L2 repeat: matches 2414..2497 of consensus"
     repeat_region   143238..143345
                     /note="AluJo repeat: matches 36..134 of consensus"
     repeat_region   143600..143726
                     /note="MIR repeat: matches 1..130 of consensus"
     repeat_region   143911..144216
                     /note="AluJb repeat: matches 7..309 of consensus"
     repeat_region   144230..144536
                     /note="AluSx repeat: matches 1..306 of consensus"
     repeat_region   145173..146554
                     /note="L1MC/D repeat: matches 4302..5821 of consensus"
     repeat_region   146563..147155
                     /note="L1MA2 repeat: matches 5702..6308 of consensus"
     repeat_region   147324..147532
                     /note="MER20 repeat: matches 1..213 of consensus"
     repeat_region   149251..149491
                     /note="L2 repeat: matches 2552..2749 of consensus"
     repeat_region   150682..151099
                     /note="L1ME repeat: matches 5312..5772 of consensus"
     repeat_region   151288..151627
                     /note="AluJb repeat: matches 2..306 of consensus"
     repeat_region   151763..152061
                     /note="L1MA1 repeat: matches 5983..6296 of consensus"
     repeat_region   152257..152475
                     /note="MER20 repeat: matches 1..216 of consensus"
     repeat_region   153123..153187
                     /note="L2 repeat: matches 2632..2700 of consensus"
     repeat_region   155253..155307
                     /note="MLT2 repeat: matches 501..553 of consensus"
     repeat_region   155308..155339
                     /note="16 copies 2 mer ca 97% conserved"
     repeat_region   155341..155375
                     /note="MLT2 repeat: matches 463..497 of consensus"
BASE COUNT    48478 a  30710 c  31062 g  45125 t
ORIGIN      
        1 gatctacaga gagcggaaac gagtctggga aaggaagaaa gaggtgtgct tgtttaggtc
       61 tgggcttggc tacagagcag gaaaaaaagg aagaacagtg actttgggga agttgtagcc
      121 agctgaaatc aaagttttag gaggaggcag cgaatgcaat ttgtagcaac tgattttgtt
      181 ttaataaagg aataaagatg tttgggagga caggtgcttc aagtactaat catccaagaa
      241 gaaaagcagt ttgaacttaa ttgactatat ctgctatgat tgtgcactat tcgattttag
      301 cccaatcagt gattttctaa aagcatccca tctcctttta gtcaaaactg gatttgtcac
      361 tgacttgtat gttggcatgg aggtcctaat agctgccacc tgcatttcct gttcatcaat
      421 gaaggtattg aaaattacaa aggaatgaga gttgggggaa aggaagatgt tcatttcgta
      481 ggggtgaggg atgtatttac atgtgggttt ttaagtgtac agtactagaa ggaaaagcca
      541 aaagtttcca tgcattctag agaagcatat ggcattgtaa ttctcacaca ttgctatgca
      601 agacaatctc ctatggaatt ctgtggctct tacctttggt gtacatacga cgcagagatt
      661 ctgagttatt atgtgtgaat tgggctccag gttctcagcc agggattctc attgttggtt
      721 gtaaattgga atcacctgga gagcttaaaa atattggcac ctaggttctg ttcccagagg
      781 tttttactta atttttctgg gggtgagaac tgagcatggg gattttctaa ctctccccaa
      841 gtgattctaa tgttcagaca agtctgggac tctctagttt agcccattgc ttaattactt
      901 gggtggatta aactagttgt tcttaaagtg ttgtcctgga accagcagct tcaacatcaa
      961 caggacactt attagaatgc aaattcttgg gcccctgccc atatttactg aaactcttgg
     1021 ggtggctcag aaacctgtgt tttaacaaac cctctggatg attctgatgc atgctataaa
     1081 gttgacagcc actagattaa acggttgttc ctgacccttt ctaaagtaca tctgtgggag
     1141 ttgctaaaaa caagagatgt cttcaagagc tgatgaaaac tggcctgcaa aaggactttt
     1201 acagaaatac ctataataat aataatagaa gctagagtag agaaagggga aaggggatga
     1261 taatttgctg caaaacctgg gcaagttcag ggggaacatg tcttcatgag ttagggagtt
     1321 tgttgcatgg gctagtaata ccctggcagt acctgcaggc aaactcctac ttattcttca
     1381 ataccctttt aaaatgtcag cctcctctgt gaagctttct ctgagctgtg gtgtaaacta
     1441 cttcatcctt gtgtttactt agtgctttct agtttcctcg attagctctt ggcccactgt
     1501 actgtagctc tctatatatg tctgtctccc actaaactgc gtgatccttc aaagcaagaa
     1561 cactgtctta ttcatcctta gctctccagt gtctaataaa ctatctggta catactaaat
     1621 gctcttgaat tgtttgctta gaggggggat cagtaatggc cagctttctt gtatcaagaa
     1681 ccttcagctg ggcgcagtgg ctcacacctg taatcccagc actttgggag accgaggtgg
     1741 gtggatcatt taagttcggg agttcaagac cagcctggcc aacatgctga aaccccatct
     1801 ctactaacat ggtgacgcgt gcctgtaatc ccagctactc tggaggctga ggcgggagaa
     1861 tcacttgaac ccaggaaggc ggaggttgca gtgaactgag atcgtgcccc gcactccagc
     1921 ctgggcaaca gagcaagatt ccgtctcaaa aaaaaaacct tcaaaagatg ggtttacctt
     1981 tgtgaattaa tcgtagtcct gagttcaact ctgactagaa gactagagac attgattctc
     2041 tcaggccatc atgacttgat atctcttagt ggtggcctta taccacaaag ctctcagaga
     2101 aatatacaat tattagcaac ttattatttt accatgataa ttggcaggtg ccaagactac
     2161 tattgtcatt ttttcctcat atttcctctc attgtagatg caccttcatg ggaaatctaa
     2221 ttaaaatcaa tagggataca gctgccacag ggtaaaagta tttgagttct tgggccccac
     2281 agcttgcctc tctcatccag gcactggcat cttctggttc attgtacttc cctaattcta
     2341 taatgtactc tcttcattcc acatctattc acaaacattc ttcaaacaag ataagcactt
     2401 agcaaattac aattaccttt ccattgcact ttcccaaggt tatactcagg tcaaaatgaa
     2461 tggatgtatg tgtattcccc aagtctttag agaaacagat ttacaggcaa tcctttggtt
     2521 taattttcta gatacttaat cacatattcg ttatatgtgg gtttttactg gcttaactct
     2581 cagaggacaa ccatataacc ttatttttaa gtttttggag gtaggacgac acttcactgt
     2641 cacctcagtt actgccaata ttaatcattt ctaaatgtta tagaaacttt gaaatatatg
     2701 tgtcattctc taaagtggaa ttattaagac ccagaatgta aaccagaaga aactggattt
     2761 ttgccgtctg tgccaagcca tgattttaat atgtatgtgc tgaatggatt tttctatttt
     2821 tttctacagg cattgtatct gatagccact aatggaactc cagagctcca gaatcctgag
     2881 agactgtcag ctgtattccg tgacttttta aatcgctgtc ttgagatgga tgtggatagg
     2941 cgaggatctg ccaaggagct tttgcaggtg aaaataaaat aggacaatta caaagagagg
     3001 cattatactc agcttttcca tctcaatgtg tgacagagag ctctgtcaag agatgtctag
     3061 agttaggcta ttaccatttt attttagacc acaggcacat aactataggc acgcatctaa
     3121 tacatgattg attgacgtgt tgcctttctt ctgagaagtt ggcagtgtaa tctgtaaatc
     3181 tcttttgttc ttcctggagt cttattttgc aaggaaaaaa aaaaaagcaa tgtgaccctc
     3241 tggtgttggt gttttgttct aaactttact tgtgcccctg gatgttgtat gcaggtgctc
     3301 ttggccggag gtaattctct acataactat gcaattattt ccatggcatt tgctactcta
     3361 atgctttgct ggtcaaagcc cctttgcttg tgatgccaat tggcaggcac ctgcttgcct
     3421 gggtgagtac catccagtgg tgagtgcctc tgtaaccatt tgtccttttc acggctccca
     3481 gggattagta tacatgatga gaaataatac ttactatgta ccaaattctt tcaataaatt
     3541 atttcatttt tataacaact ttatagggta ggcattatta tcccatttta tagatgtaaa
     3601 agttgagact cacagctctt gaagggcaga gctgggatat gagccctgaa ttgtctgact
     3661 ttccaccaca cgtcacacac aaaccaattt agataaaaat atgtctgtat ttgataaagc
     3721 agtacgtgcc caggattata cacatagctt cactgctggg gagtgtttct cagggacatg
     3781 tatgaagaca tttaacatct aagaacaatg actgctattt ttctttgtga ttttgaacac
     3841 ataagccaat taaacagaag gtgcaacact tggttataca gcgcattaac cacagattgt
     3901 ctgttctagc ctactccctg aaagaaaaac ctgttcctgt tgcttctgaa ctccttccag
     3961 agacagagct gcacccaggc aaaaactatc ccaccctaac tcagatcata cctggaataa
     4021 acctatttgg agaaactgaa gagactacat cttttctttt ctgctacctg caagcatttc
     4081 acaggttatc atagggccct aaaatgtcaa atctggaatg gccctttgtt aatcctctaa
     4141 ttaatccagc ctcaaattac agatgaaaag acaacttttg agtacatcac ttttatattt
     4201 tttcacccct cccaccactc cacatacaca ttctgataca cattcctctt atatcttctt
     4261 gtaaggataa ttgaaggtgt aggttatctt caattggcta tccagataaa tgtggctatc
     4321 cagctaaata gatcattaaa tcaattatct agttaattca ttccccaaca atacttttat
     4381 atgaaccagt cagataactg atacaaaatt gtattttgag tctaaattga atagtttatt
     4441 gcgtcttttc attttatata tgtaagcaat agcctgtgtt tctctttgag tccatgcaac
     4501 tccattctct gactctagcc tattattgcc attttcaaga ttagaaaggg gagagatgca
     4561 tgacattcac cccactggaa ggctggagaa aagacacagg tcttcaatat ttgtgtgtct
     4621 ttctgtatga cctaagacag ttctgtccat caattgatat ttattgagca acttccattt
     4681 gtaaagcata agtgcagtgg gtgatacaaa cctattaaaa ggcatgattc ctccatttaa
     4741 gaagttcaca accttcttag gtacatgaaa atataatgca aagctgttta cagactcatt
     4801 caacccaaag agctgttcaa atagtaaatg atatggttga ttagagaaat acttctgtta
     4861 gctaggggtg atcagaaaat gtcttcaaac aggaaagagg tcttctgggc cttaaagaag
     4921 tgatcaggtt tagttaagtt gaagaaaata agggaaggga tttcaaatgg aaggaagtgt
     4981 tagggtataa ccaaagacat atgggcaagg tctacaggaa cagaacctag aaagtgtagc
     5041 tgaaataaag agttcaagta aagaagctgc aggggataat cctatggtgg gccttgaaca
     5101 ccaggcttaa gagttggggc tgtaggcagt ggggaaacac tgaagatgtc tgagggacaa
     5161 gcatgatgac aactttgttt atgaagatta atctggtagt atgtggtaag aattagcaac
     5221 aggaaaaaat ggagacaggg agaccaaggg ggagttatta taatttggga ttaagatgat
     5281 taggattgaa ttagaaaaag caagagatgg ccagacgtga ttgctatgat tgctatgcct
     5341 gtaatcccaa cactttgtga ggccgaggca ggcggatcac ctgaggtcag gtgtttgaga
     5401 ccagcctggc caacatggtg aaatgctgcc tctactaaaa atacaaaaat tagccaggca
     5461 tggtggcatg cgcctgtgat cctagctact caggagactg aggcaagaga atcacttgaa
     5521 cctgggaggc ggaagttgca gtgagctgag attgtgccac tgcactccag cctgggcaac
     5581 agagcaagag tccatctcaa ataataataa taataataat aataataata ataataataa
     5641 taataagcaa gagataaatg taagagacgt tgagaagaga gtacccggta gcttattgca
     5701 tttagaacat aaagaagagg aagacaaaag tgacttcagg ctatcaagtc aatggtagca
     5761 ttcagcaaga ggacaagtga tggaagacaa tataatagct aacatttctt gaccctatgt
     5821 aacttacaag ggattgccac ttacctcatt taaccatttt aataaacaga gatgaggatt
     5881 taacctaagt gctctgactc caaagcccct gtagcttaca ttacacaaga ttgccctgct
     5941 tgtgagactg taacataact ttttaaatgt tgaatttaag gtgacagaag aaaatcccgt
     6001 aggaaatatc cagcaggaaa actaggacca gagttcagga aagagatcag gtcatacact
     6061 atggatttag gagtcatctc cataaagctg atggtgggtt gaaaccatga aagagaatga
     6121 catctgtaag aaattaaatt aaaaaaagaa gatgatgaca gccaagttct tatgagtggg
     6181 aggtaggagg aggaagtgaa accagggaag aaaatggagc tggaatattc agtgcacatg
     6241 cgcatggaag agacccaata tggtagaatg tcacaggctt ctgaggcaaa aaaaataaat
     6301 taaaaaaatt caaagaagga agggatattc acttttatta agtactgcag aggggccaaa
     6361 agagaaatgg tagaggcaca gccgttgggt ttagcagtta gtaagccatt ggtttcagga
     6421 cagagttctc tagtggttaa ctaaatatac tttttccaag catgacagct tctctagata
     6481 aatatatcag tttaggaaaa tcagagagtc agactgttgc aaacacagtg aggttttaca
     6541 agatccctca gagtttctca ctctgctagt cagcttccag aggatttcat gacaatgaag
     6601 tacaaaaaca tctgaaggac aaaagctggc accgttttta attaaatgat atttaattgg
     6661 gtgttcaaaa tatatcttaa ttcaaaaggg ctgcttgaaa atgtaaagta atatacttgt
     6721 tggaggaaag aattccaata attcctcttt ttccttcctt ttgcagcatc catttttaaa
     6781 attagccaag cctctctcca gcctgactcc tctgattatc gctgcaaagg aagcaattaa
     6841 gaacagcagc cgctaagact gcaagcctta cacctcacca tctccctcat gagtaagact
     6901 gaaataaaac tctgctgcag gaaagatgga agaaaagaca gtcaaatggg gtgggggttc
     6961 tttacctttc aaatgaatag aaacttctta taagcctttt tcctactccc tcagattatg
     7021 taatttattt gtaagcctga atcgcagccc aaacagggca gcaatgttga agtgaccata
     7081 aagtggtcac ttccaccgtg aagcgaaaga gccagtagtg aatcccctca ttttgtgcat
     7141 tcactttgaa gaaaaaggtt tctcaaagat gcacactccc tcttcatagt gttgtgtttg
     7201 tttttaagtt agagagtagt ccctcttgca ttcaaacctc cttcaaaact ccttacccaa
     7261 tgtgatgttt ttcacttgca ttgtcattag atgtccagaa aaaaaaaaga tgtcaaaatg
     7321 tttttctaaa aaaagaaagc aaaaaaagca aggcaaaaaa aaaaaaaaaa acaaacaaaa
     7381 acaaaaacaa aacaaaaaca agcaaacaaa aaataccaga gcaagtactg tgtgaacatg
     7441 tggaagtcca tgccctaata gagttgcaat tttttattct tcttctatag tggtggcttg
     7501 gtttgtgtac ctatttttct gcatttgtat tggaaaaggt ttcttttaag acattttcca
     7561 aaagtggaga ggaatatgtg tgttcaggaa gggctttcaa aaaactgtat atctaaataa
     7621 agctcaaacg gtgaaatcct gtcacatttt cacaatgatg cttaaaagat aattgagtaa
     7681 accaggttgt taatctcctt aatacctgaa agaggacaca ctgaaactga aactgtgaca
     7741 tcctgctagg tgagttcagg ttctgaacct aggaaatcct cataggagaa accacattta
     7801 aacaaagatg ggactttctc tgagagccaa aaccagataa atgtagaata ctgaaatcct
     7861 tgttggacat taagtaaaca aagataatga tacctaaatt aatcctctct tgtgcttatg
     7921 aaacatatgc actgtaaaat aggcatacca ggaggaaata gatacattaa tcatcattta
     7981 cttatgatac aaattattta ttttgacaat ttataacgtt taaaaaagtt ttttaaagat
     8041 ctagagaaag gtgatatagt aaacattcaa ctctgtaaga aatgggaggt cagtgaaggc
     8101 tacatcccaa tcaatatttg gctctaagta cctcttccca tttttcctat gtatcaccta
     8161 tttctgtttc ggaatatggt gtgttcatgc ttagttcttt gggcttttga atatcaaaag
     8221 catattcata aatgtcttga aattctctcc agtggaaaat aattttaact tacaatcata
     8281 tcccaagaaa tgtcagtccg acagaattcc ttatatgact tggggaaaat aacaaaattt
     8341 gactactatt tcaccatata tctatttatt aaaaaattca acagttggca cttcctgaat
     8401 cttctgagag tagaaaaata tctgcggagt gtctgtgtag aaaaggatat gcctctcttt
     8461 tgagtgtatt gacaattttg taaattacag aaagttgttt ctctaagcct ttgaaaaact
     8521 aacaatttgt gttatagaag gcttcttaat ttgcagtata aaagaatcta aacagaactt
     8581 atgtacattc agccagaagg ggaaagagat cagttacata ggcctctctc cttctttgcc
     8641 aaggtacatc catccatcta accatccata tatccacatc ttaaaatgaa agcactttct
     8701 ttagagtttc agcaaactat atagtgtacg tgtttatgtt caggagatga ccccactggt
     8761 gtatttccta ttttccctat tgttttcttt gactgtaaaa gttgggagag gcttgacctc
     8821 ctccccttga aaatgtccac agtgggataa aacaacaaat gtgaaaagaa aatgaaacgg
     8881 taatattaat ttgaagcata ctatgttata ctttgcaaaa acgaatctgg gcctgtaatt
     8941 tttaatgcca cactgctcta atgagagaga gaggccttaa ttttgatttc atttaaaaat
     9001 aagtacttta aaaaattttt cactcatagt gccgggaaat tcaatgaaat cctgggatgc
     9061 aaataaaaat cagtacatta gtgactgtgt cctgccagtg gagagagccc aatacctggt
     9121 taggaagccc tattcattag ttagcatccc ttacatgttg agaaggcctt ttttttgttg
     9181 ttattttgga gaccttggag cagtgaccct tcagatcact gtaggcagag aaatggcttc
     9241 tctcttatgc tttcagttca gcatattaac aatgaggagc caggtacttc tttactacca
     9301 ctttgtacca agatttgata ataatatatc ccaggaggca ttacttttat aaatttgtat
     9361 tcatgtaaat tttcaaatga gaacagcttc taaagcccct tccctgtatt ggagagttat
     9421 gtatatttct aataagtatt agaaagaagc tgtttctcat gccacagtga tgctgaagga
     9481 ttcacatttg gtacaatcga gtaacttgaa cgccagattg ttaacagttt attctctttc
     9541 cctggatttt taagctcatc ttgacacagg tgagtctatc caaatctttg atgttgctag
     9601 tgtgccctga gataacgagg gcacatcttt caatgttgat tccaaaatgt cctgagttag
     9661 gaatagggca gtgggaaagt cagggaaggg tgagaagcac agtagagatt atttatttaa
     9721 aaaaggaaag aacgttaatg ttgttagcaa ggatccagtg cgttgtcata atcccatgag
     9781 gattttcaga tgacacaatc ccctcaaatc agtcaccatg ttgggtaatg acttcgttct
     9841 tgctgatctc gtgtgtgtgt cattgtaaat atttgtgtgt ccatgttcca ttttggctac
     9901 tggatggcca agccatgtaa gaagatttaa ctcaagtatt tattctttat gttattcaga
     9961 tttctttcag gcttgtgaac tgcaccccaa tgtttgagtt taaccacctg atccttacat
    10021 ctatccctcc ccggtgaagc acattccatt gctaaaagaa aaagaaacac gaaattgctt
    10081 cctgttgtct gtataactgt tttgatagtt tgagatattt gtctataaat gatatttctc
    10141 agctcaaaga tcgtgtaaat aattatattc ctttgctcaa tgggtttatt tctaatgagg
    10201 ctgccagttc tgagagattc tataatatca cttttaaata acataaacag ggattacaac
    10261 tatgtaaaaa gaaatgcata tggacaaaga ctgggaacac agataattga aatcagttgt
    10321 gttagggtgg tggaattatg tgaatttttt tttcttttta aaattttatt tgatattgtt
    10381 ataatattgc tttacaataa ataaacagca gaaagggaac tatagacaca tagaaaagat
    10441 gccagaagca gatgccttct ggccagagcg cagagcatgc agggcagaga tatttgctag
    10501 ttacaattat tccataggct ttatgcttgc ctgggtgctg aggttggcac acgctcgggt
    10561 atggcacacg ctttcttagg agactattat ctataagtta aagctaggga gatgtcacta
    10621 ttagaactcc aaacacactc ttctgcttta aaacaggttg tctgccctct gctttggtat
    10681 ggcattcggg tgtctgtttt gtggttgctt tagattggag gggtgaccat tttattagcc
    10741 cccttgataa catctgttgc agatattgcc tttctggaac gttttaacag actctcaggt
    10801 tgaattttgg aggactagaa ggataaaatc cccagctccc accattttct tgtccaacag
    10861 gatattactg tatatcattc aggtaggatt cttcttttaa taaccaatag ggcaagtccc
    10921 actaatttca atagaagtta tgacttgcaa ttaaaagctg actttgaaat cattaaacaa
    10981 atatgtagga ctgtctctgc ctgttggcat tcagttatag ttctgttaat tttggcttgg
    11041 gatggtctcc atgtgctttt ttctgcctat ttataggttg tttgcagtag ttgtgatttt
    11101 taaagagcaa gggagaccat ctaaccaaag gataacttcc ttctaactca ccaaagaaat
    11161 tttaggtgag aactttaata atgaggtagt cacctcagat atgctgctta gtttcactaa
    11221 aagcagaccc tatacctaga gaagtcactg gctttttatt ggtcattctc aatacagaaa
    11281 tacttagggg agtcttaacc ctgccatccc cggttgaatc tcttggtctt tatctaagct
    11341 acttgcagtt aatattcagt taagcaaagg tatggccagt agtgcaagta tctcccagtc
    11401 tctgagctct gaacaagagg actgaaattc agcatttgta aactgacagt ttgatgggcc
    11461 tgggatttga agtgaactca gcacacaatt ctgaacgtgt atttgcatgt ggactgggaa
    11521 ggaaataaat gggaacttgg aaataatgga atatttctcc tatgaaagaa tttttcgtag
    11581 aagatttgtt tttgatataa tctttctgtt ggttagcttt tagtgttttc attccttttc
    11641 tgatccacac tcctttaagt gaccaaatga atataaccca acatgcattg ggaatgtgtt
    11701 taatattaaa caatgtctaa ctgaatctgc aaatgcggga actgagatat cacctccatg
    11761 tgcacacctg tgtgtacgag tattctatac aacttgtagc atttactgcc acttaattgg
    11821 gttgaacttg caagataaac ttttggaaac tgcttagtgc catcggagtc tcctttagaa
    11881 gctgccatca ggcaaatgct atcccataat accagcagta agcctggcaa catgttcaac
    11941 agatttagta cccaagagga aatcaacagc gatagtagag aatgagtcag atgtagtggg
    12001 ataaatacta gcctaggaag aaggagcccc ggagtctaat atgagcttta ttactaaatt
    12061 gctatgtgac gctaggcaag tcacttaacc tctccatggc tgtttcctca tctgtaaaat
    12121 aagtgtattg gactagatga tccttagggt ctttccaaaa gtctaacatt ctatggcatt
    12181 ataggttgcc ttgcaaattc agcctgctat agtgatggca aatatcacgt ttaagcctga
    12241 gtctcttatg ttgcagttaa ataaaagaac tatgtaagat gatttttaaa attcaagcaa
    12301 atgggccggg tgcggtggct catacctgta atcccagcac tttgggaggc caaggcaggc
    12361 ggatcacctg aggtcaggag ttcgagacca gcctgaccaa catagagaaa ccccatctct
    12421 actaaaaata caaaattagc cgggtgtggt ggcgggcgcc tgtaatccca gctacttggg
    12481 aggctgaggt gggagaatcg cttgaaccca ggaggcggag gttgtggtga gctgagatca
    12541 tgccattgca ctccagcctc ggcaacaaga gtgaaacttc gtctccaaaa aaaaaaactc
    12601 aagcaaatga agttcataat aataggggat gttgataaaa cttgtggcag ccttccaatt
    12661 catttacagt tgtttcgttt tgtttttgtt ttaatgtcca ttttctgttg actgttccca
    12721 gttttcattt tccatacagt ctgtatgtaa agtctggttt tcattaagct gtggccagta
    12781 tttgccacta caacagaaac acactgtcac acttgctaga atataactgt acttgagctt
    12841 ctcctttcct gtgaagtagt gctgggcttt ctagagttta attctcaagt ggcacaagat
    12901 agcagagccc atgcatttta atggctgaga ctgctaagag tgaacctaaa cacttacaag
    12961 ttgcagagag aaatgaaaaa gtaattacat gctattagca ttgagaaatg ttgacaaatt
    13021 aatttgttgg gaaccaaaga tagcatttct gatgacaact cccacagtga ttggccagtt
    13081 gtatgatgag tacactgctg gaaagagggt aaactgggag ttagtggatg gtcccaatgc
    13141 cctgcctaca gcagagtgcc aaccagccct gagtgcaaaa ttcaagttca atgtgtgtgc
    13201 ttgtgtgtgg tgtgctttat ggacccgcaa ataccatatt cattattgat gataagatct
    13261 tcacagaatc ctgtagctac taatgcattg agtttttaat ctcagtacat cagccaggag
    13321 gagccagatc acagggtagt gatgtctact gggattatac tcataacatc tacacaaaac
    13381 aagttgagaa ggatccacgt tttcattgtt tatcagaatt gtatctcatt tggctgagca
    13441 ttacttttgt cagaatgtgt tatctgtaaa ccatgtgtag tgaaattctt ctgtaacttt
    13501 ggattaaagg tatttatggt ctttttgttt gtttgatttt taagtaagtt atttcttttg
    13561 tagacctgct gatggtatgg ttccatcctt ctgacctcag catccaatct ttttaaggat
    13621 ttttgttttc aatattgtta ttttaaattg tggttgaagc aatagaaaat tgaaatatgg
    13681 attgtgcatg actgtgtctt gagtgtaaaa atattgcagt ttgaaacttg gacctaaagt
    13741 attgcaaata aaaatgacaa acatcaatga gctattgctt ctgtcttctg tcaccttccc
    13801 atccaccccc cttggcattg accattttac ctacttcaca ggttagaggg tatcaccatc
    13861 tcataggtct ggagtcaggg aaatgcccag ccttgtcagc catacaggaa atgctctgca
    13921 aattctgcta ctaaataact cggacccaga gtctccatct gatgtttaca tggaatagct
    13981 cagagaggtg aatgtgtata ctcaatgcag gtcagaagcc ttgcagatca caaatgtaga
    14041 tgaaggggaa ctaccaaaag aaatcacgtg tagtaaataa tttgtttttg atcatctttc
    14101 cacttggaaa attctgaaca cgcaaaacca tctgcacaat gtttagtttt gtcttatatc
    14161 tgataccaaa gtccagaggt cacaggaggc atttcaacca acattactga aaaccttcat
    14221 tcattaattt gctcacacat ttgaaaagca tttattgagc tgctgctaca tgtcagacac
    14281 tgtagatatt caaaaaatca cctgtaaatg tgactaaatg acagtaaagc ttgttaaaag
    14341 ctgctagatg cacctgttgg agtcaggttt gtcagaggct ctcttgagtc ctcaagactc
    14401 aaggacacca gcaatttggt aaactggagg tttgttccac tagtattact tgtacagtgg
    14461 gcatagtact ccttctagag aacgtgtata taattttgta cctttgtgta agtgcataca
    14521 cactggtact atattcctta caaatggatt atgtatctta ccaaatgtta tttggttgtt
    14581 ttctagttca ctgtacttac atgcaagctt taattttctc acttaatctc tccatgttga
    14641 aagccaaaaa gacattttct gtaaatatga gcgggcaagt gaatgagaat tccagcaact
    14701 aggatttagc caagttctct actgcccatg tgcataatta ctgcactgcc tctcttacta
    14761 acaagctcct gctaacaagc ctcacaccct tccagccttt cttgccaggt tactccaact
    14821 atttttctat tcctattaaa tgaaacagaa atactgtcta tgagttagca aattaccaat
    14881 atcttccatc acaaaataaa atttgaaggc attttccttt gtcacaatga cttggaggag
    14941 cagggcccaa acaatgataa atatattatt ccacagctca accttataag catgattgct
    15001 ggaactctag ccttatccaa gttccttacc tcctactgaa gtgaggaggg gctaatacct
    15061 gaagtgcccc acagatttta gttgttctcc acatcccatg aggtctttgc ccccaggcac
    15121 ttgggtggca tctgctggct tcccacacac ctgtgtcaag caccactact accactaccc
    15181 tctgctaatc ctgtaccagt gtcactgtcc agctgctcct gtgcttgcca ttctacttac
    15241 ctatatctgc aatatacaca gcagtccctg atgctgacag atctctggga tgattcaacc
    15301 actttgcagt tttacagctg actacttttt atttgttttc catacctgat ggcagaaacc
    15361 cttcccatgg aatcaaatca tttcactaca tctttggacc cattttgatg ttgatgttct
    15421 ttttcagttt ttggcacctc actgaaccag ggagcaagtt tggtcttggt tttttcccct
    15481 ctcccttact ggggcttgtc aggtcaactt ttgcatatta caggcccgct gcctaccata
    15541 gaggttaatc tgaagccttt gctatagcta ttgctttata cacatttata catgttctct
    15601 caagcagcca accttggagt aaaaaggaaa ttggcaatgt ggccctctta tacgtgtact
    15661 ttagtatttt acctagagaa cccttgtcct atttcctaga catatatatt tctgcaaagt
    15721 aagaagcaag gctatattac cattgagggt aaacagataa aaggtacaca ggatctctct
    15781 tcattcttac aattacatgt gatatacaat tatcttaaca atatttgcaa taagaaaagc
    15841 aatggactgc atggcatctt tgtttatgac atttcagagg tcatggtaat gatataggct
    15901 gggtatcatg gcccagccca cttctatctc caactaccta tactttggta tcaagaagtt
    15961 cctcacaaaa ttaattccac ctgtcaccag tcccacccac cttagatagt aggacatccc
    16021 agtttctcct cagccacttg tgatttctcc ctggggatgc agggaatgca aatatcattg
    16081 tttatgagtt cctccctgtg gacatgaatt gactggcttc tcagcctaca gtattgcagg
    16141 agaataaaat tcataatgag aaacatcttt taattttata atatgtaaag ccaaattgct
    16201 ttgctttaca aaatgggagg gctatgccat gttctaatgt tgtaaattaa gactctacat
    16261 tgtatactga aaccctgcag taagaataaa taaatagccc cactaggctt tccacatttt
    16321 cattctaaac tctcttgaac agacattagc aaatggagtt aaaattcctt ttatatttaa
    16381 atatatgtag ctgacataat tcagatattt taaaattatt taaataagat agttactcct
    16441 ctagaattca agtagaaaat ggggattatc aaactattca ccataatgaa gatcttaatc
    16501 cctgctgcta ttggtccatt accaccttca gcagtgattt ttgttttgat tttgcaagca
    16561 ttgtgcttgg taagctgttt ttcactttac tcagaagtgc atatattata atctgctttc
    16621 aggatatttc tatcataagg tttgactgta gaaaagacaa taatgttccc agttggtaat
    16681 ttagagtctt ctaactcctc tggataaaca ttatctggat tccatgtgat ctcatcaatc
    16741 ttaccaacac caacacatct agcatcaaaa tgatataaat aattcattgt atgaaggatc
    16801 atattaccaa atcctgaatg ataaaggttt tatcccctac actgagagaa aatactaaaa
    16861 cgttcatgca agaagcagca tttatgaaca tttcaaatcc atgcaggcta actctccaat
    16921 ggcagagttg catggatgcc acttggactg atctgttaac caatgatgta ggcatgtgga
    16981 ttaatatcag aatagcctat actatcagga taagtatctg taatctagaa tattcctttc
    17041 atcagtgctc atatgaggag caaatgtatc aattccagga tcaatgaaac ccattttcct
    17101 tagctccatg gtgagcttaa ctcattgact gtataatttt gagcactgtg aacaccaaac
    17161 ttagcccctc ctttaaatgg tatataaact aaagcacttt gtatgttatt aaagaaacga
    17221 aagccttaat atcattttca ctaacatctg taccataagg ggtgcagtgt tagttgtgct
    17281 gggccaatgt gccctctatg accatctagg agtcactgtt cctctgatgg gaaaactaag
    17341 gcccagcaca tagttgtagg cttaatgatt tgtaaaatgt cgtgtcccca attgagttca
    17401 cctcaccttc tcagatcttc agattttcca ccagcttttc tccaaccatc atggcatcac
    17461 agttgaagaa gctacccacc ccaccctgta ataaatgatt tccaatcata ttgtccttca
    17521 tcattgtaat gatgccttgt aacagatgag gtggttccca ctgctgcaga agcaaggtca
    17581 agtacaaatt tggctcccca gcctctaagt gagaattaaa ggaattctgg aggggtactt
    17641 actttgctgg caaacatatg gatggtggcc tggtgacttt tctttaattt cctgttgact
    17701 agagtataca ctgtggccca ctgtgatagg gtgactcatt tttcttcctg cccataaaca
    17761 gggcaaccat tttggacttc ccagggaggc cctcaattat tcagtggaac tcagcacctc
    17821 tccatgcagc tcctgtcatc atactctaac aatgtaatag tgtttattcc attcgaggtt
    17881 gttgcaaaat gccttggagg ccaacctgta tgtttccagt agtttccaca ttggtttagc
    17941 tacatatttt agccgcttcc taaatctcta cacctcttct tcctccctca tagaggaagt
    18001 ctctggtaaa tggagtttcc caccaaaaat aaaattaagt gagtaaaatt atcatcccct
    18061 gtagggtggt agcccagtac caaaaatgct caatcctgtt cctctaccac ctgttttgct
    18121 tttgcttcaa tttgactcaa tcacatgctg tcaagtcaat ggaatcactg aaattataga
    18181 tccctttagg ctaatatggg ctaagagttt agatttcaga gttagaactg cattttatgt
    18241 ctctctaatc atgagtttcc cccattttaa caagagtgta aaaatacatc agaggttgtg
    18301 aagaataaaa agttaacata tcataatgct tatcctgcta tacagtagat actccataaa
    18361 tggtagccat tattattatt caccttttaa aaagtcttgg tcgattggaa cactttcaat
    18421 agtaaccact gcccggagtg aattttggcc agcaattcaa atgcagacaa gtggctcttc
    18481 cacagtagta tacccattgt tagagacctt ctggtcctaa gcaggacaag cttaggtggg
    18541 ccctggttgg tgcaatgtgt ggacatcatt agtatgcacc ccagttttct ccatctatat
    18601 ccaacggcaa gaaatctttg ctcactaaca ccagtctctt gcttgtgggg atagcatcca
    18661 gctgtgcttt ctgtattggc tagggcttgt cctatttaaa gcaagcccat gcctcacatt
    18721 tgtgtgtcaa gccctcatcc atcttaggca gaaatagccc gttctttatt agttctcaag
    18781 tatttttccc tcttatattg cccaaaccat tgtgtaatgc ttttaaaacc ataagacaat
    18841 actcttggga ggaaaatcca gctatttcca tttatttttg caaaagacat tatggcatcc
    18901 atataaaacc tgattgaccc ataattgaat tgtataatta gataaggcaa ttttattgta
    18961 tttatatttc aatgaagctg aaacgtaaaa gcaatatgag tgttgtgtgt gcgattgtat
    19021 atgtacaatt tgaccattct tgccactttt actgcaacca atctgattca agccatattc
    19081 acctatgctt tgatttttgc aattgcttct acacttgcac acctaactgt aatctcagca
    19141 cagcattcaa aataaatcag aactctccaa tggttccata cccctctcag agtaaagaca
    19201 aagtccttgt aatgacatca tacatggccc tccactgtgt gtaccacgct cttaccttct
    19261 cacctcattc ccttctattc ttcccctcat tcattctgct caaggtagaa tgcttctttg
    19321 ctattccaca aacatttcag gcatgtgttt acctcagagc ctctataggg gctgttcctt
    19381 ctgatcaaac acccttctct cagatatcca catcttcacc tttcacctca agtctatatt
    19441 caagtttttc tttttcaacg agtgctaccc tgatcacccc gtgtaaactg aacttctctg
    19501 actggaactc ctgatctgca ttccctggtc ctttttctca taacatctct aaacttcttt
    19561 catactttat tgttaattta tttgttatgt gtattgtttg actccttccc gttcaatgtt
    19621 acctctacca gagcaggcat ttttttcctt tttaattcca tgaaggcatc cccaggtaca
    19681 cagtagtcat tcaataaacc ttagttgact gagtgaatgt tgactgttcc tttcttcttg
    19741 aaactctgtt ttcttttggt tctgtgacca atctcttctg atcatttccc tacctttctg
    19801 gttgtcacta ctctagttcc tttttatatt cctgtttcta gccatccctt acatgtggat
    19861 gtcccccagg actctttcct gctctatttt gttcacaccc atgtctctag cccccatctg
    19921 catcataact tccaaattaa tatctcaagc ccatatatat ttcctctgag ttctagatgc
    19981 ttattataaa tccacctatc tactcaaaat ctccactaat gtgtttcaaa ggcacttgat
    20041 actcaagttg tcccaaactg aatgtgttct tttatatttg tcatatcaga gcctctcctc
    20101 atattctctc ttttctcacc actcaaatca gaaatctaag catcatcctt gaatttcccc
    20161 tcatcttatt tacccgtacc atttcctcag ttaccaagtt cttttgatta ttcccattaa
    20221 acatctctta aactatccct tgctctttac ccccattact cctggggaat attgcagtaa
    20281 tccaggaaaa taaataatga agacctaaaa gaattggtca cttagcttct acttttccct
    20341 tctaatttac aattctgcta gagtgtgtgt tctaaaaagc aaatctgact aggtcatgct
    20401 ctttctcaca tcctttctgt aaactctctg gccttatcag ggtgtgcctc tggctcttca
    20461 gtgcatcccc tcatgtggat cctctgctca agctcaagag cttgtctaat gctttatgaa
    20521 tataaaacat gacccctgct tacagatcca tagagagata tagaaaccac ctatgaaaat
    20581 cacatcaaaa aatcaaaatc gatgatctgt aagccagtag ggagtgaaaa aagttagtga
    20641 agaatattgg ctgtagtttg tgactgcaaa atcttcctgt gtgataatcc tatataaaag
    20701 cacataaacc caatgatgaa gctgagaaat cacaaacaga tgaggagaag gaggaggagg
    20761 tatttttgca ctgcccatca tgcagaataa tcccacaatg ctccccatcc cgtggagctt
    20821 gaatttgaac tttcagttgt tgtaggtcaa tctccatcct ggaaggtaag ctaactcttc
    20881 ctcaacttgc tcaggaaggc acagtccttg gtaaataatc tgaattggtt tgatatattt
    20941 agcattcttg gaaatgacca tataatggtc tagagctaga cacacaaaga tctattattt
    21001 tacagtaaaa tgagttatag tcattgagat tgtcactgag agctacatct caaatggttt
    21061 agtaggagag aagatctgca ctgaaataaa aagtatgagt tttttggtct ggatagatgt
    21121 cttgacatgg aacaaaagga aatgcccttc cctgagtaga gctcctctgg agagatgaac
    21181 tgtgccaggg cgcaagtcag cttggataag cactctggta cagactatgg actttccttg
    21241 taaccctcaa catctggaag gagcatctgg gaacaactca acagctacta gggttcaaag
    21301 aaagataaga cccctctgcc agatctgggc agaggtttga acagatgttt ttatattgac
    21361 agtggtctca gagtatggga atagtatctg gagtaaaacc aaccttgatt caaatcctac
    21421 ctctgccacc taaaagccct gtgagttcaa tctaagttgt taacattcat aggctttagc
    21481 ttcctcatct gcaaactgga attgttatac ctatgtaaaa ggttgttgca aagattgagt
    21541 gggactatgt aatggagaaa tgcaagttac aaggcttggc agataatatt ctcttgataa
    21601 atgagttttc atcttccctt acccagactt tgactgtatg aagagtgagc ctgcaaaacc
    21661 aagaatgctc aagatcaaaa ctgccccttt tcgaactgac ccagttctca cagcaaggtc
    21721 tttatttctg tctttaaaat accatctaga aacaacataa gatggagatg tgacacttgt
    21781 taagaaggag gccttaaaaa gggactagta cattattttg taaatgaaca atttttgttt
    21841 tactcttatc taaaaagtga tacacaccct atgttccttt taatgttgat aagttaaatg
    21901 ccctgtaata ttgttaagat ggcaatattc cccaaattaa tatacagatt caattcaatg
    21961 cctatcaaaa tcctagctgg cttctttgca gaaattgaaa aaatttgaca agctgaccca
    22021 aaataacaaa aattattcaa agataaggaa acagttggag gtctcacact ttctaatttc
    22081 aaaatttact acagagccat agcaatcaaa acagcatggt actatcatag gatagacata
    22141 gataatagaa tataattgaa aatccagaaa taaatccaca catttgtggt ccattggttt
    22201 tcaacaagag tgccaacact attcaatggg taaagaatgg tcttttcaaa caagtgaaat
    22261 aaaccatttg cacatggata tccacatgca aaaaaaaaaa aaaaaaacaa atttggattc
    22321 ttgcagtaca tcatatacaa aaattaactc aaaatggatt aaagtcctta acataaaagc
    22381 taaaacaaca aatctcttag aacacatagg cataaatctt tgtgacatcg gattaagcaa
    22441 tgatttaaac agtggattaa gcaatgatgt aagctgtcac accaaaacca caagcagcaa
    22501 aagaacaaac agataaattg ggcatcatca aaattaaaaa tgtttgtgca tgaaaggaca
    22561 aatcaagaaa tagaaggaac aacccagaaa atggcagata atatttgcca agcatatctg
    22621 ttaaatggct tagatccaga atatatatat aaagtctaca actcaataaa aaagacaacc
    22681 catttttaac atgggcaaag gatctgaata catatttttc tgagtaagat gtacaaattt
    22741 ctaataaggg cataaataga tgctcaacat cattagccat cagagaaaca caaagcaaaa
    22801 acacaatgtt atatattcat cctgcatacc cataacttta tacctgttga ccaacatttc
    22861 accatttccc cgacctcccc acccctggta gccgccattc tactgtttct atgtgtttga
    22921 cttgtttaga ttccacatat aagtgagatc atgcagtatt ttcctttctg tttccagtgt
    22981 atttcactta acataataaa taagtctaga catttaatgt acaatatgat gactagcgtt
    23041 aataatactg tattatatat tggaaatttt ctaaaagtac atttcatgtt ctgtcaccaa
    23101 aaaaagtgac tatgagagga gatggatatg ttaatttgct tgactgtagt aatcatttca
    23161 ccacgtatat gtatatcaaa acattatgtt atataatgtg atatataaga taaatataat
    23221 tttatcttaa aaataaaaac actatgagat atcacttcac acccactaag atggaagaat
    23281 aaaagcaaca gataataacc agtattggca agaatgtgga gaaattcaaa gcctcatata
    23341 ctgcaggtgg gaatgcaaaa ttgtgcagct gttttgaata atgatcgtct agttactcaa
    23401 acagctaaac acagagttac catgtgatct agcaattcct ctcctagaaa tacacccaag
    23461 agaattgaaa acacatgttc aaacaaaaac ctatacatga atgttcacag caacattatt
    23521 cataatagtc aaaaagtaga aataacccat ttccattaac tggagaattg ataaataaaa
    23581 tctggtatat ccatacaatg gactattatc tagcaaaaac aaatgaaata ctgctttgtg
    23641 ctgcaatatg gaagaacctt gaaaacactc tgctaaatga aagacgacag acacaaaagt
    23701 ctatatattg tatgattcca cttacaggaa tgtctggaat aggccaatct atagagaaga
    23761 aagtaaatta gtattttctt agcgaggaag aaaagaggtt gggagtaagt agagcgtgac
    23821 aaaggaaaaa atatttcttt ttggaggtga tgaaattctt acaaaattga ctttggtgtt
    23881 ggtggcacaa ctgttaatat actaaaaagc attgactttt acactttaaa ttggagaatt
    23941 atatggtatg tggattatat ttcaataaaa ccataataat aaaagatgac acatgttcct
    24001 tataaaaacg tgaagaaatt cagatatgca aaaggaagaa aaggaaattc cttttacttt
    24061 ttctttttct tttctattta aacttttaac ttggattcaa gggataaata tgctggtttg
    24121 ttgcatggat atgtgcatga tgctgaggtt tggggtacag atgatcctgt cacccaggta
    24181 ctgagtatag tagccaatgg gtgttgtttt tttttttttt tttttgaggc agagtctcgc
    24241 tctgtcgccc aggctggagt gcagtagcgc catctcggct cactgcaagc tccacctccc
    24301 gggttcacgc cattctcctg cctcagcctc cagaatagct gggactacag gcacccgcca
    24361 ccacgcccgg ctaatttttt tttgtatttt tagtagagac ggagtttcac tgcccaatgg
    24421 gtgttttttt agcccaagac cccctccctc cctctcccct ctaatagtcc ccagtgtgta
    24481 ttgttccctt ctttatgtcc atgtgtactc aaatgtttag ctcccactta taagtgaaaa
    24541 catgcaatat ttggttttct gttcctttgc taatccactt aggataatgg cctccagctg
    24601 catccatgtt gctgcaaaaa agacatgatg ttgctctttt ttatggcttt gtagtactcc
    24661 atagtgcata tataccatat tttacttatc caatctgccg ttgatgggca cctatgttga
    24721 ttccatgttt ctgctattgt gaatagtacc tcaataaaca tgtaagagaa tgtgtatttt
    24781 tggtagaaag atttattttc ctttgggtat atatccagta ataggattcc tgggttgaat
    24841 ggtgttctac ttttagttat ttgagaaagg accaaattgc attcctctgg cttcattctt
    24901 tttgcttagg actgctttgg ctactagtgg ctaaactaac ttacattccc accaacagag
    24961 tataagcatt ctattttctt cactgccttg ccagcatctg ttgttttttc gactttttag
    25021 taatagccat tctgactggt gtgagatgat atctctttgt ggtattgatt tgcatttctc
    25081 tgatcatcag tgatgatgag cattttttca tgtttgttgg ccgcttgtat agtattggaa
    25141 gtcctagcca aagcaatcag gcaagagaaa aaaataaaag gcatccaaat aggaaaagaa
    25201 gaataactat ctctcctcac tgacaatatg gttctatacc tagaaaaccc taaagactgc
    25261 tgaaagtttc ctagagctgg taaacaactt tagtacaatt tcagtataca aaatgtacaa
    25321 aaatcagcag cactactata caccaataat gttctagcta aggaccaaat ctagacacaa
    25381 tcccatttac aatagccaca tacacaaaaa atgagatgcc taggaatata tcaccccacg
    25441 gaggtgaaag atctctacaa ggagaactgc aaaacactgc tgaaagaaat cacagatgac
    25501 ataaataaat gaaaaaaagt ccacgctcat gaattggaag aatcaatatt attaaaatgg
    25561 ccatagtgct caaagcaatt tacagaatca atgctattcc tatcaaaata ccaatgacat
    25621 ttttcacgga atctaaaaat ctactctaaa attcatatgg aacaaagaag agctccaata
    25681 gccaaagcaa ccctaagtaa aaagaatgaa gctagaggca taacattacc tgacttcaaa
    25741 ctatactaca gggctacaat aaccaaaaga gcatggtacc actacaaaaa caaacacaaa
    25801 gaccaatgga atggaataga gaacccagaa ataaagccac acaactacaa ccatctgatc
    25861 tttgaccaag tcaacaaaaa tgaacaatga agaaagtact ccctattcag tggtgctggg
    25921 gtaactggct agccatatgc agaagaatga aaccagatcc ctacctttca ccatataaaa
    25981 aaattaactc aagctgggct gaagatttaa atgaaagacc tctaactata aaaatcctgg
    26041 aagaaaacct aaggaatatc tttctcaaca tcagctttag caaattattt ataggtgaga
    26101 ccaaaagcaa ttgtaagaaa aacaaaaatt gacaagtgat acttacttaa atgaaagagc
    26161 ttctgcacag caaaagaaat tatcaacaga gtaaacaaac agtctacaga atgggagaca
    26221 atattcccaa actatgaatc tgacaaaggt gtaatatcta gaatctacaa agaacttaaa
    26281 caaatcaaca agcaaaaaaa caacccaatt aaaaaatgag caaaggacaa gaacagccac
    26341 cttttttatt tttactttta tttttttaga gacagggtct cactctattg cccaggcaca
    26401 atcattactc attgcaggcc caaacttctt ggttcaagtg atcctcccac ctcagccttc
    26461 tgagtatcta ggactacagg cacatgccac catgccctat aaattttaac attttttgta
    26521 gagacacgtt ctgcctatgt tgcccaggct ggtcctggcc tcaagcaatc ctcccacctc
    26581 agcctcccaa agtgctggga ttataggtgt gagccattgt gtctggctcc attcacattt
    26641 ttaaaaaaac ttcctttcta tgtttcccaa ttgtttttag gtcaaaaaat ccttatcctg
    26701 aaaggaaaga tctctcatga ttcatcccat tacttctcca atatcatatt tcaatcactc
    26761 ccgagttggc atgatttgtt ccaactacaa aaaacttttc ctgctaatat tccatgctct
    26821 cttgcaagtc catacatttg cacacaatat tctcagggtt tgtagctgtt gatccctcct
    26881 ttcttcattg tgttaattgt actgatgcat catgattcag cttaggtatc acctccccca
    26941 gaaaattttt ccagaaccct ggcacgagct gtgtctcctt ggtgtctaat agttccctgt
    27001 gtttttttct atgagataat tcactgtatg tacatttttt taaataaaaa tcagaattaa
    27061 aaattatgcc aactaaaaat tgaaaaaaat ttgcatggtg cctcaaagat ctctgggaca
    27121 atatcaaaag gtctaacatt tataccatta aagttccaga aggagcagaa aaagaaattg
    27181 gtgcagaaag taatttttgc ataaataatt gcagaaaaca tcacaaagtt tgatcaaagt
    27241 tatgaattta cagattcaat aaatcagtga acttcaaact gcgtaaactt aaggaaaatc
    27301 atttctagac agataataaa acaataccaa agaaagaaaa acctttgaag tacctacaga
    27361 aaaatgacac ataatatata tggttaacag taatttggaa tactgtgaat ttcttgtcag
    27421 aaactatgga ggacagaaaa cagaagaata atgcctttaa attagaagaa gataactgcc
    27481 aaaccagaat tctataacca gtgaaaatat ccttcaggaa tgcaggcaaa tacggacatt
    27541 gttagatgat aaaaaactaa gagaactcat tgacagctct aaaagaaata ataaaagaag
    27601 ttcttcaggc tgaacataaa tgacaccaga gagaatcctg gaatgtcagg aatgaagaca
    27661 gagaaacaga aatcgtaaat agctgagtaa atattttaaa aaactatttt tctcctctta
    27721 agttctttaa aatacgtact gctattgaaa gcacaaatta taacatttcc tggtggagtt
    27781 ttaaaaatgt atttagatat atgtattttt atgtgtattt agatataaat atatgtaaac
    27841 tttaaaagat cagtgaagga tcgtgaagtg gttataaggc ttcacttaaa aaaatttatt
    27901 tacagtacaa ctcagttgaa ttttgaaaaa cacacacagt tatttaacca acacaacaat
    27961 caagatacca agtagttcca agatagtgaa cataatcatc acccccaaaa atttccttgg
    28021 gcccctttat attcctttct tctcttccct tcccacctcc ccaggcaata aatgtgcttt
    28081 ctgtcactat agagtacttt gcattttcta gattttatat gaatgaaatc aagcagtacg
    28141 tacccttttt tgtctgaggt ctttctccca gcataattat tttgaggttc aattttgttg
    28201 tgtgtatcaa agacatgctc atatatatta gtgcattgat ttttttttgg caagaccaag
    28261 tctacctaca gatgctatac atacatgaag tataagttag aggtaaagat tctgagagaa
    28321 agtatttggt atgttacctg cacagttctt ggtacagtgc tgtggataca cactgtacca
    28381 aggttagatg tcactacctt ctggaagaca gcatatttct atggaaaatg cccaagttcc
    28441 ggagttaaat aaaaacaatc ataccaaagc catctgttgt taatggatga acatccatga
    28501 gcaagttatt ttgcctccct cttttcctta actttcttat ccttcaaaaa cagggataat
    28561 aacggcactc catttggttg ttatgggcat taaaatgaga tcacatatgt aaagcattga
    28621 atgtggagtt tgatacacag cagataattt aaaatgctag ttccttttct tttctaccct
    28681 ttttaaattt tttaaaattt tgttttaagt tctgggatac atgtgcagaa catgcatagg
    28741 tatacatgtg ccatggtggt ttgctgcacc tatcaaccca tcatctaggt tttaagccct
    28801 gcatgcatta ggtatttgtc ctaatgctct ccctcccctt gacccccact cctcaacagg
    28861 ccctgatgtg tgatattccc ctccctttgt ccatgcattc tcattgttca acccccactt
    28921 ataagtgaga acatatggtg tttggttttc tgttcctatg atagtttgct gagaatgatg
    28981 gtttccggct tcatccatat ctctacaaag gacgtgaaca tattcttttt tatggctgca
    29041 tagtattcta tggtgtgtat gtgccacatt ttctttatag agtctatcat tggtgggcat
    29101 ttgagtttgt tccacgtctt tgctattgta aatagtgctg caataaagat gcatgtgcat
    29161 gtgtctttat agtaaaatga tttataatcc tttgggtata tatccagtaa tggaattgtt
    29221 gggtctaatg gaatttctgg ttctagattc ttgaggaatc accacactgt cttccacaat
    29281 ggctgaacaa attgacactc ccaccaacag tgtaaaagca ttcctatttc tccacatcct
    29341 caccagcatc tgttgtttcc tgacttttta atgattgcca ttctaactag tgtgagatgg
    29401 tatctcattg tggttttgat ttgcatttct cttatgacta gtgatgatga gctttttttc
    29461 atatgtttgt tggctgcata aatgtcttct tttgagaagt gtctgttcat atccttcgcc
    29521 tactttttga tgcgattgtt ttttattgta aatttgttta agttccttct agattctgta
    29581 tattacacct ttgtcagata gatagattgc aaaaaatttc tccaattctg taggttgcct
    29641 gttcatgctg atgatagttt cttttgctgt gaagacgctg tttagtttaa ttagatcccg
    29701 tttgtcaatt ttggcttttg ttgccattgc ttttggtgtt ttagtcatga aatctttgcc
    29761 catgcctatg ccctgaatgg tattgcctag gttttcttct agggttttca tggttttagg
    29821 ttttatattt aagtctttaa tccatcttga gttaattttt gtatgaggtg taaggaaggg
    29881 atccagtttc agctttctgc atatggttag ccagttttcc cagcaccatt tattaaatag
    29941 ggaacccatt ccccattgct tctttttgtc aggtttgtca aagatcagat ggttgtagat
    30001 gtgtggtgtt atttctgagg cctctgttct gttccatttg tctatatatc tgttttggta
    30061 ccagtaccat gatgtttttg ttgctgtagc attgtagtat agtttgaagt caggtagcat
    30121 gatgcctcca ggtttgttct ttttgcttag gattgtcttg gctatgtggg ctcttttttg
    30181 gttccatatg aaatttgaag tgtttttttt ctaattatgt gaagaaagtt catggtagct
    30241 tgatgggaat agcattcagt ctataaatta cttcaggcgg tatggccatt ttcacaatat
    30301 tgattcttcc tatccatgag catggcttta ttttgcattt gtttgtgtcc tctcttattt
    30361 ccttgagcag tggtttgtag ttctccttaa agaggtcctt cacgtctctt gtaagttata
    30421 ttcctgggta ttttattctc tttgtagcaa ttgtgaatgg gagttcactt attatttggc
    30481 tctttgcttg tctgttattg gtgtatagga gtgctttgtg atttttgcac actgattttg
    30541 tatcctgaga ctttgctgaa gttgcttatc agcttaagga gattttaagc tgagactatg
    30601 gggttttcta aatatacaat catgtcatct gcaaacagag acaatttgac atcctctctt
    30661 cctgtttgaa taccctttat ttctttctct tgcctgattg tcccggccag aacttccaat
    30721 actatgttga atatgagtgg tgaaagaggg cattctaccc ttttttgtct ggggaaaaat
    30781 tgtcatcctt ggggggaaaa aaaagcagta gcatctttga gcttaattgt aataccatag
    30841 tgccatctgg tgaccatatg gaatgactgc attatccaca tgggatattt catgtctaac
    30901 aaaaatctac aacatgtaaa tctctagtgt gttctgggtt agtgttatag ggacagacag
    30961 tgagggctgt acttgagaac tgttctcctg agctactcct ttggagatcc ctcccctggt
    31021 taggcacttt ttcatgtcag aaagaaaaat aagaattttc ccctttgcct accctaagat
    31081 ggtagatggc aggttgtata agcaaatgag tgaagtcttc acacatccct tcggctctaa
    31141 ggaagatgct gggaaagggg agttggtatt ttgtcttgtt gagagatctt cccttgaaag
    31201 atttggctac tgtacctatg gcctcgagtc caatcccaaa taggagtttc aaactcaagt
    31261 aaccccgaat agcacctagc tgtccctaaa tatgaaataa aggcccctct tcattcttcc
    31321 aactacccca gaatgcaaag aactgtacca tcagagtgta gccagggctt ttgaatccat
    31381 cacactttta cctgagctca gatttcatga agtacctcaa cccccacctc cagtggcttt
    31441 gaaacaagtc ttaacagatg gtcactcttg ccatccatcc catgccatca attcaagtca
    31501 agactgatca atgtactgaa tatgaacttt attgtgattt ggtttatgta aggcagtgta
    31561 gtgaaggcac tgcagaagtt aaacagactg gaaaacatgg taaaaaatga gcgagctaaa
    31621 aatgaatgct tgctggtcta ggtaggctaa gtactacaac tatgtttctt tttcatcatt
    31681 gcagaagcgt tgaaatattg ttgaaggtgg gggtttgggg tggaattagg gaaggcttca
    31741 tggaggaggt ggcatttata ctgggccttg aaggaggggc agaaagttga taggttaaga
    31801 tggagggaag tacatttcat aatgactgac tctagagaaa attctgtaga agtactgaaa
    31861 aatgagaggg caaacaaaca cccccagcga gtgaagcatc ccaactgaga gaaaagtctg
    31921 aggagtctgg gctgggacca aagtctcaag tccccagatg gggtcttcag tacaaatgat
    31981 attaggaggc acaggacaat gggcttgttt gtaaacctag cagcagcgag aggtttcaca
    32041 ttgcagtcag gggccccaaa ctggctgtaa tatgtctgga tgaattctct catcttttag
    32101 gtgtgtatgg tacttactta tctttcctct tgcccactga ctgcctgaat ggtagttcgt
    32161 gcgcgcccat gtgcgcgtgt gtgagtgtag gtgtgtgtgg atgggtgggg gaccgaggct
    32221 ttgtctccca ctgcagcttc ttccactttt ccgtcttgtg gcacacagca ggcttccatt
    32281 gagctgaagc tcaggcccct ctcactgtct aggactcatt gaactccagc tcttgattaa
    32341 gcaccaggaa gctttctcaa catgaaggta aaatttactg tcgggggtga cttgggttag
    32401 ggcaggacta cactttccgg aggtgatgat ggtggtgagt ggtggtggtg gtggtggtcg
    32461 tggttggtgg tggtggcagt ggtgtctgta agggttgggg acttaaagcc ggctctggct
    32521 gctgggtgaa atgctgctgg gaagtctata gtgttcagtc aggagctagg atgtcaggct
    32581 agaccagctt ggccttgatc agccctcctc cttcctcaac atgccctaca ggggagatat
    32641 atcaaacaac tgaaaacatc aaagacatct gtagggtgtc cagcctcttg ttattgtcct
    32701 acttgaatct cacccccgga agccccagag atcctttctt ggcaaatgtg tatgatgtct
    32761 agataggact gtcgcctccc catgtccagc tgctggaggt atataggtta tgtagctaca
    32821 gatgctactt ggattgggag gtatggggaa tttatcagaa gagaggaaga gttaattatt
    32881 gtgaatctca ttctttctaa agattgtgaa tcttaactaa gacctggcac ctgacggaaa
    32941 ataaaagggt ggcacgcttt gtcaggattc tctgggattg cagatttaga gctcagtgag
    33001 atcatcgctg gaaataacct tgaagctgat gtggccttgc ttgactttgg cagttggacc
    33061 acccttctta cgcaggtaca gagacttcag atcacggcag tcgctggggt cagcatccag
    33121 agtaacctgc cccaagaact gatcacagaa ttttcggctg ttccagacct ggaaagacaa
    33181 acgaagggag tgaagatgag cagccctcag tgagattagc ctttgcccag tgacatccag
    33241 atctaatgct gagcttcaaa agatttgttt taaaaccaag agactattct gtcctttcct
    33301 gatagcccat cagttgaggg gaggatacct caaagaatat acatgaaaag ccagaaaaca
    33361 gaatctccgg gtcgttcttg acctgtggga tggttaccta aaggctgctg gtcaggagta
    33421 ttaagtcaga aaaactgcca aactgcagtg attcatcctg acctagccat gtgagggtcc
    33481 tctgaactgg cataagcttt cctccatggg attttatgat acgtgtgtta acatgtgtac
    33541 tggtgagcat gtgggctttt gcccacaatt aaataaatgg aagagaggag atttaggatg
    33601 gatggtaata aggaagtcag tgtccatgct tgatgagaag agtttttgag tataaaacaa
    33661 accaaaaaat acgataattc ttgctctgca tcctaaacat atataaggta cttgggatca
    33721 tctagtatct ggtacaaagt atgagccttt ctggcgaccc ctgtacacac tcttacctgt
    33781 actataatag gaatgtcagt ggtccttctg tagaaaatgg cctgggtgtc aaaaatggca
    33841 tgaactgtat tcttctggac aggagaacgg acttcctcct ttccacattt gatgaccaaa
    33901 tatgggttta cagctggaac aaagtgacat atgtttttaa cacagcaatt aaagttcctt
    33961 ttctagatag acaagaaggt atctcgctaa gccaattctg ggactttcag catagcactc
    34021 gagcaccaga atgactgaga ttcaacactt ttattcccac ttccctttcc ttttccatat
    34081 tttatatagg aatatgaaaa tgaatctaat gtaggccttg cttgaagttg acatcccatc
    34141 ctctgccatg gttgccatcc attcccaact tctctagaca aaatatcgct ggagaaaatc
    34201 atttgtgatc atgttttcta acttatgtgc tataaatgca actgaatttt gctttcccag
    34261 acattcctgc cattcttact ttcattggca tacttcttct ccaggtcctc agcactgtga
    34321 acagtgatct gagtaactac tttcgggtag ccacgagcca ggttccagca ggacattttg
    34381 ggcatgtcca gagtcagttc cctagaacat caaaaataaa atattgtcat catatgataa
    34441 atatcctttg atgaaataag cccgccaaag cagacatcat tgctgaaaag catgagacat
    34501 tgtgtgactt agcccattcc tcctcagata tgtcttatat atattttaag catgctttcc
    34561 tttctccttt ctttaagggc caggagataa agatcaaagt aggcagggct gggacaaaaa
    34621 ggtggctttg gtgatccagc acacagaaca taatagaaga gcagggagga gtgctaccta
    34681 cttaagaaaa gaggccatca gccagaaaac agaagacaaa tttgatgtgg aagaacttag
    34741 actataggcc tgttcctctt taagggtagc tcagagctct gaagttcaga ccttcactgg
    34801 ggaatttccc ccaaccaccc tttgagaaat atttttacca ccatgatttc agtgaatcaa
    34861 gtctgattta aacaagttgt cccattgaat gtctgaagag gggagacggg tgaaggggtt
    34921 taggattgga agtaaagcaa agaacagggg tttgagggct tgcgaaggaa actgaacctg
    34981 agctggacag gcacttcaga gaagattctc aggagaaact cgctggtgcg accatgctgg
    35041 aacatggttg ggacaagcac atagttgccc ttcttcaggt acttgctcag aaacactgtg
    35101 cgggtgtcaa tataggtgga agtcccagca cgctcctgga tgtagaggtg gtggaggcgg
    35161 aatttgcggt tcatctccac ctggaaatga aatagggtaa aatattcaat tccactcaaa
    35221 tcatggtttt tgtatgctct gtctttttta tccctaggcc ttccattctt cccctaccat
    35281 taaattatcc taatttttac tctgtttgcc ttcattctct ctgccccaaa tgcccatttt
    35341 taccttgaag agctcaaagc caatgatgta attgtcaggt cttcccattc ggcggtaagt
    35401 gcgcaggtcc ttctgctgca gtgacataat gaccttgtgc ccatcctcag gcacagtgaa
    35461 gatgtactaa gggaaagaga ccaccacaga ggtgagttag cattcccatt tcatatgaac
    35521 gagcttaaat ttcagggaat aaagcttgtc attgggaagc agagccttga gttattttct
    35581 gggctttcat atttatcaga caacaaggtc ttggataatt cactgacttg ctaagcgtag
    35641 accattaaca cttgtttctt tctatgtcat tcagcatgtg ttgaaggaca accttattct
    35701 tagcctttga tgccaaagag gatatgcttt gttctaataa gttcttggta aattattgta
    35761 ttattattat tattattgtt attattatcc ctttgtattt aaatgccata tatagaaaaa
    35821 gtttggaatc cataaagaca gaaaggtatt tatttgactc aacctgacac tacagcatca
    35881 ccctgggcta gcctcctttt gccagccaga catacagctt gtggaagggc tccttggaga
    35941 attgatgttt tgctacttcc actttaagga agctacaagc ctgtgtcaga gactttccaa
    36001 acatcatctg gggaatcacg gtgctattag gagataaaac taaaaggcat cagatttttc
    36061 tctctgtctc tgtctccatt cctctctccc tttagaagga aacataaaac agctactcag
    36121 gaggctaagg cagaaggatc ccttgagccc aggaattcaa gggtgcagtg aactgtgatc
    36181 atgccactgc actccagcct gggcaacaaa gtgagacccc atctctaaaa tgaaggaaag
    36241 aaggaaggaa ggaaggaagg aaggaaggag ggagggaagg aaggaaggta ggaagaaaag
    36301 aagggaggga gggagggagg gagggaggac ttaaaaacag actctgaact aaggccccta
    36361 agttcaatcc tgtacataca tatagagtga agccattttc ccagcttagt atatgtgcct
    36421 aatacttgaa acatttccag agttgtaaat ggaggtgagg tggtgtgtgt ggggggagat
    36481 aggatatttc tttaggaaca attaagatgg gaagggtgga agagcagtat gagttaggtg
    36541 aatcagccca ctagtctcca tattctccta aatgcctgac caatgagatc catgaaaaga
    36601 aaagacgcac ccatgggcag tgaggcagca atatagtcag aattcttgtt ttctatgagt
    36661 tagctcatca gcacattttg agatgaaacc atccaggtta ttatagattc agcctttaga
    36721 tgtttggatt ttttactctg gagtgtgcgt ggacttgacc ctgttatgtc acctggaata
    36781 gcactgtaat gggagatgga tgggacagag tcgttgacag ggcaatgatt caaatcctgg
    36841 gaaaaatgac ctggatctag atgataaaat ggaggcataa taatggggtt caaaagagtc
    36901 atctgcatta gaaagaggaa gaatatagta aatgtgcacc ccagagaaca gctgggcagt
    36961 aaagagaaat gaggaagaat tgagggagca tctcaaagca gatcttctct ctccaatatg
    37021 ggaccatcac aaggaagcag ggattgtcaa agagccatac cagacaatat ggccagaaga
    37081 gaggcagcca gaggaggtga ttcagatttg atactgctgc caaagtctgt ctctgcaaaa
    37141 agacatagtc tgctgtgttt tacaagcacc tgagacaatc cgtagggatc ctggttgttg
    37201 aatattacac aatagctgca aaaggaagat ttgatgctaa agaaatactc cagagatgac
    37261 caaagcaaga tctgagttcc tattgactgg ataactttga ggcaaataaa gtaattgact
    37321 caaacctggg gattctgcag gaaggtatca cggttgttat agcagcctcc tgagcggttc
    37381 atcaggggat catcatccac agtccagcat cccaacaccg attccagctc ctttcggcca
    37441 aaaatagggt tgttcacatt gcggcagaca ttcagtttgt gaaagttgcg gcaaaagtcc
    37501 tccaagctca tcctgaatgg aaggagcaca ggaaaaaaat aaagatggta tttaatactt
    37561 aacactgtag ttccatgtag aagccccaat tcccttagtg cagaaacccc tcctcaccaa
    37621 aactctccat catcagacat aacaagcccc aggttcttgc gatctgatgc agtcagttgc
    37681 tgccactctt cagaactgaa agtaaataga aaaaaaaaaa aaagataaat cattatgttt
    37741 gccattgact tcttgtagta agtaggatgc cagagaaaca gagtgcatga ggatcagtcc
    37801 tgctttattt catctgcggc tgggtcacag accccgtgcc tgacaaaagc ccacagatgt
    37861 cactgagcat tgacagcaca ctagcagtct ggagggatgg gtctgggaga actacagggc
    37921 tttggagatg tagctgtgaa agtggaatgc aactcacatt tcactccagg ggccactcca
    37981 ttcctgtctt cccaaggggt ttctcaggcg aaccatatac accttctcag cactgaagac
    38041 ttccacaagt ctctctccaa gacgaatttt gcgaatatca gtcatggtat aggtatggcc
    38101 cttcagcaga ccccaatcag tttcaacttc ttgctcctcc tgattgggag actgagagca
    38161 aagaaagata tataaaggca caggaattgg gagacccaat cctgactcct cttcttcatt
    38221 aattggctta acccagtagg attgaagcta gttctcacac agatggaaat ttcatcctga
    38281 cacccaactt gaaacaaaaa cagtagagga aaataaataa agaatgtaat ataacaactc
    38341 aaacaagtca gattcaaact atccactgta ggacattaat atgaactgcc ataaaatttg
    38401 atggggcaga ttcatttatt gagcaagcag ccataacctc aagatctaga agcagagaca
    38461 agaggaataa tttagcagag gctactgaca caagcctgga tgctgcattg gttccatgtg
    38521 gaatgactgg tacaaaagga taaaaaacta aagcacctca gcttccacac cagggcaaaa
    38581 tgtgtatttt agctttccaa gtcccattaa tagtttttca tgggtcttcg ggttaagggc
    38641 gaatttccct accaaacttt gtagtctctt ctgccaggaa tgagtataca gtggttgttt
    38701 ttcatgagta caaacctcaa tggaacagca gatcagacca cctttggtaa atgttttgta
    38761 cagttctccg aatagcttgt acttctcctc aacaagctca gtgtatcttc ctttctgcat
    38821 gtcaacagtt tcagccaatg tgcccgtgaa gtccacaata atatcagtga tggtcaaacc
    38881 atccagggcc tcataacagc ctagcagcct gagggcaagt atgcaaggtt atctttgtaa
    38941 aattgttttt ccaggtcaag aacatcataa tctaaagaat cagaacatcc cctgcctaga
    39001 ctttaccaga tggcttttct ctaggtatga aatgagagat gttttgaaat tatattcaga
    39061 aatttctagg cttaggctat gttgatcaag gacagtctat gcagaaactt ctttctacgg
    39121 ggagatatga gataaagaaa aaatgaagtg atagtgccat gtgatctgca gatagcagga
    39181 ggaaagggta cctgggagca ggtaagagtt taaatggaaa caagaacctc tgatttctct
    39241 ctcatctaag atttctgaga gccaggccca agtgcaaagc aatgccttgg ttctgactga
    39301 catcttaccc aacttaaact ccagggttgg gctgggagag aatgagctcc caaagataaa
    39361 tgtggctttc ctttcctcat gtaccctgat agcagactag tcaatcacct ccttactttg
    39421 cataagcttt ttccagcaga gcattccaaa actcattcat ggaagtggag aaagagaaga
    39481 ccagatctcc gttaatggtg ggcaacaagt catcaatcac cacttcagtc cattctccaa
    39541 aatgccagaa acgaaagtga aatatcccag cgtatttttc tgttttttga gggtcccatt
    39601 cctgttcctt atggttggga attgtctaga aatgcaagaa catttaatca agtgttattc
    39661 accagaccac atccagaaac tgagattaca gtttactcct ggcaaatcta cagggcaaca
    39721 ggaagcatgc cacatacacc tggtttgagt ctggattgga aattcctaga tatggcattt
    39781 tgcccacaac ttctgcttcc tcagttcagt tttgctgagg atgatatcac catgcctgtc
    39841 taaatgcaca caggagtaca cagtgtggct gggacaacca gttagtggtt aaatgctttc
    39901 cacaaataat ctataaacta gatgttgaga agcggactat ataaacatat tcatgattgc
    39961 atgtttaaaa tgcaaggcat tctaaagagc aaaaccaatc aaaatcaagg cccattgtag
    40021 tttgcttttt aaatatccat tctagagatt ttatatcaat tatattaaat atggatttca
    40081 tttgcatttt gaataaaatc atgtttagga attttttttg tataacagct acacattaat
    40141 attctaaaat tattttttga ggaaaatgtt ttaactttgt cccaagtcaa gttaacactt
    40201 cattgccttg acctctacct gaatgctaaa ggaatttttg tactattaat aatttattag
    40261 aacttttcaa gtcctacttc tgtatttctc accttttttg ttattaatat gtattattga
    40321 cttcctggct tctactgact ttatacccca ataggtacag ataacataat atgcagagat
    40381 actgagtcaa tgcacggcaa atgataacca aagatcagtg agcatatatg acaaagcaag
    40441 tagagtaaaa tgttaatggt agaatctagg tgggtatatg agtgttcact ataaaattat
    40501 ttcaactttt ctgtgtttgc aaaacttctt ataatgaaat ggagggaagt cgcctagtta
    40561 atatctctgc tttcaagaaa tacaacaatc taaggaagtt atgcgctgat ttttttccta
    40621 aagttattaa agaagacaag aaagtggaaa ctgaatgagg tcccacagga gggataatac
    40681 ctttgtccaa tgagactcct gaacagccaa acaggaaaat gcagaaacca ttggcttgtg
    40741 ccccagtctc ccttgggtca gctggtggtt gctaatgttg cccacaatca gatgggggtc
    40801 atcacagatg tcctatgaat acaataattg gttatatgag agtctgggaa gagtttctca
    40861 agagtcactg tgagctgaac taggcaggac agccttcata gaggagataa ggttagggtt
    40921 gaaaaataag taggatttcc atagatggga tgaagggaag gctgagggga ttataaggaa
    40981 aaccaaaaaa cagttgatca tctagcccca tgtttggaag tggcaggaga aatgaaagga
    41041 gagggtgtca tgagattggg tgccatgcgg tggtgagggc tcttagacat agctatcatt
    41101 ggggtttggg agagttttca ccacagcagt gaactttcca aaaaaataat aactaggaat
    41161 tagtacatta cattctgatc ctattttgcc aggatgagat tttctgaagc acctgtgctt
    41221 ggaagggaga ttgtattaaa gagacaaggt ctaatgaagg ttaacattaa ttcatctcct
    41281 tctctaggaa aaccagttag gagaccttat aggaagccct tttctaacat aaagcaaaag
    41341 caacagtggt ctccatagca accagatttt tctacatttg tggatgaata tagatgacag
    41401 cagcattaat agcaagtggt aataaaccac tcacaggaac acttcaaccg acactcttcc
    41461 cagtgaaggt cctgagggga tctgggcaga gtgggggcag tatttgtttg gaatgtatgt
    41521 gtgtatcttc cccccaacat attaactgtg cctttcccac tgggagtagg aaaatgatac
    41581 ctaaatgata caattataac acaaagcgat agggcacagt agagtcattt tcaaagcata
    41641 attcatgtct aaaggaagaa tccccataat tctcctgatc acagtcagaa taggattcat
    41701 agtgttgtta acactggagt tgactagaat taggcagttg gacaagagag gggtacggtg
    41761 gcataacagg tagagagaaa ggcatgggca aaggcattgt ggcatacatg acactacata
    41821 gttgaagaca taattgggtg actcagagta tcagcatttg aagagttcaa aggcaggaaa
    41881 taccaatgca ttcttgacta gatggagaaa gcttcacgag gaggccattg gccggtacct
    41941 tgacattctg cttacgcagc agaaataaat ccttgtctca aatggaatct agacatatat
    42001 ccttttaaaa agtattagag acagctatat ttttctggtc ctcaaacctc tttttttaat
    42061 ggccataatt ctccttgacg aagggtctgg tctacagaat gaaacattcc atatattctt
    42121 taaaagccaa aataactctg gaaagaaata ggcaggcatt ctgggcaaag acaagtccaa
    42181 gctggagatt aactgggcat ccctctgcca aaaggagatg gctagatagg ttgtcagctt
    42241 tggattctca ggtctttttt tttagatgca tgcctggtag ttcacaacta ctttttgttt
    42301 taggagaatt ttctatctta aaaaatattt tttatgggct gcaaactgcc tttgttttca
    42361 cttgttgaac atagagcagg tatgttgggt aagtcaaacc ctgatgttct ttttcttctc
    42421 atttttttgg ccttttctaa aatgaaacta agcacagttt ttcctcccca ggtcaggtgt
    42481 tttcatttta ttttgagcaa gatgtcaaag tataaaataa tttctggcca ggtgtggtgg
    42541 ctctcgcctg taatcccagc actttgggag gccgaggcag gtggatcacc tgaggtcaga
    42601 agttcaagac cagcctggtc aacatggtga aaccccatct cttctaaata tacaaaaatt
    42661 agccaggcgt ggtggcaggc gcctgtaatc tcagctactc aggaggctga ggcaggagaa
    42721 tcgcttgaac ccgggaggca gaggttgcag tgagccgaga tcacaccatt gcgctccagc
    42781 ctgggcaaca agagagaaac ttcgttccaa aaaaaaaata atgatttatt tgacctagat
    42841 aatacatatc atagtttcct aacatggaaa aacaataaag tgaacactga tatatgaaaa
    42901 agtgtgcttt tagattaata aggcaaatta tttgctaatt ttatgtagag ttattagtgg
    42961 ttggagatac attatggtgt cttgaatttc cataaaattt tcatttcaca gtaggtatct
    43021 aggcttagcc aacacttttg gcatgtcaac ttctattgaa gacataaagg gtcacataag
    43081 tgagcaaaaa gacatataaa caattcaaat tgaggaatgg ggtaattatt ctcttcttga
    43141 cccaagatgg atatgattat tatatattat atgtgtctat gtatatacat gtatgtacat
    43201 ataatatata tattaaaagc tgtgtgaaat tcatgagaga atgtaaattt gggacccccc
    43261 cccccacaac attccatttc ctcttataaa ctggcccatt ctccagttat actttacccc
    43321 ttttcatacg tgatggctat ttaacaatag atttaaaaca tgacaagtta acagtcatag
    43381 aatgtcaaaa ctggaatgaa ttttggagac tatctagtaa agcctatttt tcacatgagg
    43441 aagcaagctc agagaggggc aataacttgc tactctgatc atctagctag tcagcaggag
    43501 acctaggcct aaagtttttt atttttcaga accacatatg gctagagaaa gaagctgaag
    43561 acctcagtca tccatccctg tgcctttagg ctcaacaatg gccaccacat aaaggatgat
    43621 attgtgcaac tcctcaggga cagagatgct actacctctt tcagtcattc atcccagtgt
    43681 tttatgcctc tgcttaaaac ctacctaaac actcttcact cttttgggaa atttacttac
    43741 ctcttagctt ttggttccag gatgaatatc actaataggc gttagtctgc tcaattaaat
    43801 tcaataaata tgaatcaatc tcttactata tgtaagcatc gtgatatgaa atacgggaaa
    43861 taaatccttt attaagccct atcttctgcc tctttaagga agtgaggcat taaaagacta
    43921 gaaagtagcc aaggatttag aaagggtaaa gggtaataag ccaatgcaaa cctagcagcc
    43981 aaacagacta tttggtgaca aatttgaaaa caacaaaatg actgtttgtt gacttctccc
    44041 ctaattttgg acacactgga tatcttcctg ttggggttaa ggatgtaaga cgagaaattt
    44101 ttctgtaact caaagctgag atgcagagtg cagatgggac atttcagctt gagctgtgct
    44161 ggagtggtag gaatggggtt ggaggagagg tcacagcaca catattcaca acatttgggt
    44221 ggcttaggag caaggaggga tgctgagccc ctaatggctc ccagctgtgc tggcacagtt
    44281 gaagtgatca ttttgtcacg gcaaaatgat ttcttaggaa gaaacacaca actaggaagt
    44341 tgagtaccaa gagtaaagct gaaatggtcc ccgcatgacc ccacttctga gccaaaacct
    44401 tccattaagc aaagcagctt ctctaggaag tgtgttctct tagcttagtc ttaagggact
    44461 tctcattgtc cttacttcat cctccttccc agagacctct gagcgggagg gggcatacag
    44521 ttacaagtcc tgtagccctg cacaaggcac tctatgtgac tccctttgca cctaccacat
    44581 ccaaattcag catttttttt acacaggtga aatttactga caataaacat ggaggtccag
    44641 ggtaaaccgc aggaacctca gtaaaatgaa aagctgccca gaaaaccagc agccagaata
    44701 gtggaaacac tttcaaaagt gctacgtgca atctagcagt agtgaggggg taggggtgtg
    44761 gacacctaca agtgaggtca ggcctgcagt aggtatatta tgaataagtc cagggtctaa
    44821 atagtcccag aaagctcagg ggctaatttg cacagaactg ctattcagga actgagggtt
    44881 ttgtgagcta gtccaagtag tcgttatctt acttagaaaa ttgaaccatc ccccagatgg
    44941 agcatttata ggtggactta ggctcaggga ctaatgttgc accagcctct caagaagtcc
    45001 acttttccta ctggggagtt tcaattctgc atgaagattt tattatcaat aaaatagtct
    45061 cttccttcct aacccatcca aggaaagctc aatatggaga atagaatagg atcttaggcc
    45121 caacctccac ttccaaactg ggccaccaaa tcactgtgta atcattttct cactttaacc
    45181 ttcaggattc tgctaggaga ctcaaggcac agatcccgaa ctgctttttt gggaagaatt
    45241 tatatatcta tgcaacggaa tactgttcag caataagatg gaataaattg atacatgcta
    45301 caacatggat gactcttgag aacattatgc taagaaaaag aagccagtta cttttatgat
    45361 tccattttaa gtgaaatgta cagaataggc agttctggat agaaaccaaa agtacattag
    45421 tggttggctg cagcaggggt gggggtaggg tggagggatt actcccttca aatgtgcaca
    45481 agggatctta ttgaagtgat gcagctcttc tacaactggg ttgtggtaat gtttgcacaa
    45541 ctttaaattt actaaaaatc attgctttgt atacttaaaa caggtgagtt tcacaatata
    45601 taaattgcac ttcgataaag ctgttaaagt ttgttaatgc ataatttaaa ataaatagga
    45661 cagttggaga aaagactatg ggcctattca actctcttca attgtgtaaa cagtggaaaa
    45721 cacttaggac tactcacatg gggtgatata gtctctggga agaaaaggaa attcttgagc
    45781 agactaggcc aaaactacta tacctaggtg ttgcattcag ctggtggttg gaaatttcta
    45841 ttagcacaga gagaaaagtg tttctttcct tacggttccc tcatatgctg tcacccaagc
    45901 ctgtatattc tgcttctaat gtcttccctt atctgccttt cccttccatt cctactacca
    45961 taattttaat ccagagcctc atcaccctcc cactctcaac cagacaatgt agtaaaatgg
    46021 ccccagggga atgtattcta gaggggaaga ctaggcaatg gcaatggtag cgaaggctga
    46081 acttaaaact tgacaggaga gctgacagat cttgtgcagg tgcccttttt tcctcacagc
    46141 tgacagtgca aattcccatg aggctttact cctttttagt cagggattag atgtgtctga
    46201 ttttaagaca ctattgtagg tagatatgag tcagtaagga aaaacgccag aggtgggtag
    46261 gaggggagag tggctggagg gagctggaga agagattttc tgtgggaatt tggcaggcag
    46321 atttcagcct aaaaatccta ccggcagtct tatgatcaca ctgtaactca gtataagtta
    46381 gaccttataa tttatgaaga ctatatatgt tgagctccag gagaatagaa aagtccagga
    46441 gcacatggtc tcaggtaata tacctgtgta ttcagaaagc taggacaaca gctgaggcca
    46501 atagacggct ggagaaatta agaagaactt tcctctggag ggtatgggca gatgatctgg
    46561 cattgaacaa ttgcaagaca ttcataattc ctgtcatctg aagaggctcc ttaaaacaca
    46621 gcaaagaagt ggcaagtggg aggaggggaa ctcaatcctg cccaagaatg gggcggcggg
    46681 gggccggggg ggcgccaagg cggcgagggg cggagcttga ctggagtcag tttgttaaac
    46741 tatcagtggt gtttgcaggc atggaggctt acgaaggcaa aagccttcaa cacgagaaag
    46801 gaaaaagaca gaaaatatca ggttaatcgt gttgcttcca ttttggagac ttcagaaaat
    46861 agccccagtt aagtgtgcag ctacatggca tgccaaggga caagatgtct gtaagtttcg
    46921 ctgcaaacca ggatctcaac cttgccaatg aggtggggga ggccaacggc tcatctggcc
    46981 ttcctgagcc aattcagaga ggcccagctc tgtctctcag ctttcctgat gggaggccag
    47041 agaactttga cgtggttctt gaccacaggc ttggcctcca ctctggctgg gaaggaatcc
    47101 acagcttttg gaggaggctc agatggtggt ccagaacctt gtttttttct cctgctcccc
    47161 acaagaggcc ccttctccaa ccattccaat ctatggtaac ctccccttct tcaaaaattc
    47221 tacatgattt actgtccata tgactgaaaa tagcatttca tacatccaac attcatttct
    47281 tttttaacta tttgatagct caatatccat ctccccaact agaatgggag tgcctcacca
    47341 caggccatgt ctgtgtcttg caccaaggct ggcatagagt aagtgttcag taaatatttc
    47401 tctagtgacc ctccctttgg gcttaataag attaagcaaa caccatgctc tcaaacggga
    47461 gagtgtggaa caattaatgg aagtgtggca tcaacacagt gaagaggaat atgagaactg
    47521 actggaggga agctacctat tttggtgaaa acaggggaca caaacaacta actctatcta
    47581 tctccctgat atgacagcat gatagagtga aaacttatgc aggctttgca gtcaaacgtg
    47641 gatgcaaatc cgagcttgtt agcagtctga ccctgagcaa atacttacct tctcttaaag
    47701 acacagtttc tttttatata aagatatata atgggcatgt atctactgct attttttaaa
    47761 ctaggaaatc agaattgcat caaagtaact tttaacatgc ctactttgct aagccatggt
    47821 aaataagggt gaagtaaaat aatgtataga aaacttcaaa cacagtatct ggcccatagt
    47881 acgcatgtca tgactgttga ttctttttcc tacccatccc cactgttcag cctctctctc
    47941 tgattttctc tcactttaga agggctattg tttgccttag aggcaaaatg ccgtggttgt
    48001 taagactttt tccttccctg ctaatccatc taggattcaa tccacagaat ttggaggatg
    48061 aaggcaaagg acaatggact gagccaagat agagggggaa ggatatgtct aaagacagcc
    48121 agtacaggaa acactgtctg cttggtgcac ttctggggaa tggcttcatg ttggcttgca
    48181 attttgctgc ccatcggatc tcctcccacc tcctccacct cacccctatc aatcatccct
    48241 tttctcttct gtgtcttcaa cctcttcttc tttaccatct acttcttagc ctataaatat
    48301 gttcaagtct gtcctatttc agaaagacac ctacccaata cacctctctt aatttcctgc
    48361 ttccttttta gctacttttg ccctattgac atcttttcct tcatatccaa gcatcagaaa
    48421 agagggatgc cccttattta tcaccatcca ttcataagag tggctcccag acaacgtatc
    48481 tctatttttc tccatgtatt tttgcaaata aaaaaaaaag cctgtctgca taccaaagta
    48541 ttggttaatg aattctgctc atttcctaga tttttcttgc aggtcagcta agtgatccgc
    48601 tggaatgtca ataactgcag acacagaaag gcaggaggag tgtaattttc cttcagacag
    48661 agcgaacttc tcccagtctc atactttcaa tggttgtcat tctttatagg caaaaactaa
    48721 gagatatcct ttgagttgca tagtaaacaa gtcttgatgt aaccagtgta tcttgacacc
    48781 acatggtagc aacgtgcata aacagaagta gctcttggca gaactccatt taggaattta
    48841 agctgtggaa attgcaacta aaagtgatag ttggtaaaag ggttacttga gggaggcaaa
    48901 cattgagatg atgcctgaga tggttccata atagtacaca gaaaggaacc tcattaaata
    48961 ggacctgaat gtcttcagga tattatgaag ttccaagaac ttgcatttca caacatcctt
    49021 ctgtggccta tatacctcat cacctacatt gagaaggagt ttaggtacaa tatccttcta
    49081 tccagttata ccctttaggt atagcaaggt tgcaatttat tctccagact ccatcatcct
    49141 attgttttat acactgagct cctgctcatg taggactagt gaaagggttc tgaaacttgc
    49201 tttagagaca caacatacag atgatgcaca gagaaccaca acaactatta atagttacta
    49261 ctaataggtg gtgaaagacc tacaattcta cttgaagccc acttgtccag ttgtgggatg
    49321 atacagatga cagtgtccat agagcagcaa agaagccttc ttgcccccat gaaaaattca
    49381 tcttctccct ttaaaatttg tctcatttat gctcacttta taaggagttt ttctcctcac
    49441 tggtctgaaa ttacatttaa attgttatga caggtaactc agtaaagagg caagagcata
    49501 gttgcttttt gtgttaccat atttactcaa atgaacttcc aagagaaaat aaggttggca
    49561 tatctctctg ttttttgtac agaaatattc atagcatctt tatgtgtaac agtaaaaaat
    49621 actaaaacca accccaatgc cctttagtgg gtgactagat gaacaaattt tggcatatcc
    49681 ttatcataaa ttactactca ccagtaaaaa caaaacaact attgatacac agaacacctt
    49741 gaatggatct tagattaatt tgcttaaggt taaatagcta atctcaaaaa ggtacagtgt
    49801 atttttttgt tgttttttct aagtaaatct gtcaaaattt gactaaaatt caggactcca
    49861 agctagagcc aggagcacag atccatggaa ctttcattat aaagttttct ttatgtgagg
    49921 tgatgaatac gttaatttac ctgattgtgg taattatttc acaatgtata tgtatatcaa
    49981 aacatcacat tgaacaccat aacatacaca atttttattt gttaattata tttcaataaa
    50041 gctgaaaaaa aaagaaagga aggaaggaag gttaacttct aaattggtct acacagtagg
    50101 ctttgtaagt gaaacagtgg atcacgtaga agtatgatga aggaggtcaa ggagacagaa
    50161 tgtgtaaaat agaaagttac ctggggacgt ttccacacca cctttccagg aagcagtcgg
    50221 ttgtagaaaa gagaatcatt ctcaggcaga aatgttggat cacagaaaag tctgctgtct
    50281 ttgatgcatt cctgcttcag ttcctggtat ttctggtttt tgaagagctt cagaggagga
    50341 cccatagtgt tgaactatgc ctagaatgag gaaaattttt gtctttgaag gaagaatttc
    50401 tcaggtcttt taagggttac ctgagattag acaaacagcc gtcatcccag tgacttcttc
    50461 ccagagctag gcacttctag tttggcttta aaggaaatgt caccattttc aaagaaagaa
    50521 gtgtccacac tccgcaacta attagtatgg ggccttgggt aaggcaccta accattttgg
    50581 gtctcagttt ctttatctgt caaatggggc aatatgggcc ttaatcttat tggggaaaaa
    50641 tcaataataa aagtaaagat tgtgaattaa atgtgattca ttataattta aagagaagtc
    50701 ttccatttac accagcaggg agtagttcaa ggcagagaac aacttgaaga taagggaaaa
    50761 atcactttac tgtagtcaga ataatttggt gccctggaga gtcgcacact ttgtgcatga
    50821 ggggaaaagc agcctctttc cattccagca agtcttttta cacaatttct tccgtcttgt
    50881 catttaaaga gctttgcatt ttttattatt ggccaggaag actactgcaa gaaacagctc
    50941 agcagatggc ctccaaagac aaggaaagac agaaagattg tatggtcttt cttcttaaaa
    51001 gtctaatgta taatcctttg cctgtgataa aaggcagact ctgaaattca attacactaa
    51061 tttggtgggc tatagattgt tatcccatgg caatacactc ttgctttctt ttctttaaaa
    51121 aaaaaaaaag gaggaaaaac aaaaacctta catggctaaa cagagaattc atatagtggc
    51181 tcttcaagca atgtctatca gagtgcccca cactcaagtg ctcttaacaa ctggttttta
    51241 ggaagataat ttgaaaggca aatttgattt tcaatagctt tggggtgata atagggtttt
    51301 cagtgctttt tcttgattca caaagatttg taatgatttt aatgatcctc tcctggcatt
    51361 caacagtaaa taagaatgag attgtgaaat ttacatggtg tcaactattt caaaatgtat
    51421 tttagaaata actaaagtct atattctata atcacaggag ggttgaaaca aaaaaaacct
    51481 gactaagcta cagaatgtct gctaaaaaat aaggagaata cgttagcact ttaaggagta
    51541 tctgtgtggt atatctgtga ggcagtcgct gttttataaa aagtgaaata ttgatcagac
    51601 ctctctacta gttcatttta tatcctttca gtatcacatt gaaatgcctc actggtaaag
    51661 aagtttcccc atttgaaaac cataacaatt tttaatttaa atgtttgcaa tgttattaca
    51721 atgttatttg caatgtatgt acaacaaact taccctttct ctgaatccaa ggaataagaa
    51781 gctgagaata accacatttt tttcatagcc ttatttattg gcaattgctt ttaaaattga
    51841 gagctgtggg gacactgcaa aatactgatg tccctttttt catcagattt gtctgctcat
    51901 gtgttttaaa tattaaagtg tacttcattg tatttgaaag catggatggt atcagtaaaa
    51961 atttttagtt atgcagcatc agagccaacc tgaatgtttg cccagactcc tttctggagg
    52021 ctgttacaaa tggatcccaa ttttcttgtg tgctcaaact tacaacaaac actactgcgc
    52081 ctcttgcttt ccctagaagg tttggtgaat tgccagagcc tggggactgt aaatgtggaa
    52141 aagcctgggg caattagaat atttagtttg ccagaacctc ctagtccctg gacattctgg
    52201 ataaatgagt atttcctgtt ctgccttcaa aggtcttcct tactactcgg tcaaatggca
    52261 aggcaggcaa gatcctgcaa aaggaatccc tgttataaag agggcctgtg tacagacaga
    52321 agtagcacca attgaaagga aaatcaggtt tgatttttaa agcccatgtg atcccaatag
    52381 actctttcag aggcagaatg tacagctcac atggatagat tacagcttga cggaacacct
    52441 tttgggtcag ggtgcctgtg ttctggggac acacagtaaa taaagaagag cagactatga
    52501 gcctgaactt ccactacagt acaactgtag tcccaagtcc tctcggactt ggagagtaga
    52561 agtgaccttc atgtggaggt tgcagtgagc cgagatcgtg ccattgcact ccagcctggg
    52621 caacagagcg agactccgtc tcaaaaaaaa aaaaaaaaaa aaaaaaaagt gacccgcatt
    52681 tctgactggg caacagaact gtgatccatt tgctctgtga ccttgttctc tttgtgcctc
    52741 aaatttttca tcaggaaagt gaggttatct accctgcccc tcttacacca tgggtgtacc
    52801 taatgagatg gttggtgtaa gaggaatttt gaataagtag aaagtgatcc ataaatgcaa
    52861 aatatacact gaagccctct cattgcctag cattgagctg ggcacacaaa tattgttcag
    52921 aaaatgtttg ttgaacagat aaatgtctta tgtggagaat tagtgattgc atggcaattt
    52981 cttgtttgaa atgtgcattt tacattttag ttcgggttct catcatagca gtcatgttta
    53041 atgagattat ctgtgccctt cattttgtat ctgcagaaat ggcagctcag agaagcagta
    53101 gctagaaaat ggcagagtta actcaaatcc aggttctgtg caatgccaga ttcagtgtgc
    53161 agctttccat cacaccacac tgcttctttt ggtcactgcc ttcttagaga aaggattttt
    53221 tagttcaaat gaggaaatcc tccaaagcaa acatgatact caactttgag gcattctgcc
    53281 tagaatttat tgccatggta gacattgaaa aggttcttta ttttcatggt ttgatgttca
    53341 tgggagaatc tgaactcttt acatcggcgg gttcatcatt taaacatcag aatacttttg
    53401 gaaacccaga aatttgtaag ttgagtttgt cataaacaac ttactctgtt ggaacggatg
    53461 agtgcagaga agggaacact ccctcctttc tgttctattg cacccgatgg cgatggggaa
    53521 gggagagccc ttacgttaaa gatcacatat taggaaagtc gggagacaag ataaaagtta
    53581 gggctttgag tataaaccca gggatttgaa tataaaccaa ggagagcctc tagctaacaa
    53641 aggcaatgtg gacatttgag aagagggacc ctgtggcctg agggctagtg gaatgagcgg
    53701 ttgggtgagg cagagtgtgg tggggcttag cattgaggta atggtcttag gcaatattat
    53761 tataattcag atccaccact ttgagaaaga atatgaggac aaccgtgtct gccaaggcaa
    53821 aggataggag cgggcccaga gtgcagaaag ttgagaattg ccagggaagt tgagggagac
    53881 tgccacgtag aatacaaggc tgcagtttaa agagagaggt attttttttc tactcactgg
    53941 actggaaagc ttgaggaagc taggcaagtc gtgaaagcaa aatctcttgc ttcttaacct
    54001 gcgggttagc cttagagcaa aaagcatatt ttgctctaaa aaaatctgat aaaatacatc
    54061 ctgtctgtgg ttcacaagcc actttggatt taggtgagtg ggagagaaca tgagggaact
    54121 taatggggag agggagagag ggcatggctg aggcattgtt cctgcctagt ttctcctttt
    54181 cctcagtttc atccctgttc tgtacttgga gcaaaaatca agataaaatc aataaaagtt
    54241 tgcagacaaa ttgaatacag aagacccaat tctgccctgg catgaagtag gcaccacttt
    54301 tacccacaat ccacatccct tatccccaca cctggggggt ggtatggtga gagaaaaggg
    54361 acttggatag gaaaaagtaa tacaaattca gcatccacat ggtatacaga gaatagtgct
    54421 tttcataggt acacaggcag agtcatagct tacatacatg ggcacgcatt taaccacagg
    54481 acttatgtat ggctgttcag attcaaactt ctaaccagat attctgggtc taaaaaggac
    54541 cacagagatc ctctgatcca gcccactgcc ttatatagga tgtacctact ggcccagaga
    54601 gaagagtctg ttcaaagcca ttcaaaggaa aaaaaagagg acacaatttt aattcggttg
    54661 ctcaataact ctatcaaaaa ataaaaagca attcaaaatg ataaaaggaa cacaggtggt
    54721 tacaagatcc agaaacaaat ccaaaatatc accatttagt atacataaca tctcagtccc
    54781 tcagtttcca catagcacaa tgggaatatg ttagcttccc tgtctacttc agagtgttat
    54841 tgggagaatg taatgaaatt atgtatgtga acatactctt tattgttaaa gataaggcta
    54901 aagcagggca gtagtattag gaagcactta ttttaacaag ttaaatttct tttctgtctt
    54961 taggggaaac taatgaaatt tgatcatgtg ctttaaataa gacatgcacc aacatttaaa
    55021 agactaatat aaaaattgcc cctccaccat tctctacacc tccctatttc acccctcctt
    55081 ttttggctga cttgcccttt tggccgaata agcccatcct atatgtctac ttttccagcc
    55141 tttcaagaat caaattctga ttttatagaa tgttatgttg gaaagtctgt tagaaatcac
    55201 ttagtacaac ttctttcttt tgttgtaagg taaaggcact tgcccagggt tatgtagtta
    55261 gtgacagagc ccacactaaa actgggacct cccaacgttt ttttttaaac cacaatttgt
    55321 tgtaaggtgg ggaggagggc aaacatgcag acacaatatg tcaagcaaga cgttgctttt
    55381 aaaaattcat attgtactct tctcacaaga atgcacttag aagttttgtt tttgaattgc
    55441 aagctgtcag aaggtgagat gatttcaaat caaggctaat caaagaaaca ctgttttggg
    55501 gtcacaaagg gactttttct gcttttaatt taggaactga aaactcccaa ggatatggaa
    55561 agcctgcttt ttaacctatt cctgaccatg taagtgttta gactttttcc ctagactgct
    55621 agggagtcta aggacaaata aagaacgcta tttgaataca aagtattcac acacacccag
    55681 gcacacacgt gcacataagc atgctcacac acacacacaa acttccagag gaataaaaga
    55741 gttaattagc agtccacaaa taattttttt gtttattact tcagactgta aagtgttctt
    55801 actttcacaa taaagattag caaatgaggc attagctgaa tttcaataca ccaataatac
    55861 aaaccaggat gttttatcct gaagccagaa gtaattgtta ggacagcaga atttgctgag
    55921 ttttcctatg tgcttttcag caaacttaca gttccaaatc ctcagaacca attgtttcca
    55981 ttacaatgca ccgctccttt ttcccgtccc ccacccaaag ccagttctga tcaagtggtc
    56041 cagtgaagac cctagaacag tgtacttggc ctgggagtgg aggatggagc cccatcttga
    56101 aatcaaagtt taaaccaaag ctggaatcag aggggaacaa agccaggcca gaggctgaaa
    56161 atggaatagc caagatcaac ctgtcaccag ggcaggaaga ccactggctg tgcagacagg
    56221 gagttaagat agacttggcc accacttcct cctgtacaac actgcacagc ctgctcaccc
    56281 tcctaccaga gttcacaggc cttttgcaca cagtgtggct ccaagatttt gctcgcattt
    56341 caataatttt cccacagctg tgggtcagca gaagctagaa ggaggccctg aaaggagcca
    56401 cgcttctccc cctcaacacc atcccctgtg atttttcctc acttaccagg tattttgggt
    56461 agcagaaggc aaacacatac acatgcacgc gtgtgtgcac acacacaaaa aactgaccag
    56521 atatccccag aaaagtttca ttgccagaca accacaatca tcaccccaca cacccctcaa
    56581 aataaccaac caaaaattcc attatttttg aagttctcaa actgctatta acgggcacat
    56641 atttgatcca gacaagccat aaaatatagc tgtggagttt ggttaactac aaagttgaag
    56701 tttcttcatc tttcttagac gctaagttat ccaggaaata aaactccagg gtaagtttgc
    56761 gaattccctc tctactcacc tgagttatcc caggagccct gctgctgctg ctgctgctgc
    56821 tgctgctgct gctgccgttg ccgctgctgc tgttagccag gtaaccccac taaaagtctg
    56881 ttcagtcagt gtggctgaac aaagattctg gctttcttaa cctgaaaact atttgggctg
    56941 taccaagctc tgctcacagg gggaggtgcc tcgcagcaac ttcaggaggc tttgccccgc
    57001 ccagcagtca gaaactgatc ctgagtcatg aggaaaatgt ggaaagtgcc tctaccagat
    57061 gtgcagacgg gatgcccact ccagacccac ccaccagcaa actgaaaaac tcaggaagcc
    57121 acgtttgaaa gctctttcgg tttcaaggag aaataagact ttcagcgagg attctgatag
    57181 atttcttact tttctgtccc agggttaata tcacagacat gcacattact agagagagga
    57241 catgcagaga aaaatggtga agtgttggca aagttaatat gggctaggct gcagacatgg
    57301 ccctggtcag acatggccct ggtaggcaga tctgtgatag tgatgaggac ttcaggaagt
    57361 gactgggagt gaaagagcag ctaatggggt gcaatgaaga aaaggtcact tcgtgaaaca
    57421 acaggtaaca tttggtcaaa gacagagaga agggattgtt aagaggaaag tgaatggagt
    57481 ttattccaca gcagctataa gaaccctaat ttctagcaac caggctatga agattctgca
    57541 cagctgtgtt aggtgagagg agagaaaata attgtttttg ctctgaaagg ccaagcaaga
    57601 tccctataaa tactatactg catcacagtc aaagtcctgg tcccttcaac ctgagacatt
    57661 catttggaca caattccagg aacatgattc tcaagttgtt ttgtttgttt cattttcaca
    57721 aactgtaaag atgctcatca tctcattcct cagctttcct gaccagtgta ctgttaaccc
    57781 ttgaacctga aattacttcc ttctacccct cctcctgcac caaccctcct gctggctagt
    57841 tgaagatgta agagataagg ggggttgggg ataacaagtg agcaacatag agctcaacaa
    57901 ggacaggaag catgttctct ctccactcac tgggcccata ccaatgtttt cataaaacag
    57961 ttctgagcat ttctataact ctggtctctg agatctcctt cttggcattt ccctacaact
    58021 ccctctgtac cttctatgcc aagatggaaa ttcagctaga ggtagggcac aaggaaacag
    58081 gggtccaggg caattaggcc tctaaatatt ggcaaccatt cagctagaca gtgagatagt
    58141 cctcaagcaa atatccctgg tatggaattt tggttttaaa atcaacggag gatgttttgt
    58201 ctgggctcat attctttagc catcttttcc accttcatac ttgtaaatct agcagggaat
    58261 cacagaacct gggaattgga aggaacctta gaagccatgt ttatcctctt gctcagtgca
    58321 gaaatatcct tctgacagtc tgagaaactc atccctatgc aaaacagctt attccatatt
    58381 tgatcatttc ctaattgtta aaagggtctt ccttaggttg aggaaaaaat agcttcacac
    58441 aaacaatttc taattcctct ttcacatgac agccctttaa aatatctagc atgtcacgcc
    58501 aagtcattca tcctctgggc taaatatcct cagttctttc aaaccatgcc tcataggaca
    58561 tggtttacag actttttgcc actctagttg tcttttgagc ataaatctac tcagtgtgca
    58621 gcccgtagct taatcaaaat cctctgttgt ttcttgcatg aactactact atgtgcctga
    58681 ttctaggttt gtacaattga cttttaattt tacctggatg ctgggctttg tattcattct
    58741 caaaaaccat tgccttgttt attttaatct atctccttaa gatctttctt tatcttgagc
    58801 ctcatataca aaatcatatt ctccatagta atgtttctca atcttttttt atcattatca
    58861 cggtcctaaa gagccttttt agatattttt cccaatgtct ctccgtgaaa ttttaatacc
    58921 acagatatgc tgtctatctg tttatttgct gtctatctgt ttatttgctg tatgtatatc
    58981 tctgtttcat gtaaaaaaga aatttttttt tcactcttca agaatcaatt tttgccctct
    59041 tgggggcaat atcaccccca ttgagaatgc atgatctgta tcaataaatg ttttatatgt
    59101 gtatttgatc agccagactg atgaaagtat tgaatgagag ggtccaggac agaatactgc
    59161 agcataacct taagagtctt ccccttttga taaggttgtt caaacttgaa acaatccata
    59221 taactcctgc taccacccag ctctgacttt tccatcttct tcatgaagat atcataagac
    59281 tttttcaaat gtcttgctga aatccagata cagtatacta tggtatccca gtgatctatc
    59341 cttttagata cccttttcaa aaaggaaacg gacttagtta ggcatgtctt gcttttagtc
    59401 atcccatgct tacccctagt gatttccgct ccctttctaa agcctcacaa acgatctgcc
    59461 taacagtgca ttctacactt ttgccaagga taaatgccat gctcattggt cttcagtttc
    59521 caggatctat attttgcacc ccacaccacc accacctacc ttcactaaaa attgggatat
    59581 ttccctgcct ctttgcaccc ctcatgtctt ctctttgcca taatttctca aaggttacca
    59641 agactggctc tgtgatcaca tctgcacatt ctctcagcag cctcaaatgt gatttgtgtg
    59701 ggcctggaaa ctcattgaag gcagctaagg actcctttat tatctcctca tctatcttgg
    59761 gctttaattc tctttccagt tcaaagatca tgctccttga tggagaggtt aattaacttg
    59821 cccacagcca cacacctagt aaattggcag agctgggatt taaatcaggc tctatttaac
    59881 tctgaagact ggctcttaac cagaatgcta tgagatttca aacaccaaat gagtggtatg
    59941 gcctatgggt gttgtatgag ttccaaaatg ggagagaact attttaataa gagtagtcga
    60001 ggcaggcctt atacaagagg tgaaatttga actggatctt aaataatgag taggatccca
    60061 ataggaacaa aataatgatg tgtgcatatg ttggtaggaa tcagggaaga gactgaggat
    60121 ggcatcccag tcagaggaaa taacaaagaa taataaagga taagaattgc caccttggtc
    60181 tgcctcatag tctacttggt tgatcattat gaacactaag ataccacaaa tggtcaagag
    60241 agagccagag tgttcctgcc ttaccttaat aatagtttat taaagctctg atgcttgcta
    60301 cttgtatatt ctagggcaag gcacagccct tctggtcctt ggtttcctgc tctataaaat
    60361 aggaataaaa cagtcctatt ttcaataagc atttactgag ctcctactat ataccagata
    60421 ctatgctagt tgctggagtt ataatggcag gctataataa gaaacatgat ctcaccctca
    60481 ataattaaaa atcagtcatg tatttgcttt gtggaagttt tcagtccata ctactatttt
    60541 gttccattat ctgggtaatg tggcacattt tacttatctt aaaatgacct catttaaaag
    60601 cttgtgggga tagtggtaag gaggcttttg atatggtagc ctaggatttg tggagcaagt
    60661 tggtgtctgc tgtctccaag atgttataga ttctggttaa ttccatctca gccctcaccc
    60721 aagtcttgga ttctgttctt ttgcttcctc tagcctcatt cccagcccaa tattttcccc
    60781 tatattgggc attttccatc cctagattgt aaaactgagt tccctgaggg agaaaaaaaa
    60841 gccagctgca cccagtatgc caagtggttg gcaggggcgg gggttgctgt acattggtgt
    60901 agtatggctc tgccgtgtgg ggcagtgtca tttggttttg gctagctaac tacagtcctt
    60961 gcctcttttt agcttgaaat cctctaaggc cagtgaaagt gtgaacagaa gccacggaag
    61021 aggatggact gtcagcttcc aaaccaaata gccttggggt ggagttcaac caagattttg
    61081 ggagagtgta agttggatat cttctctgtt ggcctggtta acacaaaaat atgcttcgct
    61141 ggctggcagc agaacttttt tttaatcagg ggaagcaact catttgcctt gtcaacaaaa
    61201 gggctctagt ttctcaatat tagtcaaagg aaacctctgt ccctagtaat tgaaataaat
    61261 ggttacagtg gttgctgatc tttgactata tcaaatccat acccctctgc ttggtcatca
    61321 aacctctttc tccaccaaaa gcttggtttc accactactt ggcactagtc tagctggcct
    61381 ttgtattcca tttctcaggg ctgaacaaaa ctttcaggaa cataacctcc actgggctgg
    61441 tgttagagta ggagactcag ggctggaaaa ggttgggtgg agggtgatac ttaaaataca
    61501 gatgtcatcc aagtcttagg cataaattgt agctgagaaa tggtcagccc tagagaattg
    61561 agcatagggt accaggtgag agtgggcaag gccacaggca gaagacaagg caagaagtca
    61621 ggaattgagg tgatccagaa ctcacaggtc aggaagtatt gtgtaatgtg ggatggagcc
    61681 aaataatgca aggcaacagc taagaacaaa gccacagatc aggtactcat ttttaatgga
    61741 ggttggagtc ccagcatgat tcaaccctac caccatattt ttactcaagt tatgcccccg
    61801 gctatgactg ttctttctca cccttccatc tacccaagtc ttgcttattt tttgattctc
    61861 caaccttact ttctccacga agccttctcc ctccctgagt gatctgcact ctttctgaat
    61921 aatcatctca ctttgagtca gtatcatatc acacagggtt ctagttaatc attcttcaat
    61981 gtttcatgtg tgtgcttgga tttcctcaga ctacagactc cttgaggaca gcagctctag
    62041 cttctgtact tctacactcc ccacagtgtg aagcacagag ccaggcacca agtaaatgtg
    62101 cagtaaatac ttggcaagga attccagaga tattcagaat ggaagtcagt agcaacttag
    62161 caagagactt gaaattagcc caccacacac gtgagcacat acacgcacgc acgcacacac
    62221 atacacacac acgtgcttag aaaaccaagt aaaactgtcc taaatgtgga agtcatggag
    62281 gatggtatta tactatccct ttgtcagagc taggtataca taaagagctc tttgagagct
    62341 gggaaagctt atagaattag aacaaaatgg gaggaggcag ccttgagttg taattaagct
    62401 ggagggtgaa gtaaagatag ggagggggtt agaagcaaac agaaatatga aaaatttcta
    62461 acatggaggc aagggactct ttttcagtcc ccgtaatatg tcattcatct ggcatccact
    62521 aatttgggat ttttgaaaaa tcggactaag agaggttgta gcttgttata gccagatctt
    62581 gaaagaaaaa gagctgctaa attaattaag cctcctccat gatgggagag acagtggggc
    62641 atagggaaaa atcttgtttg acttagttaa aagtttccca gttccttcct agagctcttt
    62701 agaaacctcg ttaagctact tgtaaatcct ggccattcct gtccagtttt ttccttatca
    62761 taaacacttt ggagtattaa aactacatta aaaatttaca attccaactt tttactatct
    62821 tagactcctt ctcctccttc cctcctcagc tggtaccaaa agagaagaga ctaggtatgg
    62881 aaagcctcaa agtactggga gctttcctgt gctatctcat ttcgtcttca cagaaataac
    62941 cctgtagcag agataatttt ttcttggtta tcagaggggg gcacctgctg ggtgctcggg
    63001 ggtatgttaa gtaaccaggc taaggctaca caagaaggaa attgtgaagc ccctgcttct
    63061 aaaaaacaat tcttccaact ccaaagacct cctgctttct cctagactat gttgccccta
    63121 aattgaaaat aatcattgac taacaggaga ttaaaatatt tttcttggaa aatattgctt
    63181 tattagtcat ttaagtggct gtgaattcat gagaagagca catgttcaaa aacagaggtc
    63241 atattcataa tagacaaaaa ctggaaatag cccaggtgtc ctagttttag aaatggagaa
    63301 ctgattagca tttgccaggg tttgggggtg gggagaagct gtgttgcggg agggaggtgg
    63361 gtgtggttat aaaagggcaa taccacgaag gatctttgag gtgatggcac ttttcagtat
    63421 cttgacgctg atggtagata catgaaccta atgtgtttaa aattatatag aactaaatat
    63481 atatgcacag atgaatatga gtaaaatgag aactctgaat aagactggtg aattgtatca
    63541 atgtcaatat cctagttgtg atattatact aaagttttgc aaaatattat cggggaaagc
    63601 tgagaaaagg gtaaacagaa tctctctgta ctatttctta caactgcatg tgaatacaca
    63661 ataatctttt aaaaattcaa tgaaaaagga gataatatgt agtggctata tatatatata
    63721 ttcaatatat gtaatgtgtg catatatatg tgtaaattca atatatataa tgtgtgcata
    63781 tatataaatt caatatatat aagacaattc aatgaaaaat gaggtaatat atagtgggaa
    63841 catatatatt tatatataca taaaatttta taataaatat atatatataa atcagtgtat
    63901 tatctctgat ctgacagaag tttcttttga aagataccta atgctagatg acgagttagt
    63961 gggtgcagcg caccagcatg gcacttgtat acatatgtaa ctaacctgca cattgtgcac
    64021 aggtacccta aaacttaaag tataataata ataatttaaa aaaagaaaga tgatgttaca
    64081 ataatcttgt aggtttgggt agttcagcca agacttggag acaaggtggt gggaattcta
    64141 gctcagggaa atcgttcccc agggaaaaac acaggttgct gagcccgact gagtgcaacg
    64201 gaatgggagt gtggtgtgaa atttgaagga gtaggtttca cgcttcattc aaggccataa
    64261 cttttcaggt tcctctgctc tttgctgaga gaataagctc tgctcccaga ggcctgattg
    64321 caatgagccc cagctagaac cagctacata atttgctggg cctagtacaa aatgaaaatg
    64381 tggggcctct tgttcaaaat gtattaggaa tttcaagata gccactgcag aacgttagac
    64441 taggtgtggg gccccaccaa gcacagaatg ctgtgcaact acacaggtcc tggcccaagt
    64501 taacatttct gctaaatccc cacagctgca acatctagag aacacagacc atcacccact
    64561 ttgcagccct tccctgctag tctgaattga ctgagatgac ggaggcccaa gtggctcttt
    64621 gggatccatt cactgtttag tcacagttct gagcactggt aaatccctgc tgactgatgg
    64681 actggactgc tgttccatag accagatata gagcaagaga gggtggaacc cagaggcaga
    64741 aggcatgctg gaaaactctg gtgcccattg acaaaaagga ggctggtgag aagcagatca
    64801 ggctggtggc cattcagaaa gtgtacaatg gatggagcac taaattatag cctggagacc
    64861 tgggctcaaa tgccagcttt gacaccctcg agatcattaa tttctgggcg tgtcagtttc
    64921 ctcatgtgtg aaatgaagat aatataaata gtatttttct cattgggctg tcatgaggat
    64981 caaataaaat cacggatatg aaagcactat gaaaactctc atgtgaacag aaatgtaaga
    65041 catctcagaa tgaccaccag tctgattatt acatacacat ccctaagcct ttgctagcag
    65101 gtggtggctg gcctgatctg gatgcctgaa ataagaagta aggttaaaat gcatgggcca
    65161 gatatccagt tctggccaag acagaggagt cccattcccc accctcaatc ccagtcctgg
    65221 ccaagacaga ggagtcccat tccccaccct cattcccagt cctccttctt acaattaaaa
    65281 aatgctggct ataacacaac aaacaaacat aagaaggctg aaatgtggga agaagaaggc
    65341 agacaaccta agcacctaag gacttgagaa acaacacaac cttgaatacc ctgggttttc
    65401 ttttttcctc ccatatatct cagataggat gttgtagaag actccaacct gaaacgtccc
    65461 acaagcacag atgaaaaaag gtgttccaag caaagcctgt tcactgtagc cacaggactg
    65521 gggaaaaggt tacctaaaaa cagaaaacct tttttggcaa tactcactct attccaacta
    65581 aacatcaatg gcacaactgc ataccactcc tctgggattt cagtggggct gagtggagag
    65641 ctgatattct atcccctgcc cggcagaaac agttagtact cttattcccc catcagagta
    65701 tggtcagctt agcagtgagc tgagctgctg gttagctcca ctaccagcta gcagcaaaga
    65761 gactgaacaa ggtgatgtga ggtagggcta gttagtttca acaagcaatt actaattctc
    65821 tggaaacaaa ggaaaaactg gataatctca gcaaagaaat agaaaaagtt aacaagaacc
    65881 aaagggaaat tatagaactt aaaaacacaa caactgaaat ataaaacact cactgaatgg
    65941 gctcaatggt agagataaca gatggtagaa tcagtgaact tcaagatagg ccgataacgt
    66001 ttacccaatg tgaaaaaaaa aaagagagag agaaaatgac tgaaaaaaag tgaacaggga
    66061 cccagggagc tgtgagacaa taataaaagt gctaacatgt gtatcattgt agttctggaa
    66121 ggagcaagta aggagtgtgg ggctaaaata tatttgaaga ggtactggct atgaattttc
    66181 caaatttgtc atgacataaa cctaaaaatt caagaagctt agtgacctga aacagaataa
    66241 actcaaagaa atccatacta aaacacacca taattaaaca tctaaaaatt agaagtcatg
    66301 gtccaaaaga gatcactctc tgccacccca tgggttttgg ggaattcagt acagaagatc
    66361 ttcagagctg tcagcctccc ctttctgcat cataacaggg aagaagggca agtcccacct
    66421 acttcaaacc ctggcttccc caggcttaac tatcaagcct atcattcatt cattcaacaa
    66481 acatttatga gcacctaata tttactagtc attcacctca ttgctagtga gacaactgtg
    66541 aacaagctag agtctctgcc cccatggagc tatattctaa taggtggaga tagacataca
    66601 gggcatatgt tgagaaagag ggtaactaaa cagagcagta cttatttaga cacatacttt
    66661 tcccagaaaa ggatcttaca tcagccacca tttcattcac tcaatattta ttaatcactc
    66721 aagcactcac aatgtctaag aacctgctcg gccttggaga cacagttcct gccctcaagg
    66781 aacaaatagt gttcttgtag gacacggttg cttaaccaca atactgttga cattggggat
    66841 tgaataattc tttgttatgt gtgttgaggt gggggctgtt ctgtgcattg tacagtgttt
    66901 agtaacatcc ctggcctgta ccctcagaag gcagtaacat cgttcctcca agttgtgaaa
    66961 accaaagatt tctacataca tggccaatgt ctcctggtag gcaaaaacac acccagttga
    67021 gaaccactgt tgtaaggtaa actgaaaagt ggaccaatag ttgatataaa atgtgataac
    67081 tgctatggta gagaaacctc ccttggctcc accttgcttc caggcccttt agaggtggcc
    67141 ttactccact tccagcctta tttccttcca tgtggtcaag cccccactat actgcactct
    67201 aatcatcccc agaacatggc atgccttttt ctttccccag acttttccat atgccctttt
    67261 ctctaccttg aataccctcc cctggccagc aacctcaagt ggcatagcac tccttttaaa
    67321 tctttatttg aggtatttgt ctcatgttga gggcagagac tatgtcaagc tgatctctgt
    67381 atccctagtg cttagcatag tccttacttc ctgaagtaat gcaatgatgc agaaaagaga
    67441 gaagatataa ttgtccctag ctctcagatt gcttccattc tccttgtagg aggaaagaga
    67501 aacagaagtg aacattccca ttctgtacag cccagtttgt agtagaatta agtgtgaata
    67561 tgtgcaagat gtattacaaa tgatagaaca tgggcggagg attcatgaga tgaaataatt
    67621 gaactgggcc ttgaagccgt tgatcatatg aaaggtatgg aggagtagct aagagatggg
    67681 agtgggaaaa gataatagaa cacactattc cccagacagt aagaaaacca aaccccagag
    67741 agtacagacc tgacacccac ttcatgtaca taagaggtct actcccaccc tgaggtttgc
    67801 tcctaccctg atgtgtggat aaatcttgaa tcctttaagc aggtaacaaa gggaaagggt
    67861 taaagaatga tgggagtttt ggactgctgg aaagcactgg gagtgacagg tttagagaat
    67921 taacacaggc acactgccac caaataagct gatattccaa gcacaggaag tgaactcttc
    67981 cttagcactt ctgattcagc taatttagac tgagcaccaa tcatgtacaa tgtgcttcct
    68041 atatattgtc ttgcttagct ttcacaatag taagatggtg aagtctcagt ggcaaaatag
    68101 attcaaacca aagttttctg gtgatgacaa agcctggtta ctttcattgt gccatactgc
    68161 catggttagc tcggtggacc caagtttggg atctggagat attcttagtg agctacttcc
    68221 ttccttcctt cgttactttt ctccctctct ataagcttct aatcctgaga taccccatcc
    68281 aaaagagcat ggacagtctc tggaatgggg catccacaaa tacttccaag actggtatga
    68341 caattttgcc cagatacaaa ggctcctctg gagggtttct gggggaactt ctttgtggaa
    68401 taggagagac ttgagctggg cctgaaagga taatagtgct tttagacaaa gcaggtattt
    68461 ggggtataag gtagagcgat ctatagatct taaagtcttg agtcggggga aatgtcgtgt
    68521 gtgttaggat ggccttcagg gaagaacagt gaggaagact ggtggggccg ggcttaagag
    68581 ttcatgttgg caaatagtgg gagatcagac cgggtagagc ctaaatgtag aggaattagg
    68641 agtttacgtg agttctgcaa gctggtatgt gtagacacag gagcctgcct agacaaatat
    68701 agctgtagag acagtcatgg attagttgac tggggcagct gggaaggctg ccactggggg
    68761 tgggtaatgt aattgaaact tctttcatgc caagatcctc tgaaatggaa atggtgaaca
    68821 agggcccacc aatgacgaag ttgtgtgcct tgggttcctt cacctccatt ttccaccctc
    68881 cgttctcttt ggatttcaat agatccatct tgcccttctg cacacagcgc cctcactttc
    68941 tgtcaccagc agaatctcac ccagaaccac cttctttagg aaggatcttt ttctctcact
    69001 aactagttct ctttccttcc gccaaacaaa ccgtatcatt ttttcttagg gctccatcat
    69061 gttactagca caaacctgat tagacaaatt cacagaaaat gttagtgttt tgataatgtc
    69121 ttaataatgt tcaaatttga acgagagcct gcaatgcctt tgctagaagc aagggtcgcc
    69181 atgagacttt aggcttagat aaagggagat tgtgatgggg tgagaatagg gtagggacag
    69241 gaatttgaga aggggacctt ttcatgacct ttgcataaag atagtcttgc ttgtttctta
    69301 aatatttgaa aagaggcttc cctccattcc cctccatatt ctccctacct tgttcctgca
    69361 ttccaggatc ctaccagaaa taattgtatt aattaaacag catttaggtg cagagaaaat
    69421 gcccagtttg gtttggggat agaaactttt cttgagttgt taaggacctc acagaactgt
    69481 agctgctgct attggagcca tgttcacagc tcccttgaac actaaaatta acagatggga
    69541 cacttcaatc cagggcacag ttgtcaaaaa ccacagttta aatcagtcac agacaactgt
    69601 ttcattcccc acacttgcct gggtgtaaca actgcaggaa tgcctctgct taaatcgcat
    69661 ccagtgacac cacagagaga gacatgtcta ggctgcattc attctcagga ctttctgagc
    69721 ttcttttaga gcaggggtca gcaacctttc caaaatgggg tgccagggtt ctgacccatg
    69781 ttcttttata ggactagtgc gtgtggactt atgtatgcaa acaacggcaa ggtgggaggg
    69841 tggtgaggta gggagaaaag agggagttat actgactctt gaagaaggag atatggtggc
    69901 caactattag ccagagacag agcaccgctg ggtgacaact tacaaaaata aatggtgagg
    69961 aaagggtttt acctgttgct tctgactggt gtgcctgtaa tagccggtct atgggtcatc
    70021 cctgggtata cagccaactc tttgataatg actaatccat ttattcattc actcaggcac
    70081 ttgttaagtg gccacacttt gtggaacact ggaaacccag acacaaataa gatgctttct
    70141 tccctcaagg agctcacatc acagtctagt ttctgttccc agtaattcaa ctattcattt
    70201 ggtgaactca ctgtcatgta tttcactagc acttgttgag tgtcaggagc catcctgcac
    70261 ttgtggtatg cagagatgcc gagaaaaatc agacaaggtc cttgacctca aggggattcc
    70321 agtctagtag cataaaagga caagtagaca tttccatttc tattcagtgt ggtaggagac
    70381 atttggctta gatgagcctc tgagacacct tctgatacaa gatccctgat ctagagggag
    70441 tataccttag atttggggag aatcagcctt gggggaggca gatccctgtg tctgagagtt
    70501 gtcttgtcac tgtaccagga tgaaagagat gttacccaaa acaagaataa aaactgttta
    70561 cctagatgct tggtaaacat ttgttattct tgttttggga accatttctt tcatcattct
    70621 ctttacagac gagaaataaa ctgaagttag tgaaatgaaa tagggaggcc tcctgatcaa
    70681 cagattatgt tctacttgtg gttcatgggc tctggagctt cttaagctga tagtcatcct
    70741 gtcaatgtcc tactaccttg tgctggcagg tctcaatctg ggggtctcta aaggcagcaa
    70801 atgataccct taagctattt ttcaatatta aaaaatcata gaaattgtac tttcagccaa
    70861 gacaaagcca cagagagcta agtacaatat catgtttctt tgcttttgac tgaaattata
    70921 ttagctttgg tgtggttagc tcatgtggat cagaagccct gagtgagtaa aaaattggaa
    70981 aaacgaatta ttccataata aatgcagttt cgctagataa aaatttttcc aaatgtgggg
    71041 gccacagggg gaaaaataat aaaagggatc cttggtggtg taaaggttgg aaatcactgg
    71101 cctccacgtc tggatgccaa agtttggctg cagattcaaa gcctgtgacc acagaatcct
    71161 gggattattg gcatctgaca cccctgatgt ggttcctgtt ggaaaagcaa acatgctccc
    71221 tgcccagctt tctaaagtgt tatacattaa ctggcttaca aagcaccatt cacgagtgga
    71281 caggtttttt ttggatcgcg gtttactgtg tttttgtacc tgcagtgaaa gaacgtttca
    71341 gaggaagcac aggcttgcct aaaatccaca aagctatgct attgccagac aagcacagaa
    71401 ctccttataa ctcacagagc tggaacaagc tgtccacatc tttctccttc tcattgacag
    71461 ccttcatcca gtacctctga gaacgcacat tccatgaaca cacgcaaaat agactaagaa
    71521 tatccattgg gtataagcaa agaaggaggg gccctgcagg ctagagagtg aggaaggagg
    71581 tttgaaggag actggttaga gaatcattct cactcttccc attgactgaa aaagctttct
    71641 gcaggaggct taggaggaga gctcaggaaa acttaaggga gaggaggccc gaggcttctg
    71701 gagagaaaag ggatttccag taacattgat cctagcctac atttaaaaat aaatttttaa
    71761 tttaatggtg ataaaataca cataagataa aacttactgt cttaactatt tttaagcata
    71821 caattcagta atgttaagta tattcacatt gttgtgaaac agatctgcag aactttttca
    71881 ttttgtaaaa ctgaaacact gtattcatta aataactcca cattttcccc tcccttcagc
    71941 ctctgatagc tgccactcta ctttctgttt ccatgagttt tcctacttta aataccttat
    72001 atatgtagaa tatatagtat ttgtcttttt gtgactggct tatttcactc agcataatga
    72061 cctcaagttt catccatgtt gtagcatgtg tcagaaattc cttcctttaa aaggctgaat
    72121 aatattccat tgtatgtata tattacattt tgtttaccca ttcatctgtc aataaacact
    72181 tgggttgatt ccatcttttg gatactgtga ataatgctgc gatgagcatg gttgtacaaa
    72241 tatctctttg agatcttgcc ttcagttctt ttggatatat acccagaagt gggattgctg
    72301 aattgaatgc caattctatt tttaatttct tgaggaacca ccttactatt ttctatagca
    72361 gctgcaccat tttacattcc caccaacagt gcccaaaggc tccaatttct ctatatgctt
    72421 atcaacattt cttattttct ttttttaatt tttaatttta atttttgtag gtacatagta
    72481 ggtatatata tttatggagc acatgagatg ttttgataca tgcatgcaat gcataataat
    72541 cacatcaggg taaatagggt atccatctcc tcaagcattt atcctttgtg ttacaaacaa
    72601 ttcaattata ctcagttagt tttaaatgta caattattat tgactatagt cactctattg
    72661 tgctatcaaa cacttttttt gataatagat ttcctgatcg atataagcta atatctcatt
    72721 ttgtttttga tttgcatttt cctaatgact atggatattg aacatctttt catgtacttg
    72781 ctggccattt gtatatctac tttggagaaa tgtctgttca agttgtttgc ccatttttga
    72841 taagattatt tgttttgtta ttggatttca gaattcttta tatgttctgg atattaatcc
    72901 cttatctgat atatggtttg taaatatttc tctcattcca taggttacct gtttagtctg
    72961 ttgattgtgt cttatggtgc tctagcctac atttaaacta tttttaattg ataaaataca
    73021 agctgtatat atttatggca tacaataagt ttcaaaatgt gtatacattg tggaattgtg
    73081 aaatcagtga aatcaaacta attatcacat gcattatctg tcatgcttac ctttttgtgt
    73141 atagtggaaa cacgaacacc agcccacatt tttgaaaatt atactttaag ttctggggta
    73201 catatgcaca acatacaggt ttgttacata agtatacatg tgccgtgttg gtttgctgca
    73261 cccatcaact cgtcatttac attaggtatt tctcctaatg ctctccctcc cttgcccccc
    73321 accacccctg acaggccctg gagtgtgata ttctgttccc tgtgtccatg tgttctcatt
    73381 gttcaactcc cacttatgag tgagaacatg caccagccca catttttaaa aacaggaaaa
    73441 atgtgataaa atgaagaaca aacccagcaa gaatgatgct catatgccag gggccagatg
    73501 agatgatttc tccaagtcct ttggaatcga tgttcttgag ttccagacct cctggtgatg
    73561 ctcctccaga gcccctccaa agcttttggg gcaccacttt gtctttacac tattagctct
    73621 aagctgcttc tacagagaat gggccccttt ctaagtaacc tctttaacac ccacaacttt
    73681 cttgttgctt tctccatacc tacacaatct accttttcag acaacagcat cttcagatag
    73741 caggctcaaa tctatttgta acccctttgc tgttccagtt agggaagagg ctaaactcta
    73801 attttatgag agccgtaatg gcaggactta gacatatgaa aatttttgca gaggcaagtt
    73861 ggtctaatgt aaagactatc agaccatcac tttcttgctg tgcaacagtt cacagatcac
    73921 ttcctctcca agctcccatt tcctgtctga aaacatgagg ataaccccac cccattctgc
    73981 tggggtttgg tgaggtttgt tcaggagaaa gtttgtgtgc tgcctggcac accacaggag
    74041 aactcaacta ttggtaagtc tttccactgg gcctggactt ttcagccaga gcaaagcatg
    74101 cccaaccaaa gcccatcatg ataccagccc ccagtgcatg acttgtctct tgttcttgtg
    74161 tgcctctggt ccagtctcca caaggttgca gcagaagtga tctatttaca acttttatct
    74221 gcttatcaca ctctcctgct taaaactctt taatgactct ctgagtgctc ttcaggggga
    74281 aattcaaact tcttaatgtg gctaaaagac cccacgtcgt cgcctctgtc tgcctttctc
    74341 atgtcacctc tcaccacctc tccaccataa cactccatac ttcagccaca ttggacttct
    74401 ttcagttgct tcaacaaatg gtcctgtttt attcagggcc ttcacatatt catttactct
    74461 catctggaac agctaacttt tactcatctt taagttcagt taatatgtca cttcctctgg
    74521 gaaggctttc ttaacatccc tcaggttgaa ttaggtgtgc catggggcca ttctatgctt
    74581 cctcatgtca cagcactggg catactgtgt ataattacat ctaccttagc tatatttgga
    74641 gctctgtgag gtgaagggcc atgtctattt atcccctaga gcctagcaca gcacctggca
    74701 catagcatgt tgcacatgtg tcaaacaaag gaatcctatt ttacctttcc aacattgtct
    74761 acatgcactc actcccccac actctacatc ttaggctctt ctacctcact agaagccttt
    74821 ctgatctcca gacacattgg gcacagatca tacctcttta actttgttca tatggtcttt
    74881 ctggctggaa gtctcttgtt cttccctctt ctcttccgcc cctcccctct ttgcctaaca
    74941 aactccaact tatttaagcc tcagctcaag tttcacctcc tcagtgaagc ctgtgcttga
    75001 agagaagcat taaaataaag atttatccta ggcaggataa ttcttagctc attcttcatc
    75061 attaatgctt tattatacca agtccaatat ctaacagcct tcatcacccc accctagcag
    75121 ttagactgcc ctcctctggg ctctctcagc tccttacaac acctccatca tagcacttcc
    75181 tcaatgcttt gggggataga ttttgtttgt ttagtttagg tttttctcct gaatgaggta
    75241 ggaggacatt tcatgtcctt acccactccc caccccctgc cgaccccttg taagaggacc
    75301 ttagtctcat ttcttcaatt aaaacaaaac ataaattctg tcccctcccc cctacccttt
    75361 gtcttttttt tttttttttt tttcctgcca gtgtgcccac tcattactga gacaagccag
    75421 ggctgcttcc ctttcaggaa gcataatttt ttccaattag cctttgattg gaaacagtac
    75481 cattgcagct gggctgcttt taatgcctgc tgatgggctg cctggtcata cggaaagagc
    75541 actggcccag gagtggaagc ccatctctgc gcttaaccaa gggtctgaat gtgagaaact
    75601 tcctttcctt ccttgggcct cagtttactc tttttctgaa caatgaggaa cttgaactta
    75661 gtgttttgct attttaaagt gtggtccaca gaccaggagc atgaagtacc agtgggcagc
    75721 ttgttaaaaa cacagaatct gggccccaca ccaaatctcc aaaatcatag tctgcatttt
    75781 agcaagatcc ccaggtgatt ctatccacaa gtttgggaag ctctgtaagc ttcgcagatg
    75841 aaagattcac tgaagctctt gttaaatatt cagcttccca ggccactgtc ctggagactc
    75901 aaatgaggtg gaactgggat ggggtccctg aagttgtatt tgtaacaagt acctgaagtg
    75961 actcacatcc ctagaggact ttgtgaaacg ttagacaagt ggttgataaa ggcccttcca
    76021 cctttgacat tctgcaattc tatatgcact tgttcattac caaggatcgt aaatagtgat
    76081 acctaattca gaattctggg acttacccct ctctccatta gggtaaacat tttttaaaac
    76141 atccagggct gtgccagtgg gaaaaggaag agagattaca acaaatcaag ccaagtcttc
    76201 ttcattggaa agaatcagca cctctgggat tcaaagagaa attaaattat ctgcttactc
    76261 aagcctctct ttctctgaag cattcaagct tgatataatc agatgatctc aaagaccttc
    76321 ttgggtattg aaaaggggat attagaaatg gagggtgcaa cacctgtccc acactggcaa
    76381 cggaatccag gagaggggtt ccagacaagt tgacaaggaa acctggagaa agcctctccc
    76441 tgaaaggaat tctattttcc aagccctcag tgtggatgca tggcctgttt aaaaaggggt
    76501 gttttctgat ttgtgagtgt atttatataa attatatata tagttgtatg cttaacaaat
    76561 cacctttatt tggaaaagga aagaaaaatt acataattta cataagatca cgactgccct
    76621 ggaaaattca ggatagaaag ttgccagttt ttctccatgt ttgggcttgt ttttcttata
    76681 aatatgccct aagttccatt gctggtagct ttatggcaac atctgagatt tagacaggtg
    76741 gagagaatgt ggaagggcat tcagagggta tttttctagg cagggaaaga agggtaggct
    76801 ctgtcagcaa gagtgaagtg tgattgagca ttgcatagcg agagctgaag tcggggaagg
    76861 agatgcatga gaagctattg ggctgtccgg agcagaggcc caggtctggg aacaggggac
    76921 gctcaggctg agttgctcaa gtcagggctt tctgttcttc ccatttcctc cctacatcca
    76981 atctgtacat cagcttctga caattctata tggacaactt ttcgctctta ttcctcctct
    77041 ctaccaccac tgctctggcc ttaattcaga atctgctttt gtctctatct gcacagcttg
    77101 caacagctgt gaagaccctt tccagagtgt ctttactggt gtaagtttct tctcaagctt
    77161 gaaaaggcta tctttctcta cagaaactaa atatatagac ctaactgaaa taaacagtct
    77221 cttgcatctt ttcttctcct gggagaagtg caaactaacc caaatgacca aaaagatcat
    77281 atatgctgct taaattctca ctgagtcctc aatgaaaagg aagcagaatt cacaagagca
    77341 gggggttctc aaactagaga gggtctcaga tgcctcatgg agagcttgtt aacacgggaa
    77401 tcgctgaacc ccacgcgcag agtttctgat ccagtatatg tgggtggggc aggcgtgtgt
    77461 gtggccaagg atgtgcattt ctgacaagtt cctaggtggg gctgatgtac tgctcacact
    77521 ttgtggacca ggggaggaca acggaagaga ggctgaggaa gggaagaggc aaagctgagg
    77581 gaagagaaaa gggagatgga gagagaactg aatgctttca ggaggaactt ccttcaaata
    77641 ccaaacacct ttttcctcat aactcaaccc ctagtggttt ctccatttcc ccatttcttt
    77701 tagtctaaac tagtgcttgc taaacttatt gacctcacag attatgagaa ttagaaaaag
    77761 aaaagaaaga aatatagatt ttatttgagg cccctcctag gattagtcac tttaaatttt
    77821 gcctagttag gacattcaaa aacaaataca tacctatctt ctaccatgac caaaacttca
    77881 tataaggaag gacattttta ataaccaaaa gtaaaatgtt atgatctcaa aactaaagat
    77941 agttatttaa atgaagtaca tttacttcta tgaaaaaagc atactacatt gtccttagtt
    78001 ttctctttat atcacagcct ggtgaaaact ttattacaga ccaccattgg ctgctgtacc
    78061 agcatttggg aaccactgat ctaaatcacc gaaattgtaa ttctcagact actgcacttg
    78121 tgttttcctg ccagaaccat gtgtactggc tctgtaactt tgccacctca ggtaatcatc
    78181 agttctcaga gagacccttg ctttttggga cacctttcat gctattgcct ctatttagat
    78241 ggctcactgc ccacctggaa agctcttaat ctgttttcaa atgctgccag ctccatgaag
    78301 tttttctgaa cctcatcatc ctgcagcaat cattattcca tcattggtgt tcccatagca
    78361 cttgctcctt tgcactggac tcatttattt acagatctgt cttcactatg ctggaactcc
    78421 tggaggccta ggtccaaagc ctagcaattg gcagggtacc tagaacatag tgctgtagtg
    78481 atgtcttaat aaaaaccatg aagaggagtt ggacttgatt taatggtcag agaaatgaga
    78541 caatcgaaaa tatttttatt gtagtaaaaa acaacattaa atttaccatc ttcaccattt
    78601 ttaagtgtac agttcagttg tgtcaagtat attgacattg ttgtgcaaca gatctcttga
    78661 acttttccat cctgcaacac taaaactcta tactcactaa atactacttc tccctttacc
    78721 ctccccaagc ttttggtaac cacctttcta cttcctgctt ttatgatttt cactatatta
    78781 gataattaat atgagcagaa tcttatggta tttgtccttt tgtgaccaaa atgttctttt
    78841 agaaagattt cacttgcagg ggactgcagg atagaccaag gtggatgaaa aggatgtcat
    78901 gagtaatcag ggacaaggag aacctgacgg tagtgggggt catgagtaat cagggacaag
    78961 gagaacctga tggtagtggg gataaaaact gtcagcaaga ggcctagcca gtctgattgt
    79021 tgtatccaac ttttctttgg attagaaaca ccaagaagtc ccatttccat ctactttctt
    79081 tgtctccacc agagtgatgg tcatggagcc aggtggtaaa atactgctta gatgaagaat
    79141 tcataatcat cttggtcctt gggagtgcct aggggaagct ggcatcgttg cttctctgaa
    79201 cacgggtcag ctgggagctg cagagttggc ctcacataac tgtgatgttg gtcaaactgt
    79261 ttcctagaat ctcagcctcc tgtgtttgtt tgtttgtttg tttgtttgtt ttgagatgga
    79321 gtctcgctct gtcacccagg ctggagggca gtggcaccgt ctcggctcac tgcaacctcc
    79381 gcctcccagg ttcaagtgat tctcctgctt cagcctccca agtagctggg attacaggtg
    79441 tgcaccacca cacccagcta attttggtat ttttagtaga gacggggttt caccatgatg
    79501 gccaggctgg tctcgaactc ctgacctcag gtgatctgct cgccttggcc tcccaaagtg
    79561 ctgggattac aggtgtgagc caccatgcct ggcctcagcc tcctgttttc tacgtccttt
    79621 tctctttccc tctccatcaa cctcttccca agcccagtgc tttgtgttct ttatcccatt
    79681 aaaacctgat ctgagaaaac tatgtcatca gtcatttacg tctgagggta aataagatat
    79741 ctccgtgttt cctctcacag agagtagctg caccatcagc ctattgaaga agaatagcca
    79801 cactgtaaaa gaaattgcaa gcagcaggta gaggtatttt ggggggcatg gagtgtttct
    79861 tgggcttctg aaaacatggt actagaagtc accacctgat gacacaggct gacattggcc
    79921 aaagcatctt atgcataaaa ggaggtcact ttattcctgc tcaactctgg ggggagatct
    79981 gagatcttat tttttgtaaa tgtgctgtac caataattat ggtgtttcta tcatacattt
    80041 actctcagct tcttcttcta taaaattaag aggttggaaa aaataatatc taagttccct
    80101 gccagcttgg acattttgag gttatatgat tattggttca gtgttttttg tttttgtttt
    80161 tgtttttaaa agaatgtcag ccagaaatat tgaaatcaaa ttttattttt ggaaaatgac
    80221 attacacaca tcttgcaaat gtacacagaa atgagaatgc gatgaagaac agagcttgag
    80281 agccaaaata attgcaagag cagctctttg gctgcctggt attgaatgct gcgaatcttc
    80341 agcactcaca gttcacagca ccttacacac atggcaaact tctctagaca accttgaagt
    80401 cctctgaccc aagaaagccc tcattgaatt gcggaggggg tgggggaaga ggggtgttac
    80461 aggcacagga gatgaaaagg gtctgctcca gctggtttat gggggcctca ccaaacctac
    80521 cagtccaagt ggggtcagca aaccaaaaaa ggagaatgcc tcttgaaata ctgcttttga
    80581 atcacaattt cagagtaggt taggtattgc ctaggacact gatagtaatg catttgaact
    80641 catgcccttg tttttttttc tggctataac tattctaaat aggtggctaa agttcttaaa
    80701 tatctgtttt atttctctca ttttttcttt ttttttcaaa taattattgg tcatcggtca
    80761 agcagagtct tctgaggtct ctatcttaaa acagctgcag ggataaggga catcactacc
    80821 tactgtcttt ggattacatg tgattctgaa aactattcaa tcctgaaatg taatcaaatg
    80881 gccaaataca accccaattt accactgatt tttacgtaaa gttgagtctt tgatcacaat
    80941 gctgttcctt aagaaatgat caataactgc tgagagatgg ttgagaaatg ccttttcccc
    81001 acattttggt ttgtttgttg tttgctgact ttacttggca agagttattg ggcctcaaat
    81061 cagatattta caactgtaag acaactggga gcagggagag ggagagggca agggggtggg
    81121 aagaaggact acaaagaaga atatattctt ttcagaggtt aaaacgagtt aagaaatgtg
    81181 atgtacaata ccatgcattt actctccaaa gctagtcact agcaagctaa cacctctaac
    81241 actacaacca ccaattacag ctgttttgct acaggtactc aattctaact acagctatag
    81301 ggcagaaagg gtgggctgtt tccattttaa actttctccc tttgaaaatg gccataaaaa
    81361 cattttctgt acaagtttaa tggcacaaaa aggtgatcaa aaacatttta aagaacttac
    81421 cacacgtggg aaatgcaaaa ttcaataaaa catgcattag ttgtatacaa gtcataaggg
    81481 atcattggct tcaagctgca atatattaca ggaccataca ttggagacat tttggcataa
    81541 cctcttactt gttcaaatcc cttttggata acttataaag aaaagtcatt gtaaattttg
    81601 gcattccata tggtattgcc aagcagtgac atttccaagg gctttgcact ctgtttaata
    81661 aattaacacc tacaatctgg cttctttagg tccactaatt tctcttatat ggactgtcta
    81721 ttcagtatca tgaacagttg gtgtcaagag taaatgttta ggtgactcag aaggaaaaaa
    81781 tacatgtcaa ttaagcaaat gtctcttttt atagtataga tttgttaaat tatcttttca
    81841 tttaaatcaa ctccgattgc ctaccagttt agttaaaaca gctcaacagt tagcaagccc
    81901 caaacaagat tattatagag agtttaatgt gtaatggtaa aactttaagc aattactccc
    81961 ccacccccat ccagccaaaa caaatgacag gagaaattac ctgtggtccc tcattatatt
    82021 atattgcagc tttaaaaatt atgggcacat caaacaaaat caattattag tatttagctg
    82081 atataaatcc atgggtggat tttttctctt cagctgatta tgtctctgag cacgcttcac
    82141 tgctccacaa aatgtcagac taaaaaggaa ccacaaagta ttctgtacag tgcttggaag
    82201 gcaaagacct aaagatacat gagctataag gaaactgctt gtaaagatct gctgaggggg
    82261 attacgaatg aaaatggaga agaagaaagg gccaggatca ggttgattcc caccaacata
    82321 cttctgattg gactggggta ggaccagggc tgattgaatc tggaaataat ttatcatcag
    82381 tcccaaggca aaatcacagg ctaagaagag gcacatgccc aatgcctaga cagggaggaa
    82441 ggaaaaagta caaataaaga taataagagg gagagacaga attgcacagg agaaaggatg
    82501 gatgtttgta ctgtggaaag agagactgat aacatagaat ataaactgag aacagaggaa
    82561 actagacaga gaaagaaaag tcaaagagat acacaagagg tagagcggca aaaagccatt
    82621 gattccctca ggttattgac ctagtaatga gaaacgctaa tgccttcaag tctttgcctt
    82681 acttacaagg tgacttctgt tgtgccatct gacctctccc aactaacctg cccttctgat
    82741 caggagcaga aagatgcctt aattaccaat tggaattcta tactccccta gtctcaggca
    82801 tccaaggtaa gaaaaaatca cctctgattc cagaagatgt tgaagccaat aaaccaaggg
    82861 ataaaaaatt agtcttgtca gctgggcatg gtggctcatg cctgtaatcc cagcactttg
    82921 ggaggccgag gtgggcagat aatttgaggc caagagttca agaccagtct ggccaacatg
    82981 gtgaaacccc atctctacta aaaataataa taataaaaaa atagccagat ttggtggcgc
    83041 acgcctgtaa tcccagctac tcgggagact gaggcaggag aatcgcttcg acctgggagg
    83101 tgaagggtga aggttgcagc gagccaagat tgtgccactg cactccagcc taggtgacag
    83161 agggagactc agtcaaaaaa aaaaaaaaaa aaaagttagt cttgtcaaat gatagccaat
    83221 gtcacgctag ctaagccaaa gcgggaggtc tgcatagatc cttaacaagg aaacaaggca
    83281 ggacgcaaca ctaagagtct gggtgtataa agaccatact agacttttgg agacaaaaac
    83341 aatttctggg gaccaggctt taggagaaag ggagggaagg caccttttcc tgaagggata
    83401 gatggggcaa cattctcatg cctcacccca gatgagctcc aaaacactta tggttcactt
    83461 gggagaccag tctgggactt aaagccaaag cccttcagtg aaggcaatat atgagtaagg
    83521 cctggaaatg gatccaagtg gccctatgtg gccaaagaca ggttccaatc atctttttaa
    83581 aaaatcaatg ccttttattt ccatagcaaa ctggacaacc ataaaagaag tcgttcaaac
    83641 cactgtccca gtgtttctat ctggggaagt ctcttggcac agacgtttct gaggtggggt
    83701 gatctttggg atcattccca tcagtgtcct ggagctcatt ttggggatat aggcatcctg
    83761 ttctcttgtc ctagggatat ggatttttaa aaaataaaag ccatttagaa tcaaatgtgc
    83821 agggtgattg ggagcattgt acccagagta ttgaagggat taggaacaca gacaaaattt
    83881 tgttcaaggt aagcaccaat cactggtcta agacttggac tagaggattc aaagaagaat
    83941 ctcagggttc cttgccttct tcaaatagct acacatccac tttcaggttc cttggttact
    84001 tgtgactgtg gaggagaagg aggaaggaga gagaagtacc tgaggagtgg ggtaagccca
    84061 gaaggaggtc tcttagaact gaactaataa ataccccctt agttcccagg aagagccttg
    84121 cagcaaggaa aaccatgtgt ttggaccctt ttgccctggt ttctgctgtg gaaggccaca
    84181 ggagcagtgt acacaagacg atgggaagag tgtatagagc acacagcttg ctattttggt
    84241 gcaaacacag gagccaatgt ttaacatcag ccagggtaca tttttgtcac aaagcccctt
    84301 gcttatacct ggaaattgtc ctcatcagag caggatggct ttggggacat tgtggatgag
    84361 ggcaaccaca gtgagaccag gttattttct ctagctcatc ttactcccat attctaaaag
    84421 aaatcattgc cagtgaatgg tagccacact agttttttta aaatgttgaa aagccatctc
    84481 agaaagttag gatgaacaaa gctattgttc cactctttct ttttctactc tggaaacaag
    84541 aaacatttcc ctcagcctgg aaagaccaac cattatgaga tgacacaagg tgaactgatg
    84601 gagaaaatca atcttcctcc attccatttg ggcaaggcaa gcattcagtg gggggctatt
    84661 cagagggctc ctcacgatgg agcagtgatc tcctgaatca gatggacaag gccagggacc
    84721 ccattgctcc tattggccct caatgcctca ctctcttcaa ggacatctgc aacccatgtg
    84781 gatgggaacc aggtgaatgc agggaaaagg ggctgtatta ggttgctggg tggatttccc
    84841 actgagatca gggcagcagc atgtggctcc aagatgcaac atgaccctga gaaataactc
    84901 tcaggagcct aatggaaggc aatactactt agtgggaaaa ggccctggaa taagagccag
    84961 gagatgtggg ttgtaaaccc aatcctgccc cagccagctg tgtgaccttg gggaaattac
    85021 tttacctctg tgtctcggtt tcctggcctg aagaatgaga aatttggagt agaggagtag
    85081 atgatgtgat cttaaggttc cttccagcta gaaaacgcta taattcaatc taggctaata
    85141 ctaaagtcca taaggacaca aagcagtcat gctaatggct atcctttcca aaacagtgaa
    85201 gagaaaagga gcttttctct caatttaccc ctagttaata ctacttatga agttagttgt
    85261 tcagcccagc cccaatatat acattttccc cttctttgtc cctatttcta tttctgtagg
    85321 gaagaaaaat caggaggtag gaacaagcaa gagatggatt gtgtgtgtgt gtgtgtgtgt
    85381 gtgtgtaaga aggagggaga agtataggga gtggtggcaa atgcttgctg acttgatgtt
    85441 tgttgaccgt gcagaaatta acacaatgga acttgagtgc taaaattgga ttgggtaaaa
    85501 ttggaagtga atgacagggg tggaggagct cagttacaga cacacaaaaa aaattgcctt
    85561 gaaagtgggc caatttaaaa tgcaattaat tattggagac ttaaattttg gactatgcca
    85621 agcctcatcc cctgctccct ccaaaatggc cataaagaaa catttcctta tactcgatta
    85681 aaagtaagat ggaagatgta ctattgttat tgaagattgt tggttcaaag gttatcctaa
    85741 agaaaatata tccatttggt attttctagt agtacatata ccgcaatcaa ggaaatactc
    85801 agagagctag tgtgaggccc aagcataagg aaatctggat tcttcctctg gtcctgctct
    85861 ttaccagcct ccacttcctc aacttaaaat aaaggtgatg gactaaatga gacctaaagt
    85921 gagactatgg gtggagtcat tgggggttct ggcctgaagt ttaaaatcag gcctcccacc
    85981 attctttatg cccacacccc ctaatccaag tatcactttt tcccaattaa ctacaaaatg
    86041 ttctgggagg cactaagaaa accctgagct tctagaaaga aataggggct tagtaaaatg
    86101 ggcttattaa tggttttgaa tgggtctaca aaaaaaggag agggggcaaa aagagatcct
    86161 catcatttgg ggttcttcaa tttcaccaga aaccagtact accatcatct cacttttcct
    86221 gaaccgaaaa tgtttctggt agctacagac atctgaagct tgtctagggg catctgagct
    86281 accccaccac cactgggttc ccccctacac agtgcacatc agcagtgcca cccagttgcc
    86341 accaatccat tacaggggtg tcctgtagta tttcctaatt ttgaagcagc tgcagatgca
    86401 cacacacaca cacacatgca ccaccccctc ccaccaccac cacacacaca cacagcaagc
    86461 tcattccatt tagagtggag gaacaactgc aagccattca gtagccaaac agcagcttgg
    86521 cttcgaatgc tgtgcctttc aaaatatctg cacaattaca aagaaataag gaaatcttca
    86581 tatctgctaa aagcaaacac agaattggca tgcatgacct tcacttaaat ttagttgtct
    86641 ttgccattat gggctatgat tacagaaaaa aaaaagcagt gttataaacc tgtatctcaa
    86701 aagactgaaa actatttcaa atgatcagtc cttagtaaag agaaatcctt tctaatgtta
    86761 aagacaagtg aaagatttct gtgtcttcag tatgaaatat taggattata gataaaaaga
    86821 gttgcttaaa atatgctgta acatttcatt taaaatataa ttagaatgtc tggtccacat
    86881 tgaaattatt aatgcaagca taactgttaa aaagctatca aaaaactgat tctattgaat
    86941 tggcctctgc tattcatctt gttttctcct ggaggttact gtttggcagt gacagcataa
    87001 tttctaaact acattattct tctgcttttt tttttttttt tggtaaattt tcttcatgtg
    87061 ttttttagac aaaagaaaaa tcaccagttt ataaaaaagg agcctgaaga gtttgctcct
    87121 gaaatacaga catgaatttt tataaaaagg atattgattt cttgctataa aaagagactg
    87181 aaaggtgtca gacctgaaaa caaattctca cagtgttttc agtgttggct gttaacaagg
    87241 cattcaatag aacagagata gatggatacc atacagttca tgtaatcatg tggtatgtta
    87301 gacagccttc tagaggaatc aggtatatga agtcccttac tccaaactat tatcaattgc
    87361 ttattgattc caaattgcct cagtggagag attgaccagg aattcactgt gctttataaa
    87421 ctcttcggaa atgtctgcag tattttccaa ttctgtctat tctcttagct aacccagcac
    87481 caggatccta agcatctggt gtcacaatta taagttgttg gtttatttct ccccattgac
    87541 agccagaatt ggtgctggac aacaaacccg ggcctcaatc aggagtgaag agtgaggctg
    87601 gggagctgcc ttgctgtcct tccagggtcc tccaccattc ttgagatcca gagaaaaggg
    87661 gcacttgtgt ttgtcattct tggatctgcc aatgagtttt ttttttccca cacaaacatt
    87721 ttaagtgctg tatatgtaaa cagccctcta cagaaggaag actgtatggg caaaattaga
    87781 cagggcaaat ataacgcagg cacctaagct ttccattgaa aggtcatgga ctagcacatt
    87841 ttgcatccct ggaatgctgc cccaaaggat ggttatcaat ctatctctca taattggtaa
    87901 ctgtggatca gtggcccaga ggagaaatca caggaaaata aacccaacat attacaatgt
    87961 gtttttcaaa ataacaacaa caacaaaata aaaacttgaa agcaccaata gccctgttgg
    88021 acacttgagc agaagtaccc tactacaatg ataggcttgg atttgtactc tggactctga
    88081 gcactctccc ctcctttaca tggaatcacc aagcgagtcc gagtcatcca aggacagagg
    88141 caggtacagg tcctataaga agagaagaga caaagttaat tttccttttc ttgatatctt
    88201 gagctctgga atgtctgctt tagttttttt tattaatact ttaattttta caacataata
    88261 attatagatt cacaggaagt tgaaaaaatg tacagggagg tacccttgta ccaatgtacc
    88321 cttcattcaa ttttccctaa tggtaccatc ttgcagttta acatgtgatc atttgtgtgt
    88381 ggggtgtgta tgtgtgtgtg tgtatattcc tatgcaattt tatcatgtgt gtatatttgt
    88441 gtgaccatcg ccacagtcaa gatacagaac tgtaccttca acacaaggct cccttgtgct
    88501 accccattag agctccatcc aaccctcccc tcttccctga gccctggcaa caaataatct
    88561 gtttcctgtc tctgtaattt tggaatttca acaatgttat atgaatgaaa ccatacagta
    88621 tataaccatt tgggattggc cattttcatt cagcataatt ccccggagac tcatgtaagc
    88681 cattatatgt atcaatagtt tgcttcttta tattgctggt gtagatgcac cacagtttgt
    88741 ttagccattc acctgttgaa gcacacgttg aatgattcaa gttttgggct attgcaaata
    88801 aagctgccat aaacgctcct gtactgtaca agtttttgtg tgatcataca ttttcatttc
    88861 tctgggacag acacccagga gtaaaattgc tgagtcatat ggtaattcca ttttcagttt
    88921 tgtaaggaat gccatattgc tttccagtgt ggctatacca tttacagtct cacctcctgc
    88981 cagcaaagta ttagtagttt ggtttctcca cattctctcc agtatttgat gtttttatta
    89041 tttttcattt tagccattct tgtggatttg tagtgacatc tcattgtggt tataatttgc
    89101 attaccatga tggctaatga tgtttaacat cttttcatgt gcttagtttt catctgtatt
    89161 tcttcctcaa ttaaatgttt cttcatgatt ttattcattt ctgaattaga tttttttctt
    89221 actggtaagt tttaacagtt attttatatt ctctaaataa gcctttaatt agatattggg
    89281 ttgcaaatat tttctcccag tctgtatctt gtctttttac acaccttcac agcttttgca
    89341 tggtctttgg cagagcaaac atttttaatt tttgagatcc aatttaccca atttatcagt
    89401 ttttatttta tggattttac ttttgttgtc aagtctaaga attctttgct tataatccat
    89461 ttttaatttt tgcatatgat atgaggtcta ggttaagaaa cttttttttt ttttttgaca
    89521 gagtccctct ctgtcaccca ggctggagtg cagtggtgtg atctccactc actgcaacct
    89581 ccgcctcctg ggctcaagtg attctcctac ctcagcctcc cgagtagctg ggattacagg
    89641 cgcccaccac cacgcctggc taatttttta tacatgttat cagtaagata ccaaagagaa
    89701 ctggaaagta tagaatcaat tctgtggcca cttaagagca cccccacccc caccgccttg
    89761 ccactgcttt tgtatcataa ggacaagaac ttcagtagaa gattgttcaa gaaaattgga
    89821 gaacatagct tctccttgag gtagagaaaa aagtggaaat gagaaagagg gacaagagtg
    89881 gggaggagaa ggacttattt atttaacatt gggaatttat ctaacattga aaatgctttt
    89941 tgctgagaaa gaagtgacag gaaaacctgt aggcacctta gaatggaaag acgagtaaag
    90001 catattcaga aacccgaggt ttaactctgg ctaattaatg ctatgacatt gaacaaatct
    90061 ttgagtattg gtttctttat cagtgaacaa tctgtaaggt ttcttccagc tctgagacta
    90121 taatttgggg gtcctaaagg gatattgaat gattaagaaa caaaacaaag gcaactacag
    90181 caaatagact cagttctcgt caacctacca tcccttcact gtaagtcagg gattagcaaa
    90241 attttctgta aaagattaga tagtaaatat tttaagtttt ctttgctggt cacatatggt
    90301 ctctgtcaca cattcttcat tatttgtttg gggtttgctc ttttacaact ctttaaatat
    90361 ttaaaaaaac atattttaga tggggtgggt catccatgca aatataacca agacacaggc
    90421 aggatctggc atccctgttc taaactacca aacaaaagtt tcttagaaaa ctgataaaca
    90481 ggtaggatgt acaatgccaa ggaaatacta gaaatgcaac aaatacatgt aaaatggaat
    90541 tgaagcaaag tggcatacaa tggagttgca tctctgaagg gattaattct gaagtactgg
    90601 agtaaagtga acatagtcaa aagtattttt gagctctgga cttccagaaa ggtggcataa
    90661 gcccctctga aaatctgttc ctcaacaaat gcagtaacac tggcaaaact gtcaaaatta
    90721 actttttcag gactctaaaa atcaaccaaa agcttgcaac aatccaaaca gtgttgaaga
    90781 aaaactgatg aatcttgtaa aaatactaag atttttggtg tttcaacttg ccttatttcc
    90841 atttttcctc tccccagctc catggtagcc ttgaaaacca acagtcacaa ctactgtacc
    90901 ggtgaaaaac agcaatctag tagttactgg aggacacaat tgggggtttg gagctcccca
    90961 aagaactcat tcccagagaa ttgtcaatct ttgacctgtg gggtagctcc ctggaaactc
    91021 ccatttgcta ggttcttctt tttttaacct gactcagagc acactcgctg tgaacagcct
    91081 tttcaggggg cattgatcag ttgcaattgt ttaaaaatca cagccgtatg aggaagaggg
    91141 caaaagttgg agcaaatcaa gctgctgaaa accttgaaag gaaaaagtca tagaattatg
    91201 tgcctataag gatctttgaa aagctcagac atattcctgg gaatctagaa ggtcaggcac
    91261 aagttcagga ctgggtatat tcttagaaaa gaagcgagat ggccctaatc tttccccatt
    91321 gactgaccct gaggccctgt gcaagcggca agagaatgct aagatggagt tgtaaactgc
    91381 ctggcagagc attgaaatca tgtcccaaca caggaacaga gcccctcggc aaaggctggg
    91441 agtcttactg gatcaaggca tttaaggtaa gcaaacaaaa ctatcctatt cattcaaaat
    91501 tatcatcaca aataaaggag aaattaaaac attccctgat agaaacacag aatccattgc
    91561 tagcagacct gtcttgcaaa agaatactaa agagaatcat ccacagagaa attaaaggac
    91621 actagaaagt aattcaaatc cacataatta aaaaaaatag taaatgcgta agtaaacaca
    91681 aaagactgta caaatttatt tttgtaactc ttttcttctt ttttaaaaat aaattttact
    91741 gtgtatattt gaagtttgca acataatgtt atgggataca tatagatagt aaaatagtta
    91801 ctatagtgaa acaaataatt acttttttgc tgacaagagc agctaaaatc tacttactcc
    91861 ctaatacaat tttattacct atagtcctca tattttatat tagatctcta gatgttcatc
    91921 ctacatatct gttattttgt atcctgtaac ctacatctcc tcatttcttc cctttctatc
    91981 cccggcttct ggtaacctct gtgttattct ccatctctgt atatttgacc tttttttttt
    92041 ttttttagat ccctcatata tgtgagatta tacaatattt ttctttctgt gtctggctta
    92101 tttcaaaaga caactacatg aactagatga actagtaatt ataagaatgt gttgatggac
    92161 ttactatgta taaagatgta atttgtatgt cagtaggaga aaaaaggagt gggaaagaaa
    92221 cagagctatg ctatggcaaa atttttatat ataatttaaa tcaagtttgt attaatctta
    92281 agtagagtgt attaaaatgt aaattgtaat cctcagggca accactaaaa aagctactca
    92341 aaatgaaatt agtaaaaata caacaagtga attaaaatag tgcactaaga aatatattta
    92401 acacataaaa ggcaagcatc aagtaatgaa cgaaaaatga tataacatat atagaaaatc
    92461 gcaaaatggt agatgtaatt ctcgctttac cagtaattac attaaacata agtagataaa
    92521 gcaatccaat taaaggcaga gactggcaaa atagattttt taaaaacatg atccaactca
    92581 atgctgtcta aaagagaaac aatttagatt caaacccaca aatatgttga aagtaaaatg
    92641 atggaaaaag atttgccatg caatcagtaa ctaaagagag ctaaaattgc tatactgatg
    92701 tcagaaaaaa aatagacttt aagacaaaaa ttattactag agacaaagaa cattttataa
    92761 tgataaaaag agtcaatata tccagaaaac ataaatacaa atatttatat atcttaacag
    92821 agtaccaaaa tacatgaagc aaaaactgat ggaattgaaa ggagaaacaa acaattcaac
    92881 tatattaggt gaagacttaa atatcccact ttcaataatg gatagaacaa ctagacagaa
    92941 gaccaacaag taaatagagg acttgaataa tactataaac aaatacacat ctatggaata
    93001 ttctatccaa cgagagcaga atacatattc ttttcaagct cccatggaac atcctccagg
    93061 atagatcata tgctaggcca taaaacaatt ctcaataaat ttagagggat ttaaatgata
    93121 caaattatat tatttggcca cagtggaatt aaattagaaa caataatgga aagatattca
    93181 ggaaattcat atatatgttg aaattaaaca acgcacagat gactaaccaa aaaagaaatc
    93241 acaggaaaaa tcacaaaata ccttgagata aaggaaaatg aaatcacaac acaccaaaac
    93301 ttataggatt cagctaaagc agttttcaga gggaaatgta tatgaaatgt ttcaaatatg
    93361 ctatcccata gaaacagaat tgattagtgg ttggttgcca ggggttggat gaaaagggaa
    93421 ggggctgcga ctgctaatgg gtaaggggtt tattttgagg gtgatgaata tgtcctaaaa
    93481 gtagatagtg gtatactaaa ttaccttaaa aaccaccaaa ctataccctt taaaaggagt
    93541 gaattctata ctatgtgaat tatatctcaa taagctgtta tttttttaag tacctttgaa
    93601 aatcctttat attgacaatt aagtactgtt tatgtgattt ctgaatatgc agccgtacac
    93661 agcaatatgc tgtacatata ttaatatata tgatatatac atatatatat acacgtgtat
    93721 atatatgata tataccatat atacacgtgt atatatatgt atgtagtgat attcaggatg
    93781 taatgatgtg tacaaatgta agagaaggtt ttatagttgt tcgctctatt actatgttat
    93841 gactattgat gaattccatt aagttaaact tgcatatgat gaaacaacac tgtaatataa
    93901 aaagcacagg actaagagtc tggagactta gattctaggc ctacattttc catttaactt
    93961 cccttttcag ggtccttgct tctagatctt aaaattgagg aagatcttta aggttccttc
    94021 tagttccaaa attctataaa ctgcaatcat tctatctttt tctttagtta ccccaaagca
    94081 agagaaatgg tacagacatt gccaagttaa tgaactccct gggtattttt caggccgtac
    94141 aaaaggacct tttaataata tggaggccac tttgagtccc agcacactga accactgggc
    94201 tttagtatta ttcatgtaac agacccttca gtggagttgc ttggttggtg gaaaaaatta
    94261 tctgagaaga ttgttccaga cctttatctc agaggaagca gggataaggc ctgagggtag
    94321 aatcctgaag agatttcact tgtagctgct gtgagactct gaattaatgg cgtgagatgc
    94381 agaagatgga gagaattcat agaactaggc ctaattatga acttctgaac actgtatctc
    94441 catcaattta tttcgtattc agaatgtatt ttcctgtatg ggtagaacaa tgttcaggaa
    94501 aaggaattat ctcaacattc tgtttttgtt gattgatatg tagaaataca gaaacacaca
    94561 tattgcttag caatacaaat aatttgcacc tgaaattcat ttgcttattc ccttggagcc
    94621 ctacagaaca aaatgtttgt aacaaataca gctgtgctgg cgatactatt gtattagaga
    94681 aaacaaaaag ttggtctctg gtccaaggag cttatagtcc aaaggaagtc agggaggagt
    94741 tttgagaaac ctggcttcta gtttgggttt tctatttccc aaacgtgccc tgtgctttta
    94801 tgatggtgtg gttttgtttc ttgctaattt ttttttaaag ctggcatgcc tgttctcttc
    94861 accagccact taaatgtcac ctctgtgaaa agctttcccc atctcccact ctctcccgtc
    94921 tttcccaccc tcctttgtgc tcatgcagtc ctttcgttag ctcttgatca cactacccaa
    94981 agccacttat attagagtta gttgccttct gtgatttaag ggtcaggacc ctactttact
    95041 catcaaaggg cctctccaag cacgtataag aggacaaaag aagtctggaa acagaggaaa
    95101 gtgcatattt aaataatgtt atttatattt tcaaataata tgttgaatat actttcctct
    95161 agtctctaga caactttgca tacaaaatgc ctctaaattt caagccattg gtgcaatcac
    95221 tgatctgaaa atttaaaaat taaataaatg aaagagtttc cttacgtttc tagatttttt
    95281 ttttttttga aaatctttac tatgtacttt cttcaaaggg gatccttaat aaatttttgt
    95341 tgaatcacac tgggactgtc tttcctctgt acctcttttt ttctcatctg taaaagggaa
    95401 atgctatctg ccctgtctct ttcacgggga tatcacaagg gtaaatgata aaatggctgt
    95461 gcgatagaaa aaagcagtat gaaaatgcaa aatgttgttc tcattaacta tccaaggatt
    95521 tacttcgtgt ccatattgta acacttaagt atgcggactt tggctaaccc cattgcagct
    95581 gcattaaaga agattctcca tgaactgctt aaaagaagaa gccaagaagc tgattacagc
    95641 ttagggcccc accacagcca tcaaaaatta ccaaagggat gctgggactt caagtttttt
    95701 ccagttgtgc aggaggaaag tcatggcctc agccatctgc ttcctgtctg catcatgact
    95761 cacatctttc aggtccactt ggacagaagt tcagatgaga tctgctcagg cctgctgccc
    95821 agagacttga tggtgctaag gttcaggtag ctgtgtgaaa ggaggagcat gaatctagga
    95881 cctgagacct agggggactg agacagggtc acaagctttt tatcttattc agaaaaacca
    95941 gttcaatctt cattactggt ggtgtaagca tgacactatt gcttcttgat aggatgcact
    96001 gaaaggatac aacatcactc ctatgatatt cctgccccaa attcagaatc tgaatctaat
    96061 cacagagaaa catcagacaa gcccaaattg aaaggcgttc taccaaataa ctggcctgga
    96121 ccttttattt ctataatata ctgtactgag agatagagag tctcaggaac ttttccagat
    96181 taaagaagac taaagagaca tgagaacttg caacatgtga ttctagactg gattttggac
    96241 cagaaaaaga acattattaa agcaattgat aaagtttgaa tatggactgc agagtatata
    96301 atagtattgt gccagtgtta aatttcctaa ttttgatgat catgctatgg ttatggaaga
    96361 aaaaatcctt gttcttagga actacatact gacatattga gggataaagg ggcataatgt
    96421 ttgcctgtga tattaatctc aaattgttca gaaaaaatag gcccagtgcg gtggctcatg
    96481 cctgtaatcc cagcacttcg gaaggccgag gcaggcggat cacgaggtca ggaaatcgag
    96541 accatcctgg ctaacacggt gaaacctcgt ctctactaaa aatacaaaaa attagccagg
    96601 cgtggtggct ggtgcctgta gtcccagctg ctcgggaggc tgaggcagga gaatggcttg
    96661 aacccgggag tgcggagctt gcagtgagcc gagatcgtgc cactgcactc cagcctggga
    96721 gacagagcga gactccatct caaaaaataa ataaataaat aaataaaata aataaaagaa
    96781 aaaataataa catgtactat atgtatttat ttgtgtatta ttatgtctac atatattata
    96841 catagagaga gagaacataa aaaataagta aatgaggcaa aatataaaca actagtgaat
    96901 ccctgtaaag agatacataa gttgtttatt atttttgcaa ctcttctgta agtttgaagt
    96961 gatataaaaa taaagagctt taaaaatagt gagaattata ccaaatttaa atgtttactt
    97021 tatttaaaag cttgctctag aaaatgatat aatgtattac ttaacatttg cttgcttaac
    97081 tgtctttaaa aagagtaatt ttaaaaataa aatgaaacac tgaaaattcc acagttaaga
    97141 aaacagtgaa gctctgacaa tgctgccacc tgtgagcttg ttctagtcct tcaatggtta
    97201 gaattgagtc aataaacaaa caagctataa gtgtctgcaa gcccaaccag gaaaaatagt
    97261 ttaacaatgc aaacaactca ccttggtggg cttgtggatg atgggtcctc agcataaatg
    97321 acctgaacat gtcagttgaa aaactgaata acttgagtgt gcagaacaaa tgtccaagct
    97381 gccttcacac tttagggaag gtaccaatgg aaggtgctag ggtaaatcat gcttaccaaa
    97441 aggaaggtaa gggcagaaag gcatatagga ggtgctgtgt ggacagtgga acaacagagg
    97501 tagacttcct atcttggcat gtccatattt acaagcttta ctgctcagca aagaataact
    97561 aaaagccaac tgtaaaaata attttcatcc atctcataat ctctaggtaa cactggttac
    97621 acagagttca gcaaatactt ctttggtaaa atgaaatagc tcaacaggac aatcagaaag
    97681 ttctggaagg ttcaatagca gaagttaaag gggccattaa agcaatggaa cagacattag
    97741 aaattatggg gcaggcagta gccagattag ccaggaaatt aaaagactga tatagaaaaa
    97801 ataaaatgaa aatggatgtc gggcagtggt atgctattaa atgtttagca actggctgtg
    97861 agaaaaaaac aaccttgact tatagtattt gttaatgtcc atggtgtaaa tattcccacc
    97921 atggccaaat tcaagttaaa aacgtgatgt cactcaatac agagttgaga agcaatgtac
    97981 acaatcagct ctcaccacct actgtgagct ggctctggca caccactggt ggtgacttgg
    98041 gtaccactct tatagctaga tatgttaccc tagtaagtaa cttaaccttt tggctctcaa
    98101 tttcctcatc tgtaaaatgt caaacctcat aagattattt agagaatcaa ataaaactaa
    98161 tttatctcca tctaaggtgt cttccagctc tgaccatctc tgattctgtg atgcagttag
    98221 tcatattggt ggctttttgc cttttaccat ctcacaccaa gtcttatgag caggagaaag
    98281 attagagaag gctagaggaa gttgctccac agtatggtaa tcagagaaga agccacatca
    98341 acactgcctt aggtataaga gattcaaggc agtaaccact ccacattttc tacacagaga
    98401 tttctgggtg agaatctgtt tggatagcgg ggctagggaa tttatgtgct tttgttattt
    98461 aatgatatac ttatttggaa tagtttaagg ggtatacatg gtggccaagg ccaatgcaat
    98521 gtgtatatgt gctctcttcc tattcagcat cagaatctgc ctggtaatga aaagtgcaca
    98581 gcaaaagtag gagctattca tttgttgatg tccagtaaat ctaaatggag taagtaactc
    98641 agtgcaactt tttgggactg ctatctttgt tatttccact gactggtgcc tggtgacata
    98701 aagtgccctg aacaccccct ttgctatgtc tctgaactgg ggagacaagt gttcaaccca
    98761 tgtcccaaag catggtttgg ttacctcatt agctttggag accttttgga aaccctttgg
    98821 aagaagtgct agaatgtagc aggaaataag tcaggaattg gaggaataaa taaaagtgac
    98881 tagaagcttg ctccctgctt ggattcgcag aacttcaagc taatgaagcc agatgaagaa
    98941 aatttctggc cttaatgtgc tgttgagtta gaatggaaga gaaaaccttc accaagccat
    99001 tcaggaaact gagtgcaaag aggtttagta aggtataaga gacataatac cttgtgcttc
    99061 cggaggctgc caggactggt gggcgtagag atgggagact gcttagactt gggggtagag
    99121 agctggctgc tggaggttcc gtttgctagc ccaaagcaag agaaaaggaa gggataaaaa
    99181 tcaacagcat acaaaggagc aagttatcct tcccctcaga agacactaag aacattcaaa
    99241 caaggtacac agacaaccaa gtgatttggt gagtagaaca aaggtgaaaa gaaacacatt
    99301 cctcaaaagg gcaagaactg caatcctcag ttaattggat ccctatgcag tgcaaggccg
    99361 tcatcctccc accctttcct cctcagtgct tcccaatcac aggggtctgt ttcttttcaa
    99421 acatttttca tattcattcc caggtttgcc cctaacatct ctgggaggta gtctgggaag
    99481 gtattatttt gctttaattt tatagaggac aaaagtgagg cacagagagg ttatgtgcct
    99541 taccttaaag gggaagaaat atgggcaatt tggaccagtg cacatttcct ttggtgccag
    99601 atgatggggt aggtgaaaga cagccccatt gttagtctct ccccctctct tttcctgctc
    99661 atgccacacc attgtgcctt tcctacctca ttctgcatgt gtgaaaagaa caggaggaaa
    99721 gaaagagaaa ataggctagt caggaagaat gaaagagaca gtggaggaga aacagaaggt
    99781 gaaacattaa aagaaggctc aagcagcaga atgggcagag gagaaaatca aaaaagagag
    99841 aatgacaaag agattgggaa tggaggagaa aagagacaag gaagatgata agatattaaa
    99901 agagaaggag gaagttgaaa caggaagaaa ggcatagaga cacataaaga tccaaaatgg
    99961 tagccagagt cagggttcac ccacatttgg ctctagccaa gaaaatgctg aaaatcagag
   100021 tttcttttta aacatcttta tattctattg tgttgctaaa ttgttttata ttcccttgct
   100081 aatgagcttt ttaatatcca atcatgaaga aagaaaagtg gaatattaaa atggaacaat
   100141 taaacagctt tcaaagtgga ttcttgttca gacacttcat actaaagtat ttttttttta
   100201 atcttaagac ctaaattgtt aaacagagat attctaaaat gtgctttaaa aacaaaggca
   100261 aatatcctga gggtgtgtgg agagtggggg agaggaagag agagggggag ggagaaagag
   100321 tgggggggag ggagacagag agaataaata gaaaattgaa aacattaagc agaaagatgg
   100381 cctgatagtg ttcctgtcag tcccataggt aaagcttaaa ctgctgtatc atgaatattt
   100441 cctgaggata cattccttct gcaccactta tggaaatatt tccctaggtt ctggaacatg
   100501 cacatcaact gtagagatgg aagagacatc agtgtagtgg ttggtaagtt tatttaagcg
   100561 caagtgagct gaaaaatgga cacttaagca ttcactgtct ggcttatatc ctcactcatc
   100621 tctcacccac tcgttgagaa gccctcctcc ctgtcattct ttcatttagc cagtagctat
   100681 ggagctctac ctatgagcca aacaaagtgt tagacatcaa ggtttcagag aggagtaagc
   100741 tacttcctgg gggaggcagg caagaattca gtgaaataaa cgctaagaca tagccttgca
   100801 tagggtatta ttggaattca gagaacagac tttgaaatct ggggcctgga agacaaacaa
   100861 gaggaggcaa cacctgagaa atcttgaaag atgaatctga aataggcaaa tcaatagaga
   100921 cagaaagtag attaatggtt tccaagggtt ggaggaaggg caggagtttg aaagtctgct
   100981 aatgggtaca agttttcttt ggggaaggca aagattctga cattaaatag tgatgatggt
   101041 tgtacaactc tgtggtcaaa ctaaaaccca ctgagttgca tatgttaaat tagtaaatta
   101101 tatgttatgt aaattgtatc tcaataagcc cttaaaaaaa gaagaatcag agatagaatc
   101161 attcccataa caacacatag gcagagagga taggaagggt attctaggca ttagaattgc
   101221 caacatcctt tcatgttaac aacccacaat aatctaggca ttgaaggaac acacttcaaa
   101281 atgttaaggg tcgcaaaaat tttttcccat gttgtaggtt gcctgttcac tctgatggta
   101341 gtttcttttg ctgtgcagaa gctctttagt ttaattagat cccatttgtc aattttgtct
   101401 tttgttgcca ttgcttttgg tgttttggac atgaagtcct tgcccacgcc tatgtcctga
   101461 atggtaatgc ctaggttttc ttctagggtt tttatggttt taggtttaac gtttaaatct
   101521 ttaatccatc ttgaattgat ttttgtataa ggtgtaagga agggatccag tttcagcttt
   101581 ctactcatct gacaaagggc taatatccag aatctacaat gaactcaaac aaatttacaa
   101641 gaaaaaaaca aacaacccca tcaaaaagtg ggcgaaggac atgaacagac acttctcaaa
   101701 agaagacatt tatgcagcca aaaaacacat gaagaaatgc tcatcatcac tggccatcag
   101761 agaaatgcaa atcaaaacca ctatgagata tcatctcaca ccagttagaa tggcaatcat
   101821 taaaaagtca ggaaacaaca ggtgctggag aggatgtgga gaaataggaa cacttttaca
   101881 ctgttggtgg gactgtaaac tagttcaacc attgtggaag tcagtgtggc gattcctcag
   101941 ggatctagaa ctagaaatac catttgaccc agccatccca ttactgggta tatacccaaa
   102001 tgagtataaa tcatgctgct ataaagacac atgcacacgt atgtttattg cggcactatt
   102061 cacaatagca aagacttgga accaacccaa atgtccaaca atgatagact ggattaagaa
   102121 aatgtggcac atatacacca tggaatacta tgcagccata aaaaatgatg agttcatatc
   102181 ctttgtaggg acatggatga aattggaaac catcattctc agtaaactat cgcaagaaca
   102241 aaaaaccaaa caccgcatat tctcactcat aggtgggaat tgaacaatga gatcacatgg
   102301 acacaggaag gggaatatca cactctgggg actgtggtgg ggtcggggga ggggggaggg
   102361 atagcattgg gagatatacc taatgctaga tgacacgtta gtgggtgcag cgcaccagca
   102421 tggcacatgt atacatatgt aactaacctg cacaatgtgc acatgtaccc taaaacttag
   102481 agtataataa aaaataaaaa aaaaaaaaaa aaaaaaaaat gttaagggtc atctacaaca
   102541 aacctatagt taacatcata ctgaatgggc aaaagctgga agcattcccc ttgaaaacca
   102601 gcacaagaca aggatgtcct ctctcaccac tccctttttt gttgttgttt ttgttttttg
   102661 agacggagtc ttgctctgcc accaggctgg agtgcagtgg cgcaatctcg gctcactgca
   102721 acctccacct ctcaggttca agtgattctc ctgcctcagc ctcccaagta gctgggacta
   102781 caggcatgca ccaccatgcc cagctaattt ttgtattttt agtagagatg gggtttcacc
   102841 atattggcca ggatgatctc gatctcttga cctcatgatc ctcctgcctc agcctcccaa
   102901 agtgctgcag ttacaggcat gagccacttc gcctggcccc tctctcacca ctcttattca
   102961 acatagtatt ggaaattctg gccagagcaa tcagtcaaga caagtaaata aaaggcatcc
   103021 aaataggaag agaggaagtc aaactatctc tgtaggtgat atgattctat atctagaaaa
   103081 ccccatagtt atctgcccaa aaactccttg atctcataaa caatgtcagc aaagtctcag
   103141 gatacaaaat caatgcacaa acatcactag ccttccgaga caccaacaac agccaagctg
   103201 agagccaaat caggaatgca attccattca tacttgttag agaaagaata aaatacctag
   103261 caatacagct aaccagggag gtgagagatc tccacaatga gaattacaaa acacttctca
   103321 aagaaatcag agaagacata aacaaatgga aaaacattct atgctcatgg atagtaagaa
   103381 tcaatatcat taaaatggcc atactgccca aaacaatgta cagattaaat gttattccta
   103441 tcaaactacc aatgacattc ttcacagaac tagggaaaac tattttaaaa ttcatatgga
   103501 accaaaaaaa tagcccaaat agccaaagca aggataagca aaaagaacaa agctggaggc
   103561 atcacattac ctgactttaa actataccat agggctacag taaccaaaac agcatagtac
   103621 tggtacaaaa acagacacat agaccaatgc aacataacag agagcccaga aataaggcca
   103681 caaacctaca accatctgat ctttgacaaa gttgacaaaa ctaagcaata gggaaaggac
   103741 tccctattta ataaatggtg ctaggataac tggctagcaa tatgcagaag gttggatctc
   103801 ggccctttcc ttacaccata tacaaaaatt aactcaaggc ggatcaaaga cttaaatgaa
   103861 aaaccctgga agataaccta ggcaatacca ttcagcacat aggacttggc aaggatttaa
   103921 taatgaagac atcaaaagca attgaaaaaa aaaagcaaaa attgacaaat aggatctaat
   103981 taaactaaag agcttctgca cagtgaaaga aactatcaac agagtgagca gacaacctac
   104041 agaatggtag aaaatacttg caaattatgc atctgacaaa ggtctaatat ccatcatcta
   104101 taaggaactt aagcaaattt acaagaaata aacaacccca ttaaaaagtg ggcaaagggc
   104161 atgaacaggc agacactttt caaaagaaga cattcatgtg gccaacaagc atatgaaaac
   104221 atgcttcaca tcactaatca ttagagaaat gcaaatcaaa accacaatga gataccatct
   104281 cacaccagtc agaatggcta ttaataaaaa gttggccagg cacggtggct catgcctgta
   104341 atcccagcac tttgggaggc tgaggcaggc tgatcacaag gtcaggagtt cgagaccatc
   104401 ctggccaaca tggtgaaacc ctgtctctac taaaaatata aaaattagct gggtgtggtg
   104461 gcgtgcacct gtaatcccag ctactcggga ggctgaggca ggagaatcgc ttgaaccagg
   104521 gaggcggagg ttgcagtgag ctgagatcat gccactacac tccagcctgg gtgtcagagt
   104581 gagactcggt ctcaaaaaaa aaaaaaaaaa aaagtaaaaa aataatatgc tggcaaggtt
   104641 gtaaagaaaa gggaatgctt atgcactgct agtgggagtg taaattactt caaccattgt
   104701 gaaaagcagt gtggtggttc ctcaaagaac ctaaaactga attacctcag tgtaccctaa
   104761 aacttaaagt ataataataa taataataat aataataaca ataataataa taataaaact
   104821 gaattaccat ttgacccagc aatctcatta ttgggtatat accccaagga atatgaatca
   104881 ttctaccata acaacacatg catgtgtata atcattgtag cactattcaa atagcaaaga
   104941 catggaatca acctaaattc ccatcaatga tagactggat aatgaaaatg tggtgcatat
   105001 acaccataga aaactatgca gccatgaaaa agaacaagat tatgtacttt gcggtaccat
   105061 ggatgaagct ggaggccatt atccttagca aactaacaca ggaacagaaa accaaatact
   105121 taatttccca cttataagtg ggagctaaat gatgagaaca catggacaca aagaggtgaa
   105181 caacagtcaa tggggcctaa ttgagggtgg agggtggaag gagggagagg atcagaaaag
   105241 ctacctatca ggtactatgc ttattacctg gttaacaaaa taatctgtac accaaactcc
   105301 catgacatgc agtttaccta tataacaaat gtgcacatgt atccctgaac ctaaaataaa
   105361 agttaaaaaa atgcaccagc agggacaaag atatgaatgt gtgagagaat acattatgtt
   105421 tacagaagtg caagtaactc ataatggctc aagcacaggg tgccagcaga gagtttcagg
   105481 agatgaggct ggagacagag gataagggtc agcccctggg ggcctttttg tgtagcctgt
   105541 taagaagttt gcatttggtt tctgaaggag agtcacagaa gaattttaag ctgaaagtga
   105601 tatgatcaaa cttgcatttt aaagctccag ctgctgtggt tggttaagac tgagggtaga
   105661 gtgggaaggg gtagggtaga agcaaagagg ttggttagaa gacattgcac taatccaggt
   105721 gagggatgat gatggttcac taaggccacg ggggagggag agaatgaaca ccatggtaga
   105781 tagggaccta gaggccagat aagcatagag tagtgtgttt atataccata caaccaccca
   105841 aagaacatac atacatacaa ccacccaaag aacatgacct gtgccaaata gcaccaagaa
   105901 tgtgggcact agactttgtc aaggcactag gctttttgtg aataacaaat gcactttgga
   105961 aaaggcaaga gtctgtggag tcgcaatggg gcaaagcagt caattactct ttgtaggaaa
   106021 tgaaggagtt aatagttgaa acagttaatg gaccacctac agtgcctgag ggcttcccca
   106081 gcaggacctg acctgtcgca ctcaaggtaa gaagcaatct tggagaccta tatgagctga
   106141 cacaatagaa ctcttccaaa agagctgtga cctcggccat aaagtctgag tggtgggcaa
   106201 ccccggtaag ttctcctctg gcattttgtt ttacatgatt gctatatact ttacagattc
   106261 tagtgtccca gtatttgtta gccaagcttt atagcctctc tatatattct ctgcatatgt
   106321 gtacgtttat atatgttccc aattcccctc cacccattac tctccctttt aaaattaaga
   106381 attaatctta agctgtctgt aaataaagga cttaattttg tcatcaacat tcaacaagca
   106441 ttttggaaat cagcttttaa taatttaact accattcact tcactgttgt aaaaatgcag
   106501 acactttgtc agacgttccc agagttccct ggtagaatga aatccaaaag tggacataga
   106561 gaaattaagt aatgtagact tgcacagtta aatgtagatt cattacaagc taaaggaggc
   106621 cctacattta cagactagat aaaatgtatt caagtgcaca ctttgtctat ggattctgtg
   106681 gggaaaaaaa tctggaaagg tgccatttga catggattct agagccaatt cacataaatg
   106741 tcaagtaaaa gaagcagcag gaaataatac aaatagggtc tgatgaactt tgtgtccttg
   106801 gattggtaaa tccatgtaga tccacatttc attttaagag tggctggaaa tggcaccact
   106861 gtttgatatc cttcattttc ctttattctt ctttgctaac catccttagc tcttgtctct
   106921 gagctttcat ttctaagcgg caattaggaa atgccttggt ggatcttctc cttaacttgt
   106981 ctctctttct catttccctt ctcagaacca cagcattgtc tcactgttct gaccttttat
   107041 ttatatatgg tacaaattct ctcatctctt tcaactcttt tatagtccat ttttcccctt
   107101 ggcaacaagg ggcatcaaaa acttttctat ctcctaggcc accttttcaa tcttatcaac
   107161 aaaaatagga taaaaattac tctttattga gtgtctatga tgggtcaggc actcttcttt
   107221 ctctctcttt ctttctctct ctctctctct cacacacaca cacacacaca catgcacaca
   107281 catgttccag ctaggggtag caaacacaag atttgaatgt aagtctcttt gattccaaag
   107341 cctatatggt ttttttgttt ttgctaacat aggctcttta aaaaataata ataataattg
   107401 cttctgaaat tccttcctgt gacaattttc tgatggaggc tagggaggag ggtgaaagtt
   107461 gtggctgaga aaagagaaag taatgaaaag aagatagaca ccagagctga cacccatctt
   107521 cagggcttct atactatttg tctggtttag atttagaaag tggttagcaa gagctttctt
   107581 tcattcaccg aaccagctgg gtatcataca cagtgtcata aaacaatagc agacagagca
   107641 ttagcatgac atgtaactac cctcatgtta cagagttctc tgggaagata gtatctatgc
   107701 cctgcctgtc aaccctctca gttggtgccc ttcagctacg cagcacaatt gccaagcttg
   107761 gtcctgcagt cttggttcac tggctcctct gaatgattat gcctgttctt gcctttgtac
   107821 tcatctttag ctccttggct ctgtctgcct ttcctgtcag aatggaacag gacaagccca
   107881 aacaggatgg ttggtaaaga aactaggctg acttatctag atttacagag atttaaatat
   107941 atgggggaaa atgggtccct gggggatgag gaataagggg tatgtgtgtg gatggggtag
   108001 ggagaggtta gttcccaaag aacaatgaga gagtccagtt tttggtgcga gtggccaagg
   108061 acagagtggg gtcactagcc aggcagagtg gatgctaaac tcccaaactg tggataccct
   108121 tgtctgggag gctggaccaa ttcctggaaa ctaggaagca actgagtaca agttgggcat
   108181 aagaggcccc aggtgtagca attatagagc tgcctaggca gcaggttaca tgagtcatta
   108241 tactttctgg gctcaagttt gcatgaccag aatggtacag aaaataaagg gacagcttgg
   108301 catctacttt tctcccaacc tttgaccaca gaccaaacca agttaatgct cattcactta
   108361 gccctggctt tccctcccat cctaagccca acatttgagc ccaactcaaa atcaagcctc
   108421 gtggttttta tttgatgtca ttttgaaata ttgctcattg aaaacaaccc cagtttatca
   108481 aaagagctat gataataatt tcagagccct ttttataatt tcagctgttt ttctgcagtt
   108541 atttttagag cattttacca tacatgaact cacattattc tcctgatagc tctaagagat
   108601 ggtggccagg tattactagt ctcctttcaa ttaacaagga tataggggct cagagagggt
   108661 atggcttgcg taaatggtat aaactctgcc ccttctcact ggatagtggc ttctggactt
   108721 gctcacagat tctatttcaa aatttcctta aagttagcac accacatacc tcccactgtg
   108781 aacttctagt gattcatttt ctgtcttgtg ttctgtaact ccctaatgcc ttgcagaggg
   108841 agatctcaac acacaattct gccacttcct catttttctc atactgagat gctctaagat
   108901 tttattcatt ccagtccaaa gataccttgt tttcccctta caaatagagg gaattttcct
   108961 ggaaggccat ccaagtggat ccaaggtggt ttcaacttac ttcagaacaa ggtgaagaga
   109021 cagaccctgt ggttttcaca aatacattcc aaagagccct tgggcagggt gtctggattg
   109081 ctttggctaa ttgtttctgc tcaatggaaa agcttttgtc cctctcccta gacccatcca
   109141 ttaggggagc aagaactgag agcctgggaa tagttcttca atgatttgat catagggttt
   109201 gctctcaagc caaattcaat tgtaggttgt gagacactgt aacttcagaa gatggaattt
   109261 gaattttaaa tatatttcct acggaatctt ggacagcatg aagttatgaa tcctttaaat
   109321 aagggggaga agtaagagtc ttccttgtta caaaacaaat gtatataccc aatcctgcaa
   109381 tattttgaca cagatgagtt tccctcaaga agaggacatc tgtagagggg ctgatggaag
   109441 tgccagaatt gttttctcag ggttgacaat gctgtcaaca tttatttcct actgagaaac
   109501 agaacatgtt tctggtggag ctagcctgtt gtcaacgctg actgctggtt ttcctggtgc
   109561 agggactagg ataaactgca ctgttcttaa tttgtgaaaa gaacacaggc tttggaatca
   109621 gtcaaagttt taaatcctgg cttagccatt tcctagattt gtgatcctgg ggctaaccta
   109681 acctctctga gtctcagttt ccccatctgt aatatttgaa taataattat ctcaattgtt
   109741 gctgttttga tgattacatg gcagactata cataaaatgc ttatctaagt gcctgccaca
   109801 tagaaaagag ttcaataaat tgtattttct gtcaatatta ctaccaccac tactagcact
   109861 gagtaagtat tcaatttata ggcactgagt taggaaatcc tagcccacaa gagatgagat
   109921 atcattctat gtaggctgag ggctagttct ttggctagtt ggtcaagtat cagataatcg
   109981 aagctttcct taggtgaagc ctattatttc tgtttttttt tttttttttt ttttgcattt
   110041 tactgctgtg ctccccaaag caaaatgatt ggccctatcc tgttctctct ttttaccctc
   110101 tcagcattat tatggcaggg gaagctgtgt tggtcaatga acagtgtgga gccactgaaa
   110161 gttttggagc agggaagtag cacaatgaaa atggtgtttt aggatgagaa atgtggtggc
   110221 accacaccac atggataagt ggacagagat ttgaagcaga aatattgcta aggagtctgc
   110281 gaaataatcg agacatcagg tcataagaga aaggaaagga ctgctgtata agtcacatgg
   110341 agatggttca catgtccaac aatgataaca acagcacaca gtgggtatgc aggattaatg
   110401 agtgggaagt caaaaatagc accagggtgt tgaacccggg aagcatgata attctattgt
   110461 cagacaaaga gatgggaggt ggaaatgctt ttgattttta gatagcttgt atttgtcaca
   110521 tataagccat atacacagga ttgtttagta ggcattatgg gtctgacatc cagaaggaag
   110581 atcggggctg gagaggtaga gttaagagcc tcctttgata agccatattc atcaccacct
   110641 ctgccccttt gttcatgctt ccctgacagc cagaagtatc catctcctta tcaatttaca
   110701 tatagccctt cttcaagttc cagttctccc aggaattcct cccttagcac cttagtccct
   110761 aatgagctct ctcccatgaa ctccaatagt tcccataata cttactacca gtatctatta
   110821 tgcattttgc atttaaataa gttattacca ttaacatggc attaatggtt cagtggttaa
   110881 aattgtggac tgtggagtca gattgcttgg gtttaaattc cagttctatc atcttactag
   110941 ctgtgtgatc ttgggcaggt cactaaactt ttgtatgcct cagtttcttc atctgcaaaa
   111001 tggggataat agtaattgcc tcacaaggtt gttgtaaaga ttaaattcgt taatacaagt
   111061 aaagcgatta gaacagttgc ctgacacata ataggagcta tataagtatt tgtaaatata
   111121 tttttaaaat ttacatgtat actggttacc caaataaatc ataaatttct tagtggcagg
   111181 gaattcatct taacatccct tttcttcata gcacctagca ctactaagta ctttagaggg
   111241 gtactttgca cattcagttg tatgtaaaca caagtcccaa gccaagactg gctgataaga
   111301 ctcatgtagg gaaatatcac catataccat gcactatggg gtgctcactg gcacccacta
   111361 tactgcttgc tccaatcatt ttctaagcag gttttagaag aatcagattg gaggaggaac
   111421 tggggaagtt tttctcttcc cacagctctg tcaaggacac cacttccctt tggagttgct
   111481 ggtaggggag ctaggggagc tgaaggaatg tctaagctgt cttgtttcag tgctctgtag
   111541 cactgaaagg cacctaatta cttgtgggac tttgctcaaa gcagcacaaa ggaaatgaca
   111601 cagacaccaa cccacccctt gcctttccac ctccctagcc tggagatggg agaaggggaa
   111661 aagaagccaa gaagcaagga ccatccaacg tagacctttg cattcaccag acagcacatc
   111721 ccacatacta actactgtgt gtcagtggga cactgcgaat aatgcttgtt gggaagtcag
   111781 gagagcgatg ttctggctct tgggtgctaa gtaatgtgtg acatggaaga tcccaagatc
   111841 ctaaactttg tatcttagtt ccccaatgaa aaaaaaagat aattatcaat ttccaactta
   111901 attcatagtg caattatgag gaacaactga ggtaatagat ttggaaatat tttgcaaagt
   111961 gcatagcatc taattcacgt aaaaagattg ttatcatgaa tctctcttag ctagagaaga
   112021 aaggtattct tgtgctgttt aaaacagtag ccaatagtca catggctgtt gaacatttga
   112081 agtgcggcta gtcagaactg agatacgttg taagtgtaaa atacaactca atttcaaaga
   112141 atgtgtaatt tattacatta attatatgtt aaaatgataa tatcttgaat gtatcattat
   112201 atgaaatatg tcaaaattaa ttttgcccat tttttaactt tttaagatgt ggctactaga
   112261 aaatgtaaaa tcacatacat gatttacatt tatggctcac attctatttc tattgagcag
   112321 agctggtcta gacagtcact ttcaggaagg tcagaatgac caagtccctc tatcttaggc
   112381 aacccaaagc agcctactag tttcagaaaa gaaggaccct gggatacagg tctttcatga
   112441 accctgctgt tttgtgtccc ttgacctccc cttctaaagg ctgtatattt tgttagagaa
   112501 ggctttcttt cttatgaatt tgggaacgga aaagatcgga ctttcagaaa taggtgcaca
   112561 tgagttgggg gaaggggaat ggagaacatt ggattgctac aacaatgact tactgttgaa
   112621 gaatagcaaa ctaattatca cattacaatt ctctcagcct accagggtgg gtaattagct
   112681 tccttcgggg gctccttaaa aaaaaagaaa gaaagagaaa gattccttcc cagcaactag
   112741 ctattttgtc cttttaattc aattcaattc aactggggac aaagttacat aaagggaaaa
   112801 ttaacacatt aatggctgaa atggaaacag agagagacat gtctttcttt agttctgaag
   112861 ccaggcttac cttcgctcat caaaatggca agtccggggt taatcaaagt gtggcactgc
   112921 ataagctttt ttcaaatatc ccataggagc ctcctgggct cagattttat cttttttacc
   112981 acagttccat ggcctgattt aaagacagac atagctaata tgacttggat tctaagagtc
   113041 catcagttct gcccggtgaa aatgagctcc ttacagaaat attatataga caacaaaatc
   113101 cttacttaac ttgagtttag ggcttagcaa cagacattcc atttggggca aactcaatat
   113161 tggcagaagt gaggattgaa cttgaaggaa ataaagtaga aggatttgat ttattcatgg
   113221 ggcagaacag gaaagggaca acttaaggtc taaagactaa ggaataaaga atcaaattat
   113281 agcacaaatt acagaccaaa tagcatcaat ttactaaacc tcaagtcaga atgaacctag
   113341 ctattaactg aaccatgaat ttgaccatat agatatagat atagatatct agcaaacttg
   113401 ttcaaaccag aatcaaacct gtatattttc caatgaagta gaacatcaaa acattcctta
   113461 aactgaacct gaacccagcc aatgtttcat ttggttagaa ttcctgctaa agaggagcac
   113521 ttttggtctt tcttttgtct tcaaaggaag acaatcttcc attggtctat cttccataga
   113581 cttggattct aagagtcttt tgttactcag gttgaaagaa gttttataaa tctgttcaca
   113641 agttccacat ccaagactga ctaacacagg atggaaacag ggtggtacta ccatgtccta
   113701 gaaatgaaca aagcctgttt gccccaccta acctattatc atttggttaa gtgtatttta
   113761 acctctacat gtccctggct aacagctaaa aaaaattgac attttatgaa acaattgtta
   113821 ctttttaaat ctcaaatgtc tgtaaggaga aatttatgcc atatgaaaag tatgcactgt
   113881 ctatagctta tcattctggt tatccgcttt tgtttctgtg ggagctattt tacaactctg
   113941 taaattgtaa ataactgata acaatggaaa attatcctta tctatagaga ttcaaattct
   114001 gaattatcta acgatgatat cgaatctcag aaaaggcaaa aatcttgcac aaatcacaaa
   114061 gctaggaagt ggcgtagctg ggacttctta gagagtcatc tccctctcct ctttcatatc
   114121 ttaccaatct cctcgtccca gcttggaatg gtcctgagtg aggtggaagt ggagagggaa
   114181 aatgggtata gcaaatccaa actttagcta aggtactgtt ttgcctgaga tcagctttgc
   114241 ccttatggaa tgttcactga atggatgcct tggactaggc tttaaacaaa ctccgatgaa
   114301 atgggcttgg ttatgttaca gcccttcagg gttggaatca tgaggaggct atagacacat
   114361 tatctgagat gagcccaaac attctggcaa ttgccataaa gattgtctag actagggttt
   114421 gacttcattt atgttcatga acatcagaat ataaatgaat attccaattc tttaaactga
   114481 gaacaccatg ttagattggt ggtaggaaac agaggttatt tgtaattctg ctatctgcca
   114541 agcaaattcc cctcatacac ccaaattggt tactctgttt atcgtgtggg ttggaataaa
   114601 tcctcatttc tttttttctt tttttttagt gaaggagtta tgaaaaagca agagcagatc
   114661 ttgcctgcca aagcctgggt tgggatctct tatattgtaa ctcccttcaa tttgacaaat
   114721 attcattgag ttctaatata ttcacgttat atctctgata ttttagagtc actgaggtgc
   114781 atattttgta gaagggaaat tctgtaggtg gcattctgga gatcaatgat ttgacataga
   114841 tttaaactgt gtagttatag gtagttgtgt tcatgtgtgc tgggtaacca attggatgca
   114901 gtctgcaaag aggatctatt tacttcaagc tgataccacc tgagaatttc taactgatgg
   114961 gttatatggt ggtttaaaaa cagatctaca aattcttcaa cactctttcc tttaataaat
   115021 agagtctggt tcctctctct ttgaatgtag gttggactta gtgactgctt ccaatgaata
   115081 taataaagga gaagtgacag tgttggctat ggagactaga tcacaaaagg caatacatct
   115141 tgttctttat tcattctctt ggatcacaaa ctaaggtagg agatcttgcc aacagccata
   115201 ttgtaagtgg atcctacagc cccagttgag ccttcagatg actgcagccc ctactaatat
   115261 tttaatgata cccacatgag agactctgag ccagaaccac ctagataaaa cactctagaa
   115321 tttcttgccc accaaaactg tgagataaaa aaaaaatgcc ttttctttaa gccactaagt
   115381 tttggggtaa tgaatttgtt tcacagcagg atacaagtaa tacaggttag aaatcccaga
   115441 aatagctact gctgtttctg agaaatgatt agctaagcaa ttattgccag actttattaa
   115501 tcttgaccaa gtgttttaaa gtgtattttt tgcaattctc tgcttaaaaa ctgtcaaatt
   115561 ttattctcag cccaggtgtt aaaacctagg tctttgtagt ttcagcaata taaatttgct
   115621 tagatataga tttatgtgct cttctgtaca tttgaacaag atagcatcaa ggcacatgta
   115681 ttttttattt cttcagcaaa cattatctaa atgcttgatt ggtctctagc accatgttgg
   115741 gtaaaatagg tactgttgtg catctattta ttcacaaata ttactggagc atctactagg
   115801 ggccaggcac tatgctctgc actgatgata tgaaaatgaa cctaccacat tacctttgag
   115861 gggctcagca tttaattggg gaagtagaca ggtaaacaca tacttgacaa cacagtgtgg
   115921 tgaatattac cccaaagata tgaatggcat gctatggtat tgcagagact ggaacaatta
   115981 tctctatgtg aggtcttgaa ggatgaatag gagttttcaa gcaaaaaagg ctgatgggtt
   116041 agaggatggc attccaggta tgggggaaca gggagtgcaa agacatggaa gtactaaaag
   116101 cctgggttgc cttttcatac tttaatttcc tcaacccaag aataaagaga tggctggtaa
   116161 tatgcatgct cctggcagtc tgatgcaggc agtacaggct attcatcttg gcaccagcat
   116221 ttcatagcca tcaggctact catgatgccc caaactgaag ggcccttttg ttttcccaag
   116281 gctctcagac tgcaagaatt ctcatgataa attctgagcc ttgggatgac aaatgtataa
   116341 gtattttgga ccatatagtt tccttagtgt gaagaacctc aaattgtgca taaatctaag
   116401 cttctgaact tggacttcta aggcctttta tttttgatat ttgttataaa agaatggaag
   116461 acattgcctc ttagatcagt gtcactcaaa ctgtgggtcc aagacaagat agggttcaca
   116521 tcagaatgca aatcaacata ctgctgcctt catcaataaa atcaggctct gaaaaaaagg
   116581 cagctgagct aaatggtttg cttaatggca tagtttattt acatttggca caagctccta
   116641 atcttgccac agactgactg gtaataaata gttcatgggt tggcagtgat caatagacga
   116701 catgttttag ataactcact atccactaaa tagaatcttc tatggcaaat ctctttgttt
   116761 tcttttagaa cctggtctgt gctttacttt ccggttgtga ggtttataca atttatcttg
   116821 tccccctatt tatatggcat atatatgtag tgtacttgtt tccctttttg ttttggcttc
   116881 taattacttc agagtcaaat ttgaagctca agtgagtcaa ctatgttgac actttggggt
   116941 gttgtttttt atattctttt tcttgattct gatttttaat ttcaggttta tggttaattt
   117001 gattattcca ttgaatatgt ttttgccaca tgatgttgaa gatgatgaaa atgatgacaa
   117061 ccatgaaaat aaacattaca gcaagcatag acaatgggcc aggcttctgg tttgcatcca
   117121 tttgaatgga ataatggttg tagctgtgcc tccatgactc caaacaagtt tgactgtttt
   117181 agtccctgaa ttgatagaat agggtttcat ggagacacag tggtgttcat gaggaactga
   117241 ttcctctgca tgcagaaagt attgtcctcc ataaatgaag tcagcgtgca cagttaggaa
   117301 aagagcactc accgtcttgg tcgttacctg agtcagctgg agacttgctt cggcgcatag
   117361 gaccagggct cttggctgaa gtcttctgag gtgttgggga tgcctttggg ccagctgtgg
   117421 ctgatgggtt tcccttcatg actcggcatt ctggggcaaa aggacacaga cagcttagta
   117481 gaggataaaa caggctcagc atgttatcag cctctggaaa ccgcaataac catcacaggc
   117541 caaaggacct aaatgggtca ggactacaat gtggtcctta ttattagtag gagccatggt
   117601 tctggggagg gaggcattag ggaaaccttc aaaagccacc tgggctaagc taggtgttta
   117661 agcccctgaa ttacgaaaac aagggcaaga gtccataaac aggtcaagta ggcaattaaa
   117721 tagtcaattc aatcatttat ggacaagcta gctatgtctc agaatagcac ctattgatat
   117781 aatgtttaat taatttccct gaattagaga ttaacaactc agaggttaca gctacacaga
   117841 tgaaacatcc attgagtaat ttgaaggaga ggtagagagc taccacttac tattacatgc
   117901 cacacgttga tagacacatg atggatatca tgtcacttta tataaataag cactatgacc
   117961 accaccatct tgcatttatc aagttctttt cgatttccca tgtatgttat tgccatccct
   118021 tatggggggt taatgaataa atgggccagc agtgactctg ccactaaaaa cttaagaaag
   118081 cttgctggtg tgtgggagag tctgccaaat gttcttgtat ttgccctagc ctcagtctga
   118141 cacctttgga acaccctttc tatgcaggtc actcagcatc cccttctatg gtcattttgg
   118201 gcactaatag tgctaattta ttagaaggat tattattctc attttgctga tgagagtaca
   118261 aaaaacttaa cccctttgcc tagggccaca aaacaagaaa ttagtaaaac aaggtctaaa
   118321 agtcaagtcc cctgagttcc agtttagtgc tctttctact acagtttttc caaccttgtg
   118381 ctgtgatgca caaaggggaa tcaaacagca gagtagagga tatagtctct cctttccttt
   118441 aaacactttc ttgttgcctc ctacttaaag gtgcattgaa atataaatta aacagaaaga
   118501 ttagaggagc ttttctggta aattgaatat tttaggaaac tgagttaagc agaaagcccc
   118561 cttttgtttg tttcacttct tgttttggtt ggtgaccttt agtgatcact gtccagagct
   118621 gaagatacag aaggtatagg aaattggaca taacctgagg tttgtttagc tattctatag
   118681 tcattgcttc ttcctaaagg gaacacagca gaaatgccac ctcttacagg gttcttgtca
   118741 cttgagttgt ttataaagtc tatattagca aaattccctt tgattttttt tcacactttg
   118801 cttctcctaa atgggaaata tgctctgaat gtatctttct atcagcatca cattctaccc
   118861 ttgttgctgc gtgagctaat taacctttaa tgtatacctg gagcttggac tcaaaccaag
   118921 ggagatggga agacgattag atccttctta ttattaacaa ccctctgtaa acctgaatca
   118981 caaggcagac gaatggggca gcaagtctga tggtgtgaaa agggtttgac actaagaata
   119041 ctgtggaaaa atctctctaa attggaacca taatcacaag ctaaatagtt ggaggacgtt
   119101 gaatggagaa aaaaggatat gtacaatatc caagttctga ctgaaaatat tccttcctgt
   119161 ggtctgattg tgcctgttat cccatgtgac acaatatatg gttcaagaaa tatggtcact
   119221 attggggcag atagtgagtt ttattgagtt tggagcctgg gactctggga tatccagtca
   119281 gtctttcagt cttacatggg tcatgaccca tgtatccaaa tatacaggag aaagaccaac
   119341 attataagcc cttgaaggat catgcataga tgttttccac accatacaaa cccatggaaa
   119401 tcctaaagga tagaagggga gagaacaatg gagcaataaa acccagtggt tatgcttacc
   119461 attttcatcc agagaaaaat catcctgagc atagcgaaat ttttcaggac cacaggcaat
   119521 aaacacatca tcatcaccaa agaaatcatg gagacaagtt acctatggag aaagcagaaa
   119581 cactgattgt atatgatggt cctgtgacaa tgaacctcac actacactga caaggagaaa
   119641 agcttcccat ctaggccatc atcttcagct cccaggagtt ttctctttct taaaacacct
   119701 accacattca tagtctgaat catgacccct agtgagggat gatatactga gtaccattgt
   119761 ctatccatgt ttcatgtatg gtactttcaa ttgcatcaag agtaagactg agaccatgtc
   119821 caaatggaat gccgttttct gaagaaaagt caaccagcag atggccctat aaatgtcacc
   119881 ttttctcaga gcctcaaaga cgggaagctc ctggcccaac tctagggcaa gggactgggc
   119941 caaggaagat gagaaaaatc ttctgcagcc tgactgtagc tgacccagaa gtggaaatgt
   120001 gctgcttcat ctgaatgagt tctaaaagaa ggacatcatt tcttaactgc tttggacaca
   120061 caaaaattga acttttgcta aaagctggcc gaaaacttgg tagtggccca gaataccatt
   120121 tcttcctgtg aaaccagaag aggaaaactt catcatgagc aatgggtccc aataggcctg
   120181 agtttctact ggtagagcca tgttcccagg accctcatgc ctgctagcat cctaagcatt
   120241 actgcttctc aacaacagac agacagtctc cccaaccccc tttctgcaca aagagatgag
   120301 ggtccctggg aaagtatgct gcttcctgca gtctccccac ttctcagctt tcctaattac
   120361 atctgttaat tagtccagca catcagggaa acccctggag gaaccataaa gtcaaaacat
   120421 gcttatacca cctcttagtt tgaaagtgtt caaagcctct aattgttcta agtttcccat
   120481 tcaagctgcc agcaatggct ttccaaaggc atggtgctca caaactaaca acagcttgct
   120541 gggtgcagag agcaatatgt tacattagaa aaatatttat attaataaca tcacccccac
   120601 ctaaggttaa ctcaatccct gttttcccaa tcaaaacgat tgcaagcctt acacaaaagg
   120661 agatatactt aaagattatc cccattgtgt cagcctccca atcatccaag agatgcccag
   120721 agaagagaag gtgacttgaa ggactgcagg gtcactctca ttaagcctct ttccagtcct
   120781 gggtcttgac ctcatcaaat tggccaacac ctttcttcgg cccacttgtc cttcatccat
   120841 tcattccatg atcacagaat gacaatcaga gctaccaact aactgaatgc ttactctgtg
   120901 acaggcactc tggtaagtac ttccaaatct gtcatctcat tttaccctca caagaaactc
   120961 tagaagtcag tatttattgt cattactatt ttaggcttga gaaaactgtg acccagagac
   121021 aataagtaag ttttccaggt ggagagcaat gaagccaaat aaacacagat ctatctaact
   121081 tcaaagcctg tccttttaac cacttggcaa tacaaagcac ttacatatag atcagtcact
   121141 ggggagacaa agatgaataa gactcaggcc atgacttcat ggagctcaca gtctgatgag
   121201 gtgagataca tcgaaataga gcatttcaat ttagtgtcat gagtactagg atgggattat
   121261 ctgttggtgc tgtgagagct attgattacc atgtccaaat gaccctagtc actggttagc
   121321 actgttagtt caataggcat cctgcatatt gtcctagtag tggatccaat gacagaggct
   121381 acatggtttt gagcctagct gcagacctcc taccaatatc ttggaccttg gctcaataaa
   121441 acacagagcc cttaactctg atacaccaaa tatctctgaa ctatttacct gccgccacat
   121501 ccttgaccct gaagatctct gccatatctt cctatttgag gctgcctttg accctgacta
   121561 gctgacttta tttttggtca tgagtcagga atacctcacc tgtgagtgtc cctctgttag
   121621 tggtctcaag gatttggcac tttgtttcaa gacctgcctg tattttcatg aggactcact
   121681 tagttctccc caccggatca caggtttctg aatctaacag ccaaattcat accttcctct
   121741 atatcttcct gtggttcctt gtagagagct tatcacatag tgaaaattta cttagggctg
   121801 tttctgcctc agctgcatgc aaagaaggta agaaacagga atattcaggg gctttgtaag
   121861 gatacacaaa gacagaaaag taggaataag aaagatgaaa tcaattgtgg cagacaacgg
   121921 agattagcat ttagagactt taaaaggcaa cacaaatata caaatttagg aacaagaaaa
   121981 gtagaaataa tgtgaattca gaaactattc ttcacgtaac tatgctaatt agtttgaaaa
   122041 caaatgaaat ggatgatttt taaagaaaag tctaggccag gtgcagtgac ttacgcctgt
   122101 aatcccagca ctttgggagg ccaaggcagg tggatcacga tgtcaggaga ttgagaccat
   122161 cctggccaac atggtgaaac cccatcttta ctaaaataca aaaaactagt ccggcgtggt
   122221 ggtgcacacc tgtaatccca gctactgggg gtgcggggga gctgaggcag gagaattgct
   122281 cgaacctggg aggcggagat tgcagtgagc cgagatcaca ccactgcact ccagcccggc
   122341 aacagagaga gactccgaaa aaaaagaaag aaagaaagaa agaaaagtct caataattaa
   122401 attagttcaa aaagaaatag aaaattgaaa ctagctaata gctgtaaaaa aaaattagta
   122461 aatgcttcaa acagcaacct tccctgcaaa gtcaccagag cattcatttt ggaggtattt
   122521 gtggaaattg cttcctaaga agaaaaccta aaagctccaa agcagccatc tcagacacat
   122581 gccatgatat cctattgtag gacaataatg tcttacatat tgctctaggg gttgctgtta
   122641 caatgatcca ccctcctcca accacaagcc tctctaccca cacccaggct ttgggaaaag
   122701 cctttggata acccctcttt gggggagcct ctaagaaagg caaagtcccc tgtgcttctg
   122761 acagactcac aatggtgtgg ctggcttcaa acacaatcat gctacaatcc agctgttcag
   122821 catcagatgt ctttgtgtga tcccattcca cattttactg gctcccaaat gcagacaaat
   122881 gcactttgtg catgtggata cccagctttt ctaaatgtct tggctattga gacacggcag
   122941 ccccccatct ttgttacagg gttgggaaaa tagaatgctc tccatcttaa ataagaagaa
   123001 ctcaaagcgc agagagtcaa agtgtcccga agatacacag ctggctcatg gcagagttgg
   123061 gactagcatt caggtgttct gcttcaattc aggacaaaat aactcaattg cctgaaagaa
   123121 catattatcc taaagcactg acttcaaagg catatctata gcctagacag gaaaatctat
   123181 atttgtccta atcaatgcct gctatgcatt agaacatctt actggattga aaaaataagc
   123241 ttctttaagg atatgtttaa tctatggaca acaaaactcc tattcattac tgtaacaata
   123301 accaagtata tggtggagaa ggaggacaag gagagggaga aggagggaga aaagaaagaa
   123361 gaagaagaag gaaaagaagg aggaggagaa gaagaatgga atataagaaa taattctcaa
   123421 atgaaactca cactcactta acacctgctt tttaaaattc tatggaggta atgtgataca
   123481 aagtccacct tagctaacat tacacatgta tgattttcca agttagtttt tttgagatat
   123541 aatttccatt acaagggggc aacagaaata gtaaatataa taatcatcat aatatcactt
   123601 acagcattac agcatcctca gggatcaaag gtcttgctga catcagcttc agtatgaatc
   123661 atacttggat ttttaagtca gggaagagaa cacaaaggca tgacacctac cattataatt
   123721 aatcaacttc cagcacaaaa tcaataaaat caactcacat taacattgaa ccattaactc
   123781 aatgctcttc ctttgaaaaa aaaaaccagc aggtgttggg ggaagggaag ccagcttggt
   123841 tgtgcattag catgaatgat agattatatg gatagtccag tggcttaaat gtcaaattac
   123901 acatggggaa atggagacaa cagatcacgg tgtcagagaa gtttgagtct attagctgca
   123961 aatagaaact tcactgtgtg acttgattta agcttgatta gattctccat ccacatatta
   124021 tagatgctgg cagttcgttt ttgaacatgc ctggttctag tagattctac atttcactta
   124081 cctaactatt cagaaataaa tgttttctcg agaagtaagg tcttgctcac attatctatg
   124141 atgctcactt ttaacttgta aaatactttc ttcaaaagtt caattctact aatattgata
   124201 cattaagcac ttactagggc aaaggaactc tgtaagagat acagaaatgt gtaataaagg
   124261 gccatgtctt caaagagttg acaacataat aagcaagtca ggattacaca aataattgga
   124321 atacaaggca ggctgttgtt agcactacta taaagatgca aagaacaagg gaagcacaga
   124381 agataaagag aacaattcta attatgaact ggggaaggct tcacagagaa aaactgagtc
   124441 ttgaaggatg agtgggatat aggttgatgg aggtattcct ggcagaggtg attgcaaaac
   124501 aaaggtaggt acctttttct taacagccct aggaagagtt gcatcctgtt gactacagta
   124561 aggcattctg attttccaaa gctgtgcaga gcacaattaa ttgcctagac tcacctgcca
   124621 accttctgca caggcaaaac atagagttat ctattggtgg aattcagggt gaagaggagg
   124681 gcagagagtg tctatgagga ggagtggcca gagtggtaaa gaagagggaa acaccgtggc
   124741 aaggaacaca aaccacactg atttttcagt cttgccaaca gtgtgccttg cctatcaaga
   124801 atggcaggga aattttgcaa aatgagttaa taaaatctca gtaaacataa cagatgaggc
   124861 tttttctaaa tgttactgct actcgtattt tggaacattt cccacctacc agccctggac
   124921 cagctccagt cagaggttga cacagagttg gcttggtgaa ctgcagcaac ctttcggaaa
   124981 caatcctgac tttttttttt ttttttatca aactcatgtt tctggggtta tcttgaagcc
   125041 ttacttgggg gctttttatg acaacatttt tgcctgcgca acaagttcct tggttctgta
   125101 gtgccttttt tttttttttg cacagttctc tttattcaat gccctgaact tccctaaaat
   125161 ctgtgctgaa ataataacaa tggcacttac atttgtatgg tatttttact gtttttgaaa
   125221 cacttccaca tttatggtat tttatgtttt ctttcatact tgtaattctc ctctcaactc
   125281 aatccgatca ggcttccatt cccattgctc caccaaggct gctcttgtta aggccaacaa
   125341 tggcttccat gttgtcaaat ccaatggtcc ttccctggcc tcatcttact caacccatca
   125401 gcaacactgg atataagttg atcgagttct tttccctaaa tacttttcct tctttgcttc
   125461 tgtgagatca ccatcttctg aatttccttc tgcttcaatc tcttttgcta gattcttttc
   125521 cttttttaga cttccagtat ttgaaagccc cagtgctcag gtcttacatc tcttttattt
   125581 tttatttaca tatttttact aggtgatcat atccagttcc aaggctttac acactgctat
   125641 ggtttcaatg tttgtcccct ccaaaactca tgttgaaact tactccccaa tgtggcatta
   125701 ttgaaaggtg gggtttttta gagatgattg ggtcatgaag gctctggatg aatgggttaa
   125761 tggattaatg agttaccatg ggaggagaac tgatggcttt tataagaaga gaaagtgaga
   125821 cctgagctag catgttcagt ccccttacca tgtgatgccc tataccacct cagaactctg
   125881 cagaaagtcc ctaccagcaa gaaggccctc accagatttg gccccttgac cttgaacttc
   125941 acagcctcca ggactgtaag aaataaatta tgtttcttta tatattaccc agcttcaggt
   126001 aatctgttat aaacaacaga caattaacta agacacatgc catatatata ctgaatactc
   126061 acacattgat atctccaggt ccttcttacc cctgtactct agacatgtat ttccatccaa
   126121 ctgcctgttc aatatctcca cttgcttgtc tgataggcat cccatgttta atatgcccaa
   126181 gactaaactg ttatttctct cctccatatt ttacttccct tcagctttcc ttatcttaaa
   126241 gaaaaggcac aattctcaca attacccaaa ccagaaacct gagagtcatt tttcattagg
   126301 agccactttt ctccctctcc ctggtcacgg tactagtcta aaatacacta caacatagcc
   126361 tcataactgt tctctaggca gcatgccctc cccatgcccc acaaatacct attctgtgtt
   126421 gcataaagca gtcagagtag tattttacaa ttttaatcag gaatatcact cttctgttta
   126481 aaacccttga aagtctgccc atcactgata gaataaaatc caaacatttt agctccctgg
   126541 gtcggggaag ggcagcactc atctctatag ctccaggcca cacttttccc ttgctggagc
   126601 cagggagctg tactgcttgg tcccaagaca tgtccacaac agcccagcac actggctgtg
   126661 gcagactgtg gccagagtgc ctcttcaggc ctgaccctga ctcatccttc ctcactgggt
   126721 ggggattccc tgcaggaact ttattaactc tagtcagagg ctcaggggca gaactctgat
   126781 ctccgtgggc ctgaacctgt aggagtaagg gggtctgcag tctctgtaga ccagcagact
   126841 tagcctttct tcctggtagt tctgaggaat cagggcagcc cagatgagtg ggttttcccc
   126901 cagtgaggca caccccttcc accaagggac aaagtgcttc attaaatggg tcctgttccc
   126961 catgccactc gactgaatga gacccttcaa caggggttgt cagacaaact catacaggag
   127021 tgatcctact ggcatcaggt ttgtgcccct cgaggtcaga gataccagaa gaaggaagag
   127081 acacccatct ttgctgttct ccagcctcct tgagtgacat ctctaggcat ggagtgaatc
   127141 agatgaatag ggcctgaagt gaacccccag caaaagacag ctaccctaca gaagagggac
   127201 ctgaccattg aaagaaaaac aaacaaacag aaagcaacaa caacagcatc aacaacaacc
   127261 aaagagcccc cacaaaaacc ccatgtaaag gtcagcagcc tcaaagatcg aaactagaca
   127321 aactcatgaa gatgagaaag aatcaatgaa aaaatgctga aaacccaaaa gcccagagtt
   127381 cctcttctcc aaatgtctct tcaagggcac agaattagaa ggagggtaag actgacgaat
   127441 tgacagaagt aggcttcaga agatgggtaa tgaaaaaact atgctgagct aaaggaacat
   127501 gttctaaccc aatgcaaaga agctaagaac cttgattaaa ggttagagga gctgctaact
   127561 agaatagtca gtttagagag gaacataaat gacctgatgg agctgaacaa cacagcacga
   127621 gaacttcatg aagcatacac aaatattaat agccgactcg accaagcaga agaaaggata
   127681 tcagagtttg aagaccacct tactgaaata aggcatgcag aaaagactag aggaaaaaga
   127741 atgaaaagga atgaacaaac cctccaagaa atatgggact tcgcaaaaag actgaaccta
   127801 cgattgattg cagtacataa aggagacagg gagaatggaa acaagccaaa aaacacactt
   127861 cagggtatta tccaggagaa cttccccaac ctagcaagac aggccaatat gcaaattcag
   127921 gaaatacaga gaacaccact aagatactcc atgagaagat caaccccaag acacataatc
   127981 atcagattct ccaaggtcta aatgaaggaa aaaatgttaa gggcagccag agagaaaggc
   128041 caggtcacct acagaggtaa gcccatcaga ctaacagcag acttctcagc agaaactcta
   128101 caagccagaa gagattgggg gccaatattt aacattctta tagaaaagaa ttttccactg
   128161 agaatttcac atccagtcaa actaagcttc ataagcaaag gagaaataaa atcatttcca
   128221 gacaaacaaa tgccgaggga tttcattacc actaggcctg ccttgaaaga gctcctgaaa
   128281 gaagcactaa atatggaagg gaaaaactgg taccagccac tgcaaaaaca caccaaaata
   128341 taaaaaccaa tgacactatg aagaaactgc atcaaatagt gtgcaaaata accagatagc
   128401 atcattatga caggaccaga ttaacagata tcaatactaa ccttacattt aaatggacta
   128461 aatgccccaa ttaaaacaac agactggaaa attggataaa gagtcaagac ctattggtgt
   128521 gctatattca ggagatgcat ctcacgtgca aagacacaca ccagctcaaa ataaagggat
   128581 ggaggaaaat ttaccaagca aaaggaaagc aaaaaaaaaa aaaaaaaagc aggagttgca
   128641 atcccagtct ctgaccaaac agactttaaa ccaacaaaga tcaaaaaaag acaaagaaga
   128701 acattacata attgtaaagg gaacaattca acaagaagag ctaactatta tatatatata
   128761 tatatatata tatatatata tatatatata tccatccaat acaggagcat ccatattcaa
   128821 aaaacaagtt cttagaggcc tacaaagaga cttagactcc cacacaataa tagtgggaga
   128881 ctttaacacc ccactgtcaa tattagacat atcaacgaga cagaaaatta acaaggatat
   128941 tccagacttt aactcagctc tggatcaagt ggacctaaaa gacatctaca gaactctcca
   129001 ccccaaatca agagaataca cattcttctc aggaccacat ggcacttatt ctaaaatcaa
   129061 ccacataatt ggaagtaaaa cactcctcag caaaagcaaa agaactgaaa tcctaacaaa
   129121 cggtctctca gaccacagtg caatcaaatt agaactcagg attaagaaac tcactcaaaa
   129181 ccacacaatt acatggaaat tgaacaaact gctcctgaat gactcctggt taaataatga
   129241 aattaaggca gaaatcaaga agttctttga aatcaaagag aacaaagaga ctacgtacca
   129301 gaatccctag gacacagcta aagcagtgtt aagagggaaa tttacagcac taaatgccca
   129361 catcagaaac ctagaaagat atcaaattga caacctaaca tcacaattaa aagagctaga
   129421 gaggcaagag caaactaatc caaaagctag cagaagacaa gaaataacta agatcagagc
   129481 agaactgaag gagataagca cacaaactcc cccccacccc caaaatcagt gaatccaaca
   129541 gctgggtttt ttttttttta attaacaaaa tagataaaca gctagctagc ctaataaaaa
   129601 gaaaagagag aagaacaaaa tagacacaat aaaaatgata aaggggatat caccactgac
   129661 ctcacagaaa tacaaactat catcagagaa tactataaac acctctatgc caataaacta
   129721 gaaaatctag aagaaatgga taaattcttg gatacataca ccctcccaag actaaaccag
   129781 gaagaaattg aatccctgac tggaccaata acaagttctg aaattgagtc agtaattaat
   129841 agcctaccaa ccaagaaaaa aaaaaaaaaa agcccaggac cagacagatt cacagccaaa
   129901 ttctacaaga ggtacaaaga gaagctggta ccattatttc tgaaacaatt ccaaacgact
   129961 gaaaaggagg gactcctccc tacctcattt catgaagcca gcatcatcct gataccaaaa
   130021 ccaggaagaa atacaacaaa aaatagaaaa tttcaggcca atatccctga tgaacataga
   130081 tgcaaaaatt ttcaataaaa tactggcagg ctgggtgcgg cagctcactc ttgtaatccc
   130141 agcactttgg gaggctaagg tggtcagatc acctgaggtc aggagtttga gaccagcctg
   130201 gccaacatgg tgaaacccca tctctactaa aaatacaaaa atgagctggg catggtggtg
   130261 ggcacctgta gtcccagcta ctcaggagtc tgaggcagga gaatggcttg aatctgggag
   130321 gcagaggttg cagtgagtgg agatcgcgcc actgcactgc aacctgggcg acagaacgag
   130381 actccatctc aaaaaaaata aaataactgg tgaaccgaat caagtagcac atcaaaaaaa
   130441 cttatgcatc atgaacaagt cggcttcatc ctgggatgca aggctggttg aacatatgca
   130501 aatcaataaa cataatctat cacataaaca gaaccaaaga caaaaaccac atgattatct
   130561 caatagatgc agaaaaggct ttgataaaat tcaacattcc ttcatgttaa aatctctcaa
   130621 taaactaggt attgatggaa catatctcaa aatagtgaga acaatttata acaaacccac
   130681 agccaatatc aaattaaatg ggcaaaagct agaagcattt cttttgaaaa ctggtacaag
   130741 acaaggatgc cctctcttac cattcctatt catcatagta ttagaagttc tggccagggc
   130801 aatcagtcaa gagaaataaa taaacggtat tcaagtagga agagaggaag taaaattgta
   130861 tatgtttgta gatgacatga ttttacattt agaaaacccc atcatctcag ccccaaaact
   130921 ccttaaacta acaagcaatt tcagcaaaat ctcaggatac aaaatcaatg tgcaaaaatc
   130981 acaagcattc ctttacacca acaatagaca agcagagaac aaaatcatga atgaactccc
   131041 actcacaatc actacaaaaa aagaataaaa tacctaggaa tacagctaac aagggatatg
   131101 aaggacctct tcaaggagaa ctacaaacca ctgctcaagg aaataagaga ggacacaaac
   131161 aaatggaaaa acattccatc ctcatggata ggaaaaatca atatcgtgaa aatggccata
   131221 ctgcccaaag taatttatat attcagtgct attcccatca aactaccatt gacattcttc
   131281 acagaattag aaaaaaaaca ctttaaattt catatggaat caaagaagac cctgtatagc
   131341 caagacaatc ttaagcaaaa gaacaaagct ggaggcatca cgctacttga cttcaaacta
   131401 tactagaagg ctacagtaac caaaacagca cgttactggt agcaaaacag tcatatagac
   131461 tgatggaaca aaacagcaac ctcagacata ataccacaca tctacaacca tctgatcttc
   131521 gacaaacctg acaaaaacaa acaatgggga aagcatctcc tattcaataa atggtactgg
   131581 gaaaaactgg ctagccatat gtagaaaact gaaactggac cccttcctta tatcttacac
   131641 aaaaattaac tcaatatgga ttaaaaactt aaatgtaaaa cccaaaacca taagaaccct
   131701 agaagaaaac ctaggcaata ccattcagga cataggcacg gacaaagact tcacgacaaa
   131761 aacgtcaaaa gcaattgcaa caaaagccaa aaatgacaaa taggttctaa ttaaactaaa
   131821 gagcttctgc aaggcaaaag aaactatcat cagagtgaac aggcaaccta cagaatggga
   131881 gaaaattttt gcaatgtacc catctgacaa aggtctaata tccagaattt acaaggaact
   131941 taaacaaaat ttacaagaaa aaaacaaaca accccatgaa aaagtgggca aaggatatga
   132001 acagacttct cagaataaga cgtttacatg gccaacaaac ataggaaaaa aagctcaaca
   132061 tcactgatca ttagagaaat gcaaatcaaa accacaatga gatactatct cacgccagtc
   132121 atgcgtcttt atagtagaat gatttataat cctttgggta tatacccagt aatgggattg
   132181 ctgggtcaaa tggtatttct agttgtagat ccctgaggaa tcgccctact gccttccaca
   132241 atggttgaac taatttacat tcccaccaac agcgtaaaag cattcccatt tctccacagc
   132301 cttgccagca tctattgttt cttgactttt taataatcac cattctgact ggtgtgagat
   132361 agcaactcgt tgtggttttg atttgcatta ctctaatgat cagtgatgtt gagctttttt
   132421 tcctatgttt gctggccaca taatgtctta ttttgagaag tctgttcata tcccttgccc
   132481 actttttcat gggttttttt ttttcttgta aatttgttta agtttcttgt aattctggat
   132541 attagacctt tgtcagatgg gtacattgca aaatttttct cccattctgt aggttgcctg
   132601 ttcactctga tgatagcttc ttttgctgtg cagaagctct ttagtttaat tagaacccat
   132661 ttgtcaattt tggcttttgt gttcaatctg gaatgtttca tagactgtca gagctagaag
   132721 agacaacaga tgaaaaggat aaacaaaatg tggtatatcc atacaatgga atattattca
   132781 gctacaaaaa agaatgcagt tctgatatat gttacaacat ggactttgaa aacattatgt
   132841 taagtgaaat aaccagacag aaaaggacaa agattgtgtg attccacttg tatgaaatat
   132901 ctagaagaga caaattcata ggatcaaaga ctggattaga agttgccagc agctagtggg
   132961 aggaaggaat aagaagtcat tgcttaatgg taacagttgg ttgtcatgtt aaaagtttaa
   133021 aaagttctag aactagatac tgatcgaacg atactgtgaa tgtaactaaa gccactgaat
   133081 tgtacacata agattattaa aattaattgc acacattaaa atggcaaaat ttatgttaca
   133141 catatttgac caaattaaaa acattaataa tataatatac ccaaaccatt gaattataca
   133201 ctttaaatgg gtgaattata cattatgtga ataatataac aataaaactg ttaaaaaaag
   133261 aaatgtaaaa atgattatta aaatttattc agaatttact tatattgtag ccaaaaggcc
   133321 ttttaaaata cttattctga catattacag taacagaagt caaagttcac ctatatggag
   133381 atgtcttacc tattgcccaa tataaactag aaccaccacc accataagaa tttaggctcc
   133441 acaagagtag agacctcacc tgtttgttca ccattctatc cctactaagt taaagtacaa
   133501 aaaaacacat atttaaaaat tattattagt agtagtaaca agcaagagta agaagaatct
   133561 aaaacccata attaaatctt agtttcataa ttaaggatat tgcaatataa taaggttatc
   133621 tgtggactac tagtgatcac tgtcctctgc atcttatgga agagtaagtc ataagagatg
   133681 aaactgactt gcttacttaa ttcttgatct ttttgttttc tgattatttg cttaaaaaca
   133741 acacaatgtc accccccttc ccaactaaaa tccatttctt tgagtgcctg gatcattata
   133801 tgatgagttt tgtcttttct ccctctttca cccatcaaag catccatttc ttgaagacac
   133861 cgatttcctt tggcttgcct tgtggggctt agctacagaa aatgacccat cccaggccct
   133921 taagaaaaag tgttgactag ctgattgtga agggaaaaat tgatattctt ttcctactat
   133981 ctgtatcatt ttctgggaat tcccagcccc tgtgcagcca gtttacctat tcctatagca
   134041 agtcatattg agactgacgc ccgtctgatc ctccagccac tagatctgac tgaatttctc
   134101 tccactcttt cctggtgaag agtgactggt cctcagaagc tctttacatt ggtcttttgg
   134161 ctgtaaagtg ccataataaa cttttcgaat agatcgtgtt ctgtcaactc atttttccac
   134221 tgatcttctt agccccatag ccaatggtga ctctcctctc cccattttag aatttccccc
   134281 aggcaaccca gctatattaa atctgggctg tcagatctca gctgagatat tggtcttcaa
   134341 atttattgtt ttcaaccttt aaaatagttt atctttgaat ggtgaagggc cataatgatc
   134401 ctacatagct tttgctgttt tacatgtgtc aggtgccctt atcttcatta cctcatttgg
   134461 ttcattggcc tcatttttaa taaatgaaat gtcaaggtcc atgaaggttc agtgacctgt
   134521 acaaggtcac attgctggct aatgacagag ctgatactag aactcaagat tgtctaattt
   134581 attctgacac tataggtacc taaggtatac tgcaaaggca tttcctagac aggggagacc
   134641 caggtaagga aaaacagcag gagactctga gtagcatgaa aataccagtg gtatgggaaa
   134701 gaagggcaca gaggggagat gtgaaggaat aagactttct accctatttt acctggagct
   134761 agtggtgatt taataacctg agctaggcaa taccagcctg attggaaggc tggcagcccc
   134821 tcgatgctca agggtgttct ttccttattt tgtctttgcc atttttccaa gagtctgtgt
   134881 cagctcccac tgggggcctc tcaaggatga gaacagctgc agaggcgctg gagttgggct
   134941 gttgagtaca tgcccaggac tgacacctgc ccttccacaa ggccctgggg acaaattggc
   135001 tcagagaaac agagacgaga cagaactcca ttcatcagaa agaactgggg tggaaaaacc
   135061 ctgcttgatc ccttggcacc attaagccct aaggttggct ctgggcccca cacaactcta
   135121 ggttctgttt agtaaggtca attaaaataa gagagtgtgc ttgtagggcc cattacagtt
   135181 atctgttgca tttaacagtt tcctggtgtc tgctagacac aaagcatgag agggatattg
   135241 cttttgaatc acgtagggga aggaaaggtt tgttaaaaaa aaaaaaagaa aagaaaagaa
   135301 aagaaaagaa agttattgtc tacctgcatt ctaggttaca acttgacact ggctcactgc
   135361 tttctctctg ggtctgtttc ttctgctata aggagaaggt gctgatctag agccttttga
   135421 gatctatatc ttttttgtgt cacaagccct ttagagaatt tgatgaaagc tgtaggtctt
   135481 ctccctagaa aaaatgcata tatgcccata gatagctagt tttctatata gtttaggggg
   135541 gttaaggatt cctttgaagc ttgtctatag atcccaaatt aagagtctct agactagttg
   135601 atttctgagg tctcttctag ctcagacagt ctatgaaaca ttccaggttg gacacagtta
   135661 atgagcagtg acacccgggc cagaatgctg gcagctgtcg tggaatccca gaggcttaac
   135721 tgtacccccc ttcccaggct tcctgagact ccagtccaaa ggaagcaaaa ggatataagc
   135781 tcattctctg caaactgagt ccaagggaag caaaggagtg caaactgatt tacggcaaac
   135841 caaagactag agtgtgcatc agacccattg cctcccaggt tgctagaatt taggaaggat
   135901 gctaactttt caggagagtg taggcatgag aggcatgacc tctaaatgtt ccttagtggt
   135961 tggttgagga caggaaaaaa aaagcatgac tgcccagttg gtaagggaca tacagcttga
   136021 ttatagatat tctgaaattg ttacttagga gctgacctca acatggctgg acagaatact
   136081 aatgaccata aacatactgc atttccttac ataatatccc ctactctgta agtgtatatt
   136141 ttgctatgca agaagaaggt taggagagaa gaaagaacga gttctcatgg gtacatcaac
   136201 catcagtcag tgtaccagta ggaaatatat gacactccta gagtggataa attgagtgga
   136261 gtttactaaa ggggattgat tacaaaagta tgggcagggt atagaagaac cacagagggc
   136321 actgtagaat tttaggaata gtaatagtga agttcattac cactcttagg cctcaaggag
   136381 agaggggagg tatcaggaac cagaactcag agggaaggag agagaatgac aattagatat
   136441 agagaacact ctgataagag ctatgacctt cagtcaagga atgcccagca acctctcagg
   136501 cagggtgcct ggaaatagat agcctaatct cacatttctc cttgttagtg ctccccattg
   136561 gctaaataca actggaatcc aaaaaacaag ggagcctgtt gataagagtc agcttcccag
   136621 gacacagagc agggtggaat agagtagaga gtagaaagga gatctggagg ggaagtcctg
   136681 gagagtagat ctgccactgt aagcaaaagg aaggagctgt tgttatagcc tagaatatag
   136741 gaaatgtgtg aatagtgatt aagcaataat aataatacca ttttgcattt gtaaaagcct
   136801 ttacgcattt tgtgcttcag tcaccccatt gtctcctctg aaccatatag ttaccatgta
   136861 ggcaaacagg taatttattt tatggataac gaaactgagg gtcaaccaag tttaggtgat
   136921 ttgcaaagac caaatgtcta caaaaaaagg acatggacta gaacttgggt ttgagctcct
   136981 ggtctagtac aatttccact atatcatgtt atctcttctg tacccttagt ccacagagct
   137041 attctgatag ttcttcagcg atctcctgct tcttctgaca cacagtcaat tccaccacaa
   137101 gagcaacgat aagagggaaa attatataca cttcttcctc tcatttttgg gaaagggtta
   137161 gtgtattagt ccattctcac actgctaata aagacatact ggagactggg caattataaa
   137221 agaaagaggt ttaactgact cacagcttca catggctggg gaggccttac aatcatggtg
   137281 gaaggcaagg atgagcaagt cacatcttac ttacatggtg gcagagaaga gggaatgaga
   137341 accaagtgaa aggggtttcc cccttataaa accatcagat cttgtgagac ttattcacta
   137401 ccatgagaac agtatggagg aaaccgcccc catgattcaa ttatctccca ctgagtccct
   137461 tccacaacat gtgggaatta tgggagctac aattcaagat gggatttggg tgaggacaca
   137521 gccaaaccat accagtcagt atgggggctc atgagggaac aagctgagtt ggaagatagg
   137581 taagaggata cagtgttctt tgcactcact attttccaga acagaggttc ttagacccag
   137641 tttttgccaa acgactttag tcaagattcc ctatcctgtc tccctcttct ctcttctggc
   137701 ctctgggtgt tacaccttat cgactatata cttccagtgt aacagatatg attgaatgga
   137761 agtagggaaa taaatgaaaa tgaagaaatg acaagctcaa ggtaaacaat aagaaaacag
   137821 aaacagtata tcttttagaa ttcaactgca aaaagtaaaa agtatttaca catgtgttcc
   137881 agttagcatg taaaccatct attaagtg



&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: AA044211. zk50g07.r1 Soares...[gi:1522068] Links  


IDENTIFIERS

dbEST Id:       659586
EST name:       zk50g07.r1
GenBank Acc:    AA044211
GenBank gi:     1522068
GDB Id:         3760014

CLONE INFO
Clone Id:       IMAGE:486300 (5')
Source:         IMAGE Consortium, LLNL
DNA type:       cDNA

PRIMERS
Sequencing:     -28M13 rev2 from Amersham
PolyA Tail:     Unknown

SEQUENCE
                GCTGCCCCTCTGCCCACGGNCGGGCTCAGATGTCGCCCTCTGGTCGCCTGTGTCTTCTCA
                CCATCGTTGGCCTGATTCTCCCCACCAGAGGACAGACGTTGAAAGATACCACGTCCAGTT
                CTTCAGCAGACTCAACTATCATGGACATTCAGGTCCCGACACGAGCCCCAGATGCAGTCT
                ACACAGAACTCCAGCCCACCTCTCCAACCCCAACCTGGCCTGCTGATGAAACACCACAAC
                CCCAGACCCAGACCCAGCAACTGGAAGGAACGGATGGGCCTCTAGTGACAGATCCAGAGA
                CACACAAGAGCACCAAAGCAGCTCATCCCACTTGATGACACCACGACGCTCTCTGAGAGA
                CCATCCCCAAGCACAGACGTCCAGACAGACCCCCAGACCCTCAAGCCATCTGGTTTTCAT
                GAGGATGACCCCTTCTTCTATGATGAACACANNCTCCGGAAACGGGGGCTGTTGGTCGCA
                GCTGTGCTGTTCATCACAGGCATCATCATCCTCACCAGTGGCAAGTGCAGG
Quality:        High quality sequence stops at base: 437

Entry Created:  Sep 4 1996
Last Updated:   Sep 4 1996

COMMENTS
                This clone is available royalty-free through LLNL ; contact
                the IMAGE Consortium (info@image.llnl.gov) for further
                information.
                Possible reversed clone: similarity on wrong strand

PUTATIVE ID     Assigned by submitter
                PIR:A40533 A40533 cAMP-dependent protein kinase major
                membrane substrate ;

LIBRARY
Lib Name:       Soares_pregnant_uterus_NbHPU
Organism:       Homo sapiens
Sex:            female
Organ:          uterus
Develop. stage: adult
Lab host:       DH10B
Vector:         pT7T3-Pac
R. Site 1:      Not I
R. Site 2:      Eco RI
Description:    1st strand cDNA was primed with a Not I - oligo(dT) primer
                [5' AACTGGAAGAATTCGCGGCCGCCTTTTTTTTTTTTTTTTTT 3'],
                double-stranded cDNA was ligated to Eco RI adaptors
                (Pharmacia), digested with Not I and cloned into the Not I
                and Eco RI sites of the modified pT7T3 vector. Library went
                through one round of normalization. Library constructed by
                M. Fatima Bonaldo.

SUBMITTER
Name:           Wilson RK
Institution:    Washington University School of Medicine
Address:        4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108
Tel:            314 286 1800
Fax:            314 286 1810
E-mail:         est@watson.wustl.edu

CITATIONS
Title:          The WashU-Merck EST Project
Authors:        Hillier,L., Clark,N., Dubuque,T., Elliston,K., Hawkins,M.,
                Holman,M., Hultman,M., Kucaba,T., Le,M., Lennon,G., Marra,M.
                , Parsons,J., Rifkin,L., Rohlfing,T., Soares,M., Tan,F.,
                Trevaskis,E., Waterston,R., Williamson,A., Wohldmann,P.,
                Wilson,R.
Year:           1995
Status:         Unpublished


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMProbeSetProbeSetTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  






    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: AA296696. EST112418 Aorta e...[gi:1949188] Links  


IDENTIFIERS

dbEST Id:       938163
EST name:       EST112418
GenBank Acc:    AA296696
GenBank gi:     1949188

CLONE INFO
Clone Id:       (5' end)
Source:         ATCC
Id in host:     115692
Other ESTs on clone:THC178829
DNA type:       cDNA

PRIMERS
Sequencing:     M13 Reverse
PolyA Tail:     Unknown

SEQUENCE
                CAGCAGCCACCGCCGCGTCCCTNTNTCCACGAGGCTGCCGGCTTAGGNCCCCCAGCTCCG
                ACATGTCGCCCTCTGGTCGCCTGTTTCTTCTNACCATCGTTGGCCTGATTCTCCCCACCA
                GAGGACAGACGTTGAAAGATACCACGTCCAGTTNTTCAGCAGACTCAACTATCATGGACA
                TTCAGGTCCCGACACGAGCCCCAGATGCAGTNTACACAGAACTCCAGCCCACCTTTNCAA
                CCCCAACCTGGCCTGCTGATGAAACACCACAACCCCAGACCCAGACCCAGNAACTGGAAG
                GAACGGTATGGGCCTCTAGT

Entry Created:  Apr 18 1997
Last Updated:   Apr 18 1997

COMMENTS
                For clone availability, additional sequence and expression
                information related to this EST, please check the TIGR Human
                Gene Index (http://www.tigr.org/tdb/hgi/hgi.html)

LIBRARY
Lib Name:       Aorta endothelial cells, TNF alpha-treated
Organism:       Homo sapiens
Organ:          aorta
Cell type:      endothelial cell
Develop. stage: adult
Vector:         pBluescript SK-
R. Site 1:      EcoRI
R. Site 2:      XhoI

SUBMITTER
Name:           Kerlavage, AR
Lab:            Bioinformatics
Institution:    The Institute for Genomic Research
Address:        9712 Medical Center Drive, Rockville, MD 20850 USA
Tel:            3018699056
Fax:            3018699423
E-mail:         arkerlav@tigr.org

CITATIONS
Medline UID:    96026280
Title:          Initial assessment of human gene diversity and expression
                patterns based upon 83 million nucleotides of cDNA sequence
Authors:        Adams,M.D., Kerlavage,A.R., Fleischmann,R.D., Fuldner,R.A.,
                Bult,C.J., Lee,N.H., Kirkness,E.F., Weinstock,K.G., Gocayne
                ,J.D., White,O., Sutton,G., Blake,J.A., Brandon,R.C.,
                Man-Wai,C., Clayton,R.A., Cline,T.R., Cotton,M.D.,
                Earle-Hughes,J., Fine,L.D., Fitzgerald,L.M., Fitzhugh,W.M.,
                Fritchman,J.L., Geoghagen,N.S., Glodek,A., Gnehm,C.L., Hanna
                ,M.C., Hedblom,E., Hinkle,P.S.Jr., Kelley,J.M., Kelley,J.C.,
                Liu,L.-I., Marmaros,S.M., Merrick,J.M., Moreno-Palanques
                ,R.F., McDonald,L.A., Nguyen,D.T., Pelligrino,S.M., Phillips
                ,C.A., Ryder,S.E., Scott,J.L., Saudek,D.M., Shirley,R.,
                Small,K.V., Spriggs,T.A., Utterback,T.R., Weidman,J.F., Li
                ,Y., Bednarik,D.P., Cao,L., Cepeda,M.A., Coleman,T.A.,
                Collins,E.J., Dimke,D., Feng,D.-F., Ferrie,A., Fischer,C.,
                Hastings,G.A., He,W.W., Hu,J.S., Greene,J.M., Gruber,J.,
                Hudson,P., Kim,A.K., Kozak,D.L., Kunsch,C., Hungjun,J., Li
                ,H., Meissner,P.S., Olsen,H., Raymond,L., Wei,Y.F., Wing,J.,
                Xu,C., Yu,G.L., Ruben,S.M., Dillion,P.J., Fannon,M.R., Rosen
                ,C.A., Haseltine,W.A., Fields,C., Fraser,C.M., Venter,J.C.
Citation:       Nature 377 (6547 Suppl): 3-174 1995


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U16752. Human cytokine SD...[gi:1272194] Links  


LOCUS       HSU16752                3541 bp    mRNA    linear   PRI 18-APR-1996
DEFINITION  Human cytokine SDF-1-beta mRNA, complete cds.
ACCESSION   U16752
VERSION     U16752.1  GI:1272194
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3541)
  AUTHORS   Spotila,L.D.
  TITLE     Novel sequences expressed by mineralizing human osteoblasts in
            culture
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 3541)
  AUTHORS   Spotila,L.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (31-OCT-1994) Loretta D. Spotila, Biochemistry and
            Molecular Biology, Thomas Jefferson University, 233 South Tenth
            Street, Philadelphia, PA 19107, USA
COMMENT     On Apr 18, 1996 this sequence version replaced gi:571507.
FEATURES             Location/Qualifiers
     source          1..3541
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="sdf5-8.7"
                     /cell_type="fibroblast"
                     /clone_lib="lambda ZAP human fibroblast cDNA library"
                     /note="a shorter clone with an identical sequence was
                     purified from a human osteoblast cDNA library"
     CDS             81..362
                     /codon_start=1
                     /product="cytokine SDF-1-beta"
                     /protein_id="AAA97434.1"
                     /db_xref="GI:571508"
                     /translation="MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANV
                     KHLKILNTPNCALQIVARLKNNNRQVCIDPKLKWIQEYLEKALNKRFKM"
BASE COUNT      912 a    875 c    799 g    935 t     20 others
ORIGIN      
        1 cgcggccgca gccgcattgc ccgctcggcg tccggccccc gacccgcgct cgtccgcccg
       61 cccgcccgcc cgcccgcgcc atgaacgcca aggtcgtggt cgtgctggtc ctcgtgctga
      121 ccgcgctctg cctcagcgac gggaagcccg tcagcctgag ctacagatgc ccatgccgat
      181 tcttcgaaag ccatgttgcc agagccaacg tcaagcatct caaaattctc aacactccaa
      241 actgtgccct tcagattgta gcccggctga agaacaacaa cagacaagtg tgcattgacc
      301 cgaagctaaa gtggattcag gagtacctgg agaaagcttt aaacaagagg ttcaagatgt
      361 gagagggtca gacgcctgag gaacccttac agtaggagtc cagctctgaa accagtgtta
      421 gggaagggcc tgccacagcc tcccctgcca gggcagggcc ccaggcattg ccaagggctt
      481 tgttttggac actttgccat attttcacca tttgattatg tagcaaaata catgacattt
      541 atttttcatt tagtttgatt attcagtgtc actggcgaca cgtagcagct tagactaagg
      601 ccattattgt acttgcctta ttagagtgtc tttccacgga gccactcctc tgactcaggg
      661 ctcctgggtt ttggattctc tgagctgtgc aggtggggag actgggctga gggagcctgg
      721 ccccatggtc agccctaggg tggagagcca ccaagaggga cgcctggggg tgtcaggacc
      781 agtcaacctg ggcaaagcct agtgaaggct tctctctgtg ggatgggatg gtggagggcc
      841 acatgggagg ttcaccccct tctccatcca catggtgagc cgggtctgcc tcttctggga
      901 gggcagcagg gctaccctga gctgaggcag cagtgtgagg ccagggcaga gtgagaccca
      961 gccctcatcc cgagcacctc cacatcctcc acgttctgct catcattctc tgtctcatcc
     1021 atcatcatgt gtgtccacga ctgtctccat ggccccgcaa aaggactctc aggaccaaag
     1081 ctttcatgta aactgtgcac caagcaggaa atgaaaatgt cttgtgttac ctgaaaacac
     1141 tgtgcacatc tgtgtcttgt ttggaatatt gtccattgtc caatcctatg tttttggtca
     1201 aagccagcgt cctcctctgt gaccaatgtc ttgatgcatg cactgttccc cctgtgcagc
     1261 cgctgagcga ggagatgctc cntgggccct ttgagtgcag tcctgatcag agccgtggtc
     1321 ctttggggtg aactaccttg gttcccccac tgatcacaaa aacatggtgg gtccatgggc
     1381 agagcccaag ggaattcggt gtgcaccagg gttgacccca gaggattgct gccccatcag
     1441 tgctccctca catgtcagta ccttcaaact agggccaagc ccagcactgc ttgaggaaaa
     1501 caagcattca caacttgttt tnggttttta aaacccagtc cacaaaataa ccaatcctgg
     1561 acatgaagat tctttcccaa ttcacatcta acctcatctt cttcaccatt tggcaatgcc
     1621 atcatctcct gccttcctcc tgggccctct ctgctctgcg tgtcacctgt gcttcgggcc
     1681 cttcccacag gacatttctc taagagaaca atgtgctatg tgaagagtaa gtcaacctgc
     1741 ctgacatttg gagtgttccc cttccactga gggcagtcga tagagctgta ttaagccact
     1801 taaaatgttt gtcactttgc caaggcaagc acttgtgggn nttgnttgtt ntcantcagt
     1861 cttncgaata ctttttcccc ttgataaaga ctccagttaa aanaaatttt aatgaagaaa
     1921 gtggaaacaa ggaagtcaaa gcaaggaaac tatgtaacat gtaggaagta ggaagtaaat
     1981 tatagtgatg taatcttgaa ttgtaactgt tcttgaattt aataatctgt agggtaatta
     2041 gtaacatgtg ttaagtattt tcataagtat ttcaaattgg agcttcatgg cagaaggcaa
     2101 acccatcanc aaaaattgtc ccttaaacaa aaattaaaat cctcaatcca gctatgttat
     2161 attgaaaaaa tagagcctga gggatcttta ctagttataa agatacagaa ctctttcnaa
     2221 accttttgaa attaacctct cactatacca gtataattga gttttcagtg gggcagtcat
     2281 tatccaggta atccaagata ttttaaaatc tgtcacgtag aacttggatg tacctgcccc
     2341 caatccatga accaagacca ttgaattctt ggttgaggaa acaaacatga ccctaaatct
     2401 tgactacagt caggaaagga atcatttcta tttctcctcc atgggagaaa atagataaga
     2461 gtagaaactg cagggnaaaa ttatttgnat aacaattcct ctactaacaa tcagctcctt
     2521 cctggagact gcccagctaa agcaatatgc atttaaatac agtcttccat ttgnaaggga
     2581 aaagtctctt gtaatccgaa tctctttttg gtttcgaact gctagtcaag tgcgtccacg
     2641 agctgtttac tagggatccc tcatctgtcc ctccgggacc tggtgctgcc tctacctgac
     2701 actcccttgg gctccctgta acctcttcag aggncctcgc tgccagctct gtntcaggac
     2761 ccagaggaag gggncagagg ctcgttgact ggctgtgtgt tgggattgag tctgtgccac
     2821 gtgtttgtgc tgtggtgtgt cccctctgtc caggcactga gataccagcg aggaggctcc
     2881 agagggcgct ctgcttgtta ttagagatta cctcctgaga aaaaaggttc cgcttggagc
     2941 agaggggctg aatagcagaa ggttgcacct cccccaacct tagatgttct aagtctttcc
     3001 attggatctc attggaccct tccatggtgt gatcgtctga ctggtgttat caccgtgggc
     3061 tccctgactg ggagttgatc gcctttccca ggtgctacac ccttttccag ctggatgaga
     3121 atttgagtgc tctgatccct ctacagagct tccctgactc attctgaagg agccccattc
     3181 ctgggaaata ttccctagaa acttccaaat cccctaagca gaccactgat aaaaccatgt
     3241 agaaaatttg ttattttgna acctcgctgg actctcagtc tctgagcagt gaatgattca
     3301 gtgttaaatg tgatgaatac tgtattttgt attgtttcaa ttgcatctcc cagataatgt
     3361 gaaaatggtc caggagaagg ncaattccta tacgcagngt gctttaaaaa ataaataaga
     3421 aacaactctt tgagaaacaa caatttctac tttgaagtca taccaatgaa aaaatgtata
     3481 tgcacttata attttcctaa taaagttctg tactcaaatg taaaaaaaaa aaaaaaaaaa
     3541 a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AB021288. Homo sapiens mRNA...[gi:4038732] Links  


LOCUS       AB021288                 925 bp    mRNA    linear   PRI 22-DEC-1998
DEFINITION  Homo sapiens mRNA for beta 2-microglobulin, complete cds.
ACCESSION   AB021288
VERSION     AB021288.1  GI:4038732
KEYWORDS    beta 2-microglobulin.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 925)
  AUTHORS   Matsumoto,K. and Minamitani,T.
  TITLE     Human mRNA for beta 2-microglobulin
  JOURNAL   Published Only in DataBase (1998)
REFERENCE   2  (bases 1 to 925)
  AUTHORS   Matsumoto,K. and Minamitani,T.
  TITLE     Direct Submission
  JOURNAL   Submitted (16-DEC-1998) Ken-ichi Matsumoto, Graduate School of
            Pharmaceutical Sciences, Hokkaido University, Department of
            Molecular Biology; Kita 12, Nishi 6, Kita-ku, Sapporo, Hokkaido
            060-0812, Japan (E-mail:kematsum@pharm.hokudai.ac.jp,
            Tel:81-11-706-3731(ex.3731), Fax:81-11-706-4988)
FEATURES             Location/Qualifiers
     source          1..925
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="HeLa"
                     /clone_lib="HeLa cDNA library"
     CDS             14..373
                     /codon_start=1
                     /product="beta 2-microglobulin"
                     /protein_id="BAA35182.1"
                     /db_xref="GI:4038733"
                     /translation="MSRSVALAVLALLSLSGLEAIQRTPKIQVYSRHPAENGKSNFLN
                     CYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKDWSFYLLYYTEFTPTEKDEYACRV
                     NHVTLSQPKIVKWDRDM"
     polyA_signal    908..913
BASE COUNT      261 a    168 c    194 g    302 t
ORIGIN      
        1 ggcacgagcc gagatgtctc gctccgtggc cttagctgtg ctcgcgctac tctctctttc
       61 tggcctggag gctatccagc gtactccaaa gattcaggtt tactcacgtc atccagcaga
      121 gaatggaaag tcaaatttcc tgaattgcta tgtgtctggg tttcatccat ccgacattga
      181 agttgactta ctgaagaatg gagagagaat tgaaaaagtg gagcattcag acttgtcttt
      241 cagcaaggac tggtctttct atctcttgta ctacactgaa ttcaccccca ctgaaaaaga
      301 tgagtatgcc tgccgtgtga accatgtgac tttgtcacag cccaagatag ttaagtggga
      361 tcgagacatg taagcagcat catggaggtt tgaagatgcc gcatttggat tggatgaatt
      421 ccaaattctg cttgcttgct ttttaatatt gatatgctta tacacttaca ctttatgcac
      481 aaaatgtagg gttataataa tgttaacatg gacatgatct tctttataat tctactttga
      541 gtgctgtctc catgtttgat gtatctgagc aggttgctcc acaggtagct ctaggagggc
      601 tggcaactta gaggtgggga gcagagaatt ctcttatcca acatcaacat cttggtcaga
      661 tttgaactct tcaatctctt gcactcaaag cttgttaaga tagttaagcg tgcataagtt
      721 aacttccaat ttacatactc tgcttagaat ttgggggaaa atttagaaat ataattgaca
      781 ggattattgg aaatttgtta taatgaatga aacattttgt catataagat tcatatttac
      841 ttcttataca tttgataaag taaggcatgg ttgtggttaa tctggtttat ttttgttcca
      901 caagttaaat aaatcataaa acttg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M74444. Human secretory l...[gi:338232] Links  


LOCUS       HUMSLPIEX1              1472 bp    DNA     linear   PRI 13-JAN-1995
DEFINITION  Human secretory leukoprotease inhibitor (SLPI) gene, 5' end.
ACCESSION   M74444
VERSION     M74444.1  GI:338232
KEYWORDS    secretory leukoprotease inhibitor protein.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1472)
  AUTHORS   Abe,T., Kobayashi,N., Yoshimura,K., Trapnell,B.C., Kim,H.,
            Hubbard,R.C., Brewer,M.T., Thompson,R.C. and Crystal,R.G.
  TITLE     Expression of the secretory leukoprotease inhibitor gene in
            epithelial cells
  JOURNAL   J. Clin. Invest. 87 (6), 2207-2215 (1991)
  MEDLINE   91250579
   PUBMED   1674946
COMMENT     Original source text: Homo sapiens DNA.
FEATURES             Location/Qualifiers
     source          1..1472
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            69..1472
                     /gene="SLPI"
     repeat_unit     69..373
                     /gene="SLPI"
                     /rpt_family="Alu repetitive sequence"
     CAAT_signal     1286..1290
                     /gene="SLPI"
     TATA_signal     1347..1352
                     /gene="SLPI"
     mRNA            1374..>1472
                     /gene="SLPI"
                     /product="secretory leukoprotease inhibitor protein"
                     /note="1 of two possible transcripts"
     mRNA            1375..>1472
                     /gene="SLPI"
                     /product="secretory leukoprotease inhibitor protein"
                     /note="1 of two possible transcripts"
     CDS             1396..>1472
                     /gene="SLPI"
                     /codon_start=1
                     /product="secretory leukoprotease inhibitor protein"
                     /protein_id="AAA60559.1"
                     /db_xref="GI:338233"
                     /translation="MKSSGLFPFLVLLALGTLAPWAVEGS"
BASE COUNT      361 a    321 c    345 g    445 t
ORIGIN      
        1 gaattccaag catgaagata atgagtcaag agcttggagt ttgtagctag atgagctttg
       61 gttgaatttt attttatttt atttttttaa gacagggtat cgctctgtcc cccaagctgg
      121 aatgcagtgg cacaatcatg gctcactgca gcctcaaact cctgggctaa agcgatcctc
      181 ctggctcagc ctcccaagta gctgggacta caggcatacg tacgtcatca tgcctggctg
      241 attttttaca tttttttgta gagatggggt ctcaatatgt ggccagggct ggtctcaaac
      301 tcctactctc aaggaatcca tacacctcag cctcctgggc agctgagaca gcaagtgtgc
      361 gaccctacac tcagctatgg gctgaatttt agagataatg gtcgctctct ttataattag
      421 aagcaaccta tgcagactgg gtagcaaata gaatgggttt aattttttgc tgtcatgtga
      481 gatctgtaag ggattttggg gaattttagg aagcaatcct ctaagatctc aaattatctc
      541 acagctaaat gtagattaca gtgactgatg agctgctttc cccctttatc tcagattcat
      601 ttcaattctc tttagtggga agggatacta ttcatttgtt cttttcattc agagtccctt
      661 catgccctta atttcataac cctctgagaa gggctgactt gttagtatca tttcatttca
      721 cagctgagac aactgagctc cagagagatt tgtggagagc ggagctcttc ttcagctttc
      781 atttgtgagt gcttttcctg tgtcaggcac agaacaggca ctggggatat aacggtgtaa
      841 atatttcagg gaactaagta tcagttggtt gaacgagctg aacttttgag aaagaaactg
      901 cattgagtaa tcagcagagt ttcacaatgc ctgagagtcc agtaatgtga gaatcagaat
      961 tagcaatgtg agaatagaat gtattgcaca aagtctcagc agggagtctg tgtctggttt
     1021 tagttccagg tccgggtagc acctttgcaa ttgaccactt cttccctctc tccacctata
     1081 aggctaatgg cctgggatct tgtgatgttt agggctcaga tggacactga gatggcctct
     1141 ttaatcaacc aacttcccag gccaatctct tccctttctt ttctgatagt tgctgtgttg
     1201 gcctcatagc cttacctggc ataggaaaga taaacaatct ccttggtgtc aggatttctg
     1261 gtctctggct acgtttcctg cttatgcaat agtagctggg agaggccgaa agaattctgg
     1321 tggggccaca cccactggtg aaagaataaa tagtgaggtt tggcattggc catcagagtc
     1381 actcctgcct tcaccatgaa gtccagcggc ctcttcccct tcctggtgct gcttgccctg
     1441 ggaactctgg caccttgggc tgtggaaggc tc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyLinkOutLinkOutHelpHelp  






    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X04470. Human mRNA for an...[gi:28638] Links  


LOCUS       HSALPR                   594 bp    mRNA    linear   PRI 21-MAR-1995
DEFINITION  Human mRNA for antileukoprotease (ALP) from cervix uterus.
ACCESSION   X04470
VERSION     X04470.1  GI:28638
KEYWORDS    antileukoprotease; elastase inhibitor; protease; signal peptide.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 594)
  AUTHORS   Heinzel,R., Appelhans,H., Gassen,G., Seemuller,U., Machleidt,W.,
            Fritz,H. and Steffens,G.
  TITLE     Molecular cloning and expression of cDNA for human
            antileukoprotease from cervix uterus
  JOURNAL   Eur. J. Biochem. 160 (1), 61-67 (1986)
  MEDLINE   87030258
COMMENT     Data kindly reviewed (05-DEC-1986) by H. Appelhans.
FEATURES             Location/Qualifiers
     source          1..594
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             19..417
                     /note="precursor ALP"
                     /codon_start=1
                     /protein_id="CAA28158.1"
                     /db_xref="GI:28639"
                     /db_xref="SWISS-PROT:P03973"
                     /translation="MKSSGLFPFLVLLALGTLAPWAVEGSGKSFKAGVCPPKKSAQCL
                     RYKKPECQSDWQCPGKKRCCPDTCGIKCLDPVDTPNPTRRKPGKCPVTYGQCLMLNPP
                     NFCEMDGQCKRDLKCCMGMCGKSCVSPVKA"
     sig_peptide     19..93
                     /note="put. signal peptide (aa -25 to -1)"
     mat_peptide     94..414
                     /product="put. mature peptide (aa 1-107)"
     misc_feature    564..569
                     /note="pot. polyA signal"
     misc_feature    568..573
                     /note="pot. polyA signal"
BASE COUNT      132 a    156 c    155 g    151 t
ORIGIN      
        1 gtcactcctg ccttcaccat gaagtccagc ggcctcttcc ccttcctggt gctgcttgcc
       61 ctgggaactc tggcaccttg ggctgtggaa ggctctggaa agtccttcaa agctggagtc
      121 tgtcctccta agaaatctgc ccagtgcctt agatacaaga aacctgagtg ccagagtgac
      181 tggcagtgtc cagggaagaa gagatgttgt cctgacactt gtggcatcaa atgcctggat
      241 cctgttgaca ccccaaaccc aacaaggagg aagcctggga agtgcccagt gacttatggc
      301 caatgtttga tgcttaaccc ccccaatttc tgtgagatgg atggccagtg caagcgtgac
      361 ttgaagtgtt gcatgggcat gtgtgggaaa tcctgcgttt cccctgtgaa agcttgattc
      421 ctgccatatg gaggaggctc tggagtcctg ctctgtgtgg tccaggtcct ttccaccctg
      481 agacttggct ccaccactga tatcctcctt tggggaaagg cttggcacac agcaggcttt
      541 caagaagtgc cagttgatca atgaataaat aaacgagcct atttctcttt gcac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AB008109. Homo sapiens mRNA...[gi:2554613] Links  


LOCUS       AB008109                2076 bp    mRNA    linear   PRI 20-FEB-1999
DEFINITION  Homo sapiens mRNA for RGS5, complete cds.
ACCESSION   AB008109
VERSION     AB008109.1  GI:2554613
KEYWORDS    RGS5.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Seki,N., Sugano,S., Suzuki,Y., Nakagawara,A., Ohira,M.,
            Muramatsu,M., Saito,T. and Hori,T.
  TITLE     Isolation, tissue expression, and chromosomal assignment of human
            RGS5, a novel G-protein signaling regulator gene
  JOURNAL   J. Hum. Genet. 43 (3), 202-205 (1998)
  MEDLINE   98419174
REFERENCE   2  (bases 1 to 2076)
  AUTHORS   Seki,N.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-OCT-1997) Naohiko Seki, Kazusa DNA Research
            Institute, Gene Structure I; 1532-3 Yana, Kisarazu, Chiba 292,
            Japan (E-mail:nseki@kazusa.or.jp, Tel:+81-438-52-3932,
            Fax:+81-438-52-3931)
FEATURES             Location/Qualifiers
     source          1..2076
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="1"
                     /map="1q23"
                     /clone="nb-20"
                     /tissue_type="neuroblastoma"
     CDS             82..627
                     /function="G protein signaling regulator"
                     /codon_start=1
                     /product="RGS5"
                     /protein_id="BAA22889.1"
                     /db_xref="GI:2554614"
                     /translation="MCKGLAALPHSCLERAKEIKIKLGILLQKPDSVGDLVIPYNEKP
                     EKPAKTQKTSLDEALQWRDSLDKLLQNNYGLASFKSFLKSEFSEENLEFWIACEDYKK
                     IKSPAKMAEKAKQIYEEFIQTEAPKEVNIDHFTKDITMKNLVEPSLSSFDMAQKRIHA
                     LMEKDSLPRFVRSEFYQELIK"
BASE COUNT      674 a    399 c    371 g    632 t
ORIGIN      
        1 agacagtttt gaagttttca aagactggct ctgctgttaa gaagttgtac ttaaagcgga
       61 ggagctaagc cacctgccaa aatgtgcaaa ggacttgcag ctttgcccca ctcatgcctg
      121 gaaagggcca aggagattaa gatcaagttg ggaattctcc tccagaagcc agactcagtt
      181 ggtgaccttg tcattccgta caatgagaag ccagagaaac cagccaagac ccagaaaacc
      241 tcgctggacg aggccctgca gtggcgtgat tccctggaca aactcctgca gaacaactat
      301 ggacttgcca gtttcaaaag tttcctgaag tctgaattca gtgaggaaaa ccttgagttc
      361 tggattgcct gtgaggatta caagaagatc aagtcccctg ccaagatggc tgagaaggca
      421 aagcaaattt atgaagaatt cattcaaacg gaggctccta aagaggtgaa tattgaccac
      481 ttcactaagg acatcacaat gaagaacctg gtggaacctt ccctgagcag ctttgacatg
      541 gcccagaaaa gaatccatgc cctgatggaa aaggattctc tgcctcgctt tgtgcgctct
      601 gagttttatc aggagttaat caagtagtaa tttagccagg ctatgaaatc atcctgtgag
      661 ttatttcctc cataataacc ctgcatttcc cattaatcta catatcttcc cacagcagct
      721 ttgctcagtg atacccacat gggaaaaatc ccaggggatg ttgcttactc tttttgccca
      781 cactgctttg gatacttatc tactgtccga aggccttctt tccccactca attcttcctg
      841 ccctgttatt aattaagata tcttcagctt gtagtcagac ccaatcagaa tcacagaaaa
      901 atcctgccta aggcaaagaa atataagaca agactatgat atcaatgaat gtgggttaag
      961 taatagattt ccagctaaat tggtctaaaa aagaatatta agtgtggaca gacctatttc
     1021 aaaggagctt aattgatctc acttgtttta gttctgatcc agggagatca cccctctaat
     1081 tatttctgaa cttggttaat aaaagtttat aagattttta tgaagcagcc actgtatgat
     1141 attttaagca aatatgttat ttaaaatatt gatccttccc ttggaccacc ttcatgttag
     1201 ttgggtatta taaataagag atacaaccat gaatatatta tgtttataca aaatcaatct
     1261 gaacacaatt cataaagatt tctcttttat accttcctca ctggccccct ccacctgccc
     1321 atagtcacca aattctgttt taaatcaatg acctaagatc aacaatgaag tattttataa
     1381 atgtatttat gctgctagac tgtgggtcaa atgtttccat tttcaaatta tttagaattc
     1441 ttatgagttt aaaatttgta aatttctaaa tccaatcatg taaaatgaaa ctgttgctcc
     1501 attggagtag tctcccacct aaatatcaag atggctatat gctaaaaaga gaaaatatgg
     1561 tcaagtctaa aatggctaat tgtcctatga tgctattatc atagactaat gacatttatc
     1621 ttcaaaacac caaattgtct ttagaaaaat taatgtgatt acaggtagag gccttctagg
     1681 tgagacactt ttaaggtaca ctgcattttg cagaaaaaaa aaaaaaaaag taatctttta
     1741 gcaaccccag tattccttca ctatttcgct tcctgcatta gcaaatttta cttacagtca
     1801 aaagtgcaga tttatactcc tgacgtgtct cattcacagc taaataatag gccataggac
     1861 ttttggtagg tttaaacttt taattctgta tttcatgatt ataagtcttg ctagaatttt
     1921 ttctaatctt tagtagattt gattaaataa tgattcacag aatttagtaa cagaatcaaa
     1981 ctaagccatg tatgagggta atcgagatga ggatattaac tcaaaagaaa tagggtgatt
     2041 tttaaaggat taataaaatt ctgaaatgtt aagtag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M77349. Human transformin...[gi:339567] Links  


LOCUS       HUMTGFBIG               2691 bp    mRNA    linear   PRI 14-JAN-1995
DEFINITION  Human transforming growth factor-beta induced gene product (BIGH3)
            mRNA, complete cds.
ACCESSION   M77349
VERSION     M77349.1  GI:339567
KEYWORDS    transforming growth factor-beta induced protein.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2691)
  AUTHORS   Skonier,J., Neubauer,M., Madisen,L., Bennett,K., Plowman,G.D. and
            Purchio,A.F.
  TITLE     cDNA cloning and sequence analysis of beta ig-h3, a novel gene
            induced in a human adenocarcinoma cell line after treatment with
            transforming growth factor-beta
  JOURNAL   DNA Cell Biol. 11 (7), 511-522 (1992)
  MEDLINE   93000472
   PUBMED   1388724
COMMENT     Original source text: Homo sapiens cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..2691
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="A549"
                     /cell_type="adenocarcinoma"
     gene            1..2691
                     /gene="BIGH3"
     CDS             48..2099
                     /gene="BIGH3"
                     /note="putative"
                     /codon_start=1
                     /product="transforming growth factor induced protein"
                     /protein_id="AAA61163.1"
                     /db_xref="GI:339568"
                     /translation="MALFVRLLALALALALGPAATLAGPAKSPYQLVLQHSRLRGRQH
                     GPNVCAVQKVIGTNRKYFTNCKQWYQRKICGKSTVISYECCPGYEKVPGEKGCPAALP
                     LSNLYETLGVVGSTTTQLYTDRTEKLRPEMEGPGSFTIFAPSNEAWASLPAEVLDSLV
                     SNVNIELLNALRYHMVGRRVLTDELKHGMTLTSMYQNSNIQIHHYPNGIVTVNCARLL
                     KADHHATNGVVHLIDKVISTITNNIQQIIEIEDTFETLRAAVAASGLNTMLEGNGQYT
                     LLAPTNEAFEKIPSETLNRILGDPEALRDLLNNHILKSAMCAEAIVAGLSVETLEGTT
                     LEVGCSGDMLTINGKAIISNKDILATNGVIHYIDELLIPDSAKTLFELAAESDVSTAI
                     DLFRQAGLGNHLSGSERLTLLAPLNSVFKDGTPPIDAHTRNLLRNHIIKDQLASKYLY
                     HGQTLETLGGKKLRVFVYRNSLCIENSCIAAHDKRGRYGTLFTMDRVLTPPMGTVMDV
                     LKGDNRFSMLVAAIQSAGLTETLNREGVYTVFAPTNEAFRALPPRERSRLLGDAKELA
                     NILKYHIGDEILVSGGIGALVRLKSLQGDKLEVSLKNNVVSVNKEPVAEPDIMATNGV
                     VHVITNVLQPPANRPQERGDELADSALEIFKQASAFSRASQRSVRLAPVYQKLLERMK
                     H"
     sig_peptide     48..92
                     /gene="BIGH3"
                     /note="putative"
     mat_peptide     93..2096
                     /gene="BIGH3"
                     /product="transforming growth factor-beta induced protein"
BASE COUNT      679 a    729 c    695 g    588 t
ORIGIN      
        1 gcttgcccgt cggtcgctag ctcgctcggt gcgcgtcgtc ccgctccatg gcgctcttcg
       61 tgcggctgct ggctctcgcc ctggctctgg ccctgggccc cgccgcgacc ctggcgggtc
      121 ccgccaagtc gccctaccag ctggtgctgc agcacagcag gctccggggc cgccagcacg
      181 gccccaacgt gtgtgctgtg cagaaggtta ttggcactaa taggaagtac ttcaccaact
      241 gcaagcagtg gtaccaaagg aaaatctgtg gcaaatcaac agtcatcagc tacgagtgct
      301 gtcctggata tgaaaaggtc cctggggaga agggctgtcc agcagcccta ccactctcaa
      361 acctttacga gaccctggga gtcgttggat ccaccaccac tcagctgtac acggaccgca
      421 cggagaagct gaggcctgag atggaggggc ccggcagctt caccatcttc gcccctagca
      481 acgaggcctg ggcctccttg ccagctgaag tgctggactc cctggtcagc aatgtcaaca
      541 ttgagctgct caatgccctc cgctaccata tggtgggcag gcgagtcctg actgatgagc
      601 tgaaacacgg catgaccctc acctctatgt accagaattc caacatccag atccaccact
      661 atcctaatgg gattgtaact gtgaactgtg cccggctcct gaaagccgac caccatgcaa
      721 ccaacggggt ggtgcacctc atcgataagg tcatctccac catcaccaac aacatccagc
      781 agatcattga gatcgaggac acctttgaga cccttcgggc tgctgtggct gcatcagggc
      841 tcaacacgat gcttgaaggt aacggccagt acacgctttt ggccccgacc aatgaggcct
      901 tcgagaagat ccctagtgag actttgaacc gtatcctggg cgacccagaa gccctgagag
      961 acctgctgaa caaccacatc ttgaagtcag ctatgtgtgc tgaagccatc gttgcggggc
     1021 tgtctgtaga gaccctggag ggcacgacac tggaggtggg ctgcagcggg gacatgctca
     1081 ctatcaacgg gaaggcgatc atctccaata aagacatcct agccaccaac ggggtgatcc
     1141 actacattga tgagctactc atcccagact cagccaagac actatttgaa ttggctgcag
     1201 agtctgatgt gtccacagcc attgaccttt tcagacaagc cggcctcggc aatcatctct
     1261 ctggaagtga gcggttgacc ctcctggctc ccctgaattc tgtattcaaa gatggaaccc
     1321 ctccaattga tgcccataca aggaatttgc ttcggaacca cataattaaa gaccagctgg
     1381 cctctaagta tctgtaccat ggacagaccc tggaaactct gggcggcaaa aaactgagag
     1441 tttttgttta tcgtaatagc ctctgcattg agaacagctg catcgcggcc cacgacaaga
     1501 gggggaggta cgggaccctg ttcacgatgg accgggtgct gaccccccca atggggactg
     1561 tcatggatgt cctgaaggga gacaatcgct ttagcatgct ggtagctgcc atccagtctg
     1621 caggactgac ggagaccctc aaccgggaag gagtctacac agtctttgct cccacaaatg
     1681 aagccttccg agccctgcca ccaagagaac ggagcagact cttgggagat gccaaggaac
     1741 ttgccaacat cctgaaatac cacattggtg atgaaatcct ggttagcgga ggcatcgggg
     1801 ccctggtgcg gctaaagtct ctccaaggtg acaagctgga agtcagcttg aaaaacaatg
     1861 tggtgagtgt caacaaggag cctgttgccg agcctgacat catggccaca aatggcgtgg
     1921 tccatgtcat caccaatgtt ctgcagcctc cagccaacag acctcaggaa agaggggatg
     1981 aacttgcaga ctctgcgctt gagatcttca aacaagcatc agcgttttcc agggcttccc
     2041 agaggtctgt gcgactagcc cctgtctatc aaaagttatt agagaggatg aagcattagc
     2101 ttgaagcact acaggaggaa tgcaccacgg cagctctccg ccaatttctc tcagatttcc
     2161 acagagactg tttgaatgtt ttcaaaacca agtatcacac tttaatgtac atgggccgca
     2221 ccataatgag atgtgagcct tgtgcatgtg ggggaggagg gagagagatg tactttttaa
     2281 atcatgttcc ccctaaacat ggctgttaac ccactgcatg cagaaacttg gatgtcactg
     2341 cctgacattc acttccagag aggacctatc ccaaatgtgg aattgactgc ctatgccaag
     2401 tccctggaaa aggagcttca gtattgtggg gctcataaaa catgaatcaa gcaatccagc
     2461 ctcatgggaa gtcctggcac agtttttgta aagcccttgc acagctggag aaatggcatc
     2521 attataagct atgagttgaa atgttctgtc aaatgtgtct cacatctaca cgtggcttgg
     2581 aggcttttat ggggccctgt ccaggtagaa aagaaatggt atgtagagct tagatttccc
     2641 tattgtgaca gagccatggt gtgtttgtaa taataaaacc aaagaaacat a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L34083. Human (clone 13) ...[gi:619809] Links  


LOCUS       HUMMHDQAAB               681 bp    mRNA    linear   PRI 14-MAR-1996
DEFINITION  Human (clone 13) MHC class II HLA-DQA1*01021 mRNA, partial cds.
ACCESSION   L34083
VERSION     L34083.1  GI:619809
KEYWORDS    cell surface glycoprotein; class II gene; integral membrane
            protein; major histocompatibility complex.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 681)
  AUTHORS   Yasunaga,S., Kimura,A., Hamaguchi,K., Ronningen,K.S. and
            Sasazuki,T.
  TITLE     Different contribution of HLA-DR and -DQ genes in susceptibility
            and resistance to insulin-dependent diabetes mellitus (IDDM)
  JOURNAL   Tissue Antigens 47 (1), 37-48 (1996)
  MEDLINE   97083137
   PUBMED   8929711
COMMENT     Original source text: Homo sapiens cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..681
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="6p21.3"
                     /cell_line="EMJ"
                     /cell_type="B lymphoblastoid"
     gene            1..681
                     /gene="HLA-DQA1"
     CDS             <1..>681
                     /gene="HLA-DQA1"
                     /function="immune regulation"
                     /note="allele DQA1*01021; putative"
                     /codon_start=1
                     /product="MHC class II HLA-DQ-alpha chain"
                     /protein_id="AAC41951.1"
                     /db_xref="GI:619810"
                     /db_xref="GDB:G00-120-638"
                     /translation="EDIVADHVASCGVNLYQFYGPSGQYTHEFDGDEQFYVDLERKET
                     AWRWPEFSKFGGFDPQGALRNMAVAKHNLNIMIKRYNSTAATNEVPEVTVFSKSPVTL
                     GQPNTLICLVDNIFPPVVNITWLSNGQSVTEGVSETSFLSKSDHSFFKISYLTFLPSA
                     DEIYDCKVEHWGLDQPLLKHWEPEIPAPMSELTETVVCALGLSVGLMGIVVGTVFIIQ
                     GLRSVGASR"
     exon            <1..13
                     /gene="HLA-DQA1"
                     /note="allele DQA1*01021; G00-120-638"
                     /number=1
     exon            14..262
                     /gene="HLA-DQA1"
                     /note="allele DQA1*01021; G00-120-638"
                     /number=2
     exon            263..544
                     /gene="HLA-DQA1"
                     /note="allele DQA1*01021; G00-120-638"
                     /number=3
     exon            545..>681
                     /gene="HLA-DQA1"
                     /note="allele DQA1*01021; G00-120-638"
                     /number=4
BASE COUNT      148 a    176 c    177 g    180 t
ORIGIN      
        1 gaagacattg tggctgacca cgttgcctct tgtggtgtaa acttgtacca gttttacggt
       61 ccctctggcc agtacaccca tgaatttgat ggagatgagc agttctacgt ggacctggag
      121 aggaaggaga ctgcctggcg gtggcctgag ttcagcaaat ttggaggttt tgacccgcag
      181 ggtgcactga gaaacatggc tgtggcaaaa cacaacttga acatcatgat taaacgctac
      241 aactctaccg ctgctaccaa tgaggttcct gaggtcacag tgttttccaa gtctcccgtg
      301 acactgggtc agcccaacac cctcatttgt cttgtggaca acatctttcc tcctgtggtc
      361 aacatcacat ggctgagcaa tgggcagtca gtcacagaag gtgtttctga gaccagcttc
      421 ctctccaaga gtgatcattc cttcttcaag atcagttacc tcaccttcct cccttctgct
      481 gatgagattt atgactgcaa ggtggagcac tggggcctgg accagcctct tctgaaacac
      541 tgggagcctg agattccagc ccctatgtca gagctcacag agactgtggt ctgtgccctg
      601 gggttgtctg tgggcctcat gggcattgtg gtgggcactg tcttcatcat ccaaggcctg
      661 cgttcagttg gtgcttccag a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyLinkOutLinkOutHelpHelp  



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L46875. Homo sapiens HLA-...[gi:1448998] Links  


LOCUS       HUMHLADJ                  82 bp    mRNA    linear   PRI 24-JUL-1996
DEFINITION  Homo sapiens HLA-DQA1 gene (DQA1*0101, DQA1*01021, DQA1*01022 and
            DQA1*0103), exon 1.
ACCESSION   L46875
VERSION     L46875.1  GI:1448998
KEYWORDS    cell surface glycoprotein; class II gene; integral membrane
            protein; major histocompatibility complex.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 82)
  AUTHORS   Yasunaga,S., Kimura,A. and Sasazuki,T.
  TITLE     Sequence polymorphisms in HLA-DQA1 exon1
  JOURNAL   Unpublished (1995)
COMMENT     Original source text: Homo sapiens blood cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..82
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="6p21.3"
                     /haplotype="DQB1*0501-DQA1*0101, DQB1*0604-DQA1*01021,
                     DQB1*0502-DQA1*01022 and DQB1*06011-DQA1*0103"
                     /cell_line="KAS116(9003), EMJ(9097), KAS011(9009) and
                     E4181324(9011)"
                     /cell_type="B-lymphocyte (B-lymphoblastoid cell)"
                     /tissue_type="blood"
     gene            1..82
                     /gene="HLA-DQA1"
     exon            1..82
                     /gene="HLA-DQA1"
                     /note="G00-120-638"
                     /number=1
                     /function="immune regulation"
BASE COUNT       17 a     22 c     25 g     18 t
ORIGIN      
        1 atgatcctaa acaaagctct gctgctgggg gccctcgctc tgaccaccgt gatgagcccc
       61 tgtggaggtg aagacattgt gg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMSNPSNPTaxonomyTaxonomyLinkOutLinkOutHelpHelp  






    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M20431. Human MHC class I...[gi:188136] Links  


LOCUS       HUMMHDQAR               1030 bp    mRNA    linear   PRI 07-JAN-1995
DEFINITION  Human MHC class II HLA-DQ-alpha (DR2-DQw1/DR4 DQw3) mRNA, partial
            cds, clone ROF1.1-alpha.
ACCESSION   M20431
VERSION     M20431.1  GI:188136
KEYWORDS    cell surface glycoprotein; class II gene; integral membrane
            protein; major histocompatibility complex.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1030)
  AUTHORS   Lock,C.B., So,A.K., Welsh,K.I., Parkes,J.D. and Trowsdale,J.
  TITLE     MHC class II sequences of an HLA-DR2 narcoleptic
  JOURNAL   Immunogenetics 27 (6), 449-455 (1988)
  MEDLINE   88226788
   PUBMED   3259543
COMMENT     Original source text: Human (haplotype DR2-DQw1/DR4 DQw3)
            B-lymphoblastoid cell line, cDNA to mRNA, clone ROF1.1-alpha, from
            a 58 year old male (ROF) with narcolepsy.
FEATURES             Location/Qualifiers
     source          1..1030
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="6p21.3"
     gene            1..1030
                     /gene="HLA-DQA1"
     mRNA            <1..1030
                     /gene="HLA-DQA1"
                     /note="MHC HLA-DQA mRNA; G00-120-638"
     CDS             <1..702
                     /gene="HLA-DQA1"
                     /note="MHC HLA-DQ-alpha chain precursor old gene name
                     'HLA-DQA'"
                     /codon_start=1
                     /protein_id="AAA59758.1"
                     /db_xref="GI:386924"
                     /db_xref="GDB:G00-120-638"
                     /translation="GEDIVADHVASCGVNLYQFYGPSGQYTHEFDGDEQFYVDLERKE
                     TAWRWPEFSKFGGFDPQGALRNMAVAKHNLNIMIKRYNSTAATNEVPEVTVFSKSPVT
                     LGQPNTLICLVDNIFPPVVNITWLSNGQSVTEGVSETSFLSKSDHSFFKISYLTFLPS
                     ADEIYDCKVEHWGLDQPLLKHWEPEIPAPMSELTETVVCALGLSVGLMGIVVGTVFII
                     QGLRSVGASRHQGPL"
     sig_peptide     <1..3
                     /gene="HLA-DQA1"
                     /note="MHC HLA-DQA-alpha chain signal peptide;
                     G00-120-638"
     mat_peptide     4..699
                     /gene="HLA-DQA1"
                     /product="MHC HLA-DQA-alpha chain; G00-120-638"
BASE COUNT      235 a    275 c    231 g    289 t
ORIGIN      Chromosome 6p21.3.
        1 ggtgaagaca ttgtggctga ccacgttgcc tcttgtggtg taaacttgta ccagttttac
       61 ggtccctctg gccagtacac ccatgaattt gatggagatg agcagttcta cgtggacctg
      121 gagaggaagg agactgcctg gcggtggcct gagttcagca aatttggagg ttttgacccg
      181 cagggtgcac tgagaaacat ggctgtggca aaacacaact tgaacatcat gattaaacgc
      241 tacaactcta ccgctgctac caatgaggtt cctgaggtca cagtgttttc caagtctccc
      301 gtgacactgg gtcagcccaa caccctcatt tgtcttgtgg acaacatctt tcctcctgtg
      361 gtcaacatca catggctgag caatgggcag tcagtcacag aaggtgtttc tgagaccagc
      421 ttcctctcca agagtgatca ttccttcttc aagatcagtt acctcacctt cctcccttct
      481 gctgatgaga tttatgactg caaggtggag cactggggcc tggaccagcc tcttctgaaa
      541 cactgggagc ctgagattcc agcccctatg tcagagctca cagagactgt ggtctgtgcc
      601 ctggggttgt ctgtgggcct catgggcatt gtggtgggca ctgtcttcat catccaaggc
      661 ctgcgttcag ttggtgcttc cagacaccaa gggccattgt gaatcccatc ctggaaggga
      721 aggtgcatcg ccatctacag gagcagaaga atggacttgc taaatgacct agcactattc
      781 tctggcccga tttatcatat cccttttctc ctccaaatat ttctcctctc accttttctc
      841 tgggacttaa gctgctatat cccctcagag ctcacaaatg cctttacatt ctttccctga
      901 cctcctgatt ttttttttct tttctcaaat gttacctaca atacatgcct ggggtaagcc
      961 acccggctac ctaattcctc agtaacctcc atctaaaatc tccaaggaag caataaattc
     1021 cttttatgag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X87838. H.sapiens mRNA fo...[gi:1154853] Links  


LOCUS       HSRNABECA               3362 bp    mRNA    linear   PRI 20-DEC-1999
DEFINITION  H.sapiens mRNA for beta-catenin.
ACCESSION   X87838
VERSION     X87838.1  GI:1154853
KEYWORDS    beta-catenin.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 17 to 2596)
  AUTHORS   Hulsken,J., Birchmeier,W. and Behrens,J.
  TITLE     E-cadherin and APC compete for the interaction with beta-catenin
            and the cytoskeleton
  JOURNAL   J. Cell Biol. 127 (6 Pt 2), 2061-2069 (1994)
  MEDLINE   95105247
REFERENCE   2
  AUTHORS   Nollet,F., Berx,G., Molemans,F. and van Roy,F.
  TITLE     Genomic organization of the human beta-catenin gene (CTNNB1)
  JOURNAL   Genomics 32 (3), 413-424 (1996)
  MEDLINE   96435919
REFERENCE   3
  AUTHORS   Nollet,F.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (07-JUN-1995) F.H. Nollet, Laboratory of Molecular
            Biology, Section Molecular Cell Biology, K.L. Ledeganckstraat 35,
            B-9000 Ghent, Belgium
  REMARK    Revised by [4]
REFERENCE   4  (bases 1 to 3362)
  AUTHORS   Nollet,F.H.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-JAN-1996) F.H. Nollet, Laboratory of Molecular
            Biology, Section Molecular Cell Biology, K.L. Ledeganckstraat 35,
            B-9000 Ghent, Belgium
COMMENT     On Jan 13, 1996 this sequence version replaced gi:860987.
FEATURES             Location/Qualifiers
     source          1..3362
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="3"
                     /map="p21"
                     /cell_line="5637"
     CDS             215..2560
                     /codon_start=1
                     /product="beta-catenin"
                     /protein_id="CAA61107.1"
                     /db_xref="GI:860988"
                     /db_xref="SWISS-PROT:P35222"
                     /translation="MATQADLMELDMAMEPDRKAAVSHWQQQSYLDSGIHSGATTTAP
                     SLSGKGNPEEEDVDTSQVLYEWEQGFSQSFTQEQVADIDGQYAMTRAQRVRAAMFPET
                     LDEGMQIPSTQFDAAHPTNVQRLAEPSQMLKHAVVNLINYQDDAELATRAIPELTKLL
                     NDEDQVVVNKAAVMVHQLSKKEASRHAIMRSPQMVSAIVRTMQNTNDVETARCTAGTL
                     HNLSHHREGLLAIFKSGGIPALVKMLGSPVDSVLFYAITTLHNLLLHQEGAKMAVRLA
                     GGLQKMVALLNKTNVKFLAITTDCLQILAYGNQESKLIILASGGPQALVNIMRTYTYE
                     KLLWTTSRVLKVLSVCSSNKPAIVEAGGMQALGLHLTDPSQRLVQNCLWTLRNLSDAA
                     TKQEGMEGLLGTLVQLLGSDDINVVTCAAGILSNLTCNNYKNKMMVCQVGGIEALVRT
                     VLRAGDREDITEPAICALRHLTSRHQEAEMAQNAVRLHYGLPVVVKLLHPPSHWPLIK
                     ATVGLIRNLALCPANHAPLREQGAIPRLVQLLVRAHQDTQRRTSMGGTQQQFVEGVRM
                     EEIVEGCTGALHILARDVHNRIVIRGLNTIPLFVQLLYSPIENIQRVAAGVLCELAQD
                     KEAAEAIEAEGATAPLTELLHSRNEGVATYAAAVLFRMSEDKPQDYKKRLSVELTSSL
                     FRTEPMAWNETADLGLDIGAQGEPLGYRQDDPSYRSFHSGGYGQDALGMDPMMEHEMG
                     GHHPGADYPVDGLPDLGHAQDLMDGLPPGDSNQLAWFDTDL"
     misc_feature    2573..2731
                     /note="alternatively spliced fragment"
BASE COUNT      880 a    707 c    822 g    953 t
ORIGIN      
        1 aagcctctcg gtctgtggca gcagcgttgg cccggccccg ggagcggaga gcgaggggag
       61 gcggagacgg aggaaggtct gaggagcagc ttcagtcccc gccgagccgc caccgcaggt
      121 cgaggacggt cggactcccg cggcgggagg agcctgttcc cctgagggta tttgaagtat
      181 accatacaac tgttttgaaa atccagcgtg gacaatggct actcaagctg atttgatgga
      241 gttggacatg gccatggaac cagacagaaa agcggctgtt agtcactggc agcaacagtc
      301 ttacctggac tctggaatcc attctggtgc cactaccaca gctccttctc tgagtggtaa
      361 aggcaatcct gaggaagagg atgtggatac ctcccaagtc ctgtatgagt gggaacaggg
      421 attttctcag tccttcactc aagaacaagt agctgatatt gatggacagt atgcaatgac
      481 tcgagctcag agggtacgag ctgctatgtt ccctgagaca ttagatgagg gcatgcagat
      541 cccatctaca cagtttgatg ctgctcatcc cactaatgtc cagcgtttgg ctgaaccatc
      601 acagatgctg aaacatgcag ttgtaaactt gattaactat caagatgatg cagaacttgc
      661 cacacgtgca atccctgaac tgacaaaact gctaaatgac gaggaccagg tggtggttaa
      721 taaggctgca gttatggtcc atcagctttc taaaaaggaa gcttccagac acgctatcat
      781 gcgttctcct cagatggtgt ctgctattgt acgtaccatg cagaatacaa atgatgtaga
      841 aacagctcgt tgtaccgctg ggaccttgca taacctttcc catcatcgtg agggcttact
      901 ggccatcttt aagtctggag gcattcctgc cctggtgaaa atgcttggtt caccagtgga
      961 ttctgtgttg ttttatgcca ttacaactct ccacaacctt ttattacatc aagaaggagc
     1021 taaaatggca gtgcgtttag ctggtgggct gcagaaaatg gttgccttgc tcaacaaaac
     1081 aaatgttaaa ttcttggcta ttacgacaga ctgccttcaa attttagctt atggcaacca
     1141 agaaagcaag ctcatcatac tggctagtgg tggaccccaa gctttagtaa atataatgag
     1201 gacctatact tacgaaaaac tactgtggac cacaagcaga gtgctgaagg tgctatctgt
     1261 ctgctctagt aataagccgg ctattgtaga agctggtgga atgcaagctt taggacttca
     1321 cctgacagat ccaagtcaac gtcttgttca gaactgtctt tggactctca ggaatctttc
     1381 agatgctgca actaaacagg aagggatgga aggtctcctt gggactcttg ttcagcttct
     1441 gggttcagat gatataaatg tggtcacctg tgcagctgga attctttcta acctcacttg
     1501 caataattat aagaacaaga tgatggtctg ccaagtgggt ggtatagagg ctcttgtgcg
     1561 tactgtcctt cgggctggtg acagggaaga catcactgag cctgccatct gtgctcttcg
     1621 tcatctgacc agccgacacc aagaagcaga gatggcccag aatgcagttc gccttcacta
     1681 tggactacca gttgtggtta agctcttaca cccaccatcc cactggcctc tgataaaggc
     1741 tactgttgga ttgattcgaa atcttgccct ttgtcccgca aatcatgcac ctttgcgtga
     1801 gcagggtgcc attccacgac tagttcagtt gcttgttcgt gcacatcagg atacccagcg
     1861 ccgtacgtcc atgggtggga cacagcagca atttgtggag ggggtccgca tggaagaaat
     1921 agttgaaggt tgtaccggag cccttcacat cctagctcgg gatgttcaca accgaattgt
     1981 tatcagagga ctaaatacca ttccattgtt tgtgcagctg ctttattctc ccattgaaaa
     2041 catccaaaga gtagctgcag gggtcctctg tgaacttgct caggacaagg aagctgcaga
     2101 agctattgaa gctgagggag ccacagctcc tctgacagag ttacttcact ctaggaatga
     2161 aggtgtggcg acatatgcag ctgctgtttt gttccgaatg tctgaggaca agccacaaga
     2221 ttacaagaaa cggctttcag ttgagctgac cagctctctc ttcagaacag agccaatggc
     2281 ttggaatgag actgctgatc ttggacttga tattggtgcc cagggagaac cccttggata
     2341 tcgccaggat gatcctagct atcgttcttt tcactctggt ggatatggcc aggatgcctt
     2401 gggtatggac cccatgatgg aacatgagat gggtggccac caccctggtg ctgactatcc
     2461 agttgatggg ctgccagatc tggggcatgc ccaggacctc atggatgggc tgcctccagg
     2521 tgacagcaat cagctggcct ggtttgatac tgacctgtaa atcatccttt agctgtattg
     2581 tctgaacttg cattgtgatt ggcctgtaga gttgctgaga gggctcgagg ggtgggctgg
     2641 tatctcagaa agtgcctgac acactaacca agctgagttt cctatgggaa caattgaagt
     2701 aaactttttg ttctggtcct ttttggtcga ggagtaacaa tacaaatgga ttttgggagt
     2761 gactcaagaa gtgaagaatg cacaagaatg gatcacaaga tggaatttag caaaccctag
     2821 ccttgcttgt taaaattttt tttttttttt ttttaagaat atctgtaatg gtactgactt
     2881 tgcttgcttt gaagtagctc tttttttttt tttttttttt tttttttgca gtaactgttt
     2941 tttaagtctc tcgtagtgtt aagttatagt gaatactgct acagcaattt ctaattttta
     3001 agaattgagt aatggtgtag aacactaatt aattcataat cactctaatt aattgtaatc
     3061 tgaataaagt gtaacaattg tgtagccttt ttgtataaaa tagacaaata gaaaatggtc
     3121 caattagttt cctttttaat atgcttaaaa taagcaggtg gatctatttc atgtttttga
     3181 tcaaaaacta tttgggatat gtatgggtag ggtaaatcag taagaggtgt tatttggaac
     3241 cttgttttgg acagtttacc agttgccttt tatcccaaag ttgttgtaac ctgctgtgat
     3301 acgatgcttc aagagaaaat gcggttataa aaaatggttc agaattaaac ttttaattca
     3361 tt
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  






&&&&&&&





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC014411. Homo sapiens, clo...[gi:15680136] Links  


LOCUS       BC014411                 823 bp    mRNA    linear   PRI 19-SEP-2001
DEFINITION  Homo sapiens, clone MGC:19914 IMAGE:4548425, mRNA, complete cds.
ACCESSION   BC014411
VERSION     BC014411.1  GI:15680136
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 823)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-SEP-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
            Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
            Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
            Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
            Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
            Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
            Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
            George Yang, Scott Zuyderduyn, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 29 Row: a Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 8924249.
FEATURES             Location/Qualifiers
     source          1..823
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:19914 IMAGE:4548425"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_18"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             18..788
                     /codon_start=1
                     /product="Unknown (protein for MGC:19914)"
                     /protein_id="AAH14411.1"
                     /db_xref="GI:15680137"
                     /translation="MSELPGDVRAFLREHPSLRLQTDARKVRCILTGHELPCRLPELQ
                     VYTRGKKYQRLVRASPAFDYAEFEPHIVPSTKNPHQLFCKLTLRHINKCPEHVLRHTQ
                     GRRYQRALCKYEECQKQGVEYVPACLVHRRRRREDQMDGDGPRPREAFWEPTSSDEGG
                     AASDDSMTDLYPPELFTRKDLGSTEDGDGTDDFLTDKEDEKAKPPREKATDEGRRETT
                     VYRGLVQKRGKKQLGSLKKKFKSHHRKPKSFSSCKQPG"
BASE COUNT      217 a    227 c    262 g    117 t
ORIGIN      
        1 ccgcgggcgc gtcggccatg agcgagttgc cgggcgacgt gcgggcgttt ctgcgggagc
       61 acccgagcct gcggctccag acggacgccc gcaaggtgag gtgcatcctg acaggtcacg
      121 agctgccctg ccgcctgccg gagctccagg tctacacccg cggcaaaaag taccagcggc
      181 tggtccgcgc ctccccggcc ttcgactatg cagagttcga gccgcacatc gtgcccagca
      241 ccaagaaccc gcaccagttg ttctgcaaac tcaccctgcg gcacatcaac aagtgcccag
      301 aacacgtgct gaggcacacc cagggccggc ggtaccagcg agctctgtgt aaatatgaag
      361 aatgtcagaa gcaaggggtg gagtacgtgc ctgcctgcct ggtgcaccgg aggaggagga
      421 gggaggacca gatggacggt gacgggcctc gcccgcggga agccttctgg gagcccacat
      481 ccagtgatga ggggggagct gcaagtgatg acagcatgac agacctgtac ccacctgagc
      541 tattcaccag aaaggacctt ggaagcacgg aggatgggga tggcactgat gactttttga
      601 cagacaaaga ggatgagaag gcaaagcccc caagagagaa ggccactgat gagggcagga
      661 gagagacgac cgtgtaccga gggctggtcc agaagcgcgg gaagaagcag ttgggctcgt
      721 tgaaaaagaa gttcaagagt catcaccgca aacccaagag cttcagctcc tgtaaacagc
      781 caggttaata aaagcacatg ccgtgaaaaa aaaaaaaaaa aaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: BM789997. K-EST0069714 S17N...[gi:19138229] Links  


IDENTIFIERS

dbEST Id:       11387293
EST name:       K-EST0069714
GenBank Acc:    BM789997
GenBank gi:     19138229

CLONE INFO
Clone Id:       S17N258215-5-A10 (5')
Plate:          5 Row: A Column: 10
DNA type:       cDNA

PRIMERS
PolyA Tail:     Unknown

SEQUENCE
                GCGAGCGGCTTCCGCCGGGCTGCTCCGCGGGCGCGTCGGCCATGAGCGAGTTGCCGGGCG
                ACGTGCGGGCGTTTCTGCGGGAGCACCCGAGCCTGCGGCTCCAGACGGACGCCCGCAAGG
                TGAGGTGCATCCTGACAGGTCACGAGCTGCCCTGCCGCCTGCCGGAGCTCCAGGTCTACA
                CCCGCGGCAAAAAGTACCAGCGGCTGGTCCGCGCCTCCCCGGCCTTCGACTATGCAGAGT
                TCGAGCCGCACATCGTGCCCAGCACCAAGAACCCGCACCAGTTGTTCTGCAAACTCACCC
                TGCGGCACATCAACAAGTGCCCAGAACACGTGCTGAGGCACACCCAGGGCCGGCGGTACC
                AGCGAGCTCTGTGTAAATATGAAGAATGTCAGAAGCAAGGGGTGGAGTACGTGCCTGCCT
                GCCTGGTGCACCGGAGGAGGAGGAGGGAGGACCANATGGACGGTGACGGGCCTCGCCCGC
                GGGAAGCCTTCTGGGAGCCCACATCCAGTGATGAGGGGGGAGCTGCAAGTGATGACAGCA
                TGACAGACCTGTACCCACCTGAGCTATTCA
Quality:        High quality sequence stops at base: 570

Entry Created:  Mar 5 2002
Last Updated:   Mar 5 2002

LIBRARY
Lib Name:       S17N258215
Organism:       Homo sapiens
Sex:            M
Organ:          Stomach
Lab host:       Top10F'
Vector:         pCNS
R. Site 1:      EcoRI
R. Site 2:      NotI
Description:    The poly (A)+ RNA was dephosphorylated with bacterial
                alkaline phosphatase (BAP) and then decapped with tabacco
                acid pyrophosphatase (TAP). The decapped intact mRNA was
                ligated with DNA-RNA linker including EcoR I site by
                treatment of T4 RNA ligase and the first strand cDNA was
                synthesized from oligo dT-selected mRNA by priming with
                dT-tailed vector. The dT-tailed vector was adjusted to have
                about 60nt. The cDNA vector was circularized with E. coli
                DNA ligase after digestion of EcoRI which site is also
                included in vector. An RNA strand converted to a DNA strand
                by Okayama-Berg method. The obtained cDNA vectors were used
                for transformation of competent cells E. coli Top10F' by
                electroporation method. The cDNA libraries constructed by
                this method are full-length enriched cDNA library.

SUBMITTER
Name:           Kim YS
Lab:            Genome Research Center
Institution:    Korea Research Institute of Bioscience & Biotechnology
Address:        52 Eoeun-dong Yuseong-gu, Daejeon 305-333, South Korea
Tel:            +82-42-860-4470
Fax:            +82-42-860-4409
E-mail:         yongsung@mail.kribb.re.kr

CITATIONS
Title:          21C Frontier Korean EST Project 2001
Authors:        Kim,N.S., Hahn,Y., Oh,J.H., Lee,J.Y., Ahn,H.Y., Chu,M.Y.,
                Kim,M.R., Oh,K.J., Cheong,J.E., Sohn,H.Y., Kim,J.M., Park
                ,H.S., Kim,S., Kim,Y.S.
Year:           2002
Status:         Unpublished


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  






&&&&&&&&&&




Genbank source (part 2)


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M55153. Human transglutam...[gi:339520] Links  


LOCUS       HUMTGASE                3257 bp    mRNA    linear   PRI 07-MAR-1995
DEFINITION  Human transglutaminase (TGase) mRNA, complete cds.
ACCESSION   M55153
VERSION     M55153.1  GI:339520
KEYWORDS    transglutaminase.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3257)
  AUTHORS   Gentile,V., Saydak,M., Chiocca,E.A., Akande,O., Birckbichler,P.J.,
            Lee,K.N., Stein,J.P. and Davies,P.J.
  TITLE     Isolation and characterization of cDNA clones to mouse macrophage
            and human endothelial cell tissue transglutaminases
  JOURNAL   J. Biol. Chem. 266 (1), 478-483 (1991)
  MEDLINE   91093168
   PUBMED   1670766
COMMENT     Original source text: Human umbilical vein endotehlial cell, cDNA
            to mRNA, clone 1.
FEATURES             Location/Qualifiers
     source          1..3257
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="14"
                     /clone="1"
                     /cell_type="endothelial cell"
                     /tissue_type="umbilical vein"
                     /tissue_lib="lambda-ZAP"
     gene            1..3257
                     /gene="TGM1"
     CDS             136..2199
                     /gene="TGM1"
                     /EC_number="2.3.2.13"
                     /codon_start=1
                     /product="transglutaminase"
                     /protein_id="AAA63261.1"
                     /db_xref="GI:339521"
                     /db_xref="GDB:G00-125-299"
                     /translation="MAEELVLERCDLELETNGRDHHTADLCREKLVVRRGQPFWLTLH
                     FEGRNYQASVDSLTFSVVTGPAPSQEAGTKARFPLRDAVEEGDWTATVVDQQDCTLSL
                     QLTTPANAPIGLYRLSLEASTGYQGSSFVLGHFILLFNAWCPADAVYLDSEEERQEYV
                     LTQQGFIYQGSAKFIKNIPWNFGQFQDGILDICLILLDVNPKFLKNAGRDCSRRSSPV
                     YVGRVGSGMVNCNDDQGVLLGRWDNNYGDGVSPMSWIGSVDILRRWKNHGCQRVKYGQ
                     CWVFAAVACTVLRCLGIPTRVVTNYNSAHDQNSNLLIEYFRNEFGEIQGDKSEMIWNF
                     HCWVESWMTRPDLQPGYEGWQALDPTPQEKSEGTYCCGPVPVRAIKEGDLSTKYDAPF
                     VFAEVNADVVDWIQQDDGSVHKSINRSLIVGLKISTKSVGRDEREDITHTYKYPEGSS
                     EEREAFTRANHLNKLAEKEETGMAMRIRVGQSMNMGSDFDVFAHITNNTAEEYVCRLL
                     LCARTVSYNGILGPECGTKYLLNLTLEPFSEKSVPLCILYEKYRDCLTESNLIKVRAL
                     LVEPVINSYLLAERDLYLENPEIKIRILGEPKQKRKLVAEVSLQNPLPVALEGCTFTV
                     EGAGLTEEQKTVEIPDPVEAGEEVKVRMDLVPLHMGLHKLVVNFESDKLKAVKGFRNV
                     IIGPA"
BASE COUNT      716 a    974 c    945 g    622 t
ORIGIN      
        1 aacaggcgtg acgccagttc taaacttgaa acaaaacaaa acttcaaagt acaccaaaat
       61 agaacctcct taaagcataa atctcacgga gggtctcggc cgccagtgga aggagccacc
      121 gcccccgccc cgaccatggc cgaggagctg gtcttagaga ggtgtgatct ggagctggag
      181 accaatggcc gagaccacca cacggccgac ctgtgccggg agaagctggt ggtgcgacgg
      241 ggccagccct tctggctgac cctgcacttt gagggccgca actaccaggc cagtgtagac
      301 agtctcacct tcagtgtcgt gaccggccca gcccctagcc aggaggccgg gaccaaggcc
      361 cgttttccac taagagatgc tgtggaggag ggtgactgga cagccaccgt ggtggaccag
      421 caagactgca ccctctcgct gcagctcacc accccggcca acgcccccat cggcctgtat
      481 cgcctcagcc tggaggcctc cactggctac cagggatcca gctttgtgct gggccacttc
      541 attttgctct tcaacgcctg gtgcccagcg gatgctgtgt acctggactc ggaagaggag
      601 cggcaggagt atgtcctcac ccagcagggc tttatctacc agggctcggc caagttcatc
      661 aagaacatac cttggaattt tgggcagttt caagatggga tcctagacat ctgcctgatc
      721 cttctagatg tcaaccccaa gttcctgaag aacgccggcc gtgactgctc ccggcgcagc
      781 agccccgtct acgtgggccg ggtgggtagt ggcatggtca actgcaacga tgaccagggt
      841 gtgctgctgg gacgctggga caacaactac ggggacggcg tcagccccat gtcctggatc
      901 ggcagcgtgg acatcctgcg gcgctggaag aaccacggct gccagcgcgt caagtatggc
      961 cagtgctggg tcttcgccgc cgtggcctgc acagtgctga ggtgcctagg catccctacc
     1021 cgcgtcgtga ccaactacaa ctcggcccat gaccagaaca gcaaccttct catcgagtac
     1081 ttccgcaatg agtttgggga gatccagggt gacaagagcg agatgatctg gaacttccac
     1141 tgctgggtgg agtcgtggat gaccaggccg gacctgcagc cggggtacga gggctggcag
     1201 gccctggacc caacgcccca ggagaagagc gaaggaacgt actgctgtgg cccagttcca
     1261 gttcgtgcca tcaaggaggg cgacctgagc accaagtacg atgcgccctt tgtctttgcg
     1321 gaggtcaatg ccgacgtggt agactggatc cagcaggacg atgggtctgt gcacaaatcc
     1381 atcaaccgtt ccctgatcgt tgggctgaag atcagcacta agagcgtggg ccgagacgag
     1441 cgggaggata tcacccacac ctacaaatac ccagaggggt cctcagagga gagggaggcc
     1501 ttcacaaggg cgaaccacct gaacaaactg gccgagaagg aggagacagg gatggccatg
     1561 cggatccgtg tgggccagag catgaacatg ggcagtgact ttgacgtctt tgcccacatc
     1621 accaacaaca ccgctgagga gtacgtctgc cgcctcctgc tctgtgcccg caccgtcagc
     1681 tacaatggga tcttggggcc cgagtgtggc accaagtacc tgctcaacct aaccctggag
     1741 cctttctctg agaagagcgt tcctctttgc atcctctatg agaaataccg tgactgcctt
     1801 acggagtcca acctcatcaa ggtgcgggcc ctcctcgtgg agccagttat caacagctac
     1861 ctgctggctg agagggacct ctacctggag aatccagaaa tcaagatccg gatccttggg
     1921 gagcccaagc agaaacgcaa gctggtggct gaggtgtccc tgcagaaccc gctccctgtg
     1981 gccctggaag gctgcacctt cactgtggag ggggccggcc tgactgagga gcagaagacg
     2041 gtggagatcc cagaccccgt ggaggcaggg gaggaagtta aggtgagaat ggacctcgtg
     2101 ccgctccaca tgggcctcca caagctggtg gtgaacttcg agagcgacaa gctgaaggct
     2161 gtgaagggct tccggaatgt catcattggc cccgcctaag ggacccctgc tcccagcctg
     2221 ctgagagccc ccaccttgat cccaatcctt atcccaagct agtgagcaaa atatgcccct
     2281 tattgggccc cagaccccag ggcagggtgg gcagcctatg ggggctctcg gaaatggaat
     2341 gtgcccctgg cccatctcag cctcctgagc ctgtgggtcc ccactcaccc cctttgctgt
     2401 gaggaatgct ctgtgccaga aacagtggga gccctgacct gtgctgactg gggctggggt
     2461 gagagaggaa agacctacat tccctctcct gcccagatgc cctttggaaa gccattgacc
     2521 acccaccata ttgtttgatc tacttcatag ctccttggag caggcaaaaa agggacagca
     2581 tgcccttggc tggatcagga atccagctcc ctagactgca tcccgtacct cttcccatga
     2641 ctgcacccag ctccaggggc ccttgggaca cccagagctg ggtggggaca gtgataggcc
     2701 caaggtcccc tccacatccc agcagcccaa gcttaatagc cctccccctc aacctcacca
     2761 ttgtgaagca cctactatgt gctgggtgcc tcccacactt gctggggctc acggggcctc
     2821 caacccattt aatcaccatg ggaaactgtt gtgggcgctg cttccaggat aaggagactg
     2881 aggcttagag agaggaggca gccccctcca caccagtggc ctcgtggtta taagcaaggc
     2941 tgggtaatgt gaaggcccaa gagcagagtc tgggcctctg actctgagtc cactgctcca
     3001 tttataaccc cagcctgacc tgagactgtc gcagaggctg tctggggcct ttatcaaaaa
     3061 aagactcagc caagacaagg aggtagagag gggactgggg gactgggagt cagagccctg
     3121 gctgggttca ggtcccacgt ctggccagcg actgccttct cctctctggg cctttgtttc
     3181 cttgttggtc agaggagtga ttgaacctgc tcatctccaa ggatcctctc cactccatgt
     3241 ttgcaataca caattcc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D31767. Human mRNA for KI...[gi:505091] Links  


LOCUS       HUMORFKG1F              1897 bp    mRNA    linear   PRI 06-OCT-2001
DEFINITION  Human mRNA for KIAA0058 gene, complete cds.
ACCESSION   D31767
VERSION     D31767.1  GI:505091
KEYWORDS    KIAA0058.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S.,
            Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S.
  TITLE     Prediction of the coding sequences of unidentified human genes. II.
            The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by
            analysis of cDNA clones from human cell line KG-1
  JOURNAL   DNA Res. 1 (5), 223-229 (1994)
  MEDLINE   96051398
REFERENCE   2  (bases 1 to 1897)
  AUTHORS   Ohara,O., Nagase,T., Kikuno,R. and Nomura,N.
  TITLE     Direct Submission
  JOURNAL   Submitted (06-JUN-1994) Osamu Ohara, Kazusa DNA Research Institute;
            1532-3, Yana, Kisarazu, Chiba 292-0812, Japan
            (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913)
FEATURES             Location/Qualifiers
     source          1..1897
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /sex="male"
                     /cell_line="KG-1"
                     /cell_type="myeloblast"
     gene            1..1897
                     /gene="KIAA0058"
     5'UTR           1..69
                     /gene="KIAA0058"
     CDS             70..576
                     /gene="KIAA0058"
                     /codon_start=1
                     /protein_id="BAA06545.1"
                     /db_xref="GI:505092"
                     /translation="MNSKGQYPTQPTYPVQPPGNPVYPQTLHLPQAPPYTDAPPAYSE
                     LYRPSFVHPGAATVPTMSAAFPGASLYLPMAQSVAVGPLGSTIPMAYYPVGPIYPPGS
                     TVLVEGGYDAGARFGAGATAGNIPPPPPGCPPNAAQLAVMQGANVLVTQRKGNFFMGG
                     SDGGYTIW"
     3'UTR           577..1897
                     /gene="KIAA0058"
BASE COUNT      478 a    455 c    391 g    573 t
ORIGIN      
        1 ctccgaacag gaagaggacg aaaaaaataa ccgtccgcga cgccgagaca aaccggaccc
       61 gcaaccacca tgaacagcaa aggtcaatat ccaacacagc caacctaccc tgtgcagcct
      121 cctgggaatc cagtataccc tcagaccttg catcttcctc aggctccacc ctataccgat
      181 gctccacctg cctactcaga gctctatcgt ccgagctttg tgcacccagg ggctgccaca
      241 gtccccacca tgtcagccgc atttcctgga gcctctctgt atcttcccat ggcccagtct
      301 gtggctgttg ggcctttagg ttccacaatc cccatggctt attatccagt cggtcccatc
      361 tatccacctg gctccacagt gctggtggaa ggagggtatg atgcaggtgc cagatttgga
      421 gctggggcta ctgctggcaa cattcctcct ccacctcctg gatgccctcc caatgctgct
      481 cagcttgcag tcatgcaggg agccaacgtc ctcgtaactc agcggaaggg gaacttcttc
      541 atgggtggtt cagatggtgg ctacaccatc tggtgaggaa ccaaggccac ctctgtgccg
      601 ggaaagacat cacatacctt cagcacttct cacaatgtaa ctgctttagt catattaacc
      661 tgaagttgca gtttagacac atgttgttgg ggtgtctttc tggtgcccaa actttcaggc
      721 acttttcaaa tttaataagg aaccatgtaa tggtagcagt acctccctaa agcattttga
      781 ggtaggggag gtatccattc ataaaatgaa tgtgggtgaa gccgccctaa ggattttcct
      841 ttaatttctc tggagtaata ctgtaccata ctggtctttg cttttagtaa taaaacatca
      901 aattaggttt ggagggaact ttgatcttcc taagaattaa agttgccaaa ttattctgat
      961 tggtctttaa tctcctttaa gtctttgata tatattactt gttataaatg gaacgcatta
     1021 gttgtctgcc ttttcctttc catcccttgc cccacccatc ccatctccaa ccctagtctt
     1081 ccatttcctc ccgccagtct ccattgaatc aatggtgcag gacagaaagc cagtcagact
     1141 aatttccttc tttcctcgca cttctcccca ctcgtcatct tttaactagt gtttcacaag
     1201 gatcctctga aaccctctct gtgccccaag tacagatgcc attacttctg ctttcgtatc
     1261 tcctcaggca aaagtggagg gtgccttatg ggccctcctc ataggttgtc tctgcataca
     1321 cgaacctaac ccaaatttgc tttggtgcca gaaaaactga gctatgtttg aacaaagatg
     1381 tcgtgcaaac tgtactgtga acaacagttg gtttaaaata tgaggggcaa ggaggaggat
     1441 gcatttcaaa agcttgattg atgtgttcag agctaaatta agaggagttt tcagatcaaa
     1501 aactggttac cattttttgt cagagtgtct gatgcggcca ctcattcggc tccccagaat
     1561 tcctagactg ggttaatagg gtcatattgt gaatgtctca ctacaaaatg acttgagtcc
     1621 agtgaaatct cattagggtt taagaatatt tcagggatcc ttaatgtttt gatttttgtt
     1681 ttctgaaatt ggattttatt ttattttatc ttataatttc agttcatcta aattgtgtgt
     1741 tctgtacatg tgatgtttga ctgtaccatt gactgttatg gaagttcagc gttgtatgtc
     1801 tctctctaca ctgtggtgca cttaacttgt ggaattttta tactaaaaat gtagaataaa
     1861 gactattttg aagatttgaa taaagtgatg aagttgc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D13666. Homo sapiens osf-...[gi:393316] Links  


LOCUS       HUMOSF2OS               3213 bp    mRNA    linear   PRI 13-APR-1999
DEFINITION  Homo sapiens osf-2 mRNA for osteoblast specific factor 2 (OSF-2os),
            complete cds.
ACCESSION   D13666
VERSION     D13666.1  GI:393316
KEYWORDS    osteoblast specific factor 2; OSF-2os.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3213)
  AUTHORS   Takeshita,S., Kikuno,R., Tezuka,K. and Amann,E.
  TITLE     Osteoblast-specific factor 2: cloning of a putative bone adhesion
            protein with homology with the insect protein fasciclin I
  JOURNAL   Biochem. J. 294 (Pt 1), 271-278 (1993)
  MEDLINE   93371373
REFERENCE   2  (bases 1 to 3213)
  AUTHORS   Kikuno,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-NOV-1992) Reiko Kikuno, Hoechst Japan Ltd., Pharma
            Research Labs.; 1-3-2 Minami-dai, Kawagoe, Saitama 350-11, Japan
            (E-mail:rkikuno@ddbj.nig.ac.jp, Tel:0492-43-6149, Fax:0492-43-2479)
FEATURES             Location/Qualifiers
     source          1..3213
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="pKOT158"
                     /cell_type="osteoblast"
                     /tissue_type="osteosarcoma"
     gene            1..3213
                     /gene="osf-2"
     CDS             12..2522
                     /gene="osf-2"
                     /note="OSF-2os"
                     /codon_start=1
                     /product="osteoblast specific factor 2"
                     /protein_id="BAA02837.1"
                     /db_xref="GI:393317"
                     /translation="MIPFLPMFSLLLLLIVNPINANNHYDKILAHSRIRGRDQGPNVC
                     ALQQILGTKKKYFSTCKNWYKKSICGQKTTVLYECCPGYMRMEGMKGCPAVLPIDHVY
                     GTLGIVGATTTQRYSDASKLREEIEGKGSFTYFAPSNEAWDNLDSDIRRGLESNVNVE
                     LLNALHSHMINKRMLTKDLKNGMIIPSMYNNLGLFINHYPNGVVTVNCARIIHGNQIA
                     TNGVVHVIDRVLTQIGTSIQDFIEAEDDLSSFRAAAITSDILEALGRDGHFTLFAPTN
                     EAFEKLPRGVLERFMGDKVASEALMKYHILNTLQCSESIMGGAVFETLEGNTIEIGCD
                     GDSITVNGIKMVNKKDIVTNNGVIHLIDQVLIPDSAKQVIELAGKQQTTFTDLVAQLG
                     LASALRPDGEYTLLAPVNNAFSDDTLSMVQRLLKLILQNHILKVKVGLNELYNGQILE
                     TIGGKQLRVFVYRTAVCIENSCMEKGSKQGRNGAIHIFREIIKPAEKSLHEKLKQDKR
                     FSTFLSLLEAADLKELLTQPGDWTLFVPTNDAFKGMTSEEKEILIRDKNALQNIILYH
                     LTPGVFIGKGFEPGVTNILKTTQGSKIFLKEVNDTLLVNELKSKESDIMTTNGVIHVV
                     DKLLYPADTPVGNDQLLEILNKLIKYIQIKFVRGSTFKEIPVTVYTTKIITKVVEPKI
                     KVIEGSLQPIIKTEGPTLTKVKIEGEPEFRLIKEGETITEVIHGEPIIKKYTKIIDGV
                     PVEITEKETREERIITGPEIKYTRISTGGGETEETLKKLLQEEVTKVTKFIEGGDGHL
                     FEDEEIKRLLQGDTPVRKLQANKKVQGSRRRLREGRSQ"
     polyA_signal    3118..3123
                     /gene="osf-2"
BASE COUNT     1096 a    590 c    654 g    873 t
ORIGIN      
        1 agagactcaa gatgattccc tttttaccca tgttttctct actattgctg cttattgtta
       61 accctataaa cgccaacaat cattatgaca agatcttggc tcatagtcgt atcaggggtc
      121 gggaccaagg cccaaatgtc tgtgcccttc aacagatttt gggcaccaaa aagaaatact
      181 tcagcacttg taagaactgg tataaaaagt ccatctgtgg acagaaaacg actgttttat
      241 atgaatgttg ccctggttat atgagaatgg aaggaatgaa aggctgccca gcagttttgc
      301 ccattgacca tgtttatggc actctgggca tcgtgggagc caccacaacg cagcgctatt
      361 ctgacgcctc aaaactgagg gaggagatcg agggaaaggg atccttcact tactttgcac
      421 cgagtaatga ggcttgggac aacttggatt ctgatatccg tagaggtttg gagagcaacg
      481 tgaatgttga attactgaat gctttacata gtcacatgat taataagaga atgttgacca
      541 aggacttaaa aaatggcatg attattcctt caatgtataa caatttgggg cttttcatta
      601 accattatcc taatggggtt gtcactgtta attgtgctcg aatcatccat gggaaccaga
      661 ttgcaacaaa tggtgttgtc catgtcattg accgtgtgct tacacaaatt ggtacctcaa
      721 ttcaagactt cattgaagca gaagatgacc tttcatcttt tagagcagct gccatcacat
      781 cggacatatt ggaggccctt ggaagagacg gtcacttcac actctttgct cccaccaatg
      841 aggcttttga gaaacttcca cgaggtgtcc tagaaaggtt catgggagac aaagtggctt
      901 ccgaagctct tatgaagtac cacatcttaa atactctcca gtgttctgag tctattatgg
      961 gaggagcagt ctttgagacg ctggaaggaa atacaattga gataggatgt gacggtgaca
     1021 gtataacagt aaatggaatc aaaatggtga acaaaaagga tattgtgaca aataatggtg
     1081 tgatccattt gattgatcag gtcctaattc ctgattctgc caaacaagtt attgagctgg
     1141 ctggaaaaca gcaaaccacc ttcacggatc ttgtggccca attaggcttg gcatctgctc
     1201 tgaggccaga tggagaatac actttgctgg cacctgtgaa taatgcattt tctgatgata
     1261 ctctcagcat ggttcagcgc ctccttaaat taattctgca gaatcacata ttgaaagtaa
     1321 aagttggcct taatgagctt tacaacgggc aaatactgga aaccatcgga ggcaaacagc
     1381 tcagagtctt cgtatatcgt acagctgtct gcattgaaaa ttcatgcatg gagaaaggga
     1441 gtaagcaagg gagaaacggt gcgattcaca tattccgcga gatcatcaag ccagcagaga
     1501 aatccctcca tgaaaagtta aaacaagata agcgctttag caccttcctc agcctacttg
     1561 aagctgcaga cttgaaagag ctcctgacac aacctggaga ctggacatta tttgtgccaa
     1621 ccaatgatgc ttttaaggga atgactagtg aagaaaaaga aattctgata cgggacaaaa
     1681 atgctcttca aaacatcatt ctttatcacc tgacaccagg agttttcatt ggaaaaggat
     1741 ttgaacctgg tgttactaac attttaaaga ccacacaagg aagcaaaatc tttctgaaag
     1801 aagtaaatga tacacttctg gtgaatgaat tgaaatcaaa agaatctgac atcatgacaa
     1861 caaatggtgt aattcatgtt gtagataaac tcctctatcc agcagacaca cctgttggaa
     1921 atgatcaact gctggaaata cttaataaat taatcaaata catccaaatt aagtttgttc
     1981 gtggtagcac cttcaaagaa atccccgtga ctgtctatac aactaaaatt ataaccaaag
     2041 ttgtggaacc aaaaattaaa gtgattgaag gcagtcttca gcctattatc aaaactgaag
     2101 gacccacact aacaaaagtc aaaattgaag gtgaacctga attcagactg attaaagaag
     2161 gtgaaacaat aactgaagtg atccatggag agccaattat taaaaaatac accaaaatca
     2221 ttgatggagt gcctgtggaa ataactgaaa aagagacacg agaagaacga atcattacag
     2281 gtcctgaaat aaaatacact aggatttcta ctggaggtgg agaaacagaa gaaactctga
     2341 agaaattgtt acaagaagag gtcaccaagg tcaccaaatt cattgaaggt ggtgatggtc
     2401 atttatttga agatgaagaa attaaaagac tgcttcaggg agacacaccc gtgaggaagt
     2461 tgcaagccaa caaaaaagtt caaggttcta gaagacgatt aagggaaggt cgttctcagt
     2521 gaaaatccaa aaaccagaaa aaaatgttta tacaacccta agtcaataac ctgaccttag
     2581 aaaattgtga gagccaagtt gacttcagga actgaaacat cagcacaaag aagcaatcat
     2641 caaataattc tgaacacaaa tttaatattt ttttttctga atgagaaaca tgagggaaat
     2701 tgtggagtta gcctcctgtg gtaaaggaat tgaagaaaat ataacacctt acaccctttt
     2761 tcatcttgac attaaaagtt ctggctaact ttggaatcca ttagagaaaa atccttgtca
     2821 ccagattcat tacaattcaa atcgaagagt tgtgaactgt tatcccattg aaaagaccga
     2881 gccttgtatg tatgttatgg atacataaaa tgcacgcaag ccattatctc tccatgggaa
     2941 gctaagttat aaaaataggt gcttggtgta caaaactttt tatatcaaaa ggctttgcac
     3001 atttctatat gagtgggttt actggtaaat tatgttattt tttacaacta attttgtact
     3061 ctcagaatgt ttgtcatatg cttcttgcaa tgcatatttt ttaatctcaa acgtttcaat
     3121 aaaaccattt ttcagatata aagagaatta cttcaaattg agtaattcag aaaaactcaa
     3181 gatttaagtt aaaaagtggt ttggacttgg gaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  


&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: J04182. Homo sapiens lyso...[gi:186927] Links  


LOCUS       HUMLAMP1A               2455 bp    mRNA    linear   PRI 11-JAN-1995
DEFINITION  Homo sapiens lysosomal membrane glycoprotein-1 (LAMP1) mRNA,
            complete cds.
ACCESSION   J04182
VERSION     J04182.1  GI:186927
KEYWORDS    LAMP1 gene; lysosomal membrane glycoprotein-1; membrane
            glycoprotein.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2455)
  AUTHORS   Fukuda,M., Viitala,J., Matteson,J. and Carlsson,S.R.
  TITLE     Cloning of cDNAs encoding human lysosomal membrane glycoproteins,
            h-lamp-1 and h-lamp-2. Comparison of their deduced amino acid
            sequences
  JOURNAL   J. Biol. Chem. 263 (35), 18920-18928 (1988)
  MEDLINE   89066687
   PUBMED   3198605
COMMENT     Original source text: Homo sapiens placenta cDNA to mRNA.
            Computer readable copy of sequence [1] kindly submitted by M.Fukuda
            24-OCT-1988.
FEATURES             Location/Qualifiers
     source          1..2455
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="13q34"
                     /clone="P-hL1-15B"
                     /tissue_type="placenta"
     gene            1..2455
                     /gene="LAMP1"
     CDS             191..1441
                     /gene="LAMP1"
                     /note="precursor"
                     /codon_start=1
                     /product="lysosomal membrane glycoprotein-1"
                     /protein_id="AAA60382.1"
                     /db_xref="GI:307109"
                     /db_xref="GDB:G00-120-137"
                     /translation="MAPRSARRPLLLLLPVAAARPHALSSAAMFMVKNGNGTACIMAN
                     FSAAFSVNYDTKSGPKNMTFDLPSDATVVLNRSSCGKENTSDPSLVIAFGRGHTLTLN
                     FTRNATRYSVQLMSFVYNLSDTHLFPNASSKEIKTVESITDIRADIDKKYRCVSGTQV
                     HMNNVTVTLHDATIQAYLSNSSFSRGETRCEQDRPSPTTAPPAPPSPSPSPVPKSPSV
                     DKYNVSGTNGTCLLASMGLQLNLTYERKDNTTVTRLLNINPNKTSASGSCGAHLVTLE
                     LHSEGTTVLLFQFGMNASSSRFFLQGIQLNTILPDARDPAFKAANGSLRALQATVGNS
                     YKCNAEEHVRVTKAFSVNIFKVWVQAFKVEGGQFGSVEECLLDENSTLIPIAVGGALA
                     GLVLIVLIAYLVGRKRSHAGYQTI"
     sig_peptide     191..271
                     /gene="LAMP1"
                     /note="G00-120-137"
     mat_peptide     272..1438
                     /gene="LAMP1"
                     /product="lysosomal membrane glycoprotein-1"
BASE COUNT      530 a    671 c    677 g    577 t
ORIGIN      
        1 gaattcgggc gggcttcttc gctgccgacg tacgacgagt ggccgggctc ttgcgtctgg
       61 taacgcgctg tctctaacgc cagcgccgtc tcgcgcgcac tgcgcacaga ccacccgcag
      121 acgcccggca gtccgcaggc ccaaacgcgc acgcgacccc gctctccgca ccgtacccgg
      181 ccgcctcggc atggcgcccc gcagcgcccg gcgacccctg ctgctgctac tgcctgttgc
      241 tgctgctcgg cctcatgcat tgtcgtcagc agccatgttt atggtgaaaa atggcaacgg
      301 gaccgcgtgc ataatggcca acttctctgc tgccttctca gtgaactacg acaccaagag
      361 tggccccaag aacatgacct ttgacctgcc atcagatgcc acagtggtgc tcaaccgcag
      421 ctcctgtgga aaagagaaca cttctgaccc cagtctcgtg attgcttttg gaagaggaca
      481 tacactcact ctcaatttca cgagaaatgc aacacgttac agcgttcagc tcatgagttt
      541 tgtttataac ttgtcagaca cacacctttt ccccaatgcg agctccaaag aaatcaagac
      601 tgtggaatct ataactgaca tcagggcaga tatagataaa aaatacagat gtgttagtgg
      661 cacccaggtc cacatgaaca acgtgaccgt aacgctccat gatgccacca tccaggcgta
      721 cctttccaac agcagcttca gcaggggaga gacacgctgt gaacaagaca ggccttcccc
      781 aaccacagcg ccccctgcgc cacccagccc ctcgccctca cccgtgccca agagcccctc
      841 tgtggacaag tacaacgtga gcggcaccaa cgggacctgc ctgctggcca gcatggggct
      901 gcagctgaac ctcacctatg agaggaagga caacacgacg gtgacaaggc ttctcaacat
      961 caaccccaac aagacctcgg ccagcgggag ctgcggcgcc cacctggtga ctctggagct
     1021 gcacagcgag ggcaccaccg tcctgctctt ccagttcggg atgaatgcaa gttctagccg
     1081 gtttttccta caaggaatcc agttgaatac aattcttcct gacgccagag accctgcctt
     1141 taaagctgcc aacggctccc tgcgagcgct gcaggccaca gtcggcaatt cctacaagtg
     1201 caacgcggag gagcacgtcc gtgtcacgaa ggcgttttca gtcaatatat tcaaagtgtg
     1261 ggtccaggct ttcaaggtgg aaggtggcca gtttggctct gtggaggagt gtctgctgga
     1321 cgagaacagc acgctgatcc ccatcgctgt gggtggtgcc ctggcggggc tggtcctcat
     1381 cgtcctcatc gcctacctcg tcggcaggaa gaggagtcac gcaggctacc agactatcta
     1441 gcctggtgca cgcaggcaca gcagctgcag gggcctctgt tcctttctct gggcttaggg
     1501 tcctgtcgaa ggggaggcac actttctgca aacgtttctc aaatctgctt catccaatgt
     1561 gaagttcatc ttgcagcatt tactatgcac aacagagtaa ctatcgaaat gacggtgtta
     1621 attttgctaa ctgggttaaa tattttgcta actggttaaa cattaatatt taccaaagta
     1681 ggattttgag ggtgggggtg ctctctctga gggggtgggg gtgccgctgt ctctgagggg
     1741 tgggggtgcc gctgtctgag gggtgggggt gccgctctct ctgagggggt gggggtgccg
     1801 ctttctctga gggggtgggg gtgccgctct ctctgagggg gtgggggtgc tgctctctcc
     1861 gaggggtgga atgccgctgt ctctgagggg tgggggtgcc gctctaaatt ggctccatat
     1921 cattgagttt agggttctgg tgtttggttt cttcattctt tactgcactc agatttaagc
     1981 cttacaaagg gaaacctctg gccgtcacac gtaggacgca tgaaggtcac tcgtgtgagg
     2041 ctgacatgct cacacattac aacagtagag agggaaaatc ctaagacaga ggaactccag
     2101 agatgagtgt ctggagcggc ttcagttcag ctttaaaggc caggacgcgc gacacgtggc
     2161 tggcggcctc gttccagtgg cggcacgtcc ttggcgtctc taatgtctgc agctcaaggg
     2221 ctggcacttt tttaaatata aaaatggtgt tatttttatt tttttttgta aagtgatttt
     2281 tggtcttctg ttgacattcg ggtgatcctg ttctgcgctg tgtacaatgt gagatcggtg
     2341 cgttctcctg atgttttgcc gtggcttggg gattgtacac gggaccagct cacgtaatgc
     2401 attgcctgta acaatgtaat aaaaagcctc tttctttcaa aaaaaccccg aattc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  






&&&&&&&


    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AH000829. Human peripheral ...[gi:488422] Links  


LOCUS       HSPBR1                   253 bp    DNA     linear   PRI 20-MAY-1994
DEFINITION  Human peripheral benzodiazepine receptor gene, exon 1.
ACCESSION   L21951
VERSION     L21951.1  GI:483402
KEYWORDS    benzodiazepine receptor; peripheral benzodiazepine receptor.
SEGMENT     1 of 4
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 253)
  AUTHORS   Lin,D., Chang,Y.J., Strauss,J.F. III and Miller,W.L.
  TITLE     The human peripheral benzodiazepine receptor gene: cloning and
            characterization of alternative splicing in normal tissues and in a
            patient with congenital lipoid adrenal hyperplasia
  JOURNAL   Genomics 18 (3), 643-650 (1993)
  MEDLINE   94140364
   PUBMED   8307574
COMMENT     Original source text: Homo sapiens DNA.
FEATURES             Location/Qualifiers
     source          1..253
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="placenta"
                     /tissue_lib="placenta"
     exon            150..207
                     /note="peripheral benzodiazepine receptor"
BASE COUNT       29 a     69 c    128 g     27 t
ORIGIN      
        1 ccgcgggaga ggtggctttg aggagtgagc tcccggtccg cggggacgcg agtgggccca
       61 gtgccgggct gccaggcggg gcggggcggg gccggggcac tgagaggggc ggggcctggc
      121 ggctgggagg ggcggggcgg atgcgggaca gcggcctggc taactcctgc caggcagtgc
      181 ccttcccgga gcgtgccctc gccgctggtg agtgaggacg ggacgcggag ggggcagcgg
      241 gaagtggggg ccc
//
LOCUS       HSPBR2                   487 bp    DNA     linear   PRI 20-MAY-1994
DEFINITION  Human peripheral benzodiazepine receptor gene, exon 2.
ACCESSION   L21952
VERSION     L21952.1  GI:483403
KEYWORDS    benzodiazepine receptor; peripheral benzodiazepine receptor.
SEGMENT     2 of 4
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 487)
  AUTHORS   Lin,D., Chang,Y.J., Strauss,J.F. III and Miller,W.L.
  TITLE     The human peripheral benzodiazepine receptor gene: cloning and
            characterization of alternative splicing in normal tissues and in a
            patient with congenital lipoid adrenal hyperplasia
  JOURNAL   Genomics 18 (3), 643-650 (1993)
  MEDLINE   94140364
   PUBMED   8307574
COMMENT     Original source text: Homo sapiens DNA.
FEATURES             Location/Qualifiers
     source          1..487
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="placenta"
                     /tissue_lib="placenta"
     exon            166..376
                     /product="peripheral benzodiazepine receptor"
BASE COUNT       67 a    168 c    156 g     96 t
ORIGIN      
        1 ctgaaatgcg ttcactcagc cactcccacc ccgcacatct ctgcgcctcc tgatcagctg
       61 actggttgat ctgtgggtga caggcctttc ggggatgctg gaggagacac gggcctgacc
      121 ccatggcctc ggaatgccct cacgcagccc tgtcttctct ttcagagctc ccctgaacag
      181 cagctgcagc agccatggcc ccgccctggg tgcccgccat gggcttcacg ctggcgccca
      241 gcctggggtg cttcgtgggc tcccgctttg tccacggcga gggtctccgc tggtacgccg
      301 gcctgcagaa gccctcgtgg cacccgcccc actgggtgct gggccctgtc tggggcacgc
      361 tctactcagc catggggtag gtgggcgtgc actggcctgg ggataagcct ggccctttgc
      421 aaggggaggc ctggcccagg acagagggtc ttctccaggc gggccatgga ccatggcatg
      481 gtttccc
//
LOCUS       HSPBR3                   645 bp    DNA     linear   PRI 20-MAY-1994
DEFINITION  Human peripheral benzodiazepine receptor gene, exon 3.
ACCESSION   L21953
VERSION     L21953.1  GI:483404
KEYWORDS    benzodiazepine receptor; peripheral benzodiazepine receptor.
SEGMENT     3 of 4
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 645)
  AUTHORS   Lin,D., Chang,Y.J., Strauss,J.F. III and Miller,W.L.
  TITLE     The human peripheral benzodiazepine receptor gene: cloning and
            characterization of alternative splicing in normal tissues and in a
            patient with congenital lipoid adrenal hyperplasia
  JOURNAL   Genomics 18 (3), 643-650 (1993)
  MEDLINE   94140364
   PUBMED   8307574
COMMENT     Original source text: Homo sapiens DNA.
FEATURES             Location/Qualifiers
     source          1..645
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="placenta"
                     /tissue_lib="placenta"
     exon            224..362
                     /product="peripheral benzodiazepine receptor"
BASE COUNT      138 a    177 c    192 g    138 t
ORIGIN      
        1 gtgtcctaaa tgctgggtat cgcatgaagc actgccatgt gctaactaga atctctacag
       61 caaccctatg aaaaatgaag gcccagacag gtcaagcaac ttccctgaga tcacacagca
      121 gtcagtgtca ggtcacgtat gaaccatcac tgtgccactc ggaggtgggc aacgctcctg
      181 gccttgttcc taatggtgct ctgaactgcg gcctctgttt caggtacggc tcctacctgg
      241 tctggaaaga gctgggaggc ttcacagaga aggctgttgg ttcccctggg cctctacact
      301 gggcagctgg ccctgaactg ggcatggccc cccatcttct tggtgcccga caaatgggct
      361 gggtaagtgt ggccacagca tgtgtccctg atccctggat ccgacccttg gaggacgtgg
      421 ggcatcacat atgacactgg gtcagtgtct atggcggggc caggggagac aaaaggccat
      481 gtctctctag ctgtaagcag cccacaccct ggagcctccc ctctactgac tcctccggtg
      541 agggaagcta ttaaagcaga aggggttgca ggggtgggtt tgggggccac tgtgtaggaa
      601 aacccacacg aagcctggta ctgtgtgggc agggtcactg gcccc
//
LOCUS       HSPBR4                   684 bp    DNA     linear   PRI 20-MAY-1994
DEFINITION  Human peripheral benzodiazepine receptor gene, exon 4.
ACCESSION   L21954
VERSION     L21954.1  GI:483405
KEYWORDS    benzodiazepine receptor; peripheral benzodiazepine receptor.
SEGMENT     4 of 4
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 684)
  AUTHORS   Lin,D., Chang,Y.J., Strauss,J.F. III and Miller,W.L.
  TITLE     The human peripheral benzodiazepine receptor gene: cloning and
            characterization of alternative splicing in normal tissues and in a
            patient with congenital lipoid adrenal hyperplasia
  JOURNAL   Genomics 18 (3), 643-650 (1993)
  MEDLINE   94140364
   PUBMED   8307574
COMMENT     Original source text: Homo sapiens DNA.
FEATURES             Location/Qualifiers
     source          1..684
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="placenta"
                     /tissue_lib="placenta"
     CDS             join(L21952.1:195..376,L21953.1:224..362,164..352)
                     /codon_start=1
                     /product="peripheral benzodiazepine receptor"
                     /protein_id="AAA18228.1"
                     /db_xref="GI:488425"
                     /translation="MAPPWVPAMGFTLAPSLGCFVGSRFVHGEGLRWYAGLQKPSWHP
                     PHWVLGPVWGTLYSAMGYGSYLVWKELGGFTEKAVGSPGPLHWAAGPELGMAPHLLGA
                     RQMGWALVDLLLVSGAAAATTVAWYQVSPLAARLLYPYLAWLAFATTLNYCVWRDNHG
                     WHGGRRLPE"
     exon            164..602
                     /product="peripheral benzodiazepine receptor"
BASE COUNT      113 a    200 c    225 g    146 t
ORIGIN      
        1 aaactagctg cgatggggag ggcttggcca ggtcactcaa gggtggagtg ggggtgagtg
       61 aggctcctga ctcccaaatc cagtgggagt tgggcagtgg gacaggcact tgggtgaacg
      121 cggtgcctca ggcctcccca tcctccgtcc ccaatctctg caggccttgg tggatctcct
      181 gctggtcagt ggggcggcgg cagccactac cgtggcctgg taccaggtga gcccgctggc
      241 cgcccgcctg ctctacccct acctggcctg gctggccttc gcgaccacac tcaactactg
      301 cgtatggcgg gacaaccatg gctggcatgg gggacggcgg ctgccagagt gagtgcccgg
      361 cccaccaggg actgcagctg caccagcagg tgccatcacg cttgtgatgt ggtggccgtc
      421 acgctttcat gaccactggg cctgctagtc tgtcagggcc ttggcccagg ggtcagcaga
      481 gcttcagagg ttgccccacc tgagccccca cccgggagca gtgtcctgtg ctttctgcat
      541 gcttagagca tgttcttgga acatggaatt ttataagctg aataaagttt ttgacttcct
      601 ttaccatggc ctttttgctt gggtgggacc cctggccaca gggaagaggg ggagctgggg
      661 ctgcactgga gctcgtctct gcag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

OMIMOMIMProteinProteinPubMedPubMedTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  






    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M36035. Human peripheral ...[gi:184333] Links  


LOCUS       HUMHPBS                  821 bp    mRNA    linear   PRI 20-DEC-1993
DEFINITION  Human peripheral benzodiazepine receptor (hpbs) mRNA, complete cds.
ACCESSION   M36035
VERSION     M36035.1  GI:184333
KEYWORDS    peripheral benzodiazepine receptor.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 821)
  AUTHORS   Riond,J., Mattei,M.G., Kaghad,M., Dumont,X., Guillemot,J.C., Le
            Fur,G., Caput,D. and Ferrara,P.
  TITLE     Molecular cloning and chromosomal localization of a human
            peripheral-type benzodiazepine receptor
  JOURNAL   Eur. J. Biochem. 195 (2), 305-311 (1991)
  MEDLINE   91146565
   PUBMED   1847678
REFERENCE   2  (bases 1 to 821)
  AUTHORS   Riond,J.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-JUN-1990) J. Riond, Sanofi Elf Bio-Recherches, BP137,
            31328 Labege Cedex, France
COMMENT     Original source text: Human cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..821
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="p-hPBS11"
                     /cell_line="hystiocytic lymphoma monocyte-like cell line
                     U937"
     mRNA            <1..811
                     /product="peripheral benzodiazepine receptor mRNA"
     CDS             62..571
                     /note="peripheral benzodiazepine receptor"
                     /codon_start=1
                     /protein_id="AAA03652.1"
                     /db_xref="GI:306883"
                     /translation="MAPPWVPAMGFTLAPSLGCFVGSRFVHGEGLRWYAGLQKPSWHP
                     PHWVLGPVWGTLYSAMGYGSYLVWKELGGFTEKAVVPLGLYTGQLALNWAWPPIFFGA
                     RQMGWALVDLLLVSGAAAATTVAWYQVSPLAARLLYPYLAWLAFATTLNYCVWRDNHG
                     WHGGRRLPE"
     polyA_signal    800..805
BASE COUNT      118 a    271 c    260 g    171 t      1 others
ORIGIN      Chromosome 22, map position q13.3.
        1 agtgcccttc ccggagcgtg ccctcgccgc tgagctcccc tgaacagcag ctgcagcagc
       61 catggccccg ccctgggtgc ccgccatggg cttcacgctg gcgcccagcc tggggtgctt
      121 cgtgggctcc cgctttgtcc acggcgaggg tctccgctgg tacgccggcc tgcagaagcc
      181 ctcgtggcac ccgccccact gggtgctggg ccctgtctgg ggcacgctct actcagccat
      241 ggggtacggc tcctacctgg tctggaaaga gctgggaggc ttcacagaga aggctgtggt
      301 tcccctgggc ctctacactg ggcagctggc cctgaactgg gcatggcccc ccatcttctt
      361 tggtgcccga caaatgggct gggccttggt ggatctcctg ctggtcagtg gggcggcggc
      421 ngccactacc gtggcctggt accaggtgag cccgctggcc gcccgcctgc tctaccccta
      481 cctggcctgg ctggccttcg cgaccacact caactactgc gtatggcggg acaaccatgg
      541 ctggcatggg ggacggcggc tgccagagtg agtgcccggc ccaccaggga ctgcagctgc
      601 accagcaggt gccatcacgc ttgtgatgtg gtggccgtca cgctttcatg accactgggc
      661 ctgctagtct gtcagggcct tggcccaggg gtcagcagag cttcagaggt tgccccacct
      721 gagcccccac ccgggagcag tgtcctgtgc tttctgcatg cttagagcat gttcttggaa
      781 catggaattt tataagctga ataaagtttt tgacttcctt t
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  






    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U12421. Human mitochondri...[gi:529945] Links  


LOCUS       HSU12421                4258 bp    DNA     linear   PRI 14-DEC-1995
DEFINITION  Human mitochondrial benzodiazepine receptor (MBR) gene, complete
            cds.
ACCESSION   U12421
VERSION     U12421.1  GI:529945
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 2080 to 3683)
  AUTHORS   Yakovlev,A.G., Ruffo,M., Jurka,J. and Krueger,K.E.
  TITLE     Comparison of repetitive elements in the third intron of human and
            rodent mitochondrial benzodiazepine receptor-encoding genes
  JOURNAL   Gene 155 (2), 201-205 (1995)
  MEDLINE   95237610
   PUBMED   7721091
REFERENCE   2  (bases 1 to 4258)
  AUTHORS   Krueger,K.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (19-JUL-1994) Karl E. Krueger, Dept. of Cell Biology,
            Georgetown University School of Medicine, Washington, DC 20007, USA
FEATURES             Location/Qualifiers
     source          1..4258
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="22q13.3"
     intron          <1..99
                     /number=1
                     /evidence=experimental
     exon            100..310
                     /product="mitochondrial benzodiazepine receptor"
                     /number=2
                     /evidence=experimental
     5'UTR           100..128
     gene            129..3872
                     /gene="MBR"
     CDS             join(129..310,1941..2079,3684..3872)
                     /gene="MBR"
                     /codon_start=1
                     /evidence=experimental
                     /product="mitochondrial benzodiazepine receptor"
                     /protein_id="AAA83252.1"
                     /db_xref="GI:529946"
                     /translation="MAPPWVPAMGFTLAPSLGCFVGSRFVHGEGLRWYAGLQKPSWHP
                     PHWVLGPVWGTLYSAMGYGSYLVWKELGGFTEKAVVPLGLYTGQLALNWAWPPIFFGA
                     RQMGWALVDLLLVSGAAAATTVAWYQVSPLAARLLYPYLAWLAFTTTLNYCVWRDNHG
                     WRGGRRLPE"
     intron          311..1940
                     /gene="MBR"
                     /number=2
                     /evidence=experimental
     exon            1941..2079
                     /gene="MBR"
                     /product="mitochondrial benzodiazepine receptor"
                     /number=3
                     /evidence=experimental
     intron          2080..3683
                     /gene="MBR"
                     /number=3
                     /evidence=experimental
     repeat_region   2401..3306
                     /note="4 Alus"
                     /rpt_family="Alu"
     repeat_region   3497..3581
                     /rpt_family="MIR"
     exon            3684..4122
                     /product="mitochondrial benzodiazepine receptor"
                     /number=4
                     /evidence=experimental
     3'UTR           3873..4122
BASE COUNT      794 a   1246 c   1221 g    997 t
ORIGIN      
        1 gatctgtggg tgacaggcct tccggggatg ctggaggaga cacgggcctg accccatggc
       61 ctcaggaatg ccctcacgca gccctgtctt ctctttcaga gctcccctga acagcagctg
      121 cagcagccat ggccccgccc tgggtgcccg ccatgggctt cacgctggcg cccagcctgg
      181 ggtgcttcgt gggctcccgc tttgtccacg gcgagggtct ccgctggtac gccggcctgc
      241 agaagccctc gtggcacccg ccccactggg tgctgggccc tgtctggggc acgctctact
      301 cagccatggg gtaggtgggc gtgcactggt cctggggata agcctggccc tttgcaaggg
      361 gaggccaggc caggacagag ggtcttctcc aggcgggcca tggaccatgg catggtttcc
      421 cctgccccat ttggcaagcc agggtgggga agctgtgggc ctgtcgattg cacaactgaa
      481 cttctcctcc tgggcccctg ggggatgggt cagatgctca ccccaccctg gcatccccgc
      541 catgttcaca gctccaggca gaggccggct tagtgtgtga aggttggtct tgggcagagg
      601 catggacgct aagattaaca acggcagcaa cagccccaag agcagctcct ttgttgaact
      661 cctgcccagt gccaggctcg tctcattcat actcaacgac ccccataata agaggcctcc
      721 caaggacaca gggcttgcaa gtgtggaccc cccacagccc cttcaggctc agtctggcct
      781 gagtcccctg ggaatagacc aggggaattc cacccggaaa ggacgtgctg ggaggggcag
      841 ggcgcctctt ggtggctgta gcggggacca cacccaggag gcagggaatt gactgccttc
      901 tgaaatgttc cctgcatcct ggttctgtgc cagcgcagtg cagggtgggg acgcaggtgc
      961 caggcactct agggggcgga tagcagaggc caggccaggt gcccggctct gatgggaata
     1021 aggccctgga acttgggtgt cacgtgggcc tcacccctcc tggcccaggc cctgcccagc
     1081 cagggtcctg cacttccctg gctccagtgt ccccaccctg cttggccttc ctacaccacc
     1141 cagtcccagc ctctcttcgg gggaaatttt ggggctccct gtggctctgg ataaggcgaa
     1201 cccctcatta gtaggtgaca gccggggctc tggcctcgtg accagacttt atccactgat
     1261 ggcctgccaa gagatctcag tcacttcacc gcctcagtct cctcactgta aagggggtgc
     1321 tgattactgc ctgccttaat ggcatcattc atgagactga aatgagtgaa tgcttgggaa
     1381 gtggtagggc ctggctcctg tgagtgctgg agggcatcgg ctgctgttac tggtggtgtt
     1441 atcgttgccc tgtgttaaca gctcccatga tcaggcctcc ccctggacct caccctgcag
     1501 tcacagactg actcagggtc tcacatctgc ctcaaagcct ttgcatgtcc tgtttcctcc
     1561 cctggaaaac tcctattcat catttaaggc ccaggccaaa tgtcccatcc tctatgcagt
     1621 gccctggcta ggttaagagc ctcctctcta cctattcaca ccatgtgctt ccctttcata
     1681 gcagtgacag tgataataat ggcaggtatt tactgagtgt cctaaatgct gggtatcgca
     1741 taagcactgc caatgtgcta actagaatct ctacagcaac cctatgaaaa atgaaggccc
     1801 agacaggtca aggcaacttc cctgagatca cacagcagtc agtgtcaggt cacgtatgaa
     1861 ccatcactgt gccactcgga ggtgggcaac gctcctggcc ttgttcctaa tggtgctctg
     1921 aactgcggcc tctgtttcag gtacggctcc tacctggtct ggaaagagct gggaggcttc
     1981 acagagaagg ctgtggttcc cctgggcctc tacactgggc agctggccct gaactgggca
     2041 tggcccccca tcttctttgg tgcccgacaa atgggctggg taagtgtggc cacagcatgt
     2101 gtccctgatc cctggatccg acccttggag gacgtggggc atcacatatg acactgggtc
     2161 agtgtctata ggcggggcca ggggagacaa aaggccatgt ctctctagct gtaacagccc
     2221 acaccctgga gcctcccctc tactgactcc tccggtgagg gaagctatta aagcagaagg
     2281 ggttgcaggg gtgggtttgg gggccactgt gtaggaaaac ccacacaagc ctggtactgt
     2341 gtgggcaggg tcactggccc ctctacataa cagcttcttc tttttttttt ttgagacaga
     2401 gtctcactct ctcacccagg ctggagtgca gtggtgagat ctcggcccac tgcaacctcc
     2461 acctcctggg ttcaagtgat tcccttgcct cagcctcccg agtagctggg actacggtgt
     2521 gcaccaccac tcctggctaa tttttgtatt tgtatttgta tttttttagt agagatgggg
     2581 tttcactatg ttggccaggc tggtcttgaa ctcctgacct caggtgatct acccgccttg
     2641 gcctctcaaa gtgctgggat tacaggtgta agccaccgcg cccggctgac tgcttctttt
     2701 ttaaagtttt aaactttttt aagagatgaa gactcgctgt gttgcccagg gcagactcga
     2761 cttcctgggc tcaaacgatt gattctcccg cctttacctc ttgagtagct gggactccag
     2821 gtcacgtcac tgtgcccggc agcctttttt ttttggagac agagtcttac ctgttgccca
     2881 ggctggagtg cagtggcatg atctcggctc actgcagact ccacctccca ggttcaagtg
     2941 attctcctgt ctcagcctcc cgagtagctg ggattacagg catgcagcac catgtccagc
     3001 taattttctg tatttttttt tttttgagac ggagtcgctc tgttgcccag gctggagtgc
     3061 agtggcgtga tctcagctca ctgcaacctc cgtctcccag gttcaagcaa ttctcctgcc
     3121 tcagcctcct gagtagctgg gactacaggc acgtgctaca cgcctggcta atttttttat
     3181 ttttagtaga gacggggttt taccatattg gtcaggttgg tcttgaactc ctgacctgca
     3241 ggtgatccac ccgtctcagc ctcccaaagt gctgggattc acaggcgtga gccactgcac
     3301 ctggacaaat agcaagtttt tgtttgggcc tagtagtaat gctgggaggt cagcatcgct
     3361 ctcagtggag gctctaagct caggggaagg acgtgcaaga cctcctaagc caccaccgca
     3421 cccttgcaac aaaccgagtt cacttcaggc atgtcatgtg caccccacac aaggcatggg
     3481 tcaggtggca tactgttccc attttacaga tgaggaaact gaggctgcga tgggggaggg
     3541 gcttggccag gtcactcaag ggtggagtgg gggtgagtga ggctcctgac tcccaaatcc
     3601 agtgggagtt gggcagtggg acaggcactt gggtgaacgc ggtgcctcag gcctccccat
     3661 cctccgtccc ccaatctctg caggccttgg tggatctcct gctggtcagt ggggcggcgg
     3721 cagccactac cgtggcctgg taccaggtga gcccgctggc cgcccgcctg ctctacccct
     3781 acctggcctg gctggccttc acgaccacac tcaactactg cgtatggcgg gacaaccatg
     3841 gctggcgtgg gggacggcgg ctgccagagt gagtgcccgg cccaccaggg actgcagctg
     3901 caccagcagg tgccatcacg cttgtgatgt ggtggccgtc acgctttcat gaccactggg
     3961 cctgctagtc tgtcagggcc ttggcccagg ggtcagcaga gcttcagagg tggccccact
     4021 gagcccccac ccgggagcag tgtcctgtgc tttctgcatg cttagagcat gttcttggaa
     4081 catggaattt tataagctga ataaagtttt tgacttcctt taccatggcc tttttgcttg
     4141 ggtgggaccc ctggccacag gaagaggggg agctggggct gcactggagc tcgtctctgc
     4201 aggatctgcc tgggctgcct tctccagaca gggctggatc cagctggggc tgccccac
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  





&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF039704. Homo sapiens lyso...[gi:4063840] Links  


LOCUS       AF039704                7947 bp    DNA     linear   PRI 29-DEC-1998
DEFINITION  Homo sapiens lysosomal pepstatin insensitive protease (CLN2) gene,
            complete cds.
ACCESSION   AF039704
VERSION     AF039704.1  GI:4063840
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 7947)
  AUTHORS   Sleat,D.E., Donnelly,R.J., Lackland,H., Liu,C.G., Sohar,I.,
            Pullarkat,R.K. and Lobel,P.
  TITLE     Association of mutations in a lysosomal protein with classical
            late-infantile neuronal ceroid lipofuscinosis
  JOURNAL   Science 277 (5333), 1802-1805 (1997)
  MEDLINE   97442529
   PUBMED   9295267
REFERENCE   2  (bases 1 to 7947)
  AUTHORS   Liu,C.G., Sleat,D.E., Donnelly,R.J. and Lobel,P.
  TITLE     Structural organization and sequence of CLN2, the defective gene in
            classical late infantile neuronal ceroid lipofuscinosis
  JOURNAL   Genomics 50 (2), 206-212 (1998)
  MEDLINE   98317534
   PUBMED   9653647
REFERENCE   3  (bases 1 to 7947)
  AUTHORS   Liu,C.G., Sleat,D.E., Donnelly,R.J. and Lobel,P.
  TITLE     Direct Submission
  JOURNAL   Submitted (22-DEC-1997) CABM, UMDNJ, 679 Hoes Lane, Piscataway, NJ
            08854, USA
REFERENCE   4  (bases 1 to 7947)
  AUTHORS   Liu,C.G., Sleat,D.E., Donnelly,R.J. and Lobel,P.
  TITLE     Direct Submission
  JOURNAL   Submitted (28-DEC-1998) CABM, UMDNJ, 679 Hoes Lane, Piscataway, NJ
            08854, USA
  REMARK    Sequence update by submitter
COMMENT     On Dec 29, 1998 this sequence version replaced gi:3282687.
FEATURES             Location/Qualifiers
     source          1..7947
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="11"
                     /map="11p15"
                     /tissue_type="placenta"
                     /clone_lib="Clontech genomic library, Cat number HL1067j"
     gene            1..7947
                     /gene="CLN2"
     misc_feature    1..1297
                     /gene="CLN2"
                     /note="5'-flanking region and putative promoter elements"
     misc_feature    26..940
                     /gene="CLN2"
                     /note="similar to last intron in Homo sapiens
                     anti-thrombin III"
     mRNA            join(1298..1343,1460..1531,1811..1950,2935..3085,
                     3283..3410,3557..3735,3851..4049,4206..4394,4637..4706,
                     5151..5271,5384..5542,5721..5846,6026..7947)
                     /gene="CLN2"
                     /product="lysosomal pepstatin insensitive protease"
     CDS             join(1327..1343,1460..1531,1811..1950,2935..3085,
                     3283..3410,3557..3735,3851..4049,4206..4394,4637..4706,
                     5151..5271,5384..5542,5721..5846,6026..6166)
                     /gene="CLN2"
                     /note="deficient in late-infantile neuronal ceroid
                     lipofuscinosis"
                     /codon_start=1
                     /product="lysosomal pepstatin insensitive protease"
                     /protein_id="AAC98480.1"
                     /db_xref="GI:4063841"
                     /translation="MGLQACLLGLFALILSGKCSYSPEPDQRRTLPPGWVSLGRADPE
                     EELSLTFALRQQNVERLSELVQAVSDPSSPQYGKYLTLENVADLVRPSPLTLHTVQKW
                     LLAAGAQKCHSVITQDFLTCWLSIRQAELLLPGAEFHHYVGGPTETHVVRSPHPYQLP
                     QALAPHVDFVGGLHRFPPTSSLRQRPEPQVTGTVGLHLGVTPSVIRKRYNLTSQDVGS
                     GTSNNSQACAQFLEQYFHDSDLAQFMRLFGGNFAHQASVARVVGQQGRGRAGIEASLD
                     VQYLMSAGANISTWVYSSPGRHEGQEPFLQWLMLLSNESALPHVHTVSYGDDEDSLSS
                     AYIQRVNTELMKAAARGLTLLFASGDSGAGCWSVSGRHQFRPTFPASSPYVTTVGGTS
                     FQEPFLITNEIVDYISGGGFSNVFPRPSYQEEAVTKFLSSSPHLPPSSYFNASGRAYP
                     DVAALSDGYWVVSNRVPIPWVSGTSASTPVFGGILSLINEHRILSGRPPLGFLNPRLY
                     QQHGAGLFDVTRGCHESCLDEEVEGQGFCSGPGWDPVTGWGTPNFPALLKTLLNP"
BASE COUNT     1949 a   2115 c   1787 g   2096 t
ORIGIN      
        1 ggatccacgc ttagtgggaa actggaggga tcaatggaca gagagaagtc agattaatag
       61 gagtgaggaa aaacaatggc ttgaaaaaac agaaggttat agtctaagaa agcattctat
      121 aattataatt tcaaaggtag agcagttttg agagatagca agcccaatgc agccatggga
      181 ctggtttgct taagtgaagt gaaggtaatg ctcactgggt ttaatgagat gaagaaatgg
      241 aaggttaggg tgttgaatgg gtcttttacc tagatatttt gatcatctag gtcaaatgat
      301 cgcaggagtt gggatggaca gaaaggtgaa gtgccataat cttcagtgtg ggaatataat
      361 ctataggtta atggaagcca gtggcctgta ggacaacagc aggagtagag ggatctatct
      421 aaaaagtatt gaggtcagaa caagggattt cacatgagta ttggattaat ggttttggaa
      481 gttaaaatga cactgctgac accatcttcc atatggattt ttggggggaa aaaaaggaag
      541 actggaaggg aattgaatcc tcagggtttt aactctcatt gaaaatcctc ttgtaatgac
      601 cccagttgga agtagtaaac tcagacctcc ttgtgggctt tgagttacca ttagaagaag
      661 caaaggtccg tttccatttt atccacctga gcccactaaa caagatgcaa gattcctcca
      721 tggcccttag aacaaagttg atttccttga aaaccttggg aacccataat aaacaggacc
      781 ccagctgatt gtagatctcc tgggaagcta cttaaaactg gaaactgatt tgaataaaag
      841 gatcctggtc ttgcaaaagc tttactctgt tcataaatgt agtcgtaaaa tgaagacaaa
      901 attgataata tggggacata aaggataaaa gaggaaaaaa attttttcac tgcaatcttt
      961 cgaggcagat cacttttata taagtctcca tattgtaaat gcagaaacct aggctctgag
     1021 aggttatttt gctgatgtca tacagctagg aagaacccca gtctgtgggt ctctacagcc
     1081 cactcacttg tggtgattcc caggggacca ggctcaggac gcaggtgggg agccctgggc
     1141 ttactcagtg cttgactggc cagaggggag aatccgggtg gcggccccac ccttgcctag
     1201 catttgggac cacccatgca agggaggagc cagtaccgtc actagttact aggcagaggg
     1261 gtagtggtgg tggaatatag agctcatgtg atccgtcaca tgacagcaga tccgcggaag
     1321 ggcagaatgg gactccaagc ctggtgagaa attgagaggg ctcgggagaa agggatcacg
     1381 ttggagggag cacacattgg gagggtggga acacaaggac aacgtagctc ccactgaaac
     1441 ccacctgctg cccctacagc ctcctagggc tctttgccct catcctctct ggcaaatgca
     1501 gttacagccc ggagcccgac cagcggagga cgtgagttga cttagcacag actgccccct
     1561 tccccatacc ctgttctgcc tcccactgtc ctggtcctag cttctctcac ccccgtgcca
     1621 gctcctagta cgcactccat agcttcatct ctatgttcct agcctcagtc cccttcattc
     1681 acctcatgtg atctgtatct gctcctagga tcctccccaa ggtctcagct cctaatctgg
     1741 aaccttccat gaccaatatt ttccatctcc accctaacca aagccatgtc cctgacccct
     1801 gaccctacag gctgccccca ggctgggtgt ccctgggccg tgcggaccct gaggaagagc
     1861 tgagtctcac ctttgccctg agacagcaga atgtggaaag actctcggag ctggtgcagg
     1921 ctgtgtcgga tcccagctct cctcaatacg gtgccttttg ggactgagga caggatgtgg
     1981 gatgcggtgg agggacacag ggctgggttg ggcatggaat ggcgatcatg tcagagcctg
     2041 ccaagacact tgtgttcctc aggctagaaa cctaaaaggg gatgtggttc agtatacagg
     2101 ctttatgatc aaatacgaac tcaaattctg actctgctac ttactagcta ccaggcatct
     2161 agtacaactt acgttctttc ccctaccctc aaatccattc taatttcctc ttttgttcac
     2221 atactaaggc tgccagttct aactcccaaa gagcctttgg attaatctcc ttcttgccgt
     2281 cttttgttac gaccaactcc gttatttatt cccactcctg gactactgcc cagctcccca
     2341 gctgatctgc agtctcctcc ctccacccca cccctcactg tactccccca ctctgcttca
     2401 aaacagtctc tccaacagtc aaaatggatt ggtctctctg tactcctaca gcaggggagt
     2461 gtgtgcttgt gaactgcaga ggctggggac ggggcagtgt gacatagtgt gcatggcagg
     2521 aaacagtgac tcaccattcg tgttaagctt aaaatcaaaa agtatgcaaa tttgagtaaa
     2581 ctgaataact ggtgccgaaa gatttgaaaa catttaccgg acataagctt aaaattaaaa
     2641 tggaaaattt atatgtatta actcattcat tttcatgatc atggcaatac atgctgtggt
     2701 gcgacatttg caacttactg gcctttaggg taaatccata ttattggcca tggcatttca
     2761 agtctctcca gccttttctt ctctgcttcc caagtacacc tacacctgca cacatagccc
     2821 tcctttaccc tgttccaggc tttgggaagt cctgatgtct catagttgag gtccaaaagg
     2881 gggagtttgg gaaagcaatg aatgagggca agtgcctctt ctgaatccct gcaggaaaat
     2941 acctgaccct agagaatgtg gctgatctgg tgaggccatc cccactgacc ctccacacgg
     3001 tgcaaaaatg gctcttggca gccggagccc agaagtgcca ttctgtgatc acacaggact
     3061 ttctgacttg ctggctgagc atccggtgag aggaaatgat tgctccatgg agggcaccag
     3121 tcatcccatc agtgagatgg atgggaggga gttgagagct tgctggggct tgtgggtggg
     3181 agctaatgca tggggagaca gtgactgact gcccagggat gctcagaggt agcttcttct
     3241 gttccgtttt gagctttctg acctctgttc tctgacctcc agacaagcag agctgctgct
     3301 ccctggggct gagtttcatc actatgtggg aggacctacg gaaacccatg ttgtaaggtc
     3361 cccacatccc taccagcttc cacaggcctt ggccccccat gtggactttg gtaacaccta
     3421 tggggtgaat gggggatggg gcacacagat ccaggggctg aggaagttta gatgccattg
     3481 gggactgggg gtggggtggt tgtaaggtgg gcattacagt ctataagatc tcctcaagcc
     3541 tgacttctcc ctacagtggg gggactgcac cgttttcccc caacatcatc cctgaggcaa
     3601 cgtcctgagc cgcaggtgac agggactgta ggcctgcatc tgggggtaac cccctctgtg
     3661 atccgtaagc gatacaactt gacctcacaa gacgtgggct ctggcaccag caataacagc
     3721 caagcctgtg cccaggtgag ccaagcaaag agccccaggg tcctcatagc ctccccacag
     3781 tgtcctcaat tccttaccac cctgggactc accctcggac ccacgatctc tgctctgact
     3841 ccctccatag ttcctggagc agtatttcca tgactcagac ctggctcagt tcatgcgcct
     3901 cttcggtggc aactttgcac atcaggcatc agtagcccgt gtggttggac aacagggccg
     3961 gggccgggcc gggattgagg ccagtctaga tgtgcagtac ctgatgagtg ctggtgccaa
     4021 catctccacc tgggtctaca gtagccctgg tactaccaag aggactggac agtggggaag
     4081 ggggtgggag atgggtgttg atccctgctc cctcaaggga atgctataag ctggagagag
     4141 atcctgacaa cccccagtga ctatctttgt gcccatccct caaaaaaaaa aaaaaaaaaa
     4201 tccaggccgg catgagggac aggagccctt cctgcagtgg ctcatgctgc tcagtaatga
     4261 gtcagccctg ccacatgtgc atactgtgag ctatggagat gatgaggact ccctcagcag
     4321 cgcctacatc cagcgggtca acactgagct catgaaggct gccgctcggg gtctcaccct
     4381 gctcttcgcc tcaggtgacc tcctacccta aacttagaca acgcttacac ctctgcacgc
     4441 ctgggtgctt tgactccaca gtgatccctg agcctggtct ctgactcata atctgacact
     4501 cagaccttcc agtagggacc actgacctga cctctacact ctgacctcct acagtaacaa
     4561 atttcccctc tgacatccga acccacatac taagccctaa ccaattaata tgaatgctac
     4621 acttggtctc tctcaggtga cagtggggcc gggtgttggt ctgtctctgg aagacaccag
     4681 ttccgcccta ccttccctgc ctccaggtaa gtactctagc ctaccactca ggtataacca
     4741 ccacctttca cttgtgatct catgatgtag aacctttgtc ttgaccccac catgtgctcc
     4801 tgtggttcag ccttaagctt tgcctgccct ggttgctgta ctcctgtctc ttcttcctgc
     4861 aggtcccagg ccccaaatct cttgtgtggg atacagggtc atagctgttc cttttcgtca
     4921 gttcccaggc atttgagtgg aagatttggt gggtgttctg tacagaaaag tgtgcacagt
     4981 cacctcaggc catgccttga aggctcaaaa tctcttagtc aatcccatat acatgcttcc
     5041 ccacagagtc tagttcctcc agcaagacct gggctatact cacccctccc cacatatctt
     5101 ggaggtcccc ttgggtcccc tactatccaa atgctgtctt ctcccctcag cccctatgtc
     5161 accacagtgg gaggcacatc cttccaggaa cctttcctca tcacaaatga aattgttgac
     5221 tatatcagtg gtggtggctt cagcaatgtg ttcccacggc cttcatacca ggtacgtgtg
     5281 tttgtgtgga tggatgcagg gtaagagtga ggatggggga tcctcagttc agctgactgc
     5341 tgggcaggcc acatgccaat actcactcaa aaatgccttt caggaggaag ctgtaacgaa
     5401 gttcctgagc tctagccccc acctgccacc atccagttac ttcaatgcca gtggccgtgc
     5461 ctacccagat gtggctgcac tttctgatgg ctactgggtg gtcagcaaca gagtgcccat
     5521 tccatgggtg tccggaacct cggtgagaat cagcccatct ccaaactctc actcaggaac
     5581 tacccttacc ccctaacacc ttgaacacct tgcacctaga acccctgact ccttagagat
     5641 gtctgatact ttaaagcatc actcccaaaa agtccaatca ctcagaaccc ctgaccttac
     5701 ttgcaccttc actcttgtag gcctctactc cagtgtttgg ggggatccta tccttgatca
     5761 atgagcacag gatccttagt ggccgccccc ctcttggctt tctcaaccca aggctctacc
     5821 agcagcatgg ggcaggactc tttgatgtaa gtatggaagg gaagggtgtg gacgttttca
     5881 aacaactatg gggagtgcta agggggactt gggggcagtt agggtggtgt ggaatagcct
     5941 ttgaaatgtg agtacagggt gaggagatat actctttaag tactggtact agtaggccca
     6001 gatctgatgc cagcctcctc cctaggtaac ccgtggctgc catgagtcct gtctggatga
     6061 agaggtagag ggccagggtt tctgctctgg tcctggctgg gatcctgtaa caggctgggg
     6121 aacacccaac ttcccagctt tgctgaagac tctactcaac ccctgaccct ttcctatcag
     6181 gagagatggc ttgtcccctg ccctgaagct ggcagttcag tcccttattc tgccctgttg
     6241 gaagccctgc tgaaccctca actattgact gctgcagaca gcttatctcc ctaaccctga
     6301 aatgcggtga gcttgacttg actcccaacc ctaccatgct ccatcatact caggtctccc
     6361 tactcctgcc ttagattcct caataagatg ctgtaactag cattttttga atgcctctcc
     6421 ctccgcatct catctttctc ttttcaatca ggcttttcca aagggttgta tacagactct
     6481 gtgcactatt tcacttgata ttcattcccc aattcactgc aaggagacct ctactgtcac
     6541 cgtttactct ttcctaccct gacatccaga aacaatggcc tccagtgcat acttctcaat
     6601 ctttgcttta tggcctttcc atcatagttg cccactccct ctccttactt agcttccagg
     6661 tcttaacttc tctgactact cttgtcttcc tctctcatca atttctgctt cttcatggaa
     6721 tgctgacctt cattgctcca tttgtagatt tttgctcttc tcagtttact cattgtcccc
     6781 tggaacaaat cactgacatc tacaaccatt accatctcac taaataagac tttctatcca
     6841 ataatgattg atacctcaaa tgtaagatgc gtgatactca acatttcatc gtccaccttc
     6901 ccaaccccaa acaattccat ctcgtttctt cttggtaaat gatgctatgc tttttccaac
     6961 caagccagaa acctgtgtca tcttttcacc ccaccttcaa tcaacaagtc ctcaatcaac
     7021 aagtcctact gactgcacat cttaaatata tctttatcag tccacaagtc cttccaatta
     7081 tatttcccaa gtatatctag aacttatcca cttatatccc cactgctact accttagttt
     7141 agggctatat tctcttgaaa aaaagtgtcc ttacttcctg ccaatcccca agtcatcttc
     7201 cagagtaaaa tgcaaatccc atcaggccac ttggatgaaa acccttcaag gattactgga
     7261 tagaattcag gctttcccct ccagccccca atcatagctc acaaaccttc cttgctattt
     7321 gttcttaagt aaaaaatcat ttttcctcct ccctccccaa accccaagga actctcactc
     7381 ttgctcaagc tgttccgtcc ccttaccacc cctgatacaa ctgccaggtt aatttccaga
     7441 attcttgcaa gactcagttc agaagtcacc ttctttcgtg aatgttttga ttccctgagg
     7501 ctactttatt ttggtatggc tgaaaaatcc tagattttct aaacaaaacc tgtttgaatc
     7561 ttggttctga tatggactag gagagagact gggtcaagta agcttatctc cctgaggctg
     7621 tttcctcgtc tgttaagtgt gaatatcaat acctgccttt cataatcacc agggaataaa
     7681 gtggaataat gttgataaca gtgcttggca cctggaagta ggtggcagat gttaacgccc
     7741 ttcctccctt gcactgcgcc ccctgtgcct acctctagca ttgtaacgac cacatagtat
     7801 tgaaatggcc agtttacttg tctgccttcc tttccaagac cgttggtgcc tagaggacta
     7861 gaatcgtgtc ctatttaact ttgtgttccc aggtcctagc tcaggagttg gcaaataaga
     7921 attaaatgtc tgctacaccg aaacaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC004998. Homo sapiens, Sim...[gi:13436457] Links  


LOCUS       BC004998                4190 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, Similar to membrane bound C2 domain containing
            protein, clone MGC:4422 IMAGE:2958094, mRNA, complete cds.
ACCESSION   BC004998
VERSION     BC004998.1  GI:13436457
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4190)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
            Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
            Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
            Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
            Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
            Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
            Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
            George Yang, Scott Zuyderduyn, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 9 Row: j Column: 12.
FEATURES             Location/Qualifiers
     source          1..4190
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:4422 IMAGE:2958094"
                     /tissue_type="Kidney, renal cell adenocarcinoma"
                     /clone_lib="NIH_MGC_14"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             51..3365
                     /codon_start=1
                     /product="Similar to membrane bound C2 domain containing
                     protein"
                     /protein_id="AAH04998.1"
                     /db_xref="GI:13436458"
                     /translation="MERSPGEGPSPSPMDQPSAPSDPTDQPPAAHAKPDPGSGGQPAG
                     PGAAGEALAVLTSFGRRLLVLIPVYLAGAVGLSVGFVLFGLALYLGWRRVRDEKERSL
                     RAARQLLDDEEQLTAKTLYMSHRELPAWVSFPDVEKAEWLNKIVAQVWPFLGQYMEKL
                     LAETVAPAVRGSNPHLQTFTFTRVELGEKPLRIIGVKVHPGQRKEQILLDLNISYVGD
                     VQIDVEVKKYFCKAGVKGMQLHGVLRVILEPLIGDLPFVGAVSMFFIRRPTLDINWTG
                     MTNLLDIPGLSSLSDTMIMDSIAAFLVLPNRLLVPLVPDLQDVAQLRSPLPRGIIRIH
                     LLAARGLSSKDKYVKGLIEGKSDPYALVRLGTQTFCSRVIDEELNPQWGETYEVMVHE
                     VPGQEIEVEVFDKDPDKDDFLGRMKLDVGKVLQASVLDDWFPLQGGQGQVHLRLEWLS
                     LLSDAEKLEQVLQWNWGVSSRPDPPSAAILVVYLDRAQDLPLKKGNKEPNPMVQLSIQ
                     DVTQESKAVYSTNCPVWEEAFRFFLQDPQSQELDVQVKDDSRALTLGALTLPLARLLT
                     APELILDQWFQLSSSGPNSRLYMKLVMRILYLDSSEICFPTVPGCPGAWDVDSENPQR
                     GSSVDAPPRPCHTTPDSQFGTEHVLRIHVLEAQDLIAKDRFLGGLVKGKSDPYVKLKL
                     AGRSFRSHVVREDLNPRWNEVFEVIVTSVPGQELEVEVFDKDLDKDDFLGRCKVRLTT
                     VLNSGFLDEWLTLEDVPSGRLHLRLERLTPRPTAAELEEVLQVNSLIQTQKSAELAAA
                     LLSIYMERAEDLPLRKGTKHLSPYATLTVGDSSHKTKTISQTSAPVWDESASFLIRKP
                     HTESLELQVRGEGTGVLGSLSLPLSELLVADQLCLDRWFTLSSGQGQVLLRAQLGILV
                     SQHSGVEAHSHSYSHSSSSLSEEPELSGGPPHITSSAPELRQRLTHVDSPLEAPAGPL
                     GQVKLTLWYYSEERKLVSIVHGCRSLRQNGRDPPDPYVSLLLLPDKNRGTKRRTSQKK
                     RTLSPEFNERFEWELPLDEAQRRKLDVSVKSNSSFMSRERELLGKVQLDLAETDLSQG
                     VARWYDLMDNKDKGSS"
BASE COUNT      933 a   1136 c   1154 g    967 t
ORIGIN      
        1 caacctccag ccagtccctg ggtcgggcgg atcctcccag aggtggcaca atggagcgat
       61 ctccaggaga gggccccagc cccagcccca tggaccagcc ctctgctccc tccgacccca
      121 ctgaccagcc ccccgctgct cacgcaaagc cagacccagg ttctgggggc caacctgctg
      181 gccctggcgc ggcgggtgag gccctggcgg tgctgacttc attcgggagg cggttgctgg
      241 tgctgatacc tgtgtatttg gccggggcag tgggactcag cgtgggtttc gtgctcttcg
      301 gcctcgccct ctacctgggc tggcgccggg tccgcgacga gaaagaacgg agccttcgag
      361 cagcgaggca gctactggac gacgaggagc agctcactgc gaaaactctc tatatgagtc
      421 atcgagagct acctgcctgg gtcagcttcc cagacgtgga aaaggctgaa tggctcaata
      481 agattgtggc ccaggtctgg cccttcctgg gccagtatat ggagaagctt ctggctgaaa
      541 ctgtggctcc ggctgttagg ggatctaacc cccatctgca aacatttaca tttacacgag
      601 tggaactggg tgaaaagcca ttgcgcatca ttggagtcaa ggttcaccca ggtcagagaa
      661 aagagcagat cctgctggac ttgaacatca gctatgtagg tgatgtgcag attgatgtgg
      721 aagtgaagaa atatttttgc aaagcaggag tcaagggcat gcagctacat ggcgttttgc
      781 gggtgatact ggagccactc attggggacc ttcccttcgt gggggctgtg tcaatgttct
      841 tcatccgacg cccgacccta gacatcaact ggacagggat gaccaacctg ctggatatcc
      901 caggacttag ctcactctct gacaccatga tcatggactc cattgctgcc ttcctcgtgt
      961 tgcccaaccg attactggtg ccccttgtgc ctgaccttca agatgtggct cagttgcgtt
     1021 cccctctgcc caggggcatt attcgaattc acctgctggc tgctcgaggg ctgagttcca
     1081 aggacaaata tgtgaagggc ctgattgagg gcaagtcaga cccatatgca cttgtgcgtt
     1141 tgggtaccca gacattctgc agtcgtgtca ttgatgaaga actcaaccca cagtggggag
     1201 agacttatga ggtgatggta cacgaggtcc cagggcagga gattgaagtg gaggtgttcg
     1261 acaaggatcc agataaagat gactttctgg gcagaatgaa gctggatgta gggaaggtgt
     1321 tacaggctag cgttctggat gattggttcc ctctacaagg tgggcaaggc caagttcact
     1381 tgaggctaga atggctgtca cttttgtcag atgcagagaa actggagcag gttctacagt
     1441 ggaattgggg agtctcctct cgaccagatc ccccgtcagc tgccatctta gttgtctacc
     1501 tggatcgggc ccaggatctt cctctgaaga aggggaacaa ggaacccaac cctatggtac
     1561 aactgtcaat tcaggatgtg actcaggaga gcaaggctgt ctacagtacc aactgcccag
     1621 tgtgggagga agcgttccgg ttcttcctac aagaccctca aagccaggag ctcgatgtgc
     1681 aagtgaagga tgattccagg gccctgactt taggagcact gacgctgcct ctggcccgcc
     1741 tgctgactgc cccagaactc atcctggacc agtggttcca gctcagcagc tctggtccaa
     1801 actccagact ctatatgaaa ctagtcatga ggatcctgta cttggattca tcagaaatat
     1861 gcttccccac ggtgcctggt tgtcctggtg cttgggacgt ggacagtgag aatccccaga
     1921 gaggcagcag tgtggatgcc ccacctcgac cctgtcacac gactcctgat agccagtttg
     1981 ggactgagca tgtgcttcgg atccatgtat tagaggccca ggacctgatt gccaaagacc
     2041 gtttcttggg gggactggtg aagggcaagt cagaccccta tgtcaaacta aagttggcag
     2101 gacgaagctt ccggagccat gttgttcggg aagatctcaa tccccgctgg aatgaggttt
     2161 ttgaggtgat cgtcacatca gttccaggcc aagagctaga ggttgaagtc tttgacaagg
     2221 acttggacaa ggatgatttt ctgggcaggt gtaaagtgcg tctcaccaca gtcttaaaca
     2281 gtggcttcct tgatgagtgg ctgaccctgg aggatgtccc atctggccgc ctgcacttgc
     2341 gcctggagcg tctcaccccc cgtcccactg ctgctgagtt agaggaggtg ctgcaggtga
     2401 atagtttgat ccagactcag aagagtgcgg agctggctgc ggccctgcta tccatctata
     2461 tggagcgggc agaggacctc ccgctgcgaa aaggcaccaa gcacctcagc ccttatgcta
     2521 ctctcactgt gggagatagt tctcataaaa ccaagactat ttcgcaaact tcagcccctg
     2581 tctgggatga gagtgcctcc tttctcatca ggaaaccaca cactgagagc ctagagttgc
     2641 aggttcgggg tgagggcact ggcgtgctgg gctcattatc cctgcccctc tcagagctcc
     2701 tcgtggctga ccagctctgc ttggaccgct ggtttacact cagcagtggt caggggcagg
     2761 tgctactgag agcacagcta gggatcctgg tgtcccagca ctcgggagtg gaagctcata
     2821 gccacagcta cagccacagc tcctcatcgc tgagtgaaga accagagctc tcggggggac
     2881 cccctcacat cacctcctca gccccagagc tccggcagcg cctaacacat gttgacagtc
     2941 cccttgaggc tccagccggg cctctgggcc aggtgaaact gactctgtgg tactacagtg
     3001 aagaacgaaa gctggtcagc attgttcatg gttgccggtc ccttcgacag aatggacgtg
     3061 atcctcctga tccctatgtg tcactgttgc tactgccaga caagaaccga ggcaccaaga
     3121 ggaggacctc acagaagaag aggaccctga gtcctgaatt taatgaacgg tttgagtggg
     3181 aactccccct ggatgaggcc cagagacgaa agctggatgt ctctgtcaag tctaattcct
     3241 ccttcatgtc aagagagcgt gagctgctgg ggaaggtgca gctggaccta gctgagacag
     3301 acctttccca gggtgtagcc cggtggtatg acctgatgga caacaaggac aagggcagct
     3361 cctaggagct ggcgagtccc agcctgactg ctctgtcttc ctgccttcgt ctcgctccat
     3421 caccgcctca atgtgatgag cctaaagcta gggtccaagg gcagagcctg tgcccttcag
     3481 ccctttcacc taacaggccc atattcgggc ctttgcctga ccaaagagaa gaaccgtatg
     3541 ttccctttac tgcacggcct ttatccttct gggcccctgg ggcggggacc tgagctggct
     3601 gtttcctgct ttgcctgcac attgttctcc cttcctccca actcctcagg gccttctgta
     3661 tctgtgcctg gccagtggca gcactagcag tggtattagc ttatgccaaa tacagctttg
     3721 gaaggatctt tttttcttta actagatggt caccttcttc cctaccacac atgggtggga
     3781 aggtggacag gctaacctct ccagctgtga gcctcttaga ctactgcatg tagcaaatgt
     3841 tcagcagctc aggcccccat gtccagttct gtccccactg tcctcaaccc tgtcctgaaa
     3901 attctactgc tttgatggct ggggccagtc tcttgtcact ttggaaactg aggacgcgtg
     3961 gattctactc aagcctccaa gtagtggcat atcagtcttg gagctcctag ctggtgatac
     4021 ggagagggct ttggaggact tgggacagca gggccaattt ttttgcccaa gtgcctaggc
     4081 tgctaactca ctgactagaa cttaatctgg tactttacag ttttgcacca actctgccaa
     4141 gccactggat cttacattaa acatcatact caaaaaaaaa aaaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X00737. Human mRNA for pu...[gi:35564] Links  


LOCUS       HSPNP                   1418 bp    mRNA    linear   PRI 19-JAN-1995
DEFINITION  Human mRNA for purine nucleoside phosphorylase (PNP; EC 2.4.2.1).
ACCESSION   X00737 K02574
VERSION     X00737.1  GI:35564
KEYWORDS    phosphorylase.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1418)
  AUTHORS   Williams,S.R., Goddard,J.M. and Martin,D.W. Jr.
  TITLE     Human purine nucleoside phosphorylase cDNA sequence and genomic
            clone characterization
  JOURNAL   Nucleic Acids Res. 12 (14), 5779-5787 (1984)
  MEDLINE   84272252
   PUBMED   6087295
COMMENT     Data kindly reviewed (30-JAN-1986) by S.R. Williams.
FEATURES             Location/Qualifiers
     source          1..1418
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             110..979
                     /note="PNP"
                     /codon_start=1
                     /protein_id="CAA25320.1"
                     /db_xref="GI:35565"
                     /db_xref="SWISS-PROT:P00491"
                     /translation="MENGYTYEDYKNTAEWLLSHTKHRPQVAIICGSGLGGLTDKLTQ
                     AQIFDYSEIPNFPRSTVPGHAGRLVFGFLNGRACVMMQGRFHMYEGYPLWKVTFPVRV
                     FHLLGVDTLVVTNAAGGLNPKFEVGDIMLIRDHINLPGFSGQNPLRGPNDERFGDRFP
                     AMSDAYDRTMRQRALSTWKQMGEQRELQEGTYVMVAGPSFETVAECRVLQKLGADAVG
                     MSTVPEVIVARHCGLRVFGFSLITNKVIMDYESLEKANHEEVLAAGKQAAQKLEQFVS
                     ILMASIPLPDKAS"
BASE COUNT      352 a    358 c    362 g    346 t
ORIGIN      
        1 aactgtgcga accagacccg gcagccttgc tcagttcagc atagcggagc ggatccgatc
       61 ggatcggagc acaccggagc aggctcatcg agaaggcgtc tgcgagacca tggagaacgg
      121 atacacctat gaagattata agaacactgc agaatggctt ctgtctcata ctaagcaccg
      181 acctcaagtt gcaataatct gtggttctgg attaggaggt ctgactgata aattaactca
      241 ggcccagatc tttgactaca gtgaaatccc caactttcct cgaagtacag tgccaggtca
      301 tgctggccga ctggtgtttg ggttcctgaa tggcagggcc tgtgtgatga tgcagggcag
      361 gttccacatg tatgaagggt acccactctg gaaggtgaca ttcccagtga gggttttcca
      421 ccttctgggt gtggacaccc tggtagtcac caatgcagca ggagggctga accccaagtt
      481 tgaggttgga gatatcatgc tgatccgtga ccatatcaac ctacctggtt tcagtggtca
      541 gaaccctctc agagggccca atgatgaaag gtttggagat cgtttccctg ccatgtctga
      601 tgcctacgac cggactatga ggcagagggc tctcagtacc tggaaacaaa tgggggagca
      661 acgtgagcta caggaaggca cctatgtgat ggtggcaggc cccagctttg agactgtggc
      721 agaatgtcgt gtgctgcaga agctgggagc agacgctgtt ggcatgagta cagtaccaga
      781 agttatcgtt gcacggcact gtggacttcg agtctttggc ttctcactca tcactaacaa
      841 ggtcatcatg gattatgaaa gcctggagaa ggccaaccat gaagaagtct tagcagctgg
      901 caaacaagct gcacagaaat tggaacagtt tgtctccatt cttatggcca gcattccact
      961 ccctgacaaa gccagttgac ctgccttgga gtcgtctggc atctcccaca caagacccaa
     1021 gtagctgcta ccttctttgg ccccttgctg gagtcatgtg cctctgtcct taggttgtag
     1081 cagaaaggaa aagattcctg tccttcacct ttcccacttt cttctaccag acccttctgg
     1141 tgccagatcc tcttctcaaa gctgggatta caggtgtgag catagtgaga ccttggcgct
     1201 acaaaataaa gctgttctca ttcctgttct ttcttacaca agagctggag cccgtgccct
     1261 accacacatc tgtggagatg cccaggattt gactcgggcc ttagaacttt gcatagcagc
     1321 tgctactagc tctttgagat aatacattcc gaggggctca gttctgcctt atctaaatca
     1381 ccagagacca aacaaggact aatccaatac ctcttgga
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M91196. Homo sapiens DNA-...[gi:2275152] Links  


LOCUS       HUMDNABP                1538 bp    mRNA    linear   PRI 23-JUL-1997
DEFINITION  Homo sapiens DNA-binding protein mRNA, complete cds.
ACCESSION   M91196
VERSION     M91196.1  GI:2275152
KEYWORDS    DNA-binding protein; ICSBP; interferon consensus sequence binding
            protein.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1538)
  AUTHORS   Weisz,A., Marx,P., Sharf,R., Appella,E., Driggers,P.H., Ozato,K.
            and Levi,B.Z.
  TITLE     Human interferon consensus sequence binding protein is a negative
            regulator of enhancer elements common to interferon-inducible genes
  JOURNAL   J. Biol. Chem. 267 (35), 25589-25596 (1992)
  MEDLINE   93094284
   PUBMED   1460054
REFERENCE   2  (bases 1 to 1538)
  AUTHORS   Levi,B.-Z.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-APR-1993) Dept. of Food Engineering & Biotechnology,
            Technion, Haifa 32,000, Israel
REFERENCE   3  (bases 1 to 1538)
  AUTHORS   Schmidt,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-JUL-1997) Innere Medizin mS. Haematologie/Onkologie,
            Virchow Klinikum der HU Berlin, Forschungshaus, Hs 37, R. 2.0314,
            Augustenburger Platz 1, Berlin 13353, Germany
  REMARK    Sequence update
COMMENT     On Jul 23, 1997 this sequence version replaced gi:181611.
FEATURES             Location/Qualifiers
     source          1..1538
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     CDS             48..1328
                     /function="involved in the transcription regulation of
                     interferon-inducible genes"
                     /codon_start=1
                     /product="DNA-binding protein"
                     /protein_id="AAB63813.1"
                     /db_xref="GI:2275153"
                     /translation="MCDRNGGRRLRQWLIEQIDSSMYPGLIWENEEKSMFRIPWKHAG
                     KQDYNQEVDASIFKAWAVFKGKFKEGDKAEPATWKTRLRCALNKSPDFEEVTDRSQLD
                     ISEPYKVYRIVPEEEQKCKLGVATAGCVNEVTEMECGRSEIDELIKEPSVDDYMGMIK
                     RSPSPPEACRSQLLPDWWAQQPSTGVPLVTGYTTYDAHHSAFSQMVISFYYGGKLVGQ
                     ATTTCPEGCRLSLSQPGLPGTKLYGPEGLELVRFPPADAIPSERQRQVTRKLFGHLER
                     GVLLHSSRQGVFVKRLCQGRVFCSGNAVVCKGRPNKLERDEVVQVFDTSQFFRELQQF
                     YNSQGRLPDGRVVLCFGEEFPDMAPLRSKLILVQIEQLYVRQLAEEAGKSCGAGSVMQ
                     APEEPPPDQVFRMFPDICASHQRSFFRENQQITV"
BASE COUNT      333 a    402 c    495 g    308 t
ORIGIN      
        1 atggatgggg gaaccgggcg gcgagacggc ggcaggacgg cggcaggatg tgtgaccgga
       61 atggtggtcg gcggcttcga cagtggctga tcgagcagat tgacagtagc atgtatccag
      121 gactgatttg ggagaatgag gagaagagca tgttccggat cccttggaaa cacgctggca
      181 agcaagatta taatcaggaa gtggatgcct ccatttttaa ggcctgggca gtttttaaag
      241 ggaagtttaa agaaggggac aaagctgaac cagccacttg gaagacgagg ttacgctgtg
      301 ctttgaataa gagcccagat tttgaggaag tgacggaccg gtcccaactg gacatttccg
      361 agccatacaa agtttaccga attgttcctg aggaagagca aaaatgcaaa ctaggcgtgg
      421 caactgctgg ctgcgtgaat gaagttacag agatggagtg cggtcgctct gaaatcgacg
      481 agctgatcaa ggagccttct gtggacgatt acatggggat gatcaaaagg agcccttccc
      541 cgccggaggc ctgtcggagt cagctccttc cagactggtg ggcgcagcag cccagcacag
      601 gcgtgccgct ggtgacgggg tacaccacct acgacgcgca ccattcagca ttctcccaga
      661 tggtgatcag cttctactat gggggcaagc tggtgggcca ggccaccacc acctgccccg
      721 agggctgccg cctgtccctg agccagcctg ggctgcccgg caccaagctg tatgggcccg
      781 agggcctgga gctggtgcgc ttcccgccgg ccgacgccat ccccagcgag cgacagaggc
      841 aggtgacgcg gaagctgttc gggcacctgg agcgcggggt gctgctgcac agcagccggc
      901 agggcgtgtt cgtcaagcgg ctgtgccagg gccgcgtgtt ctgcagcggc aacgccgtgg
      961 tgtgcaaagg caggcccaac aagctggagc gtgatgaggt ggtccaggtc ttcgacacca
     1021 gccagttctt ccgagagctg cagcagttct ataacagcca gggccggctt cctgacggca
     1081 gggtggtgct gtgctttggg gaagagtttc cggatatggc ccccttgcgc tccaaactca
     1141 ttctcgtgca gattgagcag ctgtatgtcc ggcaactggc agaagaggct gggaagagct
     1201 gtggagccgg ctctgtgatg caggcccccg aggagccgcc gccagaccag gtcttccgga
     1261 tgtttccaga tatttgtgcc tcacaccaga gatcattttt cagagaaaac caacagatca
     1321 ccgtctaagt gcgtcgcttg ggcgccccac cccgtctgcg tcctgcatcc atctccctgt
     1381 tacagtggcc cgcatcatga ttaaagaatg tggatccctc tgtctggggt gggatgcctt
     1441 actttgcact taatttaata agggcattct cggaggagta gacgtttaat acgaagtggc
     1501 gcatagccct gccgagatgt cggtgatggc ctgatgcg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: L76079. Homo sapiens beta...[gi:4584711] Links  


LOCUS       L76079                  2245 bp    DNA     linear   PRI 16-JUN-1999
DEFINITION  Homo sapiens beta 1,4-N-acetylgalactosaminyltransferase gene.
ACCESSION   L76079
VERSION     L76079.1  GI:4584711
KEYWORDS    beta-1,4 N-acetylgalactosaminyltransferase.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2245)
  AUTHORS   Furukawa,K., Soejima,H., Niikawa,N. and Shiku,H.
  TITLE     Genomic organization and chromosomal assignment of the human beta1,
            4-N-acetylgalactosaminyltransferase gene. Identification of
            multiple transcription units
  JOURNAL   J. Biol. Chem. 271 (34), 20836-20844 (1996)
  MEDLINE   96355429
   PUBMED   8702839
COMMENT     GSDB:S:73982
FEATURES             Location/Qualifiers
     source          1..2245
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="placenta"
     exon            1156..1586
                     /note="1a; putative"
                     /number=1
     exon            1415..1586
                     /note="1b; putative"
                     /number=1
     intron          1587..2227
                     /note="putative"
                     /number=1
     exon            1709..1817
                     /note="1c; putative"
                     /number=1
     intron          1818..2227
                     /note="alt.; putative"
                     /number=1
     exon            2228..>2245
                     /note="putative"
                     /number=2
BASE COUNT      445 a    653 c    654 g    480 t     13 others
ORIGIN      
        1 gtcgactcta ggcctaaatg gccatttagg tgacactata gaagagctcc aggatcataa
       61 acaaaacaaa aattccatga tttcttcttt tgagacagag tctcgctctg tcacccaggc
      121 tacagtgcca tggcaccatc tcagctcact gcagcttcaa cttcccgagc tcaagcaatc
      181 ttcccacttc agcctcctaa gtagttggac tacaggtgtg agccatcttg ctgggctaat
      241 ttttaaattt tttgtagaga cggggggggg ggggtctcac catgttgccc agnctggtct
      301 cgaactcctg gtctcctatc gcagcctccc aaagtgctga gattacaagc atgagccact
      361 gcgccgggtt tcatgatttc tttcactact tttatgattt cacctttaat attaaaatat
      421 ttaatacatt tggaattttg gtgtaagaag tgtagagatt ccactttcac cccagctgat
      481 tagtcatttg tttgtataaa attcattcat tcaataatcc accacacaca cacacacaca
      541 catattttcg ttatgttcta tatttcttgt tatatctttt tttcctattg aatcttgatg
      601 catcttttat tagaaccacg gagccacaga actccgattc tcggcaatgt gnggtatcag
      661 gctatgacct aaggcttgca agcagaacta ccgccaatct ggcctctctg gggttccctt
      721 cctcctgcaa ggttgcacca agcggcagtt cggggaaata gggccttcaa ttacccagtg
      781 tgtatctctt gagtgtctgt taaataatgg ggggcaggnc ggggaggagg gggagaggga
      841 gnctgtncct cagtctttgt tagccaggca gagcgtttat tttccagggg ttgtttcctt
      901 tatcttcatc ctaacattca aacttgactt gagttttagc gcgcccccgc cctacctcct
      961 cccaaagccc ttatttgtcc atccttgcct acaagctcag aacgagcagn caatggaatg
     1021 gatgtcgtgg gagcaggcgg ttgagatgcc agtgtttgga ggctaaggac tgggggctgt
     1081 gagcgaggac cttcccgcgt ttgctccgcg cccccgcagt ttcctccgcg cccactagaa
     1141 ggcgccgggg cgctcgcatt cccccgcgcg gagccgaagc agccgcaacg agccgggagc
     1201 tgagccgcgc tgcgctgcgg tgcgaagagc cgggcggcgg ccagagccct ccccgcgctg
     1261 ccagtgggac gcggagccag ggatccccgg ctttgccccg gggctggggt gcaacagact
     1321 cttaacttgg tcctcgcgcc ccgccaaccg cccccgggca ggaaacggcc agaaccacca
     1381 ccgcgccgcg ctccacaaaa gccccgggag ttccaagacg ggagggatcg ggcgcgctcc
     1441 agaggcagga gggtccccca caccctcagg ctcatgccca gctcccccat cggacagccc
     1501 ccagccccat gaggaccctc aggcccgggc gagagcccgg cacagcccgg accgaaattt
     1561 tgccgctgcc ttagagcgtt agacaggtca gtgcccgggg gtgggtgggg gtgggattgg
     1621 ttggtgaggg gaggaggagg aggcgggagg aggcggnagg aggcgcgctt cccggtccac
     1681 tctgtccccc gcgccgcttg ncgaggncac tcatcgggtc acctccgccg ggggcgcccg
     1741 ccagcacggg catgattctg cattttttcc tgacaggcta ggtgtggagc cgccctccgc
     1801 agcccacccc ggcccaggtg atgctcagac tgaggagtcg ggggcgcgcc ggccagggcg
     1861 ggcggcggnc ttggngactg ccactgccct gatagagcgt gggggctttt ccncttacgc
     1921 acggccgcgt cgaaaccgcg gcatccaaat gtcttcagca ttaagatgcc agncctcggg
     1981 tgtgtgagga aaccgagatg caggagaggg cgagaagaaa ggaggccggg agaccctggg
     2041 gcctgggaag tgaggggagg ggtaccggga aaaccgcggc tggcagtgca aaccctagcg
     2101 cgatgtgtgg agtggagagc gctcttggga gccggcagga gcttctgagg acgcgggggg
     2161 caagggcccc atcccttgcc ctcccttctt ccctcccttt ctcactcccc accctacccc
     2221 cacctaggat gtggctgggc cgccg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMPubMedPubMedSNPSNPTaxonomyTaxonomyLinkOutLinkOutHelpHelp  







    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M83651. Human beta-1,4 N-...[gi:431032] Links  


LOCUS       HUMAGT                  2512 bp    mRNA    linear   PRI 30-NOV-1993
DEFINITION  Human beta-1,4 N-acetylgalactosaminyltransferase mRNA, complete
            cds.
ACCESSION   M83651
VERSION     M83651.1  GI:431032
KEYWORDS    beta-1,4 N-acetylgalactosaminyltransferase.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2512)
  AUTHORS   Nagata,Y., Yamashiro,S., Yodoi,J., Lloyd,K.O., Shiku,H. and
            Furukawa,K.
  TITLE     Expression cloning of beta 1,4 N-acetylgalactosaminyltransferase
            cDNAs that determine the expression of GM2 and GD2 gangliosides
  JOURNAL   J. Biol. Chem. 267 (17), 12082-12089 (1992)
  MEDLINE   92291088
   PUBMED   1601877
COMMENT     On Dec 1, 1993 this sequence version replaced gi:178260.
            Original source text: Homo sapiens cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..2512
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="YT"
     CDS             61..1662
                     /codon_start=1
                     /product="beta-1,4 N-acetylgalactosaminyltransferase"
                     /protein_id="AAA35516.1"
                     /db_xref="GI:431033"
                     /translation="MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQ
                     SPRRPELPDLAPEPRYAHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDL
                     TKAFDPAELRAASATREQEFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSI
                     LVPGLSLQAASGQEVYQVNLTASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNR
                     QLQLVTYSSRSYQTNTADTVRFSTEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNIS
                     ALVTIATKTFLRYDRLRALITSIRRFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGK
                     GWFAGRNLAVSQVTTKYVLWVDDDFVFTARTRLERLVDVLERTPLDLVGGAVREISGF
                     ATTYRQLLSVEPGAPGLGNCLRQRRGFHHELVGFPGCVVTDGVVNFFLARTDKVREVG
                     FDPRLSRVAHLEFFLDGLGSLRVGSCSDVVVDHASKLKLPWTSRDAGAETYARYRYPG
                     SLDESQMAKHRLLFFKHRLQCMTSQ"
BASE COUNT      491 a    802 c    705 g    514 t
ORIGIN      
        1 ggcgagagcc cggcacagcc cggaccgaaa ttttgccgct gccttagagc gttagacagg
       61 atgtggctgg gccgccgggc cctgtgcgct ctggtccttc tgctcgcctg cgcctcgctg
      121 gggctcctgt acgcgagcac ccgggacgcg cccggcctcc ggctacctct tgcgccgtgg
      181 gcgcccccgc aaagcccccg caggcccgag ctgccagatc ttgctcctga gccccgctac
      241 gcacacatcc cggtcaggat caaggagcaa gtagtggggc tgctggcttg gaacaactgc
      301 agttgtgagt ccagtggggg gggcctcccc ctccccttcc agaaacaagt ccgagctatt
      361 gacctcacca aggcctttga ccctgcagag ctgagggctg cctctgccac aagagagcag
      421 gagttccagg cctttctgtc gaggagccag tccccagctg accagctgct catagcccct
      481 gccaactccc cgctccagta ccccctacag ggtgtggaag ttcagcccct caggagcatc
      541 ttggtgccag ggctgagcct tcaggcagct tctggtcagg aggtatacca ggtgaacctg
      601 actgcctccc taggcacctg ggacgtggca ggggaagtga ctggagttac tctcactgga
      661 gagggtcagg cagatctcac ccttgtcagc ccagggctgg accaactcaa caggcaacta
      721 caactggtca cttacagcag ccgaagctac cagaccaaca cagcagacac agtccggttc
      781 tccaccgagg gacatgaggc tgctttcact atccgcataa gacacccgcc caaccctcgg
      841 ctgtacccac ctgggtctct accccaggga gcccagtaca acatcagcgc tctagtcacg
      901 attgccacca agaccttcct ccgttatgat cggctacggg ctctcatcac cagtatccgc
      961 cgcttctacc caacggttac cgtggtcatc gctgacgaca gcgacaagcc agagcgcgtt
     1021 agtggcccct acgtggaaca ctatctcatg cccttcggca agggctggtt cgcaggccgg
     1081 aacctggccg tgtctcaagt aaccaccaag tacgtgctgt gggtggacga cgacttcgtc
     1141 ttcacggcgc ggacgcggct ggagaggctt gtggacgtgc tggagcggac gccgctggac
     1201 ctggtggggg gcgcggtgcg cgagatctcc ggctttgcca ccacttatcg gcagctgctg
     1261 agcgtggagc ccggcgcccc aggcctcggg aactgcctcc ggcaaaggcg cggcttccac
     1321 cacgagctcg tcggcttccc aggctgcgtg gtcaccgacg gcgtggttaa cttcttcctg
     1381 gcgcggactg acaaggtgcg cgaggtcggt ttcgaccccc gcctcagccg cgtggctcat
     1441 ctggaattct tcttggatgg gcttggttcc cttcgggttg gctcctgctc cgacgtcgtg
     1501 gtggatcatg catccaaact gaagctgcct tggacatcaa gggatgccgg agcagagact
     1561 tacgcccggt accgttaccc aggatcactg gacgagagcc agatggccaa acaccggctg
     1621 ctcttcttca aacaccggct gcagtgcatg acctcccagt gatggcccgc tggggatttc
     1681 tgactgtcag gctgggcctg cctccttgtc cctgccagga atttccaaca aaccccacca
     1741 ccctgtgagc actctactgg ctgtccctga gcctctagtt cctcactctt ccttttcaga
     1801 acctgatgcc cagtaggggt tgtcctggtg acacccctcc tttttccagt gcccagaggc
     1861 ctggtggagc cataacctct cccacagcca gtgccaagtc ctccccctgc ccattctcat
     1921 ggggcaggaa atggggggat cactttccaa gtgccaaaga gcccagaggg actctaagaa
     1981 cctaaggtgg aaacactgtc ctctcatctt gggaccgagg gggtggggaa gttccccaac
     2041 acataatccc aagactgtgc ccctcatctg catcttcaga tccagtactc tgtgtacctg
     2101 ctccagcccc acccccacag agagaacttg tggctctggg gctggggtga gggctggtgg
     2161 ttggtgaaag ccattcttag ttgtgtctct gcaatgctgt gggcacaaaa gaaggggcac
     2221 cagagtccct gtgcaaacac ctagactcac ttcatggatt ccaaagctct cagcttcatt
     2281 ttattagtta cgttaggtaa gggggttcaa gggtcatggt cctcatcaca cacatgtcat
     2341 cagggccctc ctgcactcca catgatgagg tcagacccac acggtgcaaa tctttgggtc
     2401 agtgagctcc tggagaagag aggagacatg tcaggaatag attaggcacc cctcttcctt
     2461 aatgaaatgt ggcagtcctc tcaggggtac cccacctact tagggatctg ag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC015391. Homo sapiens, clo...[gi:15929933] Links  


LOCUS       BC015391                1470 bp    mRNA    linear   PRI 04-OCT-2001
DEFINITION  Homo sapiens, clone MGC:21698 IMAGE:4425229, mRNA, complete cds.
ACCESSION   BC015391
VERSION     BC015391.1  GI:15929933
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1470)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-OCT-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 27 Row: l Column: 7
            This clone was selected for full length sequencing because it
            passed the following selection criteria: GenomeScan gene
            prediction.
FEATURES             Location/Qualifiers
     source          1..1470
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="MGC:21698 IMAGE:4425229"
                     /tissue_type="Kidney, hypernephroma"
                     /clone_lib="NIH_MGC_89"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     CDS             23..484
                     /codon_start=1
                     /product="Unknown (protein for MGC:21698)"
                     /protein_id="AAH15391.1"
                     /db_xref="GI:15929934"
                     /translation="MGAPLLSPGWGAGAAGRRWWMLLAPLLPALLLVRPAGALVEGLY
                     CGTRDCYEVLGVSRSAGKAEIARAYRQLARRYHPDRYRPQPGDEGPGRTPQSAEEAFL
                     LVATAYETLKVSQAAAELQQYCMQNACKDALLVGVPAGSNPFREPRSCALL"
BASE COUNT      407 a    301 c    344 g    418 t
ORIGIN      
        1 agagccgcca gcgaggctgg ggatgggggc gccgctgctc tctcccggct ggggagccgg
       61 ggctgccggc cggcgctggt ggatgctgct ggcgcccctg ctgccggcgc tgctgctggt
      121 gcggcccgcg ggggccctgg tggaggggct ctactgcggc acgcgggact gctacgaggt
      181 gctgggcgtg agccgctcgg cgggcaaggc ggagatcgcg cgggcctacc gccagctggc
      241 ccggcgctac caccctgacc gctaccggcc ccagcccgga gacgagggcc ccgggcggac
      301 gccgcagagc gccgaggagg ctttcctgct ggtggcaacc gcctacgaga cactcaaggt
      361 ctctcaggca gctgcagagc ttcaacagta ctgtatgcag aatgcctgca aggatgccct
      421 gctggtgggt gttccagctg gaagtaaccc cttccgggag cctagatcct gtgctttact
      481 ctgaagactc gagagaagtt tgctgaggaa tgccttcaag cacaaagtga tgaatgactg
      541 ccttcaagtc tcaagaaaac acttttccct aacttttaga gatatttcag ccctttcctg
      601 tggcctggtc ctatagccaa aatcacagat attcatgagt ttctacttga gtgagaaaac
      661 tgggtgaagg aatagaattt taaatagtaa taactgcttg ttttttttgt gcaagtactt
      721 ttatacataa gataaacaaa aaccttacca ccaaacatac caaaatgcac ctctttcata
      781 agtgagttac taagatttct atacctggaa tatcatgtat gtttcattta ctggatgttt
      841 acattttagg aaggaaaata gttttgttta tttaaacaac tgaatactta taaactgttg
      901 ttcctggaag ttatttattc cataaaaaat ttgttctttt gtcatgaatt tataattcct
      961 aaatgaagac cagaaagtac aaattgctgg gaggaagaat aggctttatt aatcaactga
     1021 tgtcttgatt tttctaaatg ggaagattgc tttattttta acactaatta tgggagcaga
     1081 ttcttagcaa acttctttgg aaaagttaat gttatgatgt gcattaggct gccccatcgt
     1141 gtatataaat gaagcagatt tgatttttgt attcttacgt ttctctgctt tgtagttgtg
     1201 gctgtactta aagaaataca gaatttcata tatttaaaaa tgtttaaaat gtgacccaca
     1261 gaacattgta aatgattaaa aactaacatg aaaatattac aacctaaaag aattcttaac
     1321 ttcacaagtg ttttacttcg acgatgtgcc tttgatttaa tttgggacac ttttttagaa
     1381 ggatacatta ttcgtgtttg caacggtctt tgaagagctt ggaaataaaa tttctgctta
     1441 attaatcaaa aaaaaaaaaa aaaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  






&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF100756. Homo sapiens coat...[gi:5410297] Links  


LOCUS       AF100756                3075 bp    mRNA    linear   PRI 08-JUL-1999
DEFINITION  Homo sapiens coat protein gamma-cop mRNA, complete cds.
ACCESSION   AF100756
VERSION     AF100756.1  GI:5410297
KEYWORDS    FLI_CDNA.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3075)
  AUTHORS   Peng,Y., Song,H., Dai,M., Huang,Q., Mao,Y., Zhang,Q., Mao,M.,
            Fu,G., Luo,M., Chen,J. and Hu,R.
  TITLE     Human coat protein gamma-cop gene
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 3075)
  AUTHORS   Peng,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-OCT-1998) Rui-Jin Hospital, Shanghai Institute of
            Endocrinology, Molecular Medicine Center, 197, Rui-Jin Road II,
            Shanghai, P.R. China, 200025
FEATURES             Location/Qualifiers
     source          1..3075
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="pituitary"
     CDS             76..2700
                     /codon_start=1
                     /product="coat protein gamma-cop"
                     /protein_id="AAD43020.1"
                     /db_xref="GI:5410298"
                     /translation="MLKKFDKKDEESGGGSNPFQHLEKSAVLQEARVFNETPINPRKC
                     AHILTKILYLINQGEHLGTTEATEAFFAMTKLFQSNDPTLRRMCYLTIKEMSCIAEDV
                     IIVTSSLTKDMTGKEDNYRGPAVRALCQITDSTMLQAIERYMKQAIVDKVPSVSSSAL
                     VSSLHLLKCSFDVVKRWVNEAQEAASSDNIMVQYHALGLLYHVRKNDRLAVNKMISKV
                     TRHGLKSPFAYCMMIRVASKQLEEEDGSRDSPLFDFIESCLRNKHEMVVYEAASAIVN
                     LPGCSAKELAPAVSVLQLFCSSPKAALRYAAVRTLNKVAMKHPSAVTACNLDLENLVT
                     DSNRSIATLAITTLLKTGSESSIDRLMKQISSFMSEISDEFKVVVVQAISALCQKYPR
                     KHAVLMNFLFTMLREEGGFEYKRAIVDCIISIIEENSESKETGLSHLCEFIEDCEFTV
                     LATRILHLLGQEGPKTTNPSKYIRFIYNRVVLEHEEVRAGAVSALAKFGAQNEEMLPS
                     ILVLLKRCVMDDDNEVRDRATFYLNVLEQKQKALNAGYILNGLTVSIPGLERALQQYT
                     LEPSEKPFDLKSVPLATAPMAEQRTESTPITAVKQPEKVAATRQEIFQEQLAAVPEFR
                     GLGPLFKSSPEPVALTESETEYVIRCTKHTFTNHMVFQFDCTNTLNDQTLENVTVQME
                     PTEAYEVLCYVPARSLPYNQPGTCYTLVALPKEDPTAVACTFSCMMKFTVKDCDPTTG
                     ETDDEGYEDEYVLEDLEVTVADHIQKVMKLNFEAAWDEVGDEFEKEETFTLSTIKTLE
                     EAVGNIVKFLGMHPCERSDKVPDNKNTHTLLLAGVFRGGHDILVRSRLLLLDTVTMQV
                     TARSLEELPVDIILASVG"
BASE COUNT      779 a    833 c    781 g    682 t
ORIGIN      
        1 ggcactgtag caccgctact ccgtgccgcg cccgtcgagc attgcgttgc tgcattgcgc
       61 cccaccgact ccactatgtt gaagaaattc gacaagaagg atgaggagtc aggtggaggc
      121 tccaacccat tccagcacct tgagaagagt gcggtactcc aggaggcccg tgtatttaat
      181 gaaactccca tcaaccctcg gaaatgtgcc cacatcctca ccaagattct ttatctcata
      241 aaccaggggg agcacctggg gaccacggaa gcgaccgagg ccttctttgc catgaccaag
      301 ctctttcagt ccaatgaccc cacactccgt cggatgtgct acttgaccat caaggagatg
      361 tcttgcattg cagaggatgt catcattgtc accagcagcc taacaaaaga catgactggg
      421 aaagaagaca actaccgggg cccggccgtg cgagccctct gccagatcac tgatagcacc
      481 atgctgcagg ctattgagcg ctacatgaaa caagccattg tggacaaggt gcccagtgtc
      541 tccagctctg ccctcgtgtc ttccttgcac ctgctgaagt gcagctttga cgtggtcaag
      601 cgctgggtga atgaggctca ggaggcagca tccagtgata acatcatggt ccagtaccac
      661 gcactagggc tcctgtacca tgtgcgtaag aatgaccgcc tagccgtcaa taagatgatc
      721 agcaaggtca cacggcatgg ccttaagtct ccctttgcct actgcatgat gatccgggtg
      781 gccagcaagc agctggaaga ggaggatggc agccgtgaca gcccactgtt tgacttcatc
      841 gagagctgct tgcgcaacaa gcacgagatg gtggtgtatg aagccgcctc ggccattgtc
      901 aacctgcctg ggtgcagcgc caaggagctg gccccagctg tctcagtgct ccagctcttc
      961 tgcagctccc ccaaggccgc cctccgttac gccgccgtcc gcaccctcaa caaggtggcc
     1021 atgaagcacc cgtccgctgt gacagcttgt aatctggatc tggagaacct ggtcacagat
     1081 tcaaaccgca gcattgccac gctggccatc accaccctcc ttaagacggg cagcgagagc
     1141 agcatcgacc gcctcatgaa gcagatctcc tccttcatgt cagaaatctc ggatgaattc
     1201 aaggtggtgg ttgtccaggc catcagtgcc ctgtgtcaga aatatcctcg caaacacgcc
     1261 gtccttatga acttcctgtt caccatgctg cgggaagagg gtggctttga gtataagcgc
     1321 gctatcgtgg actgcatcat cagcatcatt gaagagaact cagagagcaa ggagacaggg
     1381 ctgtcacatc tgtgcgagtt catcgaggac tgcgagttca cagtgctggc cacccgtatt
     1441 ctacatctcc tgggccagga ggggcccaag accaccaatc cctcaaagta catccgcttc
     1501 atctataacc gagtggtctt ggagcatgag gaggtccggg caggtgctgt gagtgctctg
     1561 gcgaagtttg gagcccagaa tgaagagatg ttacccagta tcttggtgtt gctgaagagg
     1621 tgtgtgatgg atgatgacaa tgaagtaagg gaccgagcca ccttctacct aaatgtcctg
     1681 gagcagaagc agaaggccct taatgcaggc tatatcctaa atggtctgac tgtgtccatc
     1741 cctggtctgg agagggctct gcagcagtac actctagaac catcagaaaa accttttgac
     1801 ctcaagtctg tgcccctggc cacggcgccc atggcagagc agagaacaga aagtaccccc
     1861 atcacagcag tcaaacagcc tgagaaagtg gcagctacca ggcaggagat cttccaggag
     1921 cagttggcag cagtgccaga gttccgcggt cttgggcccc tcttcaagtc ctcgcctgag
     1981 cccgtggccc tcaccgagtc agagacggag tatgtcatcc gctgcaccaa acacaccttc
     2041 accaaccaca tggtttttca gtttgactgc acaaacacac tcaatgacca gaccttggag
     2101 aatgtcacag tgcagatgga gcccactgag gcctatgagg tgctctgtta cgtgcctgcc
     2161 cggagcctgc cctacaacca gcccgggacc tgctacacac tggtggcact gcccaaagaa
     2221 gaccccacag ctgtggcctg cacattcagc tgcatgatga agttcactgt caaggactgt
     2281 gatcccacca ctggggagac tgatgacgaa ggctatgagg atgagtatgt gctggaagat
     2341 ctggaagtta ctgtagctga tcacattcaa aaggtcatga aactgaactt cgaagcagcc
     2401 tgggatgagg taggggatga atttgagaag gaggaaacgt tcaccttgtc taccatcaag
     2461 acacttgaag aggctgtggg taatattgtg aagttcttgg gaatgcaccc ttgtgagagg
     2521 tcagacaaag tgccggataa caagaacacc cacacgttgc tcctggctgg tgtgttccgg
     2581 ggtggtcatg acatcctggt gcgctcccgg ctgctgcttt tggacacagt gacaatgcag
     2641 gtgacagcca gaagtttgga ggagctgcca gtagacatca tcttggcatc tgtgggataa
     2701 gaggccagcc tgcataggac ctcataccct tccccaacac tacctggaag ttgtgccttc
     2761 ctcatgaaac tggcagaaac cccttcccaa gcttctgtat tgaaaaacaa ttaggaatca
     2821 ttgcagattt ttttttattc tgctcccacc tcccacccgg gactacttgc tggtgacttt
     2881 tttttttttt ttttttaaat aggggatgat tttagcttgt cctaaatctt gctgtccacc
     2941 cttccaggaa agggacattg taaatgaata aaacattctc aactcctctt gaatctatcc
     3001 cccaagaaac catcttatcc ctgtaataaa tcagcatgta tttattgaaa aaaaaaaaaa
     3061 aaaaaaaaaa aaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  



&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X62744. Human RING6 mRNA ...[gi:36062] Links  


LOCUS       HSRING6                 1100 bp    mRNA    linear   PRI 14-OCT-1994
DEFINITION  Human RING6 mRNA for HLA class II alpha chain-like product.
ACCESSION   X62744
VERSION     X62744.1  GI:36062
KEYWORDS    HLA class II antigen.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1100)
  AUTHORS   Kelly,A.P., Monaco,J.J., Cho,S.G. and Trowsdale,J.
  TITLE     A new human HLA class II-related locus, DM
  JOURNAL   Nature 353 (6344), 571-573 (1991)
  MEDLINE   92018223
REFERENCE   2  (bases 1 to 1100)
  AUTHORS   Kelly,A.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (11-FEB-1992) Adrian P Kelly, Human Immunogenetics,
            Imperial Cancer Research Fund, 44 Lincoln's Inn Fields, London,
            WC2A 3PX, United Kingdom
FEATURES             Location/Qualifiers
     source          1..1100
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /clone="RING6"
                     /cell_type="B lymphoblastoid"
     gene            1..1100
                     /gene="RING6"
     CDS             46..831
                     /gene="RING6"
                     /note="HLA class II alpha chain-like"
                     /codon_start=1
                     /product="RING6"
                     /protein_id="CAA44606.1"
                     /db_xref="GI:36063"
                     /db_xref="SWISS-PROT:P28067"
                     /translation="MGHEQNQGAALLQMLPLLWLLPHSWAVPEAPTPMWPDDLQNHTF
                     LHTVYCQDGSPSVGLSEAYDEDQLFFFDFSQNTRVPRLPEFADWAQEQGDAPAILFDK
                     EFCEWMIQQIGPKLDGKIPVSRGFPIAEVFTLKPLEFGKPNTLVCFVSNLFPPMLTVN
                     WHDHSVPVEGFGPTFVSAVDGLSFQAFSYLNFTPEPSDIFSCIVTHEIDRYTAIAYWV
                     PRNALPSDLLENVLCGVAFGLGVLGIIVGIVLIIYFRKPCSGD"
BASE COUNT      247 a    305 c    264 g    284 t
ORIGIN      
        1 ctaaagctgg gttggtagct cctacctact gtgtggcaag aaggtatggg tcatgaacag
       61 aaccaaggag ctgcgctgct acagatgtta ccacttctgt ggctgctacc ccactcctgg
      121 gccgtccctg aagctcctac tccaatgtgg ccagatgacc tgcaaaacca cacattcctg
      181 cacacagtgt actgccagga tgggagtccc agtgtgggac tctctgaggc ctacgacgag
      241 gaccagcttt tcttcttcga cttttcccag aacactcggg tgcctcgcct gcccgaattt
      301 gctgactggg ctcaggaaca gggagatgct cctgccattt tatttgacaa agagttctgc
      361 gagtggatga tccagcaaat agggccaaaa cttgatggga aaatcccggt gtccagaggg
      421 tttcctatcg ctgaagtgtt cacgctgaag cccctggagt ttggcaagcc caacactttg
      481 gtctgttttg tcagtaatct cttcccaccc atgctgacag tgaactggca cgatcattcc
      541 gtccctgtgg aaggatttgg gcctactttt gtctcagctg tcgatggact cagcttccag
      601 gccttttctt acttaaactt cacaccagaa ccttctgaca ttttctcctg cattgtgact
      661 cacgaaattg accgctacac agcaattgcc tattgggtac cccggaacgc actgccctca
      721 gatctgctgg agaatgtgct gtgtggcgtg gcctttggcc tgggtgtgct gggcatcatc
      781 gtgggcattg ttctcatcat ctacttccgg aagccttgct caggtgactg attcttccag
      841 accagagttt gatgccagca gcttcggcca tccaaacaga ggatgctcag atttctcaca
      901 tcctgcccag gatctcctct tagggtagaa gaagtctctg ggacatccct ggggtgtgtg
      961 tgtagatttc ccacctgggg actctgctgt ccctgggctt gcatcccagg gatcccagag
     1021 tggcctgcct atcacaacca catcccttcc ccccacaagg caataaatct catttcttta
     1081 aaaaaaaaaa aaaaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: X89750. H.sapiens mRNA fo...[gi:1150425] Links  


LOCUS       HSTGIFPRO               1562 bp    mRNA    linear   PRI 06-JAN-1996
DEFINITION  H.sapiens mRNA for TGIF protein.
ACCESSION   X89750
VERSION     X89750.1  GI:1150425
KEYWORDS    tgif gene; TGIF protein.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Bertolino,E., Reimund,B., Wildt-Perinic,D. and Clerc,R.G.
  TITLE     A novel homeobox protein which recognizes a TGT core and
            functionally interferes with a retinoid-responsive motif
  JOURNAL   J. Biol. Chem. 270 (52), 31178-31188 (1995)
  MEDLINE   96125101
REFERENCE   2  (bases 1 to 1562)
  AUTHORS   Clerc,R.G.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JUL-1995) R.G. Clerc, Roche Ltd., Room 69-209,
            Grenzacherstr.124, CH- 4002 Basel, SWITZERLAND
FEATURES             Location/Qualifiers
     source          1..1562
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="liver"
                     /clone_lib="Clontech"
     gene            1..1562
                     /gene="tgif"
     CDS             312..1130
                     /gene="tgif"
                     /codon_start=1
                     /product="TGIF protein"
                     /protein_id="CAA61897.1"
                     /db_xref="GI:1150426"
                     /db_xref="SWISS-PROT:Q15583"
                     /translation="MKGKKGIVAASGSETEDEDSMDIPLDLSSSAGSGKRRRRGNLPK
                     ESVQILRDWLYEHRYNAYPSEQEKALLSQQTHLSTLQVCNWFINARRRLLPDMLRKDG
                     KDPNQFTISRRGAKISETSSVESVMGIKNFMPALEETPFHSCTAGPNPTLGRPLSPKP
                     SSPGSVLARPSVICHTTVTALKDVPFSLCQSVGVGQNTDIQQIAAKNFTDTSLMYPED
                     TCKSGPSTNTQSGLFNTPPPTPPDLNQDFSGFQLLVDVALKRAAEMELQAKLTA"
BASE COUNT      421 a    382 c    371 g    388 t
ORIGIN      
        1 ctggaattcg gggcgccgag caggagcagg gaacaaagga gcggagaggg gaggggagag
       61 agttgggcga gggagagccc ccggccggct gccagaagat cctggcggga ggaagcccaa
      121 gtgtcacttg aattccaccc aaggagcggg cgcctgggat cagagcgtcc tgtttagcaa
      181 taacggctgg agcacgtcct acaagttacg ggagagtcgg ctgtgaagga gacgttcgct
      241 tatcccctgt gtccccgctc ctggcccctc cagacccccg ccttgcctcg cgctgggagg
      301 ggagatccag aatgaaaggc aagaaaggta ttgttgcagc atctggcagt gagactgagg
      361 atgaggacag catggacatt cccttggacc tttcttcatc cgctggctca ggcaagagaa
      421 ggagaagggg caacctaccc aaggagtctg tgcagattct tcgggattgg ctgtatgagc
      481 accgttacaa tgcctatcct tcagagcaag aaaaagcgtt gctgtcccag caaacacacc
      541 tgtctacgct acaggtctgt aactggttca tcaacgcccg ccgcaggctc ctccctgaca
      601 tgctgagaaa ggatggcaaa gatccaaatc agttcacaat ttcccgccgt ggggccaaga
      661 tttctgaaac gagctctgtg gagtccgtga tgggcatcaa aaacttcatg ccagctctag
      721 aggagacccc atttcattcc tgtacagctg ggccaaaccc aaccctaggg aggccactgt
      781 ctcctaagcc gtcatccccg ggatcagttt tggctcgtcc atcagtgatc tgccatacca
      841 ctgtgactgc attgaaagat gtccctttct ctctctgcca gtcggtcggt gtgggacaaa
      901 acacagatat acagcagata gcggccaaaa acttcacaga cacctctctc atgtacccag
      961 aggacacttg taaatctgga ccaagtacga atacacagag tggtcttttc aacactcctc
     1021 cccctactcc accggacctc aaccaggact tcagtggatt tcagcttcta gtggatgttg
     1081 cactcaaacg ggctgcagag atggagcttc aggcaaaact tacagcttaa cccattttca
     1141 agcaaaacag ttctcagaaa tgtcatgatt gccggggtga aggcaagaga tgaattgcat
     1201 tattttatat attttttatt aatatttgca catgggattg ctaaaacagc ttcctgttac
     1261 tgagatgtct tcaatggaat acagtcattc caagaactat aaacttaaag ctactgtaga
     1321 aacaaagggt tttctttttt aaatgtttct tggtagatta ttcataatgt gagatggttc
     1381 ccaatatcat gtgatttttt tttttcctcc ccttcccttt ttttgttatt ttttcagact
     1441 gtgcaatact tagagaacct atagcatctt ctcattccca tgtggaacag gatgcccaca
     1501 tactgtctaa ttaataaatt ttccattttt tttcaaacaa gtaaaaaaaa aaaaaaaaaa
     1561 aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AJ006266. Homo sapiens mRNA...[gi:3287172] Links  


LOCUS       HSAJ6266                4050 bp    mRNA    linear   PRI 30-JUN-1998
DEFINITION  Homo sapiens mRNA for AND-1 protein.
ACCESSION   AJ006266
VERSION     AJ006266.1  GI:3287172
KEYWORDS    AND-1 protein; DNA-binding protein.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Kohler,A., Schmidt-Zachmann,M.S. and Franke,W.W.
  TITLE     AND-1, a natural chimeric DNA-binding protein, combines an HMG-box
            with regulatory WD-repeats
  JOURNAL   J. Cell. Sci. 110 (Pt 9), 1051-1062 (1997)
  MEDLINE   97318764
REFERENCE   2  (bases 1 to 4050)
  AUTHORS   Koehler,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (19-MAY-1998) Koehler A., German Cancer Research Center,
            Division of Cell Biology, Im Neuenheimer Feld 280, Heidelberg,
            D-69120, GERMANY
COMMENT     nt 1-101 from EST (AC: AA315413), nt 102-4050 from human fetal
            brain cDNA
            library.
FEATURES             Location/Qualifiers
     source          1..4050
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="fetal brain"
                     /clone_lib="Stratagene Human fetal brain lambda Uni-ZAP
                     library"
     CDS             40..3429
                     /function="DNA-binding protein"
                     /codon_start=1
                     /evidence=experimental
                     /product="AND-1 protein"
                     /protein_id="CAA06932.1"
                     /db_xref="GI:3287173"
                     /db_xref="SPTREMBL:O75717"
                     /translation="MPATRKPMRYGHTEGHTEVCFDDSGSFIVTCGSDGDVRIWEDLD
                     DDDPKFINVGEKAYSCALKSGKLVTAVSNNTIQVHTFPEGVPDGILTRFTTNANHVVF
                     NGDGTKIAAGSSDFLVKIVDVMDSSQQKTFRGHDAPVLSLSFDPKDIFLASASCDGSV
                     RVWQISDQTCAISWPLLQKCNDVINAKSICRLAWQPKSGKLLAIPVEKSVKLYRRESW
                     SHQFDLSDNFISQTLNIVTWSPCGQYLAAGSINGLIIVWNVETKDCMERVKHEKGYAI
                     CGLAWHPTCGRISYTDAEGNLGLLENVCDPSGKTSSSKVSSRVEKDYNDLFDGDDMSN
                     AGDFLNDNAVEIPSFSKGIINDDEDDEDLMMASGRPRQRSHILEDDENSVDISMLKTG
                     SSLLKEEEEDGQEGSIHNLPLVTSQRPFYDGPMPTPRQKPFQSGSTPLHLTHRFMVWN
                     SIGIIRCYNDEQDNAIDVEFHDTSIHHATHLSNTLNYTIADLSHEAILLACESTDELA
                     SKLHCLHFSSWDSSKEWIIDLPQNEDIEAICLGQGWAAAATSALLLRLFTIGGVQKEV
                     FSLAGPVVSMAGHGEQLFIVYHRGTGFDGDQCLGVQLLELGKKKKQILHGDPLPLTRK
                     SYLAWIGFSAEGTPCYVDSEGIVRMLNRGLGNTWTPICNTREHCKGKSDHYWVVGIHE
                     NPQQLRCIPCKGSRFPPTLPRPAVAILSFKLPYCQIATEKGQMEEQFWRSVIFHNHLD
                     YLAKNGYEYEESTKNQATKEQQELLMKMLALSCKLEREFRCVELADLMTQNAVNLAIK
                     YASRSRKLILAQKLSELAVEKAAELTATQVEEEEEEEDFRKKLNAGYSNTATEWSQPR
                     FRNQVEEDAEDSGEADDEEKPEIHKPGQNSFSKSTNSSDVSAKSGAVTFSSQGRVNPF
                     KVSASSKEPAMSMNSARSTNILDNMGKSSKKSTALSRTTNNEKSPIIKPLIPKPKPKQ
                     ASAASYFQKRNSQTNKTEEVKEENLKNVLSETPAICPPQNTENQRPKTGFQMWLEENR
                     SNILSDNPDFSDEADIIKEGMIRFRVLSTEERKVWANKAKGETASEGTEAKKRKRVVD
                     ESDETENQEEKAKENLNLSKKQKPLDFSTNQKLSAFAFKQE"
BASE COUNT     1306 a    713 c    896 g   1135 t
ORIGIN      
        1 gaattcggca cgaggtcacc cggataggta aaggaaaaca tgcctgccac acggaagcca
       61 atgagatatg ggcatacaga gggacacacg gaggtctgtt ttgatgattc tgggagtttt
      121 attgtgactt gtggaagtga tggtgatgtg aggatttggg aagacttgga tgatgatgat
      181 cctaagttca ttaatgttgg agaaaaggca tattcatgtg ctttgaagag tggaaaactg
      241 gtcactgcag tttctaataa tactattcaa gtccacacat ttcctgaagg agttccagat
      301 ggtatattga ctcgcttcac tacaaatgca aaccatgtgg tctttaatgg ggatggtact
      361 aaaattgctg ctggatctag tgattttcta gtcaaaattg tggatgtgat ggatagcagc
      421 caacagaaaa catttcgagg acatgatgcc cctgttttaa gtctttcctt tgatcctaag
      481 gacatctttc tggcatcagc tagttgtgat ggatctgtca gagtgtggca aatttcagat
      541 cagacatgtg ctattagttg gccactgcta caaaaatgca acgatgtgat aaatgcaaaa
      601 tcaatctgca gacttgcttg gcagccaaaa agtgggaagt tactggcaat tcctgtggaa
      661 aaatctgtta agctatatag aagagaatct tggagtcatc aatttgatct ttcagataat
      721 ttcatctctc agaccctcaa tatagtaacc tggtctccct gtgggcaata tttagctgca
      781 ggtagtatta atggtctaat catagtttgg aatgtggaaa ccaaagactg catggaaagg
      841 gtgaaacatg agaaaggtta tgcaatttgt ggtctggcat ggcatcctac ttgtggtcga
      901 atatcgtata ctgatgcgga aggaaatcta gggcttctag agaatgtttg tgaccccagt
      961 ggaaagacat caagcagtaa ggtatctagc agagtggaaa aggattataa tgatcttttt
     1021 gatggagatg atatgagtaa tgctggtgat tttctaaatg acaatgcagt tgagatccct
     1081 tctttttcaa aagggattat aaatgatgat gaggatgatg aagacctcat gatggcttca
     1141 ggtcgtccta gacagcgaag tcacatccta gaagatgatg aaaactcagt tgatatttca
     1201 atgctaaaaa ctggttctag tcttctcaaa gaggaggagg aagatggtca agaaggcagc
     1261 attcacaatc taccacttgt aacatcccaa aggccatttt atgatggacc catgccaact
     1321 ccccggcaaa agccatttca gtcaggttct acaccgttgc atctcactca cagattcatg
     1381 gtgtggaact ctattggaat tattcgctgc tataatgatg agcaagacaa tgccatagat
     1441 gtggagttcc atgatacctc catacaccat gcaacacact tatcaaacac tttgaattat
     1501 acaatagcag atctttccca cgaagctatt ttgttggcat gtgaaagcac tgatgaacta
     1561 gcaagcaagc ttcactgcct gcactttagt tcttgggatt caagcaaaga gtggataata
     1621 gacttgcctc agaatgagga tattgaagcc atatgtctcg gtcaaggatg ggctgctgcc
     1681 gctactagtg ccctgcttct tcgattgttt actattggag gggttcaaaa agaggtattc
     1741 agccttgctg gacctgtggt gtcaatggca ggacatggag aacagctttt cattgtttat
     1801 cacagaggta caggatttga tggggatcag tgccttggag ttcaactgct agagctgggg
     1861 aaaaagaaaa aacaaatttt gcatggtgac cctcttcctc ttacaaggaa atcctacctt
     1921 gcatggattg ggttttcagc tgaaggtacc ccttgttacg tggattcaga aggaattgtt
     1981 cgaatgctta acagaggact tggtaatacg tggactccta tatgtaatac aagagagcac
     2041 tgcaaaggaa aatctgatca ctactgggtg gttggtatcc atgaaaatcc ccagcaacta
     2101 aggtgcattc cttgtaaagg ttctcggttt cccccaaccc ttccacgccc tgctgttgct
     2161 atattatcct ttaagcttcc ttactgtcag attgcaacag agaaaggaca aatggaggag
     2221 caattttggc gttcagttat atttcacaac caccttgatt atttagctaa aaatggttat
     2281 gaatatgaag agagcactaa aaatcaagca acaaaagagc aacaggaact tttaatgaaa
     2341 atgcttgcgc tttcttgtaa actggagcga gaattccgtt gtgtggaact tgctgatcta
     2401 atgactcaaa atgctgtgaa tttagccatt aaatatgctt ctcgctctcg gaaattaata
     2461 ctggctcaaa aactaagtga actggctgta gagaaggcag ccgaattgac agcaacccag
     2521 gtggaagagg aagaagaaga agaagatttc agaaaaaagc tgaatgctgg ttacagcaat
     2581 actgctacag agtggagcca accaaggttc agaaatcaag ttgaagaaga tgctgaggac
     2641 agtggagaag ctgatgatga agaaaaacca gaaatacata agcctggaca gaactcgttt
     2701 tccaaaagta caaattcctc tgatgtttca gctaagtcag gtgcagttac ctttagcagc
     2761 caaggacgag taaatccctt taaggtatca gccagttcca aagaaccagc catgtcaatg
     2821 aattcagcac gttcaactaa tattttagac aatatgggca aatcatccaa gaaatccact
     2881 gcacttagtc gaactacaaa taatgaaaag tctcccatta taaagcctct gattccaaag
     2941 ccgaagccta agcaggcatc tgcagcatcc tatttccaga aaagaaattc tcaaactaat
     3001 aaaactgagg aagtgaaaga agaaaatctt aaaaatgtat tatctgaaac cccagctata
     3061 tgtcctcctc aaaacactga aaaccaaagg ccaaagaccg ggttccagat gtggttagaa
     3121 gaaaatagaa gtaatatttt gtctgacaat cctgactttt cagatgaagc agacataata
     3181 aaagaaggaa tgattcgatt tagagtattg tcaactgaag aaagaaaggt gtgggctaac
     3241 aaagccaaag gagaaacggc aagtgaagga actgaagcaa agaagcgaaa acgtgtggtt
     3301 gatgaaagtg atgaaacaga aaaccaggaa gaaaaagcaa aagagaacct gaatttgtct
     3361 aaaaagcaga aacctttaga tttttctaca aatcagaaac tatcagcttt tgcatttaag
     3421 caggagtaaa ggaagaaagt gaccctaggg aagtaatgga ttttttttac tcatctttga
     3481 atatagactc gagtctttgg gaaactcatt atatatatat tttttaaaga gtttgaagca
     3541 actgtttgtc tttataagat aatgtagtaa ttatattggt gtaggtaaca ggacatatgt
     3601 aaaaactatc atctttgcag attactctgc ctccaaatgc agggcctttc agagatgcat
     3661 tgtgattgta attactgagt tgaagctcca accaatttga atttgtttct taaccttgaa
     3721 aaatcattaa agccaaggta ttaaaacctt tgtgcattaa taccttctag gggtttggtt
     3781 catttggttt ttgtcatgtg caaggaagga caatagtcct ctttccaagt gtgttagcat
     3841 agacttctct atatgtttct actagaccta ggggatgacg tcttttaata atactggccc
     3901 taaacatgta aataatcttg taggtgagac tttttctttt gtgtttcgga aatttcctat
     3961 gtggctttca gttgtctgtt tgtatagcct ggattttttt gaggtaaatg aaactttctc
     4021 atttgtaaaa aaaaaaaaaa aaaactcgag
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerProbeSetProbeSetProteinProteinPubMedPubMedTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AJ251595. Homo sapiens mRNA...[gi:6491738] Links  


LOCUS       HSA251595               3091 bp    mRNA    linear   PRI 29-NOV-1999
DEFINITION  Homo sapiens mRNA for transmembrane glycoprotein (CD44 gene).
ACCESSION   AJ251595
VERSION     AJ251595.1  GI:6491738
KEYWORDS    alternative splicing; CD44 gene; transmembrane glycoprotein.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Gunthert,U.
  TITLE     CD44: a multitude of isoforms with diverse functions
  JOURNAL   Curr. Top. Microbiol. Immunol. 184, 47-63 (1993)
  MEDLINE   94147793
REFERENCE   2  (bases 1 to 3091)
  AUTHORS   Gunthert,U.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-NOV-1999) Gunthert U., Basel Institute for
            Immunology, Grenzacherstrasse 487, CH-4005 Basel, SWITZERLAND
FEATURES             Location/Qualifiers
     source          1..3091
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            1..3091
                     /gene="CD44"
     prim_transcript <1..>3091
                     /gene="CD44"
     exon            <1..245
                     /gene="CD44"
                     /number=1
     CDS             179..2407
                     /gene="CD44"
                     /function="hyaluronate receptor"
                     /codon_start=1
                     /product="transmembrane glycoprotein"
                     /protein_id="CAB61878.1"
                     /db_xref="GI:6491739"
                     /translation="MDKFWWHAAWGLCLVPLSLAQIDLNITCRFAGVFHVEKNGRYSI
                     SRTEAADLCKAFNSTLPTMAQMEKALSIGFETCRYGFIEGHVVIPRIHPNSICAANNT
                     GVYILTYNTSQYDTYCFNASAPPEEDCTSVTDLPNAFDGPITITIVNRDGTRYVQKGE
                     YRTNPEDIYPSNPTDDDVSSGSSSERSSTSGGYIFYTFSTVHPIPDEDSPWITDSTDR
                     IPATTLMSTSATATETATKRQEAWDWFSWLFLPSESKNHLHTTTQMAGTSSNTISAGW
                     EPNEENEDERDRHLSFSGSGIDDDEDFISSTISTTPRAFDHTKQNQDWTQWNPSHSNP
                     EVLLQTTTRMTDVDRNGTTAYEGNWNPEAHPPLIHHEHHEEEETPHSTSTIQATPSST
                     TEETATQKEQWFGNRWHEGYRQTPREDSHSTTGTAAASAHTSHPMQGRTTPSPEDSSW
                     TDFFNPISHPMGRGHQAGRRMDMDSSHSTTLQPTANPNTGLVENLDRTGPLSMTTQQS
                     NSQSFSTSHEGLEEDKDHPTTSTLTSSNRNDVTGGRRDPNHSEGSTTLLEGYTSHYPH
                     TKESRTFIPVTSAKTGSFGVTAVTVGDSNSNVNRSLSGDQDTFHPSGGSHTTHGSESD
                     GHSHGSQEGGANTTSGPIRTPQIPEWLIILASLLALALILAVCIAVNSRRRCGQKKKL
                     VINSGNGAVEDRKPSGLNGEASKSQEMVHLVNKESSETPDQFMTADETRNLQNVDMKI
                     GV"
     sig_peptide     179..249
                     /gene="CD44"
     exon            246..431
                     /gene="CD44"
                     /number=2
     mat_peptide     250..2404
                     /gene="CD44"
                     /product="transmembrane glycoprotein"
     exon            432..545
                     /gene="CD44"
                     /number=3
     exon            546..614
                     /gene="CD44"
                     /number=4
     exon            615..845
                     /gene="CD44"
                     /number=5
     exon            846..974
                     /gene="CD44"
                     /note="alternative"
                     /number=7
     exon            975..1100
                     /gene="CD44"
                     /note="alternative"
                     /number=8
     exon            1101..1214
                     /gene="CD44"
                     /note="alternative"
                     /number=9
     exon            1215..1331
                     /gene="CD44"
                     /note="alternative"
                     /number=10
     exon            1332..1460
                     /gene="CD44"
                     /note="alternative"
                     /number=11
     exon            1461..1592
                     /gene="CD44"
                     /note="alternative"
                     /number=12
     exon            1593..1694
                     /gene="CD44"
                     /note="alternative"
                     /number=13
     exon            1695..1784
                     /gene="CD44"
                     /note="alternative"
                     /number=14
     exon            1785..1988
                     /gene="CD44"
                     /note="alternative"
                     /number=15
     exon            1989..2051
                     /gene="CD44"
                     /number=16
     exon            2052..2126
                     /gene="CD44"
                     /number=17
     exon            2127..2202
                     /gene="CD44"
                     /number=18
     exon            2203..>3091
                     /gene="CD44"
                     /number=20
BASE COUNT      901 a    792 c    679 g    719 t
ORIGIN      
        1 gggagaccca agcttctaga gatccctcga cctcgagatc cattgtgctc taaagagcgg
       61 accccagcct ctgccaggtt cggtccgcca tcctcgtccc gtcctccgcc ggcccctgcc
      121 ccgcgcccag ggatcctcca gctcctttcg cccgcgccct ccgttcgctc cggacaccat
      181 ggacaagttt tggtggcacg cagcctgggg actctgcctc gtgccgctga gcctggcgca
      241 gatcgatttg aatataacct gccgctttgc aggtgtattc cacgtggaga aaaatggtcg
      301 ctacagcatc tctcggacgg aggccgctga cctctgcaag gctttcaata gcaccttgcc
      361 cacaatggcc cagatggaga aagctctgag catcggattt gagacctgca ggtatgggtt
      421 catagaaggg catgtggtga ttccccggat ccaccccaac tccatctgtg cagcaaacaa
      481 cacaggggtg tacatcctca catacaacac ctcccagtat gacacatatt gcttcaatgc
      541 ttcagctcca cctgaagaag attgtacatc agtcacagac ctgcccaatg cctttgatgg
      601 accaattacc ataactattg ttaaccgtga tggcacccgc tatgtccaga aaggagaata
      661 cagaacgaat cctgaagaca tctaccccag caaccctact gatgatgacg tgagcagcgg
      721 ctcctccagt gaaaggagca gcacttcagg aggttacatc ttttacacct tttctactgt
      781 acaccccatc ccagacgaag acagtccctg gatcaccgac agcacagaca gaatccctgc
      841 taccactttg atgagcacta gtgctacagc aactgagaca gcaaccaaga ggcaagaagc
      901 ctgggattgg ttttcatggt tgtttctacc atcagagtca aagaatcatc ttcacacaac
      961 aacacaaatg gctggtacgt cttcaaatac catctcagca ggctgggagc caaatgaaga
     1021 aaatgaagat gaaagagaca gacacctcag tttttctgga tcaggcattg atgatgatga
     1081 agattttatc tccagcacca tttcaaccac accacgggcc tttgaccaca caaaacagaa
     1141 ccaggactgg acccagtgga acccaagcca ttcaaatccg gaagtgctac ttcagacaac
     1201 cacaaggatg actgatgtag acagaaatgg caccactgct tatgaaggaa actggaaccc
     1261 agaagcacac cctcccctca ttcaccatga gcatcatgag gaagaagaga ccccacattc
     1321 tacaagcaca atccaggcaa ctcctagtag tacaacggaa gaaacagcta cccagaagga
     1381 acagtggttt ggcaacagat ggcatgaggg atatcgccaa acacccagag aagactccca
     1441 ttcgacaaca gggacagctg cagcctcagc tcataccagc catccaatgc aaggaaggac
     1501 aacaccaagc ccagaggaca gttcctggac tgatttcttc aacccaatct cacaccccat
     1561 gggacgaggt catcaagcag gaagaaggat ggatatggac tccagtcata gtacaacgct
     1621 tcagcctact gcaaatccaa acacaggttt ggtggaaaat ttggacagga caggacctct
     1681 ttcaatgaca acgcagcaga gtaattctca gagcttctct acatcacatg aaggcttgga
     1741 agaagataaa gaccatccaa caacttctac tctgacatca agcaatagga atgatgtcac
     1801 aggtggaaga agagacccaa atcattctga aggctcaact actttactgg aaggttatac
     1861 ctctcattac ccacacacga aggaaagcag gaccttcatc ccagtgacct cagctaagac
     1921 tgggtccttt ggagttactg cagttactgt tggagattcc aactctaatg tcaatcgttc
     1981 cttatcagga gaccaagaca cattccaccc cagtgggggg tcccatacca ctcatggatc
     2041 tgaatcagat ggacactcac atgggagtca agaaggtgga gcaaacacaa cctctggtcc
     2101 tataaggaca ccccaaattc cagaatggct gatcatcttg gcatccctct tggccttggc
     2161 tttgattctt gcagtttgca ttgcagtcaa cagtcgaaga aggtgtgggc agaagaaaaa
     2221 gctagtgatc aacagtggca atggagctgt ggaggacaga aagccaagtg gactcaacgg
     2281 agaggccagc aagtctcagg aaatggtgca tttggtgaac aaggagtcgt cagaaactcc
     2341 agaccagttt atgacagctg atgagacaag gaacctgcag aatgtggaca tgaagattgg
     2401 ggtgtaacac ctacaccatt atcttggaaa gaaacaaccg ttggaaacat aaccattaca
     2461 gggagctggg acacttaaca gatgcaatgt gctactgatt gtttcattgc gaatcttttt
     2521 tagcataaaa ttttctactc tttttgtttt ttgtgttttg ttctttaaag tcaggtccaa
     2581 tttgtaaaaa cagcattgct ttgtaaatta gggcccaatt aataatcagc aagaatttga
     2641 tcgttcagtt ccacttggag gccttcatcc tcgggtgtgc tatggatggc ttctaacaaa
     2701 aactacacat atgtattcct gatcgccaac ctttccccca ccagctaagg acatttccca
     2761 gggttaatag ggcctggtcc ctgggaggaa atttgaatgg gtccattttg cccttccata
     2821 gcctaatccc tgggcattgc tttccactga ggttggggtg tactagttac acatcttcaa
     2881 cagaccccct ctagaaattt ttcagatgct tctgggagac accaaagggt gaagctattt
     2941 atctgtagta aactatttat ctgtgttttt gaaatattaa accctggatc agtcctttga
     3001 tcagtataat tttttaaagt tactttgtca gaggcacaaa agggtttaaa ctgattcata
     3061 ataaatatct gtacttcttc gatcttcaaa a
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&&





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U50062. Homo sapiens RIP ...[gi:3426026] Links  


LOCUS       HSU50062                2617 bp    mRNA    linear   PRI 18-AUG-1998
DEFINITION  Homo sapiens RIP protein kinase mRNA, complete cds.
ACCESSION   U50062
VERSION     U50062.1  GI:3426026
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2617)
  AUTHORS   Hsu,H., Huang,J., Shu,H.B., Baichwal,V. and Goeddel,D.V.
  TITLE     TNF-dependent recruitment of the protein kinase RIP to the TNF
            receptor-1 signaling complex
  JOURNAL   Immunity 4 (4), 387-396 (1996)
  MEDLINE   96200892
   PUBMED   8612133
REFERENCE   2  (bases 1 to 2617)
  AUTHORS   Huang,J., Hsu,H., Baichwal,V.R. and Goeddel,D.V.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-FEB-1996) Biology, Tularik Inc., 270 East Grand
            Avenue, South San Francisco, CA 94080, USA
REFERENCE   3  (bases 1 to 2617)
  AUTHORS   Huang,J., Hsu,H., Baichwal,V.R. and Goeddel,D.V.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-AUG-1998) Biology, Tularik Inc., 270 East Grand
            Avenue, South San Francisco, CA 94080, USA
  REMARK    Sequence update by submitter
COMMENT     On Aug 18, 1998 this sequence version replaced gi:1236942.
FEATURES             Location/Qualifiers
     source          1..2617
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /tissue_type="umbilical vein endothelium"
     CDS             1..2016
                     /note="Ser/Thr protein kinase; protein has death domain
                     sequence at the carboxyl terminus"
                     /codon_start=1
                     /product="RIP protein kinase"
                     /protein_id="AAC32232.1"
                     /db_xref="GI:3426027"
                     /translation="MQPDMSLNVIKMKSSDFLESAELDSGGFGKVSLCFHRTQGLMIM
                     KTVYKGPNCIEHNEALLEEAKMMNRLRHSRVVKLLGVIIEEGKYSLVMEYMEKGNLMH
                     VLKAEMSTPLSVKGRIILEIIEGMCYLHGKGVIHKDLKPENILVDNDFHIKIADLGLA
                     SFKMWSKLNNEEHNELREVDGTAKKNGGTLYYMAPEHLNDVNAKPTEKSDVYSFAVVL
                     WAIFANKEPYENAICEQQLIMCIKSGNRPDVDDITEYCPREIISLMKLCWEANPEARP
                     TFPGIEEKFRPFYLSQLEESVEEDVKSLKKEYSNENAVVKRMQSLQLDCVAVPSSRSN
                     SATEQPGSLHSSQGLGMGPVEESWFAPSLEHPQEENEPSLQSKLQDEANYHLYGSRMD
                     RQTKQQPRQNVAYNREEERRRRVSHDPFAQQRPYENFQNTEGKGTVYSSAASHGNAVH
                     QPSGLTSQPQVLYQNNGLYSSHGFGTRPLDPGTAGPRVWYRPIPSHMPSLHNIPVPET
                     NYLGNTPTMPFSSLPPTDESIKYTIYNSTGIQIGAYNYMEIGGTSSSLLDSTNTNFKE
                     EPAAKYQAIFDNTTSLTDKHLDPIRENLGKHWKNCARKLGFTQSQIDEIDHDYERDGL
                     KEKVYQMLQKWVMREGIKGATVGKLAQALHQCSRIDLLSSLIYVSQN"
BASE COUNT      794 a    586 c    659 g    574 t      4 others
ORIGIN      
        1 atgcaaccag acatgtcctt gaatgtcatt aagatgaaat ccagtgactt cctggagagt
       61 gcagaactgg acagcggagg ctttgggaag gtgtctctgt gtttccacag aacccaggga
      121 ctcatgatca tgaaaacagt gtacaagggg cccaactgca ttgagcacaa cgaggccctc
      181 ttggaggagg cgaagatgat gaacagactg agacacagcc gggtggtgaa gctcctgggc
      241 gtcatcatag aggaagggaa gtactccctg gtgatggagt acatggagaa gggcaacctg
      301 atgcacgtgc tgaaagccga gatgagtact ccgctttctg taaaaggaag gataattttg
      361 gaaatcattg aaggaatgtg ctacttacat ggaaaaggcg tgatacacaa ggacctgaag
      421 cctgaaaata tccttgttga taatgacttc cacattaaga tcgcagacct cggccttgcc
      481 tcctttaaga tgtggagcaa actgaataat gaagagcaca atgagctgag ggaagtggac
      541 ggcaccgcta agaagaatgg cggcaccctc tactacatgg cgcccgagca cctgaatgac
      601 gtcaacgcaa agcccacaga gaagtcggat gtgtacagct ttgctgtagt actctgggcg
      661 atatttgcaa ataaggagcc atatgaaaat gctatctgtg agcagcagtt gataatgtgc
      721 ataaaatctg ggaacaggcc agatgtggat gacatcactg agtactgccc aagagaaatt
      781 atcagtctca tgaagctctg ctgggaagcg aatccggaag ctcggccgac atttcctggc
      841 attgaagaaa aatttaggcc tttttattta agtcaattag aagaaagtgt agaagaggac
      901 gtgaagagtt taaagaaaga gtattcaaac gaaaatgcag ttgtgaagag aatgcagtct
      961 cttcaacttg attgtgtggc agtaccttca agccggtcaa attcagccac agaacagcct
     1021 ggttcactgc acagttccca gggacttggg atgggtcctg tggaggagtc ctggtttgct
     1081 ccttccctgg agcacccaca agaagagaat gagcccagcc tgcagagtaa actccaagac
     1141 gaagccaact accatcttta tggcagccgc atggacaggc agacgaaaca gcagcccaga
     1201 cagaatgtgg cttacaacag agaggaggaa aggagacgca gggtctccca tgaccctttt
     1261 gcacagcaaa gaccttacga gaattttcag aatacagagg gaaaaggcac tgtttattcc
     1321 agtgcagcca gtcatggtaa tgcagtgcac cagccctcag ggctcaccag ccaacctcaa
     1381 gtactgtatc agaacaatgg attatatagc tcacatggct ttggaacaag accactggat
     1441 ccaggaacag caggtcccag agtttggtac aggccaattc caagtcatat gcctagtctg
     1501 cataatatcc cagtgcctga gaccaactat ctaggaaata cacccaccat gccattcagc
     1561 tccttgccac caacagatga atctataaaa tataccatat acaatagtac tggcattcag
     1621 attggagcct acaattatat ggagattggt gggacgagtt catcactact agacagcaca
     1681 aatacgaact tcaaagaaga gccagctgct aagtaccaag ctatctttga taataccact
     1741 agtctgacgg ataaacacct ggacccaatc agggaaaatc tgggaaagca ctggaaaaac
     1801 tgtgcccgta aactgggctt cacacagtct cagattgatg aaattgacca tgactatgag
     1861 cgagatggac tgaaagaaaa ggtttaccag atgctccaaa agtgggtgat gagggaaggc
     1921 ataaagggag ccacggtggg gaagctggcc caggcgctcc accagtgttc caggatcgac
     1981 cttctgagca gcttgattta cgtcagccag aactaaccct ggatgggcta cggcagctga
     2041 agtggacgcc tcacttagcg gataacccca gaaagttggc tgcctcagag cattcagaat
     2101 tctgtcctca ctgatagggg ttctgtgtct gcagaaattt ngtttcctgt acttcatagc
     2161 tggagaatgg ggaaagaaat ctgcagcaaa ggggtctcac tctgttgcca ggctggtctc
     2221 aaacttctgg actcaagtga tcctcccgcc tcggccttcc aaagtgctgg gatatcaggc
     2281 actgagccac tgcgcccagt caacaatccg ntctgaggaa agcgtaagca ggaagacctc
     2341 ttaatggcat agcaccaata aaaaaatgac tcctagttgt gtttggaaag ggagagaaga
     2401 gatgtctgag gaaggtcatg ttctttcagc ttatggcatt tcctagagtt tngttgaagc
     2461 aagaagaaaa actcagagaa tataaaatca actttnaaaa ttgtgtgctc tcttcttcac
     2521 gtaggctcct gttaaaaaca aagtgcagtc agattctaag ccctgttcag agacttcgcg
     2581 gatcacagct gcagctcacc gccacatcac aggatcc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  



&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: BC009290. Homo sapiens, ATP...[gi:14424533] Links  


LOCUS       BC009290                1122 bp    mRNA    linear   PRI 12-JUL-2001
DEFINITION  Homo sapiens, ATPase, H+ transporting, lysosomal (vacuolar proton
            pump) 16kD, clone MGC:16615 IMAGE:4111426, mRNA, complete cds.
ACCESSION   BC009290
VERSION     BC009290.1  GI:14424533
KEYWORDS    MGC.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1122)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-JUN-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site:       http://www.nisc.nih.gov/
            Contact:        nisc_mgc@nhgri.nih.gov
            Shevchenko,Y., Wetherby,K.D., Beckstrom-Sternberg,S.M.,
            Benjamin,B., Blakesley,R.W., Bouffard,G.G., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Guan,X., Gupta,J., Ho,S.-L., Karlins,E., Legaspi,R.,
            Lim,M., Maduro,Q.L., Masiello,C., Mastrian,S.D., McCloskey,J.C.,
            McDowell,J., Pearson,R., Snyder,B., Stantripop,S., Thomas,P.J.,
            Tiongson,E.E., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A.,
            Zhang,L.-H. and Green,E.D.

            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 26 Row: c Column: 5.
FEATURES             Location/Qualifiers
     source          1..1122
                     /organism="Homo sapiens"
                     /db_xref="LocusID:527"
                     /db_xref="taxon:9606"
                     /clone="MGC:16615 IMAGE:4111426"
                     /tissue_type="Muscle, rhabdomyosarcoma"
                     /clone_lib="NIH_MGC_17"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     CDS             149..616
                     /codon_start=1
                     /product="ATPase, H+ transporting, lysosomal (vacuolar
                     proton pump) 16kD"
                     /protein_id="AAH09290.1"
                     /db_xref="GI:14424534"
                     /translation="MSESKSGPEYASFFAVMGASAAMVFSALGAAYGTAKSGTGIAAM
                     SVMRPEQIMKSIIPVVMAGIIAIYGLVVAVLIANSLNDDISLYKSFLQLGAGLSVGLS
                     GLAAGFAIGIVGDAGVRGTAQQPRLFVGMILILIFAEVLGLYGLIVALILSTK"
BASE COUNT      202 a    391 c    296 g    233 t
ORIGIN      
        1 ggcacgaggg gtatttagag cgcagcggct gacgggccgg atcgccttcg ccgccgcccg
       61 cccgcaaacc ttcgtgcccg gcccgtcctc gcccccgcct ccgccaccgc ctcggcccgc
      121 agagcttgcc ccctccccac ccgcagacat gtccgagtcc aagagcggcc ccgagtatgc
      181 ttcgtttttc gccgtcatgg gcgcctcggc cgccatggtc ttcagcgccc tgggcgctgc
      241 ctatggcaca gccaagagcg gtaccggcat tgcggccatg tctgtcatgc ggccggagca
      301 gatcatgaag tccatcatcc cagtggtcat ggctggcatc atcgccatct acggcctggt
      361 ggtggcagtc ctcatcgcca actccctgaa tgacgacatc agcctctaca agagcttcct
      421 ccagctgggc gccggcctga gcgtgggcct gagcggcctg gcagccggct ttgccatcgg
      481 catcgtgggg gacgctggcg tgcggggcac cgcccagcag ccccgactat tcgtgggcat
      541 gatcctgatt ctcatcttcg ccgaggtgct cggcctctac ggtctcatcg tcgccctcat
      601 cctctccaca aagtagaccc tctccgagcc caccagccac agaatattat gtaaagacca
      661 cccctcctca ttccagaacg aacagcctga cacatacgca cggggccgcc gcccccagta
      721 gttggtcttg tacatgcgca gtgtcctagt gcccatcgtc tgtttccccg gccttgcccc
      781 cgcccgcccc gtgccgtgga catctgggcc cactcatcgc ccctccaggc ccccggcgcc
      841 ccacccccta gagtgctctg tgtatgcgga tgatttagaa ttgtcatttc tctttactgg
      901 atgtttattt ataaagatct ggcctgttcc tgcgtctgcg gagcggccct tgtctcccag
      961 ctatctataa ccttagctag agtgtcgcct tgtgggttcc tgttgctgag acttcctgga
     1021 tggagccgcc ctcaccgccg ggcccgtggc cctgcgcgga gctgtgtcca ataaagttct
     1081 tggatgtgaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 EST FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default        
 
 

1: BI548787. 603189188F1 NIH_M...[gi:15436099] Links  


IDENTIFIERS

dbEST Id:       9396101
EST name:       603189188F1
GenBank Acc:    BI548787
GenBank gi:     15436099

CLONE INFO
Clone Id:       IMAGE:5260677 (5')
Plate:          LLAM11656 Row: p Column: 22
DNA type:       cDNA

PRIMERS
PolyA Tail:     Unknown

SEQUENCE
                AGCGGCATTTTGTTCTGCGGTGCTGGTATTTAGAGCGCAGCGGCTGACGGGCCGGATCGC
                CTTCGCCGCCGCCCGCCCGCAAACCTTCGTGCCCGGCCCGTCCTCGCCCGCGCCTCCGCC
                ACCGCCTCGGCCCGCAGAGCTGGCCCCCTCCCCACCCGCAGACATGTCCGAGTCCAAGAG
                CGGCCCGAGTATGCTTCGTTTTTCGCCGTCATGGGCGCCTCGGCCGCCATGGTCTTCAGC
                GCCCTGGGCGCTGCCTATGGCACAGCCAAGAGCGGTACCGGCATTGCGGCCATGTCTGTC
                ATGCGGCCGGAGCAGATCATGAAGTCCATCATCCCAGTGGTCATGGCTGGCATCATCGCC
                ATCTACGGCCTGGTGGTGGCAGTCCTCATCGCCAACTCCCTGAATGACGACATCAGCCTC
                TACAAGAGCTTCCTCCAGCTGGGCGCCGGCCTGAGCGTGGGCCTGAGCGGCCTGGCAGCC
                GGCTTTGCCATCGGCATCGTGGGGGACGCTGGCGTGCGGGGCACCGACCAGCAGAACCCG
                ACTATACGTGGGCATGATCCTGATTCTCATCTTCGCCGAGGTGCTCGGCCTCTACGGACT
                CATCGTCGCCCTCATCCTCTCCACAAAGTAGAACCA
Quality:        High quality sequence stops at base: 628

Entry Created:  Sep 4 2001
Last Updated:   Sep 5 2001

COMMENTS
                Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
                cDNA Library Preparation: Michael J. Brownstein (NHGRI),
                Shiraki Toshiyuki and Piero Carninci (RIKEN)
                cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
                DNA Sequencing by: Incyte Genomics, Inc.
                Clone distribution: MGC clone distribution information can
                be found through the I.M.A.G.E. Consortium/LLNL at:
                http://image.llnl.gov

LIBRARY
Lib Name:       NIH_MGC_95
Organism:       Homo sapiens
Organ:          brain
Tissue type:    hippocampus
Lab host:       DH10B
Vector:         pBluescriptR (modified pBluescript KS+)
R. Site 1:      BamHI
R. Site 2:      SalI-XhoI (gtcgag)
Description:    Oligo-dT primed using primer 5'-TTTTTTTTTTTTTTTTVN-3',
                size-selected for average insert size 2.5 kb and normalized
                to ROT 5. This is a primary library enriched for full-length
                clones and constructed using the Cap-trapper method
                (Carninci, in preparation). Library constructed by M.
                Brownstein (NIMH/NHGRI, National Institutes of Health).
                Note: this is a NIH_MGC Library.

SUBMITTER
Name:           Robert Strausberg, Ph.D.
E-mail:         cgapbs-r@mail.nih.gov

CITATIONS
Title:          National Institutes of Health, Mammalian Gene Collection
                (MGC)
Authors:        NIH-MGC http://mgc.nci.nih.gov/
Year:           1999
Status:         Unpublished


MAP DATA
--------------------------------------------------------------------------------



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Map ViewerMap ViewerOMIMOMIMTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  









&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: D21853. Human mRNA for KI...[gi:434770] Links  


LOCUS       HUMORFJA                1682 bp    mRNA    linear   PRI 06-OCT-2001
DEFINITION  Human mRNA for KIAA0111 gene, complete cds.
ACCESSION   D21853
VERSION     D21853.1  GI:434770
KEYWORDS    KIAA0111.
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S.,
            Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N.
  TITLE     Prediction of the coding sequences of unidentified human genes.
            III. The coding sequences of 40 new genes (KIAA0081-KIAA0120)
            deduced by analysis of cDNA clones from human cell line KG-1
  JOURNAL   DNA Res. 2 (1), 37-43 (1995)
  MEDLINE   95308325
REFERENCE   2  (bases 1 to 1682)
  AUTHORS   Ohara,O., Nagase,T., Kikuno,R. and Nomura,N.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-OCT-1993) Osamu Ohara, Kazusa DNA Research Institute;
            1532-3, Yana, Kisarazu, Chiba 292-0812, Japan
            (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913)
FEATURES             Location/Qualifiers
     source          1..1682
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="HA0659"
                     /sex="male"
                     /cell_line="KG-1"
                     /cell_type="myeloblast"
     gene            1..1682
                     /gene="KIAA0111"
     5'UTR           1..214
                     /gene="KIAA0111"
     CDS             215..1450
                     /gene="KIAA0111"
                     /codon_start=1
                     /protein_id="BAA04879.1"
                     /db_xref="GI:2104219"
                     /translation="MATTATMATSGSARKRLLKEEDMTKVEFETSEEVDVTPTFDTMG
                     LREDLLRGIYAYGFEKPSAIQQRAIKQIIKGRDVIAQSQSGTGKTATFSISVLQCLDI
                     QVRETQALILAPTRELAVQIQKGLLALGDYMNVQCHACIGGTNVGEDIRKLDYGQHVV
                     AGTPGRVFDMIRRRSLRTRAIKMLVLDEADEMLNKGFKEQIYDVYRYLPPATQVVLIS
                     ATLPHEILEMTNKFMTDPIRILVKRDELTLEGIKQFFVAVEREEWKFDTLCDLYDTLT
                     ITQAVIFCNTKRKVDWLTEKMREANFTVSSMHGDMPQKERESIMKEFRSGASRVLIST
                     DVWARGLDVPQVSLIINYDLPNNRELYIHRIGRSGRYGRKGVAINFVKNDDIRILRDI
                     EQYYSTQIDEMPMNVADLI"
     3'UTR           1451..1682
                     /gene="KIAA0111"
BASE COUNT      429 a    399 c    474 g    380 t
ORIGIN      
        1 cagcggcaca gcgaggtcgg cagcggcaca gcgaggtcgg cagcggcaca gcgaggtcgg
       61 cagcggcaca gcgaggtcgg cagcggcaca gcgaggtcgg cagcggcaca gcgaggtcgg
      121 cagcggcagc gaggtcggca gcggcacagc gaggtcggca gcggcagcga ggtcggcagc
      181 ggcgcgcgct gtgctcttcc gcggactctg aatcatggcg accacggcca cgatggcgac
      241 ctcgggctcg gcgcgaaagc ggctgctcaa agaggaagac atgactaaag tggaattcga
      301 gaccagcgag gaggtggatg tgacccccac gttcgacacc atgggcctgc gggaggacct
      361 gctgcggggc atctacgctt acggttttga aaaaccatca gcaatccagc aacgagcaat
      421 caagcagatc atcaaaggga gagatgtcat cgcacagtct cagtccggca caggaaaaac
      481 agccaccttc agtatctcag tcctccagtg tttggatatt caggttcgtg aaactcaagc
      541 tttgatcttg gctcccacaa gagagttggc tgtgcagatc cagaaggggc tgcttgctct
      601 cggtgactac atgaatgtcc agtgccatgc ctgcattgga ggcaccaatg ttggcgagga
      661 catcaggaag ctggattacg gacagcatgt tgtcgcgggc actccagggc gtgtttttga
      721 tatgattcgt cgcagaagcc taaggacacg tgctatcaaa atgttggttt tggatgaagc
      781 tgatgaaatg ttgaataaag gtttcaaaga gcagatttac gatgtataca ggtacctgcc
      841 tccagccaca caggtggttc tcatcagtgc cacgctgcca cacgagattc tggagatgac
      901 caacaagttc atgaccgacc caatccgcat cttggtgaaa cgtgatgaat tgactctgga
      961 aggcatcaag caatttttcg tggcagtgga gagggaagag tggaaatttg acactctgtg
     1021 tgacctctac gacacactga ccatcactca ggcggtcatc ttctgcaaca ccaaaagaaa
     1081 ggtggactgg ctgacggaga aaatgaggga agccaacttc actgtatcct caatgcatgg
     1141 agacatgccc cagaaagagc gggagtccat catgaaggag ttccggtcgg gcgccagccg
     1201 agtgcttatt tctacagatg tctgggccag ggggttggat gtccctcagg tgtccctcat
     1261 cattaactat gatctcccta ataacagaga attgtacata cacagaattg ggagatcagg
     1321 tcgatacggc cggaagggtg tggccattaa ctttgtaaag aatgacgaca tccgcatcct
     1381 cagagatatc gagcagtact attccactca gattgatgag atgccgatga acgttgctga
     1441 tcttatctga agcagcagat cagtgggatg agggagactg ttcacctgct gtgtactcct
     1501 gtttggaagt atttagatcc agattctact taatggggtt tatatggact ttcttctcat
     1561 aaatggcctg ccgtctccct tcctttgaag aggatatggg gattctgctc tcttttctta
     1621 tttacatgta aataatacat tgttctaagt ctttttcatt aaaaatttaa aacttttccc
     1681 at
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U22055. Human 100 kDa coa...[gi:799176] Links  


LOCUS       HSU22055                3480 bp    mRNA    linear   PRI 03-NOV-1995
DEFINITION  Human 100 kDa coactivator mRNA, complete cds.
ACCESSION   U22055
VERSION     U22055.1  GI:799176
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3480)
  AUTHORS   Tong,X., Drapkin,R., Yalamanchili,R., Mosialos,G. and Kieff,E.
  TITLE     The Epstein-Barr virus nuclear protein 2 acidic domain forms a
            complex with a novel cellular coactivator that can interact with
            TFIIE
  JOURNAL   Mol. Cell. Biol. 15 (9), 4735-4744 (1995)
  MEDLINE   95379816
   PUBMED   7651391
REFERENCE   2  (bases 1 to 3480)
  AUTHORS   Tong,X.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-MAR-1995) Xiao Tong, Dept. of Microbiology and
            Molecular Genetics, Harvard University, 75 Francis St., Boston, MA
            02115, USA
FEATURES             Location/Qualifiers
     source          1..3480
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_type="EBV transformed B cells (IB4 cells)"
     CDS             268..2925
                     /function="associates with the EBV nuclear protein 2
                     acidic domain"
                     /codon_start=1
                     /product="100 kDa coactivator"
                     /protein_id="AAA80488.1"
                     /db_xref="GI:799177"
                     /translation="MVLSGCAIIVRGQPRGGPPPERQINLSNIRAGNLARRAAATQPD
                     AKDTPDEPWAFPAREFLRKKLIGKEVCFTIENKTPQGREYGMIYLGKDTNGENIAESL
                     VAEGLATRREGMRANNPEQNRLSECEEQAKAAKKGMWSEGNGSHTIRDLKYTIENPRH
                     FVDSHHQKPVNAIIEHVRDGSVVRALLLPDYYLVTVMLSGIKCPTFRREADGSETPEP
                     FAAEAKFFTESRLLQRDVQIILESCHNQNIVGTILHPNGNITELLLKEGFARCVDWSI
                     AVYTRGAEKLRAAERFAKERRLRIWRDYVAPTANLDQKDKQFVAKVMQVLNADAIVVK
                     LNSGDYKTIHLSSIRPPRLEGENTQDKNKKLRPLYDIPYMFEAREFLRKKLIGKKVNV
                     TVDYIRPASPATETVPAFSERTCATVTIGGINIAEALVSKGLATVIRYRQDDDQRSSH
                     YDELLAAEARAIKNGKGLHSKKEVPIHRVADISGDTQKAKQFLPFLQRAGRSEAVVEY
                     VFSGSRLKLYLPKETCLITFLLAGIECPRGARNLPGLVQEGEPFSEEATLFTKELVLQ
                     REVEVEVESMDKAGNFIGWLHIDGANLSVLLVEHALSKVHFTAERSSYYKSLLSAEEA
                     AKQKKEKVWAHYEEQPVEEVMPVLEEKERSASYKPVFVTEITDDLHFYVQDVETGTQF
                     QKLMENMRNDIASHPPVEGSYAPRRGEFCIAKFVDGEWYRARVEKVESPAKIHVFYID
                     YGNREVLPSTRLGTLSPAFSTRVLPAQATEYAFAFIQVPQDDDARTDAVDSVVRDIQN
                     TQCLLNVEHLSAGCPHVTLQFADSKGDVGLGLVKEGLVMVEVRKEKQFQKVITEYLNA
                     QESAKSARLNLWRYGDFRADDADEFGYSR"
BASE COUNT      838 a    934 c    961 g    747 t
ORIGIN      
        1 ggcggagatc gcgtctcttt cgctccgtgt ccgctgctgc tcctgtgagc gcccggcgag
       61 tccgtcccgt ccaccgtccg cagctggtag ccagcctgcc cctcgcctcg actccctttc
      121 accaacaccg acacccacat tgacacctcc agtccggcca gccgctccac tcgttgcctt
      181 tgcatctcca cacatggcgt cctcgcgcag agcggcggct cctccggggg acccgcggtc
      241 cccaccgtgc agcggggcat catcaagatg gtcctctcag ggtgcgccat cattgtccga
      301 ggtcagcctc gtggtgggcc tcctcctgag cggcagatca acctcagcaa cattcgtgct
      361 ggaaatcttg ctcgccgggc agccgccaca caacctgatg caaaggatac ccctgatgag
      421 ccctgggcat ttccagctcg agagttcctt cgaaagaagc tgattgggaa ggaagtctgt
      481 ttcacgatag aaaacaagac tccccagggg cgagagtatg gcatgatcta ccttggaaaa
      541 gataccaatg gggaaaacat tgcagaatca ctggttgcag agggcttagc cacccggaga
      601 gaaggcatga gagctaataa tcctgagcag aaccggcttt cagaatgtga agaacaagca
      661 aaggcagcca agaaagggat gtggagtgag gggaacggtt cacatactat ccgggatctc
      721 aagtatacca ttgaaaaccc aaggcacttt gtggactcac accaccagaa gcctgttaat
      781 gctatcatcg agcatgtgcg ggacggcagt gtggtcaggg ccctgctcct cccagattac
      841 tacctggtta cagtcatgct gtcaggcatc aagtgcccaa cttttcgacg ggaagcagat
      901 ggcagtgaaa ctccagagcc ttttgctgca gaagccaaat ttttcactga gtcgcgactg
      961 cttcagagag atgttcagat cattctggag agctgccaca accagaacat tgtgggtacc
     1021 atccttcatc caaatggcaa catcacagag ctcctcctga aggaaggttt cgcacgctgt
     1081 gtggactggt cgattgcagt ttacacccgg ggcgcagaaa agctgagggc ggcagagagg
     1141 tttgccaaag agcgcaggct gagaatatgg agagactatg tggctcccac agctaatttg
     1201 gaccaaaagg acaagcagtt tgttgccaag gtgatgcagg ttctgaatgc tgatgccatt
     1261 gttgtgaagc tgaactcagg cgattacaag acgattcacc tgtccagcat ccgaccaccg
     1321 aggctggagg gggagaacac ccaggataag aacaagaaac tgcgtcccct gtatgacatt
     1381 ccttacatgt ttgaggcccg ggaatttctt cgaaaaaagc ttattgggaa gaaggtcaat
     1441 gtgacggtgg actacattag accagccagc ccagccacag agacagtgcc tgccttttca
     1501 gagcgtacct gtgccactgt caccattgga ggaataaaca ttgctgaggc tcttgtcagc
     1561 aaaggtctag ccacagtgat cagataccgg caggatgatg accagagatc atcacactac
     1621 gatgaactgc ttgctgcaga ggccagagct attaagaatg gcaaaggatt gcatagcaag
     1681 aaggaagtgc ctatccaccg tgttgcagat atatctgggg atacccaaaa agcaaagcag
     1741 ttcctgcctt ttcttcagcg ggcaggtcgt tctgaagctg tggtggaata cgtcttcagt
     1801 ggttctcgtc tcaaactcta tttgccaaag gaaacttgcc ttatcacctt cttgcttgca
     1861 ggcattgaat gccccagagg agcccgaaac ctcccaggct tggtgcagga aggagagccc
     1921 ttcagcgagg aagctacact tttcaccaag gaactggtgc tgcagcgaga ggtggaggtg
     1981 gaggtggaga gcatggacaa ggccggcaac tttatcggct ggctgcacat cgacggtgcc
     2041 aacctgtccg tcctgctggt ggagcacgcg ctctccaagg tccacttcac cgccgaacgc
     2101 agctcctact acaagtccct gctgtctgcc gaggaggccg caaagcagaa gaaagagaag
     2161 gtctgggccc actatgagga gcagcccgtg gaggaggtga tgccagtgct ggaggagaag
     2221 gagcgatctg ctagctacaa gcccgtgttt gtgaccgaga tcactgatga cctgcacttc
     2281 tacgtgcagg atgtggagac cggcacccag ttccagaagc tgatggagaa catgcgcaat
     2341 gacattgcca gtcacccccc tgtagagggc tcctatgccc cccgcagggg agagttctgc
     2401 attgccaaat ttgtagatgg agaatggtac cgtgcccgag tagagaaagt cgagtctcct
     2461 gccaaaatac atgtcttcta cattgactac ggcaacagag aggtcctgcc atccacccgc
     2521 ctgggtaccc tatcacctgc cttcagcact cgggtgctgc cagctcaagc cacggagtat
     2581 gccttcgcct tcatccaggt gccccaagat gatgatgccc gcacggacgc cgtggacagc
     2641 gtagttcggg atatccagaa cactcagtgc ctgctcaacg tggaacacct gagtgccggc
     2701 tgcccccatg tcaccctgca gtttgcagat tccaagggcg atgtggggct gggcttggtg
     2761 aaggaagggc tggtcatggt ggaggtgcgc aaggagaaac agttccagaa agtgatcaca
     2821 gaatacctga atgcccaaga gtcagccaag agcgccaggc tgaacctgtg gcgctatgga
     2881 gactttcgag ctgatgatgc agacgaattt ggctacagcc gctaaggagg ggatcgggtt
     2941 tggcccccag cccccgtcac gccagtccct cttcctctgc cgggagggtg ttttcaactc
     3001 caaaccccag agaggggttg tacattgggt ccagctttgc ttcagtgtgt ggaaatgtct
     3061 cgtggggtgg catcggggct gcggggtggg gaccccaagg ctttctgggg cagacccttg
     3121 tcctctggga tgatgggcac tgctatccac agtctctgcc agttggtttt atttggaggt
     3181 ttgtgggctt ttttaaaaaa aaaaaagtcc tcaaatcagg aagaaacatc aaagactatg
     3241 tcctagtgga gggagtaatc ctaacaccca ggctggccgc cagctggcac ctgcctctat
     3301 cccagactgc cctcgtccca gctctctgtc caactgttga ttatgtgatt tttctgatac
     3361 gtccattctc aaatgccagt gtgttcacat cttcgctctg gccagcccat tctgtattta
     3421 aagctttttg aggcccaata aaatagtacg tgctgctgca gcccttattg atcaaaaaaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  




&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: J00129. Human fibrinogen ...[gi:182429] Links  


LOCUS       HUMFBRB                 1883 bp    mRNA    linear   PRI 08-NOV-1994
DEFINITION  Human fibrinogen beta-chain mRNA, partial cds.
ACCESSION   J00129
VERSION     J00129.1  GI:182429
KEYWORDS    beta-fibrinogen; fibrin; fibrinogen; glycoprotein.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1883)
  AUTHORS   Chung,D.W., Que,B.G., Rixon,M.W., Mace,M. Jr. and Davie,E.W.
  TITLE     Characterization of complementary deoxyribonucleic acid and genomic
            deoxyribonucleic acid for the beta chain of human fibrinogen
  JOURNAL   Biochemistry 22 (13), 3244-3250 (1983)
  MEDLINE   83283433
   PUBMED   6688356
COMMENT     Original source text: Human liver, cDNA to mRNA, clone
            pHI-beta-[1-6].
            The authors identified three potential translation initiation
            codons.  Two of these codons were located upstream of position 1,
            while the third is located at positions 19-21.  The exact
            initiation codon was not confirmed.
FEATURES             Location/Qualifiers
     source          1..1883
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /map="4q28"
     gene            1..1883
                     /gene="FGB"
     mRNA            <1..1883
                     /gene="FGB"
                     /note="b-fibrinogen mRNA (alt.) [2]"
     mRNA            <1..1621
                     /gene="FGB"
                     /note="b-fibrinogen mRNA (alt.)"
     mRNA            <1..1618
                     /gene="FGB"
                     /note="b-fibrinogen mRNA (alt.)"
     mRNA            <1..1552
                     /gene="FGB"
                     /note="b-fibrinogen mRNA (alt.)"
     CDS             <1..1452
                     /gene="FGB"
                     /note="beta-fibrinogen precursor"
                     /codon_start=1
                     /protein_id="AAA52429.1"
                     /db_xref="GI:182430"
                     /db_xref="GDB:G00-119-130"
                     /translation="FHKLKTMKHLLLLLLCVFLVKSQGVNDNEEGFFSARGHRPLDKK
                     REEAPSLRPAPPPISGGGYRARPAKAAATQKKVERKAPDAGGCLHADPDLGVLCPTGC
                     QLQEALLQQERPIRNSVDELNNNVEAVSQTSSSSFQYMYLLKDLWQKRQKQVKDNENV
                     VNEYSSELEKHQLYIDETVNSNIATNLRVLRSILENLRSKIQKLESDVSAQMEYCRTP
                     CTVSCNIPVVSGKECEEIIRKGGETSEMYLIQPDSSVKPYRVYCDMNTENGGWTVIQN
                     RQDGSVDFGRKWDPYKQGFGNVATNTDGKNYCGLPGEYWLGNDKISQLTRMGPTELLI
                     EMEDWKGDKVKAHYGGFTVQNEANKYQISVNKYRGTAGNALMDGASQLMGENRTMTIH
                     NGMFFSTYDRDNDGWLTSDPRKQCSKEDGGGWWYNRCHAANPNGRYYWGGQYTWDMAK
                     HGTDDGVVWMNWKGSWYSMRKMSMKIRPFFPQQ"
     sig_peptide     <1..66
                     /gene="FGB"
                     /note="beta-fibrinogen signal peptide"
     mat_peptide     67..1449
                     /gene="FGB"
                     /product="beta-fibrinogen"
BASE COUNT      612 a    351 c    431 g    489 t
ORIGIN      114 bp upstream of TaqI site; chromosome 4q31.
        1 ttccacaaac ttaaaaccat gaaacatcta ttattgctac tattgtgtgt ttttctagtt
       61 aagtcccaag gtgtcaacga caatgaggag ggtttcttca gtgcccgtgg tcatcgaccc
      121 cttgacaaga agagagaaga ggctcccagc ctgaggcctg ccccaccgcc catcagtgga
      181 ggtggctatc gggctcgtcc agccaaagca gctgccactc aaaagaaagt agaaagaaaa
      241 gcccctgatg ctggaggctg tcttcacgct gacccagacc tgggggtgtt gtgtcctaca
      301 ggatgtcagt tgcaagaggc tttgctacaa caggaaaggc caatcagaaa tagtgttgat
      361 gagttaaata acaatgtgga agctgtttcc cagacctcct cttcttcctt tcagtacatg
      421 tatttgctga aagacctgtg gcaaaagagg cagaagcaag taaaagataa tgaaaatgta
      481 gtcaatgagt actcctcaga actggaaaag caccaattat atatagatga gactgtgaat
      541 agcaatatcg caactaacct tcgtgtgctt cgttcaatcc tagaaaacct gagaagcaaa
      601 atacaaaagt tagaatctga tgtctcagct caaatggaat attgtcgcac cccatgcact
      661 gtcagttgca atattcctgt ggtgtctggc aaagaatgtg aggaaattat caggaaagga
      721 ggtgaaacat ctgaaatgta tctcattcaa cctgacagtt ctgtcaaacc gtatagagta
      781 tactgtgaca tgaatacaga aaatggagga tggacagtga ttcagaaccg tcaagacggt
      841 agtgttgact ttggcaggaa atgggatcca tataaacagg gatttggaaa tgttgcaacc
      901 aacacagatg ggaagaatta ctgtggccta ccaggtgaat attggcttgg aaatgataaa
      961 attagccagc ttaccaggat gggacccaca gaacttttga tagaaatgga ggactggaaa
     1021 ggagacaaag taaaggctca ctatggagga ttcactgtac agaatgaagc caacaaatac
     1081 cagatctcag tgaacaaata cagaggaaca gccggtaatg ccctcatgga tggagcatct
     1141 cagctgatgg gagaaaacag gaccatgacc attcacaacg gcatgttctt cagcacgtat
     1201 gacagagaca atgacggctg gttaacatca gatcccagaa aacagtgttc taaagaagac
     1261 ggtggtggat ggtggtataa tagatgtcat gcagccaatc caaacggcag atactactgg
     1321 ggtggacagt acacctggga catggcaaag catggcacag atgatggtgt agtatggatg
     1381 aattggaagg ggtcatggta ctcaatgagg aagatgagta tgaagatcag gcccttcttc
     1441 ccacagcaat agtccccaat acgtagattt ttgctcttct gtatgtgaca acatttttgt
     1501 acattatgtt attggaattt tctttcatac attatattcc tctaaaactc tcaagcagac
     1561 gtgagtgtga ctttttgaaa aaagtatagg ataaattaca ttaaaatagc acatgatttt
     1621 cttttgtttt cttcatttct cttgctcacc aagaagtaac aaaagtatag ttttgacaga
     1681 gttggtgttc ataatttcag ttctagttga ttgcgagaat tttcaaataa ggaagagggg
     1741 tcttttatcc ttgtcgtagg aaaaccatga cggaaaggaa aaactgatgt ttaaaagtcc
     1801 acttttaaaa ctatatttat ttatgtagga tctgtcaaag aaaacttcca aaaagattta
     1861 ttaattaaac cagactctgt tgc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: M64983. Homo sapiens fibr...[gi:182597] Links  


LOCUS       HUMFIBRB                8878 bp    DNA     linear   PRI 18-MAY-2000
DEFINITION  Homo sapiens fibrinogen beta chain (FGB), complete cds.
ACCESSION   M64983
VERSION     M64983.1  GI:182597
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 8878)
  AUTHORS   Chung,D.W., Harris,J.E. and Davie,E.W.
  TITLE     Nucleotide sequences of the three genes coding for human fibrinogen
  JOURNAL   (in) Liu,C.Y. and Chien,S. (Eds.);
            FIBRINOGEN, THROMBOSIS, COAGULATION AND FIBRINOLYSIS: 39-48;
            Plenum Press, New York (1991)
FEATURES             Location/Qualifiers
     source          1..8878
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
     gene            <470..8878
                     /gene="FGB"
     mRNA            join(<470..583,3258..3449,3939..4122,5043..5270,
                     5831..5944,6633..6758,6967..7252,7871..8537)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
     mRNA            join(<470..583,3258..3449,3939..4122,5043..5270,
                     5831..5944,6633..6758,6967..7252,7871..8534)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
                     /note="putative"
     mRNA            join(470..583,3258..3449,3939..4122,5043..5270,5831..5944,
                     6633..6758,6967..7252,7871..8273)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
                     /note="putative"
     mRNA            join(<470..583,3258..3449,3939..4122,5043..5270,
                     5831..5944,6633..6758,6967..7252,7871..8268)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
                     /note="putative"
     mRNA            join(<470..583,3258..3449,3939..4122,5043..5270,
                     5831..5944,6633..6758,6967..7252,7871..8202)
                     /gene="FGB"
                     /product="fibrinogen beta chain"
                     /note="putative"
     CDS             join(470..583,3258..3449,3939..4122,5043..5270,5831..5944,
                     6633..6758,6967..7252,7871..8102)
                     /gene="FGB"
                     /note="start codon not confirmed"
                     /codon_start=1
                     /product="fibrinogen beta chain"
                     /protein_id="AAA18024.2"
                     /db_xref="GI:7924018"
                     /translation="MKRMVSWSFHKLKTMKHLLLLLLCVFLVKSQGVNDNEEGFFSAR
                     GHRPLDKKREEAPSLRPAPPPISGGGYRARPAKAAATQKKVERKAPDAGGCLHADPDL
                     GVLCPTGCQLQEALLQQERPIRNSVDELNNNVEAVSQTSSSSFQYMYLLKDLWQKRQK
                     QVKDNENVVNEYSSELEKHQLYIDETVNSNIPTNLRVLRSILENLRSKIQKLESDVSA
                     QMEYCRTPCTVSCNIPVVSGKECEEIIRKGGETSEMYLIQPDSSVKPYRVYCDMNTEN
                     GGWTVIQNRQDGSVDFGRKWDPYKQGFGNVATNTDGKNYCGLPGEYWLGNDKISQLTR
                     MGPTELLIEMEDWKGDKVKAHYGGFTVQNEANKYQISVNKYRGTAGNALMDGASQLMG
                     ENRTMTIHNGMFFSTYDRDNDGWLTSDPRKQCSKEDGGGWWYNRCHAANPNGRYYWGG
                     QYTWDMAKHGTDDGVVWMNWKGSWYSMRKMSMKIRPFFPQQ"
     3'UTR           8103..8537
                     /gene="FGB"
     misc_feature    8538..8878
                     /gene="FGB"
                     /note="3' flanking sequence"
BASE COUNT     2912 a   1478 c   1663 g   2825 t
ORIGIN      
        1 gaattcatgc cccttttgaa atagacttat gtcattgtca gaaaacataa gcatttatgg
       61 tatatcatta atgagtcacg attttagtgg ttgccttgtg agtaggtcaa atttactaag
      121 cttagatttg ttttctcaca tattctttcg gagcttgtgt agtttccaca ttaatttacc
      181 agaaacaaga tacacactct ctttgaggag tgccctaact tcccatcatt ttgtccaatt
      241 aaatgaattg aagaaattta atgtttctaa actagaccaa caaagaataa tagttgtatg
      301 acaagtaaat aagctttgct gggaagatgt tgcttaaatg ataaaatggt tcagccaaca
      361 agtgaaccaa aaattaaata ttaactaagg aaaggtaacc atttctgaag tcattcctag
      421 cagaggactc agatatatat aggattgaag atctctcagt taagtctaca tgaaaaggat
      481 ggtttcttgg agcttccaca aacttaaaac catgaaacat ctattattgc tactattgtg
      541 tgtttttcta gttaagtccc aaggtgtcaa cgacaatgag gaggtgaatt ttttaaagca
      601 ttattatatt attagtagta ttattaatat aagatgtaac ataatcatat tatgtgctta
      661 ttttaatgaa attagcattg cttatagtta tgaaatggaa ttgttaacct ctgacttatt
      721 gtatttaaag aatgtttcat agtatttctt atataaaaac aaagtaattt cttgttttct
      781 agtttatcac ctttgttttc ttaagatgag gatggcttag ctaatgtaag atgtgttttt
      841 ctcacttgct attctgagta ctgtgatttt catttacttc tagcaataca ggattacaat
      901 taagaggaca agatctgaaa atctcacaaa ctataaaata ataaaagagc agaattttaa
      961 gataaaagaa actggtggta ggtagattgt tctttggtga aggaaggtaa tatatattgt
     1021 tactgagatt actatttata aaaattataa ctaagcctaa aagcaaaata catcaagtgt
     1081 aatgatagaa aatgaaatat tgcttttttc agatgaaaag ttcaaattag agttagtgtg
     1141 tattgttatt attaatagtt atgaaacacg gttcagtcta atttatttat ttgtagaaca
     1201 gtttgtcctc aactattatt tttgctgact tattgctgtt aatttgcagt tactaaaaat
     1261 acagaaatgc atttaggaca atggatattt aagaaattta aattttatca tcaaacgtat
     1321 catggccaaa tttcttacat atagcatagt atcattaaac tagaaataag aatacacaat
     1381 aatatttaaa tgaagtgatt catttcggat cattattgag tttcaaggga acttgagtgt
     1441 tgtacttatc agactctaca tgtaagaaca tatagttaat ctggttgtgt gtgtaaaaac
     1501 atatggttaa tctggttaag tctggttaat catattaggt aagaaaaatg taaagaatgt
     1561 gtaagacgaa atttttgtaa agtactctgc aaagcacttt cacatttctg cttatcaact
     1621 aaacctcaca gagatagttt aatagtttag gctttaaaat ggattttgat tattcaacaa
     1681 gtggccttca taatttcttt aagtgttttt ctttaagtat atactttctt taaatatttt
     1741 ttaaaatttc cttttctcta gtaaagccag accatccatg ctacctctct agtggcactc
     1801 tgaaataaaa agaaaatagt tttctctgtt ataattgtat ttgtaataag cagatgaatc
     1861 acatttctta aaatttgttt tagagagggt aagctctgac taggaccatg acttcaatgt
     1921 gaaatatgta tatatcctcc gaatctttac atattaagaa tgtatatagt caactggtta
     1981 aacaggaaaa tctggaacag cctggctggg ttttaatctt agcaccatcc tactaaatgt
     2041 taaataatat tataatctaa tgaataaatg acaatgcaat tccaaataga gttcatctga
     2101 tgacttctag actcacaaaa ttgcaagaga gctcagttgt tgctcagttg ttccaaatca
     2161 tgtcgtttgt taatttgtaa ttaagctcca aaggatgtat agctactgac aaaaaaaaaa
     2221 atgagaatgt agttaatcca aatcaaaact ttcctattgc aatgcgtatt ttctgcttca
     2281 ttatccttta atataatatt ttaagttagc aagtaatttt aattacaatg cacaagcctt
     2341 gagaattatt ttaaatataa gaaaatcata atgtttgata aagaaatcat gtaagaaatt
     2401 tcaagataat ggtttaacaa ataattttgt tgatagaaga taagactaaa agtgaaattc
     2461 gaagtggaga ggacacttaa actgtagtac ttgttatgtg tgattccagt aaaaatagta
     2521 atgagcactt attattgcca agtactgttc tgagggtacc atatgcaata agttatttaa
     2581 tccttacaat aatcttgtaa ggcagattca aactatcatt acacttattt tacagatgag
     2641 aaaactgggg cacagataaa gcaacttgcc caaggtctca tagctgtaag tcaaccctac
     2701 ggtcaagacc tacaagtagc cgagctccag agtacattat gagggtcaaa gattgtctta
     2761 ttacaaataa attccaagta gaatcaacct ttaataagtc tttaatgtct cttaaatatg
     2821 tttatatagg agtctaatca ccaattcaca aaaatgaaag tagggaaatg attaacaata
     2881 atcataggaa tctaacaatc caagtggctt gagaatattc attcttcttg acagtataga
     2941 ttctttacaa tttcgtaagt tccaatgtat gttttaggaa tatgaggtca ttactattca
     3001 taatctgata cagctttatc ctaaggcctc tctttaaaaa ctacactgca tcatagcttt
     3061 tttgtgcagt tggtctttct actgttactg aacagtaagc aacctacaga ttcactatca
     3121 ccaaccagcc agttgatgga tcttaagcaa attatcaagc ttgtgataac ctaaattata
     3181 aaatgagggt gttggaatag ttacattcca aatcttctat aacactctgt attatatttc
     3241 tgcctcattc cttgtagggt ttcttcagtg cccgtggtca tcgacccctt gacaagaaga
     3301 gagaagaggc tcccagcctg aggcctgccc caccgcccat cagtggaggt ggctatcggg
     3361 ctcgtccagc caaagcagct gccactcaaa agaaagtaga aagaaaagcc cctgatgctg
     3421 gaggctgtct tcacgctgac ccagacctgg tgggtgcact gatgtttctt gcagtggtgg
     3481 ctctctcatg cagagaaagc ctgtagtcat ggcagtctgc taatgtttca ctgacccaca
     3541 ttaccatcac tgttattttg tttgtttatt ttggaaataa aattcaaaac ataaacatat
     3601 tgggcctttg gtttaggctt tctttcttgt tttctttggt ctgggcccaa aatttcaaat
     3661 taggatatgt gggtgccacc tttccatttg tattttgcca ctgcctttgt ttagttggta
     3721 aaattttcat agcccaatta tattttttct ggggtaagta atattttaaa tctctatgag
     3781 agtatgatga tgactttcga atttctggtc ttacagaaaa ccaaataata aatttttatg
     3841 ttggctaatc gtatcgctga attttcctat gtgctatttt aacaaatgtc catgacccaa
     3901 atccttcatc taatgcctgc tattttcttt gtttttaggg ggtgttgtgt cctacaggat
     3961 gtcagttgca agaggctttg ctacaacagg aaaggccaat cagaaatagt gttgatgagt
     4021 taaataacaa tgtggaagct gtttcccaga cctcctcttc ttcctttcag tacatgtatt
     4081 tgctgaaaga cctgtggcaa aagaggcaga agcaagtaaa aggtagatat ccttgtgctt
     4141 tccattcgat tttcagctat aaaattggaa ccgttagact gccacgagaa tgcatggttg
     4201 tgagaagatt aacatttctg ggttagtgaa tagcattcat acgcttttgg gcaccttccc
     4261 ctgcaacttg ccagataagc actattcagc tcttattccc agtctgacat cagcaagtgt
     4321 gattttctat gaaaaattct actatgactc cttattttaa gtatacaaga aacttgtgac
     4381 tcagaagata atatttacag agtggaaaaa aacccctagc atttatagtt ttaacatttg
     4441 aggttttgaa tgagagagtt atccataata tattcaattg tgttgtggat aatgacacct
     4501 aacctgtgaa tcttgaggtc agaatgttga gtgctgttga cttggtggtc aggaaacagc
     4561 tagtgcgtga gcctggcaca ggcatctcag tgagtagcat acccacagtt ggaaattttt
     4621 caaagaaatc aaaggaatca tgacatctta taaatttcaa ggttctgcta tacttatgtg
     4681 aaatggataa ataaatcaag catatccact ctgtaagatt gaacttctca gatggaagac
     4741 cccaatactg ctttctcctc ttttccctca ccaaagaaat aaacaaccta tttcatttat
     4801 tactggacac aatctttagc gtatacctat ggtaaattac tagtatggtg gttaggattt
     4861 atgttaattt gtatatgtca tgcgccaaat catttccact aaatatgact atatatcata
     4921 actgcttggt gatagctcag tgtttaatag tttattctca gaaaatcaaa attgtatagt
     4981 taaatacatt agttttatga ggcaaaaatg ctaactattt ctacataatt tcatttttcc
     5041 agataatgaa aatgtagtca atgagtactc ctcagaactg gaaaagcacc aattatatat
     5101 agatgagact gtgaatagca atatcccaac taaccttcgt gtgcttcgtt caatcctgga
     5161 aaacctgaga agcaaaatac aaaagttaga atctgatgtc tcagctcaaa tggaatattg
     5221 tcgcacccca tgcactgtca gttgcaatat tcctgtggtg tctggcaaag gtaactgatt
     5281 cataaacata tttttagaga gttccagaag aactcacaca ccaaaaataa gagaacaaca
     5341 acaacaacaa aaatgctaag tggattttcc caacagatca taatgacatt acagtacatc
     5401 ataaaaatat ccttagccag ttgtgttttg gactggcctg gtgcatttgc tggttttgat
     5461 gagcaggatg gggcacaggt agtcccaggg gtggctgatg tgtgcatctg cgtactggct
     5521 tgaacagatg gcagaaccac agatagatgt agaagtttct ccattttgtg tgttctggga
     5581 gctcatggat attccaggac acaaaaggtg gagaagagct ttgttcatcc tcttagcaga
     5641 taaacgtcct caaaactggg ttggacttac taaagtaaaa tgaaaatcta atatttgtta
     5701 tattattttc aaaggtctat aataacacac tccttagtaa cttatgtaat gttattttaa
     5761 agaattggtg actaaataca aagtaattat gtcataaacc cctgaacata atgttgtctt
     5821 acatttgcag aatgtgagga aattatcagg aaaggaggtg aaacatctga aatgtatctc
     5881 attcaacctg acagttctgt caaaccgtat agagtatact gtgacatgaa tacagaaaat
     5941 ggaggtaagc tttcgacagt tgttgacctg ttgatctgta attatttgga taccgtaaaa
     6001 tgccaggaaa caaggccagg tgtggtggct catacctgta attccagcac cttgggaggc
     6061 caaagtgggc tgatagcttg agcctaggag tttgaaacta gcctgggcaa cataatgaga
     6121 ccctaactct acaaaaaaaa aaaaaatacc aaaaaaaaaa aaaaaatcag ctgtgttggt
     6181 agtatgtgcc tgtagtccca gctatccagg aggctgagat gggagatcac ctgagcccac
     6241 aacctggagt cttgatcatg ctactgaact gtagcctggg caacagagga tagtgagatc
     6301 ctgtctcaaa aaaaaaaatt aattaaaaag ccaggaaaca agacttagct ctaacatcta
     6361 acatagctga caaaggagta atttgatgtg gaattcaacc tgatatttaa aagttataaa
     6421 atatctataa ttcacaattt ggggtaagat aaagcacttg cagtttccaa agattttaca
     6481 agtttacctc tcatatttat ttccttattg tgtctatttt agagcaccaa atatatacta
     6541 aatggaatgg acaggggatt cagatattat tttcaaagtg acattatttg ctgttggtta
     6601 atatatgctc tttttgtttc tgtcaaccaa aggatggaca gtgattcaga accgtcaaga
     6661 cggtagtgtt gactttggca ggaaatggga tccatataaa cagggatttg gaaatgttgc
     6721 aaccaacaca gatgggaaga attactgtgg cctaccaggt aacgaacagg catgcaaaat
     6781 aaaatcattc tatttgaaat gggatttttt ttaattaaaa aacattcatt gttggaagcc
     6841 tgttttaggc agttaagagg agtttcctga caaaaatgtg gaagctaaag ataagggaag
     6901 aaaggcagtt tttagtttcc caaaatttta tttttggtga gagattttat tttgtttttc
     6961 ttttaggtga atattggctt ggaaatgata aaattagcca gcttaccagg atgggaccca
     7021 cagaactttt gatagaaatg gaggactgga aaggagacaa agtaaaggct cactatggag
     7081 gattcactgt acagaatgaa gccaacaaat accagatctc agtgaacaaa tacagaggaa
     7141 cagccggtaa tgccctcatg gatggagcat ctcagctgat gggagaaaac aggaccatga
     7201 ccattcacaa cggcatgttc ttcagcacgt atgacagaga caatgacggc tggtatgtgt
     7261 ggcactcttt gctcctgctt taaaaatcac actaatatca ttactcagaa tcattaacaa
     7321 tatttttaat agctaccact tcctgggcac ttactgtcag ccactgtcct aagctcttta
     7381 tgcatcactc gaaagcattt caactataag gtagacattc ttattctcat tttacagatg
     7441 agatttagag agattacgtg atttgtccaa tgtcacacaa ctacccagag ataaaactag
     7501 aatttgagca cagttacttt ctgaataatg agcatttaga taaataccta tatctctata
     7561 ttctaaagtg tgtgtgaaaa ctttcatttt catttccagg gttctctgat actaagggtt
     7621 gtaaaagcta ttattccagt ataaagtaac aaacacagtc cctagatgga ttgccacaaa
     7681 ggcccagtta tctctctttc ttgctatagg gcacaggagg tctttggtgt attagtgtga
     7741 ctctatgtat agcacccaaa ggaaagacta ctgtgcacac gagtgtagca gtcttttatg
     7801 ggtaatctgc aaaacgtaac ttgaccaccg tagttctgtt tctaataacg ccaaacacat
     7861 tttctttcag gttaacatca gatcccagaa aacagtgttc taaagaagac ggtggtggat
     7921 ggtggtataa tagatgtcat gcagccaatc caaacggcag atactactgg ggtggacagt
     7981 acacctggga catggcaaag catggcacag atgatggtgt agtatggatg aattggaagg
     8041 ggtcatggta ctcaatgagg aagatgagta tgaagatcag gcccttcttc ccacagcaat
     8101 agtccccaat acgtagattt ttgctcttct gtatgtgaca acatttttgt acattatgtt
     8161 attggaattt tctttcatac attatattcc tctaaaactc tcaagcagac gtgagtgtga
     8221 ctttttgaaa aaagtatagg ataaattaca ttaaaatagc acatgatttt cttttgtttt
     8281 cttcatttct cttgctcacc caagaagtaa caaaagtata gttttgacag agttggtgtt
     8341 cataatttca gttctagttg attgcgagaa ttttcaaata aggaagaggg gtcttttatc
     8401 cttgtcgtag gaaaaccatg acggaaagga aaaactgatg tttaaaagtc cacttttaaa
     8461 actatattta tttatgtagg atctgtcaaa gaaaacttcc aaaaagattt attaattaaa
     8521 ccagactctg ttgcaataag ttaatgtttt cttgttttgt aatccacaca ttcaatgagt
     8581 taggctttgc acttgtaagg aaggagaagc gttcacaacc tcaaatagct aataaaccgg
     8641 tcttgaatat ttgaagattt aaaatctgac tctaggacgg gcacggtggc tcacgactat
     8701 aatcccaaca ctttgggagg ctgaggcggg cggtcacaag gtcaggagtt caagaccagc
     8761 ctgaccaata tggtgaaacc ccatctctac taaaaataca aaaattagcc aggcgtggtg
     8821 gcaggtgcct gtaggtccca gctagcctgt gaggtggaga ttgcattgag ccaagatc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AK024070. Homo sapiens cDNA...[gi:10436354] Links  


LOCUS       AK024070                3812 bp    mRNA    linear   PRI 01-AUG-2002
DEFINITION  Homo sapiens cDNA FLJ14008 fis, clone Y79AA1002416, highly similar
            to Mus musculus CTP synthetase homolog (CTPsH) mRNA.
ACCESSION   AK024070
VERSION     AK024070.1  GI:10436354
KEYWORDS    oligo capping; fis (full insert sequence).
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Isogai,T., Ota,T., Hayashi,K., Sugiyama,T., Otsuki,T., Suzuki,Y.,
            Nishikawa,T., Nagai,K., Sugano,S., Shiratori,A., Sudo,H.,
            Wagatsuma,M., Hosoiri,T., Kaku,Y., Kodaira,H., Kondo,H.,
            Sugawara,M., Takahashi,M., Chiba,Y., Ishida,S., Murakawa,K.,
            Ono,Y., Takiguchi,S., Watanabe,S., Kimura,K., Murakami,K.,
            Ishii,S., Kawai,Y., Saito,K., Yamamoto,J., Wakamatsu,A.,
            Nakamura,Y., Nagahari,K., Masuho,Y., Ninomiya,K. and Iwayanagi,T.
  TITLE     NEDO human cDNA sequencing project
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 3812)
  AUTHORS   Isogai,T. and Otsuki,T.
  TITLE     Direct Submission
  JOURNAL   Submitted (23-AUG-2000) Takao Isogai, Helix Research Institute,
            Genomics Laboratory; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan
            (E-mail:genomics@hri.co.jp, Tel:81-438-52-3975, Fax:81-438-52-3986)
COMMENT     NEDO human cDNA sequencing project supported by Ministry of
            International Trade and Industry of Japan; cDNA full insert
            sequencing: Research Association for Biotechnology; cDNA library
            construction, 5'- & 3'-end one pass sequencing and clone selection:
            Helix Research Institute (supported by Japan Key Technology Center
            etc.) and Department of Virology, Institute of Medical Science,
            University of Tokyo.
FEATURES             Location/Qualifiers
     source          1..3812
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /clone="Y79AA1002416"
                     /cell_line="Y79"
                     /cell_type="retinoblastoma"
                     /clone_lib="Y79AA1"
                     /note="cloning vector: pME18SFL3"
     CDS             239..1999
                     /note="unnamed protein product"
                     /codon_start=1
                     /protein_id="BAB14814.1"
                     /db_xref="GI:10436355"
                     /translation="MKYILVTGGVISGIGKGIIASSIGTILKSCGLRVTAIKIDPYIN
                     IDAGTFSPYEHGEVFVLNDGGEVDLDLGNYERFLDINLYKDNNITTGKIYQHVINKER
                     RGDYLGKTVQVVPHITDAVQEWVMNQAKVPVDGNKEEPQICVIELGGTIGDIEGMPFV
                     EAFRQFQFKAKRENFCNIHVSLVPQLSATGEQKTKPTQNSVRALRGLGLSPDLIVCRS
                     SSPIEMAVKEKISMFCHVNPEQVICIHDVSSTYRVPVLLEEQSIVKYFKERLHLPIGD
                     SASNLLFKWRNMADRYERLQKICSIALVGKYTKLRDCYASVFKALEHSALAINHKLNL
                     MYIDSIDLEKITETEDPVKFHEAWQKLCKADGILVPGGFGIRGTLGKLQAISWARTKK
                     IPFLGVCLGMQLAVIEFARNCLNLKDADSTEFRPNAPVPLVIDMPEHNPGNLGGTMRL
                     GIRRTVFKTENSILRKLYGDVPFIEERHRHRFEVNPNLIKQFEQNDLSFVGQDVDGDR
                     MEIIELANHPYFVGVQFHPEFSSRPMKPSPPYLGLLLAATGNLNAYLQQGCKLSSSDR
                     YSDASDDSFSEPRIAELEIS"
BASE COUNT     1064 a    786 c    869 g   1093 t
ORIGIN      
        1 tttccttttc cttctctcct gagcgctcct gcagttcctg gggcgtagta ggggatccac
       61 aagcgtttgt gaccagtgaa gttctttaca agggtgagat ctgcacggga ggacccgagc
      121 gagggtctcg gcttgccagg aagccggggt tccccgggaa gcgtggagtt cacccgcgca
      181 ctcgaagtgc ctttgcaaaa ttatatctgg gtgttggcac ccagccacta ttctgccaat
      241 gaagtacatc ctggtcacgg gtggggtcat ctcaggcatt ggtaaaggga tcattgccag
      301 cagcattgga acgattctaa aatcatgtgg actccgagtt actgccataa aaatcgaccc
      361 ctatattaac atcgatgctg gcactttttc accttatgaa cacggtgaag tcttcgtctt
      421 aaatgatggt ggagaagttg atttagacct tggaaattat gaaagatttt tggatattaa
      481 tctttataaa gacaacaata tcaccacggg gaagatatat cagcatgtga tcaataaaga
      541 gaggcgtggt gattacctgg ggaaaacagt gcaagttgtc cctcacatta ctgatgctgt
      601 ccaggagtgg gttatgaatc aagccaaggt gccggtggat ggtaataagg aagagcccca
      661 aatatgcgtt attgagctgg gaggcaccat tggagacatc gaaggaatgc cgtttgtgga
      721 ggcgtttaga caattccagt ttaaggcgaa aagagagaat ttctgtaata tccacgttag
      781 ccttgtccca cagctcagtg ctaccggaga acaaaaaacc aaacccaccc aaaacagcgt
      841 ccgcgcactg aggggtttag gcctgtctcc agatctgatt gtctgccgaa gttcatcgcc
      901 cattgagatg gccgtgaagg agaagatttc tatgttttgt cacgtgaacc ctgaacaggt
      961 catatgtatc catgatgttt cttccacata ccgagttcct gtgcttttag aggaacaaag
     1021 cattgtgaaa tattttaagg agagattgca cctgcccatc ggtgattctg caagtaattt
     1081 gctttttaag tggagaaata tggctgacag gtatgaaagg ttacagaaaa tatgctccat
     1141 agccctggtt ggcaaataca ccaagctcag agactgctac gcctctgtgt tcaaagccct
     1201 ggaacactca gccctggcca tcaaccacaa gttgaatctg atgtacatag actccattga
     1261 tctggagaag atcactgaaa ccgaggaccc tgtgaaattt catgaagctt ggcagaagct
     1321 atgcaaagct gatggtattc ttgtgcctgg aggctttgga atcagaggaa cattgggaaa
     1381 actccaggcg atttcttggg caaggacaaa gaagattcct tttctgggag tttgtcttgg
     1441 gatgcaacta gcagtgatag agtttgcaag aaactgcctt aacttgaaag atgctgattc
     1501 cacagagttt aggccaaatg ccccagttcc tctggtgatt gatatgcccg agcacaaccc
     1561 tggcaatttg ggaggaacaa tgagactggg aataagaaga actgttttca aaactgaaaa
     1621 ttcaatatta aggaaacttt atggtgatgt tccttttata gaagaaagac acagacatcg
     1681 gttcgaggta aaccctaacc tgatcaaaca atttgagcag aatgacttaa gttttgtagg
     1741 tcaggatgtc gatggagaca ggatggaaat cattgaactg gcaaatcatc cttattttgt
     1801 tggtgtccag ttccatcctg agttttcttc taggccgatg aagccttccc ctccgtatct
     1861 ggggctgtta cttgcagcaa ctgggaacct gaatgcctac ttgcaacagg gttgcaaact
     1921 gtcttccagt gatagataca gtgatgccag tgatgacagc ttttcagagc caaggatagc
     1981 tgagttggaa ataagctgaa atgaatacat gactgggaat aatggggact gcctgtgagg
     2041 cctctgaaat aattgaaggc aagatgaagg aactatctga agaaatcact acactcttag
     2101 agaatccctc tgttctccag caaacatggg atgtaaagcc tcacagggaa tctgataata
     2161 catacttctg tcaaccagaa ccagaggggt agttttcttt tccctccaga ggcagccttt
     2221 ggtacttaaa atatctgtag ctgattaaat ttttcccaac aacctcactg gggagaaagt
     2281 gtgttcatgt tttgtccagc ggatcaggat gttaggatga cgagcaagag tccaggtcac
     2341 tgtgcctttg ctgtgttgta tggaaaggat ggcagggaac atgctgtaag taattttgag
     2401 taagaaaatg agtcactgtg ttacctggaa ctcagccaca gatttgtgtg tggtccaaga
     2461 tcattgcagt ttctcaccct gtttatttcc tggtaaaagt aaaattgaat aggtccaaga
     2521 cttgggggtg gcaagtaagg ctttgcctca ggcacaaaat ttaagggggc tccaaaaaac
     2581 tcaggaatca agatcagcaa tacagtctga gtatccctta tgtgaaatgc ttggggctag
     2641 aagtgttttg aatttcagat tttggaatat ttgcatatac atgcgatatc ttggggatga
     2701 gactcaagac taaacatgaa attcatttat gcttcatata caccttatat acatagccta
     2761 aaggtaattt gatacaatat tttaaataat tttgtgcatg aaacaaagtt tcgactgcat
     2821 tttgactgtg atttctggca tgagatcagt tatggaattt tccacttcta gcgtcatgtt
     2881 ggcattcaga aattttgaaa ttttggagca ttttggattt tcagattagg gatgctcaac
     2941 ctgtatatat attttttaat cgacgtgaaa ttcacgtaac atagaattaa ccattttgaa
     3001 gtgaacaatt tggttgcatt cactgatgtt gagcaaccac cacctttaac tatttccaaa
     3061 acattttcat cactccaaaa taaatgcctg tacacactag cagtcactcc ctatcttccc
     3121 ctccacctgt ccgctggcaa ccactgatct cctttttatt tctgtggctt tttctattct
     3181 ggatatttca tataagtgga attacacaat atatgtggtc ttttgtgtct ggcttcttct
     3241 gagacagtag gaagggggct tggctttggc tcacccccac tagagcattt tttcatgcat
     3301 tcccactgat cacaaaaccc atactactac ctcattgaca ccatacctgc taacctcgag
     3361 gctttagtca tacaaagaag atggcctttc tgtattgttc ttctgtgctc tcataatgct
     3421 taaccatgtc ttttacttaa acaattccag gaactggcct taggagatcc aaatagggaa
     3481 ccaagattgc agagtgtccc atcttgggag ggaatgctga ataattaatt gatttacagc
     3541 cttgttgccg ctggccagac caccaggtgg cccattactc gagatgatca tcacaaccag
     3601 atgatgctaa cctatatcct ctacccttcg cgtgctttgt ctgggaagtc ttttggcccc
     3661 atgtcagttt ctattgcatt gagagcccaa gagcccctgg tcagtcaggc ttccatttag
     3721 catggcgttt gcaaggttta cccatgttgt agcatgtgtc agaatttcat tcctttctat
     3781 ggctgaataa aattccattg tatgaatata cc
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProteinProteinSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  





&&&&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U70660. Human copper tran...[gi:1945364] Links  


LOCUS       HSU70660                 502 bp    mRNA    linear   PRI 19-APR-1997
DEFINITION  Human copper transport protein HAH1 (HAH1) mRNA, complete cds.
ACCESSION   U70660
VERSION     U70660.1  GI:1945364
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 502)
  AUTHORS   Klomp,L.W., Lin,S.J., Yuan,D.S., Klausner,R.D., Culotta,V.C. and
            Gitlin,J.D.
  TITLE     Identification and functional expression of HAH1, a novel human
            gene involved in copper homeostasis
  JOURNAL   J. Biol. Chem. 272 (14), 9221-9226 (1997)
  MEDLINE   97238857
   PUBMED   9083055
REFERENCE   2  (bases 1 to 502)
  AUTHORS   Klomp,L.W.J. and Gitlin,J.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-SEP-1996) Pediatrics, Washington University, One
            Children's Place, St. Louis, MO 63110, USA
FEATURES             Location/Qualifiers
     source          1..502
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="5"
                     /map="5q32-33"
                     /tissue_type="liver"
     gene            1..502
                     /gene="HAH1"
     5'UTR           1..113
                     /gene="HAH1"
     CDS             114..320
                     /gene="HAH1"
                     /note="human ATX1 homolog"
                     /codon_start=1
                     /product="copper transport protein HAH1"
                     /protein_id="AAC51227.1"
                     /db_xref="GI:1945365"
                     /translation="MPKHEFSVDMTCGGCAEAVSRVLNKLGGVKYDIDLPNKKVCIES
                     EHSMDTLLATLKKTGKTVSYLGLE"
     3'UTR           321..502
                     /gene="HAH1"
     polyA_signal    479..484
                     /gene="HAH1"
BASE COUNT      102 a    149 c    142 g    109 t
ORIGIN      
        1 ctgtcgccct gcacggtgac ccgcgtgtgc gaggccttca tggccaggat ccgggtggag
       61 aggcgctgct gacaccgccg ccacaccgcc gccacaccgc cgctgcctca gtcatgccga
      121 agcacgagtt ctctgtggac atgacctgtg gaggctgtgc tgaagctgtc tctcgggtcc
      181 tcaataagct tggaggagtt aagtatgaca ttgacctgcc caacaagaag gtctgcattg
      241 aatctgagca cagcatggac actctgcttg caaccctgaa gaaaacagga aagactgttt
      301 cctaccttgg ccttgagtag caggggcctg gtccccacag cccacaggat ggaccaaagg
      361 gggcaggatg ctgatcctcc cgctggcttc cagacagacc tgggacttgg cagtcatgcc
      421 gggtgatcgt gttcctgcgg agaccctcag ttgtcctatt ccttcctagc ttccctgcaa
      481 taaatcaagc tgcttttgtt gg
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  




&&&&&&&




    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: AF013263. Homo sapiens apop...[gi:2330014] Links  


LOCUS       AF013263                7042 bp    mRNA    linear   PRI 23-AUG-1997
DEFINITION  Homo sapiens apoptotic protease activating factor 1 (Apaf-1) mRNA,
            complete cds.
ACCESSION   AF013263
VERSION     AF013263.1  GI:2330014
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 7042)
  AUTHORS   Zou,H., Henzel,W.J., Liu,X., Lutschg,A. and Wang,X.
  TITLE     Apaf-1, a human protein homologous to C. elegans CED-4,
            participates in cytochrome c-dependent activation of caspase-3
  JOURNAL   Cell 90 (3), 405-413 (1997)
  MEDLINE   97410306
   PUBMED   9267021
REFERENCE   2  (bases 1 to 7042)
  AUTHORS   Zou,H., Henzel,H.J., Liu,X., Lutschg,A. and Wang,X.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-JUL-1997) Biochemistry, University of Texas
            Southwestern Medical Center at Dallas, 5323 Harry Hines Blvd.,
            Dallas, TX 75235, USA
FEATURES             Location/Qualifiers
     source          1..7042
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /cell_line="HeLa S3"
     gene            1..7042
                     /gene="Apaf-1"
     CDS             578..4162
                     /gene="Apaf-1"
                     /function="cytochrome c-dependent activation of caspase-3"
                     /note="similar to C. elegans cell death gene ced-4"
                     /codon_start=1
                     /product="apoptotic protease activating factor 1"
                     /protein_id="AAC51678.1"
                     /db_xref="GI:2330015"
                     /translation="MDAKARNCLLQHREALEKDIKTSYIMDHMISDGFLTISEEEKVR
                     NEPTQQQRAAMLIKMILKKDNDSYVSFYNALLHEGYKDLAALLHDGIPVVSSSSVRTV
                     LCEGGVPQRPVVFVTRKKLVNAIQQKLSKLKGEPGWVTIHGMAGCGKSVLAAEAVRDH
                     SLLEGCFPGGVHWVSVGKQDKSGLLMKLQNLCTRLDQDESFSQRLPLNIEEAKDRLRI
                     LMLRKHPRSLLILDDVWDSWVLKAFDSQCQILLTTRDKSVTDSVMGPKYVVPVESSLG
                     KEKGLEILSLFVNMKKADLPEQAHSIIKECKGSPLVVSLIGALLRDFPNRWEYYLKQL
                     QNKQFKRIRKSSSYDYEALDEAMSISVEMLREDIKDYYTDLSILQKDVKVPTKVLCIL
                     WDMETEEVEDILQEFVNKSLLFCDRNGKSFRYYLHDLQVDFLTEKNCSQLQDLHKKII
                     TQFQRYHQPHTLSPDQEDCMYWYNFLAYHMASAKMHKELCALMFSLDWIKAKTELVGP
                     AHLIHEFVEYRHILDEKDCAVSENFQEFLSLNGHLLGRQPFPNIVQLGLCEPETSEVY
                     QQAKLQAKQEVDNGMLYLEWINKKNITNLSRLVVRPHTDAVYHACFSEDGQRIASCGA
                     DKTLQVFKAETGEKLLEIKAHEDEVLCCAFSTDDRFIATCSVDKKVKIWNSMTGELVH
                     TYDEHSEQVNCCHFTNSSHHLLLATGSSDCFLKLWDLNQKECRNTMFGHTNSVNHCRF
                     SPDDKLLASCSADGTLKLWDATSANERKSINVKQFFLNLEDPQEDMEVIVKCCSWSAD
                     GARIMVAAKNKIFLWNTDSRSKVADCRGHLSWVHGVMFSPDGSSFLTSSDDQTIRLWE
                     TKKVCKNSAVMLKQEVDVVFQENEVMVLAVDHIRRLQLINGRTGQIDYLTEAQVSCCC
                     LSPHLQYIAFGDENGAIEILELVNNRIFQSRFQHKKTVWHIQFTADEKTLISSSDDAE
                     IQVWNWQLDKCIFLRGHQETVKDFRLLKNSRLLSWSFDGTVKVWNIITGNKEKDFVCH
                     QGTVLSCDISHDATKFSSTSADKTAKIWSFDLLLPLHELRGHNGCVRCSAFSVDSTLL
                     ATGDDNGEIRIWNVSNGELLHLCAPLSEEGAATHGGWVTDLCFSPDGKMLISAGGYIK
                     WWNVVTGESSQTFYTNGTNLKKIHVSPDFKTYVTVDNLGILYILQTLE"
BASE COUNT     1985 a   1355 c   1580 g   2122 t
ORIGIN      
        1 aagaagaggt agcgagtgga cgtgactgct ctatcccggg caaaagggat agaaccagag
       61 gtggggagtc tgggcagtcg gcgacccgcg aagacttgag gtgccgcagc ggcatccgga
      121 gtagcgccgg gctccctccg gggtgcagcc gccgtcgggg gaagggcgcc acaggccggg
      181 aagacctcct ccctttgtgt ccagtagtgg ggtccaccgg agggcggccc gtgggccggg
      241 cctcaccgcg gcgctccggg actgtggggt caggctgcgt tgggtggacg cccacctcgc
      301 caaccttcgg aggtccctgg gggtcttcgt gcgccccggg gctgcagaga tccaggggag
      361 gcgcctgtga ggcccggacc tgccccgggg cgaagggtat gtggcgagac agagccctgc
      421 acccctaatt cccggtggaa aactcctgtt gccgtttccc tccaccggcc tggagtctcc
      481 cagtcttgtc ccggcagtgc cgccctcccc actaagacct aggcgcaaag gcttggctca
      541 tggttgacag ctcagagaga gaaagatctg agggaagatg gatgcaaaag ctcgaaattg
      601 tttgcttcaa catagagaag ctctggaaaa ggacatcaag acatcctaca tcatggatca
      661 catgattagt gatggatttt taacaatatc agaagaggaa aaagtaagaa atgagcccac
      721 tcaacagcaa agagcagcta tgctgattaa aatgatactt aaaaaagata atgattccta
      781 cgtatcattc tacaatgctc tactacatga aggatataaa gatcttgctg cccttctcca
      841 tgatggcatt cctgttgtct cttcttccag tgtaaggaca gtcctgtgtg aaggtggagt
      901 accacagagg ccagttgttt ttgtcacaag gaagaagctg gtgaatgcaa ttcagcagaa
      961 gctctccaaa ttgaaaggtg aaccaggatg ggtcaccata catggaatgg caggctgtgg
     1021 gaagtctgta ttagctgcag aagctgttag agatcattcc cttttagaag gttgtttccc
     1081 agggggagtg cattgggttt cagttgggaa acaagacaaa tctgggcttc tgatgaaact
     1141 gcagaatctt tgcacacggt tggatcagga tgagagtttt tcccagaggc ttccacttaa
     1201 tattgaagag gctaaagacc gtctccgcat tctgatgctt cgcaaacacc caaggtctct
     1261 cttgatcttg gatgatgttt gggactcttg ggtgttgaaa gcttttgaca gtcagtgtca
     1321 gattcttctt acaaccagag acaagagtgt tacagattca gtaatgggtc ctaaatatgt
     1381 agtccctgtg gagagttcct taggaaagga aaaaggactt gaaattttat ccctttttgt
     1441 taatatgaag aaggcagatt tgccagaaca agctcatagt attataaaag aatgtaaagg
     1501 ctctcccctt gtagtatctt taattggtgc acttttacgt gattttccca atcgctggga
     1561 gtactacctc aaacagcttc agaataagca gtttaagaga ataaggaaat cttcgtctta
     1621 tgattatgag gctctagatg aagccatgtc tataagtgtt gaaatgctca gagaagacat
     1681 caaagattat tacacagatc tttccatcct tcagaaggac gttaaggtgc ctacaaaggt
     1741 gttatgtatt ctctgggaca tggaaactga agaagttgaa gacatactgc aggagtttgt
     1801 aaataagtct cttttattct gtgatcggaa tggaaagtcg tttcgttatt atttacatga
     1861 tcttcaagta gattttctta cagagaagaa ttgcagccag cttcaggatc tacataagaa
     1921 gataatcact cagtttcaga gatatcacca gccgcatact ctttcaccag atcaggaaga
     1981 ctgtatgtat tggtacaact ttctggccta tcacatggcc agtgccaaga tgcacaagga
     2041 actttgtgct ttaatgtttt ccctggattg gattaaagca aaaacagaac ttgtaggccc
     2101 tgctcatctg attcatgaat ttgtggaata cagacatata ctagatgaaa aggattgtgc
     2161 agtcagtgag aattttcagg agtttttatc tttaaatgga caccttcttg gacgacagcc
     2221 atttcctaat attgtacaac tgggtctctg tgagccggaa acttcagaag tttatcagca
     2281 agctaagctg caggccaagc aggaggtcga taatggaatg ctttacctgg aatggataaa
     2341 caaaaaaaac atcacgaatc tttcccgctt agttgtccgc ccccacacag atgctgttta
     2401 ccatgcctgc ttttctgagg atggtcagag aatagcttct tgtggagctg ataaaacctt
     2461 acaggtgttc aaagctgaaa caggagagaa acttctagaa atcaaggctc atgaggatga
     2521 agtgctttgt tgtgcattct ctacagatga cagatttata gcaacctgct cagtggataa
     2581 aaaagtgaag atttggaatt ctatgactgg ggaactagta cacacctatg atgagcactc
     2641 agagcaagtc aattgctgcc atttcaccaa cagtagtcat catcttctct tagccactgg
     2701 gtcaagtgac tgcttcctca aactttggga tttgaatcaa aaagaatgtc gaaataccat
     2761 gtttggtcat acaaattcag tcaatcactg cagattttca ccagatgata agcttttggc
     2821 tagttgttca gctgatggaa ccttaaagct ttgggatgcg acatcagcaa atgagaggaa
     2881 aagcattaat gtgaaacagt tcttcctaaa tttggaggac cctcaagagg atatggaagt
     2941 gatagtgaag tgttgttcgt ggtctgctga tggtgcaagg ataatggtgg cagcaaaaaa
     3001 taaaatcttt ttgtggaata cagactcacg ttcaaaggtg gctgattgca gaggacattt
     3061 aagttgggtt catggtgtga tgttttctcc tgatggatca tcatttttga catcttctga
     3121 tgaccagaca atcaggctct gggagacaaa gaaagtatgt aagaactctg ctgtaatgtt
     3181 aaagcaagaa gtagatgttg tgtttcaaga aaatgaagtg atggtccttg cagttgacca
     3241 tataagacgt ctgcaactca ttaatggaag aacaggtcag attgattatc tgactgaagc
     3301 tcaagttagc tgctgttgct taagtccaca tcttcagtac attgcatttg gagatgaaaa
     3361 tggagccatt gagattttag aacttgtaaa caatagaatc ttccagtcca ggtttcagca
     3421 caagaaaact gtatggcaca tccagttcac agccgatgag aagactctta tttcaagttc
     3481 tgatgatgct gaaattcagg tatggaattg gcaattggac aaatgtatct ttctacgagg
     3541 ccatcaggaa acagtgaaag actttagact cttgaaaaat tcaagactgc tttcttggtc
     3601 atttgatgga acagtgaagg tatggaatat tattactgga aataaagaaa aagactttgt
     3661 ctgtcaccag ggtacagtac tttcttgtga catttctcac gatgctacca agttttcatc
     3721 tacctctgct gacaagactg caaagatctg gagttttgat ctccttttgc cacttcatga
     3781 attgaggggc cacaacggct gtgtgcgctg ctctgccttc tctgtggaca gtaccctgct
     3841 ggcaacggga gatgacaatg gagaaatcag gatatggaat gtctcaaacg gtgagcttct
     3901 tcatttgtgt gctccgcttt cagaagaagg agctgctacc catggaggct gggtgactga
     3961 cctttgcttt tctccagatg gcaaaatgct tatctctgct ggaggatata ttaagtggtg
     4021 gaacgttgtc actggggaat cctcacagac cttctacaca aatggaacca atcttaagaa
     4081 aatacacgtg tcccctgact tcaaaacata tgtgactgtg gataatcttg gtattttata
     4141 tattttacag actttagaat aaaatagtta agcattaatg tagttgaact ttttaaattt
     4201 ttgaattgga aaaaaattct aatgaaaccc tgatatcaac tttttataaa gctcttaatt
     4261 gttgtgcagt attgcattca ttacaaaagt gtttgtggtt ggatgaataa tattaatgta
     4321 gctttttccc aaatgaacat acctttaatc ttgtttttca tgatcatcat taacagtttg
     4381 tccttaggat gcaaatgaaa atgtgaatac ataccttgtt gtactgttgg taaaattctg
     4441 tcttgatgca ttcaaaatgg ttgacataat taatgagaag aatttggaag aaattggtat
     4501 tttaatactg tctgtattta ttactgttat gcaggctgtg cctcagggta gcagtggcct
     4561 gctttttgaa ccacacttac cccaaggggg ttttgttctc ctaaatacaa tcttagaggt
     4621 tttttgcact ctttaaattt gctttaaaaa tattgtgtct gtgtgcatag tctgcagcat
     4681 ttcctttaat tgactcaata agtgagtctt ggatttagca ggccccccca cctttttttt
     4741 ttgtttttgg agacagagtc ttgctttgtt gccaggctgg agtgcagtgg cgcgatctcg
     4801 gctcaccaca atcgctgcct cctgggttca agcaattctc ctgcctcagc ctcccgagta
     4861 gctgggacta caggtgtgcg cacatgccag gctaattttt gtatttttag tagagacggg
     4921 gtttcaccat gttggccggg atggtctcga tctcttgacc tcatgatcta cccgccttgg
     4981 cctcccaaag tgctgagatt acaggcgtga gccaccgtgc ctggccaggc cccttctctt
     5041 ttaatggaga cagggtcttg cactatcacc caggctggag tgcagtggca taatcatacc
     5101 tcattgcagc ctcagactcc tgggttcaag caatcctcct gcctcagcct cccaagtagc
     5161 tgagactgca ggcacgagcc accacaccca gctaattttt aagttttctt gtagagacag
     5221 ggtctcacta tgttgtctag gctggtcttg aactcttggc ctcaagtaat cctcctgcct
     5281 cagcctccca aagtgttggg attgcagata tgagccactg gcctggcctt cagcagttct
     5341 ttttgtgaag taaaacttgt atgttggaaa gagtagattt tattggtcta cccttttctc
     5401 actgtagctg ctggcagccc tgtgccatat ctggactcta gttgtcagta tctgagttgg
     5461 acactattcc tgctccctct tgtttcttac atatcagact tcttacttga atgaaacctg
     5521 atctttccta atcctcactt ttttcttttt taaaaagcag tttctccact gctaaatgtt
     5581 agtcattgag gtggggccaa ttttaatcat aagccttaat aagatttttc taagaaatgt
     5641 gaaatagaac aattttcatc taattccatt tacttttaga tgaatggcat tgtgaatgcc
     5701 attcttttaa tgaatttcaa gagaattctc tggttttctg tgtaattcca gatgagtcac
     5761 tgtaactcta gaagattaac cttccagcca acctattttc ctttcccttg tctctctcat
     5821 cctcttttcc ttccttcttt cctttctctt cttttatctc caaggttaat caggaaaaat
     5881 agcttttgac aggggaaaaa actcaataac tagctatttt tgacctcctg atcaggaact
     5941 ttagttgaag cgtaaatcta aagaaacatt ttctctgaaa tatattatta agggcaatgg
     6001 agataaatta atagtagatg tggttcccag aaaatataat caaaattcaa agattttttt
     6061 tgtttctgta actggaacta aatcaaatga ttactagtgt taatagtaga taacttgttt
     6121 ttattgttgg tgcatattag tataactgtg gggtaggtcg gggagagggt aagggaatag
     6181 atcactcaga tgtattttag ataagctatt tagcctttga tggaatcata aatacagtga
     6241 atacaatcct ttgcattgtt aaggaggttt tttgttttta aatggtgggt caaggagcta
     6301 gtttacaggc ttactgtgat ttaagcaaat gtgaaaagtg aaaccttaat tttatcaaaa
     6361 gaaatttctg taaatggtat gtctccttag aatacccaaa tcataatttt atttgtacac
     6421 actgttaggg gctcatctca tgtaggcaga gtataaagta ttaccttttg gaattaaaag
     6481 ccactgactg ttataaagta taacaacaca catcaggttt taaaaagcct tgaatggccc
     6541 ttgtcttaaa aagaaattag gagccaggtg cggtggcacg tgcctgtagt cccagctcct
     6601 tgggaggctg agacaggagg attccttgag ccctggagtt tgagtccagc ctgggtgaca
     6661 tagcaagacc ctgtcttaaa agaaaaatgg gaagaaagac aaggtaacat gaagaaagaa
     6721 gagataccta gtatgatgga gctgcaaatt tcatggcagt tcatgcagtc ggtcaagagg
     6781 aggattttgt tttgtagttt gcagatgagc atttctaaag cattttccct tgctgtattt
     6841 ttttgtatta taaattacat tggacttcat atatataatt tttttttaca ttatatgtct
     6901 cttgtatgtt ttgaaactct tgtatttatg atatagctta tatgattttt ttgccttggt
     6961 atacatttta aaatatgaat ttaaaaaatt tttgtaaaaa taaaattcac aaaattgttt
     7021 tgaaaaacaa aaaaaaaaaa aa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneUniSTSUniSTSLinkOutLinkOutHelpHelp  




&&&&&&&



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: U41514. Human UDP-GalNAc:...[gi:1136284] Links  


LOCUS       HSU41514                2185 bp    mRNA    linear   PRI 25-DEC-1995
DEFINITION  Human UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
            mRNA, complete cds.
ACCESSION   U41514
VERSION     U41514.1  GI:1136284
KEYWORDS    .
SOURCE      Homo sapiens
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2185)
  AUTHORS   Meurer,J.A., Naylor,J.M., Baker,C.A., Thomsen,D.R., Homa,F.L. and
            Elhammer,A.P.
  TITLE     cDNA cloning, expression, and chromosomal localization of a human
            UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase
  JOURNAL   J. Biochem. 118 (3), 568-574 (1995)
  MEDLINE   96115928
   PUBMED   8690719
REFERENCE   2  (bases 1 to 2185)
  AUTHORS   Meurer,J.A.
  TITLE     Direct Submission
  JOURNAL   Submitted (29-NOV-1995) Janet A. Meurer, Department of
            Biochemistry, The Upjohn Company, 301 Henrietta Street, Kalamazoo,
            MI 49001, USA
FEATURES             Location/Qualifiers
     source          1..2185
                     /organism="Homo sapiens"
                     /db_xref="taxon:9606"
                     /chromosome="18"
                     /clone="pCR-5'A, pCR-HumGalNAc-1, pCR-EH"
                     /tissue_type="salivary gland"
     5'UTR           1..31
     CDS             32..1711
                     /codon_start=1
                     /product="UDP-GalNAc:polypeptide
                     N-acetylgalactosaminyltransferase"
                     /protein_id="AAC50327.1"
                     /db_xref="GI:1136285"
                     /translation="MRKFAYCKVVLATSLIWVLLDMFLLLYFSECNKCDEKKERGLPA
                     GDVLEPVQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVR
                     LEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASER
                     DFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGW
                     LEPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMD
                     RRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTL
                     EIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKVD
                     YGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRKEETNQCLDNMARKE
                     NEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEY
                     DPVKLTLQHVNSNQCLDKATEEDSQVPSIRDCNGSRSQQWLLRNVTLPEIF"
     3'UTR           1712..2185
BASE COUNT      707 a    395 c    490 g    593 t
ORIGIN      
        1 tttttaaatt ttgcatttga cttaaagtgc catgagaaaa tttgcatact gcaaggtggt
       61 cctagccacc tccttgattt gggtactctt ggatatgttc ctgctgcttt acttcagtga
      121 atgcaacaaa tgtgatgaaa aaaaggagag aggacttcct gctggagatg ttctagagcc
      181 agtacaaaag cctcatgaag gtcctggaga aatggggaaa ccagtcgtca ttcctaaaga
      241 ggatcaagaa aagatgaaag agatgtttaa aatcaatcag ttcaatttaa tggcaagtga
      301 gatgattgca ctcaacagat ctttaccaga tgttaggtta gaagggtgta aaacaaaggt
      361 gtatccagat aatcttccta caacaagtgt ggtgattgtt ttccacaatg aggcttggag
      421 cacacttctg cgaactgtcc atagtgtcat taatcgctca ccaagacaca tgatagaaga
      481 aattgttcta gtagatgatg ccagtgaaag agactttttg aaaaggcctt tagagagtta
      541 tgtgaaaaaa ctaaaagtac cagttcatgt aattcgaatg gaacaacgtt ctggattgat
      601 cagagctaga ttaaaaggag ctgctgtgtc taaaggccaa gtgatcacct tcctggatgc
      661 ccattgtgag tgtacagtgg gatggctgga gcctctcttg gccaggatca aacatgacag
      721 gagaacagtg gtgtgtccca tcatcgatgt gatcagtgat gatacttttg agtacatggc
      781 aggctctgat atgacctatg gtgggttcaa ctggaagctc aattttcgct ggtatcctgt
      841 tccccaaaga gaaatggaca gaaggaaagg tgatcggact cttcctgtca ggacacctac
      901 catggcagga ggcctttttt caatagacag agattacttt caggaaattg gaacatatga
      961 tgctggaatg gatatttggg gaggagaaaa cctagaaatt tcctttagga tttggcagtg
     1021 tggaggaact ttggaaattg ttacatgctc acatgttgga catgtgtttc ggaaagctac
     1081 accttacacg tttccaggag gcacagggca gattatcaat aaaaataaca gacgacttgc
     1141 agaagtgtgg atggatgaat tcaagaattt cttctatata atttctccag gtgttacaaa
     1201 ggtagattat ggagatatat cgtcaagagt tggtctaaga cacaaactac aatgcaaacc
     1261 tttttcctgg tacctagaga atatatatcc tgattctcaa attccacgtc actatttctc
     1321 attgggagag atacgaaaag aggaaacgaa tcagtgtcta gataacatgg ctagaaaaga
     1381 gaatgaaaaa gttggaattt ttaattgcca tggtatgggg ggtaatcagg ttttctctta
     1441 tactgccaac aaagaaatta gaacagatga cctttgcttg gatgtttcca aacttaatgg
     1501 cccagttaca atgctcaaat gccaccacct aaagggcaac caactctggg agtatgaccc
     1561 agtgaaatta accctgcagc atgtgaacag taatcagtgc ctggataaag ccacagaaga
     1621 ggatagccag gtgcccagca ttagagactg caatggaagt cggtcccagc agtggcttct
     1681 tcgaaacgtc accctgccag aaatattctg agaccaaatt tacaaaaaaa cgaaaaaaat
     1741 aaggattgac tgggctacct cagcatacat ttctgccaca ttcttaagta gcaaaaaagg
     1801 aaaagtgctt tcctcctctg caggatgtaa ggtttatcag ccattaaaac ttagacttct
     1861 ctagcttttc actagctgtg aaccagcctt cctgtccatg gacgtgaaac tgcatagtaa
     1921 tgagactgtg cacactgatg tttacaagat tgaaagagtc tttctccgaa aatcatggta
     1981 aagaatactg agacaatgaa aaaaaatcaa caaaatatgc tttctggaga actgtacctt
     2041 ctatggtttg cttgcacatc agtagtttct gctgaacgtg ctgtcataat gaagagattt
     2101 ccaagatttt ttttcctgat tagaacgggt agccagtata ttaaatattg atagaaaaat
     2161 aaaagaactg gaaccagatt cagaa
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMProbeSetProbeSetProteinProteinPubMedPubMedSNPSNPTaxonomyTaxonomyUniGeneUniGeneLinkOutLinkOutHelpHelp  



    
 
PubMed Nucleotide Protein Genome Structure PopSet Taxonomy OMIM Books 
 
   Search PubMed Protein Nucleotide PopSet Taxonomy Genome OMIM Structure Domains GEO Books Books2 MapViewDr TestDb UniSTS CDD SNP Journals UniGene  for        
 
    Limits  Preview/Index  History  Clipboard  Details  
 
 
  Summary ASN.1 FASTA TinySeq XML GenBank GBSeq XML GI List Graphics XML default             
 
 

1: Y10343. H.sapiens GalNAc-...[gi:2292903] Links  


LOCUS       HSY10343                2541 bp    DNA     linear   PRI 27-OCT-2000
DEFINITION  H.sapiens GalNAc-T1 gene, 3'UTR.
ACCESSION   Y10343
VERSION     Y10343.1  GI:2292903
KEYWORDS    GalNAc-T1 gene.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
REFERENCE   1
  AUTHORS   Bennett,E.P., Weghuis,D.O., Merkx,G., van Kessel,A.G., Eiberg,H.
            and Clausen,H.
  TITLE     Genomic organization and chromosomal localization of three members
            of the UDP-N-acetylgalactosamine: polypeptide
            N-acetylgalactosaminyltransferase family
  JOURNAL   Glycobiology 8 (6), 547-555 (1998)
  MEDLINE   98256183
REFERENCE   2  (bases 1 to 2541)
  AUTHORS   Bennett,E.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-JAN-1997) E.P. Bennett, School Of Dentistry, Histo
            Lab, Noerre Alle 20, Copenhagen, 2200N, DENMARK
FEATURES             Location/Qualifiers
     source          1..2541
                     /organism="Homo sapiens"
                     /isolate="P1 2190"
                     /db_xref="taxon:9606"
                     /clone_lib="P1"
     gene            1..2541
                     /gene="GalNAc-T1"
     CDS             <1..3
                     /gene="GalNAc-T1"
                     /codon_start=1
     3'UTR           4..2541
                     /gene="GalNAc-T1"
     polyA_signal    449..454
                     /gene="GalNAc-T1"
                     /note="putative (box I)"
     polyA_signal    1006..1011
                     /gene="GalNAc-T1"
                     /note="putative (box II)"
     polyA_signal    1206..1211
                     /gene="GalNAc-T1"
                     /note="putative (box III)"
     polyA_signal    1841..1846
                     /gene="GalNAc-T1"
                     /note="putative (box IV)"
BASE COUNT      824 a    402 c    440 g    875 t
ORIGIN      
        1 tgagaccaaa tttacaaaaa aacgaaaaaa ataaggattg actgggctac ctcagcatac
       61 atttctgcca cattcttaag tagcaaaaaa ggaaaagtgc tttcctcctc tgcaggatgt
      121 aaggtttatc agccattaaa acttagactt ctctagcttt tcactagctg tgaaccagcc
      181 ttcctgtcca tggacgtgaa actgcatagt aatgagactg tgcacactga tgtttacaag
      241 attgaaagag tctttctccg aaaatcatgg taaagaatac tgagacaatg aaaaaaaatc
      301 aacaaatatg ctttctggag aactgtacct tttatggttt gcttgcacat cagtagtttc
      361 tgctgaacgt gctgtcataa tgaagagatt tccaggattt tttttcctga ttagaactgg
      421 tagccagtat attaaatatt gatataaaaa taaaagaact ggaaccagat tccgaatctt
      481 gaaaacaaca ttttttacaa caaacaaaaa aactatatta aacagggttt aaaggaaaat
      541 taaaacagaa ctatgaagaa gtacaatttg ttatagtata gtatcaaatt tctatataga
      601 ttttatacct cagtggggaa aaataactga ttccaatgac attcattttg ttttcatctg
      661 tgatagtcat ggatgctttt attttccttg gggtgctgaa attgagctga aaaaaaaagg
      721 ctctttgaat atagttttaa tttctctcta cagttttttt tgtttggttt gtgggctgtt
      781 ggaattgtaa tttttaattg ccttctaaaa aatggaaatt taacaatgtc tgatctcagc
      841 tgaacaaatt agatgtttca gttgctcttg ggtcaactgg cttacagatt tacatgtgca
      901 cacacacaca aatttcttat cacattttcg acttcttcac ttgacctaac tgattatgcg
      961 aaatacccaa gattcatgct actgttccac atttgttttc acagcaataa atcttcagtt
     1021 ctgttgttta tgattccact taacaagggg cctgcaaatg tgatttatta tttgggtatt
     1081 tggagataat acatttgagg gttttttgga aaaccttttt cactccatac tcaaatatgc
     1141 ttcattgtca aatgcatatt taaattaaat tattgaattg taatgtttat ctgctgcttt
     1201 ttttaaataa aatttgactg aaaatgttta attggcattt tttaatgact tacccaagaa
     1261 aagtgcagct attattccat attaataggc ttgcatttct tttcctaaat cttatttagg
     1321 ctaaatcagt tttattgtcc tctgattttt tttaatacca cagaaatcac ctgagtgtca
     1381 attgaaaagt tgtcaattaa aaggtaacct tttaactctc gtaggaggaa tctcattaag
     1441 acatttttcc tgatatgtag agcagtctgt tggcaaaaat gcatatattt tctttcatat
     1501 ttgtaaaatt atatttaatg gaattctttt ctttgattat caaggacttt cactgcaggc
     1561 agtgctattt cttgtgccta agaatgtttc caaaagtcgc atcgctaatg atatttgcca
     1621 agttgagtgt acacaaagtt tctcatatcc tgttcaagtt aatcaacatc aaacacatgg
     1681 ggatgcttta gggtgagtct ataatacaaa atgcataaac catgtcccca ggaaatttga
     1741 aaggaagcaa gtgctgaatg gaattttttt ccttttccat gagctgtgtt aattctatct
     1801 ccagtaggcc taatgcttga aataagcaag atgtctaatc aataaattat tttcatgctc
     1861 agaatttcag gtttttgtac tccagcatag cttggtctta tttcttactg tatgaaagct
     1921 taacagcaat gtgatttaag gttttgtttt aaatgggaga tgtaagtgat ttaattcatg
     1981 ggtactttta gaacctgata gataatccca ttgcctttat ttttctaatt aaagaatcct
     2041 aaatactttg aaaatacaaa atattcctga atagttgtgc cttacattgg ttttatttga
     2101 agttagagtg gatccctgtt tatcataatt acttataatt aggcagatgc ttatcactct
     2161 attataactc aaattgggaa gaaaagaatg aaaaagaaaa caatggctgg gtactggctc
     2221 atacctgtaa ttttagcatt ttgggaaggt gagacgatag gaccacttga gggcaggagt
     2281 ttgaggttgc agtgagctat gatggtgcca ctgcactcca gcctgggcaa cagagaagaa
     2341 accatgtatc ttaaaaaaaa agaaaaaatg aaggatgaaa aagtgggatt attatttaac
     2401 cagtgaagaa ttcataaatg tgacaattaa ggataatttg aatctcacaa atctattcac
     2461 cccaccccag tcaaggtcta tttaatatat ctatcttcat tggcattctt gagaaataat
     2521 gctgccattg aacattgggg g
//



Revised: July 5, 2002.
 
 


Disclaimer | Write to the Help Desk
NCBI | NLM | NIH 

 

Oct 21 2002 11:56:56 

Related SequencesRelated SequencesMap ViewerMap ViewerOMIMOMIMPubMedPubMedSNPSNPTaxonomyTaxonomyUniSTSUniSTSLinkOutLinkOutHelpHelp  






&&&&&&&






