(WO/2002/079750) MARKOVIAN DOMAIN FINGERPRINTING IN STATISTICAL SEGMENTATION OF PROTEIN SEQUENCES

(WO/2002/079750) MARKOVIAN DOMAIN FINGERPRINTING IN STATISTICAL SEGMENTATION OF PROTEIN SEQUENCES

Latest bibliographic data on file with the International Bureau
Pub. No.:  WO/2002/079750   International Application No.:  PCT/IL2002/000278
Publication Date:10.10.2002 International Filing Date:04.04.2002
IPC: C07K 1/00 (2006.01)
Applicants:YISSUM RESEARCH DEVELOPMENT COMPANY OF THE HEBREW UNIVERSITY OF JERUSALEM [IL/IL]; P.O. Box 4279, Jabotinsky Street 46, 91 042 Jeursalem (IL) (All Except US).
TISHBY, Naftali [IL/IL]; (IL) (US Only).
SELDIN, Yevgeny [IL/IL]; (IL) (US Only).
BEJERANO, Gill [IL/IL]; (IL) (US Only).
MARGALIT, Hanah [IL/IL]; (IL) (US Only).
Inventors:TISHBY, Naftali; (IL).
SELDIN, Yevgeny; (IL).
BEJERANO, Gill; (IL).
MARGALIT, Hanah; (IL).
Agent:G. E. EHRLICH (1995) LTD.; Bezalel Street 28, 52 521 Ramat Gan (IL).
Priority Data:
60/281,627 30.03.2001 US
Title: MARKOVIAN DOMAIN FINGERPRINTING IN STATISTICAL SEGMENTATION OF PROTEIN SEQUENCES
Abstract: Apparatus for automatic segmentation of non-aligned data sequences comprising structural domains to identify and construct models of the structural domains. The apparatus comprises a soft clustering unit, a refinement unit and an annealing unit. The soft clustering unit iteratively partitions the data sequences and trains variable memory Markov sources, created using a prediction suffix tree data structure, on the data until convergence is reached. The clustering unit also eliminates sources showing low relationships with the data. The refinement unit is connected to the soft clustering unit and splits and perturbs the sources following convergence, to repeat the iterative partitioning at the soft clustering unit, thereby to refine the model. The annealing unit increases the resolution with which the relationships between data and sources is shown, thereby governing the way in which less competitive sources are rejected, and the apparatus outputs the surviving variable memory Markov sources to provide models for subsequent identification of the structural domains.
Designated States: AE, AG, AL, AM, AT, AU, AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZM, ZW.
African Regional Intellectual Property Org. (ARIPO) (GH, GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW)
Eurasian Patent Organization (EAPO) (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM)
European Patent Office (EPO) (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE, TR)
African Intellectual Property Organization (OAPI) (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, NE, SN, TD, TG).
Publication Language:English (EN)
Filing Language:English (EN)

PATENTSCOPE®

Related Links

E-Newsletters