发明名称 |
Field-based similarity search system and method |
摘要 |
A field-based similarity search system includes an input device which inputs a query molecule, and a processor which partitions a conformational space of the query molecule into a fragment graph including an acyclic graph including plural fragment nodes connected by rotatable bond edges, computes a property field on fragment pairs of fragments of the query molecule from the fragment graph, the property field including a local approximation of a property field of the query molecule, constructs a set of features of the fragment pairs based on the property field, the features including a set of local, rotationally invariant, and moment-based descriptors generated from all conformations of the fragment graph of the query molecule, and weights the descriptors according to importance as perceived from a training set of descriptors to generate a context-adapted descriptor-to-key mapping which maps the set of descriptors to a set of feature keys. |
申请公布号 |
US8805622(B2) |
申请公布日期 |
2014.08.12 |
申请号 |
US201213597385 |
申请日期 |
2012.08.29 |
申请人 |
International Business Machines Corporation |
发明人 |
Pitman Michael C.;Fitch Blake G.;Horn Hans W.;Huber Wolfgang;Rice Julia E.;Swope William C. |
分类号 |
G01N33/48;G01N31/00;G06F7/60;G06F17/10;G06F19/22;G06F19/12;G06F19/24 |
主分类号 |
G01N33/48 |
代理机构 |
McGinn IP Law Group, PLLC |
代理人 |
Alexanian Vazken;McGinn IP Law Group, PLLC |
主权项 |
1. A field-based similarity search system, comprising:
an input device which inputs a query molecule; and a processor which:
partitions a conformational space of said query molecule into a fragment graph comprising an acyclic graph including plural fragment nodes connected by rotatable bond edges;computes a property field on fragment pairs of fragments of said query molecule from said fragment graph, said property field comprising a local approximation of a property field of said query molecule;constructs a set of fragment pairs for description of each candidate molecule wherein each set is known as a set of candidate molecule fragment pairs;constructs a set of features of said fragment pairs based on said property field, said features comprising a set of local, rotationally invariant, and moment-based descriptors generated from all conformations of said fragment graph of said query molecule; andweights said descriptors according to importance as perceived from a training set of descriptors to generate a context-adapted descriptor-to-key mapping which maps said set of descriptors to a set of feature keys comprising indices that label grid cells in discriminant space, wherein a candidate molecule fragment pair is within a set of candidate molecule fragment pairs that describe a candidate molecule stored in a feature database to said query molecule, and wherein the number of candidate molecule fragment pairs in the set, Cfp, is given by Cfp=(n−1)×(Cfrag)2×Crbe, where n is the number of fragments in said candidate molecule and n−1 is the number of rotatable bond edges connecting n fragments, Cfrag is a number of conformations in a fragment of said fragments and Crbe is a number of steps in which said rotatable bond edges are sampled. |
地址 |
Armonk NY US |