发明名称 METHOD AND SYSTEM FOR IDENTIFYING DEPENDENT COMPONENTS
摘要 Embodiments include processing a data structure representing a dependency matrix having columns representing respective first components and rows representing respective second components. Aspects include assigning each cell of the matrix a value indicative of the level of dependency or indicative of an unknown dependency of a pair of first and second components forming the cell and assigning each component of the first and second components an affiliation vector indicative of the strength of affiliation of the component to N predefined initial clusters of cells of the matrix. Aspects also include determining a probability model using the affiliations vectors parameters and estimating the parameters of the probability model for a plurality of different numbers of clusters starting from the initial number N of clusters. Aspects further include computing a score for the parameters of the probability model estimated and selecting the parameters of the probability model with the highest computed score.
申请公布号 US2016063392(A1) 申请公布日期 2016.03.03
申请号 US201514935476 申请日期 2015.11.09
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Heckel Reinhard Wolfram;Vasileiadis Vasileios;Vlachos Michail
分类号 G06N7/00 主分类号 G06N7/00
代理机构 代理人
主权项 1. A computer implemented method for processing a data structure representing a dependency matrix having a plurality of columns representing respective first components and a plurality of rows representing respective second components, the method comprising: assigning each cell of the matrix a value indicative of the level of dependency or indicative of an unknown dependency of a pair of first and second components forming the cell; assigning each component of the first and second components an affiliation vector fu and fi respectively indicative of the strength of affiliation of the component to N predefined initial clusters of cells of the matrix, and initializing the affiliation vectors with predefined values; determining a probability model using the affiliations vectors fu and fi as parameters, wherein the probability model presents the probabilities for a first component of the first components to depend on a second component of the second components; estimating the parameters of the probability model for a plurality of different numbers of clusters starting from the initial number N of clusters using the matrix; computing a score for the parameters of the probability model estimated for each of the plurality of different numbers of clusters; selecting the parameters of the probability model with the highest computed score; using the selected parameters of the probability model to identify cells of unknown dependencies pairs of first and second components that depend on each other.
地址 ARMONK NY US