摘要 |
PROBLEM TO BE SOLVED: To provide a similarity calculation apparatus and a similarity calculation method that can prevent degradation of reliability of a similarity calculation result in calculating the similarity between data having missing values and other data. SOLUTION: The similarity calculation apparatus includes a reception means for receiving first and second data having an area related to each attribute in which an attribute value for the attribute related to the area is or is not described, a storage means for storing distribution information about each attribute indicating the distribution of possible values of the attribute value for the attribute, a first calculation means for calculating the similarity between attribute values for attributes for which the attribute value is described in both first and second data, and as for attributes for which the attribute value is missing in at least either of the first and second data, calculating the similarity expectation between attribute values from the distribution information, and a second calculation means for calculating the similarity between the first and second data from the similarities and similarity expectations calculated by the first calculation means. COPYRIGHT: (C)2010,JPO&INPIT |