摘要 |
The present invention has an object to provide a method for configuring a pattern recognizer using versatile, readily available data, comprehensive protein data, and comprehensive chemical data and an object to provide a method for predicting an unknown interaction of a pair by the pattern recognizer-configuring method. In particular, an interaction such as the coupling between a protein and a chemical is used as an index; at least one selected from four parameters that are the position of a peak in mass spectrum data obtained from each chemical, the set of the position and intensity of the peak, the distance between two peaks, and the set of the positions and intensities of the two peaks is vectorized for each of a first pair having a first interaction and a second pair having a second interaction; an amino acid sequence of each protein is vectorized; a vector containing elements of the vector derived from each protein and elements of the vector derived from each chemical paired with the protein is created; and a support vector machine (SVM) is applied to this vector and trained to learn them, whereby the pattern recognizer is configured so as to discriminate between a class to which the first pair belong and a class to which the second pair belong.
|