摘要 |
<P>PROBLEM TO BE SOLVED: To automatically find out a candidate for a pair of a substance name and its chemical formula in an electronic document. <P>SOLUTION: A generation device extracts a character string "ethane (H3CH3)" estimated to have its substance name and rational formula described in synonymous expression in document information 100-1. The generation device extracts a word "ethane" right before the parentheses in the character string "ethane (H3CH3)". The generation device determines whether or not the word "ethane" is included in a substance name DB in which substance names, and characters and character strings related to the substance names are registered. When the word "ethane" is included, the generation device specifies the word "ethane" as a substance name, and specifies the character string "CH3CH3" having the English letters in the parentheses as a rational formula. The generation device registers the specified substance name "ethane" and rational formula "CH3CH3" in a correspondence relation DB 200. <P>COPYRIGHT: (C)2013,JPO&INPIT |