摘要 |
PROBLEM TO BE SOLVED: To robustly normalize a sentence even when the sentence is hardly normalized robustly by syntax analysis.SOLUTION: A sentence normalization system 10 includes: an input part 11 which inputs a sentence; a morphological analysis part 12 which divides the sentence into a word string and estimates a part of speech of each divided word; a separation part 13 separates the divided word string into a content part including a content of the sentence and a sentence end based on the estimated part of speech of each word; a content word string extraction part 14 which extracts a content word string which is content information showing the content of the sentence from an independent word included in the content part; a semantic label string extraction part 15 which extracts a semantic label string which is function information showing a functional expression of the sentence from the sentence end; and a symbol string combining part 16 which combines the content word string and the semantic label string, and outputs it as a normalized expression of the sentence. |