摘要 |
The invention concerns linguistic analysis. In particular the invention involves a method of operating a computer to perform linguistic analysis. In another aspect the invention is a computer system which implements the method, and in a further aspect the invention is software for programming a computer to perform the method. The method comprising the steps of: Splitting an input text into words and sentences. For each sentence, comparing phrases in the sentence with known phrases stored in a database, as follows: For each word in the sentence, comparing its value and the values of the words following it with the values of the words of stored phrases, starting with the longest stored phrase that starts with that word, and working from longest to shortest. In the event a match is found for two or more consecutive words, and considering the words around the phrase, labelling the matched phrase with an overphrase that describes the grammar use of the matched phrase. After the penultimate word has been compared, recasting the sentence by replacing the matched phrases by their respective overphrases. Then repeating the comparison process with the recast sentence until there is no further recasting. WO 2008/080190 PCT/AU2007/002005 Split Input Text into Words Sentence for Parse Round j to Wj(1) Wj(2) ... Wj (Lj) TEXT PARSING (Parse Round j) Match unknown phrases in the Input Sentence with known phrases in the database. See Fig. 2. 140 Changes Made? OI IDOTITI IT OLICCT IDI II [ nL.% DrtIA I I |