摘要 |
A device and a method for analyzing syntax by recognizing a parallel structure are provided to divide excessively long sentences found in a patent document with high correctness, and raise syntax analysis efficiency/correctness by dividing the sentence into more syntaxes. A chunking part(100) tags and partially parses an inputted raw English document. A node recognizer(200) recognizes a parallel node starting point of the raw document. A similarity calculator(300) calculates similarity weight among parallel nodes based on the similarity of vocabulary/part-of-speed of a starting word, a head, and the word next to the head. A parallel structure recognizer(400) searches all available parallel structures based on the calculated similarity weight, calculates the weight of the searched parallel structures, and recognizes the parallel structure of the raw document based on the calculated weight of the parallel structure. A parallel structure parser(500) parses the recognized parallel structure. A whole sentence parser(600) parses the whole raw sentence again which is inputted from a parsing result.
|
申请人 |
ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE |
发明人 |
ROH, YOON HYUNG;CHOI, SUNG KWON;LEE, KI YOUNG;KWON, OH WOOG;PARK, SANG KYU;KIM, YOUNG KIL;KIM, CHANG HYUN;SEO, YOUNG AE;YANG, SEONG IL;HONG, MUN PYO |