发明名称 |
APPARATUS AND METHOD FOR RECOGNIZING AND CLASSIFYING NAMED ENTITIES FROM TEXT DOCUMENT USING REPEATED LEARNING |
摘要 |
PURPOSE: An apparatus and a method for recognizing and classifying named entities from text document using repeated learning are provided to gradually extend a range for recognizing the object name by automatically extending a pattern rule and a vocabulary dictionary through the repeated learning. CONSTITUTION: A language quality extractor(10) extracts a language quality needed for recognizing the object name from a general text document set(11) inputted from the outside. A vocabulary dictionary extender(20) decides an object name list(22) and a vocabulary candidate to be added by applying the current pattern rule to the document set, and extends the vocabulary dictionary(12). A pattern rule extender(30) extends the pattern rule list(21) by generating/certifying a new pattern rule candidate as applying the vocabulary dictionary to the document set.
|
申请公布号 |
KR20040038559(A) |
申请公布日期 |
2004.05.08 |
申请号 |
KR20020067571 |
申请日期 |
2002.11.01 |
申请人 |
ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE |
发明人 |
LEE, HYEON SUK;WANG, JI HYEON;YOON, BO HYEON |
分类号 |
G06F17/21;(IPC1-7):G06F17/21 |
主分类号 |
G06F17/21 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|