发明名称 METHOD AND APPARATUS FOR TEXT NORMALIZATION USING EXTENSIBLE MARKUP LANGUAGE(XML)
摘要 A device and a method for normalizing text using XML(eXtensible Markup Language) are provided to facilitate maintenance of text normalization by applying the XML, facilitate a tuning process according to the feature of input data in the text normalization, and reduce errors of the text normalization and induce rules by easily applying the dynamic feature to the text normalization and verifying the data in advance. A storing part stores text sentences. A parser divides the stored text sentence into a word unit and parses the divided text data to a semantic unit to analyze semantics. An XML data converter converts the parsed text data of the word unit into XML data based on the first predefined XML DTD(Data Type Definition). An attribute setting part sets an attribute to each word unit by checking front/rear context after reading the converted XML data and considering the checked context. An XML data generator generates the final XML data for converting the text data according to the text normalization if the attribute value is set to each word unit.
申请公布号 KR100631086(B1) 申请公布日期 2006.09.26
申请号 KR20050066713 申请日期 2005.07.22
申请人 ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE 发明人 PARK, KYOUNG HYUN
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址