发明名称 System and method for automated classification of text by time slicing
摘要 For use in an information processing system, there is disclosed a system and method for automatically classifying text. The system comprises a text classifier controller that reads text having one or more keywords contained within one or more story segments within the text. The text classifier controller identifies keywords within each line, and, in response to identifying at least one keyword within a line of text, classifies that line of text as a part of a story segment within the text. The text classifier controller also identifies keyword transition points in the text where the number of detected keywords in a particular category of keywords decreases below a threshold number. The text classifier controller also identifies keyword transition points in the text where the number of detected keywords in a particular category of keywords increases above a threshold number. The text classifier controller classifies story segments based on the location of the keyword transition points.
申请公布号 US6990496(B1) 申请公布日期 2006.01.24
申请号 US20000616631 申请日期 2000.07.26
申请人 KONINKLIJKE PHILIPS ELECTRONICS N.V. 发明人 MCGEE, III THOMAS FRANCIS;DIMITROVA NEVENKA
分类号 G06F17/00;G06F17/30;G06K9/62 主分类号 G06F17/00
代理机构 代理人
主权项
地址