发明名称 System and method for parsing a document using one or more break characters
摘要 A parsing system and method are provided in which the break characters in the document are used to rapidly parse the document and extract one or more key phrases from the document which characterize the document. The break characters in the document may include explicit break characters, such as punctuation, soft stop words and hard stop words. The determination of which phrases in the document are extracted depends upon the type of break character appearing after the phrase in the document.
申请公布号 US6424982(B1) 申请公布日期 2002.07.23
申请号 US19990288994 申请日期 1999.04.09
申请人 SEMIO CORPORATION 发明人 VOGEL CLAUDE
分类号 G06F17/30;G06F17/27;(IPC1-7):G06F17/21 主分类号 G06F17/30
代理机构 代理人
主权项
地址