发明名称 Retrieval of records using phrase chunking
摘要 Methods are provided for generating phrase chunking rules for titles of records in a database. According to one method, the title of each record in a first set of records is part-of-speech tagged, and a plurality of phrase chunking rules are created based on patterns of part-of-speech tags in the tagged titles. The phrase chunking rules are applied to the titles of records in a second set of records so as to generate indexes for the records in the second set of records. In a preferred embodiment, the phrase chunking rules are modified if coverage of the second set of records by the phrase chunking rules does not reach a predetermined threshold. Also provided are methods for retrieving records from a database and systems for generating phrase chunking rules.
申请公布号 US2003105622(A1) 申请公布日期 2003.06.05
申请号 US20010005447 申请日期 2001.12.03
申请人 NETBYTEL, INC. 发明人 HOROWITZ DAVID M.;SANDERS JAMES E.;KASHIMBA JARED P.;SIMONE JOSEPH E.
分类号 G06F17/27;G06F17/30;(IPC1-7):G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址