发明名称 AUTOMATED DISCOVERY USING TEXTUAL ANALYSIS
摘要 An example method includes receiving text from a plurality of documents, segmenting text received text of the plurality of documents, calculating a frequency statistic for each segment of each document, determining segments of potential interest of each document based on calculated frequency statistic, calculating distances between each document of the plurality of documents based on a text metric, and storing segments of potential interest of each document and the distances in a search database. The method may further include receiving a search query and performing a search of information contained in the search database, partitioning documents of search results using the distances, for each partition, determining labels of segments of potential interest for documents of that particular partition, the labels being determined based on a plurality of frequency statistics, and providing determined labels of segments of potential interest for documents of each partition.
申请公布号 US2015074124(A1) 申请公布日期 2015.03.12
申请号 US201414481546 申请日期 2014.09.09
申请人 Ayasdi, Inc. 发明人 Sexton Harlan;Kloke Jennifer
分类号 G06F17/30;G06F17/27 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method comprising: receiving text from a plurality of documents; segmenting text received text of the plurality of documents; calculating a frequency statistic for each segment of each document; determining segments of potential interest of each document based on calculated frequency statistic; calculating distances between each document of the plurality of documents based on a text metric; and storing segments of potential interest of each document and the distances in a search database.
地址 Menlo Park CA US