发明名称 Bulletin Board Data Mapping and Presentation
摘要 A computer-implemented method performed at a server system having one or more processors and memory, the method comprising receiving a set of curated documents comprising one or more documents identified as being relevant to a sector, analyzing the set of curated documents to determine one or more words and a count of each of the one or more words for all documents of the curated set of documents, further analyzing the set of curated documents, by analyzing one or more n-grams based on the one or more words, determining a first score based on a term frequency and a global document frequency of each of the one or more words of each of the one or more n-grams, determining a document vector based on averages of the first score, where the document vector comprises a perfect document for the sector, and storing the document vector in the data store.
申请公布号 US2016004705(A1) 申请公布日期 2016.01.07
申请号 US201514855290 申请日期 2015.09.15
申请人 Bitvore Corporation 发明人 Petrocik John;Chaney Alan;Bolcer Greg;Mogilev Andrey;Watters Kevin;Bollampalli Nirmisha
分类号 G06F17/30;G06K9/00 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method performed at a server system having one or more processors and memory, the method comprising: receive a set of curated documents from a data store comprising one or more documents identified as being relevant to a sector; analyze, by a processor, the set of curated documents to determine one or more words and a count of each of the one or more words for all documents of the curated set of documents; further analyze the set of curated documents, by the processor, by analyzing one or more n-grams based on the one or more words; determine, by the processor, a first score based on a term frequency and a global document frequency of each of the one or more words of each of the one or more n-grams; determine, by the processor, a document vector based on averages of the first score, where the document vector comprises a perfect document for the sector; and store the document vector in the data store.
地址 Irvine CA US