发明名称 CONCEPTUAL DOCUMENT ANALYSIS AND CHARACTERIZATION
摘要 Data files are received from data sources that include textual content. The data files are categorized using a taxonomy of categories, where each category has sample textual content that defines a concept for the category. The categorizing includes comparing the textual content of the data file with the sample textual content for the category. A file score is calculated for each data file to compare the degree of similarity between the defined concept of the category and a determined concept for the data file. Each data file is associated with the category if the file score is equal to or greater than a pre-determined minimum score for the category. A portion of the data file and/or file score is be provided.
申请公布号 WO2016176310(A1) 申请公布日期 2016.11.03
申请号 WO2016US29532 申请日期 2016.04.27
申请人 ALTEP INC. 发明人 MILLER, Roger, W.;VAN DEN BERGE, Willem, R.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址