发明名称 HIERARCHICAL DATA CLASSIFICATION USING FREQUENCY ANALYSIS
摘要 A method of classifying individual documents in a document collection according to a hierarchy may include selecting an object from the hierarchy, generating one or more variants for the object, and for each of the one or more variants, determining a frequency threshold based at least in part on how frequently the one or more variants occurs in the document collection. The method may also include selecting a first document in the document collection, where the first document includes one or more objects that match at least one of the one or more variants. The method may additionally include determining that the number of the one or more objects exceeds the frequency threshold and classifying the first document with the object in the hierarchy.
申请公布号 US2016342589(A1) 申请公布日期 2016.11.24
申请号 US201514716554 申请日期 2015.05.19
申请人 Oracle International Corporation 发明人 Brugger Gerhard;Baum John Eric;Beghelli Filippo Ferdinando Paolo;Wilson Charles
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method of classifying individual documents in a document collection according to a hierarchy, the method comprising: selecting an object from the hierarchy; generating one or more variants for the object; for each of the one or more variants, determining a frequency threshold based at least in part on how frequently the one or more variants occurs in the document collection; selecting a first document in the document collection, wherein the first document includes one or more objects that match at least one of the one or more variants; determining that the number of the one or more objects exceeds the frequency threshold; and based at least in part on the determination that the number of the one or more objects exceeds the frequency threshold, classifying the first document with the object in the hierarchy.
地址 Redwood Shores CA US