发明名称 AUTOMATIC DOCUMENT CLASSIFICATION VIA CONTENT ANALYSIS AT STORAGE TIME
摘要 Techniques are disclosed for efficiently and automatically classifying textual documents or files. In some embodiments, the classification process is integrated into or otherwise made part of the storage function, such that when the user initiates a save process for a given file, the file is processed through a classifier prior to (or contemporaneously with) completing the save function. In some such embodiments, textual content of the file is analyzed using natural language processing to identify a main or substantial concept discussed in the file, and one or more corresponding tags are then assigned to that file. Subsequently, the user can access that file based on the one or more tags, for instance, through a user interface that allows the user to select one or more content categories associated with the assigned tags. The files can be text-based, but may include other content as well, such as images, video, and audio.
申请公布号 US2014156665(A1) 申请公布日期 2014.06.05
申请号 US201213692699 申请日期 2012.12.03
申请人 ADOBE SYSTEMS INCORPORATED 发明人 Kraley Michael
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A file classification system, comprising: a content extraction module configured to, in response to a storage request for a file, extract textual content of the file; and a classification engine configured to analyze the extracted textual content to determine a concept category to which the file can be assigned, and to assign corresponding tag information to the file.
地址 San Jose CA US