发明名称 Systems and methods for finding project-related information by clustering applications into related concept categories
摘要 A system, method, and computer-readable medium, is described that finds similarities among programming applications based on semantic anchors found within the source code of such applications. The semantic anchors may be API calls, such as Java's package and class calls of the JDK. Latent Semantic Indexing may be used to process the application and semantic anchor data and automatically develop a similarity matrix that contains numbers representing the similarity of one program to another.
申请公布号 US8832655(B2) 申请公布日期 2014.09.09
申请号 US201113248877 申请日期 2011.09.29
申请人 Accenture Global Services Limited 发明人 Grechanik Mark
分类号 G06F9/44;G06F9/45;G01N33/48;G01N33/50;G06F17/00;G06F17/30;G06F7/00 主分类号 G06F9/44
代理机构 Finnegan, Henderson, Farabow, Garrett & Dunner, L.L.P. 代理人 Finnegan, Henderson, Farabow, Garrett & Dunner, L.L.P.
主权项 1. A computer-implemented method of determining whether applications are similar, comprising: receiving, by a computer, source code for a plurality of applications; associating, for each application, semantic anchors found in the source code for that application with the application, wherein associating semantic anchors comprises building at least one weighted term document matrix from the semantic anchors, the at least one weighted term document matrix comprising at least a first term weighted based on at least a number of the plurality of applications in which a first semantic anchor is present in the source code for those applications; comparing, based on the semantic anchors, a similarity of the first application to a second application; and assigning, based on the comparison, a number representing the similarity of the first and second applications.
地址 Dublin IE