发明名称 INTERACTIVE RECOMMENDATION OF DATA SETS FOR DATA ANALYSIS
摘要 A data analysis platform provides recommendations for datasets for analysis. Given a user selected dataset, for example resulting from a search, automatically identifies other datasets based a variety of different types of relationships, including lineage, structural, content, usage, classification, and organizational/social. Datasets for each type of relationship are identified and scored for relevance, and ranked. Selected ones of the ranked data sets are presented in a recommendation interface. As the user selects from recommended dataset, additional datasets are automatically recommended based in inferences made according to the selected dataset and relationship.
申请公布号 US2016328406(A1) 申请公布日期 2016.11.10
申请号 US201615150296 申请日期 2016.05.09
申请人 Informatica LLC 发明人 Convertino Gregorio;Gujjewar Abhiram;Kanchwala Firoz
分类号 G06F17/30;G06F3/0482;G06F3/0484 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer executed method of recommending datasets for data analysis, comprising: receiving a user selection of a first dataset; determining a context corresponding to the user selection of the first dataset; determining, based on the first dataset and determined context, one or more dataset recommenders, each of the one or more recommenders corresponding to a relationship type between datasets; determining a plurality of second datasets related to the first dataset based on the relationship types; scoring each of the plurality of second datasets using a relevance ranking algorithm specific to the corresponding relationship type to score the relevance of the of the second dataset to first dataset; ranking the plurality of second datasets based on the scoring; selecting a subset of the ranked datasets as the recommended datasets; and presenting the recommended datasets in a graphical user interface, wherein the recommended datasets are grouped by relationship type to the first dataset.
地址 Redwood City CA US