发明名称 |
INTERACTIVE RECOMMENDATION OF DATA SETS FOR DATA ANALYSIS |
摘要 |
A data analysis platform provides recommendations for datasets for analysis. Given a user selected dataset, for example resulting from a search,
automatically identifies other datasets based a variety of different types of relationships, including lineage, structural, content, usage, classification, and organizational/social. Datasets for each type of relationship are identified and scored for relevance, and ranked. Selected ones of the ranked data sets are presented in a recommendation interface. As the user selects from recommended dataset, additional datasets are automatically recommended based in inferences made according to the selected dataset and relationship. |
申请公布号 |
US2016328406(A1) |
申请公布日期 |
2016.11.10 |
申请号 |
US201615150296 |
申请日期 |
2016.05.09 |
申请人 |
Informatica LLC |
发明人 |
Convertino Gregorio;Gujjewar Abhiram;Kanchwala Firoz |
分类号 |
G06F17/30;G06F3/0482;G06F3/0484 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A computer executed method of recommending datasets for data analysis, comprising:
receiving a user selection of a first dataset; determining a context corresponding to the user selection of the first dataset; determining, based on the first dataset and determined context, one or more dataset recommenders, each of the one or more recommenders corresponding to a relationship type between datasets; determining a plurality of second datasets related to the first dataset based on the relationship types; scoring each of the plurality of second datasets using a relevance ranking algorithm specific to the corresponding relationship type to score the relevance of the of the second dataset to first dataset; ranking the plurality of second datasets based on the scoring; selecting a subset of the ranked datasets as the recommended datasets; and presenting the recommended datasets in a graphical user interface, wherein the recommended datasets are grouped by relationship type to the first dataset. |
地址 |
Redwood City CA US |