发明名称 Automatic generation of task scripts from web browsing interaction history
摘要 Embodiments of the invention relate to automatically identifying web browsing tasks based on a web browsing interaction history. According to one embodiment of the invention, a web browsing interaction history of a user is analyzed to identify web browsing actions associated with web sites. Abstracted action sequences for the web browsing actions that are identified are generated, and action subsequences for the abstracted action sequences are generated. A similarity between each of the action subsequences is determined, and similar action subsequences are designated as a web browsing task.
申请公布号 US9626159(B2) 申请公布日期 2017.04.18
申请号 US201213561378 申请日期 2012.07.30
申请人 International Business Machines Corporation 发明人 Mahmud Jalal U.
分类号 G06F11/34;G06F9/44;G06F17/30 主分类号 G06F11/34
代理机构 Fleit Gibbons Gutman Bongini Bianco PL 代理人 Fleit Gibbons Gutman Bongini Bianco PL ;Grzesik Thomas S.
主权项 1. A method comprising: analyzing a web browsing interaction history of a user to identify a plurality of web browsing actions previously performed by the user at one or more web sites; generating a plurality of abstracted action sequences for a sequence of web browsing actions in the plurality of web browsing actions for at least one website of the one or more websites, wherein generating the plurality of abstracted sequences comprises constructing a feature vector for each of the plurality of web browsing actions identified for the at least one website, wherein the feature vector at least identifies a web content element and an action performed on the web content element; generating, for the website, a plurality of action subsequences for the abstracted action sequences, wherein generating the plurality of action subsequences comprises organizing each abstracted sequence in the plurality of abstracted sequences into one of a plurality of clusters based on the feature vector constructed for the abstracted sequence, wherein each cluster in the plurality of clusters represents a different feature vector class and is associated with an action-class label identifying the feature vector class, replacing each abstracted sequence in the plurality of abstracted sequences with the action-class label of the cluster corresponding to the abstracted sequence, and generating the plurality of action subsequences based on each action-class label representing each abstracted sequence in the plurality of abstracted sequences, wherein each action subsequence in the plurality of action subsequences comprises an increasing number of action-class labels representing an abstracted sequence; determining a similarity between each of the action subsequences in the plurality of action subsequences; designating, based on the similarity that has been determined, similar action subsequences in the plurality of action subsequences as a web browsing task previously accomplished by the user at the web site, wherein the web browsing task is composed of at least one sequence of actions previously performed by the user at the web site to accomplish a goal, and wherein at least one action in the at least one sequence of actions is performed on at least one web content element of the web site; and automatically generating a set of executable scripts based on the similar action subsequences designated as the web browsing task, where web browsing actions utilized to generate the set of executable scripts are web browsing actions only from the web browsing interaction history, and wherein the set of executable scripts automatically accomplishes the web browsing task at one or more websites without requiring the user to perform an initial action at the one or more websites.
地址 Armonk NY US