摘要 |
Systems and methods of performing a process are provided, including receiving a live document video stream of a remote collaboration session, detecting a cursor action in the live document video stream, classifying the detected cursor action into an action category, detecting key frames of the live document video stream, indexing the detected key frames based on the action category, detecting a keyword in the indexed key frames, indexing the key frames using the category, visualizing the cursor action in the key frames based on the action category, and displaying the visualized cursor action. |