发明名称 METHOD FOR ESTIMATING FORMAT OF LOG MESSAGE AND COMPUTER AND COMPUTER PROGRAM THEREFOR
摘要 A technique for estimating a format of a log message (LM) according to the present invention includes creating a first directed graph structure by dividing a first LM by predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the first LM; creating a second directed graph structure by performing on a second LM the same processing as that performed on the first LM; comparing nodes in the first directed graph structure with nodes in the second directed graph structure to detect nodes other than nodes including a corresponding character string; adding to the first directed graph structure the node detected in the second directed graph structure among the detected nodes as a first branch node; and estimating the format, based on the first directed graph structure including the first branch node added thereto.
申请公布号 US2017060724(A1) 申请公布日期 2017.03.02
申请号 US201615349033 申请日期 2016.11.11
申请人 International Business Machines Corporation 发明人 Mizutani Masayoshi
分类号 G06F11/34;G06N5/04;G06F17/30 主分类号 G06F11/34
代理机构 代理人
主权项 1. A computer for estimating a format of a log message, the computer comprising: directed graph structure creation means for creating a first directed graph structure by dividing a first log message by predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the first log message, and creating a second directed graph structure by dividing the second log message by the predetermined characters to define divided portions as nodes and arranging the nodes in order from the beginning of the second log message; node detection means for comparing nodes in the first directed graph structure with nodes in the second directed graph structure to detect a node in the first directed graph structure and a node in the second directed graph structure that are nodes other than nodes including a corresponding character string; directed graph structure change means for adding to the first directed graph structure the node detected in the second directed graph structure among the detected nodes as a first branch node; and format estimation means for estimating the format, based on the first directed graph structure including the first branch node added thereto, wherein the format includes a first portion associated with a node including a corresponding character string, a second portion associated with a node whose appearance tendency of character string is similar between the node detected in the first directed graph structure and the node detected in the second directed graph structure, and, optionally, a third portion associated with a node other than nodes having a similar appearance tendency of character string.
地址 Armonk NY US