Unlock instant, AI-driven research and patent intelligence for your innovation.

Log variable semantic annotation method

A semantic annotation and log technology, applied in semantic analysis, natural language data processing, file system, etc., can solve inappropriate problems

Active Publication Date: 2021-11-16
SICHUAN UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

There are big defects in the description of entity similarity in this method. Only the similarity of variable values ​​is considered. In addition, it is not appropriate to use exact matching to measure the similarity between variable values ​​in the log.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Log variable semantic annotation method
  • Log variable semantic annotation method
  • Log variable semantic annotation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments. The present invention is divided into two links on the whole, and its structure is as follows figure 1 As shown, first, the part-of-speech tagging is performed on the words in the log pattern, and the words that can be used to represent the semantics of the log variables are found; second, the similarity characterization method and semantic inference algorithm between the log variables are proposed, from the content similarity The log variables are similarly described in two dimensions such as structural similarity and structural similarity. By describing the similarity between the log variables that have been marked in the first link and the remaining unmarked log variables, it is inferred that the log variables that have not been marked The semantics of the annotated log variables.

[0071] Part-of-speech tagging is the first step in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a log variable semantic annotation method, which comprises the following steps of: firstly, analyzing a log based on a log analysis algorithm to obtain a log mode of a log set; according to the position of the variable in the log mode, judging the part-of-speech of words near the position of the variable, and obtaining a log variable list with known semantics and a log variable list with unknown semantics; secondly, according to a variable value set in the log mode, describing the similarity between log variables from six dimensions including the overlapping performance of variable values, the distribution characteristic of the variable values, the diversity similarity of the variable values, the statistical characteristic similarity of the variable values, the variable position similarity and the neighbor variable similarity; and finally, based on an inference algorithm, judging whether the two log variables subjected to similarity description are matched or not, further identifying the log variables with unknown semantics as log variables with known semantics, and completing semantic annotation of the log variables. According to the method, the accuracy of the log variable labeling result can be effectively improved.

Description

technical field [0001] The invention relates to the technical field of log automatic parsing, in particular to a log variable semantic labeling method. Background technique [0002] With the rapid development and widespread popularization of Internet applications, log messages have exploded. At present, many studies analyze log messages to mine potential valuable information. However, log messages are mostly unstructured or semi-structured text data. Before analysis, structured fields need to be extracted from log messages. This process is called log parsing; after extracting fields, in order to help analysts understand the fields In order to facilitate the structural analysis of log messages using the log analysis platform, it is also necessary to assign appropriate semantics to the structured fields. This process is called log variable annotation. [0003] The problem that log variable labeling actually solves is how to assign reasonable semantics to log variables after ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/30G06F40/284G06K9/62G06F16/18
CPCG06F40/30G06F40/284G06F16/1815G06F18/22
Inventor 罗永刚陈兴蜀邹峰袁磊刘朋黄铁脉廖志红宋可儿王海舟王文贤
Owner SICHUAN UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More