Method and device for extracting log data

A log and data technology, applied in the field of data analysis, can solve the problem of low accuracy and achieve the effect of accurate extraction results

Active Publication Date: 2017-04-26
NEUSOFT CORP
View PDF11 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of the above problems, the present invention provides a method and device for extracting log data to solve the problem of low accuracy of specific content in existing extracted logs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for extracting log data
  • Method and device for extracting log data
  • Method and device for extracting log data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0065] In order to solve the problem of low accuracy of specific content in existing extracted logs, an embodiment of the present invention provides a method for extracting log data, such as figure 1 As shown, the method includes:

[0066] 101. Obtain a target field.

[0067] Wherein, the target field is selected by the user from the preset log samples through the input device and is used to extract data of the same category as th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for extracting log data, and relates to the technical field of data analysis. The problem of relatively low accuracy of specific contents in the existing extraction logs is solved. The method disclosed by the invention comprises the following steps: acquiring a target field; generating a regular expression set corresponding to the target field according to different generation strategies; respectively performing regular matching on a to-be-matched log according to each regular expression in the regular expression set, wherein each regular expression is matched with one matching datum at most; calculating a weight sum of all regular expressions corresponding to each matching datum and a weight value of the weight sum of all regular expressions to obtain a matching value of the corresponding matching datum; and determining the matching datum with the maximum matching value as the datum having the same category as the target field in the to-be-matched log. The method and device disclosed by the invention are used in a log analysis process.

Description

technical field [0001] The invention relates to the technical field of data analysis, in particular to a method and device for extracting log data. Background technique [0002] When analyzing a large number of logs, it is usually necessary to extract some specific content in each log, such as IP address, generation time, and so on. Although log content usually follows a certain pattern, this pattern is often cryptic and not easy to obtain intuitively. Therefore, when extracting some specific content, the corresponding regular expression is usually designed according to the extracted content, and then the specific content in the log is extracted according to the regular expression. [0003] Usually the accuracy of the regular expression directly affects the accuracy of the extracted content, so the generation of the regular expression is very important. There are mainly two existing methods for generating regular expressions: one is a manual method, and the other is an aut...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/34
CPCG06F11/3476
Inventor 吴擒龙
Owner NEUSOFT CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products