Log carrier format extraction method and device based on natural language
A natural language and extraction method technology, applied in the field of natural language-based log carrier format extraction, can solve problems such as lack of, unrecognizable and extracted information, and achieve the effect of reducing manual intervention and improving analysis
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0045] figure 1 A method for extracting a log carrier format based on natural language in an embodiment of the present invention may include:
[0046] S101. Split the accessed original log stream into streams corresponding to each log data segment through context word segmentation. Preferably, in this step, character strings separated by delimiters in the original log stream are extracted as streams corresponding to each log data segment.
[0047] The present invention is applicable to an original log stream in a predetermined format, preferably a log in which a plurality of log data segments are separated by delimiters, and each log data segment includes a data field (key), a connector or an operator, and a data value (value ). For example, the power plant equipment status information log is referred to as the power plant equipment log for short. The log data segments contained in the power plant equipment log include but are not limited to: log date, log time, power plant...
Embodiment 2
[0084] On the basis of Embodiment 1, the present invention also provides a power plant equipment log parsing method, the flow chart of which is as follows figure 2 shown.
[0085]The power plant equipment log parsing method of the second embodiment includes the following steps:
[0086] S1, access to the original log stream;
[0087] S2. Obtain the stored log carrier format;
[0088] S3. Use the stored log carrier format to match and analyze the accessed original log stream. If the matching and analysis is successful, go to step S6; otherwise, go to step S4; in this step, you can use all stored log carrier formats to match the original log stream in turn. Parsing, as long as a log carrier format can be successfully matched, the matching and parsing is considered successful; if all log carrier formats cannot be matched, the matching and parsing is considered to have failed;
[0089] In this step, the following regular expressions can be used to match and analyze the incomin...
Embodiment 3
[0097] On the basis of the first embodiment, the present invention also provides a method for judging abnormality of a power plant equipment log.
[0098] The log carrier format extraction method based on natural language in the embodiment 3 can be used to extract the log carrier format of the power plant equipment during the normal operation period and save it; and then use the saved log carrier format to update The obtained logs are matched and analyzed. If the matching analysis is successful, it is judged that the power plant equipment is normal; if the matching analysis is unsuccessful, it is judged that the power plant equipment is faulty, and an alarm message is generated. Preferably, the above matching method is regular matching, for example, the following regular expression is used:
[0099] $pattern = ' / date = (.* ),time=(.* ), devname=(.* ),device is (.* ), sever is not (.* ) / ';
[0100] preg_match_all($pattern, original log, $matches), $matches is the ma...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com