Method, device and electronic equipment for triplet extraction of safety report text

A technology of report text and triples, applied in unstructured text data retrieval, electronic digital data processing, instruments, etc., can solve the problems of low screening accuracy, time-consuming and laborious, etc., to improve efficiency and eliminate ambiguity Effect

Active Publication Date: 2021-08-27
北京天际友盟信息技术有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It can ensure that the source of information is reliable, but the existing technology usually uses manual methods to screen and summarize threat intelligence from massive security reports, which is very time-consuming and laborious, and the accuracy of screening is biased due to the inability to always focus on the information itself. Low, that is, it is impossible to ensure that the key information can be extracted manually. Therefore, it is necessary to optimize the method and method of text extraction in the security field, which has become a new development direction.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and electronic equipment for triplet extraction of safety report text
  • Method, device and electronic equipment for triplet extraction of safety report text
  • Method, device and electronic equipment for triplet extraction of safety report text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to better understand the above-mentioned technical solution, the above-mentioned technical solution will be described in detail below in conjunction with the accompanying drawings and specific implementation methods.

[0034] Triplets can be extracted based on events that depend on syntactic and semantic role annotations. Syntactic analysis is an important part of natural language. It analyzes the syntactic structure of sentences (subject-verb-object structure) and the dependency relationship between words (parallel, subordinate, etc.). Syntax analysis can lay a solid foundation for NLP application scenarios in security fields such as semantic analysis, sentiment orientation, and opinion extraction. Syntactic analysis is divided into syntactic structure parsing and dependency parsing. The syntactic analysis for the purpose of obtaining the syntactic structure of the entire sentence or the complete phrase structure is called constituent structure parsing or phr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for extracting triples of safety report text, including: obtaining safety report text; performing sentence segmentation processing on the safety report text; Perform word segmentation processing and part-of-speech tagging, and output the results of word segmentation and part-of-speech tagging; perform syntactic analysis on the results of word segmentation and part-of-speech tagging based on the dependency syntax analysis method to obtain the grammatical components and the relationship between the grammatical components; perform semantic role labeling to obtain the given Argument of the predicate; outputs a triple of subject-predicate-object form. Corresponding devices, electronic equipment and computer-readable storage media are also disclosed. The text of the security report is analyzed by means of syntactic analysis, triples are extracted, and proper nouns in the security field are added in the process of word segmentation, so that the model can be better recognized. The location and length of the subject can better express the meaning of the subject, verb and object in the triple to meet the needs of accurate intelligence analysis and screening.

Description

technical field [0001] The invention relates to the technical field of natural language recognition and analysis processing, in particular to a method, device and electronic equipment for triplet extraction of safety report text. Background technique [0002] Threat intelligence is security information that has been researched and judged. Threat intelligence can be used to support decision-making or security analysis. Security information of unknown origin and authenticity will affect the correctness of decision-making and the accuracy of analysis results. Information research and judgment is divided into two aspects, the source of information and the information itself. It can ensure that the source of information is reliable, but the existing technology usually uses manual methods to screen and summarize threat intelligence from massive security reports, which is very time-consuming and laborious, and the accuracy of screening is biased due to the inability to always focus...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/211G06F40/289G06F40/253G06F40/242G06F16/36
CPCG06F16/367G06F40/211G06F40/242G06F40/253G06F40/289
Inventor 周子楠董龙飞
Owner 北京天际友盟信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products