Triad extraction method and device of safety report text and electronic equipment

A technology for reporting text and triples, which is applied in unstructured text data retrieval, electronic digital data processing, special data processing applications, etc., and can solve the problems of low screening accuracy, time-consuming and labor-intensive, etc.

Active Publication Date: 2021-04-23
北京天际友盟信息技术有限公司
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It can ensure that the source of information is reliable, but the existing technology usually uses manual methods to screen and summarize threat intelligence from massive security reports, which is very time-consuming and laborious, and the accuracy of screening is biased due to the inability to always focus on the information itself. Low, that is, it is impossible to ensure that the key information can be extracted manually. Therefore, it is necessary to optimize the method and method of text extraction in the security field, which has become a new development direction.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Triad extraction method and device of safety report text and electronic equipment
  • Triad extraction method and device of safety report text and electronic equipment
  • Triad extraction method and device of safety report text and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033]In order to better understand the above technical solution, the above technical solution will be described in detail below with reference to the drawings and specific embodiments.

[0034]The ternary group can be extracted based on an event based on the syntax and the semantic role. Syntactic analysis is an important part of the natural language. It analyzes the syntactic structure of the sentence (subject to a host structure) and the dependency between the vocabulary (side, dependent, etc.). Through syntactic analysis, you can make a solid foundation for NLP application scenes in Semantic analysis, emotional tendency, viewpoint extraction. Syntactic analysis is divided into syntactic structure paings and dependency paingsing. The syntactic analysis of the syntactic structure or the full phrase structure of the entire sentence is called the constituent structure paings or the phraseStructure Parsing; the other is the syntactic analysis of the local component. , Known as dependenc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a triad extraction method of a safety report text. The method comprises the steps of obtaining the safety report text; performing clause processing on the security report text; carrying out word segmentation processing and part-of-speech tagging on the result of sentence segmentation processing by utilizing a word segmentation and part-of-speech tagging joint model constructed by fusing external knowledge, and outputting a word segmentation and part-of-speech tagging result; performing syntactic analysis on the segmented words and the part-of-speech tagging result based on a dependency syntactic analysis method, and obtaining grammatical components and the relation between the grammatical components in the segmented words and the part-of-speech tagging result; performing semantic role labeling, and obtaining arguments of a given predicate; and outputting the triad in the main and called guest form. The invention further discloses a corresponding device, electronic equipment and a computer readable storage medium; the safety report text is analyzed according to a syntactic analysis mode, triples are extracted, proper nouns of the security field are added in the word segmentation process, the model can better recognize the position and length of the subject, and the meaning of the subject-called bin in the triad can be better expressed to meet the requirements of accurate information analysis and screening.

Description

Technical field[0001]The present invention relates to the field of natural language identification and analysis processing, and more particularly to a three-way set extraction method, apparatus and electronic device for a safety report text.Background technique[0002]Threat intelligence is a student safety information, threat information can be used to assist in support decision-making or security analysis, unknown source and authenticity security information will affect the correctness of decision making and analysis results. Information is divided into two aspects, information source and information itself. The source of information can be ensured reliable, but the prior art is usually used in manual ways to screen and summarize threat information from massive safety reports. It is very expensive, and the accuracy of screening is not possible to pay attention to the information itself. Low, that is, manual is unable to ensure that the key information of it can be extracted, so it i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/211G06F40/289G06F40/253G06F40/242G06F16/36
CPCG06F16/367G06F40/211G06F40/242G06F40/253G06F40/289
Inventor 周子楠董龙飞
Owner 北京天际友盟信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products