Automatic extraction method for case semantic elements

An automatic extraction and element technology, applied in the field of natural language processing and machine learning, can solve problems such as vocabulary gap, algorithm performance impact, characterize word and word similarity, etc., so as to improve extraction efficiency, improve case handling efficiency, and reduce manual extraction cost effect

Inactive Publication Date: 2018-04-10
武汉烽火普天信息技术有限公司
View PDF4 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Aiming at the deficiencies of the prior art, the present invention provides an automatic extraction method for the semantic elements of a case, which solves the problem of dimensionality disaster, especially when used a

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic extraction method for case semantic elements
  • Automatic extraction method for case semantic elements

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0019] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0020] The embodiment of the present invention provides a method for automatically extracting semantic elements of a case, such as figure 1 As shown, including the following steps:

[0021] S1. First, according to the possible part of speech of the elements, use the pre-trained sequence tagging module to tag the brief case, and identify the nouns, verbs and gerunds inside; if the item involved is a noun, the tool of committing the crime is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an automatic extraction method for case semantic elements. The automatic extraction method comprises the following steps: (1) carrying out part-of-speech tagging on a brief case text by virtue of a sequence tagging module according to a pre-trained Chinese sequence tagging model, so as to recognize nouns, verbs and gerunds in the brief case text; (2) enriching a seed lexicon through word meaning similarity algorithm by virtue of a word meaning similarity module; (3) carrying out manual correction for several times by virtue of an adjusting-optimizing module, so as to further adjust and optimize an element extraction result; and (4) creating a classifier library through the elements according to the category by virtue of a classifying lexicon module. According to theautomatic extraction method, the case semantic elements are automatically extracted from a detailed, complicated and ruleless brief case, the semantic element extraction efficiency of the case is greatly improved, and the manual extraction cost is lowered; and meanwhile, a system can be applied to semantic comprehension analysis of other unstructured text information of public security industry,and an effective method is provided for correlation analysis of the cases, so that the case treating efficiency is improved.

Description

technical field [0001] The invention relates to the technical fields of natural language processing and machine learning, in particular to an automatic extraction method of case semantic elements. Background technique [0002] With the rapid development of natural language processing and machine learning, these technologies have been widely used in search engines, intelligent translation, machine question answering, user recommendation, public opinion monitoring and other related industries. Public security organs have accumulated a large amount of data information in the long-term informatization process, but the application of data at this stage is still limited to simple query of structured data, and has not yet carried out in-depth application of unstructured or semi-structured data. At the same time, there are still many problems in these unstructured or semi-structured data: the information format is not uniform, and the information description is diversified. Based o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
CPCG06F16/3344G06F40/289G06F40/30
Inventor 金勇邱爽李力陈云浩朱阳光
Owner 武汉烽火普天信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products