Model training method and device, entity extraction method and device and storage medium

A model training and model technology, applied in instruments, electrical digital data processing, calculation, etc., can solve the problem of low accuracy and achieve the effect of improving accuracy

Pending Publication Date: 2021-09-21
ZHEJIANG DAHUA TECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The present invention provides a method for model training, a method for entity extraction, a device, and a storage medium to solve the technical problem that the accuracy of entity extraction in public security documents is not high in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and device, entity extraction method and device and storage medium
  • Model training method and device, entity extraction method and device and storage medium
  • Model training method and device, entity extraction method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] Embodiments of the present invention provide a model training method, entity extraction method, device and storage medium to solve the technical problem in the prior art that the accuracy of entity extraction in public security documents is not high.

[0052] Terminology Explanation

[0053] Entity extraction: A subtask of information extraction that extracts predefined entities, such as time and place, from text data.

[0054] word2vec model: is a model used to generate word vectors. This model is a shallow, two-layer neural network that is trained to reconstruct linguistic word texts. The network is represented by words and needs to guess the input words in adjacent positions. Under the assumption of the word bag model in word2vec, the order of words is not important. After the training is completed, the word2vec model can be used to map each word to a vector, which can be used to represent the relationship between words and words, and the vector is the hidden layer...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a model training method, an entity extraction method, a model training device, an entity extraction device and a storage medium, which are used for solving the technical problem of low accuracy of entity extraction in public security texts in the prior art. The model training method comprises the following steps: performing word segmentation and part-of-speech tagging on each historical public security text in a historical public security text set; counting vocabularies in each word segmentation result in the tagged historical public security text set, the frequency of occurrence in each part-of-speech and the frequency of occurrence in adjacent positions of each part-of-speech to obtain corresponding word frequency probabilities and word use probabilities; generating a word vector of each vocabulary, and combining the word vector with the corresponding word frequency probability and word use probability to form a feature vector of the corresponding vocabulary; dividing all the feature vectors into a training set and a test set, wherein the training set is used for training the conditional random field model, and the test set is used for testing the trained conditional random field model; and stopping the training until the accuracy rate of a test result reaches a set threshold value to obtain the trained conditional random field model.

Description

technical field [0001] The invention relates to the field of public security information processing, in particular to a model training method, an entity extraction method, a device and a storage medium. Background technique [0002] In the field of public security, when handling various cases, investigators usually need to consult criminal investigation records and past case records related to the current case, and analyze these records to form a relevant knowledge map to analyze the current case. , Digging clues, stringing together cases, etc., in order to quickly solve the case. [0003] However, with the passage of time, more and more information about cases and persons involved in the public security system has been recorded. After finding the records related to the current case from the massive data, the investigators need to extract key elements from these case records ( Such as task, time, location, time, etc.), and establish the relationship between key elements, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06F40/216
CPCG06F40/295G06F40/216
Inventor 操涛涛陈立力周明伟
Owner ZHEJIANG DAHUA TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products