Sequence labeling method based on deep learning and application

A technology of sequence labeling and deep learning, applied in the field of NLP natural language processing, can solve the problems of not using multiple models for joint labeling, low efficiency, labeling speed and accuracy affecting model update, etc., to shorten the time of manual labeling, labeling The accuracy of the improved, the effect of rapid labeling

Pending Publication Date: 2022-07-01
NAT UNIV OF DEFENSE TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the past, data labeling required a lot of manpower and material resources for labeling, which was inefficient. Common labeling software could not adapt to multilingual labeling. The entity pronoun model of the corpus could not be well identified and extracted, and common labeling tools could not be targeted. Thesaurus, for the tagged corpus, cannot be automatically tagged. Although the previous ta

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sequence labeling method based on deep learning and application
  • Sequence labeling method based on deep learning and application
  • Sequence labeling method based on deep learning and application

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not conflict with each other.

[0038] like figure 1 and figure 2 As shown, the deep learning-based sequence labeling method according to the embodiment of the present invention includes the following steps:

[0039] Step 1: Obtain the text to be annotated and preprocess the text.

[0040] Preferably, the training data used is Wikipedia data; data source: crawling from Wikipedia.

[0041] Classify the acquired structured data...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sequence labeling method based on deep learning and application. The method comprises the following steps: acquiring a to-be-labeled text, and preprocessing the text; performing text translation processing and error correction processing on the preprocessed text; constructing a rule dictionary and a regular expression, pre-annotating the text after error correction processing based on the rule dictionary and the regular expression, and outputting pre-annotated content; and inputting the pre-labeled text into a plurality of sequence labeling models based on deep learning, and calculating the optimal labeling content according to the weight values of the plurality of sequence labeling models for the output result. According to the method, the sequence labeling efficiency and accuracy can be improved.

Description

technical field [0001] The invention belongs to the technical field of NLP natural language processing, and relates to a sequence labeling method and application based on a deep learning model. Background technique [0002] In the application process of knowledge graph, sequence tagging is an indispensable link. Sequence tagging models are widely used in text processing related fields, such as word segmentation, part-of-speech tagging, and named entity recognition. In the past, data labeling required a lot of manpower and material resources for labeling, which was inefficient. Ordinary labeling software could not adapt to multi-language labeling. The entity pronoun model of the corpus could not be well identified and extracted, and ordinary labeling tools could not be included in a targeted manner. Thesaurus cannot realize automatic labeling of labeled corpus. Although the previous labeling software has multiple models, it does not use multiple models to jointly label, or si...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/117G06F40/211G06F40/242G06F40/295G06F40/56G06N3/04G06N3/08
CPCG06F40/295G06F40/56G06F40/242G06F40/211G06F40/117G06N3/08G06N3/044
Inventor 贾国辉谢伟张晨王玮昕王敏闫凯张友根
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products