Unlock instant, AI-driven research and patent intelligence for your innovation.

Chinese named entity recognition method and system based on pen painting convolution and word vectors

A technology of named entity recognition and word vectors, applied in neural learning methods, instruments, biological neural network models, etc., can solve problems such as insufficient semantics, reduced accuracy of named entity recognition, and inability of NER model to correctly identify entities, etc., to achieve improved results , the effect of improving accuracy

Pending Publication Date: 2022-04-08
BEIJING UNIV OF TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The word-based tagging method first uses the word segmentation tool to segment the text, and then performs entity recognition. The word boundary of this type of method is also an entity boundary. If an error occurs in the word segmentation stage, the subsequent NER model will not be able to correctly identify the entity.
The word-based sequence tagging method usually has insufficient semantics, so people mainly consider how to make better use of word information. Some applications introduce external lexical information based on the word-based sequence tagging method and integrate it into the word vector at the input layer. In the representation, this makes the model itself changed, and the introduction of external word vectors also makes the model training efficiency low, and finally the accuracy of named entity recognition is reduced; The ElMo model established by the stroke sequence has defects in the effectiveness and accuracy of named entity recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese named entity recognition method and system based on pen painting convolution and word vectors
  • Chinese named entity recognition method and system based on pen painting convolution and word vectors
  • Chinese named entity recognition method and system based on pen painting convolution and word vectors

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0064] Below in conjunction with accompanying drawing, the present invention is described in further detail:

[0065] Such as figure 1 As shown, the Chinese named entity recognition method based on stroke convolution and word vector provided by the present invention includes:

[0066] S1. Obtain the stroke sequence corresponding to each Chi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a Chinese named entity recognition method and system based on pen and picture convolution and word vectors, and relates to the technical field of named entity recognition, and the method comprises the steps: obtaining a stroke sequence corresponding to each Chinese character in a text and a character feature vector of each Chinese character; inputting the stroke sequence into a pen-painting convolutional neural network to obtain a stroke feature vector; setting a sliding window according to the maximum length of an entity in the text, and obtaining a word vector of each character in the sliding window through a self-attention mechanism; splicing the stroke feature vector, the word vector and the character feature vector of each Chinese character in the text, and inputting into the BiLSTM network to obtain the score of each Chinese character corresponding to each entity tag; and determining an optimal entity label for each Chinese character in the text by adopting a CRF model. According to the method, the influence of the stroke sequence of the Chinese character on the Chinese character is considered, and the named entity recognition is performed after the stroke feature vector, the word feature vector and the character feature vector of the Chinese character are combined, so that the named entity recognition effect is improved.

Description

technical field [0001] The present invention relates to the technical field of named entity recognition, in particular to a Chinese named entity recognition method and system based on stroke convolution and word vectors. Background technique [0002] With the rapid development of Internet technology and the continuous growth of unstructured data, we are in an era of massive and unstructured data. How to efficiently manage data and extract effective information from unstructured data has become an urgent problem to be solved. [0003] The purpose of Named Entity Recognition (NER) is to identify defined named entities from unstructured text, such as person names, place names, organization names, etc., which is the basic core task of information retrieval and information extraction. Chinese NER is a division of NER in the Chinese field. Due to the characteristics of Chinese characters, there are still many problems in Chinese NER. The main difficulties of Chinese NER are as f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/295G06N3/04G06N3/08
Inventor 何东之张震王鹏飞孙亚茹郭隆杭
Owner BEIJING UNIV OF TECH