Unlock instant, AI-driven research and patent intelligence for your innovation.

Chinese named entity recognition method and system

A technology for named entity recognition and Chinese, which is applied in instruments, biological neural network models, electrical digital data processing, etc., can solve problems such as complex Chinese structures, and achieve the effect of improving accuracy and accuracy

Pending Publication Date: 2022-04-08
SHANDONG NORMAL UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Second, the Chinese structure is more complex

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese named entity recognition method and system
  • Chinese named entity recognition method and system
  • Chinese named entity recognition method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0036] Such as figure 1 As shown, this embodiment provides a Chinese named entity recognition method, and this embodiment uses the method applied to a server for illustration. It can be understood that this method can also be applied to terminals, and can also be applied to terminals and servers and system, and through the interaction between the terminal and the server. The server can be an independent physical server, or a server cluster or distributed system composed of multiple physical servers, or it can provide cloud services, cloud database, cloud computing, cloud function, cloud storage, network server, cloud communication, intermediate Cloud servers for basic cloud computing services such as software services, domain name services, security service CDN, and big data and artificial intelligence platforms. The terminal may be a smart phone, a tablet computer, a laptop computer, a desktop computer, a smart speaker, a smart watch, etc., but is not limited thereto. The t...

Embodiment approach

[0052] As one or more implementations, the determining the vector representation of the n-gram feature according to the extracted n-gram feature specifically includes: generating the feature vector of each character and the different byte length fragments corresponding to the beginning of the character The vectors are concatenated to obtain a vector representation of the n-gram features corresponding to each character.

[0053] As one or more implementation manners, after obtaining the vector representation of the n-gram feature includes: performing quantitative coding on the vector representation of the n-gram feature.

[0054] Specifically, in the second step, use the n-gram language model to extract byte segments with lengths of 2, 3, 4, and 5. Given a sequence, the 2-gram, 3-gram, 4-gram, 5-gram of the sequence such as image 3 shown.

[0055] Each byte fragment is called a gram, which counts the frequency of occurrence of all grams, and filters them according to a prese...

Embodiment 2

[0060] This embodiment provides a Chinese named entity recognition system.

[0061] A Chinese named entity recognition system, comprising:

[0062] A segmentation module, which is configured to: obtain the segment to be processed, and segment the processed segment by character;

[0063] The gated graph neural network module is configured to: obtain the node information corresponding to each character in the segment by using the gated graph neural network based on the segment to be processed;

[0064] A feature extraction module, which is configured to: extract an n-gram feature from the sentence to be processed, and determine a vector representation of the n-gram feature according to the extracted n-gram feature;

[0065] The recognition module is configured to: after splicing the node information and the vector representation of the n-gram feature, through a two-way long-short-term memory network, obtain a Chinese named entity recognition result.

[0066] What needs to be e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of natural language processing, and provides a Chinese named entity recognition method and system, and the method comprises the steps: obtaining a to-be-processed phrase, and segmenting the processed phrase according to characters; based on the to-be-processed phrase, obtaining node information corresponding to each character in the phrase by adopting a gated graph neural network; extracting an n-gram feature from the to-be-processed phrase, and determining a vector representation of the n-gram feature according to the extracted n-gram feature; and after splicing the node information and the vector representation of the n-gram feature, obtaining a Chinese named entity recognition result through a bidirectional long-short-term memory network.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and in particular relates to a Chinese named entity recognition method and system. Background technique [0002] The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art. [0003] Named Entity Recognition (NER) is a very basic and important task in the field of Natural Language Processing (NLP). Its purpose is to automatically discover named entities from plain text and identify their corresponding categories, where entities Including personal names, place names, organization names, time and digital expressions (including time, date, currency amount and percentages, etc.), etc., are widely used in machine translation, information retrieval, information extraction, entity connection and other fields. Existing named entity recognition models are usually a sequence labeling task, that is, to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06N3/02
Inventor 徐卫志董继坤龙开放于惠孙胤泰朱继然赵晗耿艳芳曹洋
Owner SHANDONG NORMAL UNIV