Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese text named entity recognition method

A named entity recognition and named entity technology, applied in the field of big data, can solve the problems of poor named entity recognition performance and low recognition accuracy, and achieve the effect of improving recognition accuracy, robustness and performance.

Pending Publication Date: 2019-11-19
GUANGDONG UNIV OF TECH
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing Chinese named entity recognition method is to recognize the named entity of the text through single character recognition or word recognition, the performance of named entity recognition is poor, and the recognition accuracy is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese text named entity recognition method
  • Chinese text named entity recognition method
  • Chinese text named entity recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0045] see figure 1 , figure 1 It is an implementation flowchart of a method for identifying Chinese text named entities in an embodiment of the present invention, and the method may include the following steps:

[0046] S101: When a named entity recognition request is received, the named entity recognition request is parsed to obtain the Chinese text to be recognized.

[0047] When named entity recognition is required, a named entity recognition request can be sent to the recognition system. The recognition system receives the named entity recognition request, and parses the named entity recognition request to obtain the Chinese text to be recognized.

[0048] S102: Taking the Chinese text to be recognized as an analysis unit, extract character features, word features and entire sentence features in each sentence.

[0049] The Chinese text to be recognized generally contains multiple sentences, and the Chinese text to be recognized can be analyzed using sentences as the an...

Embodiment 2

[0059] see figure 2 , figure 2 It is another implementation flowchart of the method for identifying Chinese text named entities in the embodiment of the present invention, and the method may include the following steps:

[0060] S201: When a named entity recognition request is received, parse the named entity recognition request to obtain the Chinese text to be recognized.

[0061] S202: Taking the sentence as the analysis unit of the Chinese text to be recognized, removing stop words and punctuation marks of each sentence, and performing word segmentation processing on each sentence according to a preset vocabulary, to obtain each word.

[0062] After analyzing the Chinese text to be recognized, the Chinese text to be recognized can be analyzed with sentences as the unit of analysis, and the stop words and punctuation marks of each sentence can be removed. The interference to the subsequent named entity recognition is avoided, and the recognition accuracy is further impro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese text named entity recognition method which comprises the following steps: when a named entity recognition request is received, analyzing the named entity recognitionrequest to obtain a to-be-recognized Chinese text; respectively extracting character features, word features and whole sentence features in each sentence of the to-be-recognized Chinese text by takingthe sentence as an analysis unit; respectively splicing the character features and the word features of each word in the Chinese text to be recognized and the sentence features of the sentence wherethe word features are located to obtain a feature sequence respectively corresponding to each word; extracting a context feature of each feature sequence to obtain a context feature extraction result;and according to a context feature extraction result, labeling each named entity of the to-be-recognized Chinese text from each word by utilizing a Markov transfer matrix method. According to the method, the named entity recognition performance is greatly improved, and the recognition accuracy is improved. The invention furthermore discloses a Chinese text named entity identification apparatus and device, and a storage medium, which have corresponding technical effects.

Description

technical field [0001] The present invention relates to the field of big data technology, in particular to a method, device, equipment and computer-readable storage medium for identifying Chinese text named entities. Background technique [0002] Named Entity Recognition (NER) refers to the process of identifying specific object transaction names or symbols from text. Named entity recognition technology is an indispensable part of many natural language processing tasks such as information extraction, information retrieval, machine translation, and question answering systems, enabling subsequent natural language processing tasks such as relationship extraction to obtain more information based on entity recognition. Knowledge. Therefore, the research on it has important research significance and value. [0003] At present, English named entity recognition technology is relatively mature. Compared with English, Chinese named entities do not have clear boundary information an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06N3/04G06N3/08
CPCG06N3/08G06N3/045
Inventor 程良伦邓健峰张凡龙
Owner GUANGDONG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products