Unlock instant, AI-driven research and patent intelligence for your innovation.

A Chinese named entity recognition method and system

A technology for named entity recognition and named entities, which is applied in the fields of instruments, computing, and electrical digital data processing, etc., can solve problems such as slow recognition speed, low recall rate, company name recognition, and difficult application, and achieve fast recognition speed and improved The effect of recall

Active Publication Date: 2021-12-07
GUANGZHOU WANLONG SECURITIES CONSULTING CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, due to the lack of regularity in the naming of Chinese company names, they are used more casually and often appear in the form of abbreviations, such as "Bank of China Co., Ltd." often appears in the form of abbreviations, such as "Bank of China" or "Bank of China", This brings difficulties to the identification and application of company names
In general, there are the following difficulties in identifying Chinese named entities such as Chinese company abbreviations: 1. In different fields and scenarios, the extension of the abbreviation is different
2. Certain types of entity names change frequently, and there are no strict rules to follow
4. The number is huge, cannot be enumerated, and it is difficult to include them all in the dictionary
In general, in the processing of Chinese target text, the effect of Chinese word segmentation greatly affects the recognition effect of Chinese named entities, which in turn affects the analysis and processing effect of the target text, resulting in low recall rate and slow recognition speed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Chinese named entity recognition method and system
  • A Chinese named entity recognition method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] refer to figure 1 , the invention provides a Chinese named entity recognition method, comprising the following steps:

[0045] S1. Perform entity recognition based on rule matching on the target text to obtain a first named entity set;

[0046] S2. Using a statistical algorithm to perform entity recognition on the target text to obtain a second named entity set;

[0047] S3. Obtain a recognition result after cleaning the first named entity set and the second named entity set.

[0048] Among them, the target text refers to the text that requires Chinese named entity recognition.

[0049] This method performs entity recognition on the target text based on rule matching and statistical algorithms respectively, and after cleaning the recognition results of the two, obtains the final Chinese entity recognition result, which can greatly improve the accuracy of Chinese entity recognition while ensuring the accuracy of Chinese entity recognition. The recall rate of Chinese e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese named entity recognition method and system. The method comprises the following steps: S1, performing entity recognition based on rule matching on the target text to obtain a first named entity set; S2, using a statistical algorithm to perform entity recognition on the target text Identifying, obtaining a second named entity set; S3, after cleaning the first named entity set and the second named entity set, obtaining a recognition result. The present invention performs entity recognition on the target text based on rule matching and statistical algorithms respectively, and after cleaning the recognition results of the two, obtains the final Chinese entity recognition result, which can greatly improve the accuracy of Chinese entity recognition while ensuring the accuracy of Chinese entity recognition. The recall rate of the Chinese entity recognition is high, and the automatic recognition of the Chinese entity is carried out by the method, and the recognition speed is fast, which can be widely used in the field of text information processing.

Description

technical field [0001] The invention relates to the field of computer application and information processing, in particular to a Chinese named entity recognition method and system. Background technique [0002] Named entity is the basic information element in the target text, and it is the basis for correct understanding of the target text. Chinese entity naming recognition is an important basic tool in information extraction, syntactic analysis, machine learning and other application fields, and plays an important role in the process of natural language processing technology becoming practical. Chinese named entity recognition is to determine whether a string represents a named entity. In information extraction research, Chinese named entity recognition is currently the most practical technology. The commonly used method is purely based on hidden Markov, maximum entropy model recognition method. [0003] At present, due to the lack of regularity in the naming of Chinese ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/284
CPCG06F40/284
Inventor 吴远辉
Owner GUANGZHOU WANLONG SECURITIES CONSULTING CO LTD