Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A text processing method, device and equipment

A text processing and text technology, applied in the computer field, can solve problems such as poor flexibility, misjudgment of names and entities, poor transplantation performance, etc., and achieve the effect of improving accuracy

Active Publication Date: 2021-04-06
TENCENT TECH (SHENZHEN) CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the prior art, the methods for carrying out name recognition mainly include the following: 1. The name recognition method based on sequence labeling, which regards the name recognition problem as a simple sequence labeling problem; this simple labeling method , making the accuracy of name recognition lower
②Name recognition method based on recurrent neural network, this recognition method is based on the premise of complex network training, the efficiency is low, and it depends on manually specifying context rules, which has poor flexibility, low recall rate and transplantation Poor performance and other issues
③The name recognition method based on the statistical model. This recognition method is based on the ngram (that is, an algorithm based on the statistical language model) model for name recognition, which cannot provide sufficient reference features, and may easily cause misjudgment of the name entity. less accurate
To sum up, it can be seen that the existing methods for name recognition have the problem of low accuracy. Therefore, how to accurately identify the name entity from the text to be recognized has become an important topic of research.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text processing method, device and equipment
  • A text processing method, device and equipment
  • A text processing method, device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0061] The target text can include multiple characters, and these characters are not limited to Chinese characters, English characters, punctuation marks, etc.; for example, the target text is "As the CEO of the group, Xiaohua makes suggestions", and the target text includes multiple Chinese characters, such as : as, for, group, group, of...; English characters, such as: E, O, T; and punctuation marks ",". The computer device can detect whether there is a trigger event for the person name recognition, and if there is a trigger event, the target text to be recognized is obtained. Among them, when the computer device detects a new target text event, it means that there is a trigger event for name recognition detected; To the event triggered by the user to send the target text, or the event that the computer device generates a new target text. In this embodiment, the computer device may use the detected new target text as the target text to be recognized.

[0062] S202. Analyzi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the application discloses a text processing method, device and equipment. The method includes: obtaining the target text to be recognized; analyzing the grammatical relationship of the target text to obtain the target grammatical relationship sequence corresponding to the target text; matching the target grammatical relationship sequence with each reference sequence pattern in the reference sequence pattern set, selecting A matching sequence pattern matching the target grammatical relationship sequence; obtaining the target grammatical relationship identifier after the mth reference grammatical relationship identifier in the target grammatical relationship sequence, and determining the character corresponding to the target grammatical relationship identifier in the target text as a personal name character . The embodiment of the present application can better recognize the name entity from the text, and improve the accuracy of name recognition.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a text processing method, a text processing device and a text processing device. Background technique [0002] Many application scenarios (for example, portrait construction scenarios, map construction scenarios, event mining scenarios, public opinion analysis scenarios, etc.) involve name recognition. The so-called name recognition can refer to the process of identifying the name entity in the text (or field) to be recognized. [0003] In the prior art, the methods for carrying out name recognition mainly include the following: 1. The name recognition method based on sequence labeling, which regards the name recognition problem as a simple sequence labeling problem; this simple labeling method , making the accuracy of name recognition lower. ②Name recognition method based on recurrent neural network, this recognition method is based on the premise of complex networ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/253G06F40/295
CPCG06F40/253G06F40/295
Inventor 刘志煌胡林红罗朝亮朱虹王苑瑜李小宁王利
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products