Unlock instant, AI-driven research and patent intelligence for your innovation.

Method, device, server group and storage medium for identifying noise words in text

A recognition method and technology of noise words, applied in character and pattern recognition, instruments, calculations, etc., can solve problems such as difficult recognition of noise words

Active Publication Date: 2021-06-15
LENOVO (BEIJING) LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of this, the present invention provides a method, device, server group and storage medium for identifying noise words in a text, in order to solve the problem that noise words are difficult to identify in the prior art, and its technical solution is as follows:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device, server group and storage medium for identifying noise words in text
  • Method, device, server group and storage medium for identifying noise words in text
  • Method, device, server group and storage medium for identifying noise words in text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0052] The embodiment of the present invention provides a method for identifying noise words in a text, please refer to figure 1 , showing a schematic flow chart of the identification method, which may include:

[0053] Step S101: Obtain text to be recognized.

[0054] Step S102: Convert each character in the text to be recognized into a word vector in turn, and obtain a set of word vectors corresponding to the text to be recognized.

[0055] Step S103: Inpu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The application provides a recognition method, device, server group and storage medium for noise words in a text. The method includes: obtaining the text to be recognized, converting each character in the text to be recognized into a word vector in turn, and obtaining the text to be recognized The corresponding word vector set, input the word vector set corresponding to the text to be recognized into the pre-established noise word recognition model, and obtain the recognition result of the noise word in the text to be recognized output by the noise word recognition model, wherein the noise word recognition model is marked with The word vector set corresponding to the training text containing noise words is obtained by training as the training samples. The method for identifying noise words in the text provided by this application can identify the text to be recognized through the pre-established noise word recognition model. Since the noise word recognition model is trained based on the training text marked with noise words, therefore, through the noise word recognition model Noise words can be identified from text to be recognized.

Description

technical field [0001] The invention relates to the technical field of artificial intelligence, in particular to a method, device, server group and storage medium for recognizing noise words in text. Background technique [0002] Natural language processing is one of the most important subfields in the field of artificial intelligence, and it is the technical core of the current popular translation systems, human-computer dialogue systems, and question-answering systems. The irregularity of text generated in the real world is one of the most important factors affecting the performance of natural language processing, and the irregularity caused by noise words is particularly significant. [0003] Among them, noise words refer to words that are not in the range of stop words but meaningless in the current context. Noise words are different from relatively fixed stop words. They are not fixed. The noise words in some texts may not be noise words in other texts. For example, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/284G06K9/62
CPCG06F40/284G06F18/2411G06F18/214
Inventor 金宝宝杨帆张成松
Owner LENOVO (BEIJING) LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More