Recognizing and verifying method and system for anagram

A verification method and technology of variable words, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as lack of extended variable word thesaurus and concept library, poor algorithm automatic update performance, and prone to misjudgment, etc. problem, to achieve the effect of improving automatic update performance, improving accuracy, and reducing the probability of misjudgment

Active Publication Date: 2017-11-17
DATAGRAND TECH INC
View PDF6 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The disadvantage of the existing inflected word recognition technology is that the system and method are based on a fixed inflected thesaurus and concept library, and the number and quality of inflected words and training samples have relatively large limitations, which is prone to misjudgment ; The automatic update performance of the algorithm is poor, and it does not have the ability to expand the variable word thesaurus and concept library

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Recognizing and verifying method and system for anagram
  • Recognizing and verifying method and system for anagram
  • Recognizing and verifying method and system for anagram

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] In order to enable those skilled in the art to better understand the technical solutions of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0071] like figure 1 , 11 As shown, a method for identifying and verifying an inflected word provided by an embodiment of the present invention includes the following steps:

[0072] S101. Obtain a sensitive word set and training samples;

[0073] Sensitive words refer to the collection of words in the text that violate laws, regulations, or moral standards; the collection of sensitive words is stored in the sensitive thesaurus, and the number of sensitive words in the sensitive thesaurus will continue to accumulate as it is updated. The training sample refers to a collection of multiple texts containing deformed words; all the deformed words are stored in the deformed word library; the deformed words in the training sample are determined, so t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a recognizing and verifying method and system for an anagram. The method has the following beneficial effects: an anagram library can be expanded through the pronunciation and character pattern expansion, and thus the quantity of the anagrams in the anagram library can be increased, the quality can be improved, and the misjudgement probability can be decreased; a training sample is used for training context probability, thus the misjudgement probability in semantic verification of the anagram can be further decreased, and moreover, the accuracy can be improved; the training sample is updated based on the verification result, so that the automatic updating performance of algorithm can be improved, and as a result, a concept base for the semantic verification can be expanded; as the accumulation of the verification results, the misjudgement probability continuously decreases. The recognizing and verifying system comprises an acquiring unit, an anagrammatizing training unit, a recognizing unit and a semantic verification unit. The system has the same beneficial effects as the method.

Description

technical field [0001] The invention relates to the field of machine recognition of deformed words, in particular to a method and system for identifying and verifying deformed words. Background technique [0002] When we browse post bars, forums, news media and other similar platforms, we often see deformed sensitive words. The way of thinking of the human brain allows us to discover these inflected words very naturally, because these inflected words are "abnormal" parts in the sentence, and this "abnormal" feeling will focus our attention on this area, and then Gradually discover complete inflected words. However, when the machine directly faces these deformed words (including intermixed special symbols, homonym conversion, near-form conversion, simplified-to-traditional conversion, radical splitting, etc.), the recognition of variant words is an important problem in solving Chinese spam content filtering. . [0003] At present, in the Chinese patent application with the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/3344G06F40/284G06F40/30
Inventor 张健江永青纪传俊陈运文高翔
Owner DATAGRAND TECH INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products