Text recognition method, device and equipment

A text recognition and text technology, applied in the Internet field, can solve problems such as recognition, affecting abnormal information recognition results, difficult abnormal information and normal text, etc., to achieve the effect of guaranteeing recognition results and improving flexibility

Pending Publication Date: 2021-07-16
KOUBEI SHANGHAI INFORMATION TECH CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of this, the present application provides a text recognition method, device and equipment, the main purpose of which is to solve the difficulty of combining abnormal information with normal text in the compilation process of the machine translation model in the prior art, which affects the recognition of abnormal information The problem with the result

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text recognition method, device and equipment
  • Text recognition method, device and equipment
  • Text recognition method, device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0096] Hereinafter, the present application will be described in detail with reference to the drawings and embodiments. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other.

[0097] In related technologies, the products on the Internet platform must be subject to the supervision of relevant departments, so that no sensitive texts can appear online. Usually, based on the establishment of a large number of corpora, the machine translation model can be used to comprehensively study and train the texts to obtain The word vector of the text realizes mutual translation of the text, and matches the sensitive characters of the word vector of the text to identify whether there is abnormal information in the text. However, because the text generated on the Internet platform is usually continuous and readable, the context relevance of the training corpus in the machine translat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text recognition method, device and equipment, relates to the technical field of internet, and aims at abnormal information deformed in a to-be-recognized text, abnormal information recognition can be performed after the abnormal information is translated into an original text in combination with a machine model, and the flexibility of abnormal information recognition is improved while the accuracy of a recognition result is ensured. The method comprises the steps of obtaining a plurality of character elements formed by character-level segmentation of a to-be-recognized text; encoding each character element to form a phonetic code vector of the character element through encoding processing; inputting the phonetic code vectors of the character elements into a pre-constructed recognition model to obtain an original text mapped by the to-be-recognized text, the recognition model having a function of performing semantic translation on deformation information in the phonetic code vectors; and judging whether an original text mapped by the to-be-recognized text contains abnormal information or not by utilizing a pre-constructed sensitive word bank.

Description

technical field [0001] The present application relates to the technical field of the Internet, in particular to a text recognition method, device and equipment. Background technique [0002] With the rapid development of the Internet, the problem of information overload has become increasingly prominent. There are more and more words appearing on the Internet. Once these words contain abnormal information such as harmful, sensitive, and illegal, how to effectively and reasonably identify such abnormal information from normal texts is of great significance for network supervision and purification. [0003] In related technologies, products on the Internet platform must be supervised by relevant departments, so that no abnormal texts appear online. Usually, on the basis of building a large number of corpora, the machine translation model can be used to comprehensively study and train texts to obtain The word vector of the text realizes mutual translation of the text, and matc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/30G06F40/129G06K9/34
CPCG06F40/58G06F40/30G06F40/129G06V30/153G06V30/287
Inventor 贾伟汪安辉
Owner KOUBEI SHANGHAI INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products