Deep learning-based garbage text filtering method

A deep learning and garbage technology, applied in the field of big data processing, can solve problems such as inability to intercept or prompt, and low ability to discriminate graphic data, and achieve the effects of easy split and combination, improved recognition ability, and fast switching

Active Publication Date: 2018-11-13
HUAZHONG UNIV OF SCI & TECH
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the above defects or improvement needs of the prior art, the present invention provides a spam text filtering method based on deep learning, thereby solving the pr

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning-based garbage text filtering method
  • Deep learning-based garbage text filtering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0032] The present invention provides a garbage text filtering method based on deep learning. On the premise of retaining the expression content of character data and graphic data, the text information in the graphic data is recognized and converted into character data, combined with the original character data, through Compared with the prohibited words, prohibited symbols and the text informat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a deep learning-based garbage text filtering method. The method comprises the steps of filter character data at first, removing unnecessary symbols, spaces and modal particles,classifying according to different data types in the garbage text, marking and distinguishing character data and graph data without changing sequence and position of the two types of data, convertinggraph data into character data through a deep learning algorithm, wherein data conversion is integral to the deep learning method, comparing with forbidden words in a cloud server through the deep learning algorithm by combining original character data so as to obtain a garbage text, wherein text comparison represents important promotion of the deep learning method and can realize effective, deepinterception and prompt. The existing text filtering method cannot screen out the garbage text consisting both character data and graph data. The method solves the above problem, and uses the deep learning algorithm in garbage text processing, thereby improving screening accuracy.

Description

technical field [0001] The invention belongs to the technical field of big data processing, and more specifically, relates to a method for filtering garbage text based on deep learning. Background technique [0002] Text data is the most common type of semi-structured data in computer science. Many information in the real world needs to be expressed through text, and communication between users can also be realized through the exchange of text information. In this way, it is possible to generate junk text information that is useless to users. [0003] With the increasing enrichment of text data generation and processing methods in computer science and technology, coupled with the rapid development of data transmission speed, text information is not only generated by encoding types such as ASCⅡ, GBK and BIG5, but may also be enriched by means of graphic data. text information. Furthermore, junk text information may be hidden in graphic data, and character data and graphic d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/34G06K9/62
CPCG06V30/153G06F18/214
Inventor 冯丹尹祎施展苏毅
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products