Junk text filtering method and device, electronic device and storage medium

A text filtering and text technology, applied in neural learning methods, text database clustering/classification, unstructured text data retrieval, etc. The effect of improving accuracy

Pending Publication Date: 2019-10-01
BEIJING QIYI CENTURY SCI & TECH CO LTD
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method only relies on the matching of simple words to filter, and cannot understand the semantic information of the text, and cannot deeply grasp the connotation of the text
In addition, spam publishers will convert the text by adding punctuation marks, replacing traditional characters, or using pinyin. Such a word se

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Junk text filtering method and device, electronic device and storage medium
  • Junk text filtering method and device, electronic device and storage medium
  • Junk text filtering method and device, electronic device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0057] The embodiment of the present application discloses a spam text filtering method, device, electronic equipment, storage medium and computer program product including instructions, which will be described respectively below.

[0058] The embodiment of the present application provides a spam text filtering method, see figure 1 , figure 1 It is a schematic diagram of the spam text filtering method of the embodiment of the present application, including the following steps:

[00...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a junk text filtering method and device, an electronic device and a storage medium. Characteristic information of a word segmentation result is extracted by using a pre-trained deep learning network model, different weights are given to different components in the junk text through the attention mechanism model, feature information is combined through the attention weights, local key information of the text is captured, the to-be-filtered text is classified, the junk text is filtered, and the junk text filtering accuracy is improved.

Description

technical field [0001] The present application relates to the technical field of computer communication networks, in particular to a spam text filtering method, device, electronic equipment and storage medium. Background technique [0002] With the rapid development of Internet technology, the text system provides users with a place to exchange information, and users have a more free space to express their opinions. With the increasing application of text systems, various types of junk texts such as advertisements, pornography, politics, and terrorism have emerged. Spam text disrupts the normal online order, affects the normal experience of users, and brings major hidden dangers to the healthy development of the website. For the healthy and sustainable development of the website and to improve the user experience in the network environment, it is necessary to find and filter junk text. [0003] The existing filtering method is based on the keyword matching method to filter...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F16/35G06F16/335G06F17/21G06N3/04G06N3/08
CPCG06F16/35G06F16/335G06N3/049G06N3/08G06F40/103G06F40/279G06N3/045Y02D10/00
Inventor 张毓
Owner BEIJING QIYI CENTURY SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products