Method and apparatus for filtering rubbish contents

A content filtering and content technology, applied in the Internet field, can solve the problems of filtering coverage limitations, increasing labor investment, machine maintenance capital, etc., and achieve the effects of wide coverage, labor saving, and reducing workload

Inactive Publication Date: 2009-08-19
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, in the prior art, this method of relying on dirty word filtering to filter spam content is passive and leads to limitations in filtering coverage;
[0006] (2) There is also a certain degree of passivity in the later inspection process of the p

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for filtering rubbish contents
  • Method and apparatus for filtering rubbish contents
  • Method and apparatus for filtering rubbish contents

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0131] Example 1:

[0132] Embodiment 1 of the present invention takes ordinary posts as an example to illustrate the method of the present invention, combined with Figure 4 shown.

[0133] Step S501: A netizen publishes an ordinary post on Soba, and the content of the post first goes through three stages of content filtering:

[0134] (1) Duplication judgment stage; judge whether the content of the post is repeated with the content of the previously published post (this can be achieved by comparing the content of the posts of the same IP), if it is repeated, it will be automatically blocked; if it is not repeated, no processing will be done;

[0135] (2) The first-level swear word matching stage; the first-level swear word matching is performed on the content of the post that has been judged repetitively. If it matches, the content of the post with swear words will be automatically blocked; if it is not matched, it will enter the semantic analysis. process;

[0136] (3) S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a content spam filtering method and adopts a technical proposal as follows: judgment is carried out to posted content according to preset semantic analysis conditions; the content meeting the preset semantic analysis conditions in the posted content is taken as content spam and shielded; and the posted content after shielding treatment is published on a network after reviewing. The invention also discloses a content spam filtering device. The technical proposal can effectively realize the shielding of the content spam of communities, save the input capital of labor and material resources and improve working efficiency.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a method and device for filtering junk content. Background technique [0002] At present, traditional filtering methods are generally adopted in Internet technologies for filtering community spam content. combine figure 1 As shown, before the content posted by the user is published on the Internet, it must first pass through the first-level dirty word filtering, and the words in the post that match the first-level dirty word will be blocked as spam words; The second-level dirty word filtering in the manual review stage will be performed on the content after the manual review, and the words that match the second-level dirty word in the post will be blocked again as spam words; the content after the second-level dirty word filtering will be successfully published to On the Internet; for the spam content that is not filtered out in the first-level or second-level dirty word f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04L29/06H04L29/08H04L12/24
Inventor 李京晶于章涛张萌萌祝锐赵琳霖
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products