Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and apparatus for filtering rubbish contents

A content filtering and content technology, applied in the Internet field, can solve the problems of filtering coverage limitations, increasing labor investment, machine maintenance capital, etc., and achieve the effects of wide coverage, labor saving, and reducing workload

Inactive Publication Date: 2009-08-19
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, in the prior art, this method of relying on dirty word filtering to filter spam content is passive and leads to limitations in filtering coverage;
[0006] (2) There is also a certain degree of passivity in the later inspection process of the post content that has been published on the Internet. The management server should actively browse and inspect the posts published on the Internet, and delete the spam words found one by one, so Increased labor input and capital for machine maintenance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for filtering rubbish contents
  • Method and apparatus for filtering rubbish contents
  • Method and apparatus for filtering rubbish contents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0132] Embodiment 1 of the present invention takes ordinary posts as an example to illustrate the method of the present invention, combining Figure 4 shown.

[0133] Step S501: A netizen posts an ordinary post on Soba. The content of the post must first go through three stages of content filtering:

[0134] (1) Repeatability judgment stage; judge whether the content of the post is repeated with the content of the previously published post (it can be realized by comparing the post content of the same IP), if it is repeated, it will be automatically blocked; if it is not repeated, no processing will be done;

[0135] (2) First-level swear word matching stage; the content of posts that have been judged repeatedly will be matched with first-level swear words. If it is matched, the content of the post with swear words will be automatically blocked; if it is not matched, it will enter semantic analysis process;

[0136] (3) Semantic analysis process: by analyzing the semantics of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a content spam filtering method and adopts a technical proposal as follows: judgment is carried out to posted content according to preset semantic analysis conditions; the content meeting the preset semantic analysis conditions in the posted content is taken as content spam and shielded; and the posted content after shielding treatment is published on a network after reviewing. The invention also discloses a content spam filtering device. The technical proposal can effectively realize the shielding of the content spam of communities, save the input capital of labor and material resources and improve working efficiency.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a method and device for filtering junk content. Background technique [0002] At present, traditional filtering methods are generally adopted in Internet technologies for filtering community spam content. combine figure 1 As shown, before the content posted by the user is published on the Internet, it must first pass through the first-level dirty word filtering, and the words in the post that match the first-level dirty word will be blocked as spam words; The second-level dirty word filtering in the manual review stage will be performed on the content after the manual review, and the words that match the second-level dirty word in the post will be blocked again as spam words; the content after the second-level dirty word filtering will be successfully published to On the Internet; for the spam content that is not filtered out in the first-level or second-level dirty word f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04L29/06H04L29/08H04L12/24
Inventor 李京晶于章涛张萌萌祝锐赵琳霖
Owner TENCENT TECH (SHENZHEN) CO LTD
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More