A method and system for identifying rubbish information

一种垃圾信息、预定义的技术,应用在信息过滤领域,能够解决垃圾信息不合理等问题,达到提高效果、提高准确性的效果

Inactive Publication Date: 2008-04-23
ALIBABA CLOUD COMPUTING LTD
View PDF0 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to provide a method and system for determining garbage information, so as to solve the problem of unreasonable predefined garbage information, and improve the effect of information filtering by reasonably determining the content of garbage information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for identifying rubbish information
  • A method and system for identifying rubbish information
  • A method and system for identifying rubbish information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0029] The core idea of ​​the present invention is: the user selects representative information as a spam sample, and defines the keywords of the spam, calculates the sample in the system to obtain a keyword score, and then the system uses the keyword score to perform The filter index value is obtained by simulation, and by comparing with the evaluation index, the keyword or keyword score can be continuously adjusted and optimized, and finally a reasonable spam keyword and keyword score can be obtained.

[0030] The spam information is information with similar characteristics, such as malicious mass advertisements, engaging in some illegal activities or selling illegal products, and some characteristics customized by users accordin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is in use for solving the issue of unreasonable predefined garbage info. The invention includes steps: predefining key words of garbage info in sampled data; calculating point value corresponding to each key word; using the said point value of key word simulates the data of filtered sample so as to obtain filtered index value (FIV); determining whether the filtered index value is accorded with evaluating index; if not, adjusting key word or point value of key word, then simulating and calculating FIV; if yes, then ending adjustment. The invention can help user to determine key words and point values of key words of garbage info reasonably, and determine whether the received info is garbage info based on key words and point values of key words so as to raise effect for filtering info. The invention is applicable to different applications and systems widely such as feedback system, system for leaving word, forum, and task for treating garbage mail.

Description

technical field [0001] The invention relates to the field of information filtering, in particular to a method and system for determining garbage information. Background technique [0002] Nowadays, more and more users send and receive a large amount of information through the network, making full use of the Internet for information exchange and resource sharing. However, such information often contains a large amount of spam information, which is of no value to users, and even some malicious batch-published information with illegal purposes. The most common is spam email, in which users may receive advertisements, promotions for illegal activities, or even virus emails. These spam emails occupy a large amount of network resources, causing huge pressure on servers and network traffic, and some illegal information has greatly caused network security risks. [0003] In view of the above situation, the current website usually has a spam filtering function, and uses various ant...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/58H04L12/28
CPCH04L51/12H04L12/585H04L51/212
Inventor 叶静俊王聪智王皓马小龙
Owner ALIBABA CLOUD COMPUTING LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products