Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for filtering junk files based on key words

A filtering method and keyword technology, applied in the computer field, can solve the problems of low keyword matching efficiency, low filtering efficiency of junk documents, etc., and achieve the effects of improving the recognition rate, reducing the number, and improving the matching efficiency.

Inactive Publication Date: 2013-04-24
BEIJING 263 ENTERPRISE COMM
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a method and device for filtering junk documents based on keywords, which are used to solve the problem in the prior art that the matching efficiency of keywords is low, which leads to low filtering efficiency of junk documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for filtering junk files based on key words
  • Method and device for filtering junk files based on key words
  • Method and device for filtering junk files based on key words

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] In order to make the objectives, technical solutions, and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of the embodiments of the present invention, not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0020] figure 1 It is a flowchart of an embodiment of a method for filtering spam documents based on keywords of the present invention, such as figure 1 Shown, including:

[0021] 101. The junk document filtering device obtains Chinese documents to be checked.

[0022] Among them, the Chinese documents to be checked can be documents cont...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and a device for filtering junk files based on key words. The method comprises steps of acquiring a Chinese file to be detected, converting Chinese characters of the Chinese file into spelling characters, obtaining a spelling file, matching the spelling file in accordance with spelling corresponding to Chinese key words, and determining that the Chinese file to be detected is a junk file if spelling characters in the spelling file is matched to the spelling corresponding to Chinese key words. A spelling key word can correspond to a plurality of Chinese key words, the number of key words is reduced, the matching efficiency of key words is improved, spelling key words can correspond to other Chinese key words which are not listed in Chinese key words and have the same homophony with Chinese key words, and the distinguishing efficiency of junk files is improved.

Description

Technical field [0001] The present invention relates to computer technology, in particular to a method and device for filtering spam documents based on keywords. Background technique [0002] The most common method used in the current anti-spam technology is keyword filtering, that is, a keyword search is performed on the document to be inspected to determine whether the document to be inspected is a spam document. Generally, each keyword corresponds to a filter rule. The process of keyword search for the document to be checked is the process of matching the filtering rules of the document to be checked. [0003] In the prior art, keyword filtering rules need to be set for each keyword to ensure the anti-spam effect. However, this method leads to low keyword matching efficiency, which in turn leads to low efficiency in filtering spam documents, and in this method, keywords may be listed incompletely, resulting in low efficiency in filtering spam documents. Summary of the inventi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/28
Inventor 黄福昌田飞李雪明
Owner BEIJING 263 ENTERPRISE COMM