Text content filtering method and system

A content filtering and text technology, applied in text information processing and computer fields, can solve the problems of large workload of manual analysis, large granularity of classification methods, and inability to accurately distinguish, and achieve the effect of reducing processing time.

Inactive Publication Date: 2008-04-09
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF0 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although this method solves the problem of user configuration, its completely topic-based classification method has a large granularity, and it is often unable to accurately distinguish the content that contains

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text content filtering method and system
  • Text content filtering method and system
  • Text content filtering method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Hereinafter, the present invention will be described in more detail through a preferred embodiment with reference to the accompanying drawings. The input of the present invention is text content information to be filtered. The input information can come from any network information bearing device, such as gateway, routing module, service module or personal computer. After the data streams on these network information devices are processed by corresponding preprocessing devices, the extracted text information can be used as input information of the present invention.

[0031] In order to understand the present invention more easily, a system for implementing the text content filtering method is firstly introduced. Such as figure 2 As shown, the system includes:

[0032] The configuration information parsing module is used to extract the effective filtering rules according to the detected keywords and / or topics configured by the user; the configuration information par...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A text content filter method is provided, which comprises: 1) analyze user configuration information, and extract the effective filter regulations; 2) analyze and detect the filtered text information in accordance with the effective filter regulations; 3) conduct accurate key word matching detection to the analyzed results of the step 2), and output the detection results; 4) conduct fuzzy key word matching detection to the analyzed results of the step 2), and output the detection results; 5) conduct text subject detection to the analyzed results of the step 2) and the step 4), determine the theme of filtered text contents, and output the detection results. While providing filter support on the fine grained accurate key word, filter support on the limited fuzzy key word, as well as filter support on the theme-based coarse grain, through separating and recombining the text content scan front end with three filter methods, the invention only needs conduct a full text scan to the filtered text, the required processing time for the text filter can be greatly reduced.

Description

technical field [0001] The invention belongs to the field of computer technology, and relates to a method for realizing text information filtering in the field of text information processing, in particular to a text content filtering method based on text theme analysis and keyword detection. Background technique [0002] With the rapid development of computer and Internet technology and its wide application, the amount of information on the Internet has increased dramatically, and people are becoming more and more accustomed to obtaining information through the Internet. However, the information on the Internet is very complex, good and bad. It is very necessary to monitor, analyze and filter information on the Internet, discover and prevent the dissemination and dissemination of bad information in time, and purify the Internet environment. [0003] At present, the technologies for filtering text information on the Internet are mainly divided into three categories: one is t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 应凌云苏璞睿冯登国
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products