Text filtering system and method

A text filtering and text technology, which is applied in the field of text filtering system based on entity relationship extraction, can solve the problems of inaccurate expression, inaccurate establishment method, and poor expression of filtering requirements, etc., and achieve the effect of high filtering accuracy

Active Publication Date: 2013-04-10
云中开源数据技术(上海)有限公司
View PDF3 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among the current text filtering methods, some use the fuzzy clustering method based on genetic algorithm to directly cluster each individual in the population with a fuzzy similarity matrix, and then use the proposed fitness function to evaluate the population according to the clustering results. However, the filtering accuracy of this method depends on the effect of clustering, and it cannot express the user's filtering needs well.
Some use improved classification algorithms to filter bad text information, and improve the traditional KNN algorithm from the perspective of the data layer, which is also not accurate enough to express the needs of users
Some filtering methods also use ontology to express the user's filtering needs, but the establishment method of the ontology database expressing the user's filtering needs is not accurate enough, which will greatly affect the filtering accuracy of the text
Some filtering algorithms use adaptive learning text filtering. Although the user's filtering template can be adaptively learned and the filtering model can be adjusted, the method of using feature vectors cannot accurately express the user's filtering needs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text filtering system and method
  • Text filtering system and method
  • Text filtering system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The implementation of the present invention is described below through specific examples and in conjunction with the accompanying drawings, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific examples, and various modifications and changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention.

[0041] figure 1 It is a structure diagram of a text filtering system in the present invention. Such as figure 1 As shown, a text filtering system of the present invention at least includes: a filtering model building module 10 , an adaptive learning module 11 and a text filtering module 12 .

[0042] The filtering model building module 10 is used for building a filtering model according t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text filtering system and a text filtering method. The method comprises the following steps of: establishing a filtering model according to the filtering requirement of a user; training a group of filtering samples to form a body library which is close to the filtering requirement of the user; and extracting characteristic words of a text to be filtered, identifying entities in the characteristic words, extracting an entity relation to form an entity relation vector of the text to be filtered, calculating the similarity of the filtering model and the text to be filtered, and filtering the text which is higher than a similarity threshold value. The characteristics of filtered texts are expressed accurately through extraction of entity relations according to the established filtering model of the user, so that the filtering accuracy can be increased.

Description

technical field [0001] The present invention relates to a text filtering system and method, in particular to a text filtering system and method based on entity relationship extraction. Background technique [0002] Text filtering has received more attention for many years, and has a good application prospect in the fields of information retrieval and filtering. Among the current text filtering methods, some use the fuzzy clustering method based on genetic algorithm to directly cluster each individual in the population with a fuzzy similarity matrix, and then use the proposed fitness function to evaluate the population according to the clustering results. However, the filtering accuracy of this method depends on the effect of clustering, and it cannot express the user's filtering needs well. Some use improved classification algorithms to filter bad text information, and improve the traditional KNN algorithm from the perspective of the data layer, which is also not accurate e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 闫俊英
Owner 云中开源数据技术(上海)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products