News content sensitive word filtering method and system

A filtering method and technology of sensitive words, applied in the field of data processing, can solve problems such as the inability to provide accurate filtering

Active Publication Date: 2016-10-26
TSINGHUA UNIV
View PDF6 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, keyword filtering cannot provide po

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • News content sensitive word filtering method and system
  • News content sensitive word filtering method and system
  • News content sensitive word filtering method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] Embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0048] In the embodiments of the present invention, sensitive words refer to words that require prohibition or control in news content. These words are often used to spread bad information. This type of information often brings extremely adverse effects to society, but news also It may be positive news to combat these negative information, so further analysis of the emotional tendency of the news is required to determine whether to prohibit or strengthen control of such information.

[0049] In the method for filtering sensitive words in news content of the present invention, before filtering the sensitive words in news content, firstly, a sensitive word database is established, and then filtering is performed according to the sensitive words in the sensitive word database.

[0050] In this embodiment, the constructed sensitive thesaurus can be maintaine...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a news content sensitive word filtering method and system. The method comprises the steps of S1, preprocessing obtained news texts; S2, filtering the sensitive words of the news texts by employing a sensitive word multi-level filtering algorithm on the basis of the priorities of the sensitive words according to a pre-established sensitive word library; S3, judging the filtered sensitive words through an emotion analysis model based on a markov logic network when there are preset sensitive words in the news texts; and S4, marking the news texts as negative news when it is judged that the filtered sensitive words are bad sensitive words, otherwise, marking the news texts as positive news. According to the method and the system, secondary judgment is carried out on the filtered sensitive words through establishment of the emotion analysis model based on the markov logic network, thereby determining whether the filtered sensitive words have negative information or not; therefore, the negative news is filtered; moreover, the positive news fighting against the negative information will not be filtered; and the reliability of filtering the news content sensitive words can be improved.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method and system for filtering sensitive words in news content based on Markov logic network sentiment analysis. Background technique [0002] News content sensitive word filtering security control involves linguistics, computer science, cognitive science, mathematics and other interdisciplinary science. News content security control controls the content of a single news article, and provides coarse-grained content filtering at the word level. On the basis of semantic data processing technology and natural language processing technology, the purpose is to respond to the rapid response of news and public opinion, achieve real-time collection, rapid processing and analysis of public opinion information, capture hot spots, grasp the direction of public opinion, predict the level of crisis, and then assist the management and control platform Managers and decision makers g...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
CPCG06F40/30
Inventor 张新钰刘聪吴新刚
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products