Method and equipment for determining sensitivity of target text

A target text and sensitivity technology, applied in the field of information processing, can solve the problems of poor recognition of target text sensitivity, inability to recognize, inability to expand sensitive vocabulary, etc., to improve accuracy, expand application scope, and reduce manual review. cost effect

Inactive Publication Date: 2011-09-14
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF2 Cites 42 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The above-mentioned method of identifying the sensitivity of the target text needs to continuously add sensitive words manually, and cannot automatically expand the sensitive word list. At the same time, for some sensitive words that often appear at the same time with a high sensitivity assignment, but they themselves do not have obvious pornography , violence, and reactionary words, the above methods cannot be identified, resulting in poor sensitivity to identify the target text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and equipment for determining sensitivity of target text
  • Method and equipment for determining sensitivity of target text
  • Method and equipment for determining sensitivity of target text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0022] figure 1 It is a schematic diagram of a device according to one aspect of the present invention, showing a device for determining the sensitivity of target text. Wherein, the sensitivity determination device 1 includes a text acquisition means 11 , a sensitive word acquisition means 12 and a sensitivity determination means 13 . Specifically, the text obtaining device 11 obtains the target text whose sensitivity is to be determined; then, the sensitive word obtaining device 12 performs a matching query in the preset sensitive thesaurus according to the target text, so as to obtain the explicit sensitivity in the target text. words and hidden sensitive words; then, the sensitivity determining means 13 weights and determines the sensitivity of the target text according to the sensitivity assignment of the explicit sensitive words and the sensitive assign...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention aims at providing a method and equipment for determining the sensitivity of a target text. The method comprises the following steps of: acquiring the target text having the sensitivity to be determined by sensitivity determination equipment; performing matching inquiry in a preset sensitive word base according to the target text so as to acquire an apparent sensitive word and a hidden sensitive word in the target text; and weighing to determine the sensitivity of the target text according to a sensitive assignment of the apparent sensitive word and a sensitive assignment of the hidden sensitive word. Compared with the prior art, the invention has the advantages that: the accuracy of the machine in determination of the sensitivity of the target text is enhanced by identifying the apparent sensitive word and the hidden sensitive word in the target text; furthermore, the possibly required manual rechecking cost in the rear stage is reduced, so that the checking efficiency of the target text is improved and the application range is expanded greatly.

Description

technical field [0001] The present invention relates to the technical field of information processing, in particular to a technique for determining the sensitivity of target text. Background technique [0002] In the prior art, the identification of the sensitivity of the target text is mostly done manually, or a sensitive vocabulary is manually established, and a machine performs a simple matching query on the target text based on the sensitive vocabulary to determine the sensitivity of the target text. [0003] The above-mentioned method of identifying the sensitivity of the target text needs to continuously add sensitive words manually, and cannot automatically expand the sensitive word list. At the same time, for some sensitive words that often appear at the same time with a high sensitivity assignment, but they themselves do not have obvious pornography , violence, and reactionary meaning words, the above methods cannot be identified, resulting in poor sensitivity to id...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 李彦宏舒迅袁聃帅帅李岩
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products