Sensitive word detection method and device, computer equipment and storage medium

A detection method and technology for sensitive words, applied in the field of sensitive word filtering, can solve the problem of low recognition accuracy of sensitive words

Pending Publication Date: 2020-10-27
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The embodiment of the present invention provides a sensitive word detection method, device, computer equipment and storag

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sensitive word detection method and device, computer equipment and storage medium
  • Sensitive word detection method and device, computer equipment and storage medium
  • Sensitive word detection method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0041] It should be understood that when used in this specification and the appended claims, the terms "comprising" and "comprises" indicate the presence of described features, integers, steps, operations, elements and / or components, but do not exclude one or Presence or addition of multiple other features, integers, steps, operations, elements, components and / or collections thereof.

[0042] It should also be understood that the terminology used ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a sensitive word detection method and device, computer equipment and a storage medium. The method belongs to the field of artificial intelligence, and the data processed by the method can be stored in a block chain. The method comprises the steps of obtaining a sensitive word bank; constructing a harmonic word bank; constructing a sensitive word indexer and a harmonic word indexer; if the to-be-tested text is received, filtering the to-be-tested text through a sensitive word indexer to obtain a first sensitive word set; removing non-Chinese charactersin the to-be-tested text to obtain a redundancy-removed text, and filtering the redundancy-removed text through a sensitive word indexer to obtain a second sensitive word set; filtering the to-be-tested text through a harmonic word indexer to obtain a third sensitive word set; filtering the redundancy-removed text through the harmonic word indexer to obtain the fourth sensitive word set, so that sensitive words in the to-be-detected text can be recognized, deformation words of the sensitive words can also be recognized, and the recognition accuracy is greatly improved.

Description

technical field [0001] The present invention relates to the technical field of sensitive word filtering, in particular to a sensitive word detection method, device, computer equipment and storage medium. Background technique [0002] Sensitive word filtering refers to the accurate and efficient identification of political, pornographic, insulting, prohibited, spam and other illegal content in various scenarios based on advanced artificial intelligence technology, so as to prevent content risks in advance and improve user experience. Currently, commonly used sensitive word filtering algorithms include finite automata matching algorithms based on sensitive thesaurus, classification and sequence tagging algorithms based on machine learning models. [0003] The disadvantage of the above existing sensitive word filtering method is: only the sensitive word itself can be identified, and the variant words of the sensitive word, such as homophonic words and redundant insertion words,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/31G06F16/335G06F16/903G06F40/216
CPCG06F16/322G06F16/335G06F16/90344G06F40/216
Inventor 程华东李剑锋汪伟
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products