Bad text detection method and device based on Bi-LSTM
A text detection and bad technology, applied in the direction of neural learning methods, special data processing applications, instruments, etc., can solve the problems of large limitations, ambiguous matching of keywords, and insufficient coverage, etc., and achieve the effect of high recall rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048]This embodiment provides a method for detecting bad text based on Bi-LSTM, which can be executed by a computer with an information processing function, a network server, and the like. Bad text refers to text content that contains bad information related to pornography, gambling, and drugs. As an application scenario of the present invention, in this embodiment, the web server detects the webpage text in the form of data stream in the network according to the method provided by the present invention. It can be understood that, for detection, the webpage text in data stream form can be restored to the webpage text in natural language form. Hereinafter, the Bi-LSTM-based bad text detection method provided in this embodiment will be described.
[0049] refer to figure 1 , this embodiment discloses a bad text detection method based on Bi-LSTM, such as figure 1 As shown, the methods mainly include:
[0050] S0, acquiring text data, and performing type marking on the acquir...
Embodiment 2
[0077] Based on the same inventive concept, this embodiment discloses a bad text detection device based on Bi-LSTM, including a training module and a detection module, wherein the training module includes a training data acquisition unit, a preprocessing unit and a model training unit,
[0078] The training data acquisition unit is used to acquire text data, and carry out type marking to the acquired text data;
[0079] The preprocessing unit is used to preprocess the text data to form a training set;
[0080] The model training unit is used to train the parameters of the Bi-LSTM bidirectional cyclic neural network model through the training set, and when the iterative change of the loss value produced by the Bi-LSTM bidirectional cyclic neural network model is no longer lower than the set threshold, then Terminate the training of the Bi-LSTM bidirectional recurrent neural network model to obtain a trained Bi-LSTM bidirectional recurrent neural network model;
[0081] The det...
Embodiment 3
[0093] Based on the same inventive concept, this embodiment discloses a bad text detection system based on Bi-LSTM, including a memory and a processor, a computer program is stored in the memory, and the processor can run the computer program to perform implementation The method described in Example 1.
[0094] Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented through computer programs to instruct related hardware, and the programs can be stored in a computer-readable storage medium. During execution, it may include the processes of the embodiments of the above-mentioned methods. Wherein, the storage medium may be a magnetic disk, an optical disk, a read only memory (Read Only Memory, ROM) or a random access memory (RandomABBessMemory, RAM), etc.
[0095] Those skilled in the art should understand that the embodiments of the present invention may be provided as methods, systems, or co...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com