Method and system for training sensitive word detection model
A technology for detecting models and training methods, which is applied in the field of training of sensitive word detection models, and can solve problems such as dependence
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0080] Figure 6It is a sensitive word detection model training method based on a single training corpus. Such as Figure 6 The sensitive word detection model shown includes a bidirectional long short-term memory network BLSTM model and a conditional random field CRF model, and the BLSTM model includes a first BLSTM model and a second BLSTM model. In addition, the model training method also introduces a CNN model containing a convolutional neural network.
[0081] The training method is as Figure 7 As shown, based on the training corpus such as Figure 6 X shown 正&火 , execute steps A-1 and A-2 iteratively until the end of the iterative procedure:
[0082] Step A-1 (S101): keep the current parameters of the CNN model from updating, train the first BLSTM model, the second BLSTM model and the CRF model: input the sample data of the training corpus into the first BLSTM model and the second BLSTM model, and apply the first BLSTM model to the second BLSTM model. The output of...
Embodiment 2
[0119] The present embodiment is the training method of the sensitive word detection model of multi-training corpus, as Figure 9The sensitive word detection model shown includes a bidirectional long-short memory network BLSTM model and a conditional random field CRF model. The BLSTM model includes the first BLSTM model and the second BLSTM model. The model training method also includes a convolutional neural network CNN model and N training corpora , n is the label of the training corpus, n=1,2,...,N.
[0120] Figure 9 for Figure 8 method, 4 examples from the training corpus, with Figure 6 the difference is, Figure 9 The second BLSTM model and CRF model in are in one-to-one correspondence with the training corpus n, identifying the second BLSTM n Models and CRFs n The superscript n of the model indicates the corresponding relationship with the training corpus n.
[0121] Such as Figure 8 As shown, the training method of the present embodiment includes:
[0122] S...
Embodiment 3
[0143] The present invention also includes a sensitive word detection model, including the first BLSTM model, the second BLSTM model and the CRF model obtained after training in Example 1 and Embodiment 2 of the present invention.
[0144] Input the test text into the first BLSTM model and the second BLSTM model, input the output of the first BLSTM model and the second BLSTM model into the CRF model, and the CRF model outputs the sensitive word recognition result of the test text.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com