Text multi-label analysis method, device, electronic equipment and storage medium
An analysis method and multi-label technology, applied in the field of text multi-label analysis methods, devices, electronic equipment and storage media, can solve the problem that there is no solution, the subject term correspondence matrix is difficult to meet the precise requirements of Chinese semantics, and the word segmentation results have the essence Impact and other issues
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0046] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.
[0047] The text multi-label analysis method of the present embodiment comprises the following steps:
[0048] S1. Acquire training text data, where the training text data includes multiple texts, and perform word segmentation on the training text data.
[0049] The word segmentation algorithm is an algorithm that divides a sentence into a series of word combinations. For example, "I pass by Peking University" can be segmented into "I / passing / Peking University". You can use the pkuseg word segmentation model provided by Peking University for word segmentation. Since this model has subdivided pre-training models in different fields, and also supports the use of brand-new labeled data for training, you can obtain a self-training model and get more accurate word segmentation results.
[0050] S2, use N-gram (a language m...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


