Corpus text processing method and device and electronic equipment
A processing method and text technology, applied in special data processing applications, unstructured text data retrieval, text database clustering/classification, etc., can solve problems such as limited ability to express corpus text, low accuracy of labeling information of intent category, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] The embodiment of the present invention provides a kind of processing method of corpus text, such as figure 2 As shown, the method includes the following steps:
[0037] Step S202, input the corpus text set to be processed into the language model to obtain the feature vector of the corpus text in the corpus text set; wherein, the feature vector is used to represent the semantic information of the corpus text; the language model is a model obtained by training the original training samples .
[0038] Among them, the language model is the BERT (Bidirectional Encoder Representations from Transformers, bidirectional encoding representation based on the converter) language model. Before clustering the corpus text set, it is necessary to vectorize the corpus text in the corpus text set. Specifically, input the corpus text set to be processed into the BERT language model so that the BERT language model The text set is processed by vector mapping to obtain the feature vector...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com