Real-time text data flow specific information identification method and system
A technology for specific information and identification methods, which is applied in text database clustering/classification, unstructured text data retrieval, neural learning methods, etc. Data analysis, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0076] An information recognition framework for massive real-time text data streams and the key technical points involved in the system mainly include domain language model pre-training, deep network recognition modules, and cascaded model processing frameworks. The main technical key points and technical effects are explained as follows.
[0077] Key point 1, training domain language model. For tasks related to natural language processing, it is usually necessary to represent the text as a computable numerical vector first, and the language model is a way to represent the text as a vector. First of all, it is necessary to accumulate a large amount of domain corpus data and a certain amount of category labeling data, and preprocess the text data such as removing special symbols, and then use the domain corpus data to perform an unsupervised language model pre-training process. On the basis, using category labeling data, a supervised language model fine-tuning process is perfo...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com