Training data set generation method and device
A technology for training data sets and text data, which is applied in the field of data processing, can solve problems such as lack of training data information, data waste, and affect model training effects, so as to meet the requirements of training data, avoid waste, and improve effectiveness.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0068]In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.
[0069] Embodiments of the present invention provide a method and device for generating a training data set. When grabbing training data from a webpage, not only the text data in the text of the webpage is extracted, but also the pictures are obtained when the text of the webpage contains pictures, and the The picture is identified to obtain picture text data, and a training data set is generated according to the text data in the text and the picture text data.
[0070] Such as figure 1 Shown is a flowchart of a method for generating a training data set in an embodiment of the present invention, including the following steps:
[0071] Step 101, grabbing the text of the webpage.
[0072] Since the training data i...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com