Comment label extraction method based on albert pre-training model and kmean algorithm
A comment tag, pre-training technology, applied in computing, computer parts, character and pattern recognition, etc., can solve problems such as slow training speed, many arithmetic resources, and text correlation logic deviation, and achieve accurate prediction accuracy. , the training speed is fast, the model is small
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0033] The technical solution provided in this embodiment is: a method for extracting comment labels based on the albert pre-training model and the kmean algorithm, and the steps of the method are as follows:
[0034] Step 1. Crawl the review data of the store and import the data into the database;
[0035] Step 2, performing data cleaning on the data in the database;
[0036] Step 3, use the albert pre-training model to obtain word vectors;
[0037] Step 4. Evaluate the average accuracy of the model.
[0038] As a preference of this embodiment, the cleaning step in step 2 includes: removing stop words, removing html format, removing spaces, manually labeling a small amount of data, importing the cleaned data into the database, and analyzing with actual examples below , the data is shown in the table below:
[0039]
[0040]
[0041] As a preference of this embodiment, the specific operation of step 3 is: based on a small amount of labeled data, take the last layer of...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com