Emotion dictionary construction method in field of automobile product based on word2vec
A technology of emotional dictionary and construction method, applied in the direction of semantic tool creation, unstructured text data retrieval, special data processing application, etc., can solve the problems of emotional new word recognition, low efficiency of emotional dictionary, poor field applicability, etc., and achieve convenience The effect of emotional orientation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0051] The embodiment of the present invention provides a kind of sentiment dictionary construction method based on word2vec automobile product field, see figure 1 , the method includes the following steps:
[0052] 101: Use MRQ (Python's distributed task queue based on Redis, Mongo and gevent) distributed data collection mechanism to capture user comment modules of multiple automobile vertical websites, and store them in a tree structure of "website-model-word-of-mouth" into the postgresql database;
[0053] 102: Extract the word-of-mouth data in the database. The word-of-mouth data includes "most satisfactory point", "least satisfied point", "space", "power", "handling", "fuel consumption", "comfort", " Appearance", "Interior" and "Cost-effective" ten parts, select the "most satisfactory point" and "least satisfied point" in the word-of-mouth data, and preprocess the selected data: remove abnormal comments, and use punctuation marks as cutting points Cutting and turning lo...
Embodiment 2
[0060] The scheme in embodiment 1 is further introduced below in conjunction with specific calculation formulas and examples, see the following description for details:
[0061] 201: Obtain user word-of-mouth data of multiple automobile vertical websites through data capture and store them in the database;
[0062] Wherein, the step 201 is specifically:
[0063] 1) Through the python language based on the MRQ distributed data collection mechanism, write programs for the automobile vertical websites to be captured, capture the source code of the required information webpage, and analyze the source code of the webpage based on regular expressions to obtain the specific vehicle model information and the vehicle type Comment data below.
[0064] The data captured above includes website links, web page source code, user information, car model information, posting time, etc., among which are mainly user word-of-mouth data. User word-of-mouth data is further classified into: "Most ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com