NLP-based short text data processing method
A data processing and short text technology, which is applied in the fields of electrical digital data processing, natural language data processing, instruments, etc., can solve the problems of inaccuracy and low efficiency of manual processing of short text data, so as to reduce manpower and solve the low efficiency of manual processing. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0038] figure 1 Shown is a flow chart of a method for NLP-based short text data processing according to an embodiment of the present invention, the method comprising steps:
[0039] S101 obtains short text data:
[0040] We synchronize the short text data in the business database to the local TXT file through the DataX tool.
[0041] S102 jieba participle:
[0042] The short text data obtained in step S101 is cut into the sentence most precisely by using the precise mode in units of lines.
[0043] S103 to stop words (stopwords):
[0044] By loading our accumulated Chinese stop words, use NLTK to delete the stop words contained in the jieba segmented words in step S102.
[0045] S104 obtain word bag:
[0046] By using the gensim library, a unique integer id is assigned to all words that appear in the corpus, for example: {'restaurant': 0, 'fried': 1, 'pull': 2, 'open': 3, 'member' : 4, 'restaurant': 5, 'Ramen': 6, 'hotel': 7, 'restaurant': 8, 'hot pot': 9, 'catering': 10...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com