UGC text content mining method, system, equipment and storage medium
A text and content technology, applied in the field of OTA, can solve problems such as the inability to dig out the topics that users are interested in, and achieve the effect of improving mining efficiency, improving accuracy and saving time.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0075] This embodiment provides a method for mining UGC text content. refer to figure 1 , mining methods include:
[0076] S11. Obtain UGC text content.
[0077] S12. Acquiring the subject words input by the user.
[0078] S13. Obtain an extended word set of the subject word based on the subject word, wherein the extended word set includes extended words similar to the subject word, and the extended word is output by a model trained based on UGC text content.
[0079] S14. Output the expanded word set.
[0080] S15. Use the selected expanded word in the expanded word set as the subject word selection result.
[0081] S16. Calculate the correlation degree between the selection result of the keyword and the UGC text content, sort in descending order according to the correlation degree, and output several UGC text contents whose correlation degree of the extended word is ranked first.
[0082] Wherein, the UGC text content may include review information of scenic spots, revi...
Embodiment 2
[0120] This embodiment also provides a mining system for UGC text content. refer to Figure 6 , the mining system includes: a text content acquisition module 1 , a subject term acquisition module 2 , an extended word set calculation module 3 , an output module 4 , a subject term selection module 5 and a first correlation degree calculation module 6 .
[0121] The text content acquisition module 1 is used to acquire UGC text content.
[0122] The keyword acquisition module 2 is used to acquire the keyword input by the user.
[0123] The extended word set calculation module 3 is used to obtain the extended word set of the subject word based on the subject word, wherein the extended word set includes extended words similar to the subject word, and the extended word is output by a model trained based on UGC text content.
[0124] The output module 4 is used for outputting the expanded word set.
[0125] The subject word selection module 5 is used to use the selected extended wo...
Embodiment 3
[0164] Figure 7 It is a schematic structural diagram of an electronic device provided by Embodiment 3 of the present invention. The electronic device includes a memory, a processor, and a computer program stored on the memory and operable on the processor. The processor implements the method for mining UGC text content in Embodiment 1 when executing the program. Figure 7 The electronic device 30 shown is only an example, and should not limit the functions and scope of use of the embodiments of the present invention.
[0165] Electronic device 30 may take the form of a general-purpose computing device, which may be a server device, for example. Components of the electronic device 30 may include, but are not limited to: at least one processor 31 , at least one memory 32 , and a bus 33 connecting different system components (including the memory 32 and the processor 31 ).
[0166] The bus 33 includes a data bus, an address bus, and a control bus.
[0167] The memory 32 may i...
PUM

Abstract
Description
Claims
Application Information

- Generate Ideas
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com