UGC text content mining method, system, equipment and storage medium
A text and content technology, applied in the field of OTA, can solve problems such as the inability to dig out the topics that users are interested in, and achieve the effect of improving mining efficiency, improving accuracy and saving time.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0074] Example 1
[0075] This embodiment provides a method for mining UGC text content. refer to figure 1 , mining methods include:
[0076] S11. Obtain the UGC text content.
[0077] S12. Obtain the subject heading input by the user.
[0078] S13. Obtain an extension word set of the subject word based on the subject word, wherein the extension word set includes extension words similar to the subject word, and the extension word is output by a model trained based on the UGC text content.
[0079] S14, outputting the extended word set.
[0080] S15. Use the selected expanded word in the expanded word set as the subject word selection result.
[0081] S16: Calculate the correlation between the subject word selection result and the UGC text content, sort in descending order of the correlation, and output several UGC text contents that are ranked first in the correlation of the expanded words.
[0082] Among them, the UGC text content may include review information of sceni...
Example Embodiment
[0119] Example 2
[0120] This embodiment also provides a mining system for UGC text content. refer to Image 6 , the mining system includes: a text content acquisition module 1 , a subject word acquisition module 2 , an expanded word set calculation module 3 , an output module 4 , a subject word selection module 5 and a first correlation degree calculation module 6 .
[0121] The text content acquisition module 1 is used to acquire UGC text content.
[0122] The subject heading obtaining module 2 is used to obtain the subject heading input by the user.
[0123] The expanded word set calculation module 3 is used to obtain an expanded word set of the subject word based on the subject word, wherein the expanded word set includes the expanded word similar to the subject word, and the expanded word is output by a model trained based on the UGC text content.
[0124] The output module 4 is used for outputting the extended word set.
[0125] The subject word selection module 5 i...
Example Embodiment
[0163] Example 3
[0164] Figure 7 This is a schematic structural diagram of an electronic device according to Embodiment 3 of the present invention. The electronic device includes a memory, a processor, and a computer program stored on the memory and executable on the processor. When the processor executes the program, the method for mining UGC text content in Embodiment 1 is implemented. Figure 7 The electronic device 30 shown is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present invention.
[0165] The electronic device 30 may take the form of a general-purpose computing device, which may be, for example, a server device. Components of the electronic device 30 may include, but are not limited to, the above-mentioned at least one processor 31 , the above-mentioned at least one memory 32 , and a bus 33 connecting different system components (including the memory 32 and the processor 31 ).
[0166] The ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap