Method, device and system for extracting live comment theme based on N-gram model and medium
An extraction method and extraction system technology are applied in the field of bullet screen topic extraction based on the N-gram model, which can solve the problems of labor and material cost, low efficiency, inaccurate bullet screen topic extraction, etc. The screen represents the exact effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0031] see figure 1 As shown, the embodiment of the present invention provides a method for extracting bullet chatting topics based on the N-gram model, comprising the following steps:
[0032] S1. Data preparation: extract barrage data;
[0033] S2. Building barrage features: extract features corresponding to words representing a specific intention, and add them to the custom lexicon; add words that have no practical meaning to the custom stop lexicon;
[0034] S3. Data preprocessing: remove the data whose "bullet chat content" field is empty; remove the punctuation marks in the "bullet chat content" field;
[0035] S4. Use the N-gram model to represent the content of the bullet chat as a word vector: the content of the bullet chat after data preprocessing is represented by the N-gram model, and the N-gram model indicates that the ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


