Text processing method and system and medium
A text processing and text technology, which is applied in the field of data processing, can solve problems such as erroneous exclusion of repeated content, indistinguishability between long sentences and short sentences, etc., and achieve the effects of improving robustness, reducing the risk of explosion, and improving clarity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0031] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.
[0032] At present, the method of deduplication of automobile user comments in vehicle enterprises is mostly the method of absolute mapping of content and the method of transforming vector space VSM model and then performing similarity analysis on high-dimensional space vectors. Short comments, complex semantic structure, unstable deduplication results, local sensitive hashing method, ignoring different sentence order, finding local close words and then determining the weight by Hamming distance is effective in long text sentence duplicate checking, but in It is very easy to mistakenly remove duplicates in short and ultra-short texts such as car reviews.
[0033] In the present invention, in order to digitize the text, Chinese word segmentation and stop words (high frequency but not affecting semantics) are used to con...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


