Short text similarity computing method based on searched result quantity
A similarity calculation and short text technology, which is applied in computing, electrical digital data processing, special data processing applications, etc., can solve problems such as irregular terms and insufficient features
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0021] Embodiments of the present invention are now described with reference to the accompanying drawings, as figure 1 , the present embodiment takes two short texts S1 and S2 as an example to illustrate the short text similarity calculation method based on the number of retrieval results, including the following steps:
[0022] Step S1, preprocessing short texts with a length less than or equal to 200 characters, the specific steps are
[0023] Step S1-1, using a common stop words list (stop words list) to filter the short text, the common stop words are modal particles, adverbs, prepositions and conjunctions;
[0024] Step S1-2, filtering the endings of word segmentation transformation forms of each word forming the short text, extracting word stems, and calculating the word frequency of the word stems.
[0025] In step S2, a single short text and a pairwise combination of short texts are respectively submitted as search query words to a large-scale corpus, and the corpus u...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com