Text similarity matching method based on subject terms
A text similarity and matching method technology, applied in the field of text similarity matching for fast retrieval of similar articles, can solve the problems of unsatisfactory accuracy and insufficient retrieval efficiency, improve the efficiency and accuracy of duplicate checking, and reduce manpower The effect of wasting resources
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0022] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the embodiments and accompanying drawings.
[0023] like figure 1 As shown, the flow of the text similarity matching method based on keywords includes the following steps:
[0024] In step 10, the text is fragmented, the texts in various formats are unified into the database, and the data is cleaned to form texts in a unified format;
[0025] Step 20 performs word segmentation and removal of stop words to the text, and stores the document id and word segmentation results in the database;
[0026] Step 30 uses the inverted index algorithm to perform statistical calculations on all word-segmented texts in the database to form a word-document list matrix, and store the results in the database;
[0027] Step 40 extracts the keywords of each text through the tf-idf algorithm and calculates the t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com