Semantic similarity calculation method and device based on CTW and KM algorithms
A technology of semantic similarity and KM algorithm, applied in the computer field, can solve the problem of low accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0073] This embodiment provides a method for computing semantic similarity based on CTW and KM algorithms, please refer to figure 1 , the method includes:
[0074] First, step S1 is performed: select the preset corpus, and train through the method of preset word vector combined with neural network learning to obtain a word vector space, wherein each word vector in the word vector space is used to represent the semantic information of the word segment.
[0075] Specifically, the Word2Vec deep learning platform can be used to train the preset corpus to obtain word vectors, and finally obtain word vector data with 200-dimensional features to form a word segmentation vector library (word vector space).
[0076] Word2Vec comes from the word vector computing model developed by Google, which uses the idea of deep learning to automatically learn the essential information of word data from large-scale text data. Deep-Learning (deep learning) learns more useful features in the data b...
Embodiment 2
[0185] This embodiment provides a device for computing semantic similarity based on CTW and KM algorithms, please refer to Figure 4 , the device consists of:
[0186] The word vector space obtaining module 401 is used to select a preset corpus, and train through preset word vectors combined with neural network learning to obtain a word vector space, wherein each word vector in the word vector space is used to represent the word segmentation semantic information;
[0187] The word component array building module 402 is used to carry out word segmentation between the text to be compared and the source text, and then according to the word vector space, respectively establishes a word component array corresponding to the text to be compared and the source text;
[0188] CTW distance calculation module 403, for calculating the CTW distance of each participle in the text to be compared and each participle in the source text in turn;
[0189] CTW matrix construction module 404, fo...
Embodiment 3
[0220] Based on the same inventive concept, the present application also provides a computer-readable storage medium 400, please refer to Figure 5 , on which a computer program 411 is stored, and the method in Embodiment 1 is implemented when the program is executed.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com