Synonym mining method and apparatus

A technology of synonyms and parts of speech, which is applied in the field of synonym mining methods and devices, and can solve problems such as affecting filtering results.

Active Publication Date: 2017-05-10
SHANGHAI XIAOI ROBOT TECH CO LTD
View PDF4 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

And when there are synonyms for a certain feature word, if you only enter the feature word without considering its synonyms, the filtering results will inevitably be affected

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Synonym mining method and apparatus
  • Synonym mining method and apparatus
  • Synonym mining method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0021] The embodiment of the present invention proposes a synonym mining method and device. The embodiment of the present invention considers that the specific meaning of a word is closely related to its context, so the method of word vector is used to represent its meaning, and then, the clustering algorithm is used Semantic clustering is performed on the obtained word vectors to obtain generalized synsets. Preferably, in the embo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a synonym mining method and apparatus. The method comprises the steps of performing word segmentation on acquired corpus data, so as to obtain multiple separate words; calculating a word vector of each separate word; and clustering the separate words according to the word vectors, so as to obtain a synonym set. The meaning of the word is expressed through a word vector, then, word meaning clustering is performed on obtained word vectors by using the clustering algorithm, so as to mine a generalized synonym set effectively. The method is a new way of mining synonyms in natural language processing. When the mined synonym set is applied to the field of natural language processing, the accuracy of the knowledge point filtering task, keyword extraction task, text classification task, and meaning clustering task is improved.

Description

technical field [0001] The present invention relates to the technical field of information processing, in particular to a method and device for mining synonyms. Background technique [0002] Multi-word synonym and one-word polysemy are widespread phenomena in language. For example, "program" can be a synonym for "procedure" and "code" (in the computer field), which gives natural language processing bring great difficulty. For example, the intelligent question answering knowledge base includes multiple knowledge points. When it is necessary to filter knowledge points based on feature words, whether the input feature words are comprehensive or not plays a very important role in the accuracy and comprehensiveness of the filtering results. And when there is a synonym for a certain feature word, if you only input the feature word without considering its synonym, it will inevitably affect the filtering result. Therefore, how to mine synonyms so as to apply the mined synonyms to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/3344G06F16/35G06F2216/03G06F40/30
Inventor 谢瑜张昊朱频频
Owner SHANGHAI XIAOI ROBOT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products