Synonym identification method and device

A recognition method and a technology of synonyms, applied in the Internet field, to achieve good generalization ability, improve comprehensiveness and accuracy, and improve the effect of accuracy

Active Publication Date: 2015-10-14
ALIBABA GRP HLDG LTD
View PDF6 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The main purpose of this application is to provide a synonym recognition technology for the above-mentioned defects, to solve the problem of synonym recognition in the prior art relying on edit distance and knowledge base, to improve the comprehensiveness and accuracy of synonym recognition, thereby improving the accuracy of retrieval results. accuracy and efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Synonym identification method and device
  • Synonym identification method and device
  • Synonym identification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The main idea of ​​this application is that by obtaining the attribute words of the description text of the data object and the types corresponding to the attribute words, and combining user behavior logs and text features, a synonym recognition model can be obtained, and the same type can be determined according to the model Whether any two attribute words of are synonyms. This solution can identify synonyms based on user behavior logs, thereby effectively identifying synonyms with large differences in text. Moreover, dividing each description text into different types of attribute words, and judging synonyms based on different types of attribute words can better improve the accuracy of the judgment result. The scheme of this application does not depend on the knowledge base and edit distance, has good generalization ability, and can identify whether words that do not appear in the knowledge base are synonyms, thereby improving the comprehensiveness and accuracy of syn...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a synonym identification method, which comprises the following steps: according to a description text to be tested, using an attribute word identification model to obtain the attribute words of the description text to be tested and types corresponding to the attribute words; according to the attribute words and the types corresponding to the attribute words, combining with a user behavior log to calculate relevance characteristics among attribute words; according to the relevance characteristics among sample attribute words selected from the attribute words and textual characteristics among the sample attribute words, training a synonym identification model to obtain the synonym identification model; and according to the relevance characteristics among the attribute words to be tested and textual characteristics among the attribute words to be tested, using the synonym identification model to identify whether all attribute words to be tested are synonyms so as to carry out subsequent processing. According to the technical scheme of the synonym identification method, the comprehensiveness and the accuracy of synonym identification can be improved so as to improve the accuracy and the efficiency of a retrieval result.

Description

technical field [0001] The present application relates to the Internet field, and more specifically relates to a method and device for identifying synonyms. Background technique [0002] In the field of e-commerce, different types of attribute descriptors, namely attribute words, can be used to describe commodities. For example, "Chanel" is a brand attribute word of a commodity, "cotton" is a material attribute word of a commodity, "wallet" is a product attribute word, and "Galaxy" is a model attribute word. Due to the richness of natural language, there are a large number of synonymous and non-standard usages in the process of using attribute words. For example, the possible synonyms of the brand attribute word "Chanel" are "Chanel", "Chanel", "Chanel", "double C", "Xiaoxiang" etc.; the synonyms of the material attribute word "cotton" can be " Pure cotton", "100% cotton", "100% cotton" etc. In the commodity management in the field of e-commerce, in order to make the sold...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 陈俊波王力李红松庞昂博陈春明
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products