Synonym screening method and system

A screening method and technology of synonyms, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problems of good timeliness, inability to obtain synonyms, timeliness and poor coverage, etc., to improve the level of understanding sentences. Effect

Active Publication Date: 2017-12-08
GUANGZHOU DUOYI NETWORK TECH +2
View PDF7 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] 1. For rule methods that rely on ontology dictionaries or knowledge bases, since dictionaries and knowledge bases mostly rely on manual construction, their timeliness and coverage are relatively poor
[0009] 2. The method based on search log behavior needs to use the structural template of the synset, and the scalability and coverage are not good
[0010] 3. The cosine similarity expressed by the word vectorization of the neural network language model is used to measure the semantic similarity of words. This method has a certain effect, but the existing methods cannot obtain high-quality synonyms
The word vector of the neural network language model can reflect the semantic similarity to a certain extent, but some of the obtained similar words are not semantically similar, and these methods cannot effectively remove non-synonymous words to obtain high-quality synonyms
[0011] To sum up, the existing methods for obtaining synonyms cannot meet the requirements of wide coverage, good timeliness, and high quality at the same time, and cannot meet the needs of natural language processing, and it is also difficult to improve the level of sentence understanding of chat robots

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Synonym screening method and system
  • Synonym screening method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The present invention will be described in further detail below in conjunction with the examples and the accompanying drawings, but the embodiments of the present invention are not limited thereto.

[0048] Please also see figure 1 , which is a flow chart of the steps of the synonym screening method of the present invention. The present invention provides a kind of synonym screening method, comprises the following steps:

[0049] S1: Training word vectors of large corpus words.

[0050] Further, the step S1 specifically includes:

[0051] S11: Grab raw data. Specifically, S11 is specifically: capture text data of various themes as a large corpus, including various types of data in various fields, for example: various types of news texts, novel texts of various themes, and encyclopedia texts of all entries.

[0052] S12: Preprocessing large corpus. The step S12 specifically includes: removing non-Chinese characters, and performing word segmentation through the searc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a synonym screening method which includes the steps: training the word vector of a big corpus word; mining synonyms of the big corpus word, and particularly, acquiring a candidate synonym set; updating synonym similarity; performing screening to obtain a synonym list. Compared with the prior art, the synonym screening method, the synonyms obtained by big corpus training are wide in coverage, synonyms with good timeliness can be found by adding new big corpuses, the synonyms obtained by screening according to the principle of the requirement of the synonyms for synonymy are higher in quality, and a forceful tool for natural language processing semantic comprehension is added. The synonym screening method is applied to a chatting robot, sentences expressing the same meaning with different words by a user can be more effectively recognized, and the sentence understanding level of the robot is improved.

Description

technical field [0001] The invention relates to the field of artificial intelligence, in particular to a method and system for screening synonyms. Background technique [0002] In the design of chat robots, it is often necessary for the computer to understand the same sentence of the user and use different expressions to improve the robot's recognition level of the sentence, among which the conversion of synonyms is the most common method. Synonyms play an important role in basic applications such as information extraction, question answering systems, and data mining. Existing methods for mining synonyms either have narrow coverage of words, or the acquired synonyms are relatively old, or the quality of synonyms is not high. These problems affect the application of synonyms in the field of natural language processing. [0003] The methods adopted in prior art for mining synonyms mainly include: [0004] 1. Rely on the rule method of ontology dictionary or knowledge base. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/247G06F40/284G06F40/289
Inventor 徐波
Owner GUANGZHOU DUOYI NETWORK TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products