Semantic disambiguation method and system based on related words topic

A related word and semantic technology, applied in the field of natural language processing, can solve the problems of new word processing defects in vocabulary coverage, the speed of updating cannot adapt to rapid changes, etc., to achieve high flexibility, good disambiguation, and overcome limitations. Effect

Inactive Publication Date: 2013-10-23
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF2 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] With the development of language processing technology, the existing word sense disambiguation methods are mainly based on semantic dictionaries, through the semantic dictionary database to find the definition of each word, the set of synonyms, the extended definition and the set of extended synonyms, but because the dictionary database has limitations after all, There are deficiencies in vocabulary coverage and new word processing, and its update speed cannot better adapt to the rapid changes in actual language use

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic disambiguation method and system based on related words topic
  • Semantic disambiguation method and system based on related words topic
  • Semantic disambiguation method and system based on related words topic

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the object, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0024] According to one aspect of the present invention, a semantic disambiguation method based on related word topics is provided in a preferred embodiment. Please refer to figure 1 , figure 1 It is a flow chart of a semantic disambiguation method based on related word topics according to a preferred embodiment of the present invention, comprising the following steps:

[0025] Step 101, mining related words based on related word topics.

[0026] Specifically, the topic of related words described in this paper is represented as a series of related words, which can represent a specific topic. For example, for the sentences "Fuji apples are sweet" and "apples are easy to use", the topics corresponding to them may be "fruit" and "mobile phone", respective...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a semantic disambiguation method based on a related word topic. The method comprises the following steps: excavating related words on the basis of the related word topic; numbering each word and establishing a corresponding frequency characteristic vector; calculating mutual information value between words, and regarding mutual information value as the characteristic vector; calculating similarity between words and related word of each word; carrying out semantic disambiguation; accordingly, the invention further provides a semantic disambiguation system based on the related words topic, so as to improve accuracy of disambiguation.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a semantic disambiguation method and system based on related word topics. Background technique [0002] Due to the specific meaning of the language in different contexts, there are different interpretations, such as "makeup and clothing" can be divided into "makeup", "and", "clothing" or "makeup", "kimono", "dress". Since there is no human knowledge to understand, it is easy to produce ambiguity, and it is difficult for the computer to know which solution is correct. Therefore, it is necessary to guide the computer to recognize the correct word meaning. [0003] With the development of language processing technology, the existing word sense disambiguation methods are mainly based on semantic dictionaries, through the semantic dictionary database to find the definition of each word, the set of synonyms, the extended definition and the set of extended synonyms, but becaus...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 方高林
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products