Word semantic similarity solution method based on context window

A technology of semantic similarity and context, applied in the field of semantic network, achieves high accuracy, high accuracy, good linearity and signal-to-noise ratio

Inactive Publication Date: 2017-05-03
SICHUAN YONGLIAN INFORMATION TECH CO LTD
View PDF1 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The context of words is the resource and basis for acquiring natural language knowledge in corpus linguistics and solving various practical application problems in natural language processing. In specific application problems, the correctness of the final result limits the insufficiency of the effective range of the context, and realizes the quantitative calculation of the semantic similarity of words. The present invention provides a method for solving the semantic similarity of words based on the context window.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word semantic similarity solution method based on context window
  • Word semantic similarity solution method based on context window
  • Word semantic similarity solution method based on context window

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] In order to realize the quantitative calculation of the semantic similarity of words, combined figure 1 The present invention has been described in detail, and its specific implementation steps are as follows:

[0022] Step 1: Initialize the statistical methods module

[0023] Step 2: The word to be compared C ∈ (c 1 , c 2 ) into the initial statistical method module.

[0024] Step 3: Determine the words to be compared C∈(c 1 , c 2 ) context word scope "window", it is necessary to first obtain the location information J sx , context position weight value weight(C,C ij∈(1,2,…2n) ), the specific calculation process is as follows:

[0025] 3.1) First assume that the word to be compared C∈(c 1 , c 2 ) The location information of the context context J sx

[0026] Extract each word to be compared C∈(c 1 , c 2 ) The context words of each n positions on the left and right of the context constitute its "to be compared word context matrix J sx ”, whose matrix looks ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A word semantic similarity solution method based on a context window comprises the steps of inputting words to be compared in a statistical method module; determining a context range of the words to be compared; finding out two sentences with maximum weight in the range; calculating similarity between the two sentences; and finally, solving the similarity between the words to be compared according to the similarity of the sentences. By the word semantic similarity solution method, very valuable quantitative description is provided for determination of an effective range of the context, and the defect of previous subjective description is overcome; the position of description capability of the context to the key word is gradually reduced from the near to the distant, and the word semantic similarity solution method conforms to ordinary knowledge of people; the linearity and the signal-to-noise ratio of a weight contribution value are better, and simple subsequent calculation is facilitated; the normalization curve accuracy of the weight contribution value is higher; the influence of a sentence constituent relation in a left window and a right window of a key word on defining of an effective window in the context is considered; and the solution of word semantic similarity by applying a context window technology is achieved, and calculation precision and accuracy are higher.

Description

technical field [0001] The invention relates to the technical field of semantic network, in particular to a method for solving word semantic similarity based on a context window. Background technique [0002] Since entering the 21st century, the global Internet industry has entered a new period of rapid development, and various new technologies have emerged continuously. Natural language processing, an important technology connecting computers and people, has also made great progress. The calculation methods of semantic similarity of words at home and abroad can be roughly divided into two categories: first, the calculation method of semantic similarity of words based on semantic dictionary, this method is simple, effective and easy to understand, but it relies on a relatively complete A large-scale semantic dictionary organized by structural hierarchy; second, a corpus-based word semantic similarity calculation method, which uses a large-scale corpus and uses the context i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/3344G06F40/30
Inventor 金平艳
Owner SICHUAN YONGLIAN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products