Spatial semantic similarity calculation method based on sliding window sampling

A technology of semantic similarity and sliding window, which is applied in computing, instrumentation, text database query, etc., and can solve problems such as insufficient accuracy

Active Publication Date: 2020-04-10
WUHAN UNIV
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In view of this, the present invention provides a method for calculating spatial semantic similarity based on sliding window sampling, which is used to solve or at least partly solve the problems in the prior art when dealing with information containing spatial relationship information and subject ambiguity. Insufficient technical issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Spatial semantic similarity calculation method based on sliding window sampling
  • Spatial semantic similarity calculation method based on sliding window sampling
  • Spatial semantic similarity calculation method based on sliding window sampling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention aims at the technical problem that the method in the prior art has insufficient accuracy when processing information containing spatial relationship information and subject fuzzy, and provides a spatial semantic similarity calculation method based on crowdsourced geographic big data and sliding window sampling, Mining the spatial semantic similarity relationship between words, constructing a model that can measure the spatial semantic similarity of words, as a new perspective to understand human natural language by integrating human spatial thinking and spatial perception, the traditional natural semantic similarity model is carried out It is an effective supplement to effectively improve the accuracy of intelligent geographic information retrieval and recommendation systems.

[0040] In order to achieve the above object, the main idea of ​​the present invention is as follows:

[0041]Based on the spatial semantic similarity calculation method of c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a spatial semantic similarity calculation method based on sliding window sampling. The method comprises the following steps: preprocessing corpus data containing spatial information, performing projection processing on coordinates in the preprocessed corpus data by adopting a preset equal-area projection method to obtain an actual space range, determining a context window,performing sliding sampling, and finally performing similarity calculation on every two words in a word set of the whole corpus. According to the method, a model capable of measuring the spatial semantic similarity of the words is constructed by mining the spatial semantic similarity relationship between the words. The method is superior to a traditional text similarity model and a geographic space similarity model in the aspect of comprehensively considering spatial correlation and text correlation. As a new angle of understanding human natural language by integrating human spatial thinking and spatial perception, a traditional natural semantic similarity model is effectively supplemented, and the accuracy of an intelligent geographic information retrieval and recommendation system is effectively improved.

Description

technical field [0001] The invention relates to the technical field of geographic information retrieval, in particular to a method for calculating spatial semantic similarity based on sliding window sampling. Background technique [0002] For natural language processing (NLP) problems under the current cross-discipline of computer and linguistics, calculating the similarity relationship between words in text is a key part of solving these problems. [0003] In the prior art, general word similarity models are obtained by using large text corpus and deep learning training methods, such as Google's Word2Vec (Mikolov, Chen et al.2013) model and Facebook's Fasttext (Joulin, Grave et al. .2016) model. [0004] In the process of implementing the present invention, the inventor of the present application found that the method of the prior art has at least the following technical problems: [0005] The above-mentioned models in the prior art perform well on common general texts, b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/9537G06F16/33
CPCG06F16/9537G06F16/3344G06F16/3346
Inventor 王博智费腾杜清运康雨豪李梦
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products