Unlock instant, AI-driven research and patent intelligence for your innovation.

A Word Relevance Judgment Method Based on the Shortest Path

A judgment method and technology of the shortest path, applied in natural language data processing, instruments, calculations, etc., can solve the problems of poor flexibility, increase the accuracy and flexibility of word correlation judgment, low efficiency, etc., to solve the problems of insufficient accuracy, Effects of increasing accuracy and flexibility

Active Publication Date: 2021-10-22
KUNMING UNIV OF SCI & TECH
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved in the present invention is to provide a word correlation judgment method based on the shortest path for the limitations and deficiencies of the prior art, so as to solve the problem of insufficient accuracy, low efficiency, and Poor flexibility and other phenomena, dedicated to increasing the accuracy and flexibility of word correlation judgments currently relying on computers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Word Relevance Judgment Method Based on the Shortest Path
  • A Word Relevance Judgment Method Based on the Shortest Path
  • A Word Relevance Judgment Method Based on the Shortest Path

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] Embodiment 1: as Figure 1-2 As shown, a word correlation judgment method based on the shortest path, first establishes a word database, and cleans the word data in the word database, including operations such as removing punctuation marks, removing stop words, and word segmentation, to obtain the corresponding entry; then the user enters two words, obtains the corresponding entry by searching the database, recursively calculates the entry, and obtains the shortest distance between the two words; finally outputs the correlation between the two words through the matching definition.

[0030] Specifically include the following steps:

[0031] Step1: Establish a word database According to authoritative dictionaries such as "Chinese Dictionary" and "Xinhua Dictionary", obtain all the words X i ,i∈[1,N] and the corresponding part of speech X i ′,i∈[1,N], interpretation i∈[1,N], and establish a word database

[0032] Specifically: the following words exist in the wor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for judging word correlation based on the shortest path, and belongs to the technical field of Chinese information processing. First, establish a word database, and clean the word data in the word database, including operations such as removing punctuation marks, removing stop words, word segmentation, etc., to obtain the corresponding entry of the word; then the user enters two words, and obtains by searching the database The corresponding entries are recursively calculated on the entries to obtain the shortest distance between the two words; finally, the correlation between the two words is output through the matching definition. Compared with the prior art, the present invention mainly solves the problems of insufficient accuracy, low efficiency, and poor flexibility in judging the correlation of words in the prior art, and increases the accuracy and flexibility of judging the correlation of words by computers at present. sex.

Description

technical field [0001] The invention relates to a method for judging word correlation based on the shortest path, and belongs to the technical field of Chinese information processing. Background technique [0002] The determination of word relevance is widely used in Chinese information processing. For example, in traditional information retrieval technology, people search based on keyword matching, but such retrieval cannot meet the requirements of people's retrieval in terms of efficiency. By judging the correlation between words, users can "intelligently" retrieve the best information they need. [0003] At present, the PMI (point-to-point mutual information) algorithm is usually used to judge the correlation of words, but this algorithm only counts the number of co-occurrences and single-occurrences, and uses information metrics to judge, which is relatively lacking in accuracy and requires a large number of documents in the early stage as training data. Contents of t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/247
CPCG06F40/247G06F40/289
Inventor 龙华祁俊辉杜庆治宋耀莲
Owner KUNMING UNIV OF SCI & TECH