Unlock instant, AI-driven research and patent intelligence for your innovation.

A Keyword Search ksanew Method Combining Semantic Nodes and Edge Weights

A keyword search and keyword technology, applied in the field of massive data storage and retrieval, can solve the problems of ignoring additional weights, relying on the accuracy of semantic dictionaries, and keyword weight calculation methods that do not fully consider the timeliness characteristics, and achieve retrieval efficiency. improved effect

Active Publication Date: 2022-06-21
FUZHOU UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The biggest shortcoming of the TF-IDF weight calculation method is that keywords are regarded as independent individuals, ignoring the additional weights generated by the combination of keywords; although the semantic-based calculation method starts from the essential characteristics of keywords to calculate the weight , but depends on the accuracy of the semantic dictionary, the ischemic nature of the semantic dictionary will restrict the accuracy of this calculation
The calculation method based on the structural characteristics of the text defines the keyword weights based on the structural characteristics of the text. This method is more effective for texts with a relatively regular structure, but relatively ineffective for texts with chaotic structures.
[0003] Since the arrival of knowledge fragments will make the knowledge base dynamically change, the timeliness of the knowledge base will also become one of the key points of consideration. However, the existing keyword weight calculation methods do not fully consider the timeliness characteristics. Therefore, the present invention is based on the background of knowledge graphs. Next, a keyword search algorithm combining semantic nodes and edge weights is proposed. This algorithm combines the time-sensitive characteristics with the weight calculation formulas of semantic nodes and edges, and builds a query seed model with time-sensitive characteristics on the knowledge map model layer. , and then use the query seed as a guide to perform distributed keyword retrieval on the data layer to obtain query results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Keyword Search ksanew Method Combining Semantic Nodes and Edge Weights
  • A Keyword Search ksanew Method Combining Semantic Nodes and Edge Weights
  • A Keyword Search ksanew Method Combining Semantic Nodes and Edge Weights

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The technical solutions of the present invention will be described in detail below with reference to the accompanying drawings.

[0039] The present invention provides a keyword search KSANEW method combining semantic class nodes and edge weights, including two stages:

[0040] Data storage stage: As knowledge fragments are stored in the knowledge graph database, dynamically update the knowledge graph database including semantic class, entity and attribute data;

[0041] Keyword query stage: First, considering that the schema layer of the knowledge graph has a smaller amount of data than the data layer, a query seed model is proposed. The seed model maps query keywords to the schema layer. Then, through the node-based large weights The candidate seed model is generated by the direction expansion method and the edge-based large-weight direction expansion method. Then, the candidate seed model set is scored and sorted by the scoring function. Finally, the candidate seed m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a keyword search KSANEW algorithm combining semantic nodes and edge weights. Including: the data storage stage, which dynamically updates the data such as semantic classes, entities and attributes in the knowledge base with the arrival of knowledge fragments; the keyword query stage, which takes into account the model layer of the knowledge graph compared to the data layer data The amount is small, and a query seed model is proposed, which maps query keywords to the schema layer, and then generates candidates through two types of expansion methods, namely node-based expansion in the direction of large weights and edge-based expansion in the direction of large weights The seed model, and then use the scoring function to score and sort the candidate seed set, and finally use the high-scoring candidate seeds as query seeds, and use the query seeds as a guide to perform distributed searches on the data layer to obtain query results.

Description

technical field [0001] The invention belongs to the technical field of mass data storage and retrieval under a knowledge graph, and in particular relates to a keyword search KSANEW method combining semantic class nodes and edge weights. Background technique [0002] At present, the calculation methods of keyword weights are mainly divided into two categories: the calculation methods based on the characteristics of the keywords themselves and the calculation methods based on the text structure characteristics. The calculation methods based on the characteristics of keywords themselves mainly include: TF-IDF method, CHI method, IG method and semantic-based method. The biggest disadvantage of the TF-IDF weight calculation method is that the keyword is regarded as an independent individual, ignoring the additional weight generated by the combination of keywords; the semantic-based calculation method although the weight calculation is based on the essential characteristics of the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/22G06F16/242G06F16/28
Inventor 汪璟玢管健
Owner FUZHOU UNIV