Supercharge Your Innovation With Domain-Expert AI Agents!

ElasticSearch relevance algorithm optimization method and system

An optimization method and correlation technology, applied in computing, special data processing applications, instruments, etc., can solve problems such as inaccurate search and recommendation results, and achieve the effect of improving accuracy

Active Publication Date: 2017-11-07
哈尔滨工程大学科技园发展有限公司
View PDF5 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0013] The present invention proposes an ElasticSearch search correlation algorithm optimization system and method in order to solve the problem of inaccurate search and recommendation results of the ElasticSearch search server's correlation algorithm in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • ElasticSearch relevance algorithm optimization method and system
  • ElasticSearch relevance algorithm optimization method and system
  • ElasticSearch relevance algorithm optimization method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0096] Embodiment one, combine figure 1 Describe this embodiment in detail, a kind of ElasticSearch search relevance degree algorithm optimization system, the technical scheme adopted is as follows: described correlation degree algorithm optimization system comprises:

[0097] A search module for searching input text or characters;

[0098] A judging module for judging whether the input text or characters are Chinese characters;

[0099] A parsing module for parsing input Chinese characters into Pinyin;

[0100] A matching module for matching each Chinese pinyin, the first letter of the pinyin or English characters with the content in the index database and generating matching results;

[0101] A correlation degree optimization judgment module for judging whether to optimize the correlation degree algorithm for the matching result generated by the matching module;

[0102] A return-null module for determining the matching result as no query result and returning a null value...

Embodiment 2

[0105] Embodiment two, combine figure 1 Describe this embodiment in detail. This embodiment is a further limitation of the ElasticSearch search correlation algorithm optimization system described in Embodiment 1. The correlation algorithm optimization system also includes:

[0106] A search result sending module for sending the search result of the search module to the judgment module;

[0107] A Chinese character sending module for sending the Chinese character data judged by the judging module to the parsing module;

[0108] A non-Chinese character sending module for sending the non-Chinese character data judged by the judging module to the parsing module;

[0109] An analysis data sending module for sending the analysis data obtained by the analysis module to the matching module;

[0110] A matching data sending module for sending the matching result generated by the matching module to the correlation optimization judgment module;

[0111] After the correlation optimizat...

Embodiment 3

[0114] Embodiment three, combine figure 2 Describe this embodiment in detail. This embodiment is a further limitation of the ElasticSearch search correlation algorithm optimization system described in Embodiment 1. The correlation optimization module includes:

[0115]A document list module for traversing the document list recommended by the original algorithm;

[0116] A keyword splitting module for splitting keywords in the matching result into individual characters;

[0117] A character hit judging module for judging whether the characters split by the keyword splitting module hit in the documents of the document list;

[0118] A character scoring module for performing character scoring on the characters split by the keyword splitting module;

[0119] A keyword position weight calculation module for calculating the position weight of the hit character in the keyword judged by the character hit judgment module;

[0120] A document position weight calculation module for c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an ElasticSearch relevance algorithm optimization method and system, and belongs to the technical field of relevance algorithm optimization. The problem that an existing relevance algorithm is inaccurate is solved. According to the relevance algorithm optimization method and system, scores calculated through the relevance algorithm serve as one dimension in a new algorithm, then a character relevance scoring dimension is also used for scoring, after the scores are obtained, the two scores are scaled and added according to a multiple, and then search recommendation files are sequenced according to the scores to obtain characters which are matched most accurately. The relevance algorithm optimization method and system are suitable for being used in optimization of various search relevance algorithms.

Description

technical field [0001] The invention relates to a search correlation degree algorithm optimization system and method, and belongs to the technical field of correlation degree algorithm optimization. Background technique [0002] In this era when the Internet is ubiquitous, all kinds of data exist in our lives, such as our daily WeChat chat records, the endless status of Moments, and daily updated news information. Various internal emails, product information on e-commerce websites, etc. [0003] We want to find the target data quickly, and the traditional database like cannot match the target data well, so an Internet technology search is produced. The search is to score each document in the search according to the correlation algorithm, and the one with the highest score is Search recommends the best matching data. [0004] The existing correlation algorithm consists of the following parts: [0005] score(q,d)=queryNorm(q)*coord(q,d)*∑(tf(t in d)*idf(t) 2 *t.getBoost()*...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/3334G06F16/9535
Inventor 谭云峰
Owner 哈尔滨工程大学科技园发展有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More