Check patentability & draft patents in minutes with Patsnap Eureka AI!

Distributed multifunctional search engine of elasticsearch

A search engine and distributed technology, applied in the direction of network data indexing, network data retrieval, and other database retrieval, etc., can solve the problems of unsatisfactory keyword retrieval results and low operating efficiency, so as to enhance user experience, The effect of improving accuracy

Inactive Publication Date: 2020-03-17
HOHAI UNIV CHANGZHOU
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the operation efficiency of such a system is not high, and the results of keyword retrieval are not very satisfactory.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed multifunctional search engine of elasticsearch
  • Distributed multifunctional search engine of elasticsearch

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0036] like figure 1 As shown, a distributed multifunctional search engine based on elasticsearch specifically includes the following steps:

[0037] Step 1: Distributed crawlers build the original search data set

[0038] The main data sources of the search engine of the present invention are large-scale forums and existing common engines, and the distributed crawler used is based on the scrapy framework to realize fast and real-time acquisition and preservation of large-scale data, and simultaneously realize deduplication and correlation of data judge to classify. Crawling the real-time information of the website by multiple nodes can ensure that our data covers a wide range of data.

[0039] The websites crawled by the present invention include Zhihu, Jobseeker.com, technical blogs, etc., and the content involves texts, pictures, etc. Since some websites have good anti-crawler mechanisms, we use high-performance proxy agents to hide their own IP addresses, break through ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a distributed multifunctional search engine based on elasticsearch. The distributed multifunctional search engine comprises the following steps: S1, constructing an original search data set by a distributed crawler; S2, cleaning the crawled data by utilizing natural language and image processing, and inserting the cleaned data into a non-relational database; S3, synchronizing the data in the non-relational database to an elasticsearch distributed cluster and a node; and S4, utilizing a Django network framework to realize interconnection of the foreground and the elasticsearch so as to complete establishment of the search engine. The distributed multifunctional search engine provided by the invention greatly improves the search accuracy and reasonability, and enhances the user experience.

Description

technical field [0001] The invention relates to a distributed multifunctional search engine of elasticsearch, which belongs to the technical field of the Internet. Background technique [0002] The scientific research value of search engines is not only reflected in its high technical challenge, but also in the convenience and high-speed transmission of information it provides to the entire Internet and even people's livelihood, and its high economic promotion effect on the entire society. The research on search engines is just the beginning. How to find and display information that best meets user needs in web information is not only unprecedentedly large in scale, but also very uncertain in terms of normative conditions. And the system is often difficult to determine what information the user really needs, so the input obtained by the system is a general and vague concept. [0003] Generally speaking, ordinary search engines only use specific computer programs to collect ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/951G06F16/9535
CPCG06F16/951G06F16/9535
Inventor 刘旭宸姚潇王钟贤徐宁刘小峰
Owner HOHAI UNIV CHANGZHOU
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More