Information retrieval system and method based on big data

An information retrieval and big data technology, applied in the field of big data information retrieval systems, can solve problems such as increasing user burden, reducing search efficiency, and not fully considering user behavior, so as to achieve the effect of improving retrieval efficiency and reducing user burden

Inactive Publication Date: 2018-05-01
JINLING INST OF TECH
View PDF7 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But these two methods undoubtedly increase the burden on users and reduce the search efficiency
Moreover, the user's behavior is not fully considered when sorting the retrieved items.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information retrieval system and method based on big data
  • Information retrieval system and method based on big data
  • Information retrieval system and method based on big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Preferred embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0029] A traditional search engine generally consists of four parts: a data grabber, a parser, an indexer, and a retriever. The present invention adds two functional modules to the retriever of the traditional search engine framework, which are respectively a client user behavior collection module and a server-side big data intelligent analysis module. like figure 1 shown. The details are as follows:

[0030] 1) User behavior collection module

[0031] It mainly collects the user's behavior records from the time the user enters the search information to the time when the user closes the browser. User behavior information includes: the number of clicked URLs, the content of each URL, the time when the URL is clicked, and the time when the browser is closed. Wherein, the time when the URL is clicked refers to the time point when the user c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an information retrieval system and method based on big data. For improving accuracy of a search engine and a user satisfaction degree, and improving a traditional search engine framework, two function modules are added in the traditional search engine, and are respectively a user behavior collection module of a client and a big-data intelligent-analysis module of a server. The user behavior collection module mainly collects records and time stamps of selecting retrieval items in lists returned by the server and clicking corresponding URLs (links) by a user after information retrieval, namely selection and clicking situations of the user in a time period from starting of one time of retrieval until closing of a browser. Functions of the big-data intelligent-analysis module are to calculate and counting collected user behaviors, judge accuracy of retrieval items according to behavior information of clicking retrieval items by the user after a certain query, carry out reranking on the retrieval items, update a database, and provide more accurate retrieval results, which more satisfy the user, for next retrieval.

Description

technical field [0001] The invention relates to the field of information retrieval, and in particular relates to a big data information retrieval system and method. Background technique [0002] A search engine is a type of website that specifically provides retrieval services on the Internet. These websites use web search software (also known as web spiders) to collect pages from a large number of websites on the Internet locally, and build databases after processing, so as to be able to Respond to various queries from users. [0003] With the popularization of Internet applications and the advent of the era of big data, the number of global Internet pages increases by tens of millions every day. To retrieve needed information in the vast network, search engines have become an indispensable assistant for accessing the Internet. [0004] The working principle of traditional search engines can be used figure 1 For illustration, the shaded part is the module added after the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/9535G06F16/9566
Inventor 杨荣根龚乐君
Owner JINLING INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products