Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Reliable search method base on content trust

A credible and content-based technology, applied in the search field, can solve problems such as low recall and precision, expired information, and reduce the precision of search engines, so as to achieve the effect of improving the precision

Inactive Publication Date: 2012-12-05
TONGJI UNIV
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

According to the above evaluation criteria, there are seven types of problems in the existing search engines: (1) The recall and precision rates are low. Even the search engines with the most complete functions can only find about 1 / 3 of the web pages on the Web. rate cannot be guaranteed
On the other hand, due to the huge amount of network information, complicated and disorderly, invalid links on web pages, repeated query results, expired information, and distorted information, the accuracy rate of search has been greatly reduced.
(2) The problem of web page cheating. Since search engines have become a tool for network users to obtain information, there has been a phenomenon of cheating on search engine page rankings.
(3) Security issues, search engines are becoming more and more powerful, and have a tendency to penetrate into every corner of the Internet
Search Engine Security Flaw Inadvertently Opens Up for Hackers
(4) Retrieval function problems. The current search is mainly for full-text databases, bibliographic databases, and retrieval tool indexes, but there are too few retrieval points to achieve conditional linkage retrieval.
The search engine database is huge, it is not easy to update, and the quality of information is difficult to guarantee
Inferior and invalid information reduces the precision rate of search engines, and also affects the confidence of users to quickly obtain valuable information
(6) Standardization of search engines, including non-standardization of search terms, segmentation of English and Chinese characters, repeated appearance of the same result, and standardization of query interfaces
(7) Inaccurate expression of user search needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Reliable search method base on content trust
  • Reliable search method base on content trust
  • Reliable search method base on content trust

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] A. The user interaction module receives the user's search keywords and distributes them to each search engine that provides original search services;

[0027] B. Receive traditional search results provided by various search engines and submit them to the content trust detection module;

[0028] C. The content trust detection module performs deduplication, text normalization, trust semantic understanding, content credibility calculation and search result reordering operations on traditional search results, and submits trusted search results to the user interaction module;

[0029] D. The user interaction module presents trusted search results to the user.

[0030] The calculation of the content credibility includes:

[0031] Extraction of trust facts: Trust facts refer to declarative sentences that describe the concepts, attributes, and characteristics of something in varying degrees in a judgmental (affirmative or negative) manner in the content of the information text...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a reliable search method base on content trust, comprising the following steps: A, receiving search keywords of users through a user interaction module, and distributing the search keywords to each search engine providing original search service; B, receiving traditional search results provided by each search engine, and submitting the traditional search results to a content trust detection module; C, performing the operations such as the elimination of repetition, the text normalization, the comprehension of trust semantemes, the calculation of content credibility and the rearrangement of search results, submitting the reliable search results to the user interaction module; D, presenting the reliable search results through the user interaction module to the users. The reliability of the essence of text information, text content is evaluated in the invention, And three text-content-trust assessment methods are proposed base on trust facts, trust evidences and trust characteristics, and are unified by utilizing Bayes network. The credibility of the text contents is applied to a sorting algorithm, which can improve the precision of the search results.

Description

technical field [0001] The invention relates to search technology, in particular to a credible search method based on content trust. Background technique [0002] In the vast Internet world, search technology is the key to provide users with accurate and fast information services. A typical search engine consists of three parts: a crawler program, an indexing system, and a user interface. The crawler program collects as many web pages as possible on various Internet sites and stores them in the local database; the index system builds an inverted index that can provide fast search for the information in the local database, and at the same time sorts the importance of the information. Currently, the most famous The sorting algorithm is the PageRank algorithm provided by Google search founders Larry Page and Sergey Brin; the user interface receives the user's search keywords and submits them to the search server, and returns the search results provided by the search server to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 曾国荪王伟王晓君黄宇蒋昌俊苗夺谦
Owner TONGJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products