Text searching method and device

A text and original text technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of large space consumption, high retrieval time complexity, poor accuracy, etc., and achieve the effect of low time complexity

Inactive Publication Date: 2008-03-12
HUAWEI TECH CO LTD +1
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0016] In view of this, the embodiment of the present invention provides a network text retrieval method and device, which overcomes the defects of high time complexity, large space consumption and poor accuracy of text retrieval on the existing network

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text searching method and device
  • Text searching method and device
  • Text searching method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions proposed in the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0036] The text retrieval method of the embodiment of the present invention is based on the "adaptive mapping" algorithm. Referring to Figure 1, the "adaptive mapping" algorithm includes the following steps:

[0037] S102: Input the original data; the original data is in the form of a vector, and its dimension may be M;

[0038] S104: Determine the dimension of the target mapping space, that is, the dimension of the data to be reduced in dimension, which may be set as N;

[0039] S106: Determine the dimensionality reduction mapping relationship according to the dimensionality of the original data and the determined dimensionality of the target mapping space;

[0040] S108: According to the dimension reduction mapp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text retrieval method and a relevant device. The method comprises: Input primary text data; perform self-adapting mapping process with a method of descent for the primary text data; carry forth retrieval text data similar to the data after the self-adapting mapping process with the method of descent; output the retrieval text data. The invention execution example can fulfill effective compression for original data after decency for higher-dimensional text data; the invention is characterized in rather low time complexity, adapting to magnanimity data, and effectively maintaining similarity between all vectors of the text data. Utilization of the invention execution example method or the device can fulfill quick response to such requests as network text inquiry, search, retrieval and etc at rather low arithmetic cost, so as to resolve the problems in prior network text retrievals of high time complexity, large space consumption and rather low accuracy.

Description

technical field [0001] The invention relates to text retrieval technology, in particular to a method and device for text retrieval on the network. Background technique [0002] Every year more than 10 new data are added to the World Wide Web 18 bytes, and continues to grow exponentially every year. Some existing search engines can no longer adapt to such a growth scale. This scale of growth requires a new architecture that makes it possible to rapidly index and query content information such as HTML, plain text, music, and images. On the other hand, peer-to-peer networks have gained wide acceptance in recent years. Their scalable, fault-tolerant, and adaptive nature has sparked interest in building low-cost search engines on top of peer-to-peer networks. [0003] Although some search techniques based on peer-to-peer networks have been proposed recently, most of them are based on simple keyword matching, without using some more advanced ranking algorithms in the field of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 胡辛遥韩定一俞勇金洪波吕晓雨
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products