Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and system of data retrieval

A technology of data retrieval and information retrieval, which is applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as unreasonable tree structure, uneven distribution of nodes, uncontrollable number of child nodes, etc., and achieve accurate High accuracy and completeness, the effect of improving accuracy and completeness

Inactive Publication Date: 2010-03-31
HUAWEI TECH CO LTD +1
View PDF0 Cites 50 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This will lead to an unreasonable tree structure of the learned ontology: the distribution of nodes is extremely uneven, and the number of child nodes of each node is uncontrollable
The skewness of this tree structure will continue to increase with the increase of levels, the more levels, the more serious the skewness, and the accuracy and integrity of data retrieval based on this structure are low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system of data retrieval
  • Method and system of data retrieval
  • Method and system of data retrieval

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0218] On the other hand, the first embodiment of the system of the present invention, such as Figure 8 shown, including:

[0219] Terminology acquisition module 1: for acquiring electronic documents through the network, and extracting domain terms from the electronic documents;

[0220] Similarity calculation module 2: used to calculate the similarity between the domain terms extracted by the term acquisition module 1;

[0221] Clustering module 3: for clustering the similar field terms determined by the similarity calculation module 2 layer by layer in a top-down manner of defining branches, and building an index list.

[0222] Storage module 5: used to store the index list;

[0223] Information retrieval module 6: used for information retrieval using the index list.

[0224] It may also include: merging module 4: for merging domain terms with the same meaning.

[0225] Wherein, the merging module 4 may be located after the clustering module 3 or between...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a system of data retrieval, relating to the field of information acquisition and processing, and solves the problems of low accuracy and integrity of the data retrieval. The method provided by the embodiment of the invention comprises the following steps: acquiring an electronic document through network; extracting field terminologies in the electronic document; calculating similarity among the extracted field terminologies; clustering the similar field terminologies layer by layer in a restriction branch manner; establishing an indexed list; storing the indexed list; and utilizing the indexed list to carry out information retrieval by a information retrieval module. The invention is suitable for data acquisition and information retrieval.

Description

technical field [0001] The invention relates to the field of information collection and processing, in particular to a data retrieval method and system. Background technique [0002] In the existing semantic Web, question answering system, vertical search in specific fields, information extraction, library management and information retrieval, etc., it is often necessary to extract some data or words that are considered useful from the database, and based on these data or The relationship between words establishes a corresponding tree list index, so that users can search for related information. An ontology is an explicit specification of a shared conceptualization of a domain of interest. In layman's terms, ontology is used to describe concepts in a certain field or even a wider range and the relationship between concepts, so that these concepts and relationships have a common, clear and unique definition within the scope of sharing. The method of building ontology automa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 徐惠高志强戴昌林朱望斌陈世宏
Owner HUAWEI TECH CO LTD