A method and system for document retrieval based on subject database

A document retrieval and database technology, applied in the field of data processing, can solve the problem of low accuracy of document retrieval, and achieve the effect of avoiding deterioration and improving accuracy.

Active Publication Date: 2021-09-03
NORTH CHINA ELECTRIC POWER UNIV (BAODING)
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of this, the purpose of this application is to provide a method and system for document retrieval based on subject databases to solve the technical problem of low accuracy of document retrieval between different languages ​​in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for document retrieval based on subject database
  • A method and system for document retrieval based on subject database
  • A method and system for document retrieval based on subject database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0061] refer to figure 1 , is an implementation flowchart of a subject database-based document retrieval method provided in Embodiment 1 of the present application, which is applicable to cross-language document retrieval applications, for example, to retrieve documents in a second language through keywords in a first language. It should be noted that the documents involved in this embodiment refer to all carriers that record knowledge, such as books, periodic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The application discloses a method and system for document retrieval based on a subject database. The method includes: obtaining at least one keyword of a document in a first language to be retrieved; in the subject database, searching and searching for keywords belonging to the same subject category of the document Thesaurus; in the thesaurus, calculate the similarity between the keywords to be retrieved and the first language thesaurus group, and get the target first language thesaurus group with the largest similarity; in the thesaurus, get the target first language thesaurus group The target word group in the second language associated with the word group in a language, the document storage information corresponding to the target word group in the second language, and the second language documents corresponding to the target word group in the second language belong to the target second language theme The probability of word group; according to the probability that the second language document corresponding to the target second language subject word group belongs to the target second language subject word group and the document storage information corresponding to the target second language subject word group, obtain the target second language literature.

Description

technical field [0001] This application relates to the technical field of data processing, in particular to a method and system for document retrieval based on a subject database. Background technique [0002] With the process of globalization, more and more foreign researchers want to understand China. However, due to the extremely complex Chinese language, except for a few researchers who have mastered Chinese after years of language learning, they can accurately understand the basic concepts and connotations of Chinese literature. It is difficult for researchers to accurately understand the semantics in Chinese literature. Due to the huge amount of document data, foreign researchers need to spend a lot of time on translation and filtering to retrieve the required Chinese documents. [0003] For this reason, after translating English documents, search the translated Chinese to obtain Chinese documents. [0004] However, due to the ambiguity of translation in this scheme,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33G06F40/216G06F16/35
CPCG06F16/3334G06F16/3346
Inventor 王建红
Owner NORTH CHINA ELECTRIC POWER UNIV (BAODING)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products