Document Boltzmann machine construction optimization method and device for document query

A Boltzmann machine, Boltzmann technology, applied in the field of information retrieval to achieve the effect of improving accuracy and effective query likelihood

Pending Publication Date: 2020-05-19
CHINA PETROLEUM & CHEM CORP +1
View PDF2 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

There are currently no probabilistic language models applied to document queries

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document Boltzmann machine construction optimization method and device for document query
  • Document Boltzmann machine construction optimization method and device for document query
  • Document Boltzmann machine construction optimization method and device for document query

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to make the purpose, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0049] figure 1 A flow chart of a document Boltzmann mechanism construction optimization method for document query according to an embodiment of the present invention is shown.

[0050] Such as figure 1As shown, in step S101, the selected document is sampled to obtain multiple sets of text fragments, and the obtained multiple sets of text fragments are assembled to obtain a sample set. In one embodiment, a sliding window is used for sampling processing to obtain overlapping text segments, wherein the size of the sliding window is a first preset value, and the step size of the sliding window is a second preset value.

[0051] Then, in step S102, model learning processing is performed according to the sample set to obtain a document Boltzmann m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a document Boltzmann machine construction and optimization method for document query, which comprises the following steps of: sampling a selected document to obtain a pluralityof groups of text fragments, and gathering the obtained plurality of groups of text fragments to obtain a sample set; performing model learning processing according to the sample set to obtain a document Boltzmann machine model corresponding to the selected document; and performing optimization processing on the generated document Boltzmann machine model through a Bayesian information criterion toobtain an optimized document Boltzmann machine model. According to the document Boltzmann machine construction and optimization method for document query provided by the invention, the Boltzmann machine is applied to the field of document query, the dependency relationship between lexical items can be naturally captured, and the distribution hypothesis used by a traditional language model is generalized. More effective query likelihood can be obtained, and the retrieval accuracy is improved.

Description

technical field [0001] The invention relates to the field of information retrieval, in particular to a document Boltzmann mechanism construction optimization method and device for document query. Background technique [0002] In recent years, with the rapid development of Internet technology, the information on the Internet has grown exponentially, and the information resources on the Internet have been greatly enriched. However, it is becoming more and more difficult to filter out the information that users need from these massive amounts of information. This not only involves the speed of retrieval, but also the accuracy and effectiveness of retrieval results, and whether they can truly meet the needs of users. [0003] In the field of information retrieval, probabilistic language models have been widely used. The language model estimates the document model under the multinomial distribution assumption, and then uses the query likelihood to rank documents by relevance, wh...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332
Inventor 黄历铭李昌盛杨传书何江
Owner CHINA PETROLEUM & CHEM CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products