Theme digging method and equipment and query expansion method and equipment

A technology for mining equipment and topics, applied in the field of user query topics, can solve the problems of not knowing how to use, complex operations, etc., and achieve the effect of improving the recall rate

Active Publication Date: 2015-01-21
CANON KK
View PDF7 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These powerful devices greatly facilitate professional users, but often cause trouble for inexperienced users, because powerful devices often

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Theme digging method and equipment and query expansion method and equipment
  • Theme digging method and equipment and query expansion method and equipment
  • Theme digging method and equipment and query expansion method and equipment

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0046] image 3 is a flowchart showing the topic mining method according to the first embodiment of the present invention.

[0047] Such as image 3 As shown, in the obtaining step 310, a plurality of sentences are obtained from at least one knowledge base. Sentences can be obtained from the knowledge base in any manner known in the art for subsequent processing.

[0048] For example, in the case where the topic mining method is applied to the query expansion method, a query in a natural language form input by a user may be firstly received as an input, and then sentences matching the query may be retrieved in the at least one knowledge base. The retrieval method may be any method known in the art, such as a full-text retrieval method, a named entity recognition (Named Entity Recognition, NER) method or a relation extraction (Relation Extraction, RE) method.

[0049] In another embodiment, the acquiring step 310 may include: receiving the at least one knowledge base as inpu...

no. 2 example

[0083] Image 6 is a flowchart illustrating a topic mining method according to a second embodiment of the present invention.

[0084] As mentioned in the Summary of the Invention, the inventors of the present application found that in addition to the subject of user-visible distinguishing objects, there is another type of implicit subject with a large number, that is, the subject of premise assertion pairs.

[0085] Therefore, in order to further improve the recall rate of the topic, the premise assertion pair topic can be further mined on the basis of the first embodiment. That is to say, the difference between the second embodiment and the first embodiment is that, in addition to mining the subject of the user-visible distinctive object, the subject of the premise assertion is also mined. By combining user-visible discriminative object topics and premise assertion-to-topics, the recall rate of topics can be further improved, so that users can be provided with desired inform...

no. 3 example

[0158] Figure 8 is a flowchart illustrating a topic mining method according to a third embodiment of the present invention.

[0159] The difference between the third embodiment and the first embodiment and the second embodiment is that, in addition to mining user-visible distinguishing object topics (optionally, there are also premise assertion pairs), language-dependent topics are also mined . By combining linguistically dependent topics with user-visible distinguishing object topics, or combining language-dependent topics with user-visible distinguishing object topics and premise assertion-pair topics, the recall rate of topics can be further improved, which can further effectively serve users. Provide the desired information.

[0160] Figure 8 Steps 310-350 for generating user-visible distinguishing object themes and optional steps 620-660 for generating premise assertion-pair themes in are the same as in the second embodiment Image 6 The corresponding steps in are t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a theme digging method and equipment and a query expansion method and equipment. The theme digging method includes the step of acquisition, wherein a plurality of sentences are acquired from at least one knowledge base; the step of recognition, wherein for each sentence in the acquired multiple sentences, entities correlated with the sentences are recognized, and the entities indicate physical objects or physical object attributes; the step of generation, wherein one entity is extracted from the entities correlated with all the sentences respectively to generate one or more entity groups; the step of selection, wherein the entity group with the largest difference degree is selected from the one or more entity groups; the step of user visible distinctive object theme output, wherein user visible distinctive object themes corresponding to the acquired multiple sentences are output, and each user visible distinctive object theme is represented by one sentence in the acquired multiple sentences and the corresponding entity of the sentence in the selected entity group. By means of the theme digging method and equipment and the query expansion method and equipment, the hidden user query theme can be dug and thus the recall rate is increased.

Description

technical field [0001] The present invention relates to text mining technology, in particular to a topic mining method for mining hidden user query topics from larger text databases, that is, to dig out some user query topics that have no direct text records but may be used as answers to user queries. Background technique [0002] The functions of electromechanical devices used today are becoming more and more, and these electromechanical devices can usually support many individual functions. Take the multifunction printer (MFP), for example, which combines copying, printing, scanning, faxing, and remote operation functions in order to meet the needs of most people. These powerful devices greatly facilitate professional users, but often cause trouble for inexperienced users, because powerful devices often bring complicated operations, so that users do not know how to use the device or cannot find out. necessary information to operate the device. [0003] In view of this si...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/3325G06F2216/03
Inventor 张碧川黄耀海李荣军刘鹏
Owner CANON KK
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products