Information mining method and apparatus

A technology of information mining and query statements, applied in the field of information retrieval, can solve problems such as low resolution recall rate, poor resolution effect, and inability to build vocabulary, and achieve high resolution accuracy and high recall effect

Pending Publication Date: 2018-12-18
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF2 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] (1) There are various ways of expression. Users of the same problem have various forms of expression, and different users have various expression habits. In this case, artificial enrichment construction cannot cover all expressions
[0004] (2) The expression is colloquial, and the user's expression form is very colloquial, and artificially enriched templates cannot be covered
[0005] (3) The number of vocabularies in each dimension is huge, and it is impossible to construct such a huge vocabulary manually
[0006] Due to the above characteristics of user expression, if artificial enrichment rules and vocabulary are used, there will be problems such as high time and labor costs, low efficiency, and poor analysis effect, which will lead to poor user understanding module and poor human-computer interaction experience
In addition, the enrichment vocabulary cannot enrich a large-scale full-scale vocabulary, resulting in a low parsing recall rate
Enrichment expression methods cannot enrich large-scale full expression templates and colloquial expressions, resulting in low analysis recall and accuracy, inability to understand user expressions, and inability to provide accurate answers, resulting in low user satisfaction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information mining method and apparatus
  • Information mining method and apparatus
  • Information mining method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] In the following, only some exemplary embodiments are briefly described. As those skilled in the art would realize, the described embodiments may be modified in various different ways, all without departing from the spirit or scope of the present invention. Accordingly, the drawings and descriptions are to be regarded as illustrative in nature and not restrictive.

[0069] figure 1 A flowchart showing an information mining method according to an embodiment of the present invention. Such as figure 1 As shown, the information mining method may include the following steps:

[0070] Step 101, mining each query statement of each specific category from the search log;

[0071] Step 102, given the specific category of seed entities;

[0072] Step 103, according to the seed entity of the specific category and each query statement, generate an expression template corresponding to each query statement of the specific category;

[0073] Step 104 , according to each category ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the present invention provide an information mining method and apparatus. The method comprises the following steps of: mining each query statement of each specific category from a search log; giving the particular class of seed entities; generating an expression template corresponding to each query statement of the specific category according to the seed entity of the specific category and each query statement; according to various query sentences and corresponding expression templates, extracting high-frequency query sentences and high-frequency expression templates from the search log; By using the search log of the user as the data source, the obtained high-frequency expression of high-frequency sentences are rich and can cover all kinds of expression habits of users, which can include the content that cannot be covered by artificial enriched templates such as colloquial expression.

Description

technical field [0001] The invention relates to the technical field of information retrieval, in particular to an information mining method and device. Background technique [0002] In the human-computer interaction system, users express various needs for robot interaction. Existing template-based parsing modules need to have a full amount of user query query statements (query), in order to improve the recall rate of user understanding and parsing accuracy. These user expressions have the following characteristics, which cause many problems in the use of traditional artificial enrichment rules and vocabulary. [0003] (1) There are various ways of expression. Users of the same problem have various forms of expression, and different users have various expression habits. In this case, artificial enrichment construction cannot cover all expressions. [0004] (2) The expression is colloquial, and the user's expression form is very colloquial, which cannot be covered by artific...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F40/30
Inventor 王文敏纪友升凌光徐威
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products