Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic template mining system and method based on cross support degree evaluation

An automatic mining and support technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as low confidence, difficulty in finding confidence thresholds, and uneven distribution of itemset support. High-quality results

Active Publication Date: 2020-08-18
SOUTH CHINA UNIV OF TECH
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the process of template automatic mining, the records after intention recognition have the problem of uneven distribution of item set support, which leads to the occurrence frequency of category words far exceeding the frequency of other words
[0005] Another traditional confidence It only considers the influence of A on B, and ignores the occurrence of B. Assuming that the setting of reliability is unreasonable, when When the situation occurs, it indicates that the two items of A and B are independent, but because the confidence level is set too low, such a record is also retained
The records after intention recognition will have unbalanced support distribution, so the quality of mining depends on the setting of confidence, and it is often difficult to find the most suitable confidence threshold

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic template mining system and method based on cross support degree evaluation
  • Automatic template mining system and method based on cross support degree evaluation
  • Automatic template mining system and method based on cross support degree evaluation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0044] In this embodiment, the food category template mining is taken as an example.

[0045] Template automatic mining system based on cross-support evaluation, including intent recognition module, category word replacement module, frequent itemset mining module, and template sorting module;

[0046] The intent identification module is used to identify the user's historical records, and send the identified records to the category word replacement module;

[0047] In the intent recognition module, the relevant records are used to train the intent recognition model, the relevant records refer to the user's search records, the intent recognition model includes a fasttext model, and the trained intent recognition model is used to perform intent recognition on the historical search records ;

[0048] The training intention model is to input data with a category label, and the output of the model is the corresponding category label, such as input: 'how much is the hotel', the labe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic template mining system and method based on cross support degree evaluation. The system comprises an intention recognition module, a category word replacement module, a frequent item set mining module and a template sorting module. The intention recognition module is used for carrying out intention recognition on historical records of a user and sending the records subjected to intention recognition to the category word replacement module; the category word replacement module is used for carrying out word segmentation on the record subjected to consciousnessidentification, replacing category words and sending the record after the category words are replaced to the frequent item set mining module; the frequent item set mining module is used for mining the records after the category words are replaced by utilizing an association rule mining algorithm and screening frequent items to obtain a preliminary template; and the template sorting module is usedfor sorting the preliminary templates according to the entropy and the similarity with the existing word list. According to the automatic template mining method, a frequent item set mining method isimproved, and evaluation based on the cross support degree has higher quality than evaluation based on confidence.

Description

technical field [0001] The invention relates to the field of automatic mining of search templates, in particular to an automatic template mining system and method based on cross-support evaluation. Background technique [0002] In vertical search, when the user's search keywords match the regular words in the database, relevant data in the database will be returned. In practical applications, users search for various keywords, and it is difficult to manually configure all the matching words. With the increase of the number of search types, manual configuration is obviously an unrealistic approach. Therefore, the design algorithm automatically mines the frequently used search words of users. Templates are necessary. The current research is mainly to mine search templates from the user's historical data. A typical representative is Baidu's search technology patent "Automatic Mining Method for Requirement Recognition Template, Requirement Recognition Method and Corresponding D...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/9535
CPCG06F16/9535Y02D10/00
Inventor 何立华贺小勇
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products