Short text template mining method and device, electronic equipment and readable storage medium

A short text and template technology, applied in the computer field, can solve problems such as insufficient statistics, no consideration of synonyms, and limited coverage of templates, and achieve the effects of reducing labor costs, improving accuracy, and improving accuracy and ease of use

Active Publication Date: 2018-10-09
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF10 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] On the one hand, the solutions in the prior art do not consider the situation of synonyms, which leads to a very limited coverage of templates. At the same time, when the corpus is small, due to insufficient statistics, templates cannot be generated; on the other hand, the grammatical structure is diverse, and different word orders The templates may represent the same meaning, and the solutions of the prior art cannot accurately identify such cases

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short text template mining method and device, electronic equipment and readable storage medium
  • Short text template mining method and device, electronic equipment and readable storage medium
  • Short text template mining method and device, electronic equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0035] The technical solution of the embodiment of the present invention clusters keywords with similar meanings to form keyword clusters through word meaning clustering, and then selects the optimal arrangement of the keyword clusters as a short text template, thereby solving the problem that the prior art cannot cover synonyms judgment The problem. In addition, the tech...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a short text template mining method and device, electronic equipment and a readable storage medium. Through the method, problems concerning synonyms and a wordorder in a template can be effectively solved, and an accurate short text template easy to use can be generated. The method comprises the steps that keywords are extracted from a problem text to forma segmented word sequence; the keywords are clustered according to meanings to obtain keyword clusters; the keywords in the segmented word sequence are replaced with the keyword clusters containing the keywords to obtain a word cluster sequence; and the word cluster sequence in an optimal arrangement mode is selected to serve as the short text template.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a short text template mining method, device, electronic equipment and readable storage medium. Background technique [0002] In the field of natural language processing, whether it is clustering models, classification models, search rank algorithms, etc., there are generally weak feature expression capabilities and insufficient information content. The features here often refer to the word features in the text; due to the popularity of big data, not all words can correspond to enough samples. [0003] In the existing technology, the most common way to solve the above problems is to perform feature mining and expand the existing features; the mainstream idea is to mine frequent compound words to obtain combined features to improve the expressiveness of features for text. For example, convert the text into an ordered word set, use the Fp-Growth algorithm to compress data records...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F40/186G06F40/247
Inventor 李开宇
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products