Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Topic word-based language generation method

A technology of language generation and subject headings, applied in natural language translation, special data processing applications, instruments, etc., can solve problems such as difficult to determine text changes, conservative online comments, language can not respond, etc., to achieve rich text diversity, smooth text effect

Active Publication Date: 2017-09-05
RENMIN UNIVERSITY OF CHINA
View PDF8 Cites 34 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] (1) Although template generation is simple and feasible, the generated text is incomplete and of low quality
[0011] (2) Although the generated text structure is hierarchical through pattern generation, this method is only suitable for articles with a fixed structure and lacks flexibility
However, due to the complex semantic and grammatical relationships between sentences, it is not easy to construct a text rule base
[0013] (4) The concept of the method based on attribute features is simple, and the generated text is relatively flexible, but the content relationship between attributes is more complicated, and the workload is heavy, that is, it is difficult to determine what kind of text changes can be added to the collection as attribute features
[0029] Although the Seq2Seq model based on the attention mechanism can generate better text, because the model usually uses the generation method of imitating the language in the training set, there are a large number of general words such as "very good" and "unclear" in the training text, making the generated Online reviews tend to be "conservative" and lack diversity, that is, they produce a panacea-style language with little information, which leads to the language generated by the model not being able to reflect any information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Topic word-based language generation method
  • Topic word-based language generation method
  • Topic word-based language generation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0036]The method of the invention involves technologies such as intelligent analysis and language generation, and can be used for automatic generation of online comments, and makes the generated text more fluent and rich in diversity.

[0037] The present invention proposes a language generation model (T-Seq2Seq) based on a topic word (topic word) on the basis of an attention model.

[0038] Th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a topic word-based language generation method; a traditional Seq2Seq model is used to train context and descriptors, intermediate level information ci corresponding to implicit level information of input Xi is calculated at a model encoding portion, word vectors of the descriptors are used to generate intermediate level information oi through an attention mechanism, and the information ci and oi are subjected to joint influencing through a structure combined with the attention mechanism to finally generate a sequence regarding both the context and the descriptors. The topic word-based language generation method provided herein has the advantages that generated text is smoother and highly diversified, conveniences are brought to users, and potential users are guided in purchase deciding.

Description

technical field [0001] The invention relates to a language generation method, in particular to a method for language generation based on subject words using deep learning and an attention mechanism. Background technique [0002] With the development of Internet technology, online user reviews have a great impact on e-commerce and consumers. Studies have shown that most consumers will collect product and service-related information online before making a purchase decision, and will share consumption experience and purchase evaluation online after purchase. In addition, a large number of Internet users read user reviews before purchasing products or services, and are influenced by the content of reviews. Therefore, massive online user reviews are an important source of information to help consumers discover product quality and make corresponding purchase decisions. However, due to the cumbersome review process at this stage, users are unwilling to spend more time evaluating ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28
CPCG06F40/56
Inventor 赵鑫窦洪健文继荣
Owner RENMIN UNIVERSITY OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products