HSK composition generation method based on a subject model

A topic model and composition technology, applied in instruments, computing, electrical digital data processing, etc., can solve the problems of many typos, poor coherence and logic, and many grammatical errors, so as to achieve less typos, good effect, The effect of fewer grammatical errors

Inactive Publication Date: 2019-02-22
BEIJING INFORMATION SCI & TECH UNIV
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the automatic generation of text in the existing technology is not effective in terms of cohe...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • HSK composition generation method based on a subject model
  • HSK composition generation method based on a subject model
  • HSK composition generation method based on a subject model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described below in conjunction with the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0065] like figure 1 As shown, a topic model-based HSK composition generation method includes the steps: select the training data set to train the LDA topic model, obtain the distribution of sentences and texts, words and texts, and select the most similar topic keywords by calculating the cross entropy sentences and generate text.

[0066] The first writing task of HSK5 is to write a short...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a HSK composition generation method based on a subject model, which comprises the following steps of: training an LDA model to obtain a sentence and a text, a distribution ofwords and text, calculating cross entropy, selecting a sentence which is closest to a subject keyword, and then generating a text. The invention provides an HSK composition generation method based ona subject model, by training the LDA theme model, the distribution of sentences and texts, words and texts are obtained, and the sentences closest to the topic keywords are selected by calculating thecross entropy, and then the texts are generated, and the automatically generated texts have good coherence and logicality, fewer grammatical errors and fewer wrong characters, which can well fulfillthe writing task and meet the needs of practical application.

Description

technical field [0001] The invention belongs to the technical field of text information processing, and in particular relates to an HSK composition generation method based on a topic model. Background technique [0002] In the era of rapid development of the IT industry and the Internet, people are dreaming of making natural language computable, so that we can discover hidden information and knowledge under large-scale unstructured text. Artificial intelligence (AI) technology is growing rapidly. Twenty years ago, Deep Blue developed by IBM defeated world chess champion Garry Kasparov in 1997. In March 2016, AlphaGo defeated Li Shishi with its Monte Carlo tree search algorithm. This is a major milestone in artificial intelligence research. [0003] On the other hand, the combination of AI and big data has brought unprecedented development to natural language processing technology. Because the working principle of artificial intelligence robots is logical reasoning based o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/24G06F17/27
CPCG06F40/166G06F40/205
Inventor 吕学强游新冬董志安
Owner BEIJING INFORMATION SCI & TECH UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products