Unlock instant, AI-driven research and patent intelligence for your innovation.

Ancient poem automatic generation method based on pre-training model

A technology for automatic generation of ancient poems, applied in neural learning methods, biological neural network models, natural language data processing, etc., can solve problems such as poor model generation effects, achieve lower threshold, improve generation quality, and improve quality

Pending Publication Date: 2021-01-29
沈阳雅译网络技术有限公司
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the disadvantages of the automatic generation system of archaic poems in the prior art that there is not enough bilingual corpus and the generation effect of the model is not good, the technical problem to be solved by the present invention is to provide a method for automatically generating archaic poems based on pre-training, which can be used in limited training of ancient poems. Under the data, make full use of large-scale monolingual corpus pre-training language model tasks, and then transfer the information extracted by the pre-training model to the ancient poetry generation model, which significantly improves the generation quality of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Ancient poem automatic generation method based on pre-training model
  • Ancient poem automatic generation method based on pre-training model
  • Ancient poem automatic generation method based on pre-training model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The present invention will be further elaborated below in conjunction with the accompanying drawings of the description.

[0041] The present invention provides a kind of method based on pre-training ancient style poem automatic generation, comprises the following steps:

[0042] 1) Collect a large amount of monolingual corpus, including existing ancient poems and classical Chinese, and then perform word segmentation data preprocessing, use monolingual corpus to train the language model to obtain the pre-training model;

[0043] 2) Utilize the collected existing ancient style poems to construct training corpus, obtain the training data and keywords of ancient style poems;

[0044] 3) Use the keywords extracted in step 2) to train a language model as a keyword expansion module (such as figure 1 shown);

[0045] 4) Fine-tuning is carried out on the pre-trained model trained in step 1) using the ancient poetry training corpus;

[0046] 5) The keyword input by the user i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an ancient poem automatic generation method based on pre-training. The method comprises the steps of 1) collecting massive monolingual corpora including existing ancient poemsand classical Chinese, performing word segmentation data preprocessing, and training a language model by using the monolingual corpora to obtain a pre-training model; 2) constructing a training corpusby utilizing the existing ancient poem data to obtain training data of the ancient poem; 3) training a language model as a keyword extension module by utilizing keywords; 4) performing fine adjustment on the pre-training model trained in the step 1) by utilizing the constructed ancient poem training corpus; and 5) expanding keywords inputted by a user by utilizing the keyword expansion module inthe step 3), constructing a model by combining a format controller, inputting the model, and sending the model into the trained model to generate the ancient poetry. According to the method, knowledgein the Chinese monolingual data is utilized, no manual rule setting is needed, the model can automatically learn the ancient poetry mode, and the model has better generalization ability by using thepre-training model.

Description

technical field [0001] The invention relates to a natural language processing technology, in particular to a method for automatically generating ancient poetry based on a pre-trained model. Background technique [0002] Nowadays, deep learning technology has been greatly developed, and major breakthroughs have been made in image recognition, language recognition, machine translation, text generation and other tasks. Compared with the previous automatic generation method of ancient poetry based on rules and templates, based on deep learning The automatic generation method of ancient style poems can generate better quality of ancient style poems, and the generated results are more clear and smooth. However, most of the current automatic methods for ancient poetry based on deep learning are usually composed of several models responsible for generating a certain sentence, which is relatively complicated. For example, the classic method based on the RNN Encoder-decode framework i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/284G06F40/263G06N3/04G06N3/08
CPCG06F40/284G06F40/263G06N3/08G06N3/044G06N3/045
Inventor 杜权
Owner 沈阳雅译网络技术有限公司