Paper title generation method capable of utilizing distributed semantic information

A semantic information, distributed technology, applied in the direction of semantic analysis, special data processing applications, natural language data processing, etc., can solve the problem that the title is difficult to conform to the semantic rules, the amount of title information is small, etc., and achieve the effect of rich title information

Active Publication Date: 2017-02-08
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF3 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0018] In order to solve the problem that the amount of title information generated by the extractive method is small, and the title generated by the abstract method based on statistical learning is difficult to comply with the semantic rules, the present invention proposes a paper title generation method using distributed semantic information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Paper title generation method capable of utilizing distributed semantic information
  • Paper title generation method capable of utilizing distributed semantic information
  • Paper title generation method capable of utilizing distributed semantic information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to better illustrate the purpose and advantages of the present invention, the implementation of the method of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples.

[0042] Taking abstracts as input, design and deploy 1 test: a test to generate titles for 500 paper abstracts.

[0043] In the experiment, 100,000 papers containing titles, abstracts, and texts on HowNet were used as the training corpus for training the GloVe model, and 20,000 of the titles were selected as the training corpus for the title generation model, and 500 of the abstracts were selected to generate titles. Test corpus.

[0044] The experiment uses the ROUGE value and manual evaluation criteria as evaluation indicators:

[0045] 1. Evaluation of ROUGE value

[0046] The ROUGE method distinguishes the quality of candidate titles by calculating the coincidence of word units between generated titles and standard titles. R...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a paper title generation method capable of utilizing distributed semantic information, and belongs to the field of natural language processing. The paper title generation method comprises the following steps of: firstly, utilizing a TextRank algorithm to obtain the first k keywords of a paper abstract, training a GloVe (Global Vector for word representation) model to obtain a word vector, and carrying out vector initialization on the extracted keywords; then, utilizing a recurrent neural network title based on a long short-term memory unit to obtain the title; and finally, carrying out title construction. A deep learning method is used for mining the deep semantic information of the title, and the generated title exhibits high readability and conforms to the semantic rule of the title.

Description

technical field [0001] The invention relates to a method for generating paper titles using distributed semantic information, belonging to the field of natural language processing. Background technique [0002] The title is the main idea of ​​an article. Since processing large volumes of articles is a tedious and time-consuming process. Therefore, the technology of automatically generating titles can allow people to grasp information quickly, which has very important practical significance for its research. [0003] There are two main types of title generation methods: extractive title generation method and abstract title generation method. [0004] 1. Extraction method to generate title: [0005] The extractive headline generation method is to select a group of prominent sentences in the candidate set, and then use sentence compression technology to realize the headline generation. [0006] (1) For example, Dorr et al. proposed a headline generation method combining sema...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/289G06F40/30
Inventor 罗森林潘丽敏王睿怡吴舟婷
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products