A document keyword extraction method and device based on LDA and word vectors
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- HEFEI INSTITUTES OF PHYSICAL SCIENCE - CHINESE ACAD OF SCI
- Publication Date
- 2019-05-17
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The present invention relates to the technical fields of natural language processing and deep learning, in particular to a document keyword extraction method and device based on LDA and word vectors. Background technique
[0002] Keywords can concisely and accurately describe the content of the text, and generally consist of several words and phrases. Keyword extraction, also known as keyword tagging, refers to extracting a number of representative words or phrases from text or text collections to reflect the main semantic information of the text. An important channel for information of interest. The advent of the Internet era puts forward new requirements for keyword extraction. The extracted keywords should have the following three characteristics: significance, readability and comprehensiveness. Significance means that the extracted keywords should reflect the core content of the document. For example, "machine translation" is extracted from the d...