Keyword extraction method and keyword extraction apparatus

An extraction method and technology of an extraction device, which are applied in the field of text processing, can solve problems such as low keyword extraction accuracy, and achieve the effects of solving the low keyword extraction accuracy, improving calculation accuracy, and improving extraction accuracy.

Pending Publication Date: 2018-07-24
TENCENT TECH (SHENZHEN) CO LTD
View PDF3 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Embodiments of the present invention provide a keyword extraction method and a keyword extraction device with high keyword extraction accuracy to solve the problem of low keyword extraction accuracy in existing keyword extraction methods and keyword extraction devices question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Keyword extraction method and keyword extraction apparatus
  • Keyword extraction method and keyword extraction apparatus
  • Keyword extraction method and keyword extraction apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Referring to the drawings, wherein like reference numerals represent like components, the principles of the present invention are exemplified when implemented in a suitable computing environment. The following description is based on illustrated specific embodiments of the invention, which should not be construed as limiting other specific embodiments of the invention not described in detail herein.

[0034] In the following description, specific embodiments of the present invention are described with reference to steps and symbols for operations performed by one or more computers, unless otherwise stated. Accordingly, it will be understood that the steps and operations, which at times are referred to as being performed by a computer, include manipulation by a computer processing unit of electronic signals representing data in a structured form. This manipulation transforms the data or maintains it at a location in the computer's memory system that can reconfigure or ot...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a keyword extraction method. The method comprises the steps of performing word segmentation operation on all analysis statements of an extracted text to obtain word units of theanalysis statements; determining dependency association degrees between candidate words in the analysis statements and other candidate words in the analysis statements; determining word attraction between the candidate words and other candidate words in the extracted text; calculating related weights between the candidate words in the analysis statements and other candidate words in the analysisstatements; creating weighted undirected graphs of the analysis statements; based on a text sorting algorithm, calculating vertex scores of the vertexes of the weighted undirected graphs; and according to the vertex scores, sorting the candidate words corresponding to the vertexes, and extracting keywords in the candidate words. The invention furthermore provides a keyword extraction apparatus. The related weight between the two candidate words serves as a weight edge in the text sorting algorithm, so that the calculation accuracy of the text sorting algorithm is improved and the keyword extraction accuracy is improved.

Description

technical field [0001] The invention relates to the field of text processing, in particular to a keyword extraction method and a keyword extraction device. Background technique [0002] In order to effectively process massive text data, researchers have done a lot of research in the directions of text classification, text clustering, automatic summarization and information retrieval, and these studies all involve a key and basic problem, that is, how to obtain keywords in the text. Therefore, in tasks such as natural language processing and information retrieval, keyword extraction technology has gradually become a hot research issue. Among the existing research results, keyword extraction technology has been widely used in news service, query service and other fields, and it has been proved that it can play an important role in tasks such as information retrieval, automatic summarization, and text classification. At the same time, massive information processing also poses...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/313G06F16/3344G06F40/211G06F40/289
Inventor 王煦祥尹庆宇
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products