Unlock instant, AI-driven research and patent intelligence for your innovation.

Keyword extraction method and device and storage medium

An extraction method and keyword technology, applied in the field of text processing, can solve problems such as low accuracy, dependence, and poor keyword extraction performance, and achieve the effects of reducing labor costs and improving accuracy and recall rate.

Pending Publication Date: 2019-10-22
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the effect of using this method to extract keywords is very dependent on the accuracy of word segmentation. When the accuracy of word segmentation is poor, the accuracy of keyword extraction is low. In addition, the features of candidate words extracted by this method are not comprehensive enough. May perform poorly in keyword extraction for domains

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Keyword extraction method and device and storage medium
  • Keyword extraction method and device and storage medium
  • Keyword extraction method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0033] It should be noted that the terms "first" and "second" in the description and claims of the present invention and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the invention described herein can be practiced in sequences other than those illustrate...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of text processing, in particular to a keyword extraction method and device and a storage medium, and the method comprises the steps: obtaining a to-be-extracted corpus text in a target field, and carrying out the preprocessing of the to-be-extracted corpus text; traversing the preprocessed analysis statement of the corpus text to be extracted, and sequentially extracting a plurality of continuous characters in the analysis statement to be combined into a word unit; obtaining lexical features of word units of the corpus text to be extracted; obtaining statistical characteristics of word units of the corpus text to be extracted; and based on a machine learning model established by a machine learning algorithm, performing keyword extraction operation on the to-be-extracted corpus text by using the lexical features and the statistical features of the word units. According to the keyword extraction method, the accuracy and recall rate of keywordextraction can be improved, the extracted keywords have high correlation with the target field, and more accurate resource data can be provided for related text analysis.

Description

technical field [0001] The present invention relates to the technical field of text processing, in particular to a keyword extraction method, device and storage medium. Background technique [0002] With the development of the Internet, the amount of online text information has exploded, and it is increasingly difficult to manually obtain the required text information. Therefore, how to quickly and effectively summarize the key information of texts in a certain field or topic has become an important issue. [0003] In order to effectively deal with massive text data, researchers have done a lot of research in the directions of text classification, text clustering, automatic summarization and information retrieval, and these researches all involve the problem of how to obtain the keywords in the text. Keywords are the refinement of text topic information, which highly summarizes the main content of the text and can help users quickly understand the gist of the text; in addit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27
CPCG06F40/205G06F40/289
Inventor 何一涛智绪浩
Owner TENCENT TECH (SHENZHEN) CO LTD