Keyword extraction method and device and electronic equipment

A technology for extracting methods and keywords, applied in the computer field, can solve the problems of poor generalization ability, affecting training network, and taking a lot of time.

Active Publication Date: 2018-06-05
BEIJING QIYI CENTURY SCI & TECH CO LTD
View PDF10 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the inventors found in the process of implementing the present invention that at least the following problems exist in the prior art: a large number of features need to be artificially constructed through a supervised method in order to train a network with better performance
However, artificially constructing features requires constructing different features for different fields, which has poor generalization ability and requires a lot of time and effort, and the quality of artificially constructed features directly affects the quality of the training network.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Keyword extraction method and device and electronic equipment
  • Keyword extraction method and device and electronic equipment
  • Keyword extraction method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0072] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0073] The keyword extraction method provided by the embodiment of the present invention can be used to extract keywords of any text, for example, the text can be a text corresponding to a paper or a news report, and the like.

[0074] figure 1 A schematic flow chart of a keyword extraction method provided in an embodiment of the present invention, the method includes the following steps:

[0075] S101. Perform word segmentation on the text to be processed, o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An embodiment of the invention provides a keyword extraction method and device and electronic equipment. The method includes: subjecting a to-be-processed text to word segmentation to obtain a plurality of word segments, and determining a word vector of each word segment; determining a label probability vector of each word segment according to the word vector of each word segment and a well trained BLSTM (bidirectional long short-term memory) network; aiming at each sentence of the to-be-processed text, subjecting each sentence to CRF decoding according to the label probability vector of eachword segment in each sentence to determine a classification label of each word segment in each sentence; determining word segments, with the classification labels being preset classification labels, in each sentence as keywords of the corresponding sentence; taking the keywords of each sentence in the to-be-processed text as keywords of the to-be-processed text. Network training is realized by construction of a neural network through the BLSTM network and CRF decoding, manual characteristic construction in a traditional method is avoided, and keyword extraction generalization capability is improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a keyword extraction method, device and electronic equipment. Background technique [0002] Keywords are words or phrases that can reflect the theme of the text, and are an important basis for people to quickly understand the content of the text and grasp the theme of the text. For example, in a news report, keywords can be used to grasp the theme and key content of the news report. In the thesis, the field and research topic of the thesis can be clarified through keywords. At present, keyword extraction technology has been widely used in information retrieval and text classification and other fields. The Internet has entered the era of Web 2.0. Many websites recommend objects of interest to users, such as videos, news, books, etc. They also need to use keyword extraction technology to satisfy users and deliver content in a more granular and scientific manner. , to achieve a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/284G06F40/289
Inventor 陈伟王亮吴友政
Owner BEIJING QIYI CENTURY SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products