Automatic keyword extracting method

An automatic keyword extraction and keyword technology, applied in natural language data processing, instruments, computing, etc., can solve problems such as poor recognition effect, poor recognition effect of low-frequency keywords, lack of semantic features, etc.

Active Publication Date: 2018-11-30
BEIJING INFORMATION SCI & TECH UNIV
View PDF4 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The current mainstream methods usually combine the advantages of different methods to extract keywords for specific problems. The existing defects include: lack of consideration of semantic features, poor recognition effect, poor low-frequency keyword recognition effect, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic keyword extracting method
  • Automatic keyword extracting method
  • Automatic keyword extracting method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] In order to make the purpose, technical solutions and advantages of the present invention clearer, the present invention will be further described below in conjunction with the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0068] A method for automatically extracting keywords proposed by the present invention firstly proposes a method based on word frequency-document distribution entropy to extract common words in the 3GPP technical standard, then proposes an algorithm based on dependency parsing tree to extract candidate keywords, and filters out candidate keywords After the common words, the position featur...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an automatic keyword extracting method. The method includes: extracting general words in a technical standard, extracting candidate keywords, filtering the general words aiming at the candidate keywords, calculating weight scores of the candidate keywords according to position features, word co-occurrence features and context semantic features, calculating a dynamic threshold according to a range of the weight scores of the candidate keywords, and determining a result keyword according to the dynamic threshold. The automatic keyword extracting method has advantages that keyword extraction is realized according to the position features, the word co-occurrence features and the context semantic features, influences of document interior positions and context semantic features on keyword weights are considered comprehensively, high accuracy and high recall rate are realized, 3GPP technical standard retrieval quality is improved, labor cost is reduced, and practicalapplication demands can be well met.

Description

technical field [0001] The invention belongs to the technical field of keyword automatic extraction, in particular to a method for automatic keyword extraction oriented to 3GPP technical standards. Background technique [0002] The vigorous development of mobile communication technology has brought epoch-making changes to human society. As the standard maker of cutting-edge technologies in the communication field, The 3rd Generation Partnership Project (3GPP) is committed to promoting the evolution-based global mobile communication (GSM) core network (including WCDMA, TD-SCDMAE, EDGE, etc. ) of the 3G standard. [0003] In recent years, there have been many cases of patent infringement litigation disputes between large communication technology companies, and the stability of invention patent rights has been challenged unprecedentedly. 3GPP technical standards play an irreplaceable and important role in telecommunications patent examination. [0004] 3GPP technical standar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/211G06F40/216
Inventor 吕学强董志安
Owner BEIJING INFORMATION SCI & TECH UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products