Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Key term extraction method, device and equipment and computer readable storage medium

A computer program and terminology technology, applied in computing, unstructured text data retrieval, instruments, etc., can solve problems such as focusing on word frequency, ignoring, and key terms not expressing the meaning of the article, so as to improve accuracy and avoid improper segmentation. Effect

Active Publication Date: 2019-06-14
GCI SCI & TECH +1
View PDF9 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the traditional Chinese key term extraction method has the following defects: (1) Using a general segmentation dictionary, it is easy to segment the key terms in a specific field into words that are not related to the subject of the article, resulting in the final key term extraction cannot express the real meaning of the article. (2) Pay too much attention to the frequency of words, ignoring some low-frequency key terms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Key term extraction method, device and equipment and computer readable storage medium
  • Key term extraction method, device and equipment and computer readable storage medium
  • Key term extraction method, device and equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0044] Please refer to figure 1 , the first embodiment of the present invention provides a key term extraction method, which can be executed by a key term extraction device, and includes the following steps:

[0045] S11: Segment the text according to the pre-built dictionary of specific domain terms;

[0046] In the embodiment of the present invention, the key term extraction device may be a computing device such as a computer, a mobile phone, a tablet compu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a key term extraction method, device and equipment and a computer readable storage medium, and the method comprises the steps of carrying out the segmentation of a text according to a pre-constructed specific domain term dictionary; traversing the text by using a preset first extraction window, extracting the words obtained after segmentation processing to obtain candidateterms of a specific field, and extracting words obtained after segmentation processing according to a pre-constructed term dictionary of the specific field to obtain candidate terms of the specific field; performing the topic clustering on the candidate terms through a pre-constructed probability topic model to obtain a plurality of topic-associated candidate terms and association probabilities thereof; and determining the key terms according to the candidate terms associated with each topic and the association probability of the candidate terms. According to the present invention, the text isdivided on the basis of the term dictionary in the specific field, and the probability topic model is adopted for extracting the key terms, so that the key terms in the specific field are effectivelyextracted, and the key term extraction accuracy is improved.

Description

technical field [0001] The present invention relates to the technical field of word extraction, in particular to a key term extraction method, device, equipment and computer-readable storage medium. Background technique [0002] The traditional Chinese key term extraction method is generally to first segment the Chinese text, and then based on the word segmentation results, the frequency method is used to extract key terms. However, the traditional Chinese key term extraction method has the following defects: (1) Using a general segmentation dictionary, it is easy to segment the key terms in a specific field into words that are not related to the subject of the article, resulting in the final key term extraction cannot express the real meaning of the article. (2) Pay too much attention to the frequency of words, ignoring some low-frequency key terms. Contents of the invention [0003] In view of the above problems, the object of the present invention is to provide a key t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F16/35
CPCY02A90/10
Inventor 杜翠凤蒋仕宝
Owner GCI SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products