Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text topic processing method and device, electronic equipment and computer storage medium

A processing method and topic technology, applied in computing, digital data processing, special data processing applications, etc., can solve problems such as low text clustering efficiency, unsatisfactory clustering performance, and slower clustering speed than supervised learning

Active Publication Date: 2020-02-07
XINHUANET CO LTD
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, hot topics are mainly extracted through text clustering. However, text clustering belongs to unsupervised learning, and the clustering speed is far slower than supervised learning. Even the most efficient clustering algorithm, its text clustering efficiency is very low, especially in In the face of massive text data, the clustering performance is even more unsatisfactory. Therefore, an efficient text clustering method is urgently needed to extract hot topics

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text topic processing method and device, electronic equipment and computer storage medium
  • Text topic processing method and device, electronic equipment and computer storage medium
  • Text topic processing method and device, electronic equipment and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0083] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present application, and are not construed as limiting the present application.

[0084] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the specification of the present application refers to the presence of the features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and / or groups thereof. It will be under...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention relates to the technical field of computer data processing. The invention discloses a text topic processing method and device, electronic equipment and a computer storage medium. The method comprises the following steps: determining N K values included in a predetermined K value range according to a predetermined step length, subjecting the obtained first text dataset to topic clustering processing and text filtering processing in sequence through a K-means clustering algorithm based on the N K values, obtaining a text data set obtained after the Nth text filtering processing and contour coefficients corresponding to the N K values, wherein N is a positive integer not smaller than 2; determining a target K value from the N K values according to the contourcoefficients respectively corresponding to the N K values; and performing topic clustering processing on the text data set after the Nth text filtering processing through K-Means according to the determined target K value to obtain a second topic clustering result, and taking each topic included in the second topic clustering result as each topic of the first text data set.

Description

technical field [0001] The embodiments of the present application relate to the technical field of computer data processing, and specifically, the present application relates to a text topic processing method, device, electronic equipment, and computer storage medium. Background technique [0002] With the rapid development of the era of Internet big data, among the massive news information and information from multiple sources, hot topics in various fields can be quickly and automatically extracted, instead of artificially retrieving the current most concerned hot topics from the huge amount of information Information has become an inevitable trend of new media platforms. [0003] At present, hot topics are mainly extracted through text clustering. However, text clustering belongs to unsupervised learning, and the clustering speed is far slower than supervised learning. Even the most efficient clustering algorithm, its text clustering efficiency is very low, especially in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/9535G06F16/35
CPCG06F16/9535G06F16/35
Inventor 李丹赵立永吴新丽韩勇刘启明代继涛
Owner XINHUANET CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products