Automatic Chinese text topic exploration method and system

A text and Chinese technology, applied in the field of automatic Chinese text topic exploration, can solve the problems of increased time consumption, increased calculation, inconvenient induction, and extraction of text topics, etc., to achieve the effect of easy extraction

Pending Publication Date: 2021-03-26
珠海横琴博易数据技术有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] 1. The goal of K-Means judgment is to minimize the sum of the squared distances from the cluster members to the actual centroid containing the member. As the analyzed data set continues to increase, it is necessary to calculate the distance from all data points to the centroid each time, and the amount of cal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic Chinese text topic exploration method and system

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0022]The specific embodiment of the present invention will be described in detail in this section, and the preferred embodiment of the present invention is shown in the drawings, and the effect of the drawings is to use a graphical supplementary description of the text section, which enables people to understand the present Each of the techniques and an overall technical solution of the invention, but it is not understood to limit the scope of the invention.

[0023]In the description of the invention, a plurality of meanings are more than two, greater than, less than, more than, etc., is not included, or more, or less, etc. If there is a description to the first, the second is only for distinguishing the technical features, and cannot be understood as an indication or implies the reproduction of the technical features indicated by the indicated technical features or implicitly indicated the technical features indicated. relationship.

[0024]In the description of the present invention, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an automatic Chinese text topic exploration method and system; the system comprises a word vector construction module, a text clustering module and a visualization module; theautomatic Chinese text topic exploration method is used in the system, and the problem of long calculation time consumption of a K-Means clustering method can be solved; and more classification feature information is provided, so that text topics can be extracted manually and quickly.

Description

technical field [0001] The invention relates to the field of text theme exploration, in particular to a method and system for automatic Chinese text theme exploration. Background technique [0002] There are many methods of topic exploration, such as topic extraction based on LDA, K-Means text clustering based on unsupervised learning, etc. The LDA topic model is a topic inference based on the perspective of probability and statistics using Bayesian thinking, K-Means clustering The model is a scatter clustering based on the distance of the space vector, which can finally divide the text into different clusters or classes. On this basis, the purpose of text topic extraction is finally achieved through manual further information extraction and induction; under this background , K-Means has the following disadvantages: [0003] 1. The goal of K-Means judgment is to minimize the sum of the squared distances from the cluster members to the actual centroid containing the member. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/284G06K9/62G06F40/258G06F40/216G06F40/49
CPCG06F40/284G06F40/258G06F40/216G06F40/49G06F18/23213Y02D10/00
Inventor 张荣显
Owner 珠海横琴博易数据技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products