Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and device for automatically generating Chinese text tag clouds

A text label and automatic generation technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems that the tag cloud generation method cannot be lexically analyzed, and the tag cloud cannot be well adapted to the Chinese text structure.

Active Publication Date: 2016-11-30
SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention provides a method and device for automatically generating Chinese text tag clouds, aiming to solve the problem that the existing tag cloud generation methods cannot accurately perform lexical analysis based on a piece of text data, and the generated tag cloud is mainly aimed at English texts. Technical problems that do not adapt well to the structure of Chinese characters

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for automatically generating Chinese text tag clouds
  • A method and device for automatically generating Chinese text tag clouds
  • A method and device for automatically generating Chinese text tag clouds

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0027] see figure 1 , is a flowchart of a method for automatically generating a Chinese word tag cloud according to an embodiment of the present invention. The method for automatically generating Chinese text tag cloud in the embodiment of the present invention comprises:

[0028] Step 100: performing word segmentation and part-of-speech tagging on the text data to be analyzed using Chinese lexical analysis;

[0029] In step 100, the text data to be analyzed includes data such as news, network and newspaper; Please also refer to 2, figure 2 It is a flow chart of Chinese lexical analysis algorit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of label extraction, and in particular relates to a method and device for automatically generating a Chinese character label cloud. The method for automatically generating Chinese text tag clouds of the present invention comprises: step a: performing word segmentation and part-of-speech tagging on the text data to be analyzed by Chinese lexical analysis; step b: extracting keywords and word frequencies of the text data to be analyzed according to the word segmentation and part-of-speech tagging results; Step c: Take the extracted keywords and their word frequency as input data, and use the tag cloud generation algorithm to generate a tag cloud. The method and device for automatically generating Chinese text tag clouds of the present invention combines and optimizes Chinese word segmentation and tag cloud algorithms, fills the blank of Chinese tag cloud generation algorithms, and provides favorable tools for news key points extraction, public opinion analysis and other work.

Description

technical field [0001] The invention belongs to the technical field of label extraction, and in particular relates to a method and device for automatically generating a Chinese character label cloud. Background technique [0002] With the development of science and technology, especially the rapid development of computer technology, the ability of human beings to generate and obtain data has increased by orders of magnitude. Among them, news, the Internet and newspapers generate a large amount of new information. The collection, analysis and mining of these Chinese text data has always been the focus of researchers' work. Labels are usually used to mark text data and calibrate key words, which is convenient. Find or locate. A tag cloud is a visual description of keywords used to summarize user-generated tags or the textual content of a website. The existing tag cloud generation method for Chinese text extracts keywords through the word segmentation technology, and generate...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 汪云海华博丹尼尔·科恩陈宝权
Owner SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI