Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for automatically generating Chinese text label cloud

A text label, automatic generation technology, applied in the direction of electronic digital data processing, special data processing applications, instruments, etc., can solve the problem that the tag cloud cannot adapt to the Chinese text structure well, and the tag cloud generation method cannot lexical analysis.

Active Publication Date: 2013-12-11
SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF5 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention provides a method and device for automatically generating Chinese text tag clouds, aiming to solve the problem that the existing tag cloud generation methods cannot accurately perform lexical analysis based on a piece of text data, and the generated tag cloud is mainly aimed at English texts. Technical problems that do not adapt well to the structure of Chinese characters

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for automatically generating Chinese text label cloud
  • Method and device for automatically generating Chinese text label cloud
  • Method and device for automatically generating Chinese text label cloud

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0027] see figure 1 , is a flowchart of a method for automatically generating a Chinese word tag cloud according to an embodiment of the present invention. The method for automatically generating Chinese text tag cloud in the embodiment of the present invention comprises:

[0028] Step 100: performing word segmentation and part-of-speech tagging on the text data to be analyzed using Chinese lexical analysis;

[0029] In step 100, the text data to be analyzed includes data such as news, network and newspaper; Please also refer to 2, figure 2 It is a flow chart of Chinese lexical analysis algorit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of label extraction and particularly relates to a method and a device for automatically generating a Chinese text label cloud. The method for automatically generating the Chinese text label cloud comprises the following steps of a, carrying out word segmentation and part-of-speech tagging on text data to be analyzed by using Chinese lexical analysis; b, extracting a keyword and the word frequency of the text data to be analyzed according to the word segmentation and part-of-speech tagging result; c, taking the extracted keyword and word frequency as input data and generating a label cloud by using a label cloud generation algorithm. According to the method and the device for automatically generating the Chinese text label cloud, which are disclosed by the invention, the Chinese word segmentation and the label cloud algorithm are combined and optimized, the blank of a Chinese word label cloud generation algorithm is filled up and a favorable tool is provided for work such as extraction of key points of news and public opinion analysis.

Description

technical field [0001] The invention belongs to the technical field of label extraction, and in particular relates to a method and device for automatically generating a Chinese character label cloud. Background technique [0002] With the development of science and technology, especially the rapid development of computer technology, the ability of human beings to generate and obtain data has increased by orders of magnitude. Among them, news, the Internet and newspapers generate a large amount of new information. The collection, analysis and mining of these Chinese text data has always been the focus of researchers' work. Labels are usually used to mark text data and calibrate key words, which is convenient. Find or locate. A tag cloud is a visual description of keywords used to summarize user-generated tags or the textual content of a website. The existing tag cloud generation method for Chinese text extracts keywords through the word segmentation technology, and generate...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 汪云海华博丹尼尔·科恩陈宝权
Owner SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI