Text label construction method and device, computer equipment and storage medium

A text label and construction method technology, applied in the computer field, can solve the problems that are not suitable for label creation, cannot accurately realize the label creation in big data text, and achieve the effect of accurate clustering

Pending Publication Date: 2020-06-23
卓尔智联(武汉)研究院有限公司
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of the conventional label construction methods are based on keywords and other methods. This kind of label implementation is generally applicable to situations such as sites, article regions, etc. It is not suitable for the creation of labels in big data texts, and cannot accurately realize the label creation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text label construction method and device, computer equipment and storage medium
  • Text label construction method and device, computer equipment and storage medium
  • Text label construction method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0046] The text label construction method provided by this application can be applied to such as figure 1 shown in the application environment. Wherein, the terminal 102 communicates with the server 104 through the network. The terminal 102 uploads the text data to be processed to the server 104, the server 104 obtains the text data to be processed, performs word segmentation processing on the text data to be processed, and obtains a word segmentation set; trains the word segmentation set through word2vec, obtains the similarity between words in the word segmentation set,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a text label construction method and device, computer equipment and a storage medium. The method comprises the steps of obtaining to-be-processed text data, performing word segmentation processing on the to-be-processed text data to obtain a word segmentation set, training the word segmentation set through word2vec to obtain similarity among words in the word segmentationset, performing word clustering based on the similarity among the words, and constructing a text label according to a word clustering result. Whole process, the similarity between words in the text data is accurately obtained through word2vec training, clustering is carried out based on the similarity between the words, accurate clustering can be achieved in an iterative clustering mode in the clustering process, and labels of the text data can be reasonably and accurately constructed based on the clustering result of accurate clustering.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a method, device, computer equipment and storage medium for constructing tags in text tags. Background technique [0002] With the rapid development of computers and the Internet, text document data has grown exponentially. However, in the face of such a huge amount of data, how to dig out useful information and how to quickly retrieve data has always been an important problem faced by people. [0003] Among them, the use of tag construction plays an important role in the utilization of document data. For example, UGC (User Generated Content, User Generated Content) classification and clustering, indexing, topic search, topic crawler, and recommendation systems can be performed based on tags. In addition to the above applications, another common use of tag building is news or blogging. By extracting keywords from news or blogs, readers can understand the content of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/216G06F16/35
CPCG06F16/355
Inventor 周鑫
Owner 卓尔智联(武汉)研究院有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products