Unlock instant, AI-driven research and patent intelligence for your innovation.

Text classification method and device, electronic equipment and computer readable storage medium

A text classification and text technology, applied in text database clustering/classification, computing, unstructured text data retrieval, etc., can solve problems such as poor clustering accuracy, and achieve high accuracy and high classification efficiency

Pending Publication Date: 2020-10-09
ALIBABA GRP HLDG LTD
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Unsupervised models (for example, LDA (Latent Dirichlet Allocation)), the accuracy of clustering is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and device, electronic equipment and computer readable storage medium
  • Text classification method and device, electronic equipment and computer readable storage medium
  • Text classification method and device, electronic equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] Various exemplary embodiments of the present invention will now be described in detail with reference to the accompanying drawings. It should be noted that the relative arrangements of components and steps, numerical expressions and numerical values ​​set forth in these embodiments do not limit the scope of the present invention unless specifically stated otherwise.

[0057] The following description of at least one exemplary embodiment is merely illustrative in nature and in no way taken as limiting the invention, its application or uses.

[0058] Techniques, methods and devices known to those of ordinary skill in the relevant art may not be discussed in detail, but where appropriate, such techniques, methods and devices should be considered part of the description.

[0059] In all examples shown and discussed herein, any specific values ​​should be construed as exemplary only, and not as limitations. Therefore, other instances of the exemplary embodiment may have dif...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text classification method and device, electronic equipment and a computer readable storage medium, and the method comprises the steps: obtaining a vector of each text type through employing a word vector corresponding to a seed word of each text type; obtaining a to-be-classified text vector by utilizing the word vector corresponding to the keyword of the to-be-classified text; calculating a similarity value of the to-be-classified text and each text type according to the to-be-classified text vector and each text type vector; and comparing each similarity value witha corresponding similarity threshold value, and taking the text type corresponding to the similarity value exceeding the similarity threshold value as the type of the to-be-classified text.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, and more specifically, to a text classification method, a text classification device, an electronic device, and a computer-readable storage medium. Background technique [0002] At present, the methods for news classification mainly include the following two methods: [0003] The first method: manual labeling and classification, this classification method is inefficient. [0004] The second way: use the classification model to classify the news. Classification models mainly include supervised models and unsupervised models. A supervised model requires a large number of samples, and the training and tuning cycle is long. For unsupervised models (for example, LDA (Latent Dirichlet Allocation)), the accuracy of clustering is poor. [0005] Currently, a new classification method needs to be provided. Contents of the invention [0006] An object of the present invent...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/284
CPCG06F16/35
Inventor 孙昌青姚文清
Owner ALIBABA GRP HLDG LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More