Supercharge Your Innovation With Domain-Expert AI Agents!

Corpus classification method and device, computer equipment and storage medium

A classification method and corpus technology, applied in computer parts, computing, neural learning methods, etc., can solve problems such as error rate and keyword classification, and achieve the effect of improving efficiency, improving accuracy, and avoiding low classification accuracy.

Pending Publication Date: 2019-06-18
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the prior art, another classification method adopted is to classify and mark the corpus by extracting keywords in the corpus, and to classify the corpus by detecting whether the corpus has the same or similar keywords as the set classification items labeling, but each word has different meanings in different real user corpus contexts, therefore, there is a high error rate in keyword classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corpus classification method and device, computer equipment and storage medium
  • Corpus classification method and device, computer equipment and storage medium
  • Corpus classification method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0076] In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention.

[0077] In some processes described in the specification and claims of the present invention and the above-mentioned drawings, a plurality of operations appearing in a specific order are contained, but it should be clearly understood that these operations may not be performed in the order in which they appear herein Execution or parallel execution, the serial numbers of the operations, such as 101, 102, etc., are only used to distinguish different operations, and the serial numbers themselves do not represent any execution order. Additionally, these processes can include more or fewer operations, and these operations can be performed sequentially or in parallel. It should be n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a corpus classification method and device, computer equipment and a storage medium. The method comprises: acquring to-be-classified corpus information; Performing word segmentation processing on the corpus information to generate a word segmentation set, and converting the word segmentation set into a word vector matrix; Inputting the word vector matrix into a corpus classification model; And reading a classification result of the word vector matrix output by the corpus classification model, and recording a classification target represented by the classification result as a classification mark of the corpus information. Due to the fact that the classification corpus model is a neural network model trained to be in a convergence state, information expressed by corpus information expressed by the word vector matrix can be accurately analyzed, then the word vector matrix is classified to obtain a classification result, and the corpus information is classified and marked according to the classification result. And the corpus information is classified through the neural network model, so that the corpus information classification efficiency canbe improved.

Description

technical field [0001] The embodiments of the present invention relate to the field of language processing, in particular, a corpus classification method, device, computer equipment and storage medium. Background technique [0002] Corpus, that is, language material, is the content of linguistic research, and corpus is the basic unit that constitutes a corpus. The corpus stores the language materials that have actually appeared in the actual use of the language; the corpus is the basic resource of language knowledge carried by the computer; the real corpus needs to be processed (analyzed and processed) before it can become a useful resource. [0003] In the existing technology, the classification of corpus has become an important basic research of language processing or AI language interaction. Accurate classification of corpus can accurately grasp the semantics expressed by the corpus, and then execute the instructions represented by the corpus or reply. In the prior art,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F16/35G06N3/04G06N3/08G06K9/62
CPCY02D10/00
Inventor 谭莹胡小露许开河王少军
Owner PING AN TECH (SHENZHEN) CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More