Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text feature extraction method and device, computer equipment and storage medium

An extraction method and feature extraction technology, applied in the field of data analysis, can solve the problems of lowering the accuracy of the classification results of text features, ignoring the integrity of the text content, etc., and achieve the effect of improving the accuracy.

Pending Publication Date: 2021-09-28
PINGAN INT SMART CITY TECH CO LTD
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] This application provides a text feature extraction method, device, computer equipment, and storage medium, which solves the problem that the BERT model performs multi-dimensional feature extraction for each character in the text feature extraction process, ignoring the integrity of the text content, resulting in The technical problem that the accuracy rate of the classification result of the output text feature is reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text feature extraction method and device, computer equipment and storage medium
  • Text feature extraction method and device, computer equipment and storage medium
  • Text feature extraction method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

[0025] The text feature extraction method provided by the embodiment of the present application can be applied in such as figure 1 shown in the application environment. Such as figure 1 As shown, the client (computer device) communicates with the server through the network. Wherein, the client (computer device) includes but is not limited to various personal computers, notebook computers, smart phones, tablet computers, cameras and portable wearable devices. The server ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of data analysis, and discloses a text feature extraction method and device, computer equipment and a storage medium, and the method comprises the steps: obtaining a text file, segmenting the text content of the text file through a word segmentation device, and obtaining a character sequence; performing feature extraction on the character sequence through N feature extraction layers of a pre-trained BERT model to obtain N feature vectors, wherein N is an integer greater than 1; and fusing the N feature vectors based on an attention mechanism to obtain a fusion vector, wherein the fusion vector is used for describing the text features of the text file. According to the method, the obtained N feature vectors are fused, and the N feature vectors can perform feature description on the text file from different angles, so that the obtained fusion vector can more comprehensively expropriate the text features of the text file, and the accuracy of the BERT model in a classification task is improved.

Description

technical field [0001] The present application relates to the technical field of data analysis, and in particular to a text feature extraction method, device, computer equipment and storage medium. Background technique [0002] At present, the application of natural language processing (NLP) has been popularized in various fields, and one of the main reasons for the popularity of natural language processing is due to the use of pre-trained models in natural language processing. To solve the problem that the natural language processing model needs a lot of training before use, the pre-trained model can adapt to perform different natural language processing operations on different data sets, without the need to build a model from scratch. [0003] Among the common pre-training models, the appearance of the BERT model significantly improves the accuracy of classification tasks in NLP, and the BERT model can be pre-trained through unlabeled sample files. However, in the existin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/35G06F40/126G06F40/279G06F40/30G06K9/20G06K9/34G06K9/62
CPCG06F16/3344G06F16/35G06F40/126G06F40/279G06F40/30G06F18/214G06F18/253
Inventor 吴晓东
Owner PINGAN INT SMART CITY TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products