Check patentability & draft patents in minutes with Patsnap Eureka AI!

Text processing method and device, text feature extraction method and device, equipment and medium

A feature extraction and text processing technology, applied in the fields of devices, equipment, text processing methods, media and program products, and text feature extraction methods, can solve the problem of losing regularization and strong coding ability, limited generalization performance, and unsuitable for processing text. Serialized data structure and other issues to achieve the effect of accurate similarity prediction results

Pending Publication Date: 2022-07-29
INDUSTRIAL AND COMMERCIAL BANK OF CHINA
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the problem of gradient dispersion, the structure of the recurrent neural network cannot be too deep in the vertical deep stack, resulting in the loss of the regularization and strong coding capabilities brought by the deep network, and the upper limit of generalization performance is limited, which cannot be improved with the help of large data sets.
While convolutional neural networks are often used for images, they are not suitable for processing serialized data structures of text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and device, text feature extraction method and device, equipment and medium
  • Text processing method and device, text feature extraction method and device, equipment and medium
  • Text processing method and device, text feature extraction method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] Hereinafter, embodiments of the present disclosure will be described with reference to the accompanying drawings. It should be understood, however, that these descriptions are exemplary only, and are not intended to limit the scope of the present disclosure. In the following detailed description, for convenience of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the present disclosure. It will be apparent, however, that one or more embodiments may be practiced without these specific details. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concepts of the present disclosure.

[0039] In the related art, in addition to text processing based on a recurrent neural network or a convolutional neural network, text processing can be implemented based on a pre-trained model, but there is a limit to the number of input ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a text processing method, and relates to the field of artificial intelligence. The method comprises the following steps: inputting a first text into a pre-trained feature extractor to obtain a first text feature; inputting a second text into the pre-trained feature extractor to obtain a second text feature; obtaining a similarity prediction result between the first text feature and the second text feature; wherein the feature extractor comprises N cyclic encoders, each cyclic encoder is constructed and obtained according to a cyclic memory mechanism and an encoder in a Transform model, and the cyclic memory mechanism is used for processing data according to state information at the previous moment and input information at the current moment. The invention also provides a text feature extraction method, which comprises the following steps of: inputting a third text into the pre-trained feature extractor to obtain a third text feature. The invention further provides a text processing device, a text feature extraction device, equipment, a storage medium and a program product.

Description

technical field [0001] The present disclosure relates to the field of artificial intelligence, and more particularly, to a text processing method, text feature extraction method, apparatus, device, medium and program product. Background technique [0002] Using deep learning methods to process text can be implemented based on a recurrent neural network model or a convolutional neural network model. However, due to the problem of gradient dispersion, the structure of the recurrent neural network cannot be too deep in the vertical deep stack, resulting in the loss of the regularization and strong coding ability brought by the deep network, and the upper limit of the generalization performance is limited. . While convolutional neural networks are often used for images, they are not suitable for processing serialized data structures of text. SUMMARY OF THE INVENTION [0003] In view of the above problems, the present disclosure provides a text processing method, text feature...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/205G06K9/62G06F16/33G06F16/35G06N3/04G06N3/08
CPCG06F40/205G06F16/355G06F16/33G06N3/08G06N3/048G06N3/044G06F18/22
Inventor 林文杰陆杨芳温锐明袁炜尧
Owner INDUSTRIAL AND COMMERCIAL BANK OF CHINA
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More