Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text processing method and device, electronic equipment and readable storage medium

A text processing and text technology, applied in the fields of devices, electronic equipment, readable storage media, and text processing methods, can solve problems such as unsatisfactory pre-training effects, improve retraining efficiency, save manpower and time costs, and reduce labeling Quantity effect

Active Publication Date: 2021-12-07
TENCENT TECH (SHENZHEN) CO LTD
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although there are already some solutions to improve the expression ability of the pre-training model through pre-training, the current methods all require a large amount of labeled data to achieve the pre-training task, and the existing pre-training task and downstream tasks usually have problems. The large difference leads to the unsatisfactory pre-training effect, and the existing pre-training methods still need to be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and device, electronic equipment and readable storage medium
  • Text processing method and device, electronic equipment and readable storage medium
  • Text processing method and device, electronic equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present application, and are not construed as limiting the present application.

[0038] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the specification of the present application refers to the presence of the features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and / or groups thereof. It will be under...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a text processing method and device, electronic equipment and a readable storage medium, and relates to the field of artificial intelligence, and the method comprises the steps: obtaining a plurality of initial texts, and constructing a training data set based on the plurality of initial texts; training the initial neural network model based on the training data set until a value of a target function of the model meets a training ending condition; using the feature extraction module after training is finished as a text feature extraction module of a text similarity judgment model, training the text similarity judgment model again, obtaining a final text similarity judgment model, and the text similarity judgment model comprises a text feature extraction module and a second text similarity prediction module which are cascaded. The training data set constructed in the embodiment of the invention is weak supervision text similar corpora, and label data does not need to be manually labeled, so that the manpower and time cost can be effectively saved.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, specifically, the present application relates to a text processing method, device, electronic equipment and readable storage medium. Background technique [0002] With the development of natural language processing technology, pre-trained language models are widely used in downstream tasks such as text matching, information extraction, and intelligent question answering due to their effectiveness. Although there are already some solutions to improve the expression ability of the pre-training model through pre-training, the current methods all require a large amount of labeled data to achieve the pre-training task, and the existing pre-training task and downstream tasks usually have problems. The large difference leads to the unsatisfactory effect of pre-training, and the existing pre-training methods still need to be improved. Contents of the invention [0003] The pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/194G06F40/30G06F16/35G06N3/04G06N3/08
CPCG06F40/194G06F40/30G06F16/35G06N3/04G06N3/08Y02D10/00
Inventor 杨威霍腾飞刘志强张金超
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products