Check patentability & draft patents in minutes with Patsnap Eureka AI!

Model training method and device and text similarity determination method and device

A text similarity and model training technology, which is applied in character and pattern recognition, instruments, electrical digital data processing, etc., can solve the problem of model accuracy reduction

Active Publication Date: 2021-02-05
HUNDSUN TECH
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the rapid development of the Internet and artificial intelligence technology, similar search and question and answer based on natural language have become necessary skills for major websites, APP (Application, application program), intelligent customer service system, etc. The text similarity model is the key model in the process. The text similarity model is used to judge whether two texts are similar and the degree of similarity between the two texts. However, the current text diversification reduces the model accuracy of the text similarity model.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and device and text similarity determination method and device
  • Model training method and device and text similarity determination method and device
  • Model training method and device and text similarity determination method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Through research on the current text similarity model, the applicant found that the training difficulty of the text similarity model mainly focuses on two aspects, one is the model training and model parameter adjustment for multiple text similarity annotations, the other is similar The robustness problem of degree metric definition.

[0038] Among them, regarding the model training and model parameter adjustment for multiple text similarity data annotations, the current text similarity annotation includes two text annotation methods, one is triple annotation form (text A, text P, text N), which satisfies The similarity between text A and text P is greater than that between text A and text N. Generally, the text similarity model is trained by constructing a triplet loss function. The other is 0-1 labeling form (text A, text B, label 0 / 1), 0 means that text A is not similar to text B, 1 means that text A is similar to text B, and generally trains text similarity based on...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a model training method and device and a text similarity determination method and device; and the model training method comprises the steps: obtaining a text set for training atext similarity model, enabling the text set to comprise a first text group and a second text group, and enabling all texts in the first text group to be labeled in a first text labeling mode, each text in the second text group being labeled in a second text labeling form; labeling each text in a target text labeling form to unify the labeling forms of each text in the first text group and the second text group; obtaining a loss value of the first text group based on a first loss function, and obtaining a loss value of the second text group based on a second loss function; and determining a loss value of the text group based on the loss value of the first text group and the loss value of the second text group. According to the method, the model parameters of the text similarity model are adjusted by utilizing the loss value of the text group to train the text similarity model, so that the training method of the text similarity model has higher compatibility, and the accuracy of the model is improved.

Description

technical field [0001] The present application belongs to the technical field of artificial intelligence, and in particular relates to a model training method, a method and a device for determining text similarity. Background technique [0002] With the rapid development of the Internet and artificial intelligence technology, similar search and question and answer based on natural language have become necessary skills for major websites, APP (Application, application program), intelligent customer service system, etc. The text similarity model is the key model in the process. The text similarity model is used to judge whether two texts are similar and the degree of similarity between the two texts. However, the current text diversification reduces the model accuracy of the text similarity model. . Contents of the invention [0003] In view of this, the purpose of the present application is to provide a model training method, text similarity determination method and device...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/194G06K9/62
CPCG06F40/194G06F18/22G06F18/214
Inventor 王炯亮娄东方林金曙高峰陈哲许浩
Owner HUNDSUN TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More