Model training method and device based on word embedding, electronic equipment and storage medium

A model training and training model technology, which is applied in the fields of electrical digital data processing, character and pattern recognition, instruments, etc., can solve problems such as poor accuracy of matching results, far-flung core ideas, and inability to meet text matching requirements, etc.

Pending Publication Date: 2021-04-16
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present application provides a word embedding-based model training method, device, electronic equipment, and storage medium to solve the problem that only literal matching can be achieved between query items and matching content in the prior art, but the core ideas are far from each other. The accuracy of the matching results is not good enough to meet the text matching requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and device based on word embedding, electronic equipment and storage medium
  • Model training method and device based on word embedding, electronic equipment and storage medium
  • Model training method and device based on word embedding, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In the following description, specific details such as specific system structures and technologies are presented for the purpose of illustration rather than limitation, so as to thoroughly understand the embodiments of the present application. It will be apparent, however, to one skilled in the art that the present application may be practiced in other embodiments without these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary detail.

[0027] It should be understood that when used in this specification and the appended claims, the term "comprising" indicates the presence of described features, integers, steps, operations, elements and / or components, but does not exclude one or more other features. , whole, step, operation, element, component and / or the presence or addition of a collection thereof.

[0028] It shou...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is suitable for the technical field of artificial intelligence, and provides a model training method and device based on word embedding, electronic equipment and a storage medium, and the method comprises the steps: carrying out the word segmentation based on a query item and a data matching item matched with the query item, and obtaining a model training word list; inputting a target word in the model training word list into a to-be-trained model, and respectively obtaining vector representations associated with the target word from an initialized word vector matrix, a query item vector matrix and a data matching item vector matrix in the to-be-trained model; splicing the vector representations, and performing feature cross fusion through a to-be-trained model to obtain a target word vector representation; and based on the target word vector representation, obtaining vector representation parameters in the query item vector matrix and the data matching item vector matrix in the to-be-trained model, and obtaining a trained model containing the vector representation parameters. According to the scheme, the accuracy of text matching results can be improved, and text matching requirements are met.

Description

technical field [0001] The present application belongs to the technical field of artificial intelligence, and in particular relates to a word embedding-based model training method, device, electronic equipment and storage medium. Background technique [0002] Text matching has always been one of the hot tasks in the field of information retrieval such as search engines and recommendation systems. It is mainly aimed at query items given by users, using models to match documents with high similarity from the database and returning them to the user. The whole process includes two modes, recall and sort, which are used to match and filter the text and sort the recall results. [0003] In the recall mode, the existing word embedding-based recall method mainly uses the word2vec model to obtain the vector representation of each word, and then sums all the word vectors in the query and document to obtain the average value, and then obtains the vector representation of the query and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/284G06F40/289G06F40/237G06K9/62
CPCG06F40/237G06F40/289G06F40/284G06F18/00
Inventor 陈浩谯轶轩高鹏
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products