Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Sentence vector model-based sentence vector generation method and apparatus, and computer device

A sentence vector and model technology, applied in the sentence vector generation method based on the sentence vector model, computer equipment and storage media, and the field of devices, can solve the problems of affecting the accuracy of the sentence vector, loss of sentence semantic information, and difficulty in obtaining it, so as to reduce the Semantic loss, the effect of avoiding semantic loss

Pending Publication Date: 2021-11-30
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Some of the existing sentence vector generation technologies simply separate the words in the sentence, convert the words into word vectors, and then calculate the average value to obtain the sentence vector. However, this will lose the semantic information in the sentence and affect the accuracy of the sentence vector. characteristics; some generate sentence vectors through supervised learning, but in reality it is difficult to obtain a large amount of labeled text corpus for supervised learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sentence vector model-based sentence vector generation method and apparatus, and computer device
  • Sentence vector model-based sentence vector generation method and apparatus, and computer device
  • Sentence vector model-based sentence vector generation method and apparatus, and computer device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by those skilled in the technical field of the application; the terms used herein in the description of the application are only to describe specific embodiments The purpose is not to limit the present application; the terms "comprising" and "having" and any variations thereof in the specification and claims of the present application and the description of the above drawings are intended to cover non-exclusive inclusion. The terms "first", "second" and the like in the description and claims of the present application or the above drawings are used to distinguish different objects, rather than to describe a specific order.

[0044] Reference herein to an "embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the present application. The occurrenc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention belongs to the field of artificial intelligence, and relates to a sentence vector model-based sentence vector generation method and device, computer equipment and a storage medium, and the method comprises the steps of obtaining TF-IDF information of each initial text in an initial text set to determine a target adjustment word, and adjusting the initial text based on the target adjustment word to generate a similar text; inputting the initial text into an initial sentence vector model to obtain an initial sentence vector, and inputting the similar text into the initial sentence vector model to obtain a similar sentence vector; taking the similar sentence vector as a positive sample of the current initial sentence vector, taking other initial sentence vectors and the similar sentence vector as a negative sample of the current initial sentence vector, and performing comparative learning on the initial sentence vector model to obtain a sentence vector model; and inputting the to-be-processed text into the sentence vector model to obtain a sentence vector of the to-be-processed text. In addition, the invention also relates to a block chain technology, and the initial text set can be stored in a block chain. According to the invention, the accurate and available sentence vectors can be efficiently generated.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, in particular to a sentence vector generation method, device, computer equipment and storage medium based on a sentence vector model. Background technique [0002] Sentence embedding is one of the hot research fields in natural language processing in recent years. Sentence vectors can be obtained by mapping the words, tokens, and semantic information between words in a sentence to a quantifiable space. The generated sentence vectors are usually provided to downstream tasks for further processing, such as similarity calculation, classification and clustering based on sentence vectors. [0003] Some of the existing sentence vector generation technologies simply separate the words in the sentence, convert the words into word vectors, and then calculate the average value to obtain the sentence vector. However, this will lose the semantic information in the sentence and affe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F40/211G06F40/289G06F40/30G06K9/62
CPCG06F40/30G06F40/289G06F40/211G06F16/3344G06F18/214Y02D10/00
Inventor 陈浩谯轶轩
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products