Multi-language text fine-granularity accurate alignment method based on depth semanteme

A fine-grained, multi-language technology, applied in semantic analysis, text database query, unstructured text data retrieval, etc., can solve the impact of fine-grained alignment accuracy, impact on fine-grained alignment accuracy, and low efficiency of student comprehension, etc. problems, to achieve the effect of speeding up understanding, improving accuracy, and improving accuracy

Active Publication Date: 2021-08-06
LUDONG UNIVERSITY
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] When performing fine-grained alignment between two or more texts in multiple languages, due to the differences in grammar and expression habits between languages, there are certain deviations in the calculation of semantics after mutual translation, and then alignment, resulting in The accuracy of fine-grained alignment is affected. When performing fine-grained alignment between two or more texts in the same language, due to differences in polysemous words or emotional words with strong subjectivity, when word embedding features are used for semantic calculation, Some of the results of fine-grained alignment are difficult to align, which also affects the accuracy of fine-grained alignment. In addition, in the prior art, the results of fine-grained alignment are usually only displayed through data. In special occasions, especially in colleges and universities, When teaching this aspect, the results are relatively abstract, resulting in low efficiency for students to understand

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-language text fine-granularity accurate alignment method based on depth semanteme
  • Multi-language text fine-granularity accurate alignment method based on depth semanteme
  • Multi-language text fine-granularity accurate alignment method based on depth semanteme

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0051] see figure 1 , a method for fine-grained precise alignment of multilingual texts based on deep semantics, characterized in that it includes the following steps:

[0052] S1. First extract the word embedding features in two or more texts of the target, construct a corresponding fine-grained light array according to the word embedding features, and input them into the neural network to calculate the semantic similarity of words, and for words with the same or high similarity Semantics are aligned, and the light balls in the fine-grained light array corresponding to the aligned word embedding features are lit;

[0053] S2. Extract the sentence where the unaligned word embedding feature is located, obtain the sentence embedding feature, and input it into the neural network to calculate the semantic similarity of the sentence, align the semantics of the same or high similarity sentence, and match the corresponding word embedding feature The light balls in the fine-grained l...

Embodiment 2

[0057] A fine-grained and precise alignment method for multilingual texts based on deep semantics, characterized in that it includes the following steps:

[0058] S1. According to the title, extract the article embedding features in two or more texts of the target, and input them into the neural network to calculate the article semantic similarity, align the article semantics with the same or high similarity, and align the unaligned articles Marking in fine-grained light arrays;

[0059] S2. Extract paragraph embedding features from unaligned chapters, and input them into the neural network to calculate the semantic similarity of paragraphs, align the semantics of paragraphs with the same or high similarity, and mark unaligned paragraphs in the fine-grained light array ;

[0060] S3. Segment the unaligned paragraphs according to the punctuation marks, extract the sentence embedding features, and input them into the neural network to calculate the semantic similarity of the se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-language text fine-granularity accurate alignment method based on depth semanteme, and belongs to the field of text alignment, the multi-language text fine-granularity accurate alignment method based on depth semanteme comprises the following steps: extracting word embedding features, gradually calculating words, statements, paragraphs and semantic similarity of the paragraphs, consequently, gradually improving accuracy of text fine-grained alignment. Meanwhile, in the alignment process, through construction of a fine-grained lamp array, after each step, an inflatable tube is expanded and stretched in the radial direction and the longitudinal direction through inflation, then fluorescent liquid enters a photomask, lamp balls corresponding to aligned word embedding characteristics are lightened, and when the light ball is lighted for the second time or the last several times, the inflation time is gradually prolonged, the downward extension amount of the lighted light ball is increased, and the brightness is increased, so that the improvement of the accuracy of fine-grained alignment is more obvious in visual representation after each step, and the students are further assisted in accelerating the understanding speed of the content.

Description

technical field [0001] The invention relates to the field of text alignment, and more specifically, to a method for fine-grained and precise alignment of multilingual texts based on deep semantics. Background technique [0002] Entity linking is the process of mapping entity references in natural language to correct candidate entities in a knowledge base. In layman's terms, the fine-grained model is to subdivide the objects in the business model to obtain a more scientific and reasonable object model. Intuitively speaking, it is to divide many objects. [0003] The memory space in modern computers is divided by byte. In theory, it seems that access to any type of variable can start from any address, but the actual situation is that when accessing a specific variable, it is often accessed at a specific memory address, which is All types of data need to be arranged in space according to certain rules, rather than sequentially arranged one by one, which is alignment. [0004]...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F40/30G09F19/10
CPCG06F16/3344G06F16/3335G06F40/30G09F19/10
Inventor 刘伍颖
Owner LUDONG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products