Semantic checking method and system for multi-language mixed text

A technology of mixing text and language text, applied in the computer field, can solve problems such as unreliable audit results, and achieve the effect of overcoming translation difficulties

Pending Publication Date: 2021-07-23
SHANGHAI JILIAN NETWORK TECH CO LTD
View PDF15 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the deficiencies of the prior art, the present invention provides a semantic review method and system for multilingual mixed texts, which has the advantage of improving the accuracy of the review results and solves the problem of unreliable review results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic checking method and system for multi-language mixed text
  • Semantic checking method and system for multi-language mixed text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative work all belong to the protection scope of the present invention.

[0035] see Figure 1-2 , a semantic review method for multilingual mixed texts, including the following steps:

[0036] S1. Identify the primary and secondary languages, and extract the translation secondary language;

[0037] S2. MLM model prediction;

[0038] S3, preferred replacement of translation candidate results;

[0039] S4. Main language semantic review.

[0040] In this embodiment, specifically, the S1 includes:

[0041] S11. Input the tex...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of computers, and discloses a semantic checking method and system for multi-language mixed text. The method comprises the following steps: S1, identifying a main language and an auxiliary language, and extracting a translation auxiliary language; S2, conducting MLM model prediction; S3, preferably replacing translation candidate results; and S4, performing semantic checking on the main language. By designing an MLM model prediction module and a translation candidate result optimization module and using part-of-speech tagging screening and word vector similarity comparison technologies, a result really conforming to the current context can be accurately screened from a plurality of translation results of polysemy words, correct expression of the translation results to original meaning is ensured, and the accuracy of the translation results is improved; and therefore, reliable input is provided for a semantic checking model.

Description

technical field [0001] The present invention relates to the technical field of computers, in particular to a semantic review method and system for multilingual mixed texts. Background technique [0002] The information age has spawned many online social platforms such as Weibo, chat forums, video barrage, etc. These online platforms continue to generate a large amount of user interaction data such as text data and video data every day, which enriches people's spiritual life. At the same time, it also brings difficulties to effective information review and supervision. [0003] In this situation, the need to use algorithms to automatically and accurately review data is increasingly urgent. Taking the semantic review of text data as an example, it is usually necessary to use a trained model (such as a deep learning model obtained after fine-tuning a downstream task using a pre-trained model such as BERT) to judge whether the text as a whole violates the rules. However, the ne...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/30
CPCG06F40/58G06F40/30
Inventor 王晓平
Owner SHANGHAI JILIAN NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products