Error correction dictionary creation method and device, terminal and computer storage medium

A computer storage and construction method technology, applied in the field of devices, error correction dictionary creation methods, equipment and computer storage media, can solve the problems of few vocabulary and low accuracy, and achieve the effect of increasing the number and improving construction efficiency

Pending Publication Date: 2020-01-31
TENCENT MUSIC ENTERTAINMENT TECH SHENZHEN CO LTD
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the embodiments of the present invention is to provide a method for constructing an error corr...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Error correction dictionary creation method and device, terminal and computer storage medium
  • Error correction dictionary creation method and device, terminal and computer storage medium
  • Error correction dictionary creation method and device, terminal and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to enable those skilled in the art to better understand the solutions of the present invention, the following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is an embodiment of a part of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts shall fall within the protection scope of the present invention.

[0034] The terms "first", "second" and "third" (if any) and the like in the description and claims of the present invention and the above drawings are used to distinguish different objects, rather than to describe a specific order. Furthermore, the term "comprise", as well as any variations thereof, is intended to cover a non-exclus...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides an error correction dictionary construction method and device, a terminal and a computer storage medium, and the method comprises the steps: obtaining retrieval data and parallel corpora corresponding to the retrieval data; expanding the retrieval data and the parallel corpus to obtain a retrieval vocabulary set and a corpus vocabulary set; querying vocabulary pairs with a mapping relationship from the retrieval vocabulary set and the corpus vocabulary set; and constructing an error correction dictionary corresponding to the retrieval data according tothe vocabulary pairs. By adopting the embodiment of the invention, the problems of fewer vocabulary pairs, lower construction efficiency and the like in an error correction dictionary in the prior artcan be solved.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a method, device, equipment and computer storage medium for creating an error correction dictionary. Background technique [0002] The corpus is widely used in the field of linguistics research. It is the basic resource of linguistics research and the foundation of modern machine speech translation, semantic understanding and other research. In particular, the parallel corpus is a typical application of the corpus, which is mainly composed of the original text and its parallel corresponding target language text. At present, the neural machine translation model (mostly based on the encoder-decoder model) is usually used to compile the source sentence input by the user into a fixed-length target sentence, so as to construct a parallel corpus (also called an error correction dictionary). The error correction dictionary includes sentence pairs (or word pairs) with a mapping rel...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/232G06F40/242G06F16/332G06F16/35
CPCG06F16/332G06F16/35
Inventor 吴上权周蓝珺潘树燊
Owner TENCENT MUSIC ENTERTAINMENT TECH SHENZHEN CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products