Method and device for generating spelling error correction model and method and device for spelling error correction

An error correction model and model technology, applied in character and pattern recognition, instruments, calculations, etc., can solve problems such as dependence, high labor cost, and lack of generalization ability, so as to improve understanding ability, reduce error correction phenomenon, and improve generalization effect

Pending Publication Date: 2022-08-05
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional spelling error correction technology based on rule matching combined with sorting models relies heavily

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for generating spelling error correction model and method and device for spelling error correction
  • Method and device for generating spelling error correction model and method and device for spelling error correction
  • Method and device for generating spelling error correction model and method and device for spelling error correction

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0020] Exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding and should be considered as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.

[0021] In this embodiment, "first" and "second" are only used for descriptive purposes, and cannot be understood as indicating or implying relative importance or implying the number of indicated technical features. Thus, a feature delimited with "first", "second" may expressly or implicitly include at least one of that feature.

[0022] figure 1 A process 100 of a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a spelling error correction model generation method and device, relates to the technical field of artificial intelligence, in particular to the technical fields of deep learning, natural language processing and the like, and can be applied to scenes such as OCR (Optical Character Recognition) and the like. According to the specific implementation scheme, an error correction sample set comprising at least one error correction sample is obtained; based on the error correction sample set, performing spelling error correction training on a pre-trained text recognition model to obtain a to-be-corrected error correction model; selecting low-frequency samples including low-frequency vocabularies from the error correction sample set to obtain a low-frequency sample set; and based on the low-frequency sample set, performing spelling error correction training on the to-be-tuned error correction model to obtain a spelling error correction model. According to the embodiment, the generalization ability of the spelling error correction model to spelling errors is improved.

Description

technical field [0001] The present disclosure relates to the technical field of artificial intelligence, in particular to the technical fields of deep learning and natural language processing, and can be applied to scenarios such as OCR, and in particular to a method and device for generating a spelling error correction model, a spelling error correction method and device, electronic equipment, Computer-readable media and computer program products. Background technique [0002] The spelling error correction system is designed to automatically identify misspelled words in text based on natural language processing technology, and give corresponding correction suggestions. The traditional spelling error correction system mostly adopts the technical route of rule matching combined with sorting model: rule matching is recalled based on dictionary resources and edit distance, and the recalled candidates are scored through feature extraction and input sorting model to form error co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/232G06F40/216G06F40/30G06K9/62
CPCG06F40/232G06F40/216G06F40/30G06F18/22
Inventor 马芸桂睿曹宇慧黄硕陈永锋
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products