Method for automatically identifying word repetition errors
A technology for automatic recognition and word recognition, which is applied in the fields of electrical digital data processing, natural language data processing, instruments, etc., and can solve the problem of repeated words and words that are not dealt with separately.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0043] The present invention will be described in further detail below in conjunction with the examples and accompanying drawings, and the following examples do not limit the present invention.
[0044] A method for automatic recognition of word repetition errors provided by the present invention, the method comprises the following steps:
[0045] After segmenting the large-scale training corpus, statistically obtain the binary and triplet structures of repeated words in the training corpus, as well as the degree of repetition combination, the information entropy of the adjacent words in the left upper context and the information entropy of the adjacent words in the right lower context. ;
[0046] The steps of counting and collecting words containing repeated characters in the Chinese dictionary and establishing a Chinese dictionary repeated word thesaurus;
[0047] The step of judging the repeated words appearing in the text to be checked based on the repeated words in the C...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



