Text error detection method and device
A text and error detection technology, applied in the field of text processing, can solve problems that affect semantic understanding or intent classification accuracy, low error detection accuracy, and low error detection accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0154] This embodiment provides a kind of text error detection method, and this method utilizes the corpus that stores correct text, detects and obtains the error character in the text to be detected (namely following target error character), compared with prior art, can effectively improve Detection accuracy and adaptability of text error detection. Specifically, such as figure 1 As shown, the text error detection method of this implementation includes:
[0155] S110. Obtain information about a client that generates the text to be detected.
[0156] Here, the client information includes the category of the client generating the text to be detected, the identifier of the client generating the text to be detected, the category of the client associated with the client generating the text to be detected, the type of client associated with the client generating the text to be detected The user's identifier and other information.
[0157] S120. Select a corpus that matches the c...
Embodiment 2
[0172] This embodiment provides a text error detection method. Based on the previous embodiment, the method proposes a specific implementation manner of screening target error characters from the target suspected words. Such as figure 2 As shown, the text error detection method in this implementation includes as follows:
[0173] S210. Based on the corpus storing the correct text, screen suspected wrong words and suspected wrong characters from the text to be detected.
[0174] S220. Obtain the vocabulary to which each suspected wrong character belongs from the text to be detected, and filter the vocabulary belonging to the suspected wrong vocabulary from the acquired vocabulary to obtain a target suspected vocabulary.
[0175] S230. Based on the probability of each target suspected word appearing in the current position of the text to be detected, screen target wrong words from the target suspected words.
[0176] Here, the target wrong vocabulary is obtained by rationally...
Embodiment 3
[0190] This embodiment provides a text error detection method. On the basis of any of the above embodiments, this embodiment proposes a specific implementation manner of screening suspected wrong words and suspected wrong characters from the text to be detected. Such as image 3 As shown, the text error detection method of the present embodiment includes:
[0191] S310. Acquire the corpus and the text to be detected.
[0192] S320. Based on the co-occurrence probability of every two characters and the co-occurrence probability of every two words in the corpus, screen suspected wrong words and suspected wrong characters from the text to be detected.
[0193] Here, the correct text is stored in the corpus, so using the probability of co-occurrence of every two characters and the co-occurrence probability of every two words in the corpus, the co-occurrence probability of every two characters or every two words in the text to be detected can be calculated. The probability of occ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com