Information processing method and device
An information processing method and algorithm technology, applied in the computer field, can solve problems such as error-prone, and achieve the effect of improving accuracy and eliminating ambiguous words
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0058] figure 1 It is a flowchart of the information processing method provided by the first embodiment of the present invention, specifically including the following steps:
[0059] Step 101, judging whether there are ambiguous words in the text to be processed.
[0060] For example, according to the database of ambiguous words, it is judged whether there are ambiguous words in the text to be processed. The database of ambiguous words can include ambiguous words and word segmentation rules corresponding to the ambiguous words. Text in English letters.
[0061] Step 102, when there are ambiguous words in the text to be processed, split the ambiguous words from the text to be processed.
[0062] For example, when the ambiguous word is located in the middle of the text to be processed, the text to be processed can be split into three parts, the ambiguous word, the part on the left side of the ambiguous word and the part on the right side of the ambiguous word, when the When t...
no. 2 example
[0069] This embodiment adds the following on the basis of the above embodiments figure 2 steps shown.
[0070] Step 201, split the received information according to the character code, punctuation mark, and name database to obtain the text to be processed.
[0071] For example, the received information may be in Chinese, or a combination of at least one of Chinese and English, numbers and punctuation marks. The text to be processed is the text split from the received information.
[0072] After receiving the information to be processed, the received information can be split into Chinese clauses and / or English words and / or number strings according to the character code and punctuation marks, for example, the received information is "hello Zhang San, Li Si Where did you go?”, after this step, it can be split into “hello”, “Zhang San”, and “Where did Li Si go”. Then, according to the name database, the name of the person in the split Chinese clause is recognized. The recognit...
no. 3 example
[0092] On the basis of the above-mentioned embodiments, this embodiment adds the following Figure 4 steps shown.
[0093] Step 401. Combine the split results and the results obtained after splitting the split ambiguous words to obtain a word segment set, and the word segments in the word segment set are arranged according to their positions in the text to be processed .
[0094] Merge the split result obtained by the forward or reverse maximum matching algorithm and the split result obtained by the first embodiment, the way of merging can be the ambiguous word part in the split result obtained by the forward or reverse maximum matching algorithm Split according to the method provided in the first embodiment, and keep other parts unchanged.
[0095] Step 402, when the word segmentation set contains continuous words, judge whether the continuous words contain low-probability words according to the low-probability word database, and if so, synthesize the continuous words to the ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com