Chinese character string similarity calculation method and device based on phonetic and morphological codes
A technology of similarity calculation and Chinese characters, applied in other database retrieval, other database query and other directions, can solve the problems of reduced practicability, weakened differences, influence of similarity accuracy, etc., to achieve high conversion efficiency, accurate calculation, more comprehensive Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Benefits of technology
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0042] This embodiment discloses a method for calculating the similarity of Chinese character strings based on phonetic shape codes. The phonetic shape codes include phonetic codes and shape codes. The phonetic codes are composed of digital codes of initials and vowels, and the shape codes are composed of Chinese characters. The four-corner code, the structure code and the number of strokes are composed; the mapping rules of the tone shape code are pre-stored in the database;
[0043] Such as figure 1 As shown, the method includes the following steps:
[0044] Step 1: Receive two strings A and B to be compared;
[0045] Step 2: Read the phonetic shape code mapping rule from the database, and convert each Chinese character in the two character strings into phonetic shape code representation according to the mapping rule;
[0046] Step 3: Calculate the edit distance between the corresponding substrings of the two strings based on the edit distance;
[0047] Step 4: Calculate the similar...
Embodiment 2
[0105] The purpose of this embodiment is to provide a computing device.
[0106] A computing device includes a memory, a processor, and a computer program stored in the memory and capable of running on the processor, the memory prestores the mapping rule of the phonetic shape code, and the processor implements the following steps when the program is executed :
[0107] Receive two strings to be compared;
[0108] Read the phonetic code mapping rules, and according to the mapping rules, each Chinese character in the two character strings is transformed into phonetic code representation; the phonetic code includes phonetic code and shape code, wherein the phonetic code is represented by The initials and vowels are composed of digital codes, and the shape codes are composed of the four-corner code, structure code and number of strokes of Chinese characters;
[0109] Calculate the edit distance between the corresponding substrings of two strings based on the edit distance;
[0110] Calcul...
Embodiment 3
[0112] The purpose of this embodiment is to provide a computer-readable storage medium.
[0113] A computer-readable storage medium, which stores the mapping rule of the phonetic shape code and a computer program for calculating text similarity in advance, and when the program is executed by a processor, the following steps are performed:
[0114] Receive two strings to be compared;
[0115] Read the phonetic code mapping rules, and according to the mapping rules, each Chinese character in the two character strings is transformed into phonetic code representation; the phonetic code includes phonetic code and shape code, wherein the phonetic code is represented by The initials and vowels are composed of digital codes, and the shape codes are composed of the four-corner code, structure code and number of strokes of Chinese characters;
[0116] Calculate the edit distance between the corresponding substrings of two strings based on the edit distance;
[0117] Calculate the similarity of t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com