String similarity determination method, apparatus, program product, and related device
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- TENCENT CLOUD COMPUTING (CHANGSHA) CO LTD
- Filing Date
- 2023-01-12
- Publication Date
- 2026-06-19
AI Technical Summary
Existing edit distance algorithms treat all components in a string equally when calculating string similarity, resulting in strings with significant semantic differences receiving high similarity scores, which affects the accuracy of the calculation.
By introducing semantic comparison into string similarity calculation, words with opposite meanings are identified and processed to ensure that the similarity is zero. Edit distance similarity is calculated after all words have been traversed, thus improving accuracy.
This effectively avoids misjudging strings with completely opposite meanings as similar, improves the accuracy of edit distance similarity calculation, and ensures that the results are more consistent with actual semantic differences.
Smart Images

Figure CN116029304B_ABST