A character string matching method and device
A string matching and string technology, which is applied in the fields of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of inoperability, low efficiency of obtaining similar strings, and complicated implementation.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0090] see figure 1 , is a schematic flowchart of a string matching method provided in this embodiment, the string matching method includes the following steps:
[0091] S101: Obtain a target character string to be matched.
[0092] In this embodiment, it is necessary to match one or more similar character strings based on an existing character string. For the existing character string that needs to be matched, this embodiment defines the existing character string is the target string.
[0093] S102: Determine a target candidate set, where the target candidate set includes a plurality of first candidate character strings.
[0094] In practical applications, in order to achieve string matching, it is necessary to pre-build a string candidate set Cad, which usually includes a large number of candidate strings, so that when it is necessary to perform string matching on the target string, One or more candidate character strings similar to the target character string can be matc...
no. 2 example
[0111] It should be noted that this embodiment will introduce an implementation manner of "determining target candidate sets" in S102 in the first embodiment.
[0112] Step S102 may determine a target candidate set based on a preset edit distance threshold. Edit distance is the cost of completely transforming a string into another string through three operations of insertion, deletion, and replacement. Generally speaking, the smaller the edit distance, the greater the similarity between two strings. In this embodiment, an edit distance threshold can be preset The edit distance threshold can be the maximum number of edits required to convert the target string into a similar string, the edit distance threshold It can be set by the user, or the system default value can be used. The edit distance threshold It is the key parameter to realize the matching operation. Understandably, the edit distance threshold The larger the , the more similar strings are matched from the t...
no. 3 example
[0133] It should be noted that this embodiment will introduce the implementation manner of "determining the character string filtering threshold" in S103 in the first embodiment.
[0134] Step S103 can determine the character string filtering threshold MergThreshold based on the preset edit distance threshold, where the edit distance threshold MergThreshold is the maximum number of edits required to convert the target character string into a similar character string. For the relevant introduction of the edit distance threshold MergThreshold, please refer to the above The second embodiment will not be repeated here.
[0135] In an implementation manner of this embodiment, step S103 may specifically determine a character string filtering threshold according to the length of the target character string and a preset edit distance threshold.
[0136] It should be noted that when the number of slices to be matched in the target character string is smaller, then the filtering conditi...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com