A method and system for correctness verification of suffix array and longest common prefix
A correctness verification and longest common technology, which is applied in electrical digital data processing, natural language data processing, instruments, etc., can solve the problem that the correctness verification of the suffix array and the longest common prefix cannot be performed at the same time, so as to reduce the time and space overhead Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0026] The basic idea of this embodiment is: first, after SA and LCPA are constructed, scan SA from left to right to find out all LMS suffixes and calculate the LCP value between adjacent LMS suffixes at the same time; Secondly, according to the definition of LCP, use The fingerprint function calculates the fingerprint value of the longest common prefix of adjacent LMS suffixes, and saves the first character on the right of each longest common prefix, and verifies the correctness of the LMS suffix and its LCP value according to the same fingerprint value and different saved characters. Then, use the LMS suffix and its LCP value to inductively sort the L-type suffix and its LCP, and then use the L-type suffix and its LCP value to inductively sort the S-type suffix and its LCP value. Finally, compare the known SA with the newly calculated SA1 and the known LCPA with the newly calculated LCPA1. If the comparisons of the two groups are exactly the same, it means that SA and LCPA ...
Embodiment 2
[0057] The embodiment of the present invention also provides a suffix array and the longest common prefix correctness verification system, such as figure 2 Schematic diagram of the structure, including:
[0058] File reading and writing module 1, used for reading and writing character strings, SA, SA1, LCPA and LCPA1 files;
[0059] The L / S suffix identification module 2 is used to identify whether the character string suffix type is L type or S type;
[0060] LMS suffix identification module 3, for identifying the LMS suffix of character string;
[0061] LMS suffix and its LCP value correctness verification module 4, the main function of this module is: scan the SA from left to right, obtain the LMS suffix in it and use RMQ to calculate its LCP value, and then use the fingerprint function to calculate the common values of adjacent LMS suffixes The fingerprint value of the prefix, and save the first character to the right of their common prefix. If the fingerprint functi...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com