Similar text calibration method
A calibration method and text technology, applied in the fields of instruments, electrical digital data processing, character and pattern recognition, etc., can solve the problems of disordered order and poor processing effect, and achieve the effect of improving accuracy and efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0047] The present invention will be further described below in conjunction with specific examples.
[0048] Such as figure 1 As shown, the similar text marking method provided in this embodiment includes the following steps:
[0049] Step 1, denoise the document and generate the original fingerprint vector, as follows:
[0050] Remove symbols that do not affect semantics in documents through regular expressions;
[0051] Case conversion, which uniformly converts the letters contained in the document to lowercase;
[0052] Replace the variables involved in the document with meaningless variable names;
[0053] Record the position information of each character before preprocessing after document preprocessing;
[0054] Use the k-gram method to generate original fingerprint vectors from the processed documents, and record the text position represented by each original fingerprint.
[0055] Step 2. Sampling the original fingerprint vector formed in step 1 to form a new sampl...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com

