Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

62 results about "Longest common subsequence problem" patented technology

The longest common subsequence (LCS) problem is the problem of finding the longest subsequence common to all sequences in a set of sequences (often just two sequences). It differs from the longest common substring problem: unlike substrings, subsequences are not required to occupy consecutive positions within the original sequences. The longest common subsequence problem is a classic computer science problem, the basis of data comparison programs such as the diff utility, and has applications in computational linguistics and bioinformatics. It is also widely used by revision control systems such as Git for reconciling multiple changes made to a revision-controlled collection of files.

Measuring system for similarity between different tracks and measuring method for measuring system

The invention discloses a measuring system for similarity between different tracks. The system comprises a track data file uploading module and a calculation result visualization module, wherein the track data file uploading module passes a data preprocessing module and obtains a most similar section construction module, and lastly, the most similar section construction module is transmitted to a user through the calculation result visualization module. A measuring method for the measuring system for similarity between the different tracks comprises the following steps of 1.uploading data; 2.judging whether a track data file is legal, if yes, carrying out preprocessing, if no, carrying out error display; 3.evaluating similarity between the tracks through a module for calculating the similarity between the tracks; 4.searching two most similar tracks through the most similar section construction module; and 5.displaying a calculation result through the calculation result visualization module. According to the system and the method, the result is more accurate, and meanwhile, the most similar sections between the tracks are constructed by utilizing a longest common subsequence method.
Owner:HOHAI UNIV

Text paragraph identification comparison method and system based on longest common subsequence

ActiveCN108734110ATo achieve the purpose of comparisonSolve the problem of comparisonCharacter and pattern recognitionLongest common subsequence problemTheoretical computer science
The invention discloses a text paragraph identification comparison method and a text paragraph identification comparison system based on a longest common subsequence. The text paragraph identificationcomparison method comprises the steps of acquiring a first text character string and a second text character string; performing paragraph identification on the first text character string and the second text character string; performing paragraph order adjustment on the first text character string and the second text character string; and comparing the first text character string and the second text character string which are subjected to paragraph order adjustment to obtain a difference item. The text paragraph identification comparison system comprises a front end, a conversion module, a paragraph identification module and a comparison module. With the text paragraph identification comparison method and the text paragraph identification comparison system based on the longest common subsequence, the problems that texts whose paragraph information cannot be acquired cannot be compared and the paragraph adjustment situation cannot be processed well in an existing text comparison tool are solved.
Owner:DATAGRAND TECH INC

Method and system for automatically extracting virus characteristics based on family samples

The invention provides a method and a system for automatically extracting virus characteristics based on family samples. According to the method and the system, a longest public subsequence algorithm is modified, a sequence A and a sequence B are established by using samples in the family samples, Hash values of subsequences with lengths equal to preset values in the sequence A and the sequence B are calculated respectively through preset feature code lengths, and the Hash values of the subsequences in the sequence A and the sequence B are matched through a red black tree manner, if the Hash values are same, the subsequences corresponding to the Hash values are public subsequences of the sequence A and the sequence B, and the public subsequences are feature codes of the family samples; and when surplus samples are taken as the sequence B and searched in a red black tree, feature codes of all family samples are obtained and combined into a feature set of the family samples, a weighting model is evaluated according to qualities of the established feature codes, the qualities of the established feature codes are judged, and the feature codes of the family samples are determined. According to the method, the time complexity of the algorithm is simplified, and the extraction efficiency and the accuracy of the feature codes are improved.
Owner:HARBIN ANTIY TECH

Analysis and integration method and device for sequencing of medium-short gene segment

The present invention provides an analysis and integration method and device for sequencing of a medium-short gene segment. The method comprises: checking a read sequence and removing gene sequences comprising errors and unreliable information; reading processed read data, analyzing the data and constructing a k-mer structure and a quad-tree structure; constructing an integration storage table and recording the progress condition of the integration process and read information which currently participates in integration; after selecting initial k-mer to start to carry out integration, continuously selecting subsequent k-mer according to an integration scoring formula, and updating the information in the integration storage table structure in real time so as to obtain contig sequences; and combining the contig sequences on the basis of a longest common subsequence method by utilizing read-pair information and generating and outputting super-contig. Aiming at the special requirements of the integration method for performance, the device provided by the present invention is of an embedded handheld structure; and by utilizing the method and the device which are provided by the present invention, analysis and integration on sequencing of the medium-short gene segment can be rapidly and accurately implemented.
Owner:XI AN JIAOTONG UNIV

Method and apparatus for obtaining similar trademarks, computer device and storage medium

The present application relates to a method and apparatus for obtaining similar trademarks, a computer device and a storage medium. The method is based on the longest common subsequence, the longest common string, and the edit distance to determine the similarity of the character string integrity of the word mark and the prior trademark, thereby screening out similar trademarks with higher similarity and treating them again. The similarity between the word and the glyph is determined by the difference characters between the character trademark to be detected and the similarly preceding trademark. the comprehensive similarity is calculated by the overall character string judgment result and the character independent judgment result, and the prior trademark having high similarity degree is fed back to the user, the quick acquisition of the approximate trademark of the trademark to be detected is realized, and the efficiency of the approximate trademark search is improved. At the same time, the approximation judgment between the detected word mark and the prior trademark is synthesized by the overall judgment result of the string and the independent judgment result of the character. The results are determined to effectively improve the accuracy of the approximate trademark search.
Owner:PING AN TECH (SHENZHEN) CO LTD

Literature classification method and system based on trie and LCS algorithm

The invention discloses a document classification method based on trie and LCS algorithm, comprising the following steps: step 1, pre-compiling an initial classification dictionary and an initial exclusion dictionary; 2, extending each character string in the initial classification dictionary to obtain an extended character string, filtering the obtained extended character string according to theinitial exclusion dictionary, and constructing a dictionary tree; 3, calling that dictionary tree to look up all the strings appear in each sentence in the literature to be classified, taking the longest character string in the initial classification dictionary as the longest common subsequence, and taking the longest common subsequence and its corresponding class as the final character string andfinal class of the sentence, and taking the final class which appears most frequently in a document as the class to which it belongs. The invention also discloses a document classification system based on trie and LCS algorithm. The invention omits the word segmentation process, takes the stable character string as the characteristic, has high accuracy, and reduces the dependence on the context.
Owner:CHINA PETROLEUM & CHEM EXPLORATION & PRODION RES INST +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products