High average utility sequence pattern mining method under non-overlapping condition
A non-overlapping, pattern-free technology, applied in data mining, special data processing applications, instruments, etc., can solve problems such as large length, no decision-making significance for stores, and high cost of time and space, and achieve the effect of satisfying practical problems
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0079] Given a piece of DNA sequence S=s 1 the s 2 the s 3 the s 4 the s 5 the s 6 the s 7 the s 8 the s 9 the s 10 the s 11 the s 12 the s 13 = ATTCATCACATCA, the cycle gap is [0, 3], given the minimum average utility threshold minun = 25, the utility value of each item in the character set is as follows image 3 shown.
[0080] The first step is to read the sequence database SDB, the minimum gap min, the maximum gap max and the minimum average utility threshold minun:
[0081] Read into the given sequence database SDB, which contains 1 sequence S=s 1 the s 2 the s 3 the s 4 the s 5 the s 6 the s 7 the s 8 the s 9 the s 10 the s 11 the s 12 the s13 = ATTCATCACATCA, character set is {A, T, C}, minimum gap min=0, maximum gap max=3 and minimum average utility threshold minun=25.
[0082] The second step is to generate a high average utility pattern set and a high upper bound pattern set with a length of 1:
[0083] Calculate the average utility value an...
Embodiment 2
[0138] Given a piece of DNA sequence S=s 1 the s 2 the s 3 the s 4 the s 5 the s 6 the s 7 the s 8 the s 9 the s 10 the s 11 the s 12 =ATTCATCACATC, the cycle gap is [0, 3], given the minimum average utility threshold minun=25, the utility value of each item in the character set is as follows image 3 shown.
[0139] "The sixth step, when the high upper bound pattern set of length i+1 is empty, the high average utility sequential pattern mining ends, and the seventh step is executed.
[0140] Because in the fifth step, the pattern set with a high upper bound of length 5 is empty, so the high average utility sequential pattern mining ends. "
[0141] Except above-mentioned difference, other is with embodiment 1.
[0142] In the foregoing embodiment, the programming software used is VC++6.0, and the drawing tool is Visio2015, and the processor used is Pentium (R) Dual-Core 32Processor+, and the operating system is Windows7 and above versions, and the above software...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com