Self-adaptive high-average utility sequence pattern mining method under one-time condition
A one-time, self-adaptive technology, applied in data mining, special data processing applications, instruments, etc., can solve problems such as difficulty in solving generality, accuracy and flexibility, and inability to mine valuable information
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0098] Given a piece of DNA sequence S=s 1 the s 2 the s 3 the s 4 the s 5 the s 6 the s 7 the s 8 the s 9 the s 10 =ACGACGACGG, the average utility threshold minunity=20, the utility values of each item are: U(A)=10, U(C)=4, U(G)=7.
[0099] The first step is to read the sequence database SDB, the average utility threshold minunity, and the utility value U(P) of each item:
[0100] Read in a given sequence database SDB, determine its size as N, and record each sequence in the sequence database SDB as sequence S 1 , sequence S 2 , ..., sequence S k , ..., sequence S N , where 1≤k≤N, the sequence S k Each character in is denoted as the character s 1 , character s 2 , ..., character s i ..., the character s n , given the average utility threshold minunity, the utility value U(P) of each item;
[0101] The concrete operation of this embodiment is as follows:
[0102] Read into the given sequence database SDB, which contains a sequence S=ACGACGACGG, the averag...
Embodiment 2
[0319] Given a piece of DNA sequence S=s 1 the s 2 the s 3 the s 4 the s 5 the s 6 =ACACAG, the average utility threshold minunity=20, the utility values of each item are: U(A)=10, U(C)=4, U(G)=7.
[0320] In addition to the ninth step "when the above sixth step obtains a high lower bound mode set Hcand with a mode length of m+1 m+1 When it is empty, the high average utility pattern has been excavated " except that other is the same as embodiment 1.
[0321] Because in step 6 a high lower bound pattern set of pattern length 3 is generated is empty, so the high average utility mode is mined.
[0322] In the foregoing embodiment, the programming software used is VC++6.0, and the drawing tool is Visio2013, and the processor used is Inter(R) Core(TM) i5-5200U CPU@2.2.GHz , the operating system is Windows 7 and above, and the software and hardware environments used above are well known to those skilled in the art.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


