Clustering method, device and system
A clustering and cohesion technology, applied in the network field, can solve the problems that users cannot obtain resources, cannot be searched, and wrong search results, etc., and achieve the effect of objective and accurate processing methods, improved user experience, and accurate search results.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0029] refer to figure 1 Shown, be embodiment one of the method of the present invention, comprise steps:
[0030] Step 101, obtaining part of the text content of the media file;
[0031] Step 102. Calculate clustering information of the media file according to part of the text content of the media file.
[0032] Embodiments of the present invention have the following advantages:
[0033] First, the embodiment of the present invention can calculate the clustering information of the media file according to the partial text content of the media file by acquiring part of the text content;
[0034] Secondly, since the embodiment of the present invention obtains the clustering information of the media file by calculating part of the text content of the media file, it does not depend on the description information of the media file, and avoids the wrong clustering caused by artificially modifying the description information. Class, the processing method is objective and accurate....
Embodiment 2
[0073] refer to figure 2 As shown, it is the second embodiment of the method of the present invention, and the present embodiment takes an audio file as an example to illustrate, including steps:
[0074] Step 201, obtaining the contents of the header and tail of the audio file;
[0075] For MP3 and WMA files, a large amount of meta (metadata, source data) information will be stored in the header of the file to identify various attributes of the file itself, ID3V1 (the first generation of tags, for details, see http: / / www.id3.org / ID3v1 The MP3 (Moving PictureExperts Group Audio Layer III, Audio Compression Technology and Audio Coding Technology) file in ) format has 128 bytes of meta information at the end. Usually the header gets no more than 50k bytes of content; the tail gets 5k of content.
[0076] Step 202, analyzing the contents of the head and tail of the audio file;
[0077] Regarding MP3 and WMA head and tail files, referring to the MP3 file specification, there ...
example 1
[0100] Example 1: Suppose a link of an audio file in WMA format is:
[0101] http: / / oursim.whu.edu.cn / houtai / edit / UploadFile / 2006112073350103.wma For the audio file, the process of calculating its MD5 signature includes:
[0102] 1. Acquiring the header and tail contents of the WMA file in the link, the header and tail contents of the file are usually expressed in the form of a music URL link list;
[0103] Head: 2006112073350103_head 50k
[0104] Tail: 2006112073350103_tail 5k
[0105] 2. Analyze the header file and tail file of the WMA file obtained in the link:
[0106] a) First analyze the content of the header file.
[0107] The first 16 bytes of the header are 0x30 0x26 0xB2 0x75 0x8E 0x66 0xCF0x11 0xA6 0xD9 0x00 0xAA 0x00 0x62 0xCE 0x6C, so it can be judged that the file where the header file is located is a WMA format file.
[0108] The analyzer looks for audio content start identifiers 0x36 0x26 0xB2 0x75 0x8E 0x660xCF 0x11 0xA6 0xD9 0x00 0xAA 0x00 0x62 0xCE 0x6C,...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com