Audio data processing method, device and mobile terminal
A technology of audio data and processing methods, applied in the computer field, can solve problems such as poor user experience and high time and cost consumption, and achieve the effect of improving processing efficiency and user experience
Active Publication Date: 2019-01-01
VIVO MOBILE COMM CO LTD
9 Cites 2 Cited by
AI-Extracted Technical Summary
Problems solved by technology
[0005] The purpose of the embodiments of the present application is to provide an audio data processing method, device, and mobile terminal to solve...
Method used
[0163] The embodiment of the present application provides a mobile terminal. By acquiring the audio data to be set as a ringtone, if the duration of the audio data is greater than a predetermined duration threshold and there is no historical information for setting a ringtone based on the audio data, then Setting the audio data to an editable state, and ...
Abstract
A method for processing audio data, a device and a mobile terminal are disclosed in an embodiment of that present application. The method comprises: getting audio data to be set to a ringtone, if thetime of the target audio data is longer than a predetermined time threshold and there is no historical information of setting a ring tone based on the audio data, the audio data being set to an editable state, and audio data to be recommended being intercepted from the audio data based on an editing operation on the audio data. Through the method, the processing efficiency of the audio data can beimproved, and the user experience can be improved.
Application Domain
Electronic editing digitised analogue information signals
Technology Topic
Data processingComputer hardware +2
Image
Examples
- Experimental program(6)
Example Embodiment
[0057] Example one
[0058] Such as figure 1 As shown, an embodiment of the present application provides a method for processing audio data. The execution body of the method may be a mobile terminal, where the mobile terminal may be a mobile phone, a tablet computer, etc., and the mobile terminal may be a mobile terminal used by a user. This method can automatically extract the audio peak area according to the volume information and audio information of the audio data after acquiring the audio data, and recommend it to the user. The method may specifically include the following steps:
[0059] In step S102, the audio data to be set as a ringtone is acquired.
[0060] Among them, the audio data to be intercepted and set as a ringtone can be any audio data, such as audio data stored by the user in the mobile terminal, or audio data obtained by the user online, or alternatively, it can be stored or stored in the mobile terminal by the user. Audio data in the video data obtained online.
[0061] In the implementation, the development of the mobile Internet has promoted the popularization of mobile terminals. Mobile terminals based on mobile phones have become necessities for people’s lives and work. However, in the mobile terminal market, the brands and models of mobile phones with higher usage rates are more common. At the same time, when the mobile phone leaves the factory, the manufacturer usually stores a certain number of ringtones in the mobile phone for users to choose from. This results in fewer ringtones for users of the same brand and model, and there are many ringtones for the same ringtone. People use it, so when the bell rings in crowded places, it may cause "chaotic" scenes. In addition to the alternative ringtones in the mobile phone, the user can intercept part of the audio data in the audio data and use it as the ringtone of the mobile phone to distinguish it from other users. However, during the interception process, the user needs to browse all the audio data in order to accurately locate the climax part to be intercepted. In this process, sometimes it takes many repeated operations to accurately find the required part. This brings a high time cost to the user, resulting in a poor user experience. To this end, the embodiments of the present invention provide a technical solution that can solve the above-mentioned problems. For details, refer to the following content.
[0062] Taking audio data obtained online as a mobile phone ringtone as an example, the user can select the audio data they like through the audio playback software installed on the mobile terminal, download it to the mobile terminal, and use it as the audio to be set as the ringtone data. In addition, the user can also edit the audio data, and use the edited audio data as the acquired audio data to be set as a ringtone.
[0063] In practical applications, some audio data in the downloaded audio data can be directly used as mobile phone ringtones. For this reason, you can also set a selection condition. For example, you can set a ringtone duration requirement, for example, the ringtone duration ranges from 40 seconds to 1 Based on this, if the duration of the downloaded audio data does not meet the requirement of the duration of the ringtone, it can be directly used as the acquired audio data to be set as the ringtone. If the duration of the downloaded audio data meets the requirement of the duration of the ringtone, the audio data can be directly used as the ringtone.
[0064] In the above embodiment, the user obtains the audio data to be set as a ringtone through online download. In actual applications, it may also include other situations, such as the audio data stored by the user in a mobile terminal such as a mobile phone, or the user moves The audio data recorded by the terminal, etc., the audio data acquired above can all be used as the acquired audio data to be set as the ringtone.
[0065] In step S104, if the duration of the audio data is greater than the predetermined duration threshold and there is no history information for setting a ringtone based on the audio data, the audio data is set to an editable state.
[0066] Wherein, the predetermined duration threshold may be any duration, such as 20 seconds, 30 seconds. The editable state can be brought out by the color of the edit button. For example, if the audio data is editable, the edit button corresponding to the audio data can be set to a highlighted state. If it is not editable, the color of the edit button can be set to gray .
[0067] If the duration of the audio data to be set as a ringtone is greater than the predetermined duration threshold, the historical information of the ringtone setting is searched. If there is no historical information of the audio data set as a ringtone, the audio data can be set to an editable state . For example, the predetermined duration threshold is 30 seconds, the duration of the audio data to be set as a ringtone is 1 minute, the duration of the audio data is greater than the predetermined duration threshold, and the audio data has not been a ringtone, nor does it exist based on the audio The data sets the history information of the ringtone, so the audio data can be set to an editable state, and an edit button can be added next to the audio data, and it can be highlighted.
[0068] In step S106, based on the editing operation on the audio data, the audio data to be recommended is intercepted from the audio data.
[0069] Among them, the editing operation can be the interception of part of the data in the audio data, or the selection of part of the combined data in the audio data, for example, intercepting the climax part of a certain segment of the audio data, or the selection of two parts in the audio data. The combination of the climax part, the specific editing operation can be different according to the actual application.
[0070] In implementation, the audio data can be edited, and the middle part of the audio data can be cut out as the audio data to be recommended. For example, if a piece of audio data is 3 minutes, the middle 30 seconds can be changed from 1 minute to 15 minutes. The part from the second to 1 minute and 45 seconds is cut out as the audio data to be recommended.
[0071] In addition, the audio data can also be edited according to preset selection rules. For example, the preset selection rule is to obtain data for different periods of time according to the type of audio data. If the audio data belongs to a song with a duration of 1 minute to 3 minutes Class, the middle 30 seconds of the audio data is obtained as the audio data to be recommended. If the audio data belongs to short music with a duration of 30 seconds to 1 minute, the short music is directly used as the audio data to be recommended. If the audio data is greater than For 3 minutes of long audio data, the data between 3 minutes and 3 minutes and 30 seconds can be selected as the audio data to be recommended, and the user's adjustment operation of the length of the audio data to be recommended can be received.
[0072] The foregoing embodiment provides optional and achievable editing operations on audio data. The specific editing operation methods may be various, which is not limited in this embodiment.
[0073] The embodiment of the application provides a method for processing audio data. By acquiring audio data to be set as a ringtone, if the duration of the audio data is greater than a predetermined duration threshold and there is no historical information for setting the ringtone based on the audio data, then The audio data is set to an editable state, and then based on an editing operation on the audio data, the audio data to be recommended is intercepted from the audio data. In this way, in the audio data processing process, the user does not need to use manual continuous adjustment to select the desired segment while browsing the audio data, which improves the processing efficiency of the audio data and improves the user experience.
Example Embodiment
[0074] Example two
[0075] Such as figure 2 As shown, an embodiment of the present application provides a method for processing audio data. The execution body of the method may be a mobile terminal, where the mobile terminal may be a mobile phone, a tablet computer, etc., and the mobile terminal may be a mobile terminal used by a user. This method can automatically extract the audio peak area according to the volume information and audio information of the audio data after acquiring the audio data, and recommend it to the user. The method may specifically include the following steps:
[0076] In step S202, the audio data to be set as a ringtone is acquired.
[0077] For the specific processing process of the foregoing S202, reference may be made to the related content of S102 in the foregoing embodiment 1, which will not be repeated here.
[0078] In step S204, if the duration of the audio data is greater than the predetermined duration threshold and there is no historical information for setting the ringtone based on the audio data, the audio data is set to an editable state.
[0079] For the specific processing process of the foregoing S204, refer to the related content of S104 in the foregoing embodiment 1, which will not be repeated here.
[0080] In step S206, an editing instruction for audio data is received.
[0081] Wherein, the editing instruction may be a click operation on the edit button by the user, or a click operation on the audio data.
[0082] In implementation, after setting the audio data to editable state in step S204, an edit button can be set after the audio data, and the user's click operation on the edit button can be received, which is the editing of the audio data instruction. It can also receive a long-press, double-click or click operation on the audio data as an editing instruction for the audio data.
[0083] In practical applications, if the mobile terminal stores a preset commonly used audio data interception program, the audio data can be edited according to the commonly used audio data interception program, and finally the audio data is intercepted For the audio data to be recommended, please refer to the common processing methods for details, which will not be repeated here. If the mobile terminal does not store the pre-set common audio data interception scheme, the mobile terminal can extract the audio peak area from the audio data according to the volume information and audio information of the audio data, and according to the audio peak area To intercept the audio data to be recommended from the audio data, the specific processing methods may include various methods. An optional processing method is provided below. For details, please refer to the following related content.
[0084] In step S208, the audio data is divided into multiple pieces to obtain multiple pieces of sub-audio data.
[0085] In implementation, the audio data can be divided into multiple segments according to a predetermined time interval, where the predetermined time interval can be any size, or can be a time length related to the duration of the ringtone, where the duration of the ringtone can be the time when the mobile terminal is receiving When an incoming call or a text message is prompted, the time to play the audio is usually 20 seconds or 30-45 seconds. The specific application situation is different due to different application scenarios, and this application does not limit this. According to the time interval, the audio data is divided into multiple sub audio data of the same duration. The predetermined time interval may be the same as the duration of the ringtone, or less than the duration of the ringtone, which is not limited in the embodiment of the present application. According to the predetermined time interval, segment the acquired audio data. For example, the agreed time interval can be set to 15 seconds. If the total length of the acquired audio data is 3 minutes, it can be divided into 12 segments of equal duration. Audio area segment. The time interval can also be determined according to the duration of the audio data. For example, if the total length of the acquired audio data is 3 minutes and 05 seconds, the divided time interval can be set to 5 seconds or 37 seconds to obtain the equivalent time length. The multi-segment audio data.
[0086] In step S210, a candidate peak area is determined from each piece of sub-audio data. The volume value contained in the candidate peak area and the audio value corresponding to the audio information are both greater than a predetermined selection threshold, and the duration of the candidate peak area is greater than the predetermined duration threshold.
[0087] Among them, the volume information and audio information in the audio data may include parameter information such as the pitch, tone intensity, and timbre of the audio. The audio peak area may be an area in the audio data where the volume information and audio information have a relatively large value (such as an area exceeding a predetermined threshold).
[0088] The volume value and the audio value corresponding to the audio information may be data for recording audio characteristics, for example, it may be an amplitude value that determines the volume value, and a frequency value that may determine the audio information. The predetermined selection threshold can be determined according to the total volume value and audio value of the audio data. For example, the total volume value of a piece of audio data with a duration of 3 minutes is 600 decibels, and the audio data is divided into 12 segments of sub-audio data of equal duration. Then the average volume value of each piece of sub-audio data should be 50 decibels, then the predetermined selection threshold may be 50 decibels, or 60 decibels higher than the average value. The predetermined duration threshold may be any duration less than the duration of each piece of sub audio data, such as 3 seconds or 5 seconds.
[0089] In implementation, the divided sub-audio data is obtained in step S208, and the volume value contained in each sub-audio data and the audio value corresponding to the audio information can be calculated separately. If the obtained values are all greater than the predetermined selection threshold, the If the duration of the sub audio data is also greater than the predetermined duration threshold, it can be determined as a candidate peak area.
[0090] For example, if the predetermined selection threshold is set to a volume value of 50 decibels, the audio value corresponding to the audio information is 100 Hz, that is, the volume value of the determined candidate peak area and the audio value corresponding to the audio information should be greater than 50 decibels and 100 respectively hertz. If a piece of audio data with a duration of 3 minutes is divided into 12 pieces of sub-audio data, and the duration of each piece of sub-audio data is 15 seconds, among the 12 pieces of sub-audio data, there is a piece of self-audio data that contains a piece of 11-second duration. An audio data segment, where the volume value of all points in the audio data segment is greater than 50 decibels, the audio value of all points is greater than 100 Hz, and the duration of the audio data segment is also greater than a predetermined duration threshold of 10 seconds, then the audio data segment Can be determined as a candidate peak area. If among the 12 pieces of sub-audio data, multiple pieces of sub-audio data contain audio data segments that meet the above conditions, these audio data segments that meet the predetermined selection threshold and the predetermined duration threshold can be determined as candidate peak regions.
[0091] In step S212, a candidate peak area that satisfies a predetermined duration range condition is extracted from the candidate peak area as an audio peak area.
[0092] Wherein, the predetermined duration range condition can be any time range that meets the requirement of the mobile phone ringtone duration, for example, it can be greater than 10 seconds and less than 20 seconds, or the closest to 20 seconds. The specific situation can be adjusted according to the application scenario, and the embodiment of this application does not make specific requirements for this.
[0093] In implementation, among the candidate peak areas obtained in step S206, a candidate peak area that satisfies the predetermined duration range condition is found, and the candidate peak area is extracted as an audio peak area. For example, the predetermined duration range condition can be set to an audio area greater than 10 seconds and closest to 20 seconds. If there are multiple candidate peak areas with different durations, the candidate peak areas greater than 10 seconds can be extracted from the above candidate peak areas. During the comparison, one or more candidate peak areas closest to 20 seconds are found, and the candidate peak areas are extracted as audio peak areas.
[0094] In step S214, if the duration of each candidate peak area in the candidate peak area is within the predetermined duration range, from the candidate peak area, a candidate peak area whose duration is within the predetermined duration range is acquired as the audio peak area.
[0095] In implementation, if the predetermined duration range is set to be greater than 10 seconds and less than 20 seconds, then one or more candidate peak regions that meet the predetermined duration range can be extracted as audio peak regions.
[0096] In step S216, if the duration of each candidate peak area in the candidate peak area is not within the predetermined duration range, then from the candidate peak area, obtain multiple adjacent candidate peak areas whose total duration is within the predetermined duration range as Audio peak area.
[0097] In implementation, if the duration of the candidate peak area does not satisfy the predetermined duration range, multiple adjacent candidate peak areas whose total duration is within the predetermined duration range can be acquired as the audio peak area. For example, if the predetermined duration range is set to 10-20 seconds, the duration of the candidate peak area is less than 10 seconds, and if the duration of a candidate peak area is 8 seconds, the duration of its adjacent candidate peak area is 9 seconds, and two segments The interval between the candidate peak areas is 2 seconds, then the combined duration of the two candidate peak areas is 19 seconds, which meets the predetermined time range, namely 10-20 seconds, then the combination of the two candidate peak areas can be used as audio Peak area. If there are multiple adjacent candidate peak areas, and the combined area can meet the predetermined duration range, it can also be used as an audio peak area.
[0098] In step S218, according to the audio peak area, the audio data to be recommended is intercepted from the above audio data.
[0099] In implementation, after the audio peak area is obtained in step S212, the corresponding time point can be found in the audio data according to the start and end positions of the audio peak area, and then the audio data can be performed based on the found time point. Intercept, the intercepted result can be displayed in the mobile terminal for users to view.
[0100] In addition, according to the audio peak area, the time period corresponding to the audio peak area can be highlighted on the time progress bar of the audio data playback.
[0101] For example, for a piece of audio data with a duration of 3 minutes, the start time point of the audio peak area to be intercepted is 1 minute 50 seconds, and the end time point is 2 minutes 03 seconds, you can highlight the audio playback time progress bar Display the time zone from 1 minute 50 seconds to 2 minutes 03 seconds, which can be displayed in different colors or displayed in bold, and the positions of 1 minute 50 seconds and 2 minutes 03 seconds are marked at the same time. If there are multiple audio peak areas, they can be marked with different colors on the time progress bar, or the time period corresponding to each audio peak area can be labeled to distinguish. For the acquired audio peak area, the user can adjust it according to their needs. For example, if the audio data is a song, the intercepted audio peak area can be the climax of the song, if the intercepted audio peak area is at the starting point Previously included individual lyrics of the climax part, the user can drag the starting point of the time period corresponding to the audio peak area to the left on the time progress bar to intercept the clipped lyrics into the audio peak area, or to the right audio peak The end point of the time period corresponding to the area, the cut lyrics are intercepted into the audio peak area, such as image 3 Shown. The embodiment of the application provides a method for processing audio data. By acquiring audio data to be set as a ringtone, if the duration of the audio data is greater than a predetermined duration threshold and there is no historical information for setting the ringtone based on the audio data, then The audio data is set to an editable state, and then based on an editing operation on the audio data, the audio data to be recommended is intercepted from the audio data. In this way, in the audio data processing process, the user does not need to use manual continuous adjustment to select the desired segment while browsing the audio data, which improves the processing efficiency of the audio data and improves the user experience.
Example Embodiment
[0102] Example three
[0103] Such as Figure 4 As shown, an embodiment of the present application provides a method for processing audio data. The execution body of the method may be a mobile terminal, where the mobile terminal may be a mobile phone, a tablet computer, etc., and the mobile terminal may be a mobile terminal used by a user. This method can automatically extract the audio peak area according to the volume information and audio information of the audio data after acquiring the audio data, and recommend it to the user. The method may specifically include the following steps:
[0104] In step S402, the audio data to be set as a ringtone is acquired.
[0105] For the specific processing process of the foregoing S402, reference may be made to the related content of S102 in the foregoing embodiment 1, which will not be repeated here.
[0106] In step S404, if the duration of the audio data is greater than the predetermined duration threshold and there is no history information for setting the ringtone based on the audio data, the audio data is set to an editable state.
[0107] For the specific processing process of the foregoing S404, reference may be made to the related content of S104 in the foregoing embodiment 1, which is not repeated here.
[0108] In step S406, an editing instruction for audio data is received.
[0109] For the specific processing process of the foregoing S406, reference may be made to the related content of S206 in the foregoing embodiment 2, which will not be repeated here.
[0110] In practical applications, if the mobile terminal stores a preset commonly used audio data interception program, the audio data can be edited according to the commonly used audio data interception program, and finally the audio data is intercepted For the audio data to be recommended, please refer to the common processing methods for details, which will not be repeated here. If the mobile terminal does not store the pre-set common audio data interception scheme, the mobile terminal can extract the audio peak area from the audio data according to the volume information and audio information of the audio data, and according to the audio peak area To intercept the audio data to be recommended from the audio data, the specific processing methods may include various methods. An optional processing method is provided below. For details, please refer to the following related content.
[0111] In step S408, the audio data is divided into multiple pieces to obtain multiple pieces of sub-audio data.
[0112] For the specific processing process of the foregoing S408, reference may be made to the related content of S208 in the foregoing embodiment 2, which is not repeated here.
[0113] In step S410, the volume value of each piece of sub audio data and the sum of the audio value corresponding to the audio information are obtained respectively.
[0114] Wherein, the volume value corresponding to the volume information and the audio data corresponding to the audio information may be the sum of the volume value (such as the number of decibels) and the audio value.
[0115] In implementation, the volume value in each piece of sub-audio data and the audio value corresponding to the audio information are obtained and summarized. For example, the sum of the volume values of all points in a piece of sub-audio data can be 500 decibels, and all audio values The sum can be 600 Hz.
[0116] In step S412, according to the value of the sum corresponding to each segment of sub audio data, sub audio data satisfying a predetermined number of segments are selected from the multiple segments of sub audio data as the candidate peak area.
[0117] The predetermined number of segments can be one or more segments, and the specific setting can be determined according to the length of time of the audio data or the number of times the climax part of the audio data appears.
[0118] In implementation, according to step S406, the total volume value and audio value of each piece of sub-audio data are obtained respectively, and the sub-audio data is comprehensively sorted on this basis. The sorting can be performed according to the size of the volume value and the sum of the audio value. If the predetermined number of segments is set to 2 segments, the top two sub audio data of the sum of the volume value and the audio value can be used as the candidate peak area.
[0119] In step S414, a candidate peak area that satisfies a predetermined duration range condition is extracted from the candidate peak area as an audio peak area.
[0120] For the specific processing process of the foregoing S414, reference may be made to the related content of S212 in the foregoing embodiment 2, which is not repeated here.
[0121] In step S416, if the duration of each candidate peak area in the candidate peak area is within the predetermined duration range, from the candidate peak area, a candidate peak area whose duration is within the predetermined duration range is acquired as the audio peak area.
[0122] For the specific processing process of the foregoing S416, reference may be made to the related content of S214 in the foregoing embodiment 2, which will not be repeated here.
[0123] In step S418, if the duration of each candidate peak area in the candidate peak area is not within the predetermined duration range, then from the candidate peak area, obtain multiple adjacent candidate peak areas whose total duration is within the predetermined duration range as Audio peak area.
[0124] For the specific processing process of the foregoing S418, reference may be made to the related content of S216 in the foregoing embodiment 2, which is not repeated here.
[0125] In step S420, the audio data to be recommended is intercepted from the audio data according to the audio peak area.
[0126] For the specific processing process of the foregoing S420, reference may be made to the related content of S218 in the foregoing embodiment 2, which will not be repeated here.
[0127] The embodiment of the present application provides a method for processing audio data. By acquiring audio data to be set as a ringtone, if the duration of the audio data is greater than a predetermined duration threshold and there is no historical information for setting the ringtone based on the audio data, Then, the audio data is set to an editable state, and then based on an editing operation on the audio data, the audio data to be recommended is intercepted from the audio data. In this way, in the audio data processing process, the user does not need to use manual continuous adjustment to select the desired segment while browsing the audio data, which improves the processing efficiency of the audio data and improves the user experience.
PUM


Description & Claims & Application Information
We can also present the details of the Description, Claims and Application information to help users get a comprehensive understanding of the technical details of the patent, such as background art, summary of invention, brief description of drawings, description of embodiments, and other original content. On the other hand, users can also determine the specific scope of protection of the technology through the list of claims; as well as understand the changes in the life cycle of the technology with the presentation of the patent timeline. Login to view more.
Similar technology patents
Drilling device for wheel insert
Owner:CITIC DICASTAL
Neural network-based grinding machining working condition detection method
Owner:NANJING UNIV +1
Shaft gear composite processing machine tool
Owner:CHONGQING MACHINE TOOL GROUP
Device and method for magnetically grinding inner surfaces of large-diameter long and straight pipes
Owner:UNIV OF SCI & TECH LIAONING
Classification and recommendation of technical efficacy words
- Improve processing efficiency
- Improve user experience
Method for automatically acquiring new words from Chinese webpages
Owner:TSINGHUA UNIV
Full-automatic cloth paving and cutting integrated machine
Owner:CHONGQING UNIV OF TECH
Method for Managing Local IP Access Connection
Owner:ZTE CORP
Method and device for processing concurrent data and electronic accounting system
Owner:ADVANCED NEW TECH CO LTD
Ferritic steel barrel butt-junction circular-seam hot-wire TIG multi-layer multipath welding method
Owner:HARBIN JIANCHENG GRP
Method and device of pairing connection of equipment
Owner:XIAOMI INC
Automatic charging system for electric automobile
Owner:WUHAN INSTITUTE OF TECHNOLOGY
Touch processing method and portable mobile terminal
Owner:LENOVO (BEIJING) CO LTD
Method, device and equipment for displaying multiple application windows on mobile terminal
InactiveCN102968243AImprove information acquisition efficiency and ease of operationImprove user experience
Owner:SHENZHEN YUNYINGGU TECH CO LTD
Content recommending method and device based on user characteristics
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD