Audio playback method, system, electronic device and storage medium

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
By playing personalized audio data in the TWS earphone case, the problem of the earphone case having limited functionality is solved, the user experience and ease of operation are improved, and the diverse needs of users are met.

WO2026129898A1PCT designated stage Publication Date: 2026-06-25HUAWEI TECH CO LTD

View PDF 0 Cites 0 Cited by

Patent Information

Authority / Receiving Office: WO · WO
Patent Type: Applications
Current Assignee / Owner: HUAWEI TECH CO LTD
Filing Date: 2025-10-31
Publication Date: 2026-06-25

Application Information

Patent Timeline

31 Oct 2025

Application

25 Jun 2026

Publication

WO2026129898A1

IPC: H04R1/10

AI Tagging

Application Domain

Earpiece/earphone attachments

Technology Topics

Engineering Headphones

Explore More Agents

Novelty Search
Search existing technologies and assess novelty
↗
FTO
Analyze whether a product may infringe others' patents
↗
Design FTO
Check prior-design risk for exterior design
↗
Drafting
Draft patent application text based on a technical solution
↗
Find Solutions with TRIZ
Generate feasible solution to solve your technical challenge
↗

Similar Technology Patents

Texitile light ageing test instrument
CN1588059Acompact structure Easy to assemble and disassemble Material analysis by optical meansTextile testingEngineering Light filter
Multi-dimensional training method and device of support vector machine
CN114186620AImprove linear separabilityimprove classificationKernel methods Character and pattern recognition Data setDescent algorithm
Loop structure of cold heat flows
CN1916533AImprove efficiencySimple configurationFluid circulation arrangementHeating and refrigeration combinationsHeat flowWorking fluid
Environment-friendly mobile collecting box for decoration cutting dust
CN108636005AThe dragging process is smoothavoid secondary flyingUsing liquid separation agent Working accessories EngineeringSediment
Credit text analysis method, credit object auditing method and credit object auditing device
CN114386430AReduce labor costs Improve efficiency Finance Semantic analysisCredit cardEngineering

Get free access to AI patent search and analysis

Check patentability, review prior art and ask IP Agent with full patent context.

AI Technical Summary

Technical Problem

Existing TWS earphone cases have simple functional designs and cannot meet the increasingly diverse and personalized needs of users.

Method used

When the headphone case is opened or closed, specific audio data is played through the speaker, including user-defined ringtones, music clips, or welcome messages. Personalized audio content is generated based on the user's emotional characteristics, location, and other information to enhance user interactivity and operation confirmation.

Benefits of technology

It enriches the functionality of the headphone case, enhances the user experience and ease of operation, meets users' needs for product novelty, ease of use and high interactivity, helps users confirm the status of the headphone case, and enhances the safety and enjoyment of use.

✦ Generated by Eureka AI based on patent content.

Smart Images

Figure CN2025131885_25062026_PF_FP_ABST

Patent Text Reader

Abstract

The present application provides an audio playback method, a system, an electronic device, and a storage medium, and relates to the technical field of terminals. The method is applied to a TWS earbud. The TWS earbud comprises an earbud box having a speaker. The method comprises: acquiring first audio data; and, when the earbud box is opened or closed, the speaker playing back the first audio data. In this way, the first audio data can be played back when the earbud box is opened or closed, thereby further enriching functions of the earbud box and meeting increasingly diversified and individualized requirements of users.

Need to check novelty before this filing date? Find Prior Art

Description

An audio playback method, system, electronic device, and storage medium

[0001] This application claims priority to Chinese Patent Application No. 2024118765872, filed on December 18, 2024, entitled "An Audio Playback Method, System, Electronic Device and Storage Medium", the entire contents of which are incorporated herein by reference. Technical Field

[0002] This application relates to the field of terminal technology, and in particular to an audio playback method, system, electronic device and storage medium. Background Technology

[0003] Currently, true wireless stereo (TWS) earbuds are widely used in people's daily lives due to their convenience and portability. However, the charging cases for existing TWS earbuds are relatively simple in design and cannot meet the increasingly diverse and personalized needs of users. Summary of the Invention

[0004] This application provides an audio playback method, system, electronic device, and storage medium, which can further enrich the functions of headphone boxes and meet the increasingly diverse and personalized needs of users.

[0005] To achieve the above objectives, this application adopts the following technical solution:

[0006] In a first aspect, embodiments of this application provide an audio playback method applied to TWS earphones, the TWS earphones including an earphone case with a speaker, the method comprising:

[0007] Obtain the first audio data;

[0008] When the headphone case is opened or closed, the speaker plays the first audio data.

[0009] The audio playback method provided in this application allows for the playback of first audio data during the opening or closing of the headphone case, enriching its functionality. This design increases the interactivity and enjoyment of using the headphone case, meeting users' personalized needs for novelty, ease of use, and high interactivity in the product experience, thus enhancing the user experience.

[0010] In addition, users can configure the audio content and sound characteristics of the first audio data to provide emotional value through the first audio data played when the headphone case is opened or closed. For example, playing a family member's voice or a snippet of a favorite song when the headphone case is opened can effectively improve the user's mood, help the user maintain a good mindset, and enhance the user's psychological comfort.

[0011] In practical applications, users can also set corresponding first audio data for when the earphone case is opened or closed. By playing the first audio data, the user is directly and effectively informed that the earphone case has been opened or closed through sound feedback. This meets the user's need to confirm the status of the earphone case when they cannot directly observe it (such as at night or in poor visibility), ensuring the accuracy of the operation. At the same time, this audio feedback can also help users accurately identify the open or closed state of the earphone case, enhancing the user's operation recognition and interactive experience, and improving the convenience and safety of using TWS earphones.

[0012] It should be understood that the first audio data can be a user-defined ringtone, music clip, or welcome message; it can also be audio guidance data corresponding to the user manual of the TWS earbuds and / or charging case. This allows users to experience personalized settings during use, further enhancing the user experience.

[0013] In one possible implementation of the first aspect, the first audio data is determined based on one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first matter.

[0014] Based on the above possible implementation methods, since the first audio data is generated based on at least one of a variety of information, the playback of the first audio data can accurately adjust the user's emotional state, bring a personalized listening experience, help relieve stress, improve mood, and enhance the user's sense of pleasure.

[0015] Secondly, embodiments of this application provide an audio playback method applied to an electronic device that has a communication connection with a TWS earphone. The TWS earphone includes an earphone case with a speaker. The method includes: determining first audio data, wherein the first audio data is audio data played by the speaker when the earphone case is opened or closed; and sending the first audio data to the earphone case.

[0016] In one possible implementation of the second aspect, determining the first audio data includes: determining the first audio data in response to a user operation.

[0017] In other words, the first audio data can be determined directly in response to user actions.

[0018] Optionally, determining the first audio data in response to a user operation includes: displaying an audio configuration interface; and determining the first audio data in response to a configuration operation in the audio configuration interface.

[0019] Optionally, the audio configuration interface includes first audio data, which is determined based on one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task.

[0020] Optionally, the audio configuration interface further includes a first image, which is generated based on one or more of the following information: the user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and the user's progress information on the first task.

[0021] Optionally, determining the first audio data in response to a user operation includes: displaying a first control on a first interface; displaying an audio selection interface in response to a trigger operation on the first control, the audio selection interface including a list of selection controls, the list of selection controls including at least one audio selection control; displaying an audio configuration interface corresponding to the first audio selection control in response to a trigger operation on the first audio selection control among the at least one audio selection controls; and determining the first audio data in response to a configuration operation in the audio configuration interface.

[0022] Optionally, the first audio selection control is an AIGC control. The step of displaying an audio configuration interface corresponding to the first audio selection control in response to a trigger operation on at least one of the audio selection controls includes: displaying an audio configuration interface corresponding to the AIGC control in response to a trigger operation on the AIGC control, wherein the audio configuration interface includes the first audio data, and the first audio data is determined based on the user's emotional characteristic information and / or location information.

[0023] Optionally, the audio configuration interface further includes a second image, which is generated based on one or more of the following information: the user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and the user's progress information on the first task.

[0024] In one possible implementation of the second aspect, determining the first audio data includes: acquiring first information, the first information including one or more of the following: user's emotional characteristics information, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task; and generating the first audio data based on the first information.

[0025] It should be understood that in practical applications, the electronic device can, in response to the communication connection operation with the TWS earphones, acquire first information and determine the first audio data based on one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task. Alternatively, the electronic device can acquire the first information at preset intervals and generate the first audio data based on the first information.

[0026] In one possible implementation of the second aspect, determining the first audio data includes: acquiring second audio data; and determining the first audio data based on the second audio data.

[0027] Optionally, the second audio data may be determined in response to a user action.

[0028] In one possible implementation of the second aspect, the second audio data is recorded audio;

[0029] Alternatively, the second audio data is generated based on the first information, which includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task.

[0030] Optionally, when the second audio data is recorded audio, the first audio data can be obtained as follows: the audio configuration interface may include a sound recording control; the electronic device may obtain the second audio data in response to a trigger operation on the sound recording control.

[0031] In one possible implementation of the second aspect, determining the first audio data includes: acquiring multiple audio data; and determining the first audio data from the multiple audio data according to a preset rule.

[0032] In one possible implementation of the second aspect, the preset rule includes one or more of the following information: the number of times the headphone box is opened and / or closed, the user-preset playback order, first information, and the playback volume of the plurality of audio data, wherein the first information includes one or more of the following: user's emotional characteristics information, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task.

[0033] In one possible implementation of the second aspect, acquiring multiple audio data includes: acquiring first information, the first information including one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task; and acquiring the multiple audio data based on the first information.

[0034] It should be understood that the information used to determine multiple audio data may be the same as or different from the preset rules.

[0035] In one possible implementation of the second aspect, determining the first audio data from the plurality of audio data according to a preset rule includes: determining second audio data from the plurality of audio data according to the preset rule; and determining the first audio data based on the second audio data.

[0036] In one possible implementation of the second aspect, determining the first audio data based on the second audio data includes: generating the first audio data based on preset sound features and the second audio data, wherein the first audio data is audio data corresponding to the sound features.

[0037] In one possible implementation of the second aspect, determining the first audio data based on the second audio data includes: acquiring the sound features of the second audio data; and generating the first audio data based on preset first text information and the sound features.

[0038] In one possible implementation of the second aspect, determining the first audio data based on the second audio data includes: editing the second audio data to obtain the first audio data.

[0039] In one possible implementation of the second aspect, the method further includes:

[0040] When the headphone box is opened or closed, a first image is displayed on the first interface. The first image is determined based on first information, which includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the execution of a first task.

[0041] In one possible implementation of the second aspect, the first image is used to display the battery information of the earphone case and / or the battery information of the TWS earphones in a thematic manner.

[0042] Thirdly, embodiments of this application provide a TWS earphone, the TWS earphone including: one or more processors; one or more memories; and one or more computer programs; wherein the one or more computer programs are stored in one or more memories, and the one or more computer programs include instructions that, when executed by the TWS earphone, cause the TWS earphone to perform the methods as described in the first aspect and any possible implementation thereof.

[0043] Fourthly, embodiments of this application provide an electronic device comprising: one or more processors; one or more memories; and one or more computer programs; wherein the one or more computer programs are stored in the one or more memories, and the one or more computer programs include instructions that, when executed by the electronic device, cause the electronic device to perform the methods as described in the second aspect above and any possible implementation thereof.

[0044] Fifthly, embodiments of this application provide an audio playback system, the audio playback system including a TWS earphone as provided in the third aspect and an electronic device as provided in the fourth aspect, the electronic device and the TWS earphone having a communication connection, the TWS earphone including an earphone case with a speaker, and the audio playback system being used to implement the methods described in any of the above aspects and any possible implementations of any of the above aspects.

[0045] In one possible implementation of the fifth aspect, the electronic device is configured to: determine first audio data, the first audio data being audio data played by the speaker when the headphone case is opened or closed; and send the first audio data to the headphone case;

[0046] The earphone case is used to: acquire first audio data; and control the speaker to play the first audio data when the earphone case is opened or closed.

[0047] In one possible implementation of the fifth aspect, the audio playback system may further include TWS earphones adapted to the earphone case.

[0048] It should be understood that TWS earbuds can have a communication connection with the earbud case. The earbud case can be used to provide power to the TWS earbuds, provide storage space for the TWS earbuds, and play audio data.

[0049] Optionally, the TWS earbuds and the earphone case can be connected via metal contacts, and the communication connection between the TWS earbuds and the earphone case can be achieved through the aforementioned metal contacts.

[0050] Optionally, the audio playback system may also include other electronic devices, such as watches, wristbands, smart glasses, smart helmets, and cloud servers. It should be noted that the cloud server can communicate with the electronic devices to provide them with functions such as data storage, data processing, and data forwarding.

[0051] Sixthly, a computer-readable storage medium is provided, the computer-readable medium storing a computer program (also referred to as code or instructions) that, when the computer program code is run on a computer, causes the computer to perform the method of any of the above aspects or any possible implementation of any of the above aspects.

[0052] In a seventh aspect, a computer program product comprising instructions is provided, the computer program product including: a computer program (also referred to as code or instructions) that, when the computer program is run on a computer, causes the computer to perform the method in any of the above aspects or any possible implementation of any of the above aspects.

[0053] It should be noted that the above-mentioned computer program code can be stored in whole or in part on the first storage medium, wherein the first storage medium can be packaged together with the processor or packaged separately from the processor, and this application does not make specific limitations in this regard.

[0054] Eighthly, a chip system is provided, comprising: a processor and a memory, the memory for storing a computer program (also referred to as code or instructions), the processor for calling and running the computer program stored in the memory, causing a device or apparatus equipped with the chip system to perform the method in any of the above aspects or any possible implementations of any of the above aspects.

[0055] It is understood that the beneficial effects of aspects two through eight above can be found in the relevant descriptions in aspect one above, and will not be repeated here. Attached Figure Description

[0056] Figure 1 is a schematic diagram of an audio playback system provided in an embodiment of this application.

[0057] Figure 2 is a schematic diagram of the structure of an electronic device provided in an embodiment of this application.

[0058] Figure 3 is a schematic diagram of the structure of an earphone box in an audio output device provided in an embodiment of this application.

[0059] Figures 4 to 20 are schematic diagrams of application scenarios corresponding to the audio playback method provided in the embodiments of this application.

[0060] Figure 21 is a flowchart illustrating an audio playback method provided in an embodiment of this application.

[0061] Figure 22 is a flowchart illustrating another audio playback method provided in an embodiment of this application.

[0062] Figure 23 is a schematic block diagram of an audio playback device provided in an embodiment of this application.

[0063] Figure 24 is a schematic block diagram of another audio playback device provided in an embodiment of this application. Detailed Implementation

[0064] The technical solutions of the embodiments of this application are described below with reference to the accompanying drawings and related embodiments. In the description of the embodiments of this application, unless otherwise specified or logically conflicting, the terminology and / or descriptions between different embodiments are consistent and can be mutually referenced. Technical features in different embodiments can be combined to form new embodiments based on their inherent logical relationships. In the embodiments of this application, "at least one" refers to one or more, and "more than one" refers to two or more. "And / or" describes the association relationship between related objects, indicating that three relationships can exist. For example, A and / or B can represent: A existing alone, A and B existing simultaneously, or B existing alone, where A and B can be singular or plural. In the textual description of the embodiments of this application, the character " / " generally indicates that the preceding and following related objects are in an "or" relationship; for example, A / B can represent A or B. In this application, "first," "second," and various numerical designations are only for ease of description and are not used to limit the scope of the embodiments of this application. For example, they are used to distinguish different messages, rather than to describe a specific order or sequence.

[0065] The audio playback method provided in this application embodiment can be applied to an audio playback system. Figure 1 shows a schematic diagram of an audio playback system provided in this application embodiment. Referring to Figure 1, the audio playback system 10 may include an electronic device 100 and an audio output device 200. The electronic device 100 can establish a communication connection with the audio output device 200, and the audio output device 200 can be used to play audio data (including music, voice, and calls, etc.) sent by the electronic device 100.

[0066] In this embodiment, the audio output device 200 may include a first audio output device 210 and a second audio output device 220, both of which may have audio playback functionality. For example, the first audio output device 210 may be a TWS earphone, and the second audio output device 220 may be an earphone case.

[0067] In some embodiments, the first audio output device 210 and the second audio output device 220 may each have a communication connection with the electronic device 100. For example, the first audio output device 210 may communicate with the electronic device 100 via Bluetooth or other means to play audio data (such as music, voice, and calls) sent by the electronic device 100; the second audio output device 220 may also communicate with the electronic device 100 via Bluetooth or other means to play audio data (such as opening and closing sounds) sent by the electronic device 100.

[0068] In this embodiment, the way in which the first audio output device 210 establishes a communication connection with the electronic device 100 can be the same as or different from the way in which the second audio output device 220 establishes a communication connection with the electronic device 100.

[0069] In other embodiments, in addition to establishing a communication connection with the electronic device 100 to receive audio data sent by the electronic device 100, the first audio output device 210 can also have a communication connection with the second audio output device 220. The second audio output device 220 can be used to play audio data sent by the electronic device 100 through the first audio output device 210. For example, the second audio output device 220 can be used to play the opening or closing sound transmitted by the first audio output device 210.

[0070] In other embodiments, in addition to establishing a communication connection with the electronic device 100 to receive audio data sent by the electronic device 100, the second audio output device 220 can also communicate with the first audio output device 210 to directly send audio data to the first audio output device 210, or forward the received audio data sent by the electronic device 100 to the first audio output device 210 so that the first audio output device 210 can play the received audio data.

[0071] It should be noted that, in addition to Bluetooth, NFC and other methods, the communication connection between the first audio output device 210 and the second audio output device can also be achieved by direct electrical connection via metal contacts.

[0072] For example, when the first audio output device 210 is a TWS earphone and the second audio output device 220 is an earphone case, metal contacts can be provided for the TWS earphone and the earphone case respectively, so that when the TWS earphone is placed in the receiving cavity of the earphone case, the TWS earphone and the earphone case can achieve communication and charging connection functions through the metal contacts. It should be understood that in this example, the communication and charging connection functions between the TWS earphone and the earphone case can be time-division multiplexed through technologies such as carrier aggregation.

[0073] In some embodiments, electronic device 100 may include, but is not limited to, personal computers (PCs), mobile phones, tablets, wearable devices (e.g., watches, bracelets, helmets, smart glasses, etc.), augmented reality (AR) / virtual reality (VR) devices, mixed reality (MR) devices, laptops, ultra-mobile personal computers (UMPCs), netbooks, personal digital assistants (PDAs), in-vehicle devices, smart screens, smart home devices (e.g., smart TVs, smart speakers, smart cameras, and electronic door locks, etc.), servers, and motion-sensing game consoles in human-computer interaction scenarios, etc. This application embodiment does not impose any limitations on the specific type of electronic device 100.

[0074] Referring to Figure 2, this is a schematic diagram of the hardware structure of an electronic device 100 provided in this application. The electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 131, a universal serial bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, antenna 1, antenna 2, a mobile communication module 150, a wireless communication module 160, an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, a headphone jack 170D, a sensor module 180, buttons 190, a motor 191, an indicator 192, a camera 193, a display screen 194, and a subscriber identification module (SIM) card interface 195, etc. The sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, a barometric pressure sensor 180C, a magnetic sensor 180D, an accelerometer sensor 180E, a distance sensor 180F, a proximity sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, an ambient light sensor 180L, a bone conduction sensor 180M, etc.

[0075] It is understood that the structures illustrated in the embodiments of this application do not constitute a specific limitation on the electronic device 100. In other embodiments of this application, the electronic device 100 may include more or fewer components than illustrated, or combine some components, or split some components, or have different component arrangements. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.

[0076] For example, when the electronic device 100 is a mobile phone or a tablet computer, it may include all the components shown in the figure, or it may include only some of the components shown in the figure.

[0077] Processor 110 may include one or more processing units, such as: application processor (AP), modem processor, graphics processing unit (GPU), image signal processor (ISP), controller, memory, video codec, digital signal processor (DSP), baseband processor, and / or neural network processing unit (NPU), etc. Different processing units may be independent devices or integrated into one or more processors.

[0078] The processor 110 may also include a memory for storing instructions and data. In some embodiments, the memory in the processor 110 is a cache memory. This memory can store instructions or data that the processor 110 has just used or that are used repeatedly. If the processor 110 needs to use the instruction or data again, it can retrieve it directly from the memory. This avoids repeated accesses, reduces the waiting time of the processor 110, and thus improves the efficiency of the system.

[0079] The charging management module 140 is used to receive charging input from the charger. The power management module 141 is used to connect the battery 142, and the charging management module 140 is connected to the processor 110.

[0080] The wireless communication function of electronic device 100 can be realized through antenna 1, antenna 2, mobile communication module 150, wireless communication module 160, modem processor and baseband processor, etc.

[0081] The wireless communication module 160 can provide solutions for wireless communication applications on the electronic device 100, including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), Bluetooth (BT), global navigation satellite system (GNSS), frequency modulation (FM), near field communication (NFC), and infrared (IR) technologies. The wireless communication module 160 can be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via antenna 2, performs frequency modulation and filtering of the electromagnetic wave signals, and sends the processed signal to processor 110. The wireless communication module 160 can also receive signals to be transmitted from processor 110, perform frequency modulation and amplification, and convert them into electromagnetic waves for radiation via antenna 2.

[0082] Electronic device 100 implements display functions through a GPU, a display screen 194, and an application processor. The GPU is a microprocessor for image processing, connected to the display screen 194 and the application processor. The GPU is used to perform mathematical and geometric calculations and for graphics rendering. Processor 110 may include one or more GPUs, which execute program instructions to generate or modify display information.

[0083] The display screen 194 is used to display images, videos, etc., such as various display interfaces in the embodiments of this application.

[0084] Electronic device 100 can implement audio functions through audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, and application processor.

[0085] The audio module 170 can be used to convert digital audio signals into analog audio signals for output, and also to convert analog audio inputs into digital audio signals. The audio module 170 can also be used for encoding and decoding audio signals. In some embodiments, the audio module 170 can be located in the processor 110, or some functional modules of the audio module 170 can be located in the processor 110.

[0086] The speaker 170A, also known as a "loudspeaker," is used to convert audio electrical signals into sound signals. The electronic device 100 can listen to music or hands-free calls through the speaker 170A. For example, the speaker can play the comparison analysis results provided in the embodiments of this application.

[0087] The receiver 170B, also known as the "earpiece," is used to convert audio electrical signals into sound signals. When the electronic device 100 answers a telephone call or voice message, the receiver 170B can be brought close to the ear to listen to the voice.

[0088] Microphone 170C, also known as a "microphone" or "voice transducer," is used to convert sound signals into electrical signals. When making a phone call or sending a voice message, the user can speak by bringing their mouth close to microphone 170C, inputting the sound signal into microphone 170C. Electronic device 100 may have at least one microphone 170C. In some embodiments, electronic device 100 may have two microphones 170C, which, in addition to collecting sound signals, can also perform noise reduction. In other embodiments, electronic device 100 may also have three, four, or more microphones 170C, which can collect sound signals, reduce noise, identify the sound source, and perform directional recording, etc.

[0089] The 170D headphone jack is used to connect wired headphones. The 170D headphone jack can be a USB 130 interface or a 3.5mm Open Mobile Terminal Platform (OMTP) standard interface, a CTIA (Cellular Telecommunications Industry Association of the USA) standard interface.

[0090] In some embodiments, the audio output device 200 may be a wireless earphone. Correspondingly, the first audio output device 210 may be a TWS earphone, and the second audio output device 220 may be an earphone case adapted to the TWS earphone. Specifically, the TWS earphone may have a communication connection with the earphone case, and the earphone case may be used to store the TWS earphone and charge it; the TWS earphone and the earphone case may also have a communication connection with the aforementioned electronic device 100.

[0091] Figure 3 is a schematic diagram of the structure of an audio output device 200 provided in an embodiment of this application. Referring to Figure 3, the audio output device 200 may include: a processor 201, a power module 202, a memory 203, a communication module 204, a speaker 205, and a charging module 206.

[0092] It is understood that the structures illustrated in the embodiments of this application do not constitute a specific limitation on the structure of the audio output device 200. In other embodiments, the audio output device 200 may include more or fewer components than illustrated, or combine some components, or split some components, or have different component arrangements. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.

[0093] The processor 201 may include a memory for storing instructions and data.

[0094] In some embodiments, the memory in the processor 201 can be a cache memory. This memory can store instructions or data that the processor 201 has just used or that are used repeatedly. If the processor 201 needs to use the instruction or data again, it can directly retrieve it from the memory. This avoids repeated accesses, reduces the waiting time of the processor 201, and thus improves system efficiency.

[0095] The power module 202 can be used to supply power to the various components included in the audio output device 200.

[0096] In some embodiments, the power module 202 may be a battery, such as a rechargeable battery.

[0097] The memory 203 can be used to store applications. When the processor 201 executes the application code stored in the memory 203, it can cause the audio output device 200 to execute the audio playback method provided in the embodiments of this application.

[0098] The communication module 204 may include a wireless communication module, such as a Bluetooth module or a Wi-Fi module. The communication module 204 can be used not only to establish a communication connection between the audio output device 200 and the electronic device 100, but also to establish a communication connection between the first audio output device 210 and the second audio output device 220 in the audio output device 200.

[0099] As an example and not a limitation, when the communication module 204 is a Bluetooth module, a Bluetooth connection can be established between the audio output device 200 and the electronic device 100; or, a Bluetooth connection can be established between the first audio output device 210 and the second audio output device 220 in the audio output device 200.

[0100] In some embodiments, a communication module 204 may be provided in the first audio output device 210 and the second audio output device 220 of the audio output device 200, respectively. Based on this, the first audio output device 210 can receive audio data sent by the electronic device 100 through the communication module 204, or forward the received audio data to the second audio output device 220. The second audio output device 220 can also receive audio data sent by the electronic device 100 through the communication module 204, or forward the received audio data to the first audio output device 210.

[0101] In some embodiments, the audio output device 200 may further include a speaker 205. The speaker 205 can be used to convert electrical signals into sound signals. The audio output device 200 can play audio data through the speaker 205.

[0102] In some embodiments, the audio output device 200 may further include a charging module 206. The charging module 206 may be a wireless charging module or a wired charging module. The charging module 206 can be connected to an external power source to charge the audio output device 200.

[0103] In some embodiments, the audio output device 200 may further include a sensor module 207, which may include a touch sensor, a fingerprint sensor, a distance sensor, a proximity sensor, an ambient light sensor, etc.

[0104] In this embodiment, a Hall sensor may also be provided in the audio output device 200. The Hall sensor can be used to detect the state of various components in the audio output device 200.

[0105] For example, if the first audio output device 210 in the audio output device 200 is a TWS earphone and the second audio output device 220 is an earphone case, assuming that the Hall sensor is set inside the earphone case, then the Hall sensor can be used to detect the open and closed states of the earphone case.

[0106] In the above example, the earphone case may include a lid and a body, which are rotatably connected. When the lid and body are closed, they form a cavity for accommodating TWS earphones. A magnetic element (e.g., a magnet) may be placed in the lid, and a Hall sensor may be placed in the body. When the lid is closed, the magnetic element in the lid gradually moves closer to the Hall sensor in the body. Based on the Hall effect, the voltage inside the Hall sensor increases, thus determining that the lid is closed. Conversely, when the lid is opened, the magnetic element in the lid gradually moves away from the Hall sensor in the body. Based on the Hall effect, the voltage inside the Hall sensor decreases, thus determining that the lid is open.

[0107] In another example, a drive component (such as a sector gear) can be installed on the hinge of the lid, and a moving component (such as a rack) that cooperates with the drive component for transmission, as well as a detection component for detecting the movement of the moving component, can be installed inside the box. When the lid is closed, the moving component is in a specific position (e.g., a first position), and the detection component can determine that the lid is in the closed state; when the lid is opened, the moving component moves to another position (e.g., a second position), and based on this, the detection component can determine that the lid is in the open state.

[0108] In other examples, capacitor plates can be disposed on both the lid and the body of the headphone case. As the lid closes or opens, the distance and relative area between the capacitor plates change, causing a change in capacitance. Therefore, the open and closed states of the headphone case can also be detected by detecting changes in capacitance. This application does not limit the specific method for detecting whether the headphone case is open or closed.

[0109] Optionally, the audio output device 200 may also include a display module 208, which can be used to display information such as images, text, and video.

[0110] Optionally, the audio output device 200 may also include a microphone module (also called a microphone or transducer) 209, which can be used to convert sound signals into electrical signals.

[0111] In some embodiments, the structure of the first audio output device 210 or the second audio output device 220 in the audio output device 200 may be exactly the same as or not exactly the same as the structure of the audio output device 200 shown in FIG3 above. The specific structure of the first audio output device 210 or the second audio output device 220 is not limited in the embodiments of this application.

[0112] The following embodiments of this application will use the electronic device shown in Figure 2 as a mobile terminal, the first audio output device as a TWS earphone, and the second audio output device as an earphone case as an example to illustrate the application scenarios corresponding to the audio playback method provided in the embodiments of this application in conjunction with the user interface.

[0113] Figure 4 shows a schematic diagram of a scenario after a mobile terminal and an audio output device establish a communication connection, as provided in an embodiment of this application. Referring to Figure 4, after the earphone case (or TWS earphones) establishes a communication connection with the mobile terminal, a pop-up interface 400 can be displayed on the desktop shown in Figure 4.

[0114] The pop-up interface 400 may display a control 410 for changing the on / off sound of the headphone box. The control 410 for changing the on / off sound of the headphone box can be used to display an interface related to audio configuration, so as to set the audio played when the headphone box is opened or closed.

[0115] In some embodiments, the pop-up interface 400 may also display a series of image interface elements 411 and / or text elements 412. The image interface elements 411 may include at least one of the following: an image of the earphone case connected to the mobile terminal (i.e., an image of the earphone case corresponding to the earphone case), an earphone case battery icon corresponding to the actual battery level of the earphone case, an image of the earphones connected to the mobile terminal (i.e., an image of the earphones corresponding to the earphones), and an earphone battery icon corresponding to the actual battery level of the earphones. The text elements 412 may include the actual battery level of the earphone case and / or the actual battery level of the TWS earphones.

[0116] The mobile terminal can respond to the user's trigger operation to change the on / off box tone control 410 and display the user interface 500 shown in Figure 5.

[0117] In some embodiments, the mobile terminal may also directly respond to the communication connection with the earphone box (or TWS earphones) and directly display the user interface 500 shown in Figure 5.

[0118] Referring to Figure 5, the user interface 500 can also be called an on / off box sound settings interface or an audio selection interface. The user interface 500 may include an audio option list 510, which may include options such as a recorded sound option 5101, a local music option 5102, a system sound option 5103, and an artificial intelligence generated content (AIGC) option 5104. It is easy to understand that the audio option list 510 may also include other options, such as adaptive options or custom options, etc., and this embodiment does not impose any limitations on this.

[0119] It is easy to understand that the recording sound option 5101 can be used to display an interface corresponding to the recorded sound, so as to configure the audio played when the headphone case is opened or closed using the recorded sound. The local music option 5102 can be used to display an interface corresponding to music (such as music) stored locally (i.e., on the mobile terminal), so as to configure the audio played when the headphone case is opened or closed using the locally stored music; the system sound option 5103 can be used to display an interface corresponding to the preset system sound, so as to configure the preset system sound as the audio played when the headphone case is opened or closed; the AIGC option 5104 can be used to display an interface related to intelligent technologies such as AIGC technology, so as to configure the audio played when the headphone case is opened or closed using audio generated by intelligent technology. If the audio option list 510 may also include other options (such as adaptive options or custom options), these other options can also be used to configure the audio played when the headphone case is opened or closed through related interfaces such as adaptive or custom.

[0120] In some embodiments, the user interface 500 may also include image interface elements and / or text elements, wherein the image interface elements and / or text elements may be part (e.g., device status information, etc.) or all of the content in the image interface element 411 shown in FIG4. This application embodiment does not specifically limit this.

[0121] In some embodiments, the mobile terminal can display the user interface 600 shown in FIG6 in response to a user's trigger operation on the recording sound option 5101 in the audio option list 510. The user interface 600 may include a start recording button 610, an open-box sound control 620, and a close-box sound control 630. The start recording button 610 can start the recording function to obtain recorded audio in response to a user's trigger operation; the open-box sound control 620 can be used to display a first setting content in the user interface 600 corresponding to the audio played when the headphone box is opened; the close-box sound control 630 can be used to display a second setting content in the user interface 600 corresponding to the audio played when the headphone box is closed.

[0122] In this embodiment, the first setting content and the second setting content may include audio and an audio selection control, etc. The first setting content may be the same as or different from the second setting content. That is, the first setting content displayed in the user interface 600 after the user triggers the opening sound control 620 may be the same as or different from the second setting content displayed in the user interface 600 after the user triggers the closing sound control 630.

[0123] In some examples, if an audio file already exists, the user interface 600 may also include a search box 640 for searching for audio files and / or at least one audio file 650, wherein at least one audio file 650 may refer to audio file 1, audio file 2 and audio file 3 as shown in FIG. 6.

[0124] In some embodiments, the user interface 600 may further include a selection control 6501 corresponding to at least one recording file 650. The mobile terminal may respond to the user's trigger operation on the selection control 6501 to set the recording file 650 corresponding to the selection control 6501 as the audio to be played when the headphone box is turned on (or off).

[0125] It should be understood that the mobile terminal can respond to the user's trigger operation on a selection control 6501 and directly set the recorded audio in a recording file 650 corresponding to the selection control 6501 as the audio played when the headphone box is turned on (or off).

[0126] In other embodiments, the mobile terminal may also respond to the user's triggering operation on multiple selection controls 6501, determine a recording audio from multiple recording audios according to preset rules, and set the determined recording audio as the audio played when the headphone box is opened (or closed). The preset rules may include one or more of the following information: the number of times the headphone box has been opened and / or closed, the user's preset playback order, the number of times multiple recording audios have been played, and first information; wherein the first information includes one or more of the user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and the user's progress information on the first item. This application embodiment does not specifically limit the specific content of the preset rules. In the above embodiments, the process of determining a recording audio from multiple recording audios according to preset rules can be understood as an operation automatically performed by the electronic device after the user selects multiple selection controls 6501, a process that does not require manual operation by the user.

[0127] It's easy to understand that, in one example, the audio played when the headphone case is opened (or closed) can be determined from multiple audio recordings based on the number of times the headphone case is opened and / or closed. For instance, the first audio recording with the most (or fewest) openings can be determined based on the number of times the headphone case is opened (and / or closed), and based on this, the first audio recording is determined from multiple audio recordings to be the audio played when the headphone case is opened (or closed). For example, assuming that audio recording A is the most frequently opened audio recording when the headphone case is opened 50 times, then audio recording A can be determined from multiple audio recordings to be the audio played when the headphone case is opened (or closed).

[0128] In another example, the audio to be played when the headphone case is opened (or closed) can be determined from multiple recorded audios based on a user-preset playback order. Furthermore, the mobile terminal can also respond to user triggering operations on multiple selection controls 6501, determining the playback order of the multiple recorded audios based on the order in which the user triggers the multiple selection controls 6501. The playback order of the multiple recorded audios can then be determined as the user-preset playback order. Based on this, the corresponding recorded audios can be played sequentially according to the number of times the headphone case is opened and / or closed.

[0129] For example, suppose there are multiple audio recordings, including audio A, audio B, and audio C. The user-preset playback order corresponds to the user's trigger operation on the selection control 6501 corresponding to the multiple audio recordings. That is, the user-preset playback order is audio A, audio B, and audio C in sequence. Then, after setting, audio A can be played the first time the headphone case is opened (or closed); audio B can be played the second time the headphone case is opened (or closed); and audio C can be played the third time the headphone case is opened (or closed). Alternatively, audio A can be played the first time the headphone case is opened (or closed); then audio B can be played the next time the headphone case is closed (or opened); and audio C can be played the next time the headphone case is opened (or closed).

[0130] In yet another example, the audio played when the headphone case is opened (or closed) can be determined based on the playback counts of multiple recorded audio files. These playback counts can include the number of times the recorded audio files are played on the electronic device, and / or the number of times the recorded audio files are set to play when the headphone case is opened (or closed).

[0131] It should be understood that the user's emotional characteristics in the first piece of information can be used to identify the user's mood, such as happy, unhappy, calm, etc. Location information can refer to the user's geographical location, such as home, company, school, park, hospital, etc. The user's personal information can be a self-portrait, used to identify the corresponding user. For example, personal information can include at least one of the following: gender, age, occupation, hobbies, habits, social interactions, personality, etc. Seasonal information can be used to identify different seasons, such as spring, summer, autumn, or winter. Solar term information can indicate any of the 24 solar terms. Time information can be used to represent different points in time within a day, such as morning, forenoon, noon, afternoon, and evening. Festival information can be used to represent different festivals, such as Spring Festival, Mid-Autumn Festival, New Year's Day, etc. Weather information can be used to represent specific weather conditions, such as cloudy, sunny, rainy, snowy, windy, etc. Temperature and humidity information can include specific temperature and humidity values. The first task can consist of one or more things (such as fitness, exercise, study, listening to music, watching videos, etc.). The user's progress information for the first task can include whether the first task has not started, whether the first task is being executed, or whether the first task has been completed.

[0132] It is easy to understand that the aforementioned first piece of information can be obtained from the user's historical operations on the mobile terminal; it can also be obtained by recognizing the user's voice data; or it can be determined from information obtained from other electronic devices connected to the mobile terminal. This application does not limit this.

[0133] For example, a user's emotional characteristics can be determined through pulse information obtained from a watch connected to a mobile phone, or through information about the user's interests, hobbies, and social media posts. Similarly, location information can be obtained using a positioning module in a mobile phone, earphone case, or other device that communicates with the mobile phone or earphone case. For example, the positioning module can include, but is not limited to, a Global Positioning System (GPS), a Wi-Fi module, or a Bluetooth module.

[0134] Based on the above embodiments, the first information may be obtained by the mobile terminal in response to the communication connection with the headphone box (or headphones); it may also be obtained by the mobile terminal when it detects that the headphone box is opened or closed; or it may be obtained by the mobile terminal at a preset time interval.

[0135] Taking the user's emotional characteristics as an example, in other examples, the audio played when the headphone case is opened (or closed) can be determined from multiple audio recordings based on the user's emotional characteristics. For instance, assuming the electronic device determines that the user's mood is happy based on the user's emotional characteristics, then the electronic device can determine from multiple audio recordings the audio recording corresponding to the user's happy mood as the audio played when the headphone case is opened (or closed).

[0136] In some embodiments, the mobile terminal can also respond to the user's triggering operation of the record button 610 to start the recording function and obtain the recorded audio. After obtaining the recorded audio, the mobile terminal can form the obtained recorded audio into a corresponding recording file and display it in the user interface 600.

[0137] As an example and not a limitation, the acquired audio recording may correspond to audio file 4 shown in Figure 7. The mobile terminal may display the user interface 700 shown in Figure 7 in response to the user's trigger operation on audio file 4.

[0138] The user interface 700 can display audio information 710 corresponding to the recorded audio. The audio information 710 may include the audio track information 7101 corresponding to the recorded audio, the total duration of the recorded audio 7102, the time corresponding to the current playback position of the recorded audio 7103, and the audio file size 7104, etc.

[0139] In some examples, the user interface 700 shown in Figure 7 may also include an editing control 720 and a start / pause playback control 730. The editing control 720 can be used to edit the recorded audio to obtain the edited audio. It should be understood that the operations for editing the recorded audio may specifically include trimming, splicing (or adding), mixing and overlaying, cloning, and conversion. Trimming can be understood as the process of deleting (or retaining) a portion of the sound from the recorded audio. Splicing can be understood as the process of adding new audio before or after the recorded audio. Mixing and overlaying can be understood as the process of mixing the recorded audio with other audio. Cloning can be understood as the process of generating audio with the same or similar sound characteristics as the recorded audio. Conversion can be understood as the process of processing the sound characteristics of the recorded audio into audio with other sound characteristics. The start / pause playback control 730 can be used to start or pause playback of the recorded audio or the edited recorded audio.

[0140] In other examples, the user interface 700 shown in Figure 7 may also include a confirmation control 740. The mobile terminal may also respond to the user's triggering operation on the confirmation control 740 by directly setting the acquired recording audio as the audio played when the headphone box is opened (or closed).

[0141] In some embodiments, the mobile terminal may also respond to the user's trigger operation on the confirmation control 740, extract the sound features of the recorded audio, generate speech corresponding to the preset text based on the sound features of the recorded audio, and set the generated speech as the audio played when the headphone box is opened (or closed). In other words, the text content of the generated speech corresponds to the preset text.

[0142] For example, assuming the preset text corresponding to the opening sound is "Dad, good morning," the recorded audio can be obtained through recordings of the child's daily speech, such as "Dad, I'm very happy today." The mobile terminal can respond to the user's triggering of the confirmation control 740, extracting the child's voice features from the recorded audio of "Dad, I'm very happy today" using deep learning, artificial intelligence, and other models. Then, based on the extracted voice features and the preset text, it generates a speech with the text "Dad, good morning," that is similar to or identical to the child's voice in at least one of the voice features, such as timbre, intonation, or speech rate. Thus, when the headphone case is opened, the speech "Dad, good morning," which is similar to or identical to the child's voice, will play.

[0143] In some embodiments, the mobile terminal may also respond to the user's triggering operation on the confirmation control 740, extract text from the recorded audio, and generate corresponding speech based on the extracted text and preset sound features (such as timbre, tone, speech rate, etc.), and set the generated speech as the audio played when the headphone box is opened (or closed). In this embodiment, the generated speech may be audio data corresponding to preset sound features.

[0144] In some embodiments, the mobile terminal may also respond to the user's trigger operation on the confirmation control 740, edit the recorded audio according to the obtained first information to obtain the corresponding voice, and set the obtained voice as the audio played when the headphone box is opened (or closed). The first information may include one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task. In this embodiment, the editing process can be understood as the mobile terminal automatically performing editing operations on the recorded audio.

[0145] In some embodiments, the editing control 720 shown in FIG. 7 may also have the same function as the aforementioned confirmation control 740. The mobile terminal may also respond to a user's trigger operation on the editing control 720, generate speech corresponding to preset text based on the sound characteristics of the recorded audio, and set the generated speech as the audio played when the headphone case is opened (or closed).

[0146] Based on the above possible embodiments, the mobile terminal can also update the recorded audio in the user interface 700 with the generated voice after the user triggers the editing control 720. Then, the mobile terminal can set the generated voice as the audio played when the headphone box is opened (or closed) in response to the user triggering the confirmation control 740 in the user interface 700.

[0147] Optionally, the preset text or preset sound features in the above embodiments can be pre-configured or user-defined. It is easy to understand that in the above examples, in the scenario of opening or closing the headphone case, the preset text corresponding to the opening sound can be the same as the preset text corresponding to the closing sound. For example, the preset text corresponding to both the opening and closing sounds is "ding". Of course, the preset text corresponding to the opening sound and the closing sound can also be different. For example, the preset text corresponding to the opening sound could be "Welcome to the music feast!"; the preset text corresponding to the closing sound could be "Looking forward to your next use!". This application embodiment does not limit the above content in any way.

[0148] Taking the editing operation corresponding to the editing control 720 as an example, which is a trimming operation, the mobile terminal can respond to the user's triggering operation on the editing control 720 and display the user interface 800 shown in Figure 8. For example, the user interface 800 can display a positioning indicator 810, a trimming control 820, a return control 830, and a confirmation control 840. The positioning indicator 810 can be used to move along the timeline corresponding to the recorded audio to extract a certain length of audio segment. The trimming control 820 can be used to retain or delete the audio segment extracted by the positioning indicator 810. The return control 830 can be used to return to the user interface 700 shown in Figure 7. The confirmation control 840 can be used to confirm the trimming operation and set the trimmed audio as the audio played when the headphone box is opened or closed.

[0149] The mobile terminal can respond to the user's trigger operation on the trimming control 820 to obtain the trimmed audio recording; then, the mobile terminal can respond to the user's trigger operation on the confirmation control 840 to set the trimmed audio recording as the audio played when the headphone case is opened or closed. For example, referring to Figure 9, if the trimmed audio recording is "Dad, good morning", the speaker inside the headphone case can play the audio "Dad, good morning" when the user opens the headphone case.

[0150] In some embodiments, the user interface 800 shown in FIG8 may also include other content for editing and processing the recorded audio, such as controls for adding background music, splicing controls, etc. The content in the user interface 800 shown in FIG8 and the interface content corresponding to other specific processing operations can be flexibly set, and this application embodiment does not limit it in this way.

[0151] Based on the user interface 500 shown in Figure 5, in some embodiments, the mobile terminal may also display the user interface 1000 shown in Figure 10 in response to the user's triggering operation on the local music option 5102.

[0152] For example, the user interface 1000 may display an open-box sound control 1010 and a close-box sound control 1020. The open-box sound control 1010 can be used to display third settings corresponding to the audio played when the headphone case is opened in the user interface 1000; the close-box sound control 1020 can be used to display fourth settings corresponding to the audio played when the headphone case is closed in the user interface 1000. The third settings may include song information and recommended song information, and the fourth settings may also include song information and recommended song information. It is easy to understand that the third settings may be the same as or different from the fourth settings.

[0153] For example, in response to a user's trigger operation on the headphone box control 1010, the mobile terminal can display a song information list 1030 in the user interface 1000. The song information list 1030 may include a play / pause button 1030a, song information 1030b, a selection control 1030c, and a trimming control 1030d. The play / pause button 1030a can be used to play or pause the audio of a song; the song information 1030b may include the song name, author, and duration; the selection control 1030c can be used to set the corresponding song as the audio played when the headphone box is opened, triggered by the user; and the trimming control 1030d can be used to further trim the complete song or a segment of the song.

[0154] In some embodiments, the user interface 1000 shown in FIG10 may further include a recommended song list 1040, which may include a play / pause button, song information, selection control, and cropping control for each song.

[0155] Referring to Figure 10, the mobile terminal can respond to the user's trigger operation on the selection control 1030c and set the song 1 corresponding to the selection control 1030c as the audio played when the headphone box is opened or closed.

[0156] In some embodiments, the mobile terminal may also respond to a user's trigger operation on multiple selection controls 1030c and determine the audio to be played when the headphone box is opened or closed from multiple local music tracks corresponding to the multiple selection controls 1030c according to preset rules. The preset rules may include one or more of the following information: the number of times the headphone box has been opened and / or closed, a user-preset playback order, first information, and the playback count of multiple local music tracks. The first information includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on a first task. The specific process and related content of determining the audio to be played when the headphone box is opened or closed according to the preset rules can be understood by referring to the embodiment described in Figure 6 above, and will not be repeated here.

[0157] In the above embodiments, the user interface displayed after the user triggers the closing sound control 1020 and the subsequent operations can be understood by referring to the relevant content of the user triggering the opening sound control 1010, and will not be repeated here. This application embodiment does not limit the specific content displayed in the user interface 1000.

[0158] In other embodiments, the mobile terminal may also respond to a user's trigger operation on the system sound option 5103 in the user interface 500 shown in FIG. 5, displaying the user interface 1100 shown in FIG. 11. The user interface 1100 may display an open-box sound control 1110 and an close-box sound control 1120. The open-box sound control 1110 may be used to display a fifth setting content in the user interface 1100 corresponding to the audio played when the headphone case is opened. The fifth setting content may include information corresponding to one or more system sounds, such as the name of the system sound and a selection control corresponding to each system sound. The close-box sound control 1120 may be used to display a sixth setting content in the user interface 1100 corresponding to the audio played when the headphone case is closed. The sixth setting content may also include information corresponding to one or more system sounds, such as the name of the system sound and a selection control corresponding to each system sound. It is easy to understand that the fifth setting content (including but not limited to the order and number of system sounds) may be the same as or different from the sixth setting content.

[0159] In the above example, the user interface 1100 may also display a system sound list 1130, which may include multiple system sounds, a system sound name 1131 corresponding to each system sound, and a corresponding selection control 1132. It should be noted that the system sound list 1130 may be an interface element displayed after the user triggers the opening sound control 1110 or the closing sound control 1120; or it may be an interface element displayed by default in the user interface 1100 in response to the user's triggering operation on the system sound option 5104 in the user interface 500, i.e., an interface element that is displayed by default for the opening or closing sound.

[0160] For example, assuming the system sound list 1130 is an interface element corresponding to the opening sound control 1110, the mobile terminal can respond to the user's trigger operation on the selection control corresponding to system sound 1 and set system sound 1 to the opening sound, that is, system sound 1 is played when the headphone case is opened. If the system sound list 1130 is an interface element corresponding to the closing sound control 1120, the mobile terminal can respond to the user's trigger operation on the selection control corresponding to system sound 1 and set system sound 1 to the audio played when the headphone case is closed.

[0161] In some embodiments, the mobile terminal may also respond to a user's triggering operation on the selection control 1132 corresponding to multiple system sounds in the system sound list 1130, and determine the audio to be played when the headphone box is opened or closed from the multiple system sounds according to preset rules. The preset rules may include one or more of the following information: the number of times the headphone box has been opened and / or closed, the user's preset playback order, first information, and the playback count of multiple system sounds. The first information includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on a first task. The specific process and related content of determining the audio to be played when the headphone box is opened or closed according to the preset rules can be understood by referring to the embodiment described in Figure 6 above, and will not be repeated here.

[0162] In other embodiments, the mobile terminal may also respond to a user's triggering operation on the AIGC option 5104 in the user interface 500 shown in FIG. 5, and display the user interface 1200 shown in FIG. 12. The user interface 1200 may display audio information 1210, play / pause control 1220, return control 1230, and confirmation control 1240. The audio information 1210 may include: audio track information corresponding to the audio (hereinafter referred to as source audio) determined by technologies such as AIGC, total audio duration, audio file size, current playback position, and the time corresponding to the current playback position. The play / pause control 1220 can be used to play or pause the source audio. The return control 1230 can be used to return to the displayed user interface 500. The confirmation control 1240 can be used to directly set the source audio as the audio played when the headphone box is opened or closed.

[0163] In the above embodiments, the source audio may also be determined by a mobile terminal or other device (such as a cloud server) based on the first information, wherein the first information includes one or more of the following: user's emotional characteristics information, location information, user's personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task.

[0164] In one example, the mobile terminal can respond to the user's trigger operation on the confirmation control 1240 by simultaneously setting the source audio to the audio played when the headphone case is opened or closed. That is, the audio played when the headphone case is opened is the same as the audio played when the headphone case is closed. Referring to Figure 13, assuming the source audio is "Cheer up," the headphone case can respond to the user's operation of opening or closing the headphone case by playing the aforementioned source audio "Cheer up."

[0165] In another example, referring to Figure 14, the mobile terminal can also respond to the user's trigger operation on the confirmation control 1240 by displaying the opening sound control 1241 and the closing sound control 1242 in the user interface 1200; after detecting the user's trigger operation on the opening sound control 1241 (or closing sound control 1242), the mobile terminal sets the source audio to the audio played when the headphone box is opened (or closed).

[0166] In other possible examples, the mobile terminal can also respond to the user's triggering operation on the confirmation control 1240 by setting the audio played when the headphone case is opened or closed to AIGC mode, i.e., intelligent mode. That is, in one case, the mobile terminal can update the source audio based on the first information obtained at preset intervals, and play the updated source audio when the headphone case is opened or closed.

[0167] In another scenario, the mobile terminal can also determine multiple audio tracks based on the first information. Then, it can determine the audio played when the headphone case is opened or closed from among the multiple audio tracks according to preset rules. The preset rules may include one or more of the following: the number of times the headphone case has been opened and / or closed, the user-preset playback order, the first information, and the playback count of the multiple audio tracks. The first information may include one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and the user's progress information regarding the first task.

[0168] It's easy to understand that in this case, the first information used to determine multiple audio files and to further determine the audio played when the headphone case is opened or closed from among the multiple audio files can be the same. Based on this, the specific time at which the mobile terminal obtains the same first information can be the same or different. Of course, the first information used to determine multiple audio files and to further determine the audio played when the headphone case is opened or closed from among the multiple audio files can also be different.

[0169] In some examples, the user interface 1200 may also include an editing control 1250, which can be used to edit the source audio to obtain the edited source audio. Correspondingly, the confirmation control 1240 can also be used to set the edited source audio as the audio played when the headphone box is opened or closed. In this embodiment, the specific usage method of the editing control 1250 and the related user interface can be found in the relevant description in the embodiment shown in Figure 7 above, and will not be repeated here.

[0170] In the above example, the user interface 1200 may also display an image 1260. Image 1260 may also be generated using first information. This first information may include one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task. Image 1260 may also be an image generated by combining information from the source audio (e.g., timbre, pitch, etc.). Image 1260 may also be a randomly generated image. Of course, in other possible embodiments, image 1260 may also be an image generated based on other information (e.g., the number of times the headphone case has been opened and / or closed, etc.). This application embodiment does not limit this.

[0171] In some embodiments, image 1260 may be generated using one or more models, such as a diffusion model, a large language model (LLM), or a variational autoencoder.

[0172] Furthermore, image 1260 can be displayed in the user interface 1200 in full-screen, pop-up, or half-screen mode. When image 1260 is displayed in full-screen mode in the user interface 1200, it can be understood that image 1260 is displayed over the user interface 1200. This application embodiment does not limit the specific generation and display method of image 1260.

[0173] In practical applications, as shown in Figure 15, the mobile terminal can respond to the operation of the headphone box being opened (or closed) and display image 1260 on the desktop or any other interface of the mobile terminal.

[0174] In some embodiments, referring to FIG16, the mobile terminal may also display a user interface 1600 in response to the opening (or closing) of the earphone case, and the image 1260 may be set as the background image of the user interface 1600. That is, the image 1260 may be used to display in the user interface 1600 in a thematic manner. For example, the user interface 1600 may display the current battery information of the earphone case and the battery information of the earphones corresponding to the earphone case. It may also display information such as the usage time of the TWS earphones and the volume of the audio played in the TWS earphones.

[0175] In other embodiments, the user interface 1600 shown in FIG16 may also display other interface elements, such as an image of the headphone case corresponding to the headphone case, a headphone case battery icon corresponding to the actual battery value of the headphone case, an image of the headphone corresponding to the headphone, a headphone battery icon corresponding to the actual battery value of the headphone, and so on.

[0176] In some embodiments, referring to FIG17, image 1260 can also be used to display on the desktop of an electronic device in a thematic manner; that is, the generated image 1260 can also be used to change the background image of the electronic device.

[0177] In some embodiments, if the headphone case has a display screen, the headphone case may also display image 1260 on the display screen in response to the operation of the headphone case being opened or closed; or, the mobile terminal may send image 1260 to the headphone case in response to the operation of the headphone case being opened or closed, so that image 1260 is displayed on the display screen of the headphone case.

[0178] Referring to the user interface 600 shown in Figure 6, the user interface 1000 shown in Figure 10, the user interface 1100 shown in Figure 11, and the user interface 1200 shown in Figure 14, each of these user interfaces is provided with an opening tone control and an closing tone control, for a total of two controls. In some other embodiments, the opening tone control and the closing tone control can also be set as a switching control. For example, the opening tone and closing tone setting order of the headphone case can be pre-configured. Assuming that the pre-configured opening tone and closing tone setting order of the headphone case is opening tone, closing tone, then a switching control can be set in the user interface. After the mobile terminal detects the user's first operation of the switching control, it can display the setting content corresponding to the opening tone in the user interface; after the mobile terminal detects the user's second operation of the switching control, it can display the setting content corresponding to the closing tone in the user interface. The embodiments of this application do not limit the specific setting form of the switching control.

[0179] To meet the need to separately set the opening and closing sounds of the headphone case, in other possible embodiments, based on the pop-up interface 400 shown in Figure 4 above, in addition to displaying a control 410 for changing the opening / closing sound in the pop-up interface 400, two controls can also be displayed in the pop-up interface 400. One control can be used to display the settings corresponding to the opening sound; the other control can be used to display the settings corresponding to the closing sound.

[0180] As an example and not a limitation, Figure 18 shows another scenario diagram of a mobile terminal and an audio output device after communication connection according to an embodiment of this application. Referring to Figure 18, after the headphone case establishes a communication connection with the mobile terminal, a pop-up interface 1800 can be displayed on the desktop shown in Figure 18. The pop-up interface 1800 can display a control for changing the opening sound 1810 and a control for changing the closing sound 1820. The control for changing the opening sound 1810 can be used to set the audio played when the headphone case is opened; the control for changing the closing sound 1820 can be used to set the audio played when the headphone case is closed. Therefore, users can directly select the opening sound to set separately in the pop-up interface 1800 without affecting the closing sound setting; or directly select the closing sound to set separately in the pop-up interface 1800 without affecting the opening sound setting, thus meeting the user's need for differentiated settings for the opening and closing sounds.

[0181] In some examples, a series of image interface elements (and / or text elements) may also be displayed in the pop-up interface 1800. The image interface elements (and / or text elements) may be exactly the same as or partially the same as the image interface elements 411 (and / or text elements 412) displayed in the pop-up interface 400 shown in Figure 4.

[0182] Based on the above example, the mobile terminal can respond to a user's trigger operation to change the opening sound control 1810 and display the user interface 1900 shown in Figure 19. Alternatively, the mobile terminal can respond to a user's trigger operation to change the closing sound control 1820 and display the user interface 2000 shown in Figure 20. It should be understood that the interface content displayed in user interface 1900 is mainly used to manage the audio played when the headphone case is opened; the interface content displayed in user interface 2000 is mainly used to manage the audio played when the headphone case is closed. In some embodiments, the interface content displayed in user interface 1900 and the interface content displayed in user interface 2000 may be the same or different.

[0183] Specifically, the interface content displayed in user interface 1900 or user interface 2000 and subsequent user interfaces can be understood by referring to the embodiments described in Figures 5 to 17 above; only the differences are described here. Compared with the user interfaces shown in Figures 6, 10, 11, and 14, in this embodiment, the user can directly use the opening sound setting interface shown in Figure 19 to set the audio played when the headphone case is opened, and use the closing sound setting interface shown in Figure 20 to set the audio played when the headphone case is closed.

[0184] It should be noted that, in the foregoing embodiments of this application, the triggering operation detected by the mobile terminal can be a click operation, a long press operation, a long press and click operation, a double click operation, a touch operation, or a swipe operation, etc.; the triggering operation can also be a selection operation input by the user through a physical button on the electronic device; the triggering operation can also be an air gesture operation, a face recognition operation, a voice command operation, etc., detected by the user through the camera, microphone, or other sensors of the electronic device. This application does not impose any limitations on this.

[0185] The above describes the application scenarios corresponding to the audio playback method provided in the embodiments of this application. The following describes the audio playback method provided in the embodiments of this application with reference to the accompanying drawings.

[0186] Figure 21 shows a flowchart of an audio playback method 2100 provided in an embodiment of this application. This method can be applied to the headphone case in the above example; that is, the audio playback method 2100 can be executed by the headphone case. Specifically, the audio playback method 2100 can be executed by a module (e.g., a processor, chip, or chip system) in the headphone case; it can also be implemented by a logic module or software capable of implementing all or part of the headphone case's functions. Referring to Figure 21, the audio playback method 2100 may include the following steps:

[0187] S2101, the headphone box acquires the first audio data.

[0188] It should be understood that the first audio data can refer to any audio played when the headphone case is opened or closed. For example, the first audio data can be a user-defined ringtone, music clip, or welcome message; it can also be the guide audio corresponding to the user manual of the headphones and / or headphone case, etc.

[0189] In some embodiments, the first audio data may be audio generated based on first information, wherein the first information may include one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task. The specific content of the first information can be found in the relevant content of the foregoing embodiments, and will not be repeated here. This application does not limit the specific content or generation method of the first audio data.

[0190] In this embodiment, the first audio data may be different from the audio data played inside the TWS earphones. That is, the audio data played inside the TWS earphones and the first audio data may not be the same audio.

[0191] In some embodiments, the first audio data can be configured by an electronic device. The process of configuring the first audio data by the electronic device can be found in the relevant descriptions of the embodiments shown in Figures 4 to 20 above, and will not be repeated here. Furthermore, the user can also set the first audio data as the audio played when the headphone case is opened by sending a voice command to the electronic device. For example, assuming the first audio data is A, the user can send a voice command to the electronic device: "Set A as the audio played when the headphone case is opened." The electronic device can directly obtain A based on the received voice command and set A as the audio played when the headphone case is opened.

[0192] In other examples, the first audio data can also be configured via the headphone case. For instance, a user can send a voice command to the headphone case, which, upon receiving the command, retrieves the first audio data and sets it as the audio to be played when the headphone case is opened.

[0193] In some embodiments, the first audio data may be downloaded directly from the earphone box; it may also be sent directly to the earphone box by the electronic device; or it may be sent indirectly to the earphone box by the electronic device through the TWS earphone.

[0194] Of course, in other embodiments, the earphone case can also directly obtain the first audio data from its memory. For example, after the earphone case is manufactured, the first audio data is pre-stored in its memory. Alternatively, the earphone case can also receive the first audio data shared by other devices via NFC or other means.

[0195] In some embodiments, the audio duration of the first audio data can be from 2 seconds (s) to 5 seconds. For example, the audio duration of the first audio data can be 2 seconds, 2.5 seconds, 3 seconds, 4 seconds, or 5 seconds, etc. Of course, the audio duration of the first audio data can also be shorter or longer, and this application embodiment does not limit this.

[0196] S2102, when the headphone case is opened or closed, the speaker of the headphone case plays the first audio data.

[0197] For example, suppose the headphone case includes a body and a lid, which are rotatably connected. When the body and lid are closed, they form a cavity for accommodating the headphones. The process of switching from a closed state to an open state between the lid and body can be understood as the headphone case being opened, and the process of switching from an open state to a closed state can be understood as the headphone case being closed. If the headphone case detects a switch from a closed state to an open state, it can respond to the opening operation by playing first audio data through the speaker. If the headphone case detects a switch from an open state to a closed state, it can respond to the closing operation by playing first audio data through the speaker. The specific detection method for the headphone case can be found in the relevant description in the embodiment described in Figure 3 above, and will not be repeated here.

[0198] In some embodiments, when the headphone case is in an open (or closed) state, the speaker of the headphone case may also play the first audio data mentioned above at preset intervals.

[0199] In other embodiments, when the headphone case is in the open state, the speaker of the headphone case can play the first audio data in a loop until the headphone case is closed, that is, the playback of the first audio data stops after the headphone case is closed.

[0200] Based on the above possible implementation methods, the continuous playback of the first audio data can effectively inform the user that the headphone case is currently in the open state, reminding the user to close the headphone case in a timely manner and preventing dust and other debris from entering the headphone case's cavity. In certain scenarios (such as when the headphone case is in the open state, certain circuits may remain connected), the continuous playback of the first audio data can remind the user to close the headphone case in a timely manner, thereby effectively reducing the headphone case's power consumption.

[0201] In some embodiments, the first audio data may further include first sub-audio data and second sub-audio data. The first sub-audio data may correspond to the audio played when the headphone case is opened, that is, when the headphone case is opened, the speaker of the headphone case can play the first sub-audio data; the second sub-audio data may correspond to the audio played when the headphone case is closed, that is, when the headphone case is closed, the speaker of the headphone case can play the second sub-audio data. The configuration process and acquisition method of the first sub-audio data may be the same as or different from the configuration process and acquisition method of the second sub-audio data.

[0202] In the above embodiments, the second sub-audio data can be different from the first sub-audio data. This allows users to directly determine the current state of the headphone case by playing different audio data, enhancing user operation clarity and interactive experience. It is understood that the second sub-audio data can also be the same as the first sub-audio data. Based on this, both the opening and closing sounds can be configured simultaneously through the same interface, allowing for configuration of the closing sound while configuring the opening sound, further simplifying the configuration process, meeting user needs for product usability, and improving the user experience.

[0203] The following will use the process of configuring the first audio data of an electronic device as an example to illustrate the specific process of configuring the first audio data of an electronic device.

[0204] Corresponding to the audio playback method 2100 described in Figure 21, Figure 22 shows a schematic flowchart of another audio playback method 2200 provided in this application embodiment. Referring to Figure 22, this audio playback method 2200 can be applied to an electronic device. In some embodiments, the electronic device can be a device directly connected to the headphone case, and the headphone case is equipped with a speaker that can be used to play the first audio data. Of course, the electronic device can also be a device connected to TWS earphones, so that the electronic device can send the first audio data to the headphone case through the TWS earphones.

[0205] The audio playback method 2200 described above can be executed by an electronic device. Specifically, the audio playback method 2200 can be executed by a module (e.g., a processor, chip, or chip system) in the electronic device; it can also be implemented by a logic module or software capable of implementing all or part of the functions of the electronic device. Referring to Figure 22, the audio playback method 2200 may include the following steps:

[0206] S2201, the electronic device determines the first audio data.

[0207] S2202, the electronic device sends the first audio data to the headphone box.

[0208] It should be understood that after determining the first audio data, the electronic device can send the first audio data to the headphone box immediately; or it can send the determined first audio data to the headphone box after a preset time; the electronic device can also send the first audio data to the headphone box in response to the communication connection between the electronic device and the headphone box (or headphones). In this way, the headphone box can play the first audio data based on the received first audio data when the headphone box is opened or closed. The specific content of the first audio data can be found in the relevant description in the aforementioned audio playback method 2100, and will not be repeated here.

[0209] In some embodiments, the electronic device may directly determine first audio data in response to a user operation. It should be understood that the user operation may include one or more configuration operations for determining the first audio data. The process of directly determining the first audio data based on a user operation can be understood by referring to the embodiments shown in Figures 4 to 20 above, and will not be repeated here.

[0210] In some embodiments, the electronic device may acquire first information and then generate first audio data based on the acquired first information. By sending the generated first audio data to the headphone case, the headphone case can play the first audio data when it is opened or closed.

[0211] It should be understood that the first information may include one or more of the following: the user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and the user's progress information on the first task. The specific content of the first information and the process of generating the first audio data based on the first information can be understood by referring to the relevant content in the embodiments shown in Figures 4 to 20 above, and will not be elaborated upon here.

[0212] In some embodiments, the electronic device may also acquire second audio data and then further determine first audio data based on the second audio data.

[0213] It is easy to understand that in this embodiment, the second audio data can be recorded audio; it can also be determined directly in response to user operation; or it can be generated based on the first information, which may include one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, user's progress information on the first task, etc.

[0214] It should be understood that, in one example, the electronic device can directly identify the second audio data as the first audio data. In this case, the second audio data can be understood as the same audio as the first audio data.

[0215] In another example, after acquiring the second audio data, the electronic device can generate first audio data based on preset sound features and the second audio data. Furthermore, the electronic device can acquire text information from the second audio data, and then, based on the acquired text information and preset sound features, obtain first audio data corresponding to the text information in the second audio data, wherein the obtained first audio data is audio data with preset sound features.

[0216] In other words, in this example, the sound features of the second audio data and the sound features of the first audio data can be different, and the text information of the second audio data and the text information of the first audio data can be the same. The electronic device can generate the first audio data corresponding to the preset sound features through the text information in the second audio data.

[0217] In this embodiment, preset voice features can be used to identify different users, and different users have different voice features. The preset voice features may include one or more features such as timbre, tone, emotion, and speech rate.

[0218] The preset sound features can be determined based on first information, which may include one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task. The preset sound features can also be determined based on other information (such as cultural background information, musical aesthetic information, emotional preference information, etc.); the preset sound features can also be pre-configured or user-defined.

[0219] In another example, after acquiring the second audio data, the electronic device can also generate the first audio data based on the preset first text information and the second audio data, so that the generated first audio data is the audio data corresponding to the first text information.

[0220] In other words, in this example, the sound features of the second audio data and the first audio data can be the same, while the text information of the second audio data and the text information of the first audio data can be different. After acquiring the second audio data, the electronic device can extract its sound features; based on preset first text information and the sound features of the second audio data, it can generate first audio data, so that the generated first audio data can not only be the audio corresponding to the first text information, but also audio that is the same as or similar to the second audio data in terms of sound features.

[0221] In this example, the first text message can be a prompt message set separately for when the headphone case is opened or closed. For example, when the headphone case is open, the first text message could be set to "Welcome to use it." When the headphone case is closed, the first text message could be set to "Welcome to use it next time." Of course, the first text message can also be a uniform audio message set for both when the headphone case is opened and closed, that is, the opening sound and closing sound are the same. For example, the first text message could be set to "Remember to have fun no matter what."

[0222] In the above embodiments, the first text information can be pre-configured or user-defined. Based on the examples shown in Figures 4 to 20, the first text information can correspond to preset text. The relevant content of the first text information can be found in the content corresponding to the preset text in the aforementioned embodiments, and will not be repeated here.

[0223] In other examples, after acquiring the second audio data, the electronic device can also determine the first audio data by editing the second audio data.

[0224] It should be understood that editing processing may include one or more of the following processes: cropping, splicing (or adding), mixing and overlaying, cloning, and conversion of the second audio data by electronic devices.

[0225] Trimming can refer to the operation of deleting (or retaining) portions of audio from the second audio data. For example, after acquiring the second audio data, an electronic device can remove redundant content (such as noise) to obtain the first audio data. Based on this, the influence of redundant content on the first audio data can be avoided, improving the consistency and adaptability of the first audio data in practical application scenarios.

[0226] Splicing can refer to the operation of adding new audio data before or after the second audio data. For example, assuming the acquired second audio data is a recorded audio about the wife, a recorded audio about the child can be added after the second audio data to obtain the first audio data, so that the first audio data contains both the recorded audio about the wife and the recorded audio about the child.

[0227] Hybrid overlay processing refers to the operation of mixing the newly added audio with the second audio data after it has been acquired. For example, after acquiring the second audio data, an electronic device allows the user to add background music to it, resulting in second audio data with background music, where the background music is the newly added audio. The first audio data corresponds to the resulting second audio data with background music.

[0228] Cloning can refer to the operation of generating first audio data that corresponds to and possesses the sound features of the second audio data, using the sound features of the second audio data. Conversion can refer to the operation of converting the sound features of the second audio data to obtain first audio data with other sound features.

[0229] In this embodiment, the specific application scenario of editing the second audio data to obtain the first audio data can also be found in the corresponding content of the examples shown in Figures 4 to 20 above, which will not be repeated here.

[0230] Based on the above example, in one possible implementation, the electronic device can also edit and process the second audio data according to the first information to determine the first audio data. The first information may include one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task, etc.

[0231] In other words, in this possible implementation, the specific editing operation to be performed on the second audio data can be determined by the first information, so that the obtained first audio data is the audio data corresponding to the first information.

[0232] It is not difficult to understand that when the second audio data is generated based on the first information, the first information used to obtain the second audio data and the first information used to edit and process the second audio data based on the first information to determine the first audio data can be the same; can be different; or can be partially the same and partially different. This application does not limit this aspect.

[0233] In some embodiments, the electronic device may also acquire multiple audio data and then directly determine the first audio data from the multiple audio data according to preset rules. The preset rules may include one or more of the following information: the number of times the headphone case has been opened and / or closed, the user-preset playback order, first information, and the playback count of the multiple audio data. The first information includes one or more of the following: user emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user progress information on a first task.

[0234] It should be understood that the multiple audio data acquired by the electronic device can be determined directly in response to user operations. The process of directly determining multiple audio data based on user operations can be understood by referring to the embodiments shown in Figures 4 to 20 above, and will not be repeated here.

[0235] In one example, multiple audio data may also be determined based on first information obtained by the electronic device, wherein the first information may include one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the execution of the first task.

[0236] Based on the above example, after the electronic device determines multiple audio data based on the acquired first information, it can determine the first audio data from the multiple audio data according to preset rules. It is easy to understand that in this example, the same or different first information can be used to acquire multiple audio data, and the first audio data can be determined from the multiple audio data. That is, the first information used to acquire the multiple audio data and the first information used to determine the first audio data from the multiple audio data can be the same; can be different; or can be partially the same and partially different. This application does not limit this aspect.

[0237] In some embodiments, the electronic device may first acquire multiple audio data, then determine second audio data from the multiple audio data according to preset rules, and then determine first audio data based on the second audio data. The preset rules may include one or more of the following information: the number of times the headphone case has been opened and / or closed, the user-preset playback order, first information, and the playback count of the multiple audio data. The first information includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task.

[0238] It should be understood that after the electronic device determines the second audio data from multiple audio data according to preset rules, it can generate the first audio data based on preset sound features and text information in the second audio data. Alternatively, it can generate the first audio data based on preset first text information and sound features of the second audio data. Furthermore, the second audio data can be edited to obtain the first audio data. Further, the second audio data can be edited based on first information to obtain the first audio data. The specific process of determining the first audio data based on the second audio data in this embodiment may be the same as or different from the specific process in the aforementioned embodiment where the electronic device directly acquires the second audio data and then determines the first audio data based on the second audio data.

[0239] In some embodiments, the electronic device may first acquire first information, then acquire multiple audio data based on the first information; subsequently, determine second audio data from the multiple audio data according to a preset rule; and then determine first audio data based on the second audio data. It should be understood that the specific content of the first information, the process of determining multiple audio data based on the first information, the specific content of the preset rule, the process of determining second audio data from the multiple audio data, and the process of further determining first audio data based on the second audio data can be understood in conjunction with the relevant descriptions in the foregoing embodiments, and will not be repeated here.

[0240] Based on the above possible implementations, when the headphone box is opened or closed, a first image can also be displayed on the first interface of the electronic device. The first image can be determined based on first information, which may include one or more of the following: user's emotional characteristics information, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the execution of the first matter.

[0241] It should be understood that the first interface can be any interface in an electronic device, such as the desktop, "minus one screen," or pop-up interface of a mobile terminal as shown in Figure 1. The first image can be referred to as image 1260 in the embodiments shown in Figures 4 to 20 above. The specific display scene and generation method of the first image can be referred to the corresponding description of image 1260 in the relevant embodiments above, and will not be repeated here.

[0242] It's easy to understand that the specific information used to determine the first image can be the same as the specific information used to determine the first audio data. In other words, the same first information can be used to determine both the first image and the first audio data. Of course, the specific content of the first information used to determine the first image can also be different from the specific content of the first information used to determine the first audio data. For example, if the first information includes location information and the user's emotional characteristic information, the user's location information can be used to determine the first image, and the user's emotional characteristic information can be used to determine the first audio data.

[0243] In some embodiments, a second piece of information may also be used to determine the first image, and the second information may include the first information. That is, the first image may be generated using the first information and other information. For example, the first information may be the user's emotional characteristics and location information, and the other information may be current time information and weather information.

[0244] In some embodiments, a third piece of information may also be used to determine the first image. This third piece of information differs from the first information; that is, the information used to generate the first image is different from the information used to generate the first audio data. In other words, the first image can be generated using information other than the first information.

[0245] In some embodiments, the first image can also be used to display the battery information of the earphone case and / or the TWS earphones in a thematic manner. That is, the first image can be used to change the current theme or desktop background image of the electronic device. For example, referring to Figure 17, the first image can be image 1260, which can be displayed as a background image on the desktop of the mobile terminal.

[0246] In some embodiments, where the headphone case includes a display screen, the first image can also be displayed on the display screen of the headphone case when the headphone case is opened or closed. Based on this, the first image can also be used to display a theme on the display screen of the headphone case.

[0247] Of course, even when the headphone case is open or closed, the first image can also be displayed on other devices (such as a watch). Optionally, the first image displayed on other devices can also be used to change the theme or background image on those other devices.

[0248] In the various possible embodiments described above, the first image can be used not only to display the battery information of the earphone case and / or the TWS earphones in a thematic manner, but also to display other information (including images and / or text, etc.), such as the battery information of other devices connected to the electronic device. It is easy to understand that the information displayed on the first image can be text information, and / or image information, and / or voice information, etc.

[0249] In some embodiments, the display method of the first image may include pop-up display, full-screen display, or half-screen display, etc.

[0250] The following description, in conjunction with the user interface shown in Figures 4 to 20, provides an exemplary explanation of the specific process for determining the first audio data in response to a user operation. It is readily understood that the specific process for determining the first audio data described below can also be applied to the specific process of determining the second audio data in response to a user operation, or to the specific process of determining multiple audio data in response to a user operation. The specific processes for determining the second audio data and the specific processes for determining multiple audio data can be understood by referring to the specific process for determining the first audio data, and will not be repeated here.

[0251] In some embodiments, a first control may be displayed on a second interface of the electronic device. The electronic device may display an audio selection interface in response to a user's triggering operation on the first control. The audio selection interface may include a list of selection controls, which may include at least one audio selection control. The electronic device may display an audio configuration interface corresponding to the first audio selection control in response to a user's triggering operation on the first audio selection control among the at least one audio selection controls. The electronic device may determine the first audio data in response to a user's configuration operation in the audio configuration interface.

[0252] It is understood that, based on the above embodiments, user operations may include: communication connection operation between the headphone box (or headphones) and the electronic device, user triggering operation of the first control, user triggering operation of the first audio control in at least one audio selection control, and user configuration operation in the audio configuration interface.

[0253] Specifically, the second interface can be any interface on the electronic device, such as the desktop, the "minus one screen," or a pop-up window. Referring to the examples in Figures 4 to 16, the second interface can be the desktop of the mobile terminal; or it can be a pop-up window 400 displayed on the desktop of the mobile terminal. That is to say, the first control can be displayed directly on the desktop of the mobile terminal, or it can be displayed in the pop-up window 400 through the desktop of the mobile terminal.

[0254] The first control can be used to display an audio selection interface in response to a user's trigger operation. For example, the first control can be the change on / off box tone control 410 shown in the pop-up interface 400 as shown in Figure 4; or it can be the change on box tone control 1810 or the change off box tone control 1820 shown in the pop-up interface 1800 as shown in Figure 18.

[0255] The audio selection interface can be used to provide audio from one or more different audio sources. A list of selection controls can be displayed in the audio selection interface. For example, the audio selection interface can be seen in user interface 500 shown in Figure 5, user interface 1900 shown in Figure 19, or user interface 2000 shown in Figure 20. The list of selection controls can be seen in audio option list 510 in user interface 500.

[0256] Optionally, the number of audio source types corresponding to the audio source selection controls in the selection control list can be the same as or different from the number of audio selection controls. This application does not impose any limitation on this.

[0257] In this embodiment, the number of audio selection controls in the selection control list is the same as the number of audio source types, and each audio selection control corresponds to one audio source. As an example, and not a limitation, referring to the examples shown in Figures 4 to 20, the selection control list may include four audio selection controls: a record sound option 5101, a local music option 5102, a system sound option 5103, and an AIGC option 5104. The record sound option 5101 corresponds to audio recordings obtained through recording; the local music option 5102 corresponds to music stored locally; the system sound option 5103 corresponds to preset system audio; and the AIGC option 5104 corresponds to sound generated based on at least one user-related piece of information (such as first information).

[0258] It is easy to understand that the first audio selection control can refer to any audio selection control that the user selects from the list of selection controls. Based on the foregoing example, the first audio selection control can be any one of the following controls: Record Sound option 5101, Local Music option 5102, System Sound option 5103, and AIGC option 5104.

[0259] The audio configuration interface can be used to acquire audio data and / or process the acquired audio data to obtain the processed audio data, thereby determining the first audio data. For example, the audio configuration interface can be the user interface 600 shown in Figure 6, the user interface 700 shown in Figure 7, and the user interface 800 shown in Figure 8; it can also be the user interface 1000 shown in Figure 10; it can also be the user interface 1100 shown in Figure 11; or it can also be the user interface 1200 shown in Figure 12 or Figure 14.

[0260] Taking the audio configuration interface as shown in Figure 12 (user interface 1200) as an example, the user's configuration operation in the audio configuration interface can be the user's trigger operation on the confirmation control 1240; or the user's trigger operation on the confirmation control 1240 and the user's trigger operation on the open box sound control 1241 (or close box sound control 1242), etc.

[0261] In some embodiments, when the first audio selection control is an AIGC control, the electronic device can respond to the user's triggering operation on the AIGC control and display an audio configuration interface corresponding to the AIGC control. The audio configuration interface may include first audio data, which may be determined based on first information. The first information may include one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the execution of the first task.

[0262] It should be understood that when the first audio selection control is an AIGC control, the electronic device can respond to the user's triggering operation on the AIGC control and display the audio configuration interface corresponding to the AIGC control. This audio configuration interface may include the first audio data. Subsequently, the electronic device can respond to the user's configuration operation in the audio configuration interface to determine the first audio data. It is easy to understand that in this embodiment, the user's configuration operation in the audio configuration interface can be a confirmation operation of the first audio data; it can also be a confirmation operation of the first audio data or a user's selection operation of the open or closed box sound, etc.

[0263] In this embodiment, the AIGC control can be referred to as AIGC option 5104 in Figure 5 above. The audio configuration interface corresponding to the AIGC control can be referred to as user interface 1200 shown in Figure 12 above. The first audio data can correspond to audio generated by technologies such as AIGC. Based on this example, the electronic device can directly determine the audio generated by technologies such as AIGC as the first audio data in response to user operations such as confirmation of audio generated by AIGC technology in user interface 1200.

[0264] It is not difficult to understand that the first audio data in the audio configuration interface can also be determined based on other information, such as the user's educational background or the user's historical operation information on the electronic device. The specific content of the first information in this embodiment can be understood by referring to the embodiment described in Figure 12 above, and will not be repeated here.

[0265] In some embodiments, when the first audio selection control is an AIGC control, the electronic device can respond to the user's triggering operation on the AIGC control and display an audio configuration interface corresponding to the AIGC control. The audio configuration interface may include third audio data, which may be determined based on the user's first information. The first information may include one or more of the following: the user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and the user's progress information on the first task. The electronic device can respond to the user's configuration operation in the audio configuration interface and determine the first audio data based on the third audio data.

[0266] It is not difficult to understand that in this embodiment, the specific content of the first audio data determined based on the third audio data can be understood by referring to the process of editing audio generated based on AIGC and other technologies in the embodiment shown in Figure 14, and the relevant process of determining the first audio data based on the second audio data in the embodiment shown in Figure 22. It will not be repeated here.

[0267] In some embodiments, when the first audio selection control is an AIGC control, a first image may also be displayed in the audio configuration interface, and the first image may also be determined based on the first information.

[0268] It should be understood that the first image can be referred to as image 1260 in the embodiment shown in Figure 14 above. The specific display scene, generation method, and other details of the first image can be found in the description of image 1260 in the aforementioned related embodiments, as well as the relevant description in the embodiment corresponding to Figure 22 above, and will not be repeated here.

[0269] In some embodiments, the audio configuration interface may also include a sound recording control, and the electronic device may acquire first audio data in response to a user's triggering operation on the sound recording control.

[0270] It is easy to understand that a sound recording control can be used to acquire recorded sound in response to a user's trigger action. For example, the sound recording control can be the start recording button 610 shown in Figure 6. The first audio data can be directly acquired recorded audio, such as any recording file displayed in the user interface 600 shown in Figure 6.

[0271] In this embodiment, the first audio data may be audio further determined by the electronic device based on the acquired recorded audio. For details on determining the first audio data based on the recorded audio, please refer to the relevant description of determining the first audio data in the embodiment corresponding to Figure 22 above, which will not be repeated here.

[0272] In some embodiments, when the first control is a single functional control, the audio configuration interface may further include a display switching control. The display switching control can be used to switch the display content in the audio configuration interface corresponding to the first sub-audio data or the second sub-audio data. The first sub-audio data may be the audio data played by the speaker of the headphone box when the headphone box is opened; the second sub-audio data may be the audio data played by the speaker of the headphone box when the headphone box is closed.

[0273] It should be understood that when the first control is a single functional control (such as the change on / off box tone 410 in the pop-up interface 400), the first control can only be used to display the corresponding functional interface (user interface 500 as shown in Figure 5), which cannot meet the user's need to set the on and off box tones separately. Based on this, in order to meet the user's need to set the on and off box tones separately, a display switching control can also be set in the audio configuration interface, so that after the user triggers the display switching control, the content corresponding to the on or off box tone can be displayed in the audio configuration interface respectively, thereby realizing the setting of the first sub-audio data or the second sub-audio data.

[0274] In other words, the display switching control can be used to display content corresponding to the open or closed box sound in the audio configuration interface. For example, the display switching control can be the open box sound control 620 and the closed box sound control 630 in Figure 6; or the open box sound control 1010 and the closed box sound control 1020 in Figure 10; or the open box sound control 1110 and the closed box sound control 1120 in Figure 11; or the open box sound control 1241 and the closed box sound control 1242 in Figure 14.

[0275] It's easy to understand that in the example above, the display toggle control can consist of two controls. In actual design, the display toggle control can also be a single toggle control. Specific examples of a display toggle control as a single toggle control can be found in the aforementioned related embodiments, and will not be repeated here.

[0276] In some other embodiments, the electronic device determining the first audio data in response to a user operation may further include: the electronic device directly displaying a first control on a first interface; the electronic device may display an audio configuration interface in response to a user's triggering operation on the first control; and then, in response to a user's configuration operation in the audio configuration interface, determining the first audio data.

[0277] In this embodiment, the first interface, the first control, the audio configuration interface, the configuration operation in the audio configuration interface, and the specific content of determining the first audio data can be understood by referring to the corresponding content in the foregoing embodiments, and will not be repeated here.

[0278] Understandably, compared to the aforementioned embodiments, in this embodiment, after the electronic device is connected to the headphone box (or headphones), the pop-up interface 400 shown in Figure 4 can display the switch on / off control 410. In response to the user's trigger operation on the switch on / off control 410, the user interface 500 shown in Figure 5 is not displayed. Instead, any one of the following audio configuration interfaces is directly displayed: the user interface 600 shown in Figure 6, the user interface 1000 shown in Figure 10, the user interface 1100 shown in Figure 11, and the user interface 1200 shown in Figure 12. Afterward, the electronic device can determine the first audio data in response to the user's configuration operation in the audio configuration interface.

[0279] Based on the above embodiments, user operations may include: user triggering of the first control and user configuration operations in the audio configuration interface.

[0280] In other embodiments, the electronic device determining the first audio data in response to a user operation may further include: the electronic device directly displaying an audio selection interface, which may include at least one audio selection control; the electronic device displaying an audio configuration interface corresponding to the first audio selection control in response to a user's trigger operation on the first audio selection control; and then, the electronic device determining the first audio data in response to a user's configuration operation in the audio configuration interface. In this embodiment, the audio selection interface, the audio selection control displayed in the audio selection interface, the first audio selection control, the audio configuration interface, the configuration operation in the audio configuration interface, and the specific content of determining the first audio data can be understood by referring to the corresponding content in the foregoing embodiments, and will not be repeated here.

[0281] It should be noted that, compared to the previous embodiment, in this embodiment, after the electronic device is connected to the headphone box (or headphones), the pop-up interface 400 corresponding to Figure 4 may not be displayed. Instead, the user interface 500 shown in Figure 5 can be displayed directly. The user can select any audio selection control from the user interface 500 as the first audio selection control. Then, after the electronic device detects that the user has triggered the first audio selection control, it displays the audio configuration interface corresponding to the first audio selection control. In response to the user performing a series of configuration operations in the audio configuration interface, the first audio data is determined.

[0282] Based on the above embodiments, user operations may include: user triggering operation on the first audio selection control in at least one audio selection control and user configuration operation in the audio configuration interface.

[0283] In other embodiments, the electronic device determining the first audio data in response to a user operation may further include: the electronic device directly displays an audio configuration interface, and then, in response to the user's configuration operation in the audio configuration interface, determines the first audio data. In this embodiment, the specific content of the audio configuration interface, the configuration operation in the audio configuration interface, and the determination of the first audio data can be understood by referring to the corresponding content in the foregoing embodiments, and will not be repeated here.

[0284] It should be understood that the audio configuration interface can be displayed on any interface of an electronic device. For example, assuming the electronic device is currently displaying a video playback interface, the audio configuration interface can be displayed directly on the video playback interface.

[0285] Compared to the previous embodiment, in this embodiment, after the electronic device is connected to the headphone box (or headphones), the pop-up interface 400 corresponding to Figure 4 and the user interface 500 shown in Figure 5 are not displayed. Instead, any one of the audio configuration interfaces shown in Figure 6 (user interface 600), Figure 10 (user interface 1000), Figure 11 (user interface 1100), and Figure 12 (user interface 1200) can be directly displayed. Then, the electronic device can respond to the user's configuration operation in the displayed audio configuration interface and determine the first audio data.

[0286] Optionally, the audio configuration interface can be pre-configured or user-defined.

[0287] Optionally, the audio configuration interface can be displayed on the electronic device in full-screen, half-screen, or pop-up mode.

[0288] Based on the above embodiments, user operations may include configuration operations performed by the user in the audio configuration interface.

[0289] In the above embodiments, the audio configuration interface may also include a second image, which may be generated based on the first information. The details of the second image can be understood by referring to the descriptions of the first image in the preceding embodiments, and will not be repeated here.

[0290] In other possible embodiments, the electronic device's response to a user operation in determining the first audio data may further include: the electronic device directly determining the first audio data and setting the first audio data as the audio played when the headphone case is opened or closed. The specific content of the first information in this embodiment can be understood by referring to the embodiment described in Figure 12 above, and will not be repeated here.

[0291] In one example, the electronic device may, in response to a communication connection operation between the electronic device and the headphone box (or headphones), acquire first information; the electronic device may, based on the first information, determine first audio data.

[0292] Alternatively, in response to a communication connection between the electronic device and the headphone case (or headphones), the electronic device acquires first information at preset intervals. This first information may include one or more of the following: user emotional characteristics, location information, personal information, seasonal information, solar term information, time information, holiday information, weather information, temperature and humidity information, and user progress information on a first task. Based on the acquired first information, the electronic device can determine multiple first audio data. When the user opens or closes the headphone case, the headphone case's speaker can play the first audio data most recently accessed when the headphone case is opened or closed. In other words, when the headphone case is opened or closed, the time at which the first audio data played by the speaker is most recently accessed when the headphone case is opened or closed.

[0293] Based on this, user operation can be a communication connection operation between the electronic device and the headphone box (or headphones).

[0294] Based on the above embodiments, the first audio data can be used as both the audio played when the headphone case is opened and the audio played when the headphone case is closed. That is, the audio played when the headphone case is opened is the same as the audio played when the headphone case is closed. Of course, in other possible implementations, the first audio data can also be set only to the audio played when the headphone case is opened; or only to the audio played when the headphone case is closed.

[0295] In some embodiments, the electronic device may also generate a third image based on the first information. The display method, generation method, and uses of the third image can be found in the relevant content regarding the first image in the foregoing embodiments, and will not be repeated here.

[0296] It's easy to understand that, based on the aforementioned audio playback method, the first audio data can be played when the headphone case is opened or closed, enriching the functionality of the headphone case. By playing the first audio data, the user is directly and effectively informed that the headphone case has been opened, satisfying the user's need to confirm the status of the headphone case when direct observation is not possible (such as at night or in poor visibility), ensuring the accuracy of the operation. At the same time, this audio feedback can also help users accurately identify the open and closed state of the headphone case, improving the convenience and safety of use. This design also increases the interactivity and fun of using the headphone case, meeting users' needs for novelty, ease of use, and high interactivity in product experience, thus improving the user experience.

[0297] In addition, users can configure the audio content and sound characteristics of the first audio data to provide emotional value through the first audio data played when the headphone case is opened or closed. For example, playing a family member's voice or a snippet of a favorite song when the headphone case is opened can effectively improve the user's mood, help the user maintain a good mindset, and enhance the user's psychological comfort.

[0298] It should be understood that the sequence number of each step in the above embodiments does not imply the order of execution. The execution order of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiments of this application.

[0299] Figure 23 is a schematic diagram of the structure of an audio playback device 2300 provided in an embodiment of this application. The audio playback device 2300 can be deployed in the headphone case of the aforementioned embodiment. Referring to Figure 23, the audio playback device 2300 may include an acquisition module 2301 and a playback module 2302.

[0300] Module 2301 is used to acquire the first audio data;

[0301] The playback module 2302 is used to allow the speaker to play the first audio data when the headphone box is opened or closed.

[0302] In some embodiments, the first audio data is determined based on one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task.

[0303] Figure 24 is a schematic diagram of the structure of an audio playback device 2400 provided in an embodiment of this application. The audio playback device 2400 can be deployed in the electronic device of the aforementioned embodiment. Referring to Figure 24, the audio playback device 2400 may include a determining module 2401 and a sending module 2402.

[0304] The determining module 2401 is used to determine first audio data, which is the audio data played by the speaker when the headphone box is opened or closed;

[0305] The sending module 2402 is used to send the first audio data to the headphone box.

[0306] In some embodiments, the determining module 2401 is further configured to: determine the first audio data in response to a user operation.

[0307] In some embodiments, the determining module 2401 is further configured to: acquire first information, the first information including one or more of the following: user's emotional characteristics information, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's execution progress information on the first matter; and generate the first audio data based on the first information.

[0308] In some embodiments, the determining module 2401 is further configured to: acquire second audio data; and determine the first audio data based on the second audio data.

[0309] In some embodiments, the second audio data is recorded audio; or, the second audio data is generated based on first information, the first information including one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task.

[0310] In some embodiments, the determining module 2401 is further configured to: acquire multiple audio data; and determine the first audio data from the multiple audio data according to a preset rule.

[0311] In some embodiments, the preset rules include one or more of the following information: the number of times the headphone box is opened and / or closed, the user-preset playback order, first information, and the playback volume of multiple audio data, wherein the first information includes one or more of the following: user's emotional characteristics information, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the execution of the first task.

[0312] In some embodiments, acquiring multiple audio data includes: acquiring first information, the first information including one or more of the following: user emotional characteristics information, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's execution progress information for a first matter; and acquiring the multiple audio data based on the first information.

[0313] In some embodiments, determining the first audio data from the plurality of audio data according to a preset rule includes: determining second audio data from the plurality of audio data according to the preset rule; and determining the first audio data based on the second audio data.

[0314] In some embodiments, determining the first audio data based on the second audio data includes: generating the first audio data based on preset sound features and the second audio data, wherein the first audio data is audio data corresponding to the sound features.

[0315] In some embodiments, determining the first audio data based on the second audio data includes: acquiring the sound features of the second audio data; and generating the first audio data based on preset first text information and the sound features.

[0316] In some embodiments, determining the first audio data based on the second audio data includes: editing the second audio data to obtain the first audio data.

[0317] In some embodiments, the audio playback device 2400 further includes:

[0318] The display module is used to display a first image on a first interface when the headphone box is opened or closed. The first image is determined based on first information, which includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the execution of a first task.

[0319] In some embodiments, the first image is used to display the battery information of the earphone case and / or the TWS earphones in a thematic manner.

[0320] It should be noted that the module division in the audio playback device provided in the above embodiments is illustrative and only represents a logical functional division. In actual implementation, other division methods may also be used. Furthermore, the functional modules in the various embodiments of this application can be integrated into a single processor, exist as separate physical entities, or be integrated into a single module. The integrated modules described above can be implemented in hardware or as software functional modules.

[0321] If the integrated module is implemented as a software functional module and sold or used as an independent product, it can be stored in a readable storage medium. Based on this understanding, the technical solutions of the embodiments of this application, in essence, or the parts that contribute to the prior art, or all or part of the technical solutions, can be embodied in the form of a software product. This software product is stored in a storage medium and includes several instructions to cause a device or processor to execute all or part of the steps of the methods in the various embodiments of this application. The aforementioned storage medium includes various media capable of storing program code, such as USB flash drives, portable hard drives, read-only memory (ROM), random access memory (RAM), magnetic disks, or optical disks.

[0322] Furthermore, the audio playback device and audio playback method embodiments provided in the above embodiments belong to the same concept, and their specific implementation process can be found in the method embodiments. For ease of reading, this device embodiment will not repeat the details of the foregoing method embodiments one by one, but it should be clear that the device in this embodiment can correspondingly implement all the contents of the foregoing method embodiments.

[0323] Based on the same inventive concept, embodiments of this application also provide an electronic device, including one or more processors; one or more memories and one or more computer programs; wherein the one or more computer programs are stored in one or more memories, and the one or more computer programs include instructions that, when executed by the electronic device, cause the electronic device to perform the methods shown in the above embodiments.

[0324] This application also provides a computer-readable storage medium storing a computer program that, when executed by a processor, implements the methods shown in the above embodiments.

[0325] This application also provides a computer program product storing a computer program that, when run by an electronic device, enables the electronic device to perform the methods shown in the above embodiments.

[0326] This application also provides a chip system including a processor and a memory, wherein the memory stores a computer program that, when executed by the processor, implements the methods shown in the above embodiments.

[0327] It should be understood that the processor mentioned in the embodiments of this application can be a central processing unit (CPU), or other general-purpose processors, digital signal processors (DSPs), application-specific integrated circuits (ASICs), field-programmable gate arrays (FPGAs), or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor can be a microprocessor or any conventional processor.

[0328] It should also be understood that the memory mentioned in the embodiments of this application can be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory. Non-volatile memory can be ROM, programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), or flash memory. Volatile memory can be RAM, which is used as an external cache. By way of example, but not limitation, many forms of RAM are available, such as static random access memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random access memory (SDRAM), double data rate synchronous dynamic random access memory (DDR SDRAM), enhanced synchronous dynamic random access memory (ESDRAM), synchronous linked dynamic random access memory (SLDRAM), and direct rambus RAM (DR RAM).

[0329] Those skilled in the art will understand that, for the sake of convenience and brevity, the specific working processes of the systems, devices, and units described above can be referred to the corresponding processes in the foregoing method embodiments, and will not be repeated here.

[0330] In the above embodiments, implementation can be achieved, in whole or in part, through software, hardware, firmware, or any combination thereof. When implemented in software, it can be implemented, in whole or in part, as a computer program product. The computer program product includes one or more computer instructions. When the computer instructions are loaded and executed on a computer, all or part of the processes or functions described in the embodiments of this application are generated. The computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device. The computer instructions can be stored in a computer-readable storage medium or transmitted from one computer-readable storage medium to another. For example, the computer instructions can be transmitted from one website, computer, server, or data center to another via wired (e.g., coaxial cable, fiber optic, digital subscriber line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.) means. The computer-readable storage medium can be any available medium accessible to a computer or a data storage device such as a server or data center that integrates one or more available media. The available media can be magnetic media (e.g., floppy disks, hard disks, magnetic tapes), optical media (e.g., digital versatile discs (DVDs)), or semiconductor media (e.g., solid-state disks (SSDs)).

[0331] It is understood that the various numerical designations used in the embodiments of this application are merely for descriptive convenience and are not intended to limit the scope of the embodiments of this application. The order of the process numbers does not imply the order of execution; the execution order of each process should be determined by its function and internal logic.

[0332] References to "one embodiment" or "some embodiments" as described in this specification mean that one or more embodiments of this application include a specific feature, structure, or characteristic described in connection with that embodiment. Therefore, the phrases "in one embodiment," "in some embodiments," "in other embodiments," "in still other embodiments," etc., appearing in different parts of this specification do not necessarily refer to the same embodiment, but rather mean "one or more, but not all, embodiments," unless otherwise specifically emphasized. The terms "comprising," "including," "having," and variations thereof mean "including but not limited to," unless otherwise specifically emphasized.

[0333] In the embodiments of this application, the words "exemplarily" or "for example" are used to indicate examples, illustrations, or explanations. Any embodiment or design described as "exemplarily" or "for example" in the embodiments of this application should not be construed as being more preferred or advantageous than other embodiments or design solutions. Specifically, the use of the words "exemplarily" or "for example" is intended to present the relevant concepts in a specific manner.

[0334] Finally, it should be noted that the above-described embodiments are only used to illustrate the technical solutions of this application, and are not intended to limit them. Although this application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that modifications can still be made to the technical solutions described in the foregoing embodiments, or equivalent substitutions can be made to some of the technical features. Such modifications or substitutions do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of this application, and should all be included within the protection scope of this application.

Claims

1. An audio playback method, characterized in that, The method is applied to true wireless stereo (TWS) earbuds, the TWS earbuds including an earphone case with a speaker, the method comprising: Obtain the first audio data; When the headphone case is opened or closed, the speaker plays the first audio data.

2. The audio playback method according to claim 1, characterized in that, The first audio data is determined based on one or more of the following: the user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and the user's progress information on the first task.

3. An audio playback method, characterized in that, The method is applied to an electronic device that has a communication connection with TWS earphones, the TWS earphones including an earphone case with a speaker, and the method includes: Determine the first audio data, which is the audio data played by the speaker when the headphone box is opened or closed; The first audio data is sent to the earphone box.

4. The audio playback method according to claim 3, characterized in that, The determination of the first audio data includes: In response to a user action, the first audio data is determined.

5. The audio playback method according to claim 3 or 4, characterized in that, The determination of the first audio data includes: Obtain first information, which includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the execution of the first task; The first audio data is generated based on the first information.

6. The audio playback method according to claim 3 or 4, characterized in that, The determination of the first audio data includes: Obtain the second audio data; The first audio data is determined based on the second audio data.

7. The audio playback method according to claim 6, characterized in that, The second audio data is the recorded audio; Alternatively, the second audio data is generated based on the first information, which includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task.

8. The audio playback method according to claim 3 or 4, characterized in that, The determination of the first audio data includes: Acquire multiple audio data; The first audio data is determined from the plurality of audio data according to preset rules.

9. The audio playback method according to claim 8, characterized in that, The preset rules include one or more of the following information: the number of times the headphone box is opened and / or closed, the user-preset playback order, first information, and the playback volume of the multiple audio data, wherein the first information includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the first task.

10. The audio playback method according to claim 8 or 9, characterized in that, The acquisition of multiple audio data includes: Obtain first information, which includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the execution of the first task; The plurality of audio data are obtained based on the first information.

11. The audio playback method according to any one of claims 8-10, characterized in that, The step of determining the first audio data from the plurality of audio data according to a preset rule includes: The second audio data is determined from the plurality of audio data according to the preset rules; The first audio data is determined based on the second audio data.

12. The audio playback method according to claim 6, 7, or 11, characterized in that, Determining the first audio data based on the second audio data includes: The first audio data is generated based on the preset sound features and the second audio data, wherein the first audio data is the audio data corresponding to the sound features.

13. The audio playback method according to claim 6, 7, or 11, characterized in that, Determining the first audio data based on the second audio data includes: Obtain the sound features of the second audio data; The first audio data is generated based on the preset first text information and the sound features.

14. The audio playback method according to claim 6, 7 or 11, characterized in that, Determining the first audio data based on the second audio data includes: The second audio data is edited to obtain the first audio data.

15. The audio playback method according to any one of claims 3-14, characterized in that, The method further includes: When the headphone box is opened or closed, a first image is displayed on the first interface. The first image is determined based on first information, which includes one or more of the following: user's emotional characteristics, location information, personal information, seasonal information, solar term information, time information, festival information, weather information, temperature and humidity information, and user's progress information on the execution of a first task.

16. The audio playback method according to claim 15, characterized in that, The first image is used to display the battery information of the earphone case and / or the TWS earphones in a thematic manner.

17. An audio playback system, characterized in that, The audio playback system includes an electronic device and TWS earphones, the electronic device and the TWS earphones having a communication connection, the TWS earphones including an earphone case with a speaker, the electronic device for sending first audio data to the earphone case, and the speaker for playing the first audio data when the earphone case is opened or closed.

18. An electronic device comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, characterized in that, When the processor executes the computer program, it implements the method as described in any one of claims 1 to 16.

19. A computer-readable storage medium, characterized in that, The computer-readable storage medium stores a computer program that, when run on a computer, causes the computer to perform the method as described in any one of claims 1 to 16.

20. A chip system, characterized in that, include: A processor and a memory, the memory for storing a computer program, the processor for calling and running the computer program stored in the memory, causing a device or apparatus on which the chip system is mounted to perform the method as described in any one of claims 1 to 16.