Voice call data processing method and device, storage medium, and mobile terminal
A technology for voice calls and mobile terminals, applied in the fields of voice call data processing methods, storage media, mobile terminals, and devices, can solve the problems of complex sound, affecting users' use, and prone to whistling, and achieve the effect of reducing inconvenience
Active Publication Date: 2018-08-17
GUANGDONG OPPO MOBILE TELECOMM CORP LTD
5 Cites 4 Cited by
AI-Extracted Technical Summary
Problems solved by technology
However, in the voice call process, the voice call data contains many types of sounds, such as the voices of each player, the sound of the application program itself (such as the background sound or special ef...
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View moreMethod used
[0037] Exemplarily, playing a preset sound segment in a preset manner may include: playing a pre-recorded or pre-acquired sound segment through a speaker with a preset volume. Correspondingly, other mobile terminals can collect the sound signal corresponding to the preset sound segment through the microphone. The advantage of this setting is that the played sound can be heard by the user, allowing the user to know that it is currently in the distance detection stage, and allows the user to record personalized sound clips to enhance the fun. In addition, the speaker is an existing part of the mobile terminal, so there is no need to The detection of the distance can be realized by adding additional components in the mobile terminal, which saves the cost. Optionally, playing a preset sound segment in a preset manner may also include: playing an ultrasonic segment with a preset frequency and a preset intensity through an ultrasonic transmitter. Correspondingly, other mobile terminals can collect the sound signal corresponding to the preset sound segment through the ultrasonic receiver. The advantage of this setting is that during ultrasonic playback, the user will not hear the sound, and the distance detection can be realized without the user's awareness, so as to avoid interference to the user.
[0045] In some embodiments, the determining according to the feedback information that there is a target mobile terminal in the voice call group whose distance with the mobile terminal is less than a first preset distance value includes: when the When the feedback information includes information that the sound signal has been collected, it is determined that there is a target mobile terminal in the voice call group whose distance to the mobile terminal is smaller than a first preset distance value. The advantage of this setting is that the preset sound segment and the preset playback mode can be set according to the first preset distance value, so that whether the sound signal corresponding to the preset sound segment can be collected can be used as a critical value to directly determine the distance between the two mobile terminals. Whether the distance between them is less than the first preset distance value, there is no need to include information such as sound intensity in the feedback information, and there is no need to further calculate the actual distance based on information such as sound intensity, which can improve the efficiency of distance determination while reducing the amount of data transmission.
[0046] In some embodiments, the feedback information also includes the attribute information of the collected sound signal, and the anti-howling processing of the voice call data in the mobile terminal includes: according to the The attribute information determines the target distance between the target mobile terminal and the mobile terminal, and determines the target strength of the anti-howling processing according to the target distance; performs the voice call data in the mobile terminal according to the target strength Corresponding anti-howling treatment. The inventor found that the closer the distance between two mobile terminals, the more serious the howling usually occurs, so in the embodiment of the present application, the actual distance between the current mobile terminal and the target mobile terminal (i.e., the target distance) can be determined. Selecting the corresponding anti-howling processing strength (ie target strength) can achieve more targeted anti-howling processing and improve the anti-howling effect.
[0074] In some embodiments, performing howling suppression processing on the downlink voice communication data according to the howling points may also include: corresponding to frequencies of all howling points in the downlink voice communication data The audio signal is attenuated. Th...
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View moreAbstract
Embodiments of the application disclose a voice call data processing method, a voice call data processing device, a storage medium and a mobile terminal. The method comprises the following steps of: detecting that the preset sound play event is triggered after a voice call group in the preset application of the mobile terminal is successfully established; playing the preset sound clip in a presetmanner, and receiving the feedback information of other mobile terminals in the voice call group, wherein the feedback information contains results of other mobile terminals attempting to collect thesound signal corresponding to the preset sound clip; and performing anti-howling processing on the voice call data in the mobile terminal if it is determined that a target mobile terminal having the distance with the mobile terminal less than the first preset distance value exists in the voice call group according to the feedback information. By adopting the above technical solution, the application is capable of selecting an appropriate time for anti-howling processing after the voice call function of the application in the mobile terminal is enabled.
Application Domain
Technology Topic
Image
Examples
- Experimental program(1)
Example Embodiment
[0026] The technical solutions of the present application will be further described below in conjunction with the drawings and specific implementations. It can be understood that the specific embodiments described here are only used to explain the application, but not to limit the application. In addition, it should be noted that, for ease of description, the drawings only show a part of the structure related to the present application instead of all of the structure.
[0027] Before discussing the exemplary embodiments in more detail, it should be mentioned that some exemplary embodiments are described as processes or methods depicted as flowcharts. Although the flowchart describes the steps as sequential processing, many of the steps can be implemented in parallel, concurrently, or simultaneously. In addition, the order of the steps can be rearranged. The processing may be terminated when its operation is completed, but may also have additional steps not included in the drawings. The processing may correspond to methods, functions, procedures, subroutines, subroutines, and so on.
[0028] figure 1 This is a schematic flowchart of a voice call data processing method provided by an embodiment of this application. The method can be executed by a voice call data processing device, where the device can be implemented by software and/or hardware, and generally can be integrated in a mobile terminal. Such as figure 1 As shown, the method includes:
[0029] Step 101: After the voice call group in the preset application of the mobile terminal is successfully established, it is detected that the preset sound playing event is triggered.
[0030] Exemplarily, the mobile terminal in the embodiment of the present application may include mobile devices such as mobile phones and tablet computers. The preset application may be an application with a built-in voice group call function, such as an online game application, an online classroom application, a video conference application, or other applications that require multi-person collaboration, and so on.
[0031] Exemplarily, a voice call group may include two members, but in most cases, it generally includes three or more members, which can realize voice calls between three or more mobile terminals. The voice call group can be initiated and established by a user who uses a preset application on the mobile terminal. After the voice call group is successfully established, all mobile terminals included in the voice call group can communicate with each other. Generally, when the mobile terminal is not in the mute mode or the headset mode, it can be understood that the mobile terminal is in the speaker mode, and the voice of each user in the voice call group will be collected by the microphone of the mobile terminal they are using, and After network transmission and processing, it is played through the speakers of other users' mobile terminals. Take the game application as an example. If you need to team up to fight, you can turn on the team voice function. If there are 5 players in the team, then after the voice call group is established successfully, the 5 people can talk to each other, and any player can Hearing the words of the other 4 players at the same time, as if the other 4 players are talking next to him, it is convenient to communicate and play. The executive body of the technical solution of the present application, that is, the current mobile terminal, can be any mobile terminal in the voice call group, or one or several designated mobile terminals in the voice call group. That is to say, any mobile terminal in the voice call group can execute the method provided in the embodiment of this application, or one or more designated mobile terminals can execute the method provided in the embodiment of this application, or all mobile terminals can execute The method provided in the embodiment of this application.
[0032] Generally, when the mobile terminal is in the external playback mode, the sound collected by the mobile terminal’s microphone not only includes the user’s own speaking voice, but may also include the sound of the preset application played by the speaker, such as background music, etc. It includes the sound of the surrounding environment, and may also include the voice of other people in the voice call group played by the speaker. In this way, when multiple mobile terminals send data containing various sounds collected by each to the same mobile terminal via the network (for example, The voice call group contains 5 mobile terminals, then 4 of them will send their collected sounds to the server, and the server will send the sound data of the 4 mobile terminals to the fifth mobile terminal), and these sounds will be If the mobile terminal is mixed to play, howling may occur.
[0033] In the embodiments of the present application, in order to perform howling detection at an appropriate time and avoid useless howling detection at an inappropriate time to cause additional power consumption, it is possible to first detect whether there is a current movement in the voice conversation group. Other mobile terminals whose terminals are relatively close. In the application scenario of multi-person voice, the inventor found that when there is a relatively close distance between two mobile terminals, howling is prone to occur. Assuming that mobile terminal A and mobile terminal B in the voice call group are relatively close, the speaker of mobile terminal A will amplify and play the received sound collected by the microphone of mobile terminal B. Since the two mobile terminals are relatively close, this sound is It will be collected again by the microphone of mobile terminal B and sent to mobile terminal A, and the sound will continue to be amplified and played, which can easily form positive feedback amplification of the sound, thereby generating howling sound. Therefore, in the embodiment of the present application, it can be first determined whether there is another mobile terminal that is relatively close to the current mobile terminal during the voice call. If there is, the voice call data in the mobile terminal needs to be processed for howling prevention.
[0034] Specifically, in the embodiment of the present application, one mobile terminal can play a sound, and another mobile terminal collects the sound to determine the distance between two mobile terminals. The trigger condition of the preset sound playback event can be preset, and when the trigger condition is met, the preset sound playback event is triggered, so that it can be detected that the preset sound playback event is triggered. This application does not specifically limit the trigger condition. For example, the preset sound playback event can be triggered immediately when the voice call group is successfully established; it can also be triggered when the user clicks the trigger button; in addition, due to the voice A call group generally contains multiple mobile terminals. If each mobile terminal triggers a preset sound playback event at the same time, it may affect the distance detection result. Therefore, after the voice call group is successfully established, the server can Multiple mobile terminals in the voice call group respectively issue different triggering instructions. Correspondingly, when the mobile terminal receives the preset sound playing event triggering instruction issued by the server, the preset sound playing event is triggered.
[0035] Step 102: Play a preset sound clip in a preset manner, and receive feedback information from other mobile terminals in the voice call group, where the feedback information includes the other mobile terminal trying to collect the sound corresponding to the preset sound clip The result of the signal.
[0036] The purpose of playing the sound clip in the embodiments of this application is to allow other mobile terminals in the voice call group to try to collect the sound signal corresponding to the preset sound clip, and then determine the distance between the other mobile terminal and the current mobile terminal, in order to communicate with other voices. To distinguish and eliminate interference items, the preset sound segment can be a relatively special sound set in advance. For example, some uncommon sounds in life can be recorded as preset sound clips before the mobile terminal leaves the factory, such as the voices of animated characters, the sounds of animals such as dinosaurs, the whistling of trains, or the sounds written by a sound generator, etc.; and For example, users can also record their own voice and so on. The played preset sound segment can be regarded as a standard sound, and the mobile terminal learns the properties of the preset sound segment such as the sound intensity, time length, timbre tone, and voiceprint information in advance. In addition, the playback mode of the mobile terminal may also affect the collection effects of other mobile terminals. The playback mode may include playback volume, etc. If the playback volume is set to be low, the mobile terminal that is farther away may not be able to collect sound signals, so In the embodiment of this application, it is also necessary to preset the playback mode.
[0037] Exemplarily, playing a preset sound clip in a preset manner may include: playing a pre-recorded or pre-acquired sound clip at a preset volume through a speaker. Correspondingly, other mobile terminals can collect the sound signal corresponding to the preset sound segment through the microphone. The advantage of this setting is that the played sound can be heard by the user, which allows the user to understand that the user is currently in the distance detection stage, and allows the user to record personalized sound clips to enhance the interest. In addition, the speaker is an existing component in the mobile terminal and does not need to be Additional components can be added to the mobile terminal to realize distance detection and save costs. Optionally, playing a preset sound segment in a preset manner may also include: playing an ultrasonic segment with a preset frequency and a preset intensity through an ultrasonic transmitter. Correspondingly, other mobile terminals can collect the sound signal corresponding to the preset sound segment through the ultrasonic receiver. The advantage of this setting is that the user will not hear the sound during the ultrasonic playback, and the distance detection can be realized without the user noticing it, so as to avoid interference to the user.
[0038] Exemplarily, the aforementioned preset volume, or preset frequency and preset intensity can be set according to the first preset distance value below. The mobile terminal can play preset sound clips in different playback modes through theoretical calculations or simulation experiments. The debugging target is that other mobile terminals whose distance from the mobile terminal is less than the first preset distance value can be collected. For the corresponding sound signal, record the playback mode when the debugging target is reached as the preset playback mode. The first preset distance value can be set according to actual needs, for example, it can be 10 meters or 20 meters. When the distance between the two mobile terminals is less than the first preset distance value, the possibility of howling occurs is higher.
[0039] Exemplarily, before playing the preset sound clip in a preset manner, the mobile terminal may further include: sending start playback information to the server corresponding to the preset application, where the start playback information is used to instruct the server to notify the The other mobile terminals in the voice call group start to try to collect the sound signal corresponding to the preset sound segment and generate feedback information. Among them, the preset sound segment may be agreed in advance between the current mobile terminal and other mobile terminals in the voice call group, that is, other mobile terminals know in advance what kind of sound signal the preset sound segment corresponds to. The appointment process can be in the voice call After the group is successfully established, it can also be completed when a preset sound playback event is triggered. For example, the server can act as an intermediary to assist in the appointment process between mobile terminals, which is not limited in the embodiment of the present application. If users often talk in the form of voice talk group, you can record the agreed content and use it directly when you need it next time. There is no need to make an appointment again, saving time.
[0040] In the embodiment of the present application, the feedback information may be forwarded by the server corresponding to the preset application. Exemplarily, the result contained in the feedback information may refer to whether other mobile terminals can collect the sound signal. When other mobile terminals can collect the sound signal corresponding to the preset sound segment, it means that the distance between the two mobile terminals is less than the first preset distance value.
[0041] In addition, the feedback information may also include the attribute information of the collected sound signal, such as sound intensity. Since the intensity of the sound played by the mobile terminal is known, it will be attenuated as the sound spreads. The higher the attenuation degree, the distance between the other mobile terminal and the current mobile terminal can be determined according to the strength information of the sound signal in the feedback information, and it can be judged whether the distance is less than the first preset distance value.
[0042] Step 103: If it is determined according to the feedback information that there is a target mobile terminal in the voice call group whose distance from the mobile terminal is less than the first preset distance value, then anti-whistling is performed on the voice call data in the mobile terminal Called processing.
[0043] In the embodiment of the present application, when it is determined that there are other mobile terminals in the voice call group whose distance from the current mobile terminal is less than the first preset distance value, it is considered that howling is prone to occur at this time, so it is necessary to talk to the voice call The data is processed for howling prevention. The voice call data may include uplink voice call data and/or downlink voice call data, which is not specifically limited in this application. Among them, the uplink voice call data may include the voice data collected by the microphone of the mobile terminal; the downlink voice call data may be the server corresponding to the preset application after receiving the voice data of other mobile terminals in the voice call group, and then mixing, etc. For processing data sent to a mobile terminal, or data directly forwarded to a mobile terminal, this application does not limit the processing method of the server for processing voice call data. The embodiment of the present application does not limit the specific manner of howling prevention processing, and the specific implementation manner will be given as a schematic description below.
[0044] In the voice call data processing method provided in the embodiment of the present application, after the voice call group in the preset application of the mobile terminal is successfully established, it is detected that the preset sound playback event is triggered, and the preset sound clip is played in a preset manner. And receive feedback information from other mobile terminals in the voice call group. If it is determined according to the feedback information that there is a target mobile terminal in the voice call group whose distance from the mobile terminal is less than the first preset distance value, then the voice call in the mobile terminal The data is processed for howling prevention. By adopting the above technical solution, after the voice call group of the preset application in the mobile terminal is successfully established, when it is detected that there are other mobile terminals in the voice call group that are close to the current mobile terminal, the current mobile terminal The voice call data is processed for howling prevention to reduce the inconvenience caused by howling sounds to users.
[0045] In some embodiments, the determining according to the feedback information that there is a target mobile terminal in the voice talk group whose distance from the mobile terminal is less than a first preset distance value includes: when the feedback information is When the information including the collected sound signal is included, it is determined that there is a target mobile terminal whose distance to the mobile terminal is less than a first preset distance value in the voice conversation group. The advantage of this setting is that the preset sound clip and the preset playback mode can be set according to the first preset distance value, so that whether the sound signal corresponding to the preset sound clip can be collected as a critical value can directly determine the difference between the two mobile terminals. Whether the distance between the two is smaller than the first preset distance value, there is no need to include information such as sound intensity in the feedback information, and there is no need to further calculate the actual distance based on the information such as sound intensity, which reduces the amount of data transmission and improves the efficiency of distance determination.
[0046] In some embodiments, the feedback information further includes the collected attribute information of the sound signal, and the anti-howling processing on the voice call data in the mobile terminal includes: according to the attribute information Determine the target distance between the target mobile terminal and the mobile terminal, determine the target strength of anti-howling processing according to the target distance; perform corresponding defense on the voice call data in the mobile terminal according to the target strength Howling processing. The inventor found that the closer the distance between the two mobile terminals, the more severe the howling usually occurs. Therefore, in this embodiment of the application, the actual distance between the current mobile terminal and the target mobile terminal (ie, the target distance) can be used to determine Select the corresponding anti-howling processing intensity (ie target intensity) to achieve more targeted anti-howling processing and improve the anti-howling effect.
[0047] It is understandable that there may be more than one target mobile terminal. When there are more than two target mobile terminals, there are also more than two corresponding target distances. The target distance with the smallest value can be used as the standard to determine the anti-howling process Target strength.
[0048] In some embodiments, the determining the target strength of anti-howling processing according to the target distance includes: determining, according to the target distance, to prevent uplink voice call data and/or downlink voice call data in the mobile terminal. Howling processing. Wherein, when the target distance is greater than the second preset distance value, it is determined to perform anti-howling processing on the uplink voice call data in the mobile terminal; when the target distance is greater than the third preset distance value and less than or equal to When the second preset distance value, it is determined to perform anti-howling processing on the downlink voice call data in the mobile terminal; when the target distance is less than or equal to the third preset distance value, it is determined to The uplink voice call data and downlink voice call data in the mobile terminal are processed for howling prevention. Downlink voice call data generally has a relatively large amount of data, and the howling suppression processing is slightly more difficult and time-consuming. Therefore, in this embodiment of the present application, when the target distance is less than the first preset distance value but greater than the second preset distance, Explain that if howling is generated, the degree of howling is not serious, and the howling prevention processing can be performed only for the uplink voice call data. When the target distance is less than or equal to the second preset distance value but greater than the third distance value, it means that if howling occurs, the degree of howling will be more serious. You can perform anti-howling processing for the downlink voice call data to get better The anti-howling effect. When the target distance is very small, less than or equal to the third preset distance value, it means that if howling occurs, the degree of howling will be very serious. You can perform anti-howling processing for both downlink voice call data and downlink voice call data at the same time to obtain The best anti-howling effect. In this application, the values of the first preset distance, the second preset distance, and the third preset distance are sequentially reduced, but the specific values are not limited. For example, the first preset distance is 10 meters, and the second preset distance is Set the distance to 5 meters, and the third preset distance to 2 meters.
[0049] Exemplarily, when performing anti-howling processing on downlink voice call data, howling detection can be performed on the downlink voice call data first, and when it is determined that there is a howling sound, attenuation processing is performed on the howling point to achieve anti-howling Called effect.
[0050] In some embodiments, the following methods can be used to determine whether there is a howling sound in the downlink voice call data:
[0051] The first method is to perform block processing on the downlink voice call data; for each data block, use a preset analysis method to determine the suspected howling points in the current data block; when there are multiple suspected howling points with periodic characteristics When a point group is called, and the energy value corresponding to the suspected howling point is increasing according to the order of the data block to which it belongs, it is determined that there is a howling sound in the downlink voice call data; wherein the group of suspected howling points are consecutively adjacent A suspected howling point where the frequency difference in the data block is within a preset range, and the number of consecutive adjacent data blocks reaches a preset consecutive threshold.
[0052] The second method is to perform block processing on the downlink voice call data to obtain M data blocks; use a preset analysis method to sequentially analyze whether there is a suspected howling point in the current data block, and the data block for the suspected howling point will appear for the first time Determined as the starting data block; starting from the starting data block, n data blocks are successively used as the data segment to be analyzed, and the suspected howling points contained in the current data segment are analyzed using the preset analysis method, when When the frequency difference between the suspected howling points included in the N data segments is within the preset range, it is determined that there is a howling sound in the downlink voice call data; where n=2, 3,..., N; N is less than Or equal to M, greater than or equal to 2; the starting point of each data segment is the same as the starting point of the initial data block, and the initial data block is the first data segment.
[0053] Of course, other methods may also be used in the embodiment of the present application to determine whether there is a howling sound in the downlink voice call data, which is not limited in the present application. The following two methods are used as examples for detailed description.
[0054] For the first method, the block processing of the downlink voice call data may be block processing according to a preset unit length, and the preset unit length may be, for example, 40 milliseconds. Assuming that the preset time length is 1.2 seconds and the preset unit length is 40 milliseconds, it can be divided into 30 data blocks.
[0055] The embodiment of the present application does not specifically limit the preset analysis method. For example, the preset analysis method may include: obtaining in the frequency domain the to-be-determined frequency points whose energy value is higher than a preset energy threshold in the high-frequency region, and calculating the energy of a preset number of frequency points around the to-be-determined frequency point The difference value, when the energy difference value is greater than a preset difference threshold, the frequency point to be determined is determined to be a suspected howling point; the high-frequency area is a frequency range whose frequency is higher than the preset frequency threshold.
[0056] Specifically, for the current data block, it can be transformed from the time domain to the frequency domain first to facilitate spectrum analysis. The transformation method is not limited in the embodiment of the present application, and a Fourier transformation method, such as a fast Fourier transform (Fast Fourier Transformation, FFT) algorithm, may be used. Taking 40ms as an example, the size of 40ms audio data (16bit, 16k sampling rate) is 40*16*16/2=1280 bytes, which is suitable for the use of 1024 for FFT transformation for spectrum analysis. The frequency analysis after FFT processing The frequency range is 0~16K/2, the step size is (16K/2)/1024, and the step size is about 8Hz.
[0057] In the embodiment of the present application, a preset frequency threshold may be used as a boundary value to divide the high-frequency region and other regions. The preset frequency threshold can be set according to actual conditions. For example, it can be set according to the human voice frequency and the frequency characteristics of howling easily, for example, it can be 1KHz, 1.5KHz, or 2KHz. For example, the preset frequency threshold is 2KHz, that is, the part greater than 2KHz is the high frequency area. Generally, the frequency of the howling sound appears in the high-frequency region, and the sound is louder (that is, the energy value is higher). The embodiment of the present application can quickly determine the suspected howling point in a data block according to the characteristics of the energy value distribution.
[0058] Exemplarily, the energy value corresponding to each frequency point (frequency point for short) in the data block is obtained, and then the frequency point to be determined whose energy value is higher than the preset energy threshold is found from the high frequency area, and the prediction around the frequency point to be determined is calculated. Set the energy difference value of the number of frequency points. The preset energy threshold and the preset number can be set according to actual needs. For example, the preset energy threshold can be -10dB, and the preset number can be 8 (4 in front of the frequency to be determined and 4 in the back). The above step is about 8Hz as an example. Assuming the frequency value of the frequency to be determined is 3362Hz, the frequency value of the preset number of frequency points around it is about 3330Hz, 3338Hz, 3346Hz, 3354Hz, 3370Hz, 3378Hz, 3386Hz and 3394Hz . The energy difference value is used to measure the degree of difference between the frequency point to be determined and the preset number of frequency points around. Specifically, it can be the difference between the maximum energy value and the minimum energy value, or the energy variance value or the energy mean square deviation value. Wait, this application is not limited. The preset difference threshold value corresponds to the energy difference value. For example, when the energy difference value is the energy variance value, the preset difference threshold value is the preset variance threshold value. When the energy difference value is greater than the preset difference threshold, it indicates that the frequency point to be determined is more prominent, and it is very likely to be a howling point. Therefore, the frequency point to be determined is determined to be a suspected howling point. This setting can quickly and accurately identify the suspected howling point, laying a foundation for improving the efficiency of howling detection.
[0059] Exemplarily, there may be multiple frequency points to be determined in a data block, and this application can start the determination of suspected howling points from the frequency point to be determined with the highest corresponding energy.
[0060] In addition, the preset analysis method may further include: obtaining a first frequency point with the largest energy value in the high-frequency region and a second frequency point with the largest energy value in the low-frequency region in the frequency domain, when the first frequency point satisfies When the suspected howling condition is preset, it is determined that the first frequency point is a suspected howling point in the current data block, and the preset suspected howling condition includes that the energy value of the first frequency point is greater than a preset energy threshold; And the energy difference between the first frequency point and the second frequency point is greater than a preset difference threshold.
[0061] Specifically, for the current data block, it can be transformed from the time domain to the frequency domain first to facilitate spectrum analysis. Similarly, the division frequency can be preset as the boundary value to divide the high-frequency area and the low-frequency area. The preset division frequency can be set according to actual conditions. For example, it can be set according to the human voice frequency and the frequency characteristics that are prone to howling sound. For example, it can be 1KHz, 1.5KHz, or 2KHz. For example, the preset division frequency is 2KHz, that is, the part greater than 2KHz is the high frequency area, and the part less than or equal to 2KHz is the low frequency area.
[0062] Exemplarily, the energy value corresponding to each frequency point in the data block is obtained, and then the first frequency point with the largest energy value is found from the high frequency area, and the second frequency point with the largest energy value is found from the low frequency area. If the first frequency When the energy value of the point is greater than the preset energy threshold (such as -30dB), and the difference between the energy value of the first frequency point and the energy value of the second frequency point is greater than the preset difference threshold (such as 60), it can be considered as the first The frequency point is the suspected howling point in the current data block. This setting can quickly and accurately identify the suspected howling point, laying a foundation for improving the efficiency of howling detection.
[0063] Exemplarily, for each data block, the above preset analysis method is used to determine whether there is a suspected howling point, if there is, record the suspected howling point, and further determine whether the current downlink voice call data contains howling sound.
[0064] It is understandable that if there is a suspected howling sound in a certain data block, it cannot be considered that the entire downlink voice call audio contains howling sound. It may also be because some special sounds are misidentified as howling sounds, such as objects. The harsh sound produced during friction is generally higher in frequency and louder, and it is likely to be recognized as a suspected howling sound, but this sound is generally short and has a short duration. It is not a howling sound. Therefore, it needs to be increased. Further judgment.
[0065] In the embodiments of the present application, the distribution characteristics of the suspected howling sounds in each data block are analyzed. When there are suspected howling points with small frequency differences in multiple consecutive adjacent data blocks, these suspected howling points can be called a group of suspected howling points. That is, the suspected howling point group is the suspected howling point whose frequency difference in consecutive adjacent data blocks is within a preset range, and the number of consecutive adjacent data blocks reaches the preset continuous threshold. Among them, the preset continuous threshold may be determined according to actual conditions, for example, three; the preset range corresponding to the frequency difference may also be determined according to actual conditions, for example, 40 Hz. The inventor found that the howling sound generally shows continuous characteristics in a short period of time and appears periodically, and the sound gradually becomes louder. Therefore, in the embodiment of the present application, multiple (understandable as greater than or equal to 2) suspected howling point groups exhibit periodic characteristics, and the energy values corresponding to the suspected howling points show an upward trend in accordance with the order of the data blocks. Determine the conditions to identify whether there is a howling sound in the current downlink voice call data. If the above conditions are met, it is determined that there is a howling sound, so that the howling sound can be quickly and accurately identified.
[0066] Exemplarily, it is assumed that the downlink voice call data is divided into 30 data blocks. For example, if 1, 2, 3, 7, 8, 9, 13, 14, 15, 19, 20, 21, 25, 26, and 27 are detected in 15 data blocks, the frequency is detected in (A-40, A +40) The suspected howling points in the interval, the suspected howling points corresponding to every 3 data blocks become a suspected howling point group, the 5 suspected howling point groups are periodic, and the energy corresponding to the suspected howling point The values increase sequentially, so it is determined that howling sounds are included in the downlink voice call data. For another example, if only the suspected howling points with frequencies in the (B-40, B+40) interval are detected in the first, second, and third data blocks, the suspected howling points corresponding to these three data blocks become There is a group of suspected howling points, but only this one exists and does not show periodic characteristics. Therefore, it can be determined that the downlink voice call data does not contain howling sounds.
[0067] For the second method, the block processing method and the preset analysis method can refer to the related content in the first method, which will not be repeated in this embodiment of the application.
[0068] Specifically, the above-mentioned preset analysis method is used to analyze whether there is a suspected howling point in the first data block. If it exists, the suspected howling point appears for the first time, and the first data block is determined as the starting data block; if it does not exist , The next data block of the current data block is taken as the new current data block, and the aforementioned preset analysis method is used to analyze whether there is a suspected howling point in the new current data block. By analogy, until the data block where the suspected howling point appears for the first time is determined as the starting data block, if there is no suspected howling point in the M data blocks, it can be considered that the current downlink voice call data does not contain howling sound.
[0069] Taking the above block method as an example, M=30, 2≤N≤30. When performing spectrum analysis, the length of the data to be analyzed will have an impact on the analysis results, because the accuracy may not be too accurate when there are fewer data points. Therefore, using longer data to analyze again is equivalent to a correction process. , Can more accurately determine whether it is howling. This application does not limit the specific value of N. Assuming that N=4 and the length of a data block is 40ms, then the time range of the initial data block can be recorded as 0-40ms, because the initial data block has been analyzed, and As the first data segment, so starting from n=2, it is the second data segment. The time range of the second data segment can be recorded as 0 to 80ms, and so on, the time range of the third data segment can be recorded as 0 To 120ms, the time range of the third data segment can be recorded as 0 to 160ms.
[0070] Exemplarily, the preset range can be set according to actual conditions, for example, it can be 40 Hz (as in the above example, it can be considered as equivalent to 5 steps). Assuming that the frequencies of the suspected howling points analyzed by the 4 data segments are A, B, C, and D, and the differences between A, B, C, and D are all within 40 Hz, then the downlink voice call data can be determined There is a howling sound.
[0071] Optionally, if the frequency difference between the suspected howling point contained in the current data segment and the suspected howling point contained in the previous data segment is not within the preset range, the next data from the current data segment The block starts to acquire the downlink voice call data of the preset time length, and repeats the relevant operations of performing block processing on the downlink voice call data. The advantage of this setting is that when the frequency difference between the suspected howling points contained in any two data segments is large, it can indicate that the previous suspected howling points may not be the real howling points, and the detection needs to be continued, and no correction is required. The subsequent data segment detects suspected howling points to save power consumption and improve the efficiency and accuracy of howling sound detection. For example, when the difference between C and A or B exceeds 40 Hz, starting from 120 ms, the downlink voice call data of a preset time length in the mobile terminal is re-acquired, and the downlink voice call data is processed in blocks , Obtain M data blocks, then determine a new starting data block, and continue to use the above method to determine whether there is a howling tone in the downlink voice call data.
[0072] After determining that there is a howling sound in the downlink voice call data, the method further includes: determining the suspected howling point as a howling point; and performing howling suppression processing on the downlink voice call data according to the howling point. After it is determined that there is a howling sound in the downlink voice call data, it means that the suspected howling point that was previously identified that satisfies the howling sound determination condition is indeed a howling point. Then, howling suppression processing should be performed on the downlink voice according to the howling point. Prevent the howling sound from playing out from the speaker or earpiece, affecting the user's use. Further, after the howling suppression processing is performed, the downlink voice call data after the howling suppression processing is played through the speaker or the earpiece.
[0073] In some embodiments, the performing howling suppression processing on the downlink voice call data according to the howling point includes: selecting a preset number of frequencies of howling points corresponding to higher energy values as the target frequency, Perform attenuation processing on the audio signal corresponding to the target frequency in the downlink voice call data. The preset number can be set freely, such as 1, 3, or even more, and it can also be dynamically determined according to the number of howling points. The howling points can be sorted in the order of energy value from high to low, selecting the preset number of howling points in front, and determining the frequency of the selected howling points as the target frequency. The higher the energy value, the louder the howling sound, and the higher the impact on the user. The advantage of this setting is that it can more targeted the suppression of howling frequencies with higher energy values and improve the howling suppression efficiency , To ensure the timeliness of voice calls.
[0074] In some embodiments, the performing howling suppression processing on the downlink voice call data according to the howling point may also include: performing audio signals corresponding to the frequencies of all howling points in the downlink voice call data Perform attenuation processing. The advantage of this setting is that it can comprehensively suppress howling at all the howling points and prevent the playback of howling sounds.
[0075] Exemplarily, a notch filter may be used to attenuate the audio signal corresponding to the frequency of the howling point that needs to be suppressed (ie, the target frequency). The notch filter can quickly attenuate the input signal at a certain frequency point to achieve the filtering effect of blocking the passage of the frequency signal. This application does not limit the types of notch filters and specific parameter values. Generally, the target frequency is used as the center frequency of the notch filter, and the processing bandwidth and gain of the notch filter can be set according to actual needs.
[0076] Exemplarily, when performing anti-howling processing on uplink voice call data, the voice data collected by the mobile terminal can be obtained; the voice data is separated from the human voice and the background sound; the separated background sound is attenuated; After the attenuated background sound and the separated human voice are mixed, the processed uplink voice call data is sent to the server corresponding to the preset application. The advantage of this setting is that it can effectively attenuate howling caused by background sound. Exemplarily, when there is a microphone array in the mobile terminal (the number of microphones is greater than or equal to 2), the sound source position can be determined, and the sound that is far away from the mobile terminal (for example, greater than 1 meter) can be filtered out as the background sound according to the sound source position Or, the voiceprint information of the user of the mobile terminal may be obtained in advance, and the voice of the user's speech can be extracted from the voice data according to the voiceprint information as the human voice, and the remaining voice as the background sound. Exemplarily, the attenuation of the separated background sound may be to reduce the sound of the background sound by adjusting the gain, or to filter the background sound. After the background sound is attenuated, the volume is reduced, destroying the condition of increasing sound, and effectively attenuating the howling caused by the background sound.
[0077] In addition, the anti-howling processing for the uplink voice call data can also be performed according to the howling detection result of the downlink voice call data. If there is a howling sound in the downlink voice call data, when the downlink voice call data is played through the speaker or earpiece of the mobile terminal, the howling sound will be played and the user can hear it. In addition, the microphone of the mobile terminal can also collect the howling sound , That is, the uplink voice call data of the mobile terminal will also contain howling tones. In the embodiment of the application, the anti-howling processing is performed on the uplink voice call data to prevent the howling sound from being transmitted to the network again and being received by other mobile terminals, thereby destroying the condition that the howling sound becomes larger and larger, thereby achieving the anti-howling sound The purpose of the call. Specifically, a preset notch filter can be used to perform anti-howling processing on the uplink voice call data in the mobile terminal, where the center frequency of the preset notch filter is the frequency corresponding to the howling point in the downlink voice call data .
[0078] Optionally, the anti-howling processing for uplink voice call data may also be: obtaining the first uplink data currently to be uploaded and the buffered second uplink data uploaded at the last moment, and judging that the first uplink data is Whether the similarity of the second uplink data is higher than a preset similarity threshold, and if so, the first uplink data is weakened. The advantage of this setting is that it can quickly determine whether attenuation processing is needed, and improve the efficiency of anti-howling processing. Wherein, performing attenuation processing on the first uplink data may include: attenuating or filtering out audio data with the same characteristics as the second uplink data in the first uplink data. Attenuation here can include reducing sound energy. Further, the first uplink data and the second uplink data can be simulated superimposed to determine whether the superimposed data contains a howling feature, and if so, the first uplink data is attenuated. The howling characteristics may include energy concentration, periodicity, and frequency higher than a preset frequency threshold. It is also possible to determine whether the superimposed data contains the howling feature according to the above-mentioned manner of howling detection is performed on the downlink voice call data, which is not limited in the embodiment of the present application.
[0079] In some embodiments, the performing corresponding anti-howling processing on the voice call data in the mobile terminal according to the target strength includes: acquiring downlink voice call data of a preset time length in the mobile terminal; Howling points in the downlink voice call data; using a preset notch filter to perform howling suppression processing on the voice call data in the mobile terminal; wherein the center frequency of the preset notch filter is the howling The frequency, processing width and gain value corresponding to the call point are determined by the target intensity. The higher the target intensity, the wider the corresponding processing width or the smaller the corresponding gain value. A preset notch filter is used to perform howling suppression processing on the voice call data, where the voice call data may include uplink voice call data and/or downlink voice call data, which is not limited in this embodiment of the application. The advantage of this setting is that the processing width or gain value of the notch filter can be preset according to different target distances, so as to achieve different degrees of howling suppression processing.
[0080] figure 2 A schematic diagram of the original howling speech spectrum analysis provided by the embodiments of this application. The figure shows a howling segment intercepted from a certain audio containing howling sound. The energy value corresponding to 3362 Hz in the audio is the largest. It is -6.64dB. If it is played out, the sound will be loud, and howling suppression processing is required. image 3 This is a schematic diagram of a notch filter provided by an embodiment of this application. When performing howling suppression processing on downlink voice call data, the center frequency of the preset notch filter is the frequency corresponding to the howling point, such as figure 2 In 3362Hz, the corresponding processing width can be determined according to the target intensity. The processing width is the range that needs to be processed around the center frequency. Assuming that the processing width corresponding to -6.64dB is 50Hz, the range that needs to be processed is 3312Hz to 3412Hz, that is, the audio signal in this range is processed. The advantage of determining the processing width according to the target intensity is that there may be high-energy frequency points near the howling point, and by expanding the processing range, a better suppression effect can be obtained. In addition, such as figure 2 As shown, the notch filter is provided with a gain value, which is represented by g. The gmin in the figure represents the minimum value of the gain value, and the value range is generally 0 to 1. The larger the g value, the smaller the attenuation. Therefore, in this application, the gain value can be determined according to the target intensity. The higher the target intensity, the smaller the gain value, so that the higher the energy attenuation in the processing range, and the higher the degree of suppression of howling sounds. Figure 4 A schematic diagram of howling speech spectrum analysis after notch filter processing is provided in this embodiment of the application, such as Figure 4 As shown, the energy in the range of 3312 Hz to 3412 Hz is significantly attenuated after being processed by the notch filter, thereby achieving the suppression of howling sound.
[0081] Figure 5 A schematic flow diagram of another method for processing voice call data provided in an embodiment of this application. Taking the preset application as an online game application as an example, the method includes the following steps:
[0082] Step 501: After the voice call group in the game application is successfully established, it is detected that a preset sound playing event is triggered.
[0083] Exemplarily, take a team battle game as an example. For example, there are 5 players in each team, and the red and blue teams play against each other. The 5 players of each team need to communicate and discuss the battle strategy. Therefore, many players Will choose to enable the voice call function in the team. For example, after a player applies for the voice call function in the team, the voice call group is successfully established. After starting the formal voice call, any one of the 5 players in the same team can hear the voices of the other 4 players. Generally, players will set the mobile terminal to an external playback mode to facilitate the game. In the embodiment of the present application, after the voice call group is successfully established, the voice call will not start immediately, and the game server can follow the time sequence and interval (the time when the sound is played and the distance between the current mobile terminal and other mobile terminals can be detected OK, such as 50 seconds) Send a preset sound playback event trigger instruction to 5 mobile terminals in sequence, and trigger the preset sound playback event when the current mobile terminal receives the instruction.
[0084] Step 502: Play a pre-recorded or pre-acquired sound clip at a preset volume through a speaker.
[0085] Step 503: Receive feedback information from other mobile terminals in the voice call group.
[0086] Exemplarily, the feedback information is collected by the game server and sent to the current mobile terminal.
[0087] Step 504: Determine, according to the feedback information, whether there is a target mobile terminal with a distance less than X from the mobile terminal in the voice call group, if yes, perform step 505; otherwise, perform step 510.
[0088] If the mobile terminals of two of the five players are relatively close, such as two good friends playing together at home, and the mobile terminal is set to the outdoor mode at the same time, it is very easy to cause howling. Therefore, in the embodiment of the present application, it can be determined first whether there are other mobile terminals that are relatively close to the current mobile terminal in the voice call group, and if they exist, anti-howling processing is required.
[0089] Step 505: Determine the target distance D between the target mobile terminal and the mobile terminal according to the attribute information.
[0090] Step 506: Judge the magnitude relationship between D and Y and Z; D> When Y, go to step 507; Z When
[0091] Among them, the values of X, Y, and Z decrease sequentially.
[0092] Step 507: Start a voice call, and perform anti-howling processing on the uplink voice call data in the mobile terminal.
[0093] Step 508: Start a voice call, and perform anti-howling processing on the downlink voice call data in the mobile terminal.
[0094] Step 509: Start a voice call, and perform anti-howling processing on the uplink voice call data and the downlink voice call data in the mobile terminal.
[0095] Step 510: Start a voice call without performing anti-howling processing.
[0096] After the voice call group in the game application is successfully established in the embodiment of the application, the voice call will not be started immediately. Instead, the voice call group is first determined by playing a preset sound clip and receiving feedback information to determine whether there is a voice call group with the current mobile terminal. If there are other mobile terminals that are close to each other, the specific distance value is determined, and the uplink and/or downlink voice call data are selected according to the distance value to prevent howling. After the voice call is started, the corresponding voice call data Anti-howling processing is performed to weaken the interference of howling sound to the user's game process, reduce the pain points of game players, and make the function of the mobile terminal more perfect.
[0097] Image 6 This is a schematic flow diagram of another method for processing voice call data provided in this embodiment of the application, still taking an online game application as an example, the method includes:
[0098] Step 601: After the voice call group in the game application is successfully established, it is detected that a preset sound playing event is triggered.
[0099] Step 602: Play an ultrasonic segment with a preset frequency and a preset intensity through the ultrasonic transmitter.
[0100] Step 603: Receive feedback information from other mobile terminals in the voice call group.
[0101] Step 604: Determine, according to the feedback information, whether there is a target mobile terminal with a distance less than E from the mobile terminal in the voice call group, if yes, perform step 605; otherwise, perform step 608.
[0102] Step 605: Determine the target distance F between the target mobile terminal and the mobile terminal according to the attribute information.
[0103] Step 606: Determine the processing width and gain value of the preset notch filter according to F.
[0104] Among them, the smaller the F value, the larger the corresponding processing width, and the smaller the gain value.
[0105] Step 607: Start a voice call, and use a preset notch filter to perform howling suppression processing on the voice call data in the mobile terminal.
[0106] Exemplarily, the howling detection is performed on the downlink voice call data, the howling point is determined, and the center frequency of the preset wave trap is the frequency corresponding to the howling point.
[0107] Step 608: Start a voice call without performing anti-howling processing.
[0108] After the voice call group in the game application is successfully established in the embodiment of the application, the voice call will not be started immediately. Instead, the voice call group is first determined by playing a preset sound clip and receiving feedback information to determine whether there is a voice call group with the current mobile terminal. If there are other mobile terminals that are close to each other, the specific distance value is determined, and the parameters of the preset notch filter to be used for anti-howling processing are set according to the distance value. After the voice call is started, the parameters are used The set preset notch filter performs anti-howling processing on the corresponding voice call data, weakens the interference of howling sound to the user's game process, reduces the pain points of game players, and makes the function of the mobile terminal more perfect.
[0109] Figure 7 A structural block diagram of a voice call data processing device provided by an embodiment of this application. The device can be implemented by software and/or hardware, and is generally integrated in a mobile terminal. The voice call data processing method can be executed to prevent voice call data. Howling processing. Such as Figure 7 As shown, the device includes:
[0110] The trigger detection module 701 is configured to detect that a preset sound playback event is triggered after a voice call group in a preset application of the mobile terminal is successfully established;
[0111] The sound playing module 702 is configured to play a preset sound clip in a preset manner;
[0112] The feedback information receiving module 703 is configured to receive feedback information from other mobile terminals in the voice call group, where the feedback information includes the result of the other mobile terminals trying to collect sound signals corresponding to the preset sound segments;
[0113] The anti-howling processing module 704 is configured to: when it is determined according to the feedback information that there is a target mobile terminal whose distance to the mobile terminal is less than a first preset distance value in the voice call group, The voice call data in the system is processed for howling prevention.
[0114] The voice call data processing device provided in the embodiment of the present application can detect that there are other mobile terminals that are close to the current mobile terminal in the voice call group after the voice call group of the preset application in the mobile terminal is successfully established , The current mobile terminal voice call data is processed in time to prevent howling, reducing the inconvenience caused by howling to users.
[0115] Optionally, the playing a preset sound clip in a preset manner includes:
[0116] Play pre-recorded or pre-acquired sound clips through a speaker at a preset volume; or play ultrasound clips with a preset frequency and preset intensity through an ultrasonic transmitter.
[0117] Optionally, the determining according to the feedback information that there is a target mobile terminal in the voice talk group whose distance to the mobile terminal is less than a first preset distance value includes:
[0118] When the feedback information includes information that the sound signal has been collected, it is determined that there is a target mobile terminal whose distance to the mobile terminal is less than a first preset distance value in the voice call group.
[0119] Optionally, the feedback information further includes the collected attribute information of the sound signal, and the anti-howling processing on the voice call data in the mobile terminal includes:
[0120] Determine the target distance between the target mobile terminal and the mobile terminal according to the attribute information, and determine the target intensity of anti-howling processing according to the target distance; wherein, the smaller the target distance, the greater the corresponding target intensity;
[0121] According to the target intensity, corresponding anti-howling processing is performed on the voice call data in the mobile terminal.
[0122] Optionally, the determining the target intensity of the howling prevention processing according to the target distance includes:
[0123] Determine, according to the target distance, to perform anti-howling processing on uplink voice call data and/or downlink voice call data in the mobile terminal;
[0124] Wherein, when the target distance is greater than a second preset distance value, determining to perform anti-howling processing on the uplink voice call data in the mobile terminal;
[0125] When the target distance is greater than a third preset distance value and less than or equal to the second preset distance value, determining to perform anti-howling processing on the downlink voice call data in the mobile terminal;
[0126] When the target distance is less than or equal to the third preset distance value, it is determined to perform anti-howling processing on the uplink voice call data and the downlink voice call data in the mobile terminal.
[0127] Optionally, the performing corresponding anti-howling processing on the voice call data in the mobile terminal according to the target intensity includes:
[0128] Acquiring downlink voice call data of a preset time length in the mobile terminal;
[0129] Determining the howling point in the downlink voice call data;
[0130] Using a preset notch filter to perform howling suppression processing on the voice call data in the mobile terminal; wherein the center frequency of the preset notch filter is the frequency, processing width and gain value corresponding to the howling point Determined by the target intensity, the higher the target intensity, the wider the corresponding processing width or the smaller the corresponding gain value.
[0131] Optionally, the preset application is an online game application.
[0132] An embodiment of the present application also provides a storage medium containing computer-executable instructions, which are used to execute a voice call data processing method when the computer-executable instructions are executed by a computer processor, and the method includes:
[0133] After the voice call group in the preset application of the mobile terminal is successfully established, it is detected that the preset sound playback event is triggered;
[0134] Play a preset sound clip in a preset manner, and receive feedback information from other mobile terminals in the voice call group, where the feedback information includes the result of the other mobile terminal trying to collect a sound signal corresponding to the preset sound clip ;
[0135] If it is determined according to the feedback information that there is a target mobile terminal whose distance to the mobile terminal is less than the first preset distance value in the voice call group, then anti-howling is performed on the voice call data in the mobile terminal deal with.
[0136] Storage medium-any of various types of storage devices or storage devices. The term "storage medium" is intended to include: installation media, such as CD-ROM, floppy disk or tape device; computer system memory or random access memory, such as DRAM, DDRRAM, SRAM, EDORAM, Rambus RAM, etc.; Volatile memory, such as flash memory, magnetic media (such as hard disk or optical storage); registers or other similar types of memory elements, etc. The storage medium may further include other types of memory or a combination thereof. In addition, the storage medium may be located in the first computer system in which the program is executed, or may be located in a different second computer system connected to the first computer system through a network (such as the Internet). The second computer system may provide program instructions to the first computer for execution. The term "storage media" may include two or more storage media that may reside in different locations (for example, in different computer systems connected through a network). The storage medium may store program instructions executable by one or more processors (for example, embodied as a computer program).
[0137] Of course, the storage medium containing computer-executable instructions provided by the embodiments of the present application is not limited to the voice call data processing operations described above, and can also execute the voice call provided by any embodiment of the present application. Related operations in data processing methods.
[0138] The embodiment of the present application provides a mobile terminal in which the voice call data processing device provided in the embodiment of the present application can be integrated. Figure 8 This is a schematic structural diagram of a mobile terminal provided in an embodiment of this application. The mobile terminal 800 may include: a memory 801, a processor 802, and a computer program stored on the memory 801 and running on the processor 802. When the processor 802 executes the computer program, the voice as described in the embodiment of the present application is implemented. Call data processing method.
[0139] The mobile terminal provided by the embodiment of the present application can, after the voice call group of the preset application in the mobile terminal is successfully established, detect that there are other mobile terminals in the voice call group that are close to the current mobile terminal, and promptly respond to the current mobile terminal. The voice call data of the mobile terminal is processed for howling prevention, reducing the inconvenience of howling sound to users.
[0140] Picture 9 This is a schematic structural diagram of another mobile terminal provided in an embodiment of this application. The mobile terminal may include: a housing (not shown in the figure), a memory 901, a central processing unit (CPU) 902 (also called a processing unit). The processor, hereinafter referred to as CPU), circuit board (not shown in the figure) and power supply circuit (not shown in the figure). The circuit board is arranged inside the space enclosed by the housing; the CPU 902 and the memory 901 are arranged on the circuit board; the power supply circuit is used to supply power to various circuits or devices of the mobile terminal The memory 901 is used to store executable program code; the CPU 902 reads the executable program code stored in the memory 901 to run the computer program corresponding to the executable program code to implement the following steps:
[0141] After the voice call group in the preset application of the mobile terminal is successfully established, it is detected that the preset sound playback event is triggered;
[0142] Play a preset sound clip in a preset manner, and receive feedback information from other mobile terminals in the voice call group, where the feedback information includes the result of the other mobile terminal trying to collect a sound signal corresponding to the preset sound clip ;
[0143] If it is determined according to the feedback information that there is a target mobile terminal whose distance to the mobile terminal is less than the first preset distance value in the voice call group, then anti-howling is performed on the voice call data in the mobile terminal deal with.
[0144] The mobile terminal also includes: a peripheral interface 903, an RF (Radio Frequency, radio frequency) circuit 905, an audio circuit 906, a speaker 911, a power management chip 908, an input/output (I/O) subsystem 909, and other input/control The device 910, the touch screen 912, other input/control devices 910, and the external port 904 communicate through one or more communication buses or signal lines 907.
[0145] It should be understood that the illustrated mobile terminal 900 is only an example of a mobile terminal, and the mobile terminal 900 may have more or fewer components than those shown in the figure, and two or more components may be combined. Or it can have different component configurations. The various components shown in the figure may be implemented in hardware, software, or a combination of hardware and software including one or more signal processing and/or application specific integrated circuits.
[0146] The following is a detailed description of the mobile terminal for voice call data processing provided in this embodiment. The mobile terminal is taken as an example of a mobile phone.
[0147] The memory 901 can be accessed by the CPU 902, the peripheral interface 903, etc., the memory 901 can include a high-speed random access memory, and can also include a non-volatile memory, such as one or more disk storage devices, flash memory devices , Or other volatile solid-state storage devices.
[0148] A peripheral interface 903, which can connect the input and output peripherals of the device to the CPU 902 and the memory 901.
[0149] The I/O subsystem 909 can connect the input and output peripherals on the device, such as the touch screen 912 and other input/control devices 910, to the peripheral interface 903. The I/O subsystem 909 may include a display controller 9091 and one or more input controllers 9092 for controlling other input/control devices 910. Wherein, one or more input controllers 9092 receive electrical signals from other input/control devices 910 or send electrical signals to other input/control devices 910, and other input/control devices 910 may include physical buttons (press buttons, rocker buttons, etc.) ), dial, slide switch, joystick, click wheel. It is worth noting that the input controller 9092 can be connected to any of the following: a keyboard, an infrared port, a USB interface, and a pointing device such as a mouse.
[0150] The touch screen 912 is an input interface and an output interface between the user's mobile terminal and the user, and displays visual output to the user. The visual output may include graphics, text, icons, videos, and the like.
[0151] The display controller 9091 in the I/O subsystem 909 receives electrical signals from the touch screen 912 or sends electrical signals to the touch screen 912. The touch screen 912 detects the contact on the touch screen, and the display controller 9091 converts the detected contact into interaction with the user interface object displayed on the touch screen 912, that is, human-computer interaction is realized. The user interface object displayed on the touch screen 912 can be running The icon of the game, the icon of connecting to the corresponding network, etc. It is worth noting that the device may also include an optical mouse, which is a touch-sensitive surface that does not display visual output, or an extension of the touch-sensitive surface formed by a touch screen.
[0152] The RF circuit 905 is mainly used to establish communication between the mobile phone and the wireless network (that is, the network side), and realize the data reception and transmission between the mobile phone and the wireless network. For example, sending and receiving short messages, emails, etc. Specifically, the RF circuit 905 receives and transmits RF signals, which are also called electromagnetic signals. The RF circuit 905 converts electrical signals into electromagnetic signals or converts electromagnetic signals into electrical signals, and communicates with communication networks and other equipment through the electromagnetic signals. To communicate. The RF circuit 905 may include known circuits for performing these functions, including but not limited to antenna systems, RF transceivers, one or more amplifiers, tuners, one or more oscillators, digital signal processors, CODEC ( COder-DECoder, codec) chipset, subscriber identity module (Subscriber Identity Module, SIM), etc.
[0153] The audio circuit 906 is mainly used to receive audio data from the peripheral interface 903, convert the audio data into an electric signal, and send the electric signal to the speaker 911.
[0154] The speaker 911 is used to restore the voice signal received by the mobile phone from the wireless network through the RF circuit 905 into sound and play the sound to the user.
[0155] The power management chip 908 is used for power supply and power management for the hardware connected to the CPU 902, the I/O subsystem and the peripheral interface.
[0156] The voice call data processing device, storage medium, and mobile terminal provided in the above embodiments can execute the voice call data processing method provided in any embodiment of the present application, and have corresponding functional modules and beneficial effects for executing the method. For technical details that are not described in detail in the foregoing embodiment, refer to the voice call data processing method provided in any embodiment of this application.
[0157] Note that the above are only the preferred embodiments of the present application and the technical principles used. Those skilled in the art will understand that the present application is not limited to the specific embodiments described herein, and various obvious changes, readjustments and substitutions can be made to those skilled in the art without departing from the protection scope of the present application. Therefore, although this application has been described in more detail through the above embodiments, the application is not limited to the above embodiments, and can also include more other equivalent embodiments without departing from the concept of the application. The scope of is determined by the scope of the appended claims.
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more PUM


Description & Claims & Application Information
We can also present the details of the Description, Claims and Application information to help users get a comprehensive understanding of the technical details of the patent, such as background art, summary of invention, brief description of drawings, description of embodiments, and other original content. On the other hand, users can also determine the specific scope of protection of the technology through the list of claims; as well as understand the changes in the life cycle of the technology with the presentation of the patent timeline. Login to view more.
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more Similar technology patents
Mobile unit and method for efficiently establishing a multi-party call
InactiveUS20060199573A1Reduce inconvenienceSpecial service for subscribersWireless networksCommunication unitBase station
Owner:BENQ CORP
Online sipping detection system for integrity detection for fuel element cladding in nuclear power plant
InactiveCN103247359AReduce inconvenienceReduce mistakesNuclear energy generationNuclear monitoringNuclear power plantLoading unloading
Owner:NUCLEAR POWER INSTITUTE OF CHINA
Antimicrobial Self-Cleaning Toilet
ActiveUS20180008106A1Increase easeReduce inconvenienceWater closetsFlushing devicesSpray nozzleTitanium dioxide
Owner:MEDIC INC
Sweeping robot
PendingCN113303713AReduce inconvenienceSimple structureCarpet cleanersFloor cleanersWater tanksMechanical engineering
Owner:FOSHAN VIOMI ELECTRICAL TECH +1
Classification and recommendation of technical efficacy words
- Reduce inconvenience
Data backup services
ActiveUS20070136541A1Reduce inconvenienceGreat problemMemory loss protectionError detection/correctionBackupEvent trigger
Owner:NVIDIA CORP
Circuit for compensating passband flatness, apparatus and method for compensating passband flatness
InactiveUS20090264065A1Flatness defectReduce inconvenienceRepeater circuitsRadio relay systemsFrequency bandVaricap
Owner:ECTELECOM
Method for photographing control and electronic device thereof
InactiveUS20140375862A1Reduce inconvenienceShorten timeTelevision system detailsColor television detailsComputer graphics (images)Electronic equipment
Owner:SAMSUNG ELECTRONICS CO LTD
Detachable electrical power receptacle
InactiveUS20110215759A1Reduce inconvenienceEnergy efficient ICTBatteries circuit arrangementsElectronic equipmentPower cord
Owner:POWERTECH INDAL
Point of sale (POS) network-based non-near-field payment system and method
InactiveCN102419846AReduce inconvenienceLow costPoint-of-sale network systemsPayments involving neutral partyData compilationPayment system
Owner:快钱支付清算信息有限公司