A quantization encoding method, device, apparatus and storage medium
By determining the target quantization bit depth based on the sound cone information of the sound source object and the region of the listening object, and selecting an appropriate quantization encoding method, the problems of wasted encoding bit rate and insufficient precision in the existing technology are solved, and efficient audio signal rendering is achieved.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- BEIJING XIAOMI MOBILE SOFTWARE CO LTD
- Filing Date
- 2023-03-27
- Publication Date
- 2026-06-12
Smart Images

Figure CN116724351B_ABST
Abstract
Description
Technical Field
[0001] This disclosure relates to the field of communication technology, and in particular to a quantization coding method, apparatus, device and storage medium. Background Technology
[0002] Spatial audio, due to its ability to provide users with a realistic sense of space and direction, has led to the widespread application of spatial audio processing technology. One effective method for achieving spatial audio is object-based spatial audio technology. In this technology, in addition to transmitting its encoded audio signal to the decoding device, the encoding device also needs to determine the sound source orientation information (such as the horizontal angle and / or elevation angle of the sound source object relative to the listening object), quantize and encode this orientation information, and then transmit it to the decoding device. Furthermore, the decoding device can decode and inverse-quantize the received information to obtain the sound source orientation information and the audio signal. Based on this orientation information, the audio signal is rendered and then played back to the listening object, thereby recreating the spatial and directional feel of the audio signal. Summary of the Invention
[0003] This disclosure proposes a quantization coding method, apparatus, device, and storage medium.
[0004] In a first aspect, embodiments of this disclosure provide a quantization encoding method, including:
[0005] The target quantization bit depth of the sound source orientation information is determined based on at least one of the sound cone information of the sound source object and the region in which the sound source object is located relative to the listening object.
[0006] The sound source orientation information is quantized and encoded based on the target quantization bit depth to obtain a code stream signal.
[0007] Secondly, embodiments of this disclosure provide a quantization encoding method, including:
[0008] The receiver receives a bitstream signal sent by an encoding device; the bitstream signal is obtained by quantizing and encoding the sound source orientation information based on a target quantization bit depth; the target quantization bit depth is determined based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object.
[0009] The bitstream signal is decoded and inverse quantized to obtain the direction information of the sound source.
[0010] Thirdly, embodiments of this disclosure provide a communication device, including:
[0011] The processing module is used to determine the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region in which the sound source object is located relative to the listening object.
[0012] The processing module is further configured to quantize and encode the sound source orientation information based on the target quantization bit depth to obtain a code stream signal.
[0013] Fourthly, embodiments of this disclosure provide a communication device, including:
[0014] The transceiver module is used to receive the code stream signal sent by the encoding device; the code stream signal is obtained by quantizing and encoding the sound source orientation information based on a target quantization bit depth; the target quantization bit depth is determined based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object.
[0015] The processing module is used to decode and inverse quantize the bitstream signal to obtain the direction information of the sound source.
[0016] Fifthly, embodiments of this disclosure provide a communication device including a processor that, when the processor invokes a computer program in memory, executes the method described in the first or second aspect.
[0017] In a sixth aspect, embodiments of this disclosure provide a communication device including a processor and a memory, the memory storing a computer program; the processor executes the computer program stored in the memory to cause the communication device to perform the method described in the first or second aspect above.
[0018] In a seventh aspect, embodiments of this disclosure provide a communication device including a processor and an interface circuit. The interface circuit is configured to receive code instructions and transmit them to the processor, which is configured to execute the code instructions to cause the device to perform the methods described in the first or second aspect above.
[0019] Eighthly, embodiments of this disclosure provide a communication system that includes the communication devices described in the third to fourth aspects, or the communication devices described in the fifth aspect, or the communication devices described in the sixth aspect, or the communication devices described in the seventh aspect.
[0020] Ninthly, embodiments of this disclosure provide a computer-readable storage medium for storing instructions for use by the network device described above, which, when executed, cause the terminal to perform the method described in the first or second aspect.
[0021] In a tenth aspect, this disclosure also provides a computer program product including a computer program that, when run on a computer, causes the computer to perform the methods described in the first or second aspect above.
[0022] Eleventhly, this disclosure provides a chip system including at least one processor and an interface for supporting network devices in implementing the functions involved in the methods described in the first or second aspect, such as determining or processing at least one of the data and information involved in the aforementioned methods. In one possible design, the chip system further includes a memory for storing computer programs and data necessary for source and slave nodes. The chip system may be composed of chips or may include chips and other discrete devices.
[0023] In a twelfth aspect, this disclosure provides a computer program that, when run on a computer, causes the computer to perform the methods described in the first or second aspect above. Attached Figure Description
[0024] The above and / or additional aspects and advantages of this disclosure will become apparent and readily understood from the following description of the embodiments taken in conjunction with the accompanying drawings, in which:
[0025] Figure 1 This disclosure provides schematic diagrams of the architecture of some communication systems.
[0026] Figure 2a A flowchart illustrating a quantization encoding method provided in another embodiment of this disclosure;
[0027] Figure 2b This is a schematic diagram illustrating the inner angle of the acoustic cone, the outer angle of the acoustic cone, and the attenuation amplitude of the audio signal, provided in an embodiment of this disclosure.
[0028] Figure 3a A flowchart illustrating a quantization encoding method provided in yet another embodiment of this disclosure;
[0029] Figure 3b This is a schematic diagram of the structure of a sound source object relative to a listening object when they are in different regions according to an embodiment of the present disclosure.
[0030] Figure 4 A flowchart illustrating a quantization encoding method provided in yet another embodiment of this disclosure;
[0031] Figure 5 A flowchart illustrating a quantization encoding method provided in yet another embodiment of this disclosure;
[0032] Figure 6 A flowchart illustrating a quantization encoding method provided in yet another embodiment of this disclosure;
[0033] Figure 7 A flowchart illustrating a quantization encoding method provided in yet another embodiment of this disclosure;
[0034] Figure 8A flowchart illustrating a quantization encoding method provided in yet another embodiment of this disclosure;
[0035] Figure 9 The following is an interactive flowchart of a quantization encoding method provided in another embodiment of this disclosure;
[0036] Figure 10 This is a schematic diagram of the structure of a communication device provided in one embodiment of the present disclosure;
[0037] Figure 11 This is a schematic diagram of the structure of a communication device provided in one embodiment of the present disclosure;
[0038] Figure 12 This is a block diagram of a communication device provided in one embodiment of the present disclosure;
[0039] Figure 13 This is a schematic diagram of the structure of a chip provided in one embodiment of the present disclosure. Detailed Implementation
[0040] Exemplary embodiments will now be described in detail, examples of which are illustrated in the accompanying drawings. When the following description relates to the drawings, unless otherwise indicated, the same numerals in different drawings denote the same or similar elements. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with those of this disclosure. Rather, they are merely examples of apparatuses and methods consistent with some aspects of the embodiments of this disclosure as detailed in the appended claims.
[0041] The terminology used in this disclosure is for the purpose of describing particular embodiments only and is not intended to be limiting of the present disclosure. The singular forms “a” and “the” as used in this disclosure and the appended claims are also intended to include the plural forms unless the context clearly indicates otherwise. It should also be understood that the term “and / or” as used herein refers to and includes any and all possible combinations of one or more of the associated listed items.
[0042] It should be understood that although the terms first, second, third, etc., may be used to describe various information in embodiments of this disclosure, such information should not be limited to these terms. These terms are only used to distinguish information of the same type from one another. For example, first information may also be referred to as second information without departing from the scope of embodiments of this disclosure, and similarly, second information may also be referred to as first information. Depending on the context, the words “if” and “suppose” as used herein may be interpreted as “when”, “when”, or “in response to a determination”.
[0043] Embodiments of this disclosure are described in detail below, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals denote the same or similar elements throughout. The embodiments described below with reference to the accompanying drawings are exemplary and intended to explain this disclosure, and should not be construed as limiting this disclosure.
[0044] In related technologies, a unified quantization encoding method is used to uniformly quantize and encode the direction information of different sound sources at different times. However, these methods often result in wasted encoding bitrate or insufficient quantization encoding precision, affecting the rendering effect of subsequent audio signals. The main reasons are as follows:
[0045] Under different circumstances, the sensitivity of a listening object to audio signals varies depending on the orientation of the sound source. For example, in some cases, the listening object's sensitivity to changes in the orientation of the sound source is low, and it cannot easily perceive these changes. In other cases, the listening object's sensitivity to changes in the orientation of the sound source is high, and it can easily perceive these changes. Specifically, when the listening object's sensitivity to changes in the orientation of the sound source is low, since it cannot easily perceive these changes, the encoding device can perform coarse rendering based on the sound source orientation information, rather than fine rendering. Therefore, when quantizing the sound source orientation information, the encoding device can also use coarse quantization encoding, thereby reducing the bitrate without affecting the subsequent rendering effect. Conversely, when the listening object's sensitivity to changes in the orientation of the sound source is high, since it can easily perceive these changes, the encoding device should perform fine rendering based on the sound source orientation information. Therefore, when quantizing the direction information of the sound source, the encoding device also needs to use a fine quantization encoding method to ensure the subsequent rendering effect.
[0046] Therefore, when using methods in related technologies to quantize and encode sound source orientation information, a unified quantization encoding method is employed to uniformly quantize and encode the sound source orientation information of different sound source objects at different times. However, if this unified quantization encoding method is fine-grained, a situation may arise where "when the listening object's sensitivity to changes in the sound source object's orientation is low, fine-grained encoding is used to quantize the sound source orientation information," leading to unnecessary waste of encoding bitrate. Conversely, if the unified quantization encoding method is coarse-grained, a situation may arise where "when the listening object's sensitivity to changes in the sound source object's orientation is high, coarse-grained encoding is used to quantize the sound source orientation information," resulting in insufficient quantization encoding precision and affecting the rendering effect of subsequent audio signals.
[0047] Therefore, there is an urgent need for a quantization encoding method to selectively quantize and encode sound source orientation information based on the listening object's perceptual sensitivity to changes in the orientation of the sound source under different conditions. This disclosure provides such a quantization encoding method.
[0048] To better understand the quantization encoding method disclosed in this disclosure, the communication system to which this disclosure applies will be described first.
[0049] Please see Figure 1 , Figure 1 This is a schematic diagram of the architecture of a communication system provided in an embodiment of this disclosure. The communication system may include, but is not limited to, at least one encoding device and at least one decoding device, wherein the encoding device can be a network device or a terminal device, and the encoding device can also be a network device or a terminal device. Figure 1 The number and form of devices shown are for illustrative purposes only and do not constitute a limitation on the embodiments of this disclosure. The application may include one or more encoding devices, or one or more terminal devices. Figure 1 The communication system shown is exemplified by including an encoding device, which is a network device, and a decoding device, which is a terminal device.
[0050] It should be noted that the technical solutions of this disclosure can be applied to various communication systems. For example, Long Term Evolution (LTE) systems, 5th Generation (5G) mobile communication systems, 5G New Radio (NR) systems, or other future new mobile communication systems.
[0051] The network devices in this disclosure (such as the first or second network device described above) are entities on the network side used for transmitting or receiving signals. For example, the network device can be an evolved NodeB (eNB), a transmission reception point (TRP), a radio remote head (RRH), a next-generation NodeB (gNB) in an NR system, a base station in other future mobile communication systems, or an access node in a wireless fidelity (WiFi) system. This disclosure does not limit the specific technology or device form used in the base station. The base station provided in this disclosure can be composed of a central unit (CU) and a distributed unit (DU). The CU can also be called a control unit. Using a CU-DU structure, the base station, for example, can have its protocol layer separated. Some protocol layer functions are centrally controlled by the CU, while the remaining or all protocol layer functions are distributed in the DU, which is centrally controlled by the CU.
[0052] The terminal device in this disclosure can be a user-side entity used to receive or transmit signals, such as a mobile phone. The terminal device can also be referred to as a terminal, user equipment (UE), mobile station (MS), mobile terminal (MT), etc. The terminal device can be a car with communication capabilities, a smart car, a mobile phone, a wearable device, a tablet computer, a computer with wireless transceiver capabilities, a virtual reality (VR) terminal, an augmented reality (AR) terminal, a wireless terminal in industrial control, a wireless terminal in self-driving, a wireless terminal in remote medical surgery, a wireless terminal in a smart grid, a wireless terminal in transportation safety, a wireless terminal in a smart city, a wireless terminal in a smart home, etc. This disclosure does not limit the specific technology or device form used in the terminal device.
[0053] It is understood that the communication system described in the embodiments of this disclosure is for the purpose of more clearly illustrating the technical solutions of the embodiments of this disclosure, and does not constitute a limitation on the technical solutions provided in the embodiments of this disclosure. As those skilled in the art will know, with the evolution of system architecture and the emergence of new business scenarios, the technical solutions provided in the embodiments of this disclosure are also applicable to similar technical problems.
[0054] In addition, the following points are provided to facilitate understanding of the embodiments disclosed herein.
[0055] First, in this disclosure, unless contradictory, each step in any implementation or embodiment can be implemented as an independent embodiment, and the steps can be arbitrarily combined. For example, the solution after removing some steps in a certain implementation or embodiment can also be implemented as an independent embodiment, and the order of the steps in a certain implementation or embodiment can be arbitrarily interchanged. In addition, the optional methods or examples in a certain implementation or embodiment can be arbitrarily combined. Furthermore, the implementations or embodiments can be arbitrarily combined. For example, some or all steps of different implementations or embodiments can be arbitrarily combined, and a certain implementation or embodiment can be arbitrarily combined with the optional methods or examples of other implementations or embodiments.
[0056] Second, regarding the notation “A or B”, “A and / or B”, “at least one of A and B”, “A in one case, B in another case”, “in response to one case A, in response to another case B”, the following at least one scheme may be included depending on the situation: A is performed regardless of B, i.e., A in some embodiments; B is performed regardless of A, i.e., B in some embodiments; A and B are selectively performed, i.e., selected from A and B in some embodiments; A and B are both performed, i.e., A and B in some embodiments.
[0057] Third, each element, each row, or each column in the tables involved in this disclosure can be implemented as an independent embodiment, and any combination of any element, any row, or any column can also be implemented as an independent embodiment.
[0058] Fourth, in some implementations or embodiments, the terms "including A", "containing A", "for indicating A", and "carrying A" in this disclosure can be interpreted as directly carrying A or indirectly indicating A.
[0059] Fifth, in some implementations or embodiments, terms such as “in response to…”, “in the case of…”, “when…”, “when…”, “if…”, etc. in this disclosure can be replaced with each other.
[0060] Figure 2aThis is a flowchart illustrating a quantization encoding method provided in an embodiment of the present disclosure. The method is executed by an encoding device, such as... Figure 2a As shown, the quantization encoding method may include the following steps:
[0061] Step 201: Determine the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region in which the sound source object is located relative to the listening object.
[0062] Optionally, the quantization encoding method of this disclosure can be applied to quantize and encode "object-based audio signals". For example, the object-based audio signal may include at least one of the following: audio to be played on a terminal device, audio to be played in a home theater, or audio to be played on a virtual reality (VR) device. That is, the method of this disclosure can be applied to scenarios such as "a terminal device playing audio, a home theater playing movies, or using VR devices for gaming and audio-visual entertainment".
[0063] Optionally, in one embodiment of this disclosure, the acoustic cone information of the aforementioned acoustic source object may include at least one of the following:
[0064] The interior angle of the sound cone of the sound source object;
[0065] The outer angle of the sound cone of the sound source object;
[0066] The maximum attenuation of the audio signal emitted by the sound source object.
[0067] Optionally, the aforementioned inner angle of the sound cone means that audio signals emitted by the sound source object and located within the inner angle range of the sound cone of the sound source object are considered to have no attenuation due to orientation, that is, the intensity of audio signals within the inner angle range of the sound cone remains unchanged.
[0068] Optionally, the aforementioned outer angle of the sound cone means that the audio signal emitted by the sound source object and located between the inner angle edge and the outer angle edge of the sound cone of the sound source object is considered to have attenuation due to orientation. The intensity of the audio signal located between the inner angle edge and the outer angle edge of the sound cone decreases linearly with the emission angle of the audio signal, and the emission angle of the audio signal is the angle between the audio signal and the orientation (such as absolute orientation) of the sound source object.
[0069] Optionally, the maximum attenuation of the audio signal emitted by the aforementioned sound source object can be: the attenuation of the audio signal emitted by the sound source object that is located outside the angle range of the sound cone of the sound source object.
[0070] Optional, Figure 2bThis is a schematic diagram illustrating the inner angle of the acoustic cone, the outer angle of the acoustic cone, and the attenuation amplitude of the audio signal, provided in an embodiment of this disclosure. Figure 2a (1) is a structural diagram of the interior angle of the acoustic cone (i.e., the interior angle of the acoustic cone in the figure) and the exterior angle of the acoustic cone (i.e., the exterior angle of the acoustic cone in the figure). Figure 2a (2) is the emission angle of the audio signal (i.e., Figure 2a (2) The angle) and the attenuation amplitude of the audio signal (i.e. Figure 2a The correspondence between the gain values in (2). Figure 2b As shown in (1), the interior angle of the sound cone of the sound source object is 30°, and the exterior angle of the sound cone is 90°. Figure 2b As shown in (2), the maximum attenuation of the audio signal emitted by the sound source is 0.6. Optionally, in Figure 2b In (1), the intensity of the sound emitted by the sound source object remains unchanged within the region encompassed by the edge of the inner angle of the sound cone. Therefore Figure 2b (2) The attenuation amplitude of audio signals with an emission angle between [-15, 15] is 1. Figure 2b In (1), the intensity of the audio signal emitted by the sound source object in the region between the inner and outer angle edges of the sound cone exhibits linear attenuation. Figure 2b (2) The attenuation of audio signals with emission angles between [-45°, -15°] and [15°, 45°] varies linearly with the emission angle of the audio signal. Figure 2b In (1), the intensity of the audio signal emitted by the sound source object in the region outside the outer corner of the sound cone has attenuated to the maximum attenuation amplitude, and it is assumed that the sound intensity no longer changes. Figure 2b (2) The attenuation amplitude in the corresponding region is 0.6. It should be noted that... Figure 2b The data for the inner angle, outer angle, and maximum attenuation amplitude of the acoustic cone shown are for illustrative purposes only and are not fixed.
[0071] Optionally, in one embodiment of this disclosure, the sound cone information corresponding to different sound source objects may be the same or different. For example, the inner angle of the sound cone of different sound source objects may be the same or different, the outer angle of the sound cone of different sound source objects may be the same or different, and the maximum attenuation amplitude of different sound source objects may be the same or different.
[0072] Optionally, the acoustic cone information of the aforementioned sound source object can be predetermined by the encoding device. Alternatively, the encoding device can determine the acoustic cone information of the sound source object using existing methods, which will not be described in detail in this disclosure.
[0073] Optionally, in one embodiment of this disclosure, the region where the sound source object is located relative to the listening object can be determined by the encoding device based on at least one of the orientation of the listening object and the relative position of the sound source object relative to the listening object. Optionally, the method by which the encoding device determines the region where the sound source object is located relative to the listening object may include the following steps:
[0074] The first step is to determine the orientation of the object being listened to.
[0075] Optionally, in one embodiment of this disclosure, the orientation of the listening object can be the absolute orientation of the listening object. Optionally, the absolute orientation of the listening object can be, for example, due south. Alternatively, the orientation of the listening object can be the relative orientation of the listening object with respect to the sound source object. Optionally, the relative orientation of the listening object with respect to the sound source object can be, for example, 30° southeast of the sound source object.
[0076] Optionally, in one embodiment of this disclosure, the orientation of the listening object can be sent from the decoding device to the encoding device. Optionally, when the encoding device needs to determine the absolute orientation of the listening object, the decoding device can directly send the absolute orientation of the listening object to the encoding device. When the encoding device needs to determine the relative orientation of the listening object, the decoding device can directly send the relative orientation of the listening object to the sound source object; alternatively, the decoding device can send the absolute orientation of the listening object to the encoding device, and then the encoding device determines the relative orientation of the listening object based on the absolute orientation of the sound source object itself and the received absolute orientation of the listening object.
[0077] Optionally, in another embodiment of this disclosure, the orientation of the listening object can also be manually input into the encoding device.
[0078] The second step is to determine the relative position of the sound source object with respect to the listening object.
[0079] Optionally, in one embodiment of this disclosure, the encoding device may first determine the absolute position of the sound source object and the absolute position of the listening object, and then determine the relative position of the sound source object with respect to the listening object based on the absolute positions of the sound source object and the listening object. Optionally, the absolute position of the sound source object may be determined autonomously by the encoding device, or it may be manually input into the encoding device. The absolute position of the listening object may be sent from the decoding device to the encoding device, or it may be manually input into the encoding device.
[0080] Optionally, in another embodiment of this disclosure, the encoding device may not need to determine the absolute position of the sound source object and the absolute position of the listening object, but may directly receive the relative position of the sound source object relative to the listening object sent by the decoding device, or the relative position of the sound source object relative to the listening object may be manually input into the encoding device.
[0081] It should be noted that the methods for determining the "orientation of the listening object and the relative position of the sound source object relative to the listening object" mentioned above are merely exemplary descriptions of this disclosure. It should be understood that other methods for determining the orientation of the listening object and the relative position of the sound source object relative to the listening object are also within the scope of protection of this disclosure.
[0082] It is understood that the first and second steps in the embodiments of this disclosure only define different operations, and do not limit the order of the two operations. That is, the operation of determining the relative position of the sound source object with respect to the listening object can be performed first, and then the operation of determining the orientation of the listening object can be performed. The embodiments of this disclosure do not limit this.
[0083] The third step is to determine the area where the sound source object is located relative to the sound source object based on at least one of the orientation of the listening object and the relative position of the sound source object relative to the listening object.
[0084] Optionally, the area where the sound source object is located relative to the listening object can be: after the sound source object is projected onto the horizontal plane where the listening object is located, the projection position of the sound source object on the horizontal plane where the listening object is located relative to the area where the listening object is located; for example, the area where the sound source object is located relative to the listening object can be: the sound source object (or: the projection position of the sound source object on the horizontal plane where the listening object is located) is located in front of, behind, to the left of, or to the right of the listening object.
[0085] Optionally, in one embodiment of this disclosure, if the encoding device obtains the relative orientation of the listening object relative to the sound source object in the first step, then the second step can be skipped, and the area where the sound source object is located relative to the listening object can be determined directly based on the relative orientation of the listening object relative to the sound source object.
[0086] Optionally, in another embodiment of this disclosure, when the encoding device obtains the absolute orientation of the listening object in the first step, a second step is required to determine the relative position of the sound source object relative to the listening object. Then, the area where the sound source object is located relative to the listening object is determined based on the absolute orientation of the listening object and the relative position of the sound source object relative to the listening object.
[0087] Optionally, in one embodiment of this disclosure, the aforementioned sound source orientation information may include at least one of the following:
[0088] The horizontal angle between the sound source object and the listening object;
[0089] The elevation angle between the sound source object and the listening object.
[0090] Optionally, the horizontal angle between the sound source object and the listening object mentioned above can be understood as, for example, the relative deviation angle between the sound source object and the listening object on the horizontal plane, and the height angle between the sound source object and the listening object mentioned above can be understood as, for example, the relative deviation angle between the sound source object and the listening object on the vertical plane.
[0091] Optionally, the target quantization bit depth can be the number of bits for the quantized and encoded sound source orientation information. Optionally, when the target quantization bit depth is large, it indicates that the quantization encoding method for the sound source orientation information is fine quantization encoding; when the target quantization bit depth is small, it indicates that the quantization encoding method for the sound source orientation information is coarse quantization encoding.
[0092] The following example illustrates why, in step 201, the target quantization bits of the sound source orientation information are determined based on at least one of the sound cone information of the sound source object and the region in which the sound source object is located relative to the listening object.
[0093] From the above Figure 1 As described in the preceding embodiments, the method disclosed herein primarily aims to enable the encoding device to selectively choose an appropriate quantization encoding method (such as coarse quantization encoding (i.e., fewer target quantization bits) or fine quantization encoding (i.e., more target quantization bits) to quantize and encode the sound source orientation information based on the listening object's sensitivity to changes in the orientation of the sound source object under different conditions). For example, when the listening object's sensitivity to changes in the orientation of the sound source object is low, coarse quantization encoding can be used to reduce the encoded bitstream without affecting the subsequent rendering effect. When the listening object's sensitivity to changes in the orientation of the sound source object is high, fine quantization encoding is used to ensure the subsequent rendering effect.
[0094] Based on this, since the sound cone information of the sound source object and the area where the sound source object is located relative to the listening object can reflect the listening object's sensitivity to changes in the direction of the sound source object, the target quantization bit depth of the sound source direction information is determined based on at least one of the sound cone information of the sound source object and the area where the sound source object is located relative to the listening object in this embodiment. This enables the encoding device to selectively quantize the sound source direction information based on the listening object's sensitivity to changes in the direction of the sound source object under different conditions, thereby improving the flexibility of quantization encoding.
[0095] Optionally, the sound cone information of the sound source object can reflect the listening object's sensitivity to changes in the direction of the sound source object. Specifically, this can include: as described above, the sound cone information can reflect the rate of change of the sound intensity of the sound source object. This rate of change of sound intensity is the rate of change of the intensity of the audio signal emitted by the sound source object with respect to the emission angle of the audio signal. This rate of change of sound intensity can be understood as the sound attenuation amplitude. When the sound cone information of the sound source object indicates a large rate of change of sound intensity, it means that when the direction of the sound source object changes, both the intensity change and the attenuation amplitude of the emitted audio signal will be large. The listening object is more sensitive (i.e., more easily perceived) to audio signals with large intensity changes or large attenuation amplitudes. Therefore, when the sound cone information of the sound source object indicates a large rate of change of sound intensity, the listening object has a high sensitivity to changes in the direction of the sound source object. Furthermore, when the rate of change of sound intensity indicated by the sound cone information of the sound source object is small, it means that when the orientation of the sound source object changes, the intensity change and attenuation of the emitted audio signal will be small. Since the listener is not sensitive to audio signals with small intensity changes or attenuation, it means that when the rate of change of sound intensity indicated by the sound cone information of the sound source object is small, the listener's sensitivity to changes in the orientation of the sound source object is low.
[0096] Therefore, based on the aforementioned sound cone information, the rate of change of sound intensity of the sound source object can be determined, and thus the sensitivity of the listening object to changes in the orientation of the sound source object can be known.
[0097] Optionally, the location of the sound source object relative to the listening object can reflect the listening object's sensitivity to changes in the sound source object's orientation. This can include: when the sound source object is located in different areas of the listening object, the listening object's sensitivity to the audio signal emitted by the sound source object will differ, thus making the listening object's sensitivity to changes in the sound source object's orientation different in different areas. For example, when the sound source object is located to the left or right of the listening object, the listening object's sensitivity to changes in the sound source object's orientation is lower; when the sound source object is located in front of or behind the listening object, the listening object's sensitivity to changes in the sound source object's orientation is higher.
[0098] Optionally, in one embodiment of this disclosure, the method of "determining the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object" may include: determining the target quantization bit depth of the sound source orientation information based on the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object. Alternatively, in another embodiment of this disclosure, when the encoding device cannot know the region where the sound source object is located relative to the listening object, the encoding device may determine the target quantization bit depth of the sound source orientation information based on the sound cone information of the sound source object. The specific method for determining the target quantization bit depth of the sound source orientation information in step 201 will be described in subsequent embodiments.
[0099] Based on the above, it can be understood that in this embodiment, the target quantization bit depth of the sound source orientation information is determined based on at least one of the sound source cone information and the region where the sound source object is located relative to the listening object. The sound source cone information and the region where the sound source object is located relative to the listening object reflect the listening object's sensitivity to changes in the orientation of the sound source object. The target quantization bit depth is the number of bits in the quantized and encoded sound source orientation information. This target quantization bit depth reflects the specific quantization encoding method. For example, a higher target quantization bit depth indicates a finer quantization encoding method, while a lower target quantization bit depth indicates a coarser quantization encoding method. Therefore, in this disclosure, when determining the target quantization bit depth of the sound source orientation information based on at least one of the sound source object's cone information and the region where the sound source object is located relative to the listening object, a suitable quantization encoding method is specifically determined based on the listening object's current sensitivity to changes in the sound source object's orientation. This is because when at least one of the sound source object's cone information or the region where the sound source object is located relative to the listening object indicates that the listening object's sensitivity to changes in the sound source object's orientation is low, the determined target quantization bit depth is smaller, allowing for coarse quantization encoding of the sound source orientation information. This ensures that the encoding bitrate is reduced without affecting subsequent rendering effects, avoiding resource waste. Conversely, when at least one of the sound source object's cone information or the region where the sound source object is located relative to the listening object indicates that the listening object's sensitivity to changes in the sound source object's orientation is high, the determined target quantization bit depth is larger, allowing for fine quantization encoding of the sound source orientation information. This ensures the effectiveness of subsequent rendering.
[0100] Step 202: Quantize and encode the sound source orientation information based on the target quantization bit depth to obtain the bit stream signal.
[0101] Optionally, in one embodiment of this disclosure, the bitstream signal may include at least one of the following:
[0102] Encoded audio signal;
[0103] Quantized and encoded sound source orientation information;
[0104] Target quantization bits;
[0105] Sound cone information of the sound source object;
[0106] The index corresponding to the acoustic cone information;
[0107] The absolute position of the sound source object;
[0108] The absolute position of the object being listened to;
[0109] The relative position of the sound source object to the listening object.
[0110] The detailed method for "quantizing and encoding the sound source orientation information based on the target quantization bit depth to obtain the code stream signal" will be introduced in subsequent embodiments.
[0111] It should be noted that, in one embodiment of this disclosure, after the sound source orientation information of the sound source object is initially quantized and encoded, if the sound source orientation information of the sound source object does not change, it is not necessary to repeat the quantization and encoding of the sound source orientation information. Instead, an indication message indicating that the sound source orientation information has not changed can be sent to the decoding device. When the sound source orientation information of the sound source object changes compared to before, the changed sound source orientation information is then quantized and encoded. This avoids the encoding device from performing unnecessary repeated quantization and encoding of the same sound source orientation information, saving resources and improving quantization and encoding efficiency.
[0112] In summary, in the quantization encoding method provided in this disclosure, the encoding device determines the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object; and quantizes and encodes the sound source orientation information based on the target quantization bit depth to obtain a bitstream signal. The sound cone information of the sound source object and the region where the sound source object is located relative to the listening object can reflect the listening object's sensitivity to changes in the orientation of the sound source object, and the target quantization bit depth can reflect a specific quantization encoding method (such as fine quantization encoding or coarse quantization encoding). Therefore, it can be seen that the encoding device in this disclosure selects an appropriate quantization encoding method based on the listening object's current sensitivity to changes in the orientation of the sound source object. When the listening object's sensitivity to changes in the orientation of the sound source object is low, a coarse quantization encoding method is selected to quantize the sound source orientation information, ensuring that the encoding bitrate is reduced without affecting subsequent rendering effects, thus avoiding resource waste. When the listening object's sensitivity to changes in the orientation of the sound source object is high, a fine quantization encoding method is selected to quantize the sound source orientation information, ensuring subsequent rendering effects. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0113] Figure 3a This is a flowchart illustrating a quantization encoding method provided in an embodiment of the present disclosure. The method is executed by an encoding device, such as... Figure 3a As shown, the quantization encoding method may include the following steps:
[0114] Step 301: Determine the first quantization bit depth based on the sound cone information of the sound source object.
[0115] Optional, Figure 3a The embodiments illustrate a method for determining the target quantization bits of sound source orientation information based on the sound cone information of a sound source object and the region where the sound source object is located relative to the listening object.
[0116] Optionally, in one embodiment of this disclosure, the first quantization bit depth can be calculated using the following formula; the formula may include:
[0117]
[0118]
[0119]
[0120] Among them, in angle Indicates the interior angle of the acoustic cone, out angle Represents the outer angle of the sound cone, A maxThe maximum attenuation is indicated by gap, which represents the rate of change of sound intensity between the interior and exterior angles of the sound cone. This rate of change is the rate of change of the intensity of the audio signal emitted by the sound source with respect to the emission angle of the audio signal. Percep represents the minimum rate of change of sound intensity perceptible to the human ear. int `quantization` indicates the quantization interval, and `range` indicates the quantization range, which is optional. The quantization range for horizontal angles can be [0°, 360°], and the quantization range for elevation angles can be [0°, 180°]. bit This indicates the first quantization bit depth, and ceil represents the round-up function.
[0121] Step 302: Determine the target quantization bit depth based on the region where the sound source object is located relative to the listening object and the first quantization bit depth.
[0122] Optionally, in one embodiment of this disclosure, the aforementioned "determining the target quantization bit depth based on the region where the sound source object is located relative to the listening object and the first quantization bit depth" may include: when the sound source object is located in a first region of the listening object, determining the first quantization bit depth as the target quantization bit depth; when the sound source object is located in a second region of the listening object, determining the difference between the first quantization bit depth and a preset value (the preset value may be, for example, 1) as the target quantization bit depth; optionally, the listening object's sensitivity to the audio signal in the first region is higher than its sensitivity to the audio signal in the second region. In other words, the first region may be a region where the listening object has a higher sensitivity to the audio signal, for example, the first region may be the front and rear regions of the listening object, and the second region may be a region where the listening object has a lower sensitivity to the audio signal, for example, the second region may be the left and right regions of the listening object.
[0123] Example, Figure 3b This is a structural diagram illustrating a sound source object in different regions relative to a listening object, as provided in an embodiment of this disclosure. Figure 3b As shown, if in step 301 above, based on sound source object 1 (i.e. Figure 3b The first quantization bit depth #1 is determined based on the sound cone information of sound source 1), if the above step 301 is based on sound source object 2 (i.e. Figure 3b The sound cone information of sound source 2) determines the first quantization bit depth #2, where, since sound source object 1 is located in the less sensitive right region of the listening object (such as the human ear) (i.e. Figure 3b Region 2), then the value after the first quantization bit depth minus 1 can be determined as the target quantization bit depth; and, since the sound source object 2 is located in a relatively sensitive front region of the listening object (such as the human ear) (i.e. Figure 3b If we consider region 3), then we can directly determine the first quantization bit depth as the target quantization bit depth.
[0124] As can be seen from the foregoing, in the embodiments of this disclosure, after the encoding device determines the first quantization bit depth based on the sound cone information of the sound source object, it can further update the first quantization bit depth based on whether the area where the sound source object is located relative to the listening object is a "sensitive area for the listening object to perceive the audio signal". For example, when the area where the sound source object is located relative to the listening object is not a sensitive area for the listening object to perceive the audio signal, the difference between the first quantization bit depth and a preset value can be determined as the target quantization bit depth to reduce the target quantization bit depth, thereby ensuring that the subsequent rendering effect is not affected while maximizing the saving of the encoding bitstream; when the area where the sound source object is located relative to the listening object is a sensitive area for the listening object to perceive the audio signal, the first quantization bit depth can be directly determined as the target quantization bit depth to ensure the subsequent rendering effect.
[0125] In summary, the quantization encoding method provided in this disclosure allows the encoding device to selectively choose an appropriate quantization encoding method based on the listening object's current sensitivity to changes in the orientation of the sound source object. When the listening object's sensitivity to changes in the sound source object's orientation is low, a coarse quantization encoding method is chosen to quantize the sound source orientation information, ensuring a lower bitrate and avoiding resource waste without affecting subsequent rendering effects. Conversely, when the listening object's sensitivity to changes in the sound source object's orientation is high, a fine quantization encoding method is chosen to quantize the sound source orientation information, ensuring optimal subsequent rendering effects. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0126] Figure 4 This is a flowchart illustrating a quantization encoding method provided in an embodiment of the present disclosure. The method is executed by an encoding device, such as... Figure 4 As shown, the quantization encoding method may include the following steps:
[0127] Step 401: Determine the target quantization bit depth based on the acoustic cone information of the acoustic source object.
[0128] Optional, Figure 4 The embodiments are used to describe a method for determining the target quantization bits of the sound source orientation information based on the sound cone information of the sound source object when the encoding device cannot obtain the area where the sound source object is located relative to the listening object.
[0129] Optionally, in one embodiment of this disclosure, the first quantization bit depth can be calculated using the following formula; the formula may include:
[0130]
[0131]
[0132]
[0133] Among them, in angle Indicates the interior angle of the acoustic cone, out angle Represents the outer angle of the sound cone, A max The maximum attenuation is indicated by gap, which represents the rate of change of sound intensity between the interior and exterior angles of the sound cone. This rate of change is the rate of change of the intensity of the audio signal emitted by the sound source with respect to the emission angle of the audio signal. Percep represents the minimum rate of change of sound intensity perceptible to the human ear. int `quantization` indicates the quantization interval, and `range` indicates the quantization range, which is optional. The quantization range for horizontal angles can be [0°, 360°], and the quantization range for elevation angles can be [0°, 180°]. bit This indicates the target number of quantization bits, and ceil represents the round-up function.
[0134] In summary, the quantization encoding method provided in this disclosure allows the encoding device to selectively choose an appropriate quantization encoding method based on the listening object's current sensitivity to changes in the orientation of the sound source object. When the listening object's sensitivity to changes in the sound source object's orientation is low, a coarse quantization encoding method is chosen to quantize the sound source orientation information, ensuring a lower bitrate and avoiding resource waste without affecting subsequent rendering effects. Conversely, when the listening object's sensitivity to changes in the sound source object's orientation is high, a fine quantization encoding method is chosen to quantize the sound source orientation information, ensuring optimal subsequent rendering effects. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0135] Figure 5 This is a flowchart illustrating a quantization encoding method provided in an embodiment of the present disclosure. The method is executed by an encoding device, such as... Figure 5 As shown, the quantization encoding method may include the following steps:
[0136] Step 501: Determine the corresponding codebook based on the target quantization bit depth.
[0137] Optional, Figure 5 The examples illustrate a method for "how an encoding device specifically quantizes and encodes sound source orientation information based on a target quantization bit depth".
[0138] Optionally, in one embodiment of this disclosure, different codebooks correspond to different quantization bit widths. Optionally, each codebook may include at least one codeword, and each codeword corresponds to one bit value.
[0139] Optionally, in one embodiment of this disclosure, the encoding device may determine a codebook corresponding to the target quantization bit depth so that the source orientation information can be quantized and encoded based on the codebook.
[0140] Step 502: Quantize and encode the sound source orientation information based on the codebook to obtain the quantized sound source orientation information.
[0141] Optionally, the above-mentioned quantization encoding of the sound source orientation information based on the codebook to obtain the quantized and encoded sound source orientation information may include at least one of the following methods:
[0142] The first method involves determining the quantization interval corresponding to the target quantization bit depth (i.e., quan in the aforementioned embodiments). ibt Based on the quantization interval, the quantization range of the sound source orientation information is quantized (e.g., uniform quantization) to obtain at least one quantization value. The quantization value that is closest to the value of the sound source orientation information among the at least one quantization values is determined as the target quantization value. Then, the bit value corresponding to the codeword that is closest to the target quantization value in the determined codebook is determined as the quantized and encoded sound source orientation information.
[0143] Optionally, the "closest distance" mentioned above can be, for example, the smallest absolute value of the difference.
[0144] For example, assuming the sound source orientation information is the height angle of the sound source object at 35°, and assuming the quantization interval determined in step 301 above is 20°, the codebook determined in step 501 above is: (0°, 45°, 90°, 135°, 180°), where the bit values corresponding to each codeword in the codebook are: 000, 001, 010, 011, 100 respectively. Then, the process for quantizing and encoding the height angle of the sound source object at 35° using the first method described above can be as follows: First, based on the quantization interval of 20°, the quantization range of the height angle [0°, 180°] (as mentioned above) is quantized to obtain nine quantization values, namely: 0°, 20°, 40°, 60°, 80°, 100°, 120°, 140°, 160°, 180°. Then, the quantization value closest to the height angle of the sound source object at 35° among these nine quantization values is: 40°, that is, the target quantization value is 40°. Furthermore, the codeword closest to the target quantization value 40° in the codebook (0°, 45°, 90°, 135°, 180°) is 45°. Therefore, the bit value 001 corresponding to the codeword 45° can be determined as the quantized sound source orientation information.
[0145] The second method involves determining the bit value corresponding to the codeword in the determined codebook that is closest to the value of the sound source orientation information as the quantized and encoded sound source orientation information.
[0146] For example, suppose the sound source orientation information is the elevation angle of the sound source object, which is 35°, and suppose the codebook determined in step 501 above is (0°, 45°, 90°, 135°, 180°), where the bit values corresponding to each codeword in the codebook are 000, 001, 010, 011, and 100, respectively. Then, the process of quantizing and encoding the elevation angle of the sound source object of 35° using the second method described above can be as follows: determine the codeword closest to the elevation angle of the sound source object of 35° from the codebook (0°, 45°, 90°, 135°, 180°), which is 45°. Therefore, the bit value 001 corresponding to the codeword 45° can be determined as the quantized and encoded sound source orientation information.
[0147] In summary, the quantization encoding method provided in this disclosure allows the encoding device to selectively choose an appropriate quantization encoding method based on the listening object's current sensitivity to changes in the orientation of the sound source object. When the listening object's sensitivity to changes in the sound source object's orientation is low, a coarse quantization encoding method is chosen to quantize the sound source orientation information, ensuring a lower bitrate and avoiding resource waste without affecting subsequent rendering effects. Conversely, when the listening object's sensitivity to changes in the sound source object's orientation is high, a fine quantization encoding method is chosen to quantize the sound source orientation information, ensuring optimal subsequent rendering effects. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0148] Figure 6 This is a flowchart illustrating a quantization encoding method provided in an embodiment of the present disclosure. The method is executed by an encoding device, such as... Figure 6 As shown, the quantization encoding method may include the following steps:
[0149] Step 601: Send the bitstream signal to the decoding device.
[0150] Optionally, in one embodiment of this disclosure, the bitstream signal may include at least one of the following:
[0151] Encoded audio signal;
[0152] Quantized and encoded sound source orientation information;
[0153] Target quantization bits;
[0154] Sound cone information of the sound source object;
[0155] The index corresponding to the acoustic cone information;
[0156] The absolute position of the sound source object;
[0157] The absolute position of the object being listened to;
[0158] The relative position of the sound source object to the listening object.
[0159] Optionally, in one embodiment of this disclosure, after the encoding device obtains the bitstream signal, it can send the bitstream signal to the decoding device so that the decoding device can decode and inverse quantize the bitstream signal to obtain the sound source orientation information and the decoded audio signal, and further render and play the decoded audio signal to the listening object based on the sound source orientation information.
[0160] In summary, the quantization encoding method provided in this disclosure allows the encoding device to selectively choose an appropriate quantization encoding method based on the listening object's current sensitivity to changes in the orientation of the sound source object. When the listening object's sensitivity to changes in the sound source object's orientation is low, a coarse quantization encoding method is chosen to quantize the sound source orientation information, ensuring a lower bitrate and avoiding resource waste without affecting subsequent rendering effects. Conversely, when the listening object's sensitivity to changes in the sound source object's orientation is high, a fine quantization encoding method is chosen to quantize the sound source orientation information, ensuring optimal subsequent rendering effects. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0161] Figure 7 This is a flowchart illustrating a quantization encoding method provided in an embodiment of the present disclosure. The method is executed by a decoding device, such as... Figure 7 As shown, the quantization encoding method may include the following steps:
[0162] Step 701: Receive the code stream signal sent by the encoding device.
[0163] Optionally, the bitstream signal can be obtained by quantizing and encoding the sound source orientation information based on a target quantization bit depth; the target quantization bit depth is determined based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object.
[0164] The bitstream signal may include at least one of the following:
[0165] Encoded audio signal;
[0166] Quantized and encoded sound source orientation information;
[0167] Target quantization bits;
[0168] Sound cone information of the sound source object;
[0169] The index corresponding to the acoustic cone information;
[0170] The absolute position of the sound source object;
[0171] The absolute position of the object being listened to;
[0172] The relative position of the sound source object to the listening object.
[0173] For a detailed description of step 701, please refer to the foregoing embodiments.
[0174] Step 702: Decode and inverse quantize the bitstream signal to obtain the direction information of the sound source.
[0175] Optionally, in one embodiment of this disclosure, the method described above for "decoding and inverse quantizing the bitstream signal to obtain sound source orientation information" may include at least one of the following:
[0176] Method 1: When the bitstream signal acquired by the decoding device includes the target quantization bit depth, the decoding device can directly decode and dequantize the quantized and encoded sound source orientation information in the bitstream signal based on the target quantization bit depth to restore the sound source orientation information.
[0177] Optionally, the above method for decoding and inverse quantizing the quantized source orientation information in the bitstream signal based on the target quantization bit depth may include: determining the corresponding codebook based on the target quantization bit depth, and decoding and inverse quantizing the quantized source orientation information based on the codebook.
[0178] It should be noted that the process of decoding and inverse quantizing the quantized and encoded sound source orientation information in the bitstream signal by the aforementioned decoding device based on the target quantization bit depth is essentially the inverse process of quantizing and encoding the sound source orientation information by the aforementioned encoding device based on the target quantization bit depth. For detailed information on this part, please refer to the description in the foregoing embodiments, which will not be repeated here.
[0179] Method 2: When the bitstream signal obtained by the decoding device does not include the target quantization bit depth, the decoding device can first determine the target quantization bit depth, and then decode and dequantize the quantized and encoded sound source orientation information in the bitstream signal based on the target quantization bit depth to restore the sound source orientation information.
[0180] Optionally, the method by which the decoding device determines the target quantization bit depth may include: first determining the sound cone information of the sound source object based on information included in the bitstream signal (such as the sound cone information of the sound source object and / or the indicator index corresponding to the sound cone information included in the bitstream signal); then determining the region where the sound source object is located relative to the listening object; and determining the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object.
[0181] Optionally, the method by which the decoding device determines the region where the sound source object is located relative to the listening object may include at least one of the following:
[0182] The decoding device determines the area where the sound source object is located relative to the listening object based on information included in the bit stream signal (such as at least one of the absolute position of the sound source object, the absolute position of the listening object, and the relative position of the sound source object to the listening object) and the orientation of the listening object.
[0183] The decoding device autonomously determines the region where the sound source object is located relative to the listening object. For example, the decoding device autonomously determines at least one of the absolute position of the sound source object, the absolute position of the listening object, and the relative position of the sound source object relative to the listening object, and determines the region where the sound source object is located relative to the listening object by combining the orientation of the listening object.
[0184] Optionally, the detailed process of "determining the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object" and "decoding and inverse quantizing the quantized sound source orientation information in the bitstream signal based on the target quantization bit depth to restore the sound source orientation information" can be referred to the description in the foregoing embodiments.
[0185] In summary, in the quantization encoding method provided in this disclosure, the decoding device receives a bitstream signal sent by the encoding device. This bitstream signal is obtained by quantizing and encoding the sound source orientation information based on a target quantization bit depth. The target quantization bit depth is determined based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object. Then, the decoding device decodes and inverse-quantizes the bitstream signal to obtain the sound source orientation information. Therefore, in this disclosure, the encoding device selects a suitable quantization encoding method based on the listening object's current sensitivity to changes in the sound source object's orientation. When the listening object's sensitivity to changes in the sound source object's orientation is low, a coarse quantization encoding method is selected to quantize and encode the sound source orientation information, ensuring that the encoded bitstream is reduced without affecting the subsequent rendering effect, thus avoiding resource waste. When the listening object's sensitivity to changes in the sound source object's orientation is high, a fine quantization encoding method is selected to quantize and encode the sound source orientation information, ensuring the subsequent rendering effect. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0186] Figure 8 This is a flowchart illustrating a quantization encoding method provided in an embodiment of the present disclosure. The method is executed by a decoding device, such as... Figure 8 As shown, the quantization encoding method may include the following steps:
[0187] Step 801: When the bitstream signal includes the target quantization bit depth, decode and inverse quantize the quantized and encoded sound source orientation information in the bitstream signal based on the target quantization bit depth.
[0188] A detailed description of step 801 can be found in the foregoing embodiments.
[0189] In summary, the quantization encoding method provided in this disclosure allows the encoding device to selectively choose an appropriate quantization encoding method based on the listening object's current sensitivity to changes in the orientation of the sound source object. When the listening object's sensitivity to changes in the sound source object's orientation is low, a coarse quantization encoding method is chosen to quantize the sound source orientation information, ensuring a lower bitrate and avoiding resource waste without affecting subsequent rendering effects. Conversely, when the listening object's sensitivity to changes in the sound source object's orientation is high, a fine quantization encoding method is chosen to quantize the sound source orientation information, ensuring optimal subsequent rendering effects. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0190] Figure 9 This is a flowchart illustrating a quantization encoding method provided in an embodiment of the present disclosure. The method is executed by a decoding device, such as... Figure 9 As shown, the quantization encoding method may include the following steps:
[0191] Step 901: When the target quantization bit depth is not included in the bit stream signal, determine the sound cone information of the sound source object based on the information included in the bit stream signal, determine the region where the sound source object is located relative to the listening object, and determine the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object.
[0192] Step 902: Decode and inverse quantize the quantized source orientation information in the bitstream signal based on the target quantization bit depth.
[0193] The detailed description of steps 901-902 can be found in the foregoing embodiments.
[0194] In summary, the quantization encoding method provided in this disclosure allows the encoding device to selectively choose an appropriate quantization encoding method based on the listening object's current sensitivity to changes in the orientation of the sound source object. When the listening object's sensitivity to changes in the sound source object's orientation is low, a coarse quantization encoding method is chosen to quantize the sound source orientation information, ensuring a lower bitrate and avoiding resource waste without affecting subsequent rendering effects. Conversely, when the listening object's sensitivity to changes in the sound source object's orientation is high, a fine quantization encoding method is chosen to quantize the sound source orientation information, ensuring optimal subsequent rendering effects. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0195] Figure 10This is a schematic diagram of the structure of a communication device provided in an embodiment of this disclosure, as shown below. Figure 10 As shown, the device may include:
[0196] The processing module is used to determine the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region in which the sound source object is located relative to the listening object.
[0197] The processing module is further configured to quantize and encode the sound source orientation information based on the target quantization bit depth to obtain a code stream signal.
[0198] In summary, in the communication device provided in this embodiment, the encoding device determines the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object; and quantizes and encodes the sound source orientation information based on the target quantization bit depth to obtain a bitstream signal. The sound cone information of the sound source object and the region where the sound source object is located relative to the listening object can reflect the listening object's sensitivity to changes in the orientation of the sound source object, and the target quantization bit depth can reflect a specific quantization encoding method (such as fine quantization encoding or coarse quantization encoding). Therefore, it can be seen that the encoding device in this disclosure selects an appropriate quantization encoding method based on the listening object's current sensitivity to changes in the orientation of the sound source object. When the listening object's sensitivity to changes in the orientation of the sound source object is low, a coarse quantization encoding method is selected to quantize the sound source orientation information, ensuring that the encoding bitrate is reduced without affecting subsequent rendering effects, thus avoiding resource waste. When the listening object's sensitivity to changes in the orientation of the sound source object is high, a fine quantization encoding method is selected to quantize the sound source orientation information, ensuring subsequent rendering effects. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0199] Optionally, in one embodiment of this disclosure, the acoustic cone information of the acoustic source object includes at least one of the following:
[0200] The interior angle of the sound cone of the sound source object;
[0201] The outer angle of the sound cone of the sound source object;
[0202] The maximum attenuation of the audio signal emitted by the sound source object.
[0203] Optionally, in one embodiment of this disclosure, the sound source orientation information includes at least one of the following:
[0204] The horizontal angle between the sound source object and the listening object;
[0205] The elevation angle between the sound source object and the listening object.
[0206] Optionally, in one embodiment of this disclosure, the apparatus is further configured to:
[0207] Determine the orientation of the object being listened to;
[0208] Determine the relative position of the sound source object with respect to the listening object;
[0209] The region where the sound source object is located relative to the listening object is determined based on at least one of the orientation of the listening object and the relative position.
[0210] Optionally, in one embodiment of this disclosure, the processing module is further configured to:
[0211] The first quantization bit depth is determined based on the acoustic cone information of the sound source object;
[0212] The target quantization bit depth is determined based on the region where the sound source object is located relative to the listening object and the first quantization bit depth.
[0213] Optionally, in one embodiment of this disclosure, the processing module is further configured to:
[0214] The target quantization bit depth is determined based on the acoustic cone information of the sound source object.
[0215] Optionally, in one embodiment of this disclosure, the processing module is further configured to:
[0216] The first quantization bit depth or the target quantization bit depth is calculated using the following formula; the formula includes:
[0217]
[0218]
[0219]
[0220] Among them, in angle Indicates the interior angle of the acoustic cone, out angle Represents the outer angle of the sound cone, A max The maximum attenuation is represented by gap, which represents the rate of change of sound intensity between the interior and exterior angles of the sound cone. This rate of change of sound intensity is the rate of change of the intensity of the audio signal emitted by the sound source object with respect to the emission angle of the audio signal. percep represents the minimum rate of change of sound intensity perceptible to the human ear. int Indicates the quantization interval, range indicates the quantization range, quan bit This indicates the determined first quantization bit depth or target quantization bit depth, and ceil represents the round-up function.
[0221] Optionally, in one embodiment of this disclosure, the processing module is further configured to:
[0222] When the sound source object is located in the first region of the listening object, the first quantization bit depth is determined as the target quantization bit depth.
[0223] When the sound source object is located in the second region of the listening object, the difference between the first quantization bit depth and the preset value is determined as the target quantization bit depth.
[0224] The listening object has a higher sensitivity to the audio signal in the first region than it has to the audio signal in the second region.
[0225] Optionally, in one embodiment of this disclosure, the processing module is further configured to:
[0226] Determine the corresponding codebook based on the target quantization bit depth;
[0227] The sound source orientation information is quantized and encoded based on the codebook to obtain the quantized and encoded sound source orientation information.
[0228] Optionally, in one embodiment of this disclosure, the bitstream signal includes at least one of the following:
[0229] Quantized and encoded sound source orientation information;
[0230] The target quantization bit depth;
[0231] The acoustic cone information of the sound source object;
[0232] The indicator index corresponding to the acoustic cone information;
[0233] The absolute position of the sound source object;
[0234] The absolute position of the object being listened to;
[0235] The relative position of the sound source object with respect to the listening object.
[0236] Optionally, in one embodiment of this disclosure, the apparatus is further configured to:
[0237] The bitstream signal is sent to the decoding device.
[0238] Figure 11 This is a schematic diagram of the structure of a communication device provided in an embodiment of this disclosure, as shown below. Figure 11 As shown, the device may include:
[0239] The transceiver module is used to receive the code stream signal sent by the encoding device; the code stream signal is obtained by quantizing and encoding the sound source orientation information based on a target quantization bit depth; the target quantization bit depth is determined based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object.
[0240] The processing module is used to decode and inverse quantize the bitstream signal to obtain the direction information of the sound source.
[0241] In summary, in the communication device provided in this embodiment, the decoding device receives a bitstream signal sent by the encoding device. This bitstream signal is obtained by quantizing and encoding the sound source orientation information based on a target quantization bit depth. The target quantization bit depth is determined based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object. Then, the decoding device decodes and inverse-quantizes the bitstream signal to obtain the sound source orientation information. Therefore, in this disclosure, the encoding device selects a suitable quantization encoding method based on the listening object's current sensitivity to changes in the sound source object's orientation. When the listening object's sensitivity to changes in the sound source object's orientation is low, a coarse quantization encoding method is selected to quantize and encode the sound source orientation information, ensuring that the encoded bitstream is reduced without affecting subsequent rendering effects, thus avoiding resource waste. When the listening object's sensitivity to changes in the sound source object's orientation is high, a fine quantization encoding method is selected to quantize and encode the sound source orientation information, ensuring subsequent rendering effects. Furthermore, the quantization encoding method of this disclosure offers high flexibility.
[0242] Optionally, in one embodiment of this disclosure, the bitstream signal includes at least one of the following:
[0243] Quantized and encoded sound source orientation information;
[0244] The target quantization bit depth;
[0245] The acoustic cone information of the sound source object;
[0246] The indicator index corresponding to the acoustic cone information;
[0247] The absolute position of the sound source object;
[0248] The absolute position of the object being listened to;
[0249] The relative position of the sound source object with respect to the listening object.
[0250] Optionally, in one embodiment of this disclosure, when the bitstream signal includes the target quantization bit depth, the processing module is further configured to:
[0251] Based on the target quantization bit depth, the quantized source orientation information in the bitstream signal is decoded and inverse quantized.
[0252] Optionally, in one embodiment of this disclosure, when the bitstream signal does not include the target quantization bit depth, the processing module is further configured to:
[0253] The acoustic cone information of the sound source object is determined based on the information included in the code stream signal;
[0254] Determine the region where the sound source object is located relative to the listening object;
[0255] The target quantization bit depth of the sound source orientation information is determined based on at least one of the sound cone information of the sound source object and the region in which the sound source object is located relative to the listening object.
[0256] Based on the target quantization bit depth, the quantized source orientation information in the bitstream signal is decoded and inverse quantized.
[0257] Optionally, in one embodiment of this disclosure, the processing module is further configured to:
[0258] The region where the sound source object is located relative to the listening object is determined based on the information included in the bit stream signal and the orientation of the listening object;
[0259] The decoding device autonomously determines the region where the sound source object is located relative to the listening object.
[0260] Optionally, in one embodiment of this disclosure, the processing module is further configured to:
[0261] Determine the corresponding codebook based on the target quantization bit depth;
[0262] Based on the codebook, the quantized and encoded sound source orientation information is decoded and dequantized.
[0263] Please see Figure 12 , Figure 12 This is a schematic diagram of the structure of a communication device 1200 provided in an embodiment of this disclosure. The communication device 1200 can be a base station, a terminal, or a chip, chip system, or processor that supports the base station in implementing the above methods; it can also be a chip, chip system, or processor that supports the terminal in implementing the above methods. This device can be used to implement the methods described in the above method embodiments, and specific details can be found in the descriptions of the above method embodiments.
[0264] The communication device 1200 may include one or more processors 1201. The processor 1201 may be a general-purpose processor or a dedicated processor, such as a baseband processor or a central processing unit (CPU). The baseband processor can be used to process communication protocols and communication data, while the CPU can be used to control the communication device (e.g., base station, baseband chip, terminal, terminal chip, DU or CU, etc.), execute computer programs, and process data from the computer programs.
[0265] Optionally, the communication device 1200 may further include one or more memories 1202, which may store a computer program 1204. The processor 1201 executes the computer program 1204 to cause the communication device 1200 to perform the methods described in the above method embodiments. Optionally, the memory 1202 may also store data. The communication device 1200 and the memory 1202 may be provided separately or integrated together.
[0266] Optionally, the communication device 1200 may also include a transceiver 1205 and an antenna 1206. The transceiver 1205 may be referred to as a transceiver unit, transceiver, or transceiver circuit, etc., and is used to implement the transmission and reception functions. The transceiver 1205 may include a receiver and a transmitter. The receiver may be referred to as a receiver or receiving circuit, etc., and is used to implement the receiving function; the transmitter may be referred to as a transmitter or transmitting circuit, etc., and is used to implement the transmitting function.
[0267] Optionally, the communication device 1200 may further include one or more interface circuits 1207. The interface circuits 1207 are used to receive code instructions and transmit them to the processor 1201. The processor 1201 executes the code instructions to cause the communication device 1200 to perform the methods described in the above method embodiments.
[0268] In one implementation, the processor 1201 may include a transceiver for implementing receiving and transmitting functions. For example, the transceiver may be a transceiver circuit, an interface, or an interface circuit. The transceiver circuit, interface, or interface circuit for implementing receiving and transmitting functions may be separate or integrated. The aforementioned transceiver circuit, interface, or interface circuit can be used for reading and writing code / data, or it can be used for transmitting or relaying signals.
[0269] In one implementation, processor 1201 may store computer program 1203, which runs on processor 1201 and causes communication device 1200 to perform the methods described in the above method embodiments. Computer program 1203 may be embedded in processor 1201, in which case processor 1201 may be implemented in hardware.
[0270] In one implementation, the communication device 1200 may include circuitry capable of performing the functions of transmitting, receiving, or communicating as described in the foregoing method embodiments. The processor and transceiver described in this disclosure can be implemented on integrated circuits (ICs), analog ICs, radio frequency integrated circuits (RFICs), mixed-signal ICs, application-specific integrated circuits (ASICs), printed circuit boards (PCBs), electronic devices, etc. The processor and transceiver can also be manufactured using various IC process technologies, such as complementary metal oxide semiconductors (CMOS), n-metal-oxide-semiconductor (NMOS), positive-channel metal oxide semiconductors (PMOS), bipolar junction transistors (BJTs), bipolar CMOS (BiCMOS), silicon-germanium (SiGe), gallium arsenide (GaAs), etc.
[0271] The communication device described in the above embodiments may be a base station or a terminal, but the scope of the communication device described in this disclosure is not limited thereto, and the structure of the communication device may vary. Figure 12 The communication device may be a standalone device or part of a larger device. For example, the communication device may be:
[0272] (1) Independent integrated circuit IC, or chip, or chip system or subsystem;
[0273] (2) A collection of one or more ICs, optionally including storage components for storing data and computer programs;
[0274] (3) ASIC, such as modem;
[0275] (4) Modules that can be embedded in other devices;
[0276] (5) Receivers, terminals, smart terminals, cellular phones, wireless devices, handheld devices, mobile units, vehicle-mounted devices, base stations, cloud devices, artificial intelligence devices, etc.
[0277] (6) Others, etc.
[0278] For cases where the communication device can be a chip or a chip system, please refer to [link / reference]. Figure 13 The diagram shows the structure of the chip. Figure 13 The chip shown includes a processor 1301 and an interface 1302. There can be one or more processors 1301, and multiple interfaces 1302.
[0279] Optionally, the chip also includes a memory 1303, which is used to store necessary computer programs and data.
[0280] Those skilled in the art will also understand that the various illustrative logical blocks and steps listed in the embodiments of this disclosure can be implemented by electronic hardware, computer software, or a combination of both. Whether such functionality is implemented in hardware or software depends on the specific application and the overall system design requirements. Those skilled in the art can implement the described functionality using various methods for each specific application, but such implementation should not be construed as exceeding the scope of protection of the embodiments of this disclosure.
[0281] This disclosure also provides a readable storage medium having instructions stored thereon that, when executed by a computer, implement the functions of any of the above method embodiments.
[0282] This disclosure also provides a computer program product that, when executed by a computer, implements the functions of any of the above method embodiments.
[0283] In the above embodiments, implementation can be achieved, in whole or in part, through software, hardware, firmware, or any combination thereof. When implemented in software, it can be implemented, in whole or in part, as a computer program product. The computer program product includes one or more computer programs. When the computer program is loaded and executed on a computer, all or part of the processes or functions described in the embodiments of this disclosure are generated. The computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device. The computer program can be stored in a computer-readable storage medium or transferred from one computer-readable storage medium to another. For example, the computer program can be transferred from one website, computer, server, or data center to another via wired (e.g., coaxial cable, fiber optic, digital subscriber line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.) means. The computer-readable storage medium can be any available medium accessible to a computer or a data storage device such as a server or data center that integrates one or more available media. The available media may be magnetic media (e.g., floppy disks, hard disks, magnetic tapes), optical media (e.g., high-density digital video discs (DVDs)), or semiconductor media (e.g., solid-state disks (SSDs)).
[0284] Those skilled in the art will understand that the various numerical designations such as "first," "second," etc., used in this disclosure are merely for the convenience of description and are not intended to limit the scope of the embodiments of this disclosure, nor do they indicate the order of events.
[0285] At least one of the features described in this disclosure can also be described as one or more, and multiple features can be two, three, four or more, and this disclosure does not impose any limitations. In the embodiments of this disclosure, for a technical feature, the technical features in that technical feature are distinguished by "first", "second", "third", "A", "B", "C" and "D", etc., and there is no sequential order or size order among the technical features described by "first", "second", "third", "A", "B", "C" and "D".
[0286] The correspondences shown in the tables of this disclosure can be configured or predefined. The values of the information in each table are merely examples and can be configured to other values; this disclosure is not limiting. When configuring the correspondences between information and parameters, it is not necessarily required to configure all the correspondences shown in each table. For example, the correspondences shown in some rows of the tables in this disclosure may not be configured. Furthermore, appropriate modifications and adjustments can be made based on the above tables, such as splitting, merging, etc. The names of the parameters shown in the headers of the above tables can also use other names that the communication device can understand, and the values or representations of the parameters can also be other values or representations that the communication device can understand. In the implementation of the above tables, other data structures can also be used, such as arrays, queues, containers, stacks, linear lists, pointers, linked lists, trees, graphs, structures, classes, heaps, hash tables, or hash tables, etc.
[0287] The predefined terms in this disclosure can be understood as defined, predefined, stored, pre-stored, pre-negotiated, pre-configured, solidified, or pre-burned.
[0288] Those skilled in the art will recognize that the units and algorithm steps of the various examples described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are implemented in hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art can use different methods to implement the described functions for each specific application, but such implementation should not be considered beyond the scope of this disclosure.
[0289] Those skilled in the art will understand that, for the sake of convenience and brevity, the specific working processes of the systems, devices, and units described above can be referred to the corresponding processes in the foregoing method embodiments, and will not be repeated here.
[0290] The above description is merely a specific embodiment of this disclosure, but the scope of protection of this disclosure is not limited thereto. Any variations or substitutions that can be easily conceived by those skilled in the art within the scope of the technology disclosed in this disclosure should be included within the scope of protection of this disclosure. Therefore, the scope of protection of this disclosure should be determined by the scope of the claims.
Claims
1. A quantization encoding method, characterized in that, The method is executed by an encoding device, and the method includes: The target quantization bit depth of the sound source orientation information is determined based on at least one of the sound cone information of the sound source object and the region in which the sound source object is located relative to the listening object. The sound source orientation information is quantized and encoded based on the target quantization bit depth to obtain a code stream signal; The step of determining the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object includes: The first quantization bit depth is determined based on the acoustic cone information of the sound source object; The target quantization bit depth is determined based on the region where the sound source object is located relative to the listening object and the first quantization bit depth.
2. The method as described in claim 1, characterized in that, The acoustic cone information of the sound source object includes at least one of the following: The interior angle of the sound cone of the sound source object; The outer angle of the sound cone of the sound source object; The maximum attenuation of the audio signal emitted by the sound source object.
3. The method as described in claim 1, characterized in that, The sound source orientation information includes at least one of the following: The horizontal angle between the sound source object and the listening object; The elevation angle between the sound source object and the listening object.
4. The method as described in claim 1, characterized in that, The method further includes: Determine the orientation of the object being listened to; Determine the relative position of the sound source object with respect to the listening object; The region where the sound source object is located relative to the listening object is determined based on at least one of the orientation of the listening object and the relative position.
5. The method as described in claim 1, characterized in that, The determination of the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object includes: The target quantization bit depth is determined based on the acoustic cone information of the sound source object.
6. The method as described in claim 1 or 5, characterized in that, Determining the first quantization bit depth or the target quantization bit depth based on the acoustic cone information of the acoustic source object includes: The first quantization bit depth or the target quantization bit depth is calculated using the following formula; the formula includes: ; in, Represents the internal angle of the sound cone, Represents the outer angle of the sound cone, Indicates the maximum attenuation. This represents the rate of change of sound intensity between the interior and exterior angles of the sound cone. The rate of change of sound intensity is the rate of change of the intensity of the audio signal emitted by the sound source object with respect to the emission angle of the audio signal. It represents the minimum rate of change in sound intensity that the human ear can perceive. Indicates the quantization interval. Indicates the quantization range, This indicates the determined first quantization bit depth or target quantization bit depth. This represents the function for rounding up.
7. The method as described in claim 1, characterized in that, Determining the target quantization bit depth based on the region of the sound source object relative to the listening object and the first quantization bit depth includes: When the sound source object is located in the first region of the listening object, the first quantization bit depth is determined as the target quantization bit depth. When the sound source object is located in the second region of the listening object, the difference between the first quantization bit depth and the preset value is determined as the target quantization bit depth. The listening object has a higher sensitivity to the audio signal in the first region than it has to the audio signal in the second region.
8. The method according to any one of claims 1-5, characterized in that, The quantization encoding of the sound source orientation information based on the target quantization bit depth includes: Determine the corresponding codebook based on the target quantization bit depth; The sound source orientation information is quantized and encoded based on the codebook to obtain the quantized and encoded sound source orientation information.
9. The method according to any one of claims 1-5, characterized in that, The bitstream signal includes at least one of the following: Quantized and encoded sound source orientation information; The target quantization bit depth; The acoustic cone information of the sound source object; The indicator index corresponding to the acoustic cone information; The absolute position of the sound source object; The absolute position of the object being listened to; The relative position of the sound source object with respect to the listening object.
10. The method according to any one of claims 1-5, characterized in that, The method further includes: The bitstream signal is sent to the decoding device.
11. A quantization encoding method, characterized in that, The method is executed by a decoding device, and the method includes: The receiver receives a bitstream signal sent by an encoding device; the bitstream signal is obtained by quantizing and encoding the sound source orientation information based on a target quantization bit depth; the target quantization bit depth is determined based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object. The bitstream signal is decoded and inverse quantized to obtain the direction information of the sound source; Wherein, the sound cone information of the sound source object is used to determine the first quantization bit depth, and the region of the sound source object relative to the listening object and the first quantization bit depth are used to determine the target quantization bit depth.
12. The method as described in claim 11, characterized in that, The bitstream signal includes at least one of the following: Quantized and encoded sound source orientation information; The target quantization bit depth; The acoustic cone information of the sound source object; The indicator index corresponding to the acoustic cone information; The absolute position of the sound source object; The absolute position of the object being listened to; The relative position of the sound source object with respect to the listening object.
13. The method as described in claim 12, characterized in that, When the bitstream signal includes the target quantization bits, the decoding and inverse quantization of the bitstream signal includes: Based on the target quantization bit depth, the quantized source orientation information in the bitstream signal is decoded and inverse quantized.
14. The method as described in claim 12, characterized in that, When the bitstream signal does not include the target quantization bits, the decoding and inverse quantization of the bitstream signal includes: The acoustic cone information of the sound source object is determined based on the information included in the code stream signal; Determine the region where the sound source object is located relative to the listening object; The target quantization bit depth of the sound source orientation information is determined based on at least one of the sound cone information of the sound source object and the region in which the sound source object is located relative to the listening object. Based on the target quantization bit depth, the quantized source orientation information in the bitstream signal is decoded and inverse quantized.
15. The method as described in claim 14, characterized in that, Determining the region where the sound source object is located relative to the listening object includes at least one of the following: The region where the sound source object is located relative to the listening object is determined based on the information included in the bit stream signal and the orientation of the listening object; The decoding device autonomously determines the region where the sound source object is located relative to the listening object.
16. The method as described in claim 13 or 14, characterized in that, The step of decoding and inverse quantizing the quantized and encoded sound source orientation information in the bitstream signal based on the target quantization bit depth includes: Determine the corresponding codebook based on the target quantization bit depth; Based on the codebook, the quantized and encoded sound source orientation information is decoded and dequantized.
17. A communication device, comprising: The processing module is used to determine the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region in which the sound source object is located relative to the listening object. The processing module is further configured to quantize and encode the sound source orientation information based on the target quantization bit depth to obtain a code stream signal; The step of determining the target quantization bit depth of the sound source orientation information based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object includes: The first quantization bit depth is determined based on the acoustic cone information of the sound source object; The target quantization bit depth is determined based on the region where the sound source object is located relative to the listening object and the first quantization bit depth.
18. A communication device, comprising: The transceiver module is used to receive the code stream signal sent by the encoding device; The bitstream signal is obtained by quantizing and encoding the sound source orientation information based on the target quantization bit depth; The target quantization bit depth is determined based on at least one of the sound cone information of the sound source object and the region where the sound source object is located relative to the listening object; The processing module is used to decode and inverse quantize the bitstream signal to obtain the direction information of the sound source; Wherein, the sound cone information of the sound source object is used to determine the first quantization bit depth, and the region of the sound source object relative to the listening object and the first quantization bit depth are used to determine the target quantization bit depth.
19. A communication device, characterized in that, The device includes a processor and a memory, wherein the memory stores a computer program, and the processor executes the computer program stored in the memory to cause the device to perform the method as claimed in any one of claims 1 to 10, or the processor executes the computer program stored in the memory to cause the device to perform the method as claimed in any one of claims 11 to 16.
20. A communication device, characterized in that, include: Processor and interface circuitry, among which The interface circuit is used to receive code instructions and transmit them to the processor; The processor is configured to execute the code instructions to perform the method as claimed in any one of claims 1 to 10, or to execute the code instructions to perform the method as claimed in any one of claims 11 to 16.
21. A computer-readable storage medium for storing instructions that, when executed, cause the method of any one of claims 1 to 10 to be implemented, or, when executed, cause the method of any one of claims 11 to 16 to be implemented.