Audio transmission system and method

By using a low-latency codec to process speech signals in an audio transmission system and transmitting speech and non-speech signals through different communication channels, the problem of excessive latency and bandwidth requirements in existing technologies is solved, and low-latency, high-quality audio signal transmission is achieved.

CN122248320APending Publication Date: 2026-06-19HARMAN BECKER AUTOMOTIVE SYST GMBH

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
HARMAN BECKER AUTOMOTIVE SYST GMBH
Filing Date
2025-12-03
Publication Date
2026-06-19

AI Technical Summary

Technical Problem

Existing audio transmission systems struggle to provide good quality while maintaining low latency when transmitting both voice and non-voice signals, and require moderate computing power, especially when using different types of codecs, leading to excessive bandwidth demands or long latency.

Method used

Low-latency codecs are used to process speech signals, and speech and non-speech signals are transmitted through different communication channels. Codecs suitable for non-speech signals are used to process other audio signals, ensuring low latency and high-quality reproduction of speech signals while reducing bandwidth requirements.

Benefits of technology

It achieves high-quality reproduction of voice signals under low latency conditions, reduces bandwidth requirements, and improves the overall efficiency and user experience of audio transmission systems.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure CN122248320A_ABST
    Figure CN122248320A_ABST
Patent Text Reader

Abstract

An audio transmission system includes: one or more loudspeakers, each of the one or more loudspeakers configured to radiate sound corresponding to a sound signal to at least one of two listening positions associated with a user of the respective system, each of the two listening positions being located at the entrance of the ear canal of the respective user; and a microphone configured to capture speech originating from a user associated with the respective system, wherein the system is configured to: capture a first speech signal originating from the user associated with the system via the microphone; compress the first speech signal via a codec of a first type to obtain a first compressed speech signal; receive a first audio signal from a signal source, wherein the signal source is different from the microphone; transmit the first compressed audio signal via a first communication channel; and transmit the first audio signal or a compressed version of the first audio signal via the first communication channel or via a second communication channel.
Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This disclosure relates to an audio transmission system and method, and more particularly to a system for transmitting audio signals to another audio transmission system in a listening environment. Background Technology

[0002] An audio transmission system in a listening environment can transmit audio signals to another audio transmission system in the same listening environment. The audio signal may include both speech components captured by the microphones of the audio transmission system and other components (e.g., music). To avoid increasing latency, the signal sent from one audio transmission system to another system in the same listening environment can be transmitted uncompressed; however, uncompressed signals require significant bandwidth. Different types of codecs are known that allow signal compression to reduce the required bandwidth. Some codecs are generally better suited for compressing speech signals, while others are generally better suited for compressing other signals, such as, for example, music signals. For instance, codecs designed for compressing music signals provide high-quality reproduction of both music and speech, but require high computational power, operate on large buffers, and typically require look-ahead buffers. Such codecs generally introduce high latency. Codecs designed for compressing speech signals (which may also be called speech codecs) generally provide good-quality reproduction of speech signals but are generally less suitable for reproducing music signals. Speech codecs typically require moderate computational power and operate on small buffers or even individual samples. Small buffers typically make voice codecs well-suited for ultra-low latency applications.

[0003] There is a need for an audio transmission system and method that provides good quality for both speech and non-speech signals at a receiving device, the audio transmission system and method allowing speech signals to be transmitted with low latency and requiring moderate computing power. Summary of the Invention

[0004] An audio transmission system includes: one or more loudspeakers, each of the one or more loudspeakers configured to radiate sound corresponding to a sound signal to at least one of two listening positions associated with a user of the respective system, each of the two listening positions being located at the entrance of the ear canal of the respective user; and a microphone configured to capture speech originating from a user associated with the respective system, wherein the system is configured to: capture a first speech signal originating from a user associated with the system via the microphone; compress the first speech signal via a codec of a first type to obtain a first compressed speech signal; receive a first audio signal from a signal source, wherein the signal source is different from the microphone; transmit the first compressed speech signal via a first communication channel; and transmit the first audio signal or a compressed version of the first audio signal via the first communication channel or a second communication channel.

[0005] A method for operating an audio transmission system is disclosed. The system includes: one or more loudspeakers, each configured to radiate sound corresponding to a sound signal to at least one of two listening positions associated with a user of the corresponding system, each of the two listening positions being located at the entrance of the ear canal of the corresponding user; and a microphone configured to capture speech originating from a user associated with the corresponding system. The method includes: capturing a first speech signal originating from a user associated with the system via the microphone; compressing the first speech signal using a codec of a first type to obtain a first compressed speech signal; receiving a first audio signal from a signal source, wherein the signal source is different from the microphone; transmitting the first compressed speech signal via a first communication channel; and transmitting the first audio signal or a compressed version of the first audio signal via the first communication channel or via a second communication channel.

[0006] Other systems, features, and advantages of this disclosure will be or will become apparent to those skilled in the art upon review of the following detailed description and accompanying drawings. It is intended that all such additional systems, methods, features, and advantages are included in this specification, are within the scope of the invention, and are protected by the appended claims. Attached Figure Description

[0007] The arrangement and method can be better understood by referring to the following description and accompanying drawings. The components in the drawings are not necessarily drawn to scale, but rather to emphasize the principles of the invention. Furthermore, in the drawings, the same reference numerals indicate corresponding parts in all different views.

[0008] Figure 1 The arrangement of multiple audio transmission systems arranged in the listening environment is illustrated schematically.

[0009] Figure 2 The illustration schematically shows speech and non-speech signals transmitted between audio transmission systems.

[0010] Figure 3 An audio transmission system according to an embodiment of the present disclosure is illustrated schematically.

[0011] Figure 4 An audio transmission system according to a further embodiment of this disclosure is illustrated schematically.

[0012] Figure 5 An audio transmission system according to even further embodiments of this disclosure is schematically illustrated.

[0013] Figure 6 A method according to an embodiment of this disclosure is illustrated schematically. Detailed Implementation

[0014] The systems and related methods according to the various embodiments described herein allow the transmission of speech signals captured by the microphone of the system, as well as audio signals from other sources, thereby ensuring good quality reproduction of all signals at the receiving device. Speech signals are transmitted with low latency. The total bandwidth required for signal transmission in the systems according to the various embodiments is kept as low as possible. Figure 1 An arrangement 30 comprising multiple audio transmission systems 100 is schematically shown. Figure 1 In the example shown, the arrangement includes three audio transmission systems 100a, 100b, and 100c. However, in general, the arrangement may include only two, three, or even more than three individual systems 100. Each of the multiple systems 100 includes one or more speakers 200, each of which is configured to radiate sound corresponding to a sound signal to at least one of two listening positions associated with a user of the respective system 100, each of which is located at the entrance of the user's ear canal. According to one example, system 100 includes a single speaker 200 that radiates sound to both ears of the respective user. However, it is also possible that system 100 includes separate speakers 200 for different listening positions. That is, one or more speakers 200 may be configured to radiate sound to the first ear of the respective user, and one or more speakers 200 may be configured to radiate sound to the second ear of the respective user.

[0015] Each of the plurality of systems 100 further includes one or more microphones 204 configured to capture speech originating from a user associated with the respective system 100. For example, each of the plurality of systems 100 may include or be a headset including a microphone cantilever, and a first microphone 204 of system 100 may be disposed in the microphone cantilever. However, the first microphone 204 may be attached to the respective system 100 in any other suitable manner. According to another example, the first microphone 204 is disposed in or attached to a cable or wire attached to the respective system 100. However, it is also possible that the first microphone 204 is not attached to the respective system 100 directly or indirectly at all. According to other examples, the first microphone 204 may be attached to clothing or devices worn by the respective user (e.g., devices worn on the head, neck, or shoulder).

[0016] The first microphone 204 may be a single microphone or may include an array of microphones, such as a beamforming microphone array. It is even possible that the same first microphone 204, including a beamforming microphone array, is configured to pick up speech from different users arranged 30. That is, a first microphone 204 may be positioned appropriately and configured to pick up speech originating from a first user associated with the first system 100a, and speech originating from at least one other user associated with another system 100b, 100c of the plurality of systems 100.

[0017] According to one embodiment, each of the multiple systems 100 includes, or is not limited to, headphones. Headphones can typically be implemented in many different ways. Some headphones include earmuffs arranged to cover or surround the ears of a respective user. In this case, one or more speakers 200 may be arranged in the respective earmuffs. Other headphones include earplugs that can be at least partially inserted into the user's ear canal. In this case, one or more speakers 200 may be arranged in the earplugs. Other headphones are known to include different support structures that can be used to position one or more speakers 200 relative to the user's ears (e.g., headband or neckband). Alternatively, for example, it is possible that one or more speakers 200 are arranged in the headrest of a vehicle. The different systems 100 among the multiple systems 100 may be the same or may be different from each other. That is, it is generally possible that one system 100 includes earmuffs while another system 100 includes earplugs. Any other combination of different systems 100 is generally possible.

[0018] The multiple systems 100 of arrangement 30 are all arranged in the same listening environment. For example, the listening environment could be a passenger compartment, room, or lobby of a vehicle. According to one example, arrangement 30 as disclosed herein can be used for multiplayer game events, where multiple players of a video game are located in the same room and wish to communicate with each other. According to further embodiments, arrangement 30 including two or more systems 100 can be used in a vehicle. However, these are merely examples.

[0019] In a listening environment 30 comprising two or more systems 100, speech originating from a first user associated with a first system 100a among the two or more systems 100a, 100b, 100c is captured by a microphone 204a of the first system 100a, transmitted to a second and any further system 100b, 100c among the two or more systems 100a, 100b, 100c, and output by one or more speakers 200b, 200c of the respective second or further system 100b, 100c. In addition to the speech captured by the microphone 204a of the first system 100a, further audio signals may also be transmitted from the first system 100a to the second and any further system 100b, 100c among the two or more systems 100a, 100b, 100c, and may be output by one or more speakers 200b, 200c of the respective second or further system 100b, 100c. Such additional audio signals may include any audio signal originating from a signal source other than the microphone 204a of the first system 100a. For example, system 100a may include or be coupled to signal source 102 (see example...). Figure 2 The signal source is different from the microphone 204a. For example, the signal source 102 may be or may include a smartphone, media player, or tablet computer. The user of the first system 100a may want to share the audio signal (audio content, such as music, telephone conversation, etc.) with another user in the listening environment for any reason.

[0020] Signals transmitted from one system 100 to another system 100 can typically be transmitted uncompressed to avoid increasing latency in the audio signal. However, uncompressed signals require significant bandwidth. Different types of codecs are known that allow signal compression to reduce the required bandwidth. Some codecs are generally better suited for compressing speech signals, while others are generally better suited for compressing other signals, such as, for example, music signals. For instance, codecs designed for compressing music signals provide high-quality reproduction of both music and speech, but require high computational power, operate on large buffers, and typically require look-ahead buffers. Such codecs generally introduce high latency. Codecs designed for compressing speech signals (which may also be called voice codecs) generally provide good-quality reproduction of speech signals but are generally less suitable for reproducing music signals. Voice codecs typically require moderate computational power and operate on small buffers or even individual samples. Small buffers generally make voice codecs well-suited for ultra-low latency applications.

[0021] Now for reference Figure 3The diagram schematically illustrates an audio transmission system 100 according to an embodiment of the present disclosure. System 100 includes one or more speakers 200, each configured to radiate sound corresponding to a sound signal to at least one of two listening positions associated with a user of the respective system 100, each located at the entrance of the user's ear canal. System 100 further includes a microphone 204 configured to capture speech originating from a user associated with the respective system 100. System 100 is configured to: capture a first speech signal originating from a user associated with the system 100 via microphone 204; compress the first speech signal using a codec of a first type to obtain a first compressed speech signal; receive a first audio signal from a signal source 102, wherein the signal source 102 is different from microphone 204; transmit the first compressed speech signal via a first communication channel 50; and transmit the first audio signal or a compressed version of the first audio signal via the first communication channel 50 or via a second communication channel 52.

[0022] In the audio transmission system 100 according to various embodiments disclosed herein, speech originating from a user associated with system 100 and captured by microphone 204 is not mixed with additional audio signals before being transmitted to one or more other systems 100 in the arrangement 30 comprising a plurality of systems 100. Speech originating from a user associated with system 100 and captured by microphone 204 is transmitted via a first communication channel 50. Other audio signals besides the speech signal captured by microphone 204 may also be transmitted via the first communication channel 50. However, such audio signals or compressed versions of such audio signals are transmitted independently of the speech signal. It is also possible that audio signals other than the speech signal captured by microphone 204 are transmitted via a second communication channel 52, which is different from the first communication channel 50 used for transmitting the speech signal captured by microphone 204. Thus, speech originating from a user associated with system 100 and captured by microphone 204 can be compressed using a first type of codec. On the other hand, any other audio signal can be transmitted uncompressed or compressed using a second type of codec. For example, if a first compressed speech signal is transmitted via a first communication channel 50, and a first audio signal or a compressed version of the first audio signal is transmitted via a second communication channel 52 different from the first communication channel 50, then the first communication channel 50 may be a bandwidth-constrained ultra-low latency channel, while the second communication channel 52 is not bandwidth-constrained. If the same communication channel is used to transmit all signals, then a communication channel equally suitable for transmitting speech signals as well as transmitting any other kind of audio signal can be used. In this case, low latency of the speech signal can still be achieved by using different codecs to compress the corresponding signals. According to some examples, it is even possible that the speech signal and other audio signals captured by the microphone 204 are compressed using the same codec, but transmitted separately via different communication channels.

[0023] For speech originating from a user associated with system 100 and captured via microphone 204, latency is typically critical. That is, the speech signal requires low latency. Therefore, a first-type codec can be a low-latency codec. In other words, a first-type codec can be a codec specifically designed for signals and applications where low latency is critical. Known low-latency or ultra-low-latency speech codecs include, but are not limited to, G.722, G.726, and G.728.

[0024] Low-latency codecs provide high-quality reproduction of speech signals but are generally less suitable for reproducing other audio signals, such as, for example, music signals. Therefore, the claimed system does not mix the additional audio signal with speech originating from a user associated with system 100 and captured by microphone 204 before transmission to one or more other systems 100 in the arrangement 30 comprising multiple systems 100. The remaining audio signal can be transmitted uncompressed or compressed using a second type of codec, different from the first type. For example, the second type of codec could be a low-bandwidth codec. Latency is generally not critical for the additional audio signal. Rather, reducing the required bandwidth may be desirable. Different low-bandwidth or low-bitrate codecs are commonly known. Examples of low-bandwidth codecs include AAC, SBC (Low Complexity Subband Codec), LC3 (Low Complexity Communication Codec), MP3, and aptX. Many other low-bandwidth codecs are also known and can be used alternatively.

[0025] Thus, a codec specifically designed for speech signals is used for the speech signal, while another codec specifically designed for audio signals (e.g., music signals) is used for the remaining audio signals. Different signals are transmitted via the same communication channel 50 or via separate communication channels 50 and 52. The first communication channel 50 and the second communication channel 52 can be wireless communication channels. For example, the first communication channel 50 can be an ultra-wideband (UWB) communication channel, and / or the second communication channel 52 can be a Bluetooth communication channel. Other types of communication channels are also generally possible.

[0026] System 100 may include a first compression / decompression unit 404 configured to compress a first speech signal using a first type of codec to obtain a first compressed speech signal. The first compression / decompression unit 404 may be further configured to decompress any compressed speech signal received via a first communication channel 50. System 100 may further include a mixing element 402 configured to mix different audio signals to be reproduced by one or more speakers 200. That is, mixing element 402 may receive a decompressed speech signal received via the first communication channel 50 and decompressed by compression / decompression unit 404. Mixing element 402 may further receive a second audio signal received via a second communication channel 52. Mixing element 402 may then mix the second audio signal with the second speech signal to obtain a first mixed signal. Alternatively, in addition to, or in place of, the second audio signal, mixing element 402 may receive an audio signal from signal source 102 and mix the audio signal, the second speech signal, and (optionally) the second audio signal to obtain the first mixed signal. According to some examples, the audio signal received from signal source 102 may be mixed with the second speech signal only, for example, when the second audio signal has not been received by the second communication channel 52. The first mixed signal can be reproduced by one or more speakers 200.

[0027] As described above, the audio signal can be transmitted uncompressed or compressed via the second communication channel 52. That is, instead of receiving the second audio signal via the second communication channel 52, a compressed version of the second audio signal can be received via the second communication channel 52. The system 100 can be further configured to: decompress the processed version of the second audio signal using a second type of codec to obtain the second audio signal; mix the second audio signal and the second speech signal using a mixing element 402 to obtain a first mixed signal; and reproduce the first mixed signal using one or more speakers 200. Figure 3 The second compression / decompression unit is not explicitly shown. The second compression / decompression unit is configured to compress audio signals that will be transmitted via the second communication channel 52, or to decompress second compressed audio signals received via the second communication channel 52.

[0028] In an arrangement 30 comprising two or more audio transmission systems 100, each system 100 can transmit and receive signals from each of the other systems 100 in the arrangement. This is in Figure 2The following is an illustration. For example, a user associated with the first system 100a may want to converse with one or more other users associated with other systems 100b, 100c in arrangement 30. The user of system 100a in arrangement 30 may further want to share further audio content from signal source 102 with one or more other users associated with other systems 100b, 100c in the arrangement. For example, the user may want to share music with one or more other users and converse with one or more other users simultaneously. According to another example, the user may be having a telephone conversation with someone who is not a user of system 100 in arrangement 30. The user may want to include one or more other users associated with system 100 in arrangement 30 in the telephone conversation. The speech associated with such (external) telephone conversations may be transmitted uncompressed to other systems 100 in the arrangement, or may be compressed by a second type of codec and transmitted via the second communication channel 52. Low latency is generally only critical for speech captured by microphone 204, but not for speech received from different signal sources 102.

[0029] Two of the multiple systems 100a, 100b, and 100c can be directly connected to each other, or they can be connected to each other via connector unit 60 (star connection). Figure 4 A system 100a connected to connector unit 60 is illustrated as an example. That is, system 100a transmits a compressed audio signal to connector unit 60 via a first communication channel 50. Generally, connector unit 60 is configured to forward the compressed audio signal to another system 100 among the plurality of systems 100a, 100b, 100c arranged 30. Connector unit 60 receives the compressed audio signal from first system 100a via the first communication channel 50 and transmits the compressed audio signal to another system 100b, 100c among the plurality of systems 100a, 100b, 100c via the first communication channel 50. The compressed audio signal can be decompressed by the other system 100b, 100c and can be mixed with another audio signal, similar to the above description. Figure 3 As described.

[0030] For example, connector unit 60 may include a wireless audio unit WAU 620, a mixing unit 640, and an additional signal unit 660. WAU 620 may be configured to transmit and receive compressed audio signals from system 100 in two or more systems 100a, 100b, 100c via a first communication channel 50. That is, for example, WAU 620 may include one or more transmission units 622. WAU 620 may wirelessly communicate with each of the two or more systems 100a, 100b, 100c arranged 30 and one or more signal sources 102. WAU 620 may be coupled to mixing unit 640 via one or more wired connections, and mixing unit 640 may be coupled to additional signal unit 660 via one or more wired connections. In other words, any components of connector unit 60 itself are coupled to each other via wired connections, while connector unit 60 is wirelessly coupled to any external components, such as the two or more systems 100a, 100b, 100c and one or more signal sources 102.

[0031] For example, mixing unit 640 can be configured to mix and amplify different signals received from different systems 100 in arrangement 30. For example, different compressed speech signals received from different systems 100 among two or more systems 100a, 100b, 100c can be mixed to obtain a first compressed mixed speech signal. For example, if three users associated with different systems 100 among two or more systems 100a, 100b, 100c are having a conversation with each other, mixing unit 640 can mix a compressed speech signal received from a second system 100b with a compressed speech signal received from a third system 100c via voice mixer unit 644, and transmit the resulting compressed mixed speech signal to the first system 100a. That is, mixing unit 640 can include a plurality of voice mixer units 644, wherein the number of voice mixer units 644 corresponds to the number of systems 100 included in arrangement 30, and wherein each voice mixer unit 644 is configured to mix compressed speech signals from different systems to be transmitted to system 100.

[0032] The mixing unit 640 may further include a plurality of telephone mixer units 642. Each of the plurality of telephone mixer units 642 is configured to receive compressed voice signals from different systems in system 100. Such compressed voice signals may be mixed with one or more other audio signals, such as audio signals (e.g., music, voice, or telephone signals) received from signal source 102. The resulting mixed signal may be transmitted to signal source 102 and / or additional signal unit 660. For example, additional signal unit 660 may include telephone unit 662 and music source 664. That is, additional signal unit 660 may include further signal sources, such as audio systems and / or telephone devices. Therefore, additional signal unit 660 generally has similar functionality to signal source 102. According to some examples, signal source 102 may be or may include a smartphone or tablet computer capable of providing audio content and sending and receiving telephone signals, and additional signal unit may be or may include (e.g., a vehicle's) entertainment system, which is also capable of providing audio content and sending and receiving telephone signals independently of signal source 102.

[0033] According to some examples, connector unit 60 may be an entertainment system in a vehicle or may be part of an entertainment system in a vehicle. In this case, music source 664 may be the vehicle's audio system (e.g., a radio or any other audio reproduction unit). Audio output from music source 664 and output from telephone unit 662 may be provided to a universal mixer unit 646 included in mixing unit 640. Universal mixer unit 646 may mix the corresponding signals and provide a universal mixed audio signal to the corresponding system 100 via second communication channel 52. However, as regarding Figure 4 The different elements of the connector unit 60 described are merely examples. The connector unit 60 can generally be implemented in any suitable manner. The connector unit 60 can mix different signals and forward the corresponding mixed signals to the respective systems 100 in two or more systems 100.

[0034] exist Figure 4 Only one of the two or more systems 100a, 100b, 100c, system 100a, is shown in detail. Different components of the connector unit 60 associated with any further system 100 of the multiple systems 100a, 100b, 100c can generally be implemented in the same manner as described for the first system 100a. Connecting the different systems 100 of the multiple systems 100a, 100b, 100c via the connector unit 60 (star connection) may be advantageous, for example, in an arrangement 30 comprising three or more systems 100. By connecting the different systems via the connector unit 60, the total number of connections (first connection 50 and second connection 52) can be reduced.

[0035] Now for reference Figure 5 However, it is also possible that different systems 100 in two or more systems 100a, 100b, 100c in the arrangement 30 according to the embodiments of this disclosure are directly connected to each other. Figure 5 A first system 100a is schematically shown as one of two or more systems 100 directly connected to a second system 100b. Each of the first system 100a and the second system 100b is configured as described above. Figure 3 The implementation is as described. The first system 100a is connected to the second system 100b via a first communication channel 50 and a second communication channel 52. Similarly, individual direct connections via the first communication channel 50 and the second communication channel 52 will be established between the first system 100a and any other system 100 in two or more systems 100, and between the second system 100b and any other system 100 in two or more systems 100. For example, if the number of systems 100 is small (e.g., at most three), direct connections between individual systems 100 in two or more systems 100 may be advantageous. By directly connecting the individual systems 100 in two or more systems 100 to each other, the additional cost and complexity of the connector unit 60 can be avoided, and the overall cost and complexity of the arrangement 30 can be reduced.

[0036] Arrangement 30 according to the embodiments of this disclosure includes two or more audio transmission systems 100 arranged in a listening environment according to the above embodiments. (See also: Regarding...) Figure 5 According to some embodiments, the first communication channel 50 and the second communication channel 52 can directly connect the first system 100a of two or more audio transmission systems 100 to the second system 100b of two or more audio transmission systems 100. Alternatively, as per [reference to...] Figure 4 The arrangement 30 may further include a connector unit 60, wherein the first communication channel 50 and the second communication channel 52 directly connect the first system 100a of two or more audio transmission systems 100 to the connector unit 60. The second system 100b and any further systems 100 of the two or more systems 100 may be connected to the connector unit 60 in the same manner. For example, the listening environment may be the passenger compartment, room, or hall of a vehicle. That is, the different systems 100 of the two or more systems are arranged within a defined distance from each other. For example, the maximum distance between two systems 100 of the two or more systems 100 may be, for example, 10 m, 20 m, or 50 m.

[0037] Now for reference Figure 6The diagram schematically illustrates a method of operating an audio transmission system 100 according to an embodiment of the present disclosure. The audio transmission system 100 includes: one or more speakers 200, each of which is configured to radiate sound corresponding to a sound signal to at least one of two listening positions associated with a user of the respective system 100, each of the two listening positions being located at the entrance of the ear canal of the respective user; and a microphone 204 configured to capture speech originating from a user associated with the respective system 100. The method includes capturing a first speech signal originating from a user associated with the system 100 via the microphone 204 (step 601). At 602, the method includes compressing the first speech signal using a codec of a first type to obtain a first compressed speech signal. At 603, the method further includes receiving a first audio signal from a signal source 102, wherein the signal source 102 is different from the microphone 204. The method further includes: transmitting a first compressed voice signal via a first communication channel 50 (step 604), and transmitting a first audio signal or a compressed version of the first audio signal via the first communication channel 50 or via a second communication channel 52 (step 605).

[0038] The system and method described herein allow for the transmission of speech signals between different systems with low latency, and it is possible to transmit other audio signals simultaneously, resulting in a highly satisfactory listening experience for users of the receiving system. The signals are transmitted via different communication channels and mixed at the receiving system for reproduction. This allows for the use of optimal codecs for each different type of audio (speech received via the microphone 204 of the system, and audio signals received from other signal sources 102).

[0039] It is understood that the system shown is merely an example. While various embodiments of the invention have been described, it will be apparent to those skilled in the art that many more embodiments and implementations are possible within the scope of the invention. Specifically, those skilled will recognize the interchangeability of various features from different embodiments. Although these techniques and systems have been disclosed in the context of certain embodiments and examples, it should be understood that these techniques and systems can be extended beyond the specifically disclosed embodiments to other embodiments and / or uses and obvious modifications. Therefore, the invention is not limited except as provided in the appended claims and their equivalents.

[0040] The description of the embodiments has been presented for purposes of illustration and description. Suitable modifications and variations to the embodiments can be performed in light of the foregoing description or can be obtained through practical methods. The arrangements are exemplary in nature and may include additional and / or omitted elements. As used in this application, elements listed in the singular and preceded by the word "an" or "a" should be understood as not excluding a plurality of said elements unless such exclusion is indicated. Furthermore, references to "an embodiment" or "an example" in this disclosure are not intended to exclude the existence of additional embodiments that also include the listed features. The terms "first," "second," and "third," etc., are used only as labels and are not intended to impose numerical requirements or a particular order of position on their objects. The systems described are exemplary in nature and may include additional and / or omitted elements. The subject matter of this disclosure includes all novel and non-obvious combinations and sub-combinations of the various systems and configurations disclosed, as well as other features, functions, and / or properties. The appended claims specifically point to subject matter derived from the foregoing disclosure and considered novel and non-obvious.

Claims

1. An audio transmission system (100), comprising: One or more speakers (200), each of which is configured to radiate sound corresponding to a sound signal to at least one of two listening positions associated with a user of the corresponding system (100), each of which is located at the entrance of the ear canal of the corresponding user; as well as A microphone (204) configured to capture speech originating from a user associated with the corresponding system (100), wherein the system (100) is configured to: The microphone (204) captures a first voice signal originating from the user associated with the system (100). The first speech signal is obtained by compressing the first speech signal using a first type of codec. A first audio signal is received from a signal source (102), wherein the signal source (102) is different from the microphone (204). The first compressed voice signal is transmitted via the first communication channel (50), and The first audio signal or a compressed version of the first audio signal is transmitted via the first communication channel (50) or via the second communication channel (52).

2. The audio transmission system (100) of claim 1, wherein the codec of the first type is a low-latency codec.

3. The audio transmission system (100) of claim 1 or 2, wherein the system (100) is further configured to compress the first audio signal using a second type of codec, different from the first type, to obtain the compressed version of the first audio signal.

4. The audio transmission system (100) of claim 3, wherein the codec of the second type is a low-bandwidth codec.

5. The audio transmission system (100) as claimed in any of the preceding claims, wherein the system (100) is further configured to: The second compressed voice signal is received via the first communication channel (50). Receive a second audio signal via the first communication channel (50) or via the second communication channel (52). The second compressed speech signal is decompressed using the codec of the first type to obtain the second speech signal. The second audio signal is mixed with the second speech signal to obtain a first mixed signal, and The first mixed signal is reproduced by the one or more speakers (200).

6. The audio transmission system (100) as claimed in any one of claims 1 to 4, wherein the system (100) is further configured to: The second compressed voice signal is received via the first communication channel (50). A compressed version of the second audio signal is received via the first communication channel (50) or via the second communication channel (52). The second compressed speech signal is decompressed using the codec of the first type to obtain the second speech signal. The processed version of the second audio signal is decompressed using the codec of the second type to obtain the second audio information. The second audio signal is mixed with the second speech signal to obtain a first mixed signal, and The first mixed signal is reproduced by the one or more speakers (200).

7. The system as described in any of the preceding claims, wherein The first communication channel (50) is an ultra-wideband (UWB) communication channel, and / or The second communication channel (52) is a Bluetooth communication channel.

8. An arrangement (30) comprising two or more audio transmission systems (100) as described in any one of claims 1 to 7 arranged in a listening environment.

9. The arrangement (30) as claimed in claim 8, wherein the first communication channel (50) and the second communication channel (52) directly connect the first system (100a) of the two or more audio transmission systems (100) to the second system (100b) of the two or more audio transmission systems (100).

10. The arrangement (30) of claim 8 further includes a connector unit (60), wherein the first communication channel (50) and the second communication channel (52) directly connect the first system (100a) of the two or more audio transmission systems (100) to the connector unit (60).

11. The arrangement (30) as claimed in any one of claims 8 to 10, wherein the listening environment is the passenger cabin, room or hall of the vehicle.

12. A method for operating an audio transmission system (100), the system (100) comprising: One or more speakers (200), each of which is configured to radiate sound corresponding to a sound signal to at least one of two listening positions associated with a user of the corresponding system (100), each of which is located at the entrance of the ear canal of the corresponding user; and a microphone (204) configured to capture speech originating from a user associated with the corresponding system (100), wherein the method includes: The microphone (204) captures a first voice signal originating from a user associated with the system (100). The first speech signal is obtained by compressing the first speech signal using a first type of codec. A first audio signal is received from a signal source (102), wherein the signal source (102) is different from the microphone (204). The first compressed voice signal is transmitted via the first communication channel (50), and The first audio signal or a compressed version of the first audio signal is transmitted via the first communication channel (50) or via the second communication channel (52).