Call processing device and call processing method
The call processing apparatus addresses the issue of misinterpretation by selectively retaining specific driving sounds, ensuring the call partner understands the occupant is driving, thereby maintaining call quality.
Patent Information
- Authority / Receiving Office
- JP · JP
- Patent Type
- Patents
- Current Assignee / Owner
- DENSO TEN LTD
- Filing Date
- 2022-03-09
- Publication Date
- 2026-06-12
AI Technical Summary
Conventional call processing technologies using pseudo-generated environmental sounds fail to accurately convey to the call partner that the vehicle occupant is driving, leading to uncertainty and potential misinterpretation.
A call processing apparatus that selectively retains specific driving sounds, such as turn signal and tire noise, while removing other operating noises, ensuring the call partner understands the occupant is driving.
Accurately conveys the driving state to the call partner, preventing misinterpretation and maintaining call quality by retaining non-interfering driving sounds.
Smart Images

Figure 0007873567000001 
Figure 0007873567000002 
Figure 0007873567000003
Abstract
Description
【Technical Field】 【0001】 The present invention relates to a call processing apparatus and a call processing method. 【Background Art】 【0002】 Conventionally, for example, a technology for a vehicle occupant to make a call by hands-free calling is known. In this type of technology, a technology has been proposed to generate and transmit pseudo environmental sounds so that the call partner can know that the vehicle occupant is in a state where they cannot make a call while driving (see, for example, Patent Document 1). 【Prior Art Documents】 【Patent Documents】 【0003】 【Patent Document 1】 Japanese Patent Application Laid-Open No. 2000-332677 【Summary of the Invention】 【Problems to be Solved by the Invention】 【0004】 However, since the conventional technology is only pseudo-generated environmental sounds, it is difficult to say that the call partner will necessarily understand that the occupant is in a state where they cannot make a call while driving. 【0005】 The present invention has been made in view of the above, and an object thereof is to provide a call processing apparatus and a call processing method that can make a call partner accurately understand that the vehicle is in operation. 【Means for Solving the Problems】 【0006】 In order to solve the above-described problems and achieve the object, a call processing apparatus according to the present invention includes a control unit. When an occupant is on a call, the control unit performs a removal process of removing other driving sounds while leaving specific driving sounds among the driving sounds associated with the driving of the vehicle on the sound collected by a microphone in the vehicle interior, and transmits the generated adjusted voice to the call partner. 【Effects of the Invention】 【0007】 According to the present invention, the person on the other end of the call can be made to understand with high accuracy that they are driving. [Brief explanation of the drawing] 【0008】 [Figure 1] Figure 1 is a diagram showing an overview of the call processing method according to the embodiment. [Figure 2] Figure 2 is a block diagram showing the functional configuration of the call processing device according to the embodiment. [Figure 3] Figure 3 shows an example of sound feature information. [Figure 4] Figure 4 shows the frequency characteristics of the operating noise. [Figure 5] Figure 5 is a flowchart showing the overall processing procedure executed by the call processing device according to the embodiment. [Modes for carrying out the invention] 【0009】 Hereinafter, embodiments of the call processing device and call processing method disclosed in this application will be described in detail with reference to the attached drawings. However, the present invention is not limited to the embodiments described below. 【0010】 First, an overview of the call processing method according to the embodiment will be described using Figure 1. Figure 1 is a diagram showing an overview of the call processing method according to the embodiment. Figure 1 shows an example of the configuration of the call system S according to the embodiment. 【0011】 The communication system S shown in Figure 1 is installed, for example, in a vehicle and operates when a vehicle occupant makes a hands-free call to a party. As shown in Figure 1, the communication system S includes a communication processing unit 1, a microphone 10, a speaker 11, and a communication device 100. Of the components of the communication system S, the communication processing unit 1, microphone 10, and speaker 11 are installed in the vehicle in which the occupant making the call is riding, and the communication device 100 is installed in a terminal device held by the party making the call. The communication processing method according to this embodiment is performed by the communication processing unit 1. 【0012】 Microphone 10 is installed inside the vehicle and collects sounds from inside the vehicle. Specifically, microphone 10 collects noise such as ambient sounds of the vehicle and voices including the speech of the occupants. Ambient sounds of the vehicle include, for example, driving sounds that occur when the vehicle is driven. Driving sounds include, for example, the sound of the vehicle running, engine noise, air conditioning noise, turn signal noise, wiper noise, and various switch noises. Speaker 11 is an output device that outputs the voice of the person on the other end of the line. 【0013】 The call processing device 1 performs noise removal processing on the audio collected by the microphone 10 and transmits the adjusted audio generated from this process to the call device 100. 【0014】 In this situation, the driver, as a passenger, may want to end the call as quickly as possible because the conversation with the other party becomes bothersome when they want to concentrate on driving. However, conventional noise reduction processes remove all kinds of noise in order to ensure that the passenger's speech is clearly heard by the other party, making it difficult for the other party to know that the passenger is driving. 【0015】 In this regard, conventional technology includes a method of generating simulated ambient sounds and transmitting them along with the voice to make the person on the other end of the call understand that the occupant is driving. However, since conventional technology only generates simulated ambient sounds, it cannot be said with certainty that the person on the other end of the call will understand that the occupant is driving and therefore unable to talk. 【0016】 Therefore, in the call processing method according to the embodiment, a removal process is performed to remove other operating noises from the operating noises associated with the operation of the vehicle, while retaining specific operating noises. 【0017】 As will be described in more detail later, in the call processing method according to this embodiment, driving sounds that are less likely to interfere with the call, such as turn signal sounds, and that indicate that the driver is in motion are selectively retained and transmitted to the other party. 【0018】 That is, in order to transmit part of the actual driving sound to the call partner, compared with the pseudo-generated environmental sound, the call partner can be made to accurately understand that the user is driving. Furthermore, by transmitting while leaving the driving sound that is less likely to interfere with the call, a deterioration in call quality can also be prevented. 【0019】 Next, with reference to FIG. 2, a configuration example of the call processing apparatus 1 according to the embodiment will be described. FIG. 2 is a block diagram showing a functional configuration example of the call processing apparatus 1 according to the embodiment. 【0020】 As shown in FIG. 2, the call processing apparatus 1 according to the embodiment includes a control unit 2 and a storage unit 3. The control unit 2 includes an acquisition unit 21, a detection unit 22, a removal unit 23, and a call control unit 24. The storage unit 3 stores sound feature information 31. 【0021】 Here, the call processing apparatus 1 includes, for example, a computer having a CPU (Central Processing Unit), a ROM (Read Only Memory), a RAM (Random Access Memory), a flash memory, an input / output port, etc., and various circuits. 【0022】 The CPU of the computer functions as the acquisition unit 21, the detection unit 22, the removal unit 23, and the call control unit 24 of the control unit 2 by, for example, reading and executing a program stored in the ROM. 【0023】 Also, at least one or all of the acquisition unit 21, the detection unit 22, the removal unit 23, and the call control unit 24 of the control unit 2 can be configured by hardware such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array). 【0024】 Furthermore, the memory unit 3 supports RAM and flash memory. RAM and flash memory can store sound characteristic information 31 and information about various programs. The call processing unit 1 may also acquire the above-mentioned programs and various information via other computers or portable recording media connected by wired or wireless networks. 【0025】 The sound feature information 31 includes information about the characteristics of the driving sound. For example, the sound feature information 31 may be generated in advance through experiments or other means. Alternatively, the sound feature information 31 may be the result of collecting and analyzing the driving sound during actual driving using the microphone 10. 【0026】 Figure 3 shows an example of sound feature information 31. As shown in Figure 3, sound feature information 31 includes items such as "operating sound" and "feature information". 【0027】 "Operating sound" refers to the name of the operating sound and its source, and is information that identifies the operating sound. "Characteristic information" refers to information that describes the characteristics of the operating sound. In Figure 3, it is expressed as characteristic information #1, etc., but in reality, information such as frequency characteristics, amplitude characteristics, sound pressure level characteristics, and intermittent period are input. The intermittent period is a characteristic of operating sounds that occur at predetermined intervals of seconds, such as turn signal sounds and wiper sounds. 【0028】 Next, we will explain in detail each function of the control unit 2 (acquisition unit 21, detection unit 22, removal unit 23, and call control unit 24). 【0029】 The acquisition unit 21 acquires the audio collected by the microphone 10. For example, the acquisition unit 21 acquires the audio collected by the microphone 10 during the period when the caller is in a call state (connected state) with the other party. 【0030】 Microphone 10 is positioned in a location where it can collect the voices of all occupants in the vehicle (for example, near the rearview mirror). Alternatively, microphone 10 may be provided at a location corresponding to each seat position, to collect the voices of each occupant individually. 【0031】 The detection unit 22 detects driving sounds associated with vehicle operation from the audio acquired by the acquisition unit 21. Specifically, the detection unit 22 refers to the sound feature information 31 stored in the storage unit 3 and detects driving sounds included in the sound feature information 31. 【0032】 In other words, the detection unit 22 pre-stores the characteristics of the sound for each source of the operating sound as sound characteristic information 31, and detects operating sounds with specific characteristics as specific operating sounds. 【0033】 Furthermore, the detection unit 22 detects specific operating sounds from among the detected operating sounds. As mentioned above, specific operating sounds are those that are left in the removal process and not removed. In other words, the detection unit 22 performs a process to separate the specific operating sounds that are left in the removal process from the other operating sounds that are removed in the removal process. 【0034】 In this way, the characteristics of the sound are pre-stored as sound characteristic information 31 for each source of the operating sound, and operating sounds with specific characteristics are detected as specific operating sounds. This makes it possible to accurately distinguish between specific operating sounds to be retained in the removal process and other operating sounds to be removed in the removal process. 【0035】 For example, the detection unit 22 detects driving sounds that are less likely to interfere with phone calls as specific driving sounds. Driving sounds that are less likely to interfere with phone calls include, for example, tire noise when turning the steering wheel, turn signal noise, wiper noise, and button switch noise, as described in the sound characteristic information 31. Note that tire noise when turning the steering wheel refers to, for example, the sound (squeaking sound) generated by friction between the tires and a painted road surface, such as in an indoor parking lot. 【0036】 Specifically, the detection unit 22 detects a specific frequency band as a specific operating sound. This point will be explained using Figure 4. Figure 4 is a diagram showing the frequency characteristics of the operating sound. 【0037】 Figure 4 shows the frequency characteristics of road noise, wind noise when the windows are open, conversational speech, turn signal sound (or wiper sound), and tire noise when turning the steering wheel. 【0038】 In the example shown in Figure 4, the specific driving sounds are the turn signal sound and the tire noise when turning the steering wheel. In other words, the subsequent noise reduction unit 23 removes road noise and wind noise, while leaving the turn signal sound and the tire noise when turning the steering wheel. 【0039】 The detection unit 22 detects a specific frequency band as a specific driving sound based on the frequency characteristics of the driving sound shown in Figure 4. For example, the detection unit 22 detects a frequency band that includes the turn signal sound and the tire sound when turning the steering wheel as a specific frequency band. 【0040】 In the example shown in Figure 4, the detection unit 22 detects, for example, a frequency band of 1 kHz or higher as a specific frequency band. In other words, the detection unit 22 detects operating sounds higher than a predetermined frequency (1 kHz) as a specific operating sound. 【0041】 This allows us to remove "operating noises that are less likely to interfere with phone calls" without having to identify each individual operating noise. 【0042】 Furthermore, the detection unit 22 detects operating sounds in frequency bands other than the frequency band corresponding to conversational speech as specific operating sounds, based on the frequency characteristics of the operating sounds shown in Figure 4. In the example shown in Figure 4, the detection unit 22 detects, for example, operating sounds in frequency bands other than 300Hz to 3kHz as specific operating sounds. 【0043】 This allows for the removal of operating noises that have the same frequency characteristics as conversational speech, thus preventing conversational speech from being drowned out by operating noises with high precision. 【0044】 The removal unit 23 performs a removal process on the operating sounds detected by the detection unit 22, removing other operating sounds while retaining specific operating sounds. The removal process is performed, for example, by mixing a signal having frequency characteristics corresponding to the operating sounds with the audio in reverse phase. 【0045】 For example, the removal unit 23 removes other frequency bands corresponding to other operating noises while retaining a specific frequency band corresponding to a particular operating noise. 【0046】 Specifically, the removal unit 23 performs removal processing to retain certain operating sounds that are higher than a predetermined frequency (1 kHz). In addition, the removal unit 23 performs removal processing to retain certain operating sounds that correspond to frequency bands other than the frequency band corresponding to conversational speech. 【0047】 Furthermore, the removal unit 23 may perform removal processing according to the vehicle's speed. For example, with respect to driving noise and engine noise in the sound feature information 31, the removal unit 23 reduces the degree of removal as the vehicle speed increases; in other words, if the vehicle speed is above a predetermined value, the driving noise and engine noise are treated as specific driving noises. 【0048】 For example, if the removal unit 23 is to retain driving noise and engine noise during the removal process, it may be configured to increase the sound pressure level of such driving noise as the vehicle speed increases. This allows the driver to communicate to the other party that they are driving by letting them hear the driving noise and engine noise when the vehicle speed is high and concentration on driving is required. 【0049】 Furthermore, the removal unit 23 may determine whether the occupant making the call is the driver or someone other than the driver, and may vary the sound pressure level of the specific driving sounds left in the removal process according to the determination result. 【0050】 For example, when the passenger making the call is not the driver, the removal unit 23 reduces the sound pressure level of the remaining driving noise through the removal process compared to when the passenger making the call is the driver. 【0051】 This prevents the other party from mistakenly believing that the caller is driving, even if the caller is not the driver. Whether the caller is the driver or not can be determined, for example, based on the relationship between the sound pressure level of the audio collected by microphone 10 and the relative positions of microphone 10 and the occupants. 【0052】 Furthermore, the noise reduction unit 23 increases the sound pressure level of the remaining driving noise through the reduction process as the call duration increases, for example, when the caller is the driver. This makes it possible to avoid long call durations when the driver is making a call with high precision. 【0053】 The call control unit 24 transmits the adjusted audio, which has been filtered by the filtering unit 23, to the call device 100. The call control unit 24 also outputs the utterance of the caller, which it has received from the call device 100, through the speaker 11. 【0054】 Next, the processing procedure of the call processing device 1 according to the embodiment will be explained using Figure 5. Figure 5 is a flowchart showing the overall processing procedure of the call processing device 1 according to the embodiment. 【0055】 As shown in Figure 5, the acquisition unit 21 acquires the sound collected by the microphone 10 (step S101). 【0056】 Next, the detection unit 22 refers to the sound feature information 31 and detects a specific driving sound from the audio (step S102). 【0057】 Next, the removal unit 23 performs a removal process to remove other operating noises while retaining the specific operating noise that was detected (step S103). 【0058】 Next, the call control unit 24 transmits the adjusted voice, which is the voice after the removal process, to the call device 100 (step S104), and terminates the process. 【0059】 As described above, the call processing device 1 according to this embodiment includes a control unit 2. When an occupant is on a call, the control unit 2 performs a removal process on the audio collected by the microphone 10 inside the vehicle, removing all but specific driving sounds associated with the operation of the vehicle, and transmits the resulting adjusted audio to the other party (call device 100). This allows the other party to understand with high accuracy that the vehicle is in operation. 【0060】 Further effects and modifications can be readily derived by those skilled in the art. Therefore, broader aspects of the present invention are not limited to the specific details and representative embodiments expressed and described above. Accordingly, various modifications are possible without departing from the spirit or scope of the overall concept of the invention as defined by the appended claims and their equivalents. [Explanation of Symbols] 【0061】 1. Call processing device 2 Control Unit 3 Storage section 10 Microphones 11 speakers 21 Acquisition Department 22 Detection unit 23 Removal part 24. Call Control Unit 31. Sound characteristic information 100 Telephone equipment S Calling System
Claims
[Claim 1] When an occupant is on a call, the control unit transmits to the other party on the call an adjusted audio generated by removing certain driving sounds from the audio collected by the microphone inside the vehicle, while retaining other driving sounds. Equipped with, The control unit, The system determines whether the passenger making the call is the driver or someone other than the driver, and if the passenger is not the driver, it reduces the sound pressure level of the specific driving sound that remains after the removal process compared to when the passenger is the driver. Call processing device. [Claim 2] The control unit, The characteristics of the sound are stored in advance for each source of the aforementioned operating noise, and the removal process is performed to retain the operating noise that has the specific characteristics described above. The call processing device according to claim 1. [Claim 3] The control unit, The removal process is performed to retain the operating noise in a specific frequency band. The call processing device according to claim 1 or 2. [Claim 4] The control unit, The removal process is performed to leave the operating noise that is higher than the predetermined frequency. The call processing device according to claim 3. [Claim 5] The control unit, The removal process leaves the operating noise in frequency bands other than the frequency band corresponding to the conversational voice. The call processing device according to claim 3. [Claim 6] The control unit, The higher the vehicle speed, the greater the sound pressure level of the specific driving sound that remains after the removal process. A call processing device according to any one of claims 1 to 5. [Claim 7] The control unit, The longer the call duration, the higher the sound pressure level of the specific operating noise that remains after the removal process. A call processing device according to any one of claims 1 to 6. [Claim 8] A method for processing phone calls performed by a computer, A control process that, when an occupant is on a call, transmits to the other party a modified voice generated by removing certain driving noises from the audio collected by the microphone inside the vehicle, while retaining other driving noises, when the occupant is on a call. Includes, The control process described above is: The system determines whether the passenger making the call is the driver or someone other than the driver, and if the passenger is not the driver, it reduces the sound pressure level of the specific driving sound that remains after the removal process compared to when the passenger is the driver. Call handling method.