A vehicle-mounted intelligent interaction method, device, equipment and storage medium
By acquiring multimodal interaction commands from the in-vehicle intelligent interaction system and setting response priorities according to the operating mode, the problem of insufficient intelligence of existing systems under multiple commands is solved, thereby improving user experience and interaction efficiency.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- FAW JIEFANG AUTOMOTIVE CO
- Filing Date
- 2023-08-28
- Publication Date
- 2026-06-12
AI Technical Summary
Existing in-vehicle intelligent interaction systems lack effective response logic when faced with multiple complex interaction commands, resulting in insufficient intelligence and affecting user experience.
By acquiring multimodal interaction commands from users inside the vehicle, the interaction type is determined, and response priorities are set according to the current operating mode of the vehicle. High-priority interaction commands are processed first until all command responses are completed.
It improves the intelligent response capability and user experience under complex interactive commands, and ensures the intelligent processing of interactive commands in different scenarios.
Smart Images

Figure CN117125089B_ABST
Abstract
Description
Technical Field
[0001] This invention relates to the field of vehicle human-machine interaction technology, and in particular to an in-vehicle intelligent interaction method, device, equipment, and storage medium. Background Technology
[0002] As commercial vehicles evolve towards intelligence, comfort, and new energy, people's demands for the driving experience are constantly increasing, leading to the emergence of a variety of in-vehicle intelligent interactive devices. Users can interact with in-vehicle intelligent systems via voice to achieve functions such as navigation, music playback, and phone calls. The significance of in-vehicle intelligent interactive systems lies in improving driving safety, enhancing the driving experience, and strengthening the interaction between users and their vehicles.
[0003] Existing in-vehicle intelligent interaction systems can respond promptly to single user interaction commands, but when there are many interaction commands and they are of complex types, they lack effective response logic and their intelligence is insufficient, thus affecting the user experience. Summary of the Invention
[0004] This invention provides an in-vehicle intelligent interaction method, device, equipment, and storage medium to improve the intelligence level of complex interaction commands.
[0005] According to one aspect of the present invention, an in-vehicle intelligent interaction method is provided, comprising:
[0006] Acquire at least two multimodal interaction commands from the user inside the vehicle, and determine the interaction type of each multimodal interaction command;
[0007] Determine the response priority for each interaction type based on the vehicle's current operating mode;
[0008] The multimodal interaction commands are responded to according to their interaction type and response priority.
[0009] According to another aspect of the present invention, an in-vehicle intelligent interaction device is provided, comprising:
[0010] The interaction type determination module is used to acquire at least two multimodal interaction commands from the user in the vehicle and determine the interaction type of each multimodal interaction command.
[0011] The response priority determination module is used to determine the response priority of each interaction type based on the current operating mode of the vehicle.
[0012] The interactive instruction response module is used to respond to each multimodal interactive instruction according to the interaction type and response priority of each multimodal interactive instruction.
[0013] According to another aspect of the present invention, an electronic device is provided, the electronic device comprising:
[0014] At least one processor; and
[0015] A memory communicatively connected to the at least one processor; wherein,
[0016] The memory stores a computer program that can be executed by the at least one processor, which enables the at least one processor to perform the in-vehicle intelligent interaction method according to any embodiment of the present invention.
[0017] According to another aspect of the present invention, a computer-readable storage medium is provided, the computer-readable storage medium storing computer instructions for causing a processor to execute and implement the in-vehicle intelligent interaction method according to any embodiment of the present invention.
[0018] This invention, by setting different command response priorities for different vehicle operating modes, enables intelligent responses to interactive commands in various scenarios. Through effective response strategies and logic, it improves the intelligence of responding to interactive commands and enhances the user experience.
[0019] It should be understood that the description in this section is not intended to identify key or essential features of the embodiments of the present invention, nor is it intended to limit the scope of the invention. Other features of the invention will become readily apparent from the following description. Attached Figure Description
[0020] To more clearly illustrate the technical solutions in the embodiments of the present invention, the accompanying drawings used in the description of the embodiments will be briefly introduced below. Obviously, the accompanying drawings described below are only some embodiments of the present invention. For those skilled in the art, other drawings can be obtained based on these drawings without creative effort.
[0021] Figure 1A This is a flowchart of an in-vehicle intelligent interaction method according to an embodiment of the present invention;
[0022] Figure 1B This is a functional division diagram of an in-vehicle intelligent interaction device according to an embodiment of the present invention;
[0023] Figure 2A This is a flowchart of an in-vehicle intelligent interaction method according to another embodiment of the present invention;
[0024] Figure 2B This is a schematic diagram of a human-computer interaction process according to an embodiment of the present invention;
[0025] Figure 3This is a structural schematic diagram of an in-vehicle intelligent interaction device according to another embodiment of the present invention;
[0026] Figure 4 This is a schematic diagram of the structure of an electronic device that implements an embodiment of the present invention. Detailed Implementation
[0027] To enable those skilled in the art to better understand the present invention, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings of the embodiments of the present invention. Obviously, the described embodiments are only some embodiments of the present invention, and not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative effort should fall within the scope of protection of the present invention.
[0028] It should be noted that the terms "first," "second," etc., in the specification, claims, and accompanying drawings of this invention are used to distinguish similar objects and are not necessarily used to describe a specific order or sequence. It should be understood that such data can be interchanged where appropriate so that the embodiments of the invention described herein can be implemented in orders other than those illustrated or described herein. Furthermore, the terms "comprising" and "having," and any variations thereof, are intended to cover a non-exclusive inclusion; for example, a process, method, system, product, or apparatus that comprises a series of steps or units is not necessarily limited to those steps or units explicitly listed, but may include other steps or units not explicitly listed or inherent to such processes, methods, products, or apparatus.
[0029] Figure 1A This is a flowchart illustrating an embodiment of an in-vehicle intelligent interaction method according to the present invention. This embodiment is applicable to situations where multiple interaction commands awaiting response appear simultaneously within a short period of time, requiring the intelligent interaction system to determine the response priority of each interaction command and respond sequentially. This method can be executed by an in-vehicle intelligent interaction device, which can be implemented in hardware and / or software. This device can be configured in an electronic device with corresponding processing capabilities, such as a vehicle's intelligent interaction system. Figure 1A As shown, the method includes:
[0030] S110: Obtain at least two multimodal interaction commands from the user inside the vehicle, and determine the interaction type of each multimodal interaction command.
[0031] S120. Determine the response priority of each interaction type based on the vehicle's current operating mode.
[0032] S130. Respond to each multimodal interaction instruction according to its interaction type and response priority.
[0033] The interaction types include vehicle control and device interaction, multimedia interaction, and driver assistance interaction. Operating modes include driving mode and parking mode. For example... Figure 1B As shown, the intelligent interaction system can be divided into four functional modules: vehicle control and design module, multimedia module, multimodal interaction module, and driver assistance module. Vehicle control and design module involves the setting of basic vehicle functions, including hardware devices such as windows, doors, air conditioning, seats, and lights; multimedia interaction involves local audio and video playback and online audio and video playback from third-party applications; multimodal interaction involves gesture interaction and voice interaction; and driver assistance module involves functions such as navigation, electronic rearview mirror / blind spot detection, and driver fatigue detection.
[0034] Specifically, users in the vehicle send interaction intentions to the intelligent interaction system through voice, gestures, and button presses. The intelligent interaction system uses deep learning and neural networks to perceive user behavior and interpret interaction intentions, generating multimodal interaction commands corresponding to the user behavior. For any generated multimodal interaction command, its interaction type is determined based on the controlled object of the command. The vehicle's response priority to interaction commands differs in different operating modes. For example, in driving mode, to ensure vehicle safety, driver assistance interaction commands should be responded to first; in parking mode, to ensure the user's multimedia experience, multimedia interaction commands should be responded to first. Therefore, this application sets corresponding interaction type response priorities for both operating modes. After determining the interaction type of the multimodal interaction command to be responded to and the response priority of each interaction type in the current operating mode, high-priority multimodal interaction commands are responded to first. After the high-priority interaction commands are responded to, low-priority interaction commands are responded to, until all interaction commands are responded to.
[0035] This invention sets different command response priorities for different vehicle operating modes, enabling intelligent responses to interactive commands in various scenarios. Through effective response strategies and logic, it improves the intelligence of interactive command responses and enhances the user experience.
[0036] Optionally, after obtaining at least two multimodal interaction commands from the user inside the vehicle, the method further includes:
[0037] Determine the multimodal interaction feedback associated with each multimodal interaction command; display the multimodal interaction feedback to the in-vehicle user through an intelligent interactive virtual avatar.
[0038] Specifically, after determining the multimodal interaction command, the associated multimodal interaction feedback is determined based on the correlation between locally pre-stored multimodal commands and multimodal interaction feedback. An intelligent virtual interactive avatar is displayed on the vehicle's central control screen, showcasing multimodal interaction feedback through sound, movement, and facial expressions.
[0039] Optionally, responding to each multimodal interaction command according to its interaction type and response priority includes:
[0040] If there are multimodal interaction commands with the same interaction type, they will be responded to sequentially according to the timing of each multimodal interaction command.
[0041] Specifically, interactive commands of different interaction types are responded to according to the priority of the interaction type, while different interactive commands of the same type are responded to according to the sequence of events.
[0042] Figure 2A This is a flowchart illustrating an in-vehicle intelligent interaction method according to another embodiment of the present invention. This embodiment is an optimization and improvement based on the above embodiments. Figure 2A As shown, the method includes:
[0043] S210: Obtain at least two multimodal interaction commands from the user inside the vehicle, and determine the interaction type of each multimodal interaction command.
[0044] S220. Obtain the current status information of the vehicle, and determine the current operating mode of the vehicle based on the status information, wherein the operating mode includes driving mode and;
[0045] S230. If the operating mode is driving mode, the response priority of each interaction type is determined from high to low as driving assistance interaction, vehicle control and vehicle equipment interaction, and multimedia interaction; if the operating mode is parking mode, the response priority of each interaction type is determined from high to low as vehicle control and vehicle equipment interaction, multimedia interaction, and driving assistance interaction.
[0046] The status information includes vehicle speed status, vehicle power status, and external environment information.
[0047] Specifically, the current operating mode of the vehicle is determined by the status information collected by the controller. When the vehicle is in driving mode, the response priority of interactive commands is, in order: driver assistance interaction, vehicle control and device interaction, and multimedia interaction. When the vehicle is in parking mode, the priority of interactive information is, in order: vehicle control and device interaction, multimedia interaction, and driver assistance interaction.
[0048] Optionally, the driving assistance interaction includes driving assistance interaction for driving and parking; when the vehicle's current operating mode is parking mode, no multimodal interaction commands related to driving assistance interaction are generated.
[0049] Among them, driving assistance interaction is the auxiliary driving interaction that ensures vehicle safety while driving; parking assistance interaction is the auxiliary driving interaction that ensures vehicle safety while parking.
[0050] Specifically, in driving mode, the intelligent interaction system can identify obstacles, pedestrians, and other vehicles near the vehicle through collected images, and provide warnings to occupants within a safe distance. In parking mode, the intelligent interaction system prioritizes processing vehicle control and device-related interaction intentions from occupants, and the driving assistance module only provides warnings for parking driving assistance in special circumstances (such as abnormal vehicle power supply voltage, fault alarms, etc.).
[0051] Optionally, the multimodal interaction command is generated by multimodal behavior perception and interaction intent parsing of the user's behavior in the vehicle, or by driving assistance functions, or by being triggered when environmental parameters meet set trigger conditions; the user behavior includes at least one of the following: voice commands, action commands, or hardware operation commands.
[0052] Optionally, the sources of multimodal interaction commands can be divided into three types: First, user-initiated actions within the vehicle, such as issuing a voice control command "Turn the air conditioning fan speed to maximum," which serves as a multimodal interaction command. Second, adjustments or reminders issued by driver assistance functions to ensure vehicle safety, such as when the vehicle is traveling too fast and there is an obstacle ahead, the driver assistance function generates a voice reminder "There is an obstacle ahead, please slow down and avoid it," which serves as a multimodal interaction command. Third, interaction commands with pre-set trigger conditions, such as pre-setting to automatically turn the air conditioning fan speed to maximum when the outside temperature is greater than 35°C, and subsequently automatically generating the interaction command "Turn the air conditioning fan speed to maximum" when the outside temperature is greater than 35°C.
[0053] Optionally, the multimodal interaction instructions include custom multimodal interaction instructions and non-custom multimodal interaction instructions; the custom multimodal interaction instructions correspond to the in-vehicle user-defined scenario modes and are decomposed into at least two associated non-custom multimodal interaction instructions based on preset association relationships.
[0054] Optionally, the intelligent interaction system of this application provides users with customizable scenario modes, allowing in-vehicle occupants to use simple gestures and voice commands instead of a set of complex voice commands, gesture operations, and UI operations. For the implementation of customizable scenario modes, pre-recording is required. The intelligent interaction system records multiple non-customizable multimodal commands given by the user during recording and binds them to the user-defined activation action of the customizable scenario mode; this activation action is the non-customizable multimodal interaction command. After recording is complete, if a non-customizable multimodal interaction command is detected from the user, it is decomposed into multiple non-customizable multimodal actions recorded during the recording period, enabling the linkage of multiple controllers.
[0055] For example, Figure 2B This is a schematic diagram of a human-computer interaction process according to another embodiment of the present invention. The in-vehicle intelligent interaction device consists of three hardware components: a detection and recognition module, a logic operation module, and an information display module.
[0056] The detection and recognition module includes at least two microphones, a driver's cab camera, a group of surround view cameras, and a bus signal transceiver. The microphones are used for sound field localization and voice interaction, the driver's cab camera is used for gesture recognition and driver fatigue monitoring, the surround view camera is used to monitor external environmental information, and the bus signal transceiver is used to receive status information from other controllers and send control commands to them to realize the interactive instructions of the people in the vehicle.
[0057] The logic processing module is primarily responsible for processing video, audio, and hardware operation information collected by the detection and recognition module. It uses deep learning and neural networks to perceive the behavior and interpret the interaction intentions of occupants inside the vehicle. The logic processing module is also responsible for prioritizing different interaction commands.
[0058] First, the logic processing module determines the current vehicle operating mode by receiving information from other controllers via the detection and recognition module. When the vehicle is in driving mode, the priority of interactive information is, in order: driver assistance interaction, vehicle control and device interaction, and multimedia interaction. In this state, the intelligent interaction system can identify obstacles, pedestrians, and other vehicles near the vehicle through images collected by the detection and recognition module, and issue warnings to occupants within a certain safe distance. When the vehicle is in parking mode, the priority of interactive information is, in order: vehicle control and device interaction, multimedia interaction, and driver assistance interaction. In this state, the intelligent interaction system prioritizes processing vehicle control and device-related interactive intentions from occupants, and the driver assistance interaction module only issues warnings in special circumstances.
[0059] The information display module is primarily responsible for displaying multimedia information and showcasing the virtual avatar of the intelligent interactive system. Once the logic processing module determines the interaction command, the information display module controls the virtual avatar to provide interactive feedback through sound, actions, and facial expressions, making the interaction more human-like.
[0060] Users can trigger custom scene modes of the intelligent interaction system via voice, gestures, etc. The logic processing module recognizes and stores the activation actions of occupants. When the vehicle status meets the trigger conditions, the logic processing module sends commands to the corresponding controllers via the bus, enabling the linkage of multiple controllers. On the other hand, the information display module also uses virtual avatar expressions, actions, and voice to provide feedback to occupants on the current scene mode of the vehicle.
[0061] S240. Respond to each multimodal interaction instruction according to its interaction type and response priority.
[0062] The embodiments of the present invention enable multi-controller linkage by storing user-defined scenario modes, making human-computer interaction more customized and intelligent.
[0063] Figure 3 This is a structural schematic diagram of an in-vehicle intelligent interactive device provided in another embodiment of the present invention. (See diagram below.) Figure 3 As shown, the device includes:
[0064] The interaction type determination module 310 is used to acquire at least two multimodal interaction commands from the user in the vehicle and determine the interaction type of each multimodal interaction command.
[0065] The response priority determination module 320 is used to determine the response priority of each interaction type based on the current operating mode of the vehicle.
[0066] The interactive instruction response module 330 is used to respond to each multimodal interactive instruction according to the interaction type and response priority of each multimodal interactive instruction.
[0067] The in-vehicle intelligent interaction device provided in the embodiments of the present invention can execute the in-vehicle intelligent interaction method provided in any embodiment of the present invention, and has the corresponding functional modules and beneficial effects of executing the method.
[0068] Optionally, the interaction types include vehicle control and device interaction, multimedia interaction, and driver assistance interaction, and the response priority determination module includes:
[0069] The operating mode determination unit acquires the current status information of the vehicle and determines the current operating mode of the vehicle based on the status information. The operating mode includes driving mode and driving mode.
[0070] The first response priority determination unit is used to determine the response priority of each interaction type from high to low as follows if the operating mode is driving mode: driving assistance interaction, vehicle control and vehicle device interaction, and multimedia interaction.
[0071] The second response priority determination unit is used to determine the response priority of each interaction type from high to low as follows if the operating mode is parking mode: vehicle control and vehicle device interaction, multimedia interaction, and driving assistance interaction.
[0072] Optionally, the driving assistance interaction includes driving assistance interaction for driving and parking; when the vehicle's current operating mode is parking mode, no multimodal interaction commands related to driving assistance interaction are generated.
[0073] Optionally, the multimodal interaction command is generated by multimodal behavior perception and interaction intent parsing of the user's behavior in the vehicle, or by driving assistance functions, or by being triggered when environmental parameters meet set trigger conditions; the user behavior includes at least one of the following: voice commands, action commands, or hardware operation commands.
[0074] Optionally, the multimodal interaction instructions include custom multimodal interaction instructions and non-custom multimodal interaction instructions; the custom multimodal interaction instructions correspond to the in-vehicle user-defined scenario modes and are decomposed into at least two associated non-custom multimodal interaction instructions based on preset association relationships.
[0075] Optionally, the device further includes:
[0076] The multimodal interaction feedback determination module is used to determine the multimodal interaction feedback associated with each multimodal interaction command;
[0077] The multimodal interactive feedback display module is used to display the multimodal interactive feedback to the in-vehicle user through intelligent interactive virtual avatars and / or multimedia information.
[0078] Optionally, the interactive command response module 330 includes:
[0079] The timing response unit is used to respond sequentially according to the timing of each multimodal interaction instruction if there are multimodal interaction instructions of the same interaction type.
[0080] The in-vehicle intelligent interaction device described in further detail can also execute the in-vehicle intelligent interaction method provided in any embodiment of the present invention, and has the corresponding functional modules and beneficial effects of executing the method.
[0081] Figure 4A schematic diagram of an electronic device 40 that can be used to implement embodiments of the present invention is shown. The electronic device is intended to represent various forms of digital computers, such as laptop computers, desktop computers, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers. The electronic device can also represent various forms of mobile devices, such as personal digital processors, cellular phones, smartphones, wearable devices (e.g., helmets, glasses, watches, etc.), and other similar computing devices. The components shown herein, their connections and relationships, and their functions are merely illustrative and are not intended to limit the implementation of the invention described and / or claimed herein.
[0082] like Figure 4 As shown, the electronic device 40 includes at least one processor 41 and a memory, such as a read-only memory (ROM) 42 or a random access memory (RAM) 43, communicatively connected to the at least one processor 41. The memory stores computer programs executable by the at least one processor. The processor 41 can perform various appropriate actions and processes based on the computer program stored in the ROM 42 or loaded into the RAM 43 from storage unit 48. The RAM 43 may also store various programs and data required for the operation of the electronic device 40. The processor 41, ROM 42, and RAM 43 are interconnected via a bus 44. An input / output (I / O) interface 45 is also connected to the bus 44.
[0083] Multiple components in electronic device 40 are connected to I / O interface 45, including: input unit 46, such as keyboard, mouse, etc.; output unit 47, such as various types of monitors, speakers, etc.; storage unit 48, such as disk, optical disk, etc.; and communication unit 49, such as network card, modem, wireless transceiver, etc. Communication unit 49 allows electronic device 40 to exchange information / data with other devices through computer networks such as the Internet and / or various telecommunications networks.
[0084] Processor 41 can be a variety of general-purpose and / or special-purpose processing components with processing and computing capabilities. Some examples of processor 41 include, but are not limited to, a central processing unit (CPU), a graphics processing unit (GPU), various special-purpose artificial intelligence (AI) computing chips, various processors running machine learning model algorithms, a digital signal processor (DSP), and any suitable processor, controller, microcontroller, etc. Processor 41 performs the various methods and processes described above, such as in-vehicle intelligent interaction methods.
[0085] In some embodiments, the in-vehicle intelligent interaction method may be implemented as a computer program tangibly contained in a computer-readable storage medium, such as storage unit 48. In some embodiments, part or all of the computer program may be loaded into and / or installed on electronic device 40 via ROM 42 and / or communication unit 49. When the computer program is loaded into RAM 43 and executed by processor 41, one or more steps of the in-vehicle intelligent interaction method described above may be performed. Alternatively, in other embodiments, processor 41 may be configured to perform the in-vehicle intelligent interaction method by any other suitable means (e.g., by means of firmware).
[0086] Various embodiments of the systems and techniques described above herein can be implemented in digital electronic circuit systems, integrated circuit systems, field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), application-specific standard products (ASSPs), systems-on-a-chip (SoCs), payload-programmable logic devices (CPLDs), computer hardware, firmware, software, and / or combinations thereof. These various embodiments may include implementations in one or more computer programs that can be executed and / or interpreted on a programmable system including at least one programmable processor, which may be a dedicated or general-purpose programmable processor, capable of receiving data and instructions from a storage system, at least one input device, and at least one output device, and transmitting data and instructions to the storage system, the at least one input device, and the at least one output device.
[0087] Computer programs used to implement the methods of the present invention may be written in any combination of one or more programming languages. These computer programs may be provided to a processor of a general-purpose computer, a special-purpose computer, or other programmable data processing device, such that when executed by the processor, the computer programs cause the functions / operations specified in the flowcharts and / or block diagrams to be performed. The computer programs may be executed entirely on a machine, partially on a machine, or as a standalone software package, partially on a machine and partially on a remote machine, or entirely on a remote machine or server.
[0088] In the context of this invention, a computer-readable storage medium can be a tangible medium that may contain or store a computer program for use by or in conjunction with an instruction execution system, apparatus, or device. A computer-readable storage medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination thereof. Alternatively, a computer-readable storage medium may be a machine-readable signal medium. More specific examples of machine-readable storage media include electrical connections based on one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fibers, portable compact disk read-only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination thereof.
[0089] To provide interaction with a user, the systems and techniques described herein can be implemented on an electronic device having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user; and a keyboard and pointing device (e.g., a mouse or trackball) through which the user provides input to the electronic device. Other types of devices can also be used to provide interaction with the user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user can be received in any form (including sound input, voice input, or tactile input).
[0090] The systems and technologies described herein can be implemented in computing systems that include backend components (e.g., as data servers), or computing systems that include middleware components (e.g., application servers), or computing systems that include frontend components (e.g., user computers with graphical user interfaces or web browsers through which users can interact with implementations of the systems and technologies described herein), or any combination of such backend, middleware, or frontend components. The components of the system can be interconnected via digital data communication of any form or medium (e.g., communication networks). Examples of communication networks include local area networks (LANs), wide area networks (WANs), blockchain networks, and the Internet.
[0091] A computing system can include clients and servers. Clients and servers are generally located far apart and typically interact through communication networks. The client-server relationship is created by computer programs running on the respective computers and having a client-server relationship with each other. The server can be a cloud server, also known as a cloud computing server or cloud host, which is a hosting product within the cloud computing service system to address the shortcomings of traditional physical hosts and VPS services, such as high management difficulty and weak business scalability.
[0092] It should be understood that the various forms of processes shown above can be used, with steps reordered, added, or deleted. For example, the steps described in this invention can be executed in parallel, sequentially, or in different orders, as long as the desired result of the technical solution of this invention can be achieved, and this is not limited herein.
[0093] The specific embodiments described above do not constitute a limitation on the scope of protection of this invention. Those skilled in the art should understand that various modifications, combinations, sub-combinations, and substitutions can be made according to design requirements and other factors. Any modifications, equivalent substitutions, and improvements made within the spirit and principles of this invention should be included within the scope of protection of this invention.
Claims
1. A vehicle-mounted intelligent interaction method, characterized in that, The method includes: Acquire at least two multimodal interaction commands from the user inside the vehicle, and determine the interaction type of each multimodal interaction command; Determine the response priority for each interaction type based on the vehicle's current operating mode; The multimodal interaction commands are responded to according to their interaction type and response priority. The interaction types include vehicle control and device interaction, multimedia interaction, and driver assistance interaction; determining the response priority of each interaction type based on the vehicle's current operating mode includes: Obtain the current status information of the vehicle and determine the current operating mode of the vehicle based on the status information. The operating mode includes driving mode and parking mode. If the operating mode is driving mode, then the response priority of each interaction type is determined from high to low as follows: driving assistance interaction, vehicle control and vehicle design interaction, and multimedia interaction. If the operating mode is parking mode, then the response priority of each interaction type is determined from high to low as follows: vehicle control and vehicle equipment interaction, multimedia interaction, and driving assistance interaction.
2. The method according to claim 1, wherein the driving assistance interaction includes driving assistance interaction for driving and parking assistance interaction; when the current operating mode of the vehicle is parking mode, no multimodal interaction commands related to driving assistance interaction are generated.
3. The method according to claim 1, characterized in that, The multimodal interaction commands are generated by multimodal behavior perception and interaction intent parsing of the user's behavior in the vehicle, or by driving assistance functions, or by being triggered when environmental parameters meet set trigger conditions; the user behavior includes at least one of the following: voice commands, action commands, or hardware operation commands.
4. The method according to claim 1, characterized in that, The multimodal interaction commands include custom multimodal interaction commands and non-custom multimodal interaction commands; the custom multimodal interaction commands correspond to the in-vehicle user-defined scenario modes and are decomposed into at least two associated non-custom multimodal interaction commands based on preset association relationships.
5. The method according to claim 1, characterized in that, After obtaining at least two multimodal interaction commands from the user inside the vehicle, the method further includes: Determine the multimodal interaction feedback associated with each multimodal interaction command; The multimodal interactive feedback is displayed to the in-vehicle user through intelligent interactive virtual avatars and / or multimedia information.
6. The method according to claim 1, wherein responding to each multimodal interaction instruction according to the interaction type and response priority of each multimodal interaction instruction comprises: If there are multimodal interaction commands with the same interaction type, they will be responded to sequentially according to the timing of each multimodal interaction command.
7. A vehicle-mounted intelligent interactive device, characterized in that, The device includes: The interaction type determination module is used to acquire at least two multimodal interaction commands from the user in the vehicle and determine the interaction type of each multimodal interaction command. The response priority determination module is used to determine the response priority of each interaction type based on the current operating mode of the vehicle. An interactive instruction response module is used to respond to each multimodal interactive instruction according to the interaction type and response priority of each multimodal interactive instruction; The interaction types include vehicle control and device interaction, multimedia interaction, and driver assistance interaction. The response priority determination module includes: The operation mode determination unit acquires the current status information of the vehicle and determines the current operation mode of the vehicle based on the status information. The operation mode includes driving mode and parking mode. The first response priority determination unit is used to determine the response priority of each interaction type from high to low as follows if the operating mode is driving mode: driving assistance interaction, vehicle control and vehicle device interaction, and multimedia interaction. The second response priority determination unit is used to determine the response priority of each interaction type from high to low as follows if the operating mode is parking mode: vehicle control and vehicle device interaction, multimedia interaction, and driving assistance interaction.
8. An electronic device, characterized in that, The electronic device includes: At least one processor; and A memory communicatively connected to the at least one processor; wherein, The memory stores a computer program that can be executed by the at least one processor, the computer program being executed by the at least one processor to enable the at least one processor to perform the in-vehicle intelligent interaction method according to any one of claims 1-6.
9. A computer-readable storage medium, characterized in that, The computer-readable storage medium stores computer instructions that cause a processor to execute the in-vehicle intelligent interaction method according to any one of claims 1-6.