Voice control method, voice control system, voice control device, and electronic device
By recognizing the expectation and feeling information in the user's voice data, comprehensive control commands are generated, which solves the problem of inaccurate intent recognition in the existing technology, realizes multi-device linkage control, and improves the user experience and the accuracy of device control.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- DAIKIN INDUSTRIES LTD
- Filing Date
- 2024-12-18
- Publication Date
- 2026-06-19
AI Technical Summary
In existing voice control technologies, it is difficult to accurately reflect the user's control intentions, leading to miscontrols and a reduced user experience. In particular, when the user is unfamiliar with the device or cannot express clear control commands, the recognition and control effects are poor.
By recognizing first-category information related to expectations and second-category information related to feelings in user voice data, control commands are generated. The interval between the two is judged based on preset conditions, thereby improving the accuracy of intent recognition and generating comprehensive control commands.
It enables users to control multiple devices in a coordinated manner through brief descriptions, improving control accuracy and user experience, reducing operational difficulty and usage threshold, and enhancing the intelligence and technological feel of voice control.
Smart Images

Figure CN122245301A_ABST
Abstract
Description
Technical Field
[0001] This application relates to the field of voice interaction, and particularly to a voice control method, voice control system, voice control device, and electronic device. Background Technology
[0002] With the development of science and technology and urban construction, voice recognition technology is gradually being applied to all aspects of people's lives. For example, in places such as office buildings, apartments, schools, and shopping malls, devices that can be controlled based on voice, such as voice-controlled home appliances (including air handling equipment), are becoming increasingly popular.
[0003] Currently, most existing voice control technologies control the corresponding devices by recognizing the control commands in the user's voice. For example, the air conditioner is controlled based on the user's voice commands such as "set the air conditioner to 16 degrees" or "turn the air conditioner fan down."
[0004] In recent years, voice control technology has also emerged that controls devices based on the user's description of their current physical state. For example, air conditioners can be controlled based on information such as "too hot" or "a little cold" in the user's voice indicating their current physical state. Therefore, the control is based on inferring the user's control intention from their voice and then controlling the device based on the inferred result.
[0005] It should be noted that the above introduction to the technical background is only for the purpose of providing a clear and complete explanation of the technical solutions of this application and facilitating understanding by those skilled in the art. It should not be assumed that these technical solutions are known to those skilled in the art simply because they have been described in the background section of this application. Summary of the Invention
[0006] The inventors discovered that in the aforementioned prior art of controlling devices by inferring the user's control intention based on the user's voice, the control intention inferred from the user's description may not accurately reflect the user's actual control intention. For example, although the user may feel hot, they may not actually want to turn on the air conditioner to lower the room temperature; or, for example, the phrase "it's too hot" uttered by the user may have other meanings, such as the user actually wanting to say that the tea they are drinking or the food they are eating is too hot, but uttering "it's too hot" will trigger mis-control of the air conditioner; thus, it can easily lead to a degraded user experience or even user complaints.
[0007] In the existing technologies that control devices based on voice commands, users need to speak explicit control commands, which places high demands on users. However, users sometimes do not know what devices are set up in the room and all the functions of these devices, or they cannot determine which devices to use, or they cannot determine the appropriate device operating mode or operating parameters. Therefore, the control commands spoken by the user may not reflect the user's actual control intention, or the user does not know how to express their actual control intention, resulting in poor effect and reduced user experience.
[0008] In addition, during voice control, users may use long and complex voice commands. In such cases, the system may not be able to accurately identify the user's actual control intentions, leading to misidentification and miscontrol, which can easily reduce the user experience.
[0009] To address one or more of the aforementioned problems, embodiments of this application provide a voice control method, a voice control system, a voice control device, and an electronic device. Controlling home appliances based on the user's abstract description of their "expected feelings" eliminates the need for the user to describe content directly related to device control, reducing operational difficulty and the user's learning curve. Furthermore, by judging whether the interval between the first category of information and the second category of information meets preset conditions, the accuracy of recognizing the user's actual control intention is improved, thereby enhancing the user experience.
[0010] According to a first aspect of the embodiments of this application, a voice control method is provided, the method comprising: acquiring voice data; identifying in the voice data a first category of information related to the user's expectations and a second category of information related to the user's feelings; generating a control command based on the first category of information and the second category of information when the interval between the first category of information and the second category of information meets a preset condition; and controlling home appliances in a target space according to the control command.
[0011] According to a second aspect of the embodiments of this application, a voice control system is provided, which includes a voice acquisition device, a server, a control device, and at least one home appliance. The voice acquisition device acquires the user's voice data; the server acquires the voice data, identifies a first category of information related to the user's expectations and a second category of information related to the user's feelings in the voice data, and generates a control command based on the first category of information and the second category of information when the interval between the first category of information and the second category of information meets a preset condition, and sends the control command to the control device; the control device controls the home appliance in the target space according to the control command.
[0012] According to a third aspect of the embodiments of this application, a voice control device is provided, the device comprising: an acquisition module for acquiring voice data; a recognition module for recognizing a first category of information related to the user's expectations and a second category of information related to the user's feelings in the voice data; a generation module for generating control instructions based on the first category of information and the second category of information when the interval between the first category of information and the second category of information meets a preset condition; and a control module for controlling home appliances in a target space according to the control instructions.
[0013] According to a fourth aspect of the present application, an electronic device is provided, comprising: a memory storing a computer program; and a processor that executes the computer program to implement the voice control method described in the first aspect of the present application.
[0014] According to a fifth aspect of the present application, a computer-readable storage medium is provided having a computer program stored thereon, which, when executed by a processor, implements the voice control method described in the first aspect of the present application.
[0015] According to a sixth aspect of the present application, a computer program product is provided, on which a computer program is stored, which, when executed by a processor, implements the voice control method described in the first aspect of the present application.
[0016] One of the beneficial effects of the embodiments of this application is that:
[0017] Simultaneously, based on the first category of information related to the user's expectations and the second category of information related to the user's feelings in the user's voice data, control commands for home appliances are generated. Thus, based on the user's abstract description of "expectations and feelings," the user's actual intention can be identified and automatically converted into control commands that need to be executed to achieve the actual intention of "expectations and feelings." These control commands may be comprehensive control commands for multiple devices. Therefore, users can achieve advanced control of multiple devices working together to achieve the actual intention of "expectations and feelings" through a brief description (such as a sentence), thus improving the user experience.
[0018] Compared to the traditional methods in which users describe control commands and their current physical state, the abstract description of "expected feelings" in this embodiment of the invention can express the user's comprehensive feelings and needs. Furthermore, this description contains more information, thus more fully reflecting the user's actual control intentions.
[0019] In addition, in the voice control mode based on user expectations, by analyzing and understanding the natural language expressed by the user, it is possible to identify the user's comprehensive and complex actual intentions. This solves the problem that users cannot fully describe their actual control intentions in the traditional voice control mode, making the control more precise and improving the user experience.
[0020] In addition, since the "expected feeling" represents the user's actual intention, the user's actual intention can be directly identified or detected from the voice data, rather than "inferring" the user's actual intention from the content of the voice data as in traditional methods, thus significantly improving the accuracy of control.
[0021] Furthermore, users only need to describe their "desired feeling" to control home appliances without needing to understand or master control commands or related information. Moreover, even if users do not know how to control the devices to meet their needs, they can still achieve their control objectives, thereby reducing the operational difficulty and usage threshold of home appliance control and improving the user experience.
[0022] Furthermore, by limiting the interval between the first category of information related to user expectations and the second category of information related to user feelings through preset conditions, the accuracy of identifying the user's actual control intention can be improved, thereby improving the accuracy of the generated control commands, reducing misoperations, and thus better meeting user needs and further improving the user experience.
[0023] Furthermore, the preset conditions include at least one of the following: the character spacing between the first category information and the second category information is less than or equal to a first threshold; the blank time interval between the first category information and the second category information is less than or equal to a second threshold; and the meaning understood in conjunction with the context of the first category information and the second category information matches the meaning understood solely based on the first category information and the second category information. This further improves the accuracy of recognizing the user's actual control intention, avoids misidentification and miscontrol of home appliances, and thus further enhances the user experience.
[0024] Furthermore, by using sound wave matching to identify first-category and second-category information in voice data, it is possible to identify information related to "expected feelings" without recognizing the content of the voice data, thereby reducing the risk of user privacy leakage.
[0025] Furthermore, the voice data is converted into text data, and first-category and second-category information is identified within the text data. This allows for the simple and rapid identification of information related to "expected feelings."
[0026] Furthermore, identifying first-category and second-category information in text data includes: matching text data with a preset data table; and, if a match is successful, determining the first-category and second-category information in the text data. This allows for the simple and quick identification of information related to "expected feelings".
[0027] Furthermore, identifying first-category and second-category information in the text data includes inputting the text data into a natural language processing-based model and outputting first-category and second-category information. This enables accurate identification of information related to "expected feelings" and prevents omissions, thereby further improving the precision of device control.
[0028] Furthermore, the text data is matched with a preset data table, including: extracting word information from the text data; matching the word information with a first category keyword set and a second category keyword set in a preset keyword table respectively; determining the word information matching the first category keyword set as the first descriptive information, and determining the word information matching the second category keyword set as the second descriptive information. Thus, by determining the first and second category information based on keyword matching, the relevant information of "expected feelings" can be easily and quickly identified, which helps reduce the cost of speech recognition, improves the accuracy of speech recognition, and further enhances the user experience.
[0029] Furthermore, the first category of information includes first descriptive information expressing the expected intention; the second category of information includes second descriptive information expressing the feeling, and does not include descriptive information related to home appliances or control commands. Thus, by using an abstract description of the "expected feeling," automatic control reflecting the user's actual control intention can be achieved, further enhancing the intelligence and technological feel of voice control and improving the user experience.
[0030] Furthermore, the second descriptive information representing feelings includes at least one of a third descriptive information representing bodily sensations and a fourth descriptive information representing emotional states. The third descriptive information representing bodily sensations includes at least one of scene, location, weather, season, and time, while the fourth descriptive information representing emotional states includes at least one of subjective experience and emotion. Thus, users can express their personalized needs from multiple dimensions, including bodily sensations and emotional states, thereby satisfying their comprehensive and flexible control requirements and further improving the user experience.
[0031] Furthermore, speech data can be recognized locally; or on the server; or the speech data can be converted into text locally and then recognized on the server; or the speech data can be matched for sound waves locally, and if a match is successful, the speech data can be recognized on the server. This allows for flexible speech recognition based on multiple methods, meeting the speech recognition needs of different scenarios and further improving the user experience.
[0032] Furthermore, acquiring voice data includes: continuously acquiring voice data for a first preset duration, and ceasing to acquire voice data when the first preset duration is reached; or, acquiring voice data when user voice data is detected, and ceasing to acquire voice data after a second preset duration has elapsed since user voice data has not been detected. This allows for stopping the acquisition of user voice data at appropriate times, reducing the risk of user privacy leaks.
[0033] Furthermore, generating control instructions based on the first category of information and the second category of information includes: determining the environmental parameters that need to be adjusted and the target values of the environmental parameters based on the first category of information and the second category of information; determining whether control based on the home appliances in the target space can adjust the environmental parameters in the target environment to the corresponding target values based on the capabilities of the home appliances; and, if it is determined that the environmental parameters in the target space can be adjusted to the target values, generating control instructions for the home appliances based on the environmental parameters, the target values of the environmental parameters, and the home appliances used to adjust the environmental parameters. Thus, it is possible to execute control that conforms to the actual situation, taking into account the capabilities of the equipment actually deployed on-site.
[0034] Furthermore, when it is determined that the environmental parameters within the target space cannot be adjusted to the target value, and there are multiple environmental parameters, it is determined whether the control based on the home device can adjust the necessary environmental parameters among the multiple environmental parameters to the corresponding target value. If the necessary environmental parameters can be adjusted to the target value, the corresponding control is executed. Thus, at least the necessary environmental parameters that achieve the user's expected experience can be adjusted to the target value, thereby maximizing the satisfaction of the user's main control intention or close to the user's control intention, and further improving the user experience.
[0035] Furthermore, when the necessary environmental parameter among multiple environmental parameters can be adjusted to the corresponding target value, a control command for the home appliance is generated based on the necessary environmental parameter, the target value of the necessary environmental parameter, and the home appliance used to adjust the necessary environmental parameter.
[0036] When the necessary environmental parameter among multiple environmental parameters cannot be adjusted to the corresponding target value, the generation of control commands is paused and the user is notified.
[0037] Therefore, it is possible to use the equipment actually deployed on-site to realize the user's main control intentions or close to the user's control intentions, thereby maximizing the satisfaction of the user's control needs and further improving the user experience.
[0038] Furthermore, the necessary environmental parameters include preset environmental parameters among a variety of environmental parameters that need to be adjusted, or environmental parameters with preset priorities among a variety of environmental parameters that need to be adjusted. Therefore, based on the type or priority of the various environmental parameters that need to be adjusted, corresponding controls can be executed, thereby enabling the use of equipment actually deployed on-site to achieve the user's main control intent or close to the user's control intent, further improving the user experience.
[0039] Furthermore, at least one of the preset environmental parameters or the priority of environmental parameters used to make the above determination is preset by the user, or is set through interaction with the user; or is automatically updated by learning based on at least one of the user's usage habits, environment, current season, user status and user feedback.
[0040] Therefore, the above ratios or priorities can be flexibly set in different scenarios, or determined through self-learning, which improves the accuracy and rationality of the above ratios or priorities, thereby better meeting the user's control needs and further improving the user experience.
[0041] Furthermore, the method also includes: when it is determined that the environmental parameters in the target space cannot be adjusted to the target value, adjusting at least one of the environmental parameters and the target value according to the capabilities of the home appliances in the target space; and generating control instructions for the home appliances based on the adjusted environmental parameters, the target values of the environmental parameters, and the home appliances used to adjust the environmental parameters.
[0042] Therefore, even when the desired experience cannot be fully achieved, the actual equipment deployed on-site can be used to optimize the parameters that need adjustment (e.g., changing them to similar adjustable environmental parameters) and the target values (e.g., changing them to achievable target values), so that the adjustment effect is closer to the desired experience. In other words, existing equipment is used to meet the user's personalized needs as much as possible, further enhancing the user experience.
[0043] Furthermore, based on control commands, multiple home appliances within the target space can be controlled, enabling users to control multiple devices or multiple devices in a coordinated manner with just a "sentence," achieving a control effect that better matches the user's actual intentions.
[0044] Specific embodiments of this application are disclosed in detail with reference to the following description and accompanying drawings, indicating how the principles of this application can be adopted. It should be understood that the embodiments of this application are not limited in scope. Within the spirit and scope of the appended claims, embodiments of this application include many changes, modifications, and equivalents.
[0045] The feature information described and illustrated for one embodiment may be used in the same or similar manner in one or more other embodiments, combined with feature information in other embodiments, or substituted for feature information in other embodiments.
[0046] It should be emphasized that the term "including / comprises" as used herein refers to the presence of a feature, whole, step, or component, but does not exclude the presence or addition of one or more other features, wholes, steps, or components. Attached Figure Description
[0047] Many aspects of this application can be better understood by referring to the following accompanying drawings. The components in the drawings are not drawn to scale, but are only intended to illustrate the principles of this application. Corresponding portions in the drawings may be enlarged or reduced for ease of illustration and description of certain parts of this application. Elements and features described in one drawing or embodiment of this application may be combined with elements and features shown in one or more other drawings or embodiments. Furthermore, similar reference numerals in the drawings denote corresponding components in several drawings and can be used to indicate corresponding components used in more than one embodiment.
[0048] In the attached diagram:
[0049] Figure 1 This is a schematic diagram of a voice control method according to an embodiment of this application;
[0050] Figure 2 This is a schematic diagram of the method for implementing step 102 of Embodiment 1 of this application;
[0051] Figure 3 This is a schematic diagram of the method for implementing step 103 of Embodiment 1 of this application;
[0052] Figure 4 This is a flowchart of one implementation of the voice control method according to an embodiment of this application;
[0053] Figure 5 This is a schematic block diagram of a voice control system according to an embodiment of this application;
[0054] Figure 6 This is a configuration diagram of the voice control system used in the voice control method of this application embodiment;
[0055] Figure 7This is a schematic block diagram of a voice control device according to an embodiment of this application;
[0056] Figure 8 This is a schematic block diagram of the system configuration of an electronic device according to an embodiment of this application. Detailed Implementation
[0057] The preferred embodiments of this application will now be described with reference to the accompanying drawings.
[0058] Example 1
[0059] Embodiment 1 of this application provides a voice control method.
[0060] Figure 1 This is a schematic diagram of a voice control method according to an embodiment of this application. Figure 1 As shown, the method includes:
[0061] Step 101: Acquire voice data;
[0062] Step 102: Identify first-category information related to the user's expectations and second-category information related to the user's feelings in the voice data;
[0063] Step 103: If the interval between the first category information and the second category information meets the preset conditions, generate control instructions based on the first category information and the second category information; and
[0064] Step 104: Control the home appliances in the target space according to the control instructions.
[0065] The voice control method of this application generates control commands for home appliances based on a first category of information related to the user's expectations and a second category of information related to the user's feelings in the user's voice data. Thus, based on the user's abstract description of "expectations and feelings", the method can identify the user's actual intention and automatically convert it into control commands that need to be executed to achieve the actual intention of "expectations and feelings". These control commands may be comprehensive control commands for multiple devices. Therefore, users can achieve advanced control of multiple devices working together to achieve the actual intention of "expectations and feelings" through a brief description (e.g., a sentence), which improves the user experience.
[0066] Compared to the traditional methods in which users describe control commands and their current physical state, the abstract description of "expected feelings" in this embodiment of the invention can express the user's comprehensive feelings and needs. Furthermore, this description contains more information, thus more fully reflecting the user's actual control intentions.
[0067] In addition, in the voice control mode based on user expectations, by analyzing and understanding the natural language expressed by the user, it is possible to identify the user's comprehensive and complex actual intentions. This solves the problem that users cannot fully describe their actual control intentions in the traditional voice control mode, making the control more precise and improving the user experience.
[0068] In addition, since the "expected feeling" represents the user's actual intention, the user's actual intention can be directly identified or detected from the voice data, rather than "inferring" the user's actual intention from the content of the voice data as in traditional methods, thus significantly improving the accuracy of control.
[0069] Furthermore, users only need to describe their "desired feeling" to control home appliances without needing to understand or master control commands or related information. Moreover, even if users do not know how to control the devices to meet their needs, they can still achieve their control objectives, thereby reducing the operational difficulty and usage threshold of home appliance control and improving the user experience.
[0070] Furthermore, by limiting the interval between the first category of information related to user expectations and the second category of information related to user feelings through preset conditions, the accuracy of identifying the user's actual control intention can be improved, thereby improving the accuracy of the generated control commands, reducing misoperations, and thus better meeting user needs and further improving the user experience.
[0071] The voice control method of this application is not only applicable to home scenarios, such as residences and apartments, but also to commercial or public scenarios such as office buildings, shopping malls, schools, and factories. This application does not limit the application scenarios of the voice control method.
[0072] The voice control method of this application embodiment is used to control home devices. The home devices in this application embodiment can be various devices deployed in various application scenarios such as home scenarios, commercial scenarios or public scenarios to adjust the indoor environment or meet user needs. This application embodiment does not limit the types of home devices.
[0073] The voice control method of this application embodiment can control home appliances in a target space, that is, the target space is an indoor space that serves as the target for environmental regulation.
[0074] The target space in this application embodiment may include one or more indoor spaces, such as a living room, bedroom, kitchen, bathroom, etc.
[0075] In some embodiments, home appliances may be smart home appliances, such as air handling equipment and smart devices.
[0076] For example, smart devices include at least one of smart speakers, smart display devices, smart switches, smart curtains, smart lighting devices, sensors, and smart pet devices;
[0077] For example, air handling equipment includes at least one of indoor units, humidifiers, dehumidifiers, fresh air systems, air purifiers, fragrance components, underfloor heating systems, and valve components.
[0078] In some embodiments, the air handling unit affects the indoor space within the target space.
[0079] For example, at least one air handling unit is installed in the target space. In some embodiments, indoor units are installed in each indoor space within the target space, or indoor units are installed in some indoor spaces within the target space.
[0080] For example, air handling equipment can be associated with indoor spaces in a target space. For instance, the correspondence between air handling equipment and indoor spaces can be determined through the binding information of the air handling equipment.
[0081] In some embodiments, the air handling unit may not be installed in the target space's interior space, but it may still affect the target space's interior space. For example, a fresh air unit with its main unit installed above the ceiling.
[0082] In some embodiments, the air handling equipment includes at least one of an indoor unit, a humidifier, a dehumidifier, a fresh air system, and a floor heating system.
[0083] In addition, air handling equipment can also be called air conditioning equipment.
[0084] In some embodiments, the indoor unit is an indoor unit in an air handling system that includes air handling equipment, and the air handling system may include a commercial air conditioning system or a residential air conditioning system.
[0085] In some embodiments, an air conditioning system may include at least one set of outdoor units and at least one indoor unit connected to each set of outdoor units. That is, an air conditioning system may include one or more sets of outdoor units, each set of outdoor units including at least one outdoor unit; for each set of outdoor units, the set of outdoor units is connected to at least one indoor unit.
[0086] For example, an air conditioning system includes an outdoor unit and an indoor unit connected to the outdoor unit.
[0087] For example, an air conditioning system includes an outdoor unit and at least two indoor units connected to the outdoor unit.
[0088] For example, an air conditioning system includes at least two outdoor units and at least two indoor units connected to the at least two outdoor units.
[0089] For example, a group of outdoor units and at least one indoor unit connected to the group of outdoor units constitute a refrigerant system, or multiple groups of outdoor units and at least one indoor unit connected to the multiple groups of outdoor units respectively constitute multiple refrigerant systems. Thus, an air conditioning system may include one refrigerant system or multiple refrigerant systems.
[0090] In some embodiments, the outdoor unit and the indoor unit can be air conditioning equipment of various models, types, forms, and capacities. For example, the indoor unit can be in the form of four-sided air outlet, two-sided air outlet, duct unit, floor air outlet, or skirting board air outlet, etc.; the outdoor unit can be in the form of single fan top air outlet, dual fan top air outlet, single fan front air outlet, or dual fan front air outlet, etc.
[0091] In some embodiments, the fresh air equipment includes at least one of a fresh air handling device, a total heat exchange device (e.g., with or without internal circulation function), and a ventilation device (e.g., with or without internal circulation function).
[0092] In some embodiments, at least one of the humidifying device and the dehumidifying device is disposed on the output side of the fresh air device to humidify or dehumidify the air output by the fresh air device, for example, to humidify or dehumidify the air output by the fresh air device, and to introduce the humidified or dehumidified air into the indoor space. For example, at least one of the humidifying device and the dehumidifying device has a heat exchanger that can operate as an evaporator.
[0093] In some embodiments, the underfloor heating system is connected to the outdoor unit via refrigerant piping, and to the indoor underfloor heating radiator piping via underfloor heating water piping. For example, the outdoor unit and the underfloor heating system form an air-source heat pump water heater, utilizing heat from the air and the operation of the compressor to heat water. The underfloor heating system may include a water-heat exchange unit, where refrigerant from the outdoor unit's compressor enters the unit via refrigerant piping to exchange heat with the water, thereby heating the water. The heated water is then introduced into the indoor underfloor heating radiator piping via the underfloor heating water piping to provide heating for the indoor space. In some embodiments, the outdoor unit and the underfloor heating system are separate units, or they are integrated.
[0094] In some embodiments, the valve assembly includes at least one of an air valve and a floor heating valve.
[0095] In some embodiments, the air valve for controlling the air volume of the indoor space is, for example, an air valve for controlling the amount of air supplied to the indoor space by the air supply device. For example, when the external circulation function of the total heat exchanger is turned on, the air valve for controlling the air volume of the indoor space can control the air volume of the outdoor air introduced into the indoor space; when the internal circulation function of the total heat exchanger is turned on, the air valve for controlling the air volume of the indoor space can control the air volume of the indoor air introduced into the indoor space.
[0096] In some embodiments, the floor heating valve is used to control the opening and closing of the inlet and outlet pipes in the floor heating system.
[0097] In step 101, the user's voice data is acquired. In this embodiment, the voice data can be obtained through a local device.
[0098] In some embodiments, the local device includes at least one of a controller, an indoor air conditioning unit, a smart device, and a user terminal.
[0099] In some embodiments, the controller includes at least one of a central controller, a wired controller, and a remote controller.
[0100] In some embodiments, the central controller and the wired controller are fixed controllers, for example, fixed to a wall in an indoor space.
[0101] In some embodiments, the central controller may be installed in multiple indoor spaces, that is, the central controller is configured to correspond to all or part of the indoor spaces.
[0102] This allows for the full utilization of various devices as local devices for voice data acquisition, improving the utilization rate of local devices, thereby reducing the cost of voice data acquisition and improving the efficiency and flexibility of voice data processing.
[0103] In some embodiments, smart devices include at least one of smart speakers, smart display devices, smart switches, smart curtains, smart lighting devices, sensors (e.g., air sensors, temperature sensors, humidity sensors, etc.), and smart pet devices.
[0104] In some embodiments, in step 101, voice data is acquired through a first local device, and the voice data may be an audio signal.
[0105] For example, the audio signal can be directly acquired by the first local device, or it can be acquired by the second local device and then sent to the first local device. For instance, the centralized controller can act as the first local device to directly acquire the audio signal, or an indoor air conditioning unit or smart device can act as the second local device to acquire the audio signal and then send the acquired audio data to the centralized controller, which is acting as the first local device. This application does not limit the method of acquiring voice data.
[0106] For example, after a user enables the voice control function of a first local device (e.g., a centralized controller), voice monitoring is activated to acquire the monitored voice data. Another example is that the first local device (e.g., the centralized controller) has a control interface that interacts with the user, allowing the user to perform related operations through the interface.
[0107] In some embodiments, the scope of the activated voice monitoring is a target space. Voice monitoring is activated after user confirmation before it can be performed on the target space. For example, the user can confirm or set the settings in the control interface of the central controller to activate the voice monitoring and voice control functions. Alternatively, the user can also set the scope of voice monitoring in the control interface.
[0108] In addition, the scope of the activated voice monitoring may not be the target space or may not be limited to the target space. For example, if the user is currently in the living room and wants to control the home appliances in the bedroom, then the target space is the bedroom, and the scope of the activated voice monitoring includes the living room, or includes both the living room and the bedroom.
[0109] In some embodiments, acquiring voice data includes:
[0110] Voice data is continuously acquired for a first preset duration, and acquisition of voice data stops when the first preset duration is reached. For example, voice data is continuously acquired for a first preset duration T1 after the start time t1, and acquisition of voice data stops when the time t1+T1 is reached.
[0111] Alternatively, voice data can be acquired when user voice data is detected, and acquisition can stop when a second preset duration has elapsed after no user voice data has been detected. For example, if user voice data is detected at time t2 and acquisition of voice data begins, but no user voice data is detected after time t3, acquisition of voice data can be stopped if no user voice data is detected within the second preset duration T2 after time t3, i.e., if no user voice data is detected between time t3 and time t3+T2.
[0112] This allows for the cessation of acquiring users' voice data at appropriate times, reducing the risk of user privacy leaks.
[0113] In some embodiments, the first preset duration T1 and the second preset duration T2 are preset or user-defined. The first preset duration T1 can be, for example, 30 seconds, 60 seconds, etc., and the second preset duration can be, for example, 10 seconds, 15 seconds, etc., and this application does not impose any restrictions on them.
[0114] In some embodiments, user voice data is acquired in a manner with a wake word, for example, when a specific wake word is detected, the acquisition of user voice data begins; or, user voice data is acquired in a manner without a wake word, for example, when user voice data is detected, the acquisition of user voice data begins.
[0115] In some embodiments, after acquiring voice data, speech recognition is performed on the acquired user's voice data. For example, speech recognition refers to converting voice data into text data and recognizing the content of the converted text data, such as word information contained in the text data.
[0116] In some embodiments, the voice data is recognized locally, that is, the acquired voice data is converted into text data locally, and the text data is recognized locally.
[0117] Alternatively, the voice data can be recognized on the server side. For example, the acquired voice data can be uploaded to the server, and the voice data can be converted into text data on the server side, and the text data can be recognized on the server side.
[0118] Alternatively, the voice data can be converted into text data locally, and then the text data can be recognized on the server side. For example, the acquired voice data can be converted into text data locally, then the text data can be uploaded to the server, and the text data can be recognized on the server side.
[0119] Therefore, speech recognition can be flexibly performed in multiple ways to meet the speech recognition needs in different scenarios and further improve the user experience.
[0120] In some embodiments, during step 102, first category information and second category information are identified based on the text data converted from the speech data. For example, the speech data is converted into text data, and the first category information and second category information are identified in the text data. This allows for the simple and rapid identification of information related to "expected feelings."
[0121] Figure 2 This is a schematic diagram of the method for implementing step 102 of Embodiment 1 of this application. For example... Figure 2 As shown, the method includes:
[0122] Step 201: Convert the voice data into text data;
[0123] Step 202: Identify the first category of information and the second category of information in the text data;
[0124] The first category of information is related to user expectations, while the second category of information is related to user feelings.
[0125] In this way, by simultaneously identifying the first category of information related to the user's expectations and the second category of information related to the user's feelings in the voice data, the combination of these two categories of information is taken as the user's expected feelings. This allows for the accurate capture of the "expected feelings" in the user's voice, enabling timely and accurate execution of corresponding controls and further improving the user experience.
[0126] In step 202, first-category information and second-category information are identified in the text data. For example, keywords in the first-category information and second-category information are identified.
[0127] In some embodiments, the first category information includes first descriptive information expressing an expectation; the first descriptive information includes, for example, first keywords, such as “I want”, “I expect”, “I hope”, “I want”, “I think”, “I need”, “give me”, “help me”, “I hope”, etc.; or, the first descriptive information includes, for example, sentences or phrases with the meaning of “expectation”, such as “If only I could…”, “… feels so good”, etc.
[0128] In some embodiments, "expected" means that the user is willing or wants to obtain it.
[0129] In some embodiments, the second category of information includes second descriptive information representing feelings, and does not include descriptive information related to home appliances or control commands. That is, the second descriptive information to be identified is an abstract description of the feelings, rather than control commands for the devices. The second descriptive information may include, for example, second keywords such as "April," "summer," or "sunny day," or it may include sentences or phrases with the meaning of "feeling," such as "after a rain in the empty mountains" or "the taste of love." Thus, by using an abstract description of the "expected feeling," automatic control that reflects the user's actual control intention can be achieved, further enhancing the intelligence and technological feel of voice control and improving the user experience.
[0130] In some embodiments, the second descriptive information representing a feeling includes at least one of a third descriptive information representing a bodily feeling and a fourth descriptive information representing a mood feeling.
[0131] In some embodiments, bodily sensation refers to the body's perception of the environment, which may be associated with at least one of scene, location, weather, season, and time. For example, "a gentle breeze in the shade of summer," "the West Lake in April," "the refreshing feeling after the rain," "a mountain after a fresh rain," etc.
[0132] In some embodiments, the third descriptive information representing a tactile sensation includes at least one of scene, location, weather, season, and time. For example, a user can express a tactile sensation by combining time and location, such as "West Lake in April," which includes the time keyword "April" and the location keyword "West Lake"; or, a user can express a tactile sensation by combining time and weather, such as "the refreshing air after the rain," which includes the time and weather keywords "after the rain." Furthermore, users can describe their desired tactile sensation using various expressions; for example, "the air after the rain" can also be described as "after the rain in the empty mountains, the weather turns autumnal in the evening," which includes the location keyword "mountain," the time and weather keyword "after the rain," and the season keyword "autumn."
[0133] In some embodiments, mood refers to at least one of the feelings and emotions evoked about things or people. For example, "the taste of love," "longing like the wind," or "the feeling of a sunny young man."
[0134] In some embodiments, the fourth descriptive information representing a mood includes at least one of subjective experience and emotion. For example, a user may describe a mood using the phrase "the taste of love," which includes the keywords "love" for both experience and emotion.
[0135] As a result, users can express their personalized needs from multiple dimensions such as physical sensation and emotional state, thereby satisfying their comprehensive and flexible control needs and further improving the user experience.
[0136] In some embodiments, when performing step 202, the text data can be matched against a preset data table. If a match is successful, the first category information and the second category information in the text data are determined. This allows for simple and quick identification of information related to "expected feelings."
[0137] Alternatively, during step 202, the text data can be input into a natural language processing-based model, which outputs first-category information and second-category information. This allows for accurate identification of information related to the "expected feeling" and prevents omissions, thereby further improving the precision of device control.
[0138] In some embodiments, during the process of recognizing voice data through "keyword matching," text data is matched against a preset data table, including:
[0139] Extract word information from text data;
[0140] Match the word information with the first and second category keyword sets in the preset keyword list respectively; and
[0141] Words and phrases that match the first set of keywords are identified as first keywords, and words and phrases that match the second set of keywords are identified as second keywords.
[0142] Therefore, by using keyword matching to determine the first and second categories of information, it is possible to easily and quickly identify information related to "expected feelings," which helps to reduce the cost of speech recognition, improve the accuracy of speech recognition, and thus further enhance the user experience.
[0143] In some embodiments, text data is divided into characters and words using techniques such as "word segmentation" to extract all character and word information from the text data.
[0144] In some embodiments, the first category keyword set includes a plurality of pre-set first keywords representing the user's expected intentions. Therefore, each word extracted from the text data of the voice data is sequentially matched with each of the first keywords included in the first keyword set. If a match is successful, the matched word is determined as the first category information in the voice data.
[0145] In some embodiments, the second category keyword set includes a plurality of pre-set second keywords representing user feelings, such as at least one of a third keyword representing physical sensation and a fourth keyword representing emotional state. Thus, each word extracted from the text data of the speech data is sequentially matched with each of the second, third, and fourth keywords included in the second keyword set. If a match is successful, the matched word is determined as the second category information in the speech data.
[0146] In some embodiments, the first category of keyword set and the second category of keyword set are preset or user-defined. The first category of keyword set and the second category of keyword set may be included in the same keyword table, or they may be included in different keyword tables, which is not limited in this application.
[0147] In some embodiments, the "natural language processing-based model" used in the process of recognizing speech data through "model recognition" is pre-trained.
[0148] In some embodiments, steps 201-202 can be executed locally, for example, on one or more local devices; or, step 201 is executed on a local device, which uploads the converted text data to a server, and step 202 is executed on the server side; or, the local device uploads the collected voice data to a server, and steps 201-202 are all executed on the server side.
[0149] In some embodiments, when performing step 102, a first category of information and a second category of information are identified in the speech data by sound wave matching. For example, sound wave matching is performed on the speech data, matching the sound waves of the speech data with preset sound wave data. If the match is successful, the first category of information and the second category of information in the speech data are determined, thereby enabling the identification of information related to "expected feelings" without recognizing the content of the speech data, reducing the risk of user privacy leakage.
[0150] In some embodiments, acoustic matching of the voice data is performed on a local device, or the local device uploads the voice data to a server and performs acoustic matching on the server side.
[0151] In some embodiments, the recognition of voice data in this application is applicable to scenarios with or without a wake word.
[0152] For example, in scenarios with a wake word, sound wave matching is used to identify whether the user's voice data includes a preset wake word. If the voice data includes a preset wake word, the voice data is converted into text data, and a first category of information and a second category of information are identified based on the text data; or, if the voice data includes a preset wake word, sound wave matching is used to identify the first category of information and the second category of information in the voice data.
[0153] For example, in a scenario without a wake word, when the user's voice data is detected, the first category of information and the second category of information are identified in the voice data. For example, the first category of information and the second category of information are identified based on the text data converted from the voice data, or the first category of information and the second category of information are identified in the voice data through sound wave matching.
[0154] This application determines whether a user has described an "expected feeling" by identifying whether the user's voice data simultaneously contains both "first category information" and "second category information." When both first and second category information are identified from the user's voice data, it is considered that the user has described an "expected feeling" through voice, and corresponding control commands are generated based on this "expected feeling." For example, if the user's voice data is "I want an April breeze," voice recognition will show that the voice data includes the first category information "I want" and the second category information "an April breeze." Control commands are then generated based on these first and second category information to create an environment where an "April breeze" exists for the user by controlling home appliances in the target space.
[0155] When only one of the first category of information or one of the second category of information is identified from the user's voice data, or when the user's voice data contains neither the first nor the second category of information, it is considered that the user has not described an "expected feeling." In this case, control commands can be generated in the existing manner, or control command generation can be paused and the user notified. For example, if the user's voice data is "I want to eat watermelon," voice recognition shows that this voice data only includes the first category of information "want," but does not include the second category of information related to the user's feeling. Furthermore, this voice data is unrelated to home appliances or control commands. Therefore, control command generation can be paused and the user notified. As another example, if the user's voice data is "I'm too hot," this voice data does not include the first category of information related to the user's expectation. Therefore, control commands can be generated using existing control methods, such as turning on the cooling system to lower the temperature in the target space, as feedback to the user's "I'm too hot," or control command generation can be paused and the user notified.
[0156] In some embodiments, the method further includes:
[0157] If the voice data contains first-category information related to the user's expectations and second-category information related to the user's feelings, determine whether the interval between the first-category information and the second-category information meets the preset conditions;
[0158] If the interval between the first category information and the second category information meets the preset conditions, a control command is generated based on the first category information and the second category information.
[0159] Therefore, when there are first-category information and second-category information in the voice data, it is further determined whether the interval between the two meets the preset conditions. If the preset conditions are met, control commands are generated based on the first-category information and the second-category information. This can avoid misidentification and miscontrol caused by interruption of voice data acquisition or excessively long interval between the two, improve the accuracy of the generated control commands, and further improve the user experience.
[0160] In some embodiments, the preset conditions include at least one of the following:
[0161] (1) The character spacing between the first category information and the second category information is less than or equal to the first threshold.
[0162] For example, suppose the first threshold is "2" characters, meaning that if the character spacing between the first category information and the second category information is less than or equal to 2, it is considered to meet the preset condition. Suppose the user's voice data is "I want the April breeze," which includes the first category information "want" and the second category information "April breeze," and the character spacing between the first and second categories is "0," less than the first threshold "2," then it is considered to meet the preset condition. Suppose the user's voice data is "I want to go out and feel the April breeze," which includes the first category information "want" and the second category information "April breeze," but the character spacing between the first and second categories is "6," greater than the first threshold "2," then it is considered not to meet the preset condition.
[0163] (2) The blank time interval between the first category information and the second category information is less than or equal to the second threshold.
[0164] For example, suppose the second threshold is "5" seconds, meaning that if the time interval between the first and second categories of information is less than or equal to 5 seconds, it is considered to meet the preset condition. Suppose the user's voice data is "I want... (ellipsis indicates a pause) the April breeze," which includes the first category information "want" and the second category information "the April breeze," and there are no characters between the first and second categories of information, i.e., there is a time interval (the user's pause interval). If this time interval is less than or equal to the second threshold of "5" seconds, it is considered to meet the preset condition; if the time interval is greater than the second threshold of "5" seconds, it is considered not to meet the preset condition.
[0165] (3) The meaning understood in combination with the context of the first category information and the second category information matches the meaning understood only based on the first category information and the second category information.
[0166] For example, suppose a user's voice data is "I want to eat first, and the watermelon can be cooled a bit more so I can eat it tonight." This includes the first category information "I want" and the second category information "cooler." Based solely on the first and second category information, the meaning understood is that the user wants to control home appliances to cool the environment. However, considering the context of the first and second category information, the meaning understood is that the user doesn't want to control home appliances to cool the environment, but rather wants the watermelon to cool down. Therefore, the meaning understood in conjunction with the context of the first and second category information does not match the meaning understood solely based on the first and second category information, and thus does not meet the preset condition. As another example, suppose a user's voice data is "I want to eat watermelon first, and it would be nice if the living room was cooler." This includes the first category information "I want" and the second category information "cooler." Based solely on the first and second category information, the meaning understood is that the user wants the living room to be cooler. Furthermore, considering the context of the first and second category information, the user wants the living room to be cooler. Therefore, the meaning understood in conjunction with the context of the first and second category information matches the meaning understood solely based on the first and second category information, and thus meets the preset condition.
[0167] Therefore, by limiting the interval between the first and second categories of information in the voice data, the accuracy of recognizing the user's actual control intention can be further improved, avoiding misidentification and miscontrol of home appliances, thereby further improving the user experience.
[0168] In some embodiments, if the interval between the first category information and the second category information does not meet a preset condition, control commands are not generated simultaneously based on the first category information and the second category information. This avoids mis-controls caused by mis-identification of voice data, improves the accuracy of generated control commands, and further enhances the user experience.
[0169] Figure 3 This is a schematic diagram of the method for implementing step 103 of Embodiment 1 of this application. For example... Figure 3 As shown, the method includes:
[0170] Step 301: Based on the first category information and the second category information, determine the environmental parameters that need to be adjusted and the target values of the environmental parameters;
[0171] Step 302: Based on the capabilities of the home appliances in the target space, determine whether the control based on the home appliances can adjust the environmental parameters in the target environment to the target value;
[0172] Step 303: If it is determined that the environmental parameters in the target space can be adjusted to the target value, then control instructions for the home appliances are generated based on the environmental parameters, the target values of the environmental parameters, and the home appliances used to adjust the environmental parameters.
[0173] In this way, when controlling the device based on the first category of information related to the user's expectations and the second category of information related to the user's feelings, the system first determines whether the environmental parameters in the target space can be adjusted to the target value based on the first information and the target value of the environmental parameters, and then performs the corresponding control if the environmental parameters in the target space can be adjusted to the target value based on the capabilities of the home appliances in the target space. This allows the system to take into account the capabilities of the devices actually deployed on site and perform control that conforms to the actual situation.
[0174] In some embodiments, environmental parameters may include at least one of the following parameters: air quality parameter, light parameter, sound parameter, and odor parameter. For example, environmental parameters may be adjusted by at least one home appliance deployed within the target space.
[0175] For example, air quality parameters include at least one of the following: temperature, humidity, carbon dioxide concentration, fine particulate matter (e.g., PM2.5) concentration, inhalable particulate matter (e.g., PM10) concentration, volatile organic compound (VOC) concentration, indoor total organic matter (TVOC) concentration, nitrogen oxide concentration, ozone concentration, wind speed, and wind direction. Air quality parameters can be regulated by air handling equipment (e.g., at least one of indoor units, humidifiers, dehumidifiers, fresh air systems, air purifiers, fragrance diffusers, and underfloor heating systems) and cleaning equipment (e.g., robotic vacuum cleaners) within the target space.
[0176] For example, lighting parameters include at least one of the following: light intensity, color, direction, and range. Lighting parameters can be adjusted by lighting equipment, curtains, etc., within the target space.
[0177] For example, sound parameters include volume, content, timbre, and tone. These parameters can be adjusted by audio-visual devices within the target space, such as smart speakers and smart TVs.
[0178] For example, odor parameters include the various types of odors in the air and their concentrations. These odor parameters can be adjusted, for example, by fragrance components within the target space.
[0179] In step 301, the environmental parameters that need to be adjusted and their target values are determined based on the first category information and the second category information. The environmental parameters that need to be adjusted and their target values can be determined through various methods.
[0180] In some embodiments, first category information and second category information, or a question based on first category information and second category information, can be input into a first model, which then outputs environmental parameters and their target values that match the first category information and second category information. For example, the first model is a big data model, an artificial intelligence question-answering model, or an artificial intelligence search engine.
[0181] For example, if the first category information and the second category information are "I want a cool breeze in the shade in summer", and the big data model is input with "a cool breeze in the shade in summer", the big data model will output "temperature 26 degrees, no direct sunlight, wind force 1".
[0182] For example, if the first category of information and the second category of information are "I want the feeling of West Lake in April", and the AI question answering model is input with "What are the air parameters of West Lake in April", the AI question answering model will output "Temperature 18-20 degrees, humidity 70-80%, wind force 1-2".
[0183] For example, if the first category of information and the second category of information are "I want the feeling of being in love", and the AI question-answering model is input with "how to create the feeling of being in love", the AI question-answering model will output "temperature 22-25 degrees, soft lighting, play light music".
[0184] In step 302, it can be determined whether the control based on the home appliances in the target space can adjust the environmental parameter in the target space to the target value, based on the capabilities of the home appliances in the target space.
[0185] For example, as mentioned above, in the case where the "first category information and the second category information" are "I want a cool breeze in the shade in summer", the environmental parameters output by the big data model are "temperature 26 degrees, no direct sunlight, wind force level 1". The target space is a bedroom, which is equipped with an air conditioner that can adjust the temperature and wind force, smart curtains and smart lighting devices. Therefore, it has the ability to adjust the environmental parameters in the target space to the target value.
[0186] For example, as mentioned above, in the case where the "first category information and second category information" are "I want the feeling of West Lake in April", the AI question answering model outputs the environmental parameters as "temperature 18-20 degrees, humidity 70-80%, wind force 1-2". The target space is the living room, which is equipped with an air conditioner that can adjust the temperature, humidity and wind force, and also has a humidifier that can adjust the humidity. Therefore, it has the ability to adjust the environmental parameters in the target space to the target value.
[0187] For example, as mentioned above, in the case where the "first category information and the second category information" are "I want the feeling of being in love", the environmental parameters output by the AI question answering model are "temperature 22-25 degrees, soft lighting, playing light music". The target space is the living room. However, the living room is only equipped with air conditioning and cannot automatically adjust the lighting and play light music. Therefore, it does not have the ability to adjust the environmental parameters in the target space to the target value.
[0188] In step 303, if it is determined that the environmental parameter within the target space can be adjusted to the corresponding target value, a control command for the home appliance is generated based on the environmental parameter, the target value of the environmental parameter, and the home appliance used to adjust the environmental parameter. The control command can be generated in various ways.
[0189] In some embodiments, environmental parameters, target values for those environmental parameters, and home appliances used to adjust those environmental parameters can be input into a second model, which then outputs control commands. For example, the second model is a trained deep learning model.
[0190] For example, as mentioned above, in the case where "first category information and second category information" are "I want a cool breeze in the shade in summer", the environmental parameters output by the big data model are "temperature 26 degrees, no direct sunlight, wind force level 1". The control command generated by the second model is "adjust the air conditioner temperature to 26 degrees, adjust the air volume to low, and close the sheer curtains in the curtains". In addition, based on the detection results of the user's location, the control command can also include instructions for adjusting the air conditioner's airflow direction.
[0191] For example, as mentioned above, for the case where the "first information" is "West Lake in April", the environmental parameters output by the AI question-answering model are "temperature 18-20 degrees, humidity 70-80%, wind force 1-2". The control command generated by the second model is "adjust the air conditioner temperature to 20 degrees, adjust the air volume to low, and adjust the target humidity of the humidifier to 70%". In addition, based on the detection results of the user's location, the control command can also include instructions for adjusting the air conditioner's airflow direction.
[0192] In some embodiments, a control command database may be established, which stores the first category information and the second category information, environmental parameters, target values, home appliances, and corresponding control commands; or, it may store the environmental parameters, target values, home appliances, and corresponding control commands.
[0193] Thus, in steps 301 and 303, environmental parameters, their target values, and control commands that match the first category information and the second category information can be obtained by looking up a table.
[0194] After the control command is generated, in step 104, the home appliances in the target space are controlled according to the control command.
[0195] In some embodiments, step 104 is performed by the control device, that is, the control corresponding to the control command is performed.
[0196] In some embodiments, the control device may include at least one of a central controller, a wired controller, and a remote controller, wherein the central controller and the wired controller are also referred to as fixed controllers, which are fixedly installed on the wall of an indoor space.
[0197] In some embodiments, the central controller may be located within the indoor space in various ways.
[0198] For example, centralized controllers are set up in various spaces and can control the home appliances in the space where the centralized controller is located. For example, a first centralized controller is set up in the living room, a second centralized controller is set up in the bedroom, and a third centralized controller is set up in the kitchen. The first centralized controller can control the home appliances in the living room, the second centralized controller can control the home appliances in the bedroom, and the third centralized controller can control the home appliances in the kitchen.
[0199] For example, centralized controllers are set up in each space and can control the home appliances in all spaces. For instance, a first centralized controller is set up in the living room, a second centralized controller is set up in the bedroom, and a third centralized controller is set up in the kitchen. The first, second, and third centralized controllers can all control the home appliances in the living room, bedroom, and kitchen.
[0200] For example, a centralized controller can be installed in a specific space and control all home appliances in that space. Additionally, wired controllers can be installed in other spaces outside the designated central controller to control the appliances within those spaces. For instance, a centralized controller in the living room can control all home appliances in the living room, bedrooms, and kitchen; wired controllers can be installed in the bedrooms and kitchen to control the appliances in those spaces respectively.
[0201] This application is not limited to this; the control device may also include a gateway device. In some embodiments, the gateway device can control at least one home appliance according to control commands. For example, the gateway device can control all the home appliances in the voice control system, thereby enabling centralized control of home appliances by connecting home appliances in different spaces through a single gateway device. This application is not limited to this; the gateway device can control some of the home appliances in the voice control system. For example, when there are many home appliances in the voice control system, multiple gateway devices can be set up to control all the home appliances in the voice control system through multiple gateway devices.
[0202] In some embodiments, home appliances can be jointly controlled by a central controller and a gateway device. This allows for the inclusion of multiple control devices (e.g., a gateway device and a central controller) within the voice control system, thereby improving the reliability and fault tolerance of voice control.
[0203] In some embodiments, the method further includes:
[0204] If it is determined that the multiple environmental parameters in the target space (i.e., the multiple environmental parameters that need to be adjusted as determined in step 301) cannot be adjusted to the corresponding target values, and there are multiple environmental parameters, it is determined whether the control based on the home device can adjust the necessary environmental parameter among the multiple environmental parameters to the corresponding target value.
[0205] When the necessary environmental parameter among multiple environmental parameters can be adjusted to the corresponding target value, control instructions for the home equipment are generated based on the necessary environmental parameter, the target value of the necessary environmental parameter, and the home equipment used to adjust the necessary environmental parameter.
[0206] In some embodiments, necessary environmental parameters refer to environmental parameters that must be adjusted to achieve the user's control intent. In other words, by adjusting all necessary environmental parameters, the user's main control intent or close to the user's control intent can be achieved.
[0207] In some embodiments, the necessary environmental parameters correspond to the user’s control intent, or to the “expected feeling” that embodies the control intent.
[0208] In this way, when it is determined that it is impossible to adjust the environmental parameters that need to be adjusted within the target space to the target value, it is determined whether the adjustable environmental parameters include the necessary environmental parameters. If the necessary environmental parameters are included, the corresponding control is executed. Thus, at least the necessary environmental parameters that achieve the user's expected experience can be adjusted to the target value, thereby maximizing the satisfaction of the user's main control intention or close to the user's control intention, and further improving the user experience.
[0209] For example, if it is determined that it is not possible to adjust all the above-mentioned environmental parameters in the target space to the corresponding target value by controlling the home appliances, it is determined whether the environmental parameters that can be adjusted to the corresponding target value include the corresponding necessary environmental parameters; if so, then control instructions for the home appliances are generated based on the environmental parameters that can be adjusted to the corresponding target value (including the necessary environmental parameters), the target values of the environmental parameters, and the home appliances used to adjust the environmental parameters.
[0210] For example, suppose the determined environmental parameters to be adjusted and their target values include: temperature and target temperature value, humidity and target humidity value within the target space, air quality and target air quality index, and the corresponding necessary environmental parameter includes temperature. If it is determined that the environmental parameters that can be adjusted to the corresponding target values include temperature and humidity, which include the necessary environmental parameters, then control instructions for the home appliances are generated.
[0211] Furthermore, the necessary environmental parameters include preset environmental parameters among a variety of environmental parameters that need to be adjusted, or environmental parameters with preset priorities among a variety of environmental parameters that need to be adjusted. Therefore, based on the type or priority of the various environmental parameters that need to be adjusted, corresponding controls can be executed, thereby enabling the use of the equipment actually deployed on-site to achieve the user's main control intent or close to the user's control intent, thus maximizing the satisfaction of the user's control needs and further improving the user experience.
[0212] For example, if it is impossible to adjust all the above-mentioned environmental parameters in the target space to the corresponding target values by controlling the home appliances, determine whether the preset environmental parameters among the environmental parameters that need to be adjusted can be adjusted to the corresponding target values; if so, generate control instructions for the home appliances based on the adjustable environmental parameters (including preset environmental parameters), the target values of the corresponding environmental parameters, and the corresponding home appliances used to adjust the environmental parameters.
[0213] For example, assuming a user desires a "romantic" experience, the determined environmental parameters to be adjusted and their target values include: temperature in the target space: 22-25 degrees Celsius, lighting: soft, and music: light music; and the preset environmental parameters for this situation include "lighting" and "music". If it is determined that the environmental parameters that can be adjusted to the corresponding target values include "lighting" and "music", then control commands for the home appliances are generated.
[0214] For example, if it is impossible to adjust all the above-mentioned environmental parameters in the target space to the corresponding target values by controlling the home appliances, it is determined whether the environmental parameters with preset priorities among the environmental parameters that need to be adjusted can be adjusted to the corresponding target values; if so, then control instructions for the home appliances are generated based on the adjustable environmental parameters (including environmental parameters with preset priorities), the target values of the corresponding environmental parameters, and the corresponding home appliances used to adjust the environmental parameters.
[0215] For example, assuming a user desires a "romantic" ambiance, the determined environmental parameters to be adjusted and their target values include: temperature in the target space: 22-25 degrees Celsius, lighting: soft, music: light music; and in this case, "lighting" has the first priority, "music" has the first priority, "temperature" has the second priority, and the preset priority is "first priority". If it is determined that the environmental parameters "music" and "lighting" with the first priority can be adjusted to their corresponding target values, then control commands for the home appliances are generated.
[0216] In some embodiments, at least one of the preset environmental parameters, the priority of the environmental parameters, and the preset priority is preset by the user; or, at least one of the preset environmental parameters, the priority of the environmental parameters, and the preset priority is set through interaction with the user; or, the preset environmental parameters, the priority of the environmental parameters, and the preset priority are automatically updated by learning based on at least one of the user's usage habits, the environment, the current season, the user's status, and user feedback.
[0217] Therefore, it is possible to flexibly set the necessary environmental parameters in different scenarios, or to determine the necessary environmental parameters through self-learning, thereby improving the accuracy and rationality of the necessary environmental parameters, better meeting the user's control needs, and further improving the user experience.
[0218] In some embodiments, the necessary environmental parameters may differ depending on the user's desired feeling. For example, if a user desires the "feeling of love," the necessary environmental parameters are "light" and "music"; if a user desires the "gentle breeze in the shade of summer," the necessary environmental parameters are "temperature," "wind speed," and "light." This allows for more flexible fulfillment of the user's actual control intentions, further improving the user experience.
[0219] In some embodiments, if it is determined that the multiple environmental parameters (i.e., the multiple environmental parameters that need to be adjusted as determined in step 301) in the target space cannot be adjusted to the corresponding target values, and it is determined that the control of the home appliance can adjust the necessary environmental parameter among the multiple environmental parameters to the corresponding target values, it is determined whether the ratio of the adjustable environmental parameter among the environmental parameters to the environmental parameter to be adjusted is greater than or equal to a preset ratio. If so, a control command for the home appliance is generated based on the adjustable environmental parameter (including the necessary environmental parameter), the target value of the corresponding environmental parameter, and the corresponding home appliance used to adjust the environmental parameter.
[0220] For example, suppose the determined environmental parameters to be adjusted and their target values include: temperature and target temperature value, humidity and target humidity value in the target space, air quality and target air quality index, and the necessary environmental parameter includes temperature, with the preset ratio being 60%. If it is determined that the environmental parameters that can be adjusted to the corresponding target value include temperature and humidity, including the necessary environmental parameter "temperature", and the ratio of the number of environmental parameters "2" that can be adjusted to the corresponding target value to the number of determined environmental parameters "3" that need to be adjusted is 66.6%, exceeding the preset ratio of 60%, then a control command for the home appliance is generated.
[0221] This approach maximizes the satisfaction of users' control needs, ensures better control performance, and further enhances the user experience.
[0222] In some embodiments, the method further includes:
[0223] If the necessary environmental parameter among the various environmental parameters (i.e., the various environmental parameters that need to be adjusted as determined in step 301) cannot be adjusted to the corresponding target value, the generation of control commands will be paused and the user will be notified.
[0224] This avoids situations where controlling home appliances according to generated control commands fails to achieve the user's expected results or meet the user's control needs, thus preventing a negative impact on the user experience.
[0225] In some embodiments, the method further includes:
[0226] If it is determined that the environmental parameters in the target space cannot be adjusted to the target value, at least one of the environmental parameters and the target value is adjusted according to the capabilities of the home appliances in the target space; based on the adjusted environmental parameters, the target values of the environmental parameters, and the home appliances used to adjust the environmental parameters, control instructions for the home appliances are generated.
[0227] For example, suppose the determined environmental parameters to be adjusted and their target values include: temperature and target temperature value in the target space, humidity and target humidity value, air quality and target air quality index. However, the target humidity value exceeds the adjustment range of the home appliance used to adjust humidity, for example, the humidity value is greater than the upper limit of the humidity adjustment range of the home appliance used to adjust humidity. In this case, at least one environmental parameter and target value can be adjusted, for example, adjusting the target humidity value to the upper limit of the humidity adjustment range of the home appliance used to adjust humidity, and lowering the target temperature value. Based on the adjusted target humidity value, the adjusted target temperature value, and the target air quality index, a control instruction for the home appliance is generated. This control instruction includes a first control instruction for adjusting the temperature in the target space to the adjusted target temperature value for the home appliance used to adjust temperature, a second control instruction for adjusting the humidity in the target space to the adjusted target humidity value for the home appliance used to adjust humidity, and a third control instruction for adjusting the air quality in the target space to the target air quality index for the home appliance used to adjust air quality.
[0228] Therefore, even when the desired experience cannot be fully achieved, the actual equipment deployed on-site can be used to optimize the parameters that need adjustment (e.g., changing them to similar adjustable environmental parameters) and the target values (e.g., changing them to achievable target values), so that the adjustment effect is closer to the desired experience. In other words, existing equipment is used to meet the user's personalized needs as much as possible, further enhancing the user experience.
[0229] In some embodiments, the determination of whether it is possible to adjust the environmental parameters in the target space to the target value is based not only on the capabilities of the home appliances in the target space, but also on at least one of comfort and energy efficiency. If it is determined that it is possible and the comfort and energy efficiency are satisfied, corresponding control is performed, thereby enabling the execution of control that conforms to the actual situation by taking into account the capabilities of the equipment actually deployed on site.
[0230] For example, the comfort is determined based on the matching of the control command with at least one of the following: current environmental parameters, current season, current time, geographic information, user attributes, user status, and user usage habits; the energy efficiency is determined based on at least one of the following: operating power, operating time, operating mode, number of devices, and device type of home appliances associated with the control command.
[0231] The first control commands selected in this way can be applied to more scenarios and meet the personalized needs of users.
[0232] In some embodiments, multiple home appliances within the target space can be controlled according to control commands, thereby enabling the coordinated control of multiple devices or multiple devices through a single command from the user, achieving a control effect that better matches the user's actual intentions.
[0233] For example, as mentioned above, in the case where the "first information" is "a gentle breeze in the shade during summer", the air conditioner and curtains are controlled in a coordinated manner through the first control command.
[0234] For example, as mentioned above, in the case where the "first information" is "the West Lake in April", the air conditioner and humidifier are controlled in a coordinated manner through the first control command.
[0235] This allows for a simpler way to achieve control effects that better match the user's actual intentions.
[0236] In some embodiments, the method may further include: notifying the user of the speech recognition results, for example, notifying the user of first information identified from the speech data that relates to the user's desired feeling. For example, notifying the user "You just mentioned wanting a cool breeze in the shade of summer."
[0237] In some embodiments, notifications may be sent to users through one or more of the following methods: display, broadcast, push notification, etc.
[0238] For example, the speech recognition results can be displayed through a local device with display capabilities, which can be displayed as text. The local device with display capabilities includes, but is not limited to, one or more of a central controller, a smart TV, and a smart screen.
[0239] For example, the speech recognition results can be broadcast through a local device with voice broadcasting capabilities. Such local devices include, but are not limited to, one or more of a central controller, a smart TV, and a smart speaker.
[0240] For example, the speech recognition results can be notified to the user by pushing a message to the user's mobile device.
[0241] In some embodiments, the method may further include: notifying the user of the control result, for example, notifying the user of information about the completion of control based on "first information related to the user's expected feelings", such as "the air conditioner temperature has been adjusted to 26 degrees, the air volume has been adjusted to low, and the curtains in the curtains have been closed, based on your request for a cool breeze in the shade during the summer".
[0242] In some embodiments, notifications may be sent to users through one or more of the following methods: display, broadcast, push notification, etc.
[0243] For example, the control results can be displayed through a local device with display capabilities, and these results can be displayed via text. Local devices with display capabilities include, but are not limited to, one or more of a central controller, a smart TV, and a smart screen.
[0244] For example, control results can be broadcast through a local device with voice broadcasting capabilities. Such local devices include, but are not limited to, one or more of a central controller, a smart TV, and a smart speaker.
[0245] For example, notifications of control results can be sent to users' mobile devices by pushing messages.
[0246] In some embodiments, the method may further include: engaging in voice interaction with a user, for example, engaging in voice interaction with a user when it is determined that the generated control commands are inaccurate or unreasonable.
[0247] For example, voice interaction with users can be achieved through local devices with voice broadcasting and data acquisition capabilities. These local devices include, but are not limited to, one or more of a central controller, a smart TV, and a smart speaker.
[0248] Figure 4 This is a flowchart illustrating one implementation of the voice control method according to an embodiment of this application. Figure 4 As shown, the voice control method includes:
[0249] Step 401: Acquire voice data;
[0250] Step 402: Identify first-category information related to the user's expectations and second-category information related to the user's feelings in the voice data;
[0251] Step 403: Determine whether the first category information and the second category information are identified simultaneously; if yes, proceed to step 404; if no, proceed to step 407.
[0252] Step 404: Determine whether the interval between the first category information and the second category information meets the preset conditions; if yes, proceed to step 405; if no, proceed to step 407.
[0253] Step 405: Generate control instructions based on the first category information and the second category information;
[0254] Step 406: Control the home appliances in the target space according to the control instructions, and then end.
[0255] Step 407: Pause the generation of control commands, notify the user, and end.
[0256] For detailed implementation of steps 401-407, please refer to... Figures 1 to 4 The relevant steps and the aforementioned embodiments will not be repeated here.
[0257] According to the above embodiments, control commands for home appliances are generated based on a first category of information related to the user's expectations and a second category of information related to the user's feelings in the user's voice data. Thus, based on the user's abstract description of "expectations and feelings," the user's actual intention can be identified and automatically converted into control commands that need to be executed to achieve the actual intention of "expectations and feelings." These control commands may be comprehensive control commands for multiple devices. Therefore, users can achieve advanced control of multiple devices working together to achieve the actual intention of "expectations and feelings" through a brief description (e.g., a sentence), thus improving the user experience.
[0258] Compared to the traditional methods in which users describe control commands and their current physical state, the abstract description of "expected feelings" in this embodiment of the invention can express the user's comprehensive feelings and needs. Furthermore, this description contains more information, thus more fully reflecting the user's actual control intentions.
[0259] In addition, in the voice control mode based on user expectations, by analyzing and understanding the natural language expressed by the user, it is possible to identify the user's comprehensive and complex actual intentions. This solves the problem that users cannot fully describe their actual control intentions in the traditional voice control mode, making the control more precise and improving the user experience.
[0260] In addition, since the "expected feeling" represents the user's actual intention, the user's actual intention can be directly identified or detected from the voice data, rather than "inferring" the user's actual intention from the content of the voice data as in traditional methods, thus significantly improving the accuracy of control.
[0261] Furthermore, users only need to describe their "desired feeling" to control home appliances without needing to understand or master control commands or related information. Moreover, even if users do not know how to control the devices to meet their needs, they can still achieve their control objectives, thereby reducing the operational difficulty and usage threshold of home appliance control and improving the user experience.
[0262] Furthermore, by limiting the interval between the first category of information related to user expectations and the second category of information related to user feelings through preset conditions, the accuracy of identifying the user's actual control intention can be improved, thereby improving the accuracy of the generated control commands, reducing misoperations, and thus better meeting user needs and further improving the user experience.
[0263] Example 2
[0264] Embodiment 2 of this application provides a voice control system, which includes a voice acquisition device, a server, a control device, and at least one home appliance. The voice acquisition device collects the user's voice data; the server acquires the voice data, identifies a first category of information related to the user's expectations and a second category of information related to the user's feelings within the voice data, and generates control commands based on the first and second category information when the interval between the first and second category information meets a preset condition, and sends the control commands to the control device; the control device controls the home appliance within the target space according to the control commands.
[0265] The voice device can acquire voice data in the manner described in Embodiment 1. For details, please refer to the description in Embodiment 1, which will not be repeated here.
[0266] Figure 5 This is a schematic block diagram of a voice control system according to an embodiment of this application. Figure 5 As shown, the voice control system 50 includes a voice acquisition device 51, a server 52, a control device 53, and at least one home appliance 54. The voice acquisition device 51 acquires the user's voice data. The server 52 acquires the voice data, identifies a first category of information related to the user's expectations and a second category of information related to the user's feelings in the voice data, and generates control instructions based on the first category of information and the second category of information when the interval between the first category of information and the second category of information meets a preset condition, and sends the control instructions to the control device 53. The control device 53 controls the home appliance 54 in the target space according to the control instructions.
[0267] In some embodiments, the voice acquisition device 51 includes, but is not limited to, at least one of a central controller, a wired controller, a remote controller, a user terminal, a smart speaker, and an air handling unit with voice acquisition function.
[0268] In some embodiments, some functions of server 52 may be implemented by a local speech recognition device. For example, the local speech recognition device acquires speech data and identifies a first category of information related to the user's expectations and a second category of information related to the user's feelings within the speech data. The remaining functions are still implemented by the server.
[0269] In some embodiments, the control device 53 includes, but is not limited to, at least one of a central controller and a gateway device.
[0270] In some embodiments, the voice acquisition device 51 and the control device 53 may be the same device, for example, both the voice acquisition device 51 and the control device 53 may be a centralized controller;
[0271] Alternatively, the voice acquisition device 51 and the control device 53 can be different devices. For example, the voice acquisition device 51 is a smart speaker, and the control device 53 is a central controller or gateway device.
[0272] In some embodiments, the home appliance 54 includes, but is not limited to, air handling equipment and smart devices, wherein the smart devices include, for example, at least one of smart speakers, smart display devices (e.g., smart screens, smart TVs, etc.), smart switches, smart curtains, smart lighting equipment, sensors (e.g., air sensors, temperature sensors, humidity sensors, etc.) and smart pet devices.
[0273] In some embodiments, the air handling equipment includes, but is not limited to, at least one of an indoor unit, a fresh air system, a humidity control system (e.g., a humidifier, a dehumidifier, etc.), and a floor heating system.
[0274] Figure 6 This is a structural diagram of the voice control system used in the voice control method of this application embodiment. For example... Figure 6 As shown, the voice control system 600 includes one or more home appliances 610, user terminal 620, central controller 631, gateway device 632, router 640, Internet 650, and first server 661, second server 662, etc.
[0275] In some embodiments, home appliance 610 may be a smart home appliance. For example, home appliances include air handling equipment and smart devices. The smart devices include at least one of smart speakers, smart display devices (such as smart screens, smart TVs, etc.), smart switches, smart curtains, smart lighting equipment, sensors, and smart pet devices. The air handling equipment includes at least one of indoor units, humidifiers, dehumidifiers, fresh air systems, and underfloor heating systems.
[0276] In some embodiments, home appliance 610 can connect to and communicate with central controller 631 or gateway device 632 in various ways.
[0277] To distinguish between different connection or communication methods, such as Figure 6 As shown, home appliances 610 include multiple sets of home appliances 610-1, 610-2, 610-3, 610-4, and 610-5.
[0278] In some embodiments, home appliance 610-1 is connected to central controller 631 and gateway device 632 via wired connection. For example, home appliance 610-1 includes at least one of an outdoor air conditioner unit, an indoor air conditioner unit, a fresh air system, and a floor heating system. In addition, the indoor air conditioner unit can be connected to an air quality sensor and a wired controller, and the floor heating system can be connected to the wired controller.
[0279] In some embodiments, home device 610-2 is connected to voice central controller 631 and gateway device 632 via Wi-Fi. For example, home device 610-2 includes at least one of a home wall-mounted air conditioner, a wall-mounted / cabinet-style display component (e.g., LSM), a humidification component, a dehumidification component, a smart pet device (e.g., a watchdog component), and a sleep sensor.
[0280] In some embodiments, home device 610-3 is connected to voice central controller 631 via Bluetooth or Zigbee. For example, home device 610-3 includes at least one of smart curtains (e.g., motorized curtains, roller blinds) and smart lighting devices (e.g., dimmer lights).
[0281] In some embodiments, home appliance 610-4 is connected to central controller 631 via a wired connection. For example, home appliance 610-4 includes at least one of smart curtains (e.g., motorized curtains, roller blinds), smart lighting devices (e.g., ceiling lights), and floor heating valves.
[0282] In some embodiments, home device 610-5 is connected to gateway device 632 via Wi-Fi. For example, home device 610-5 is an air sensor installed indoors.
[0283] In some embodiments, the home appliance may also include Figure 6 Devices not shown, such as smart speakers, smart TVs, etc., can be connected wirelessly or wired to at least one of the voice central controller 631 and gateway device 632.
[0284] In some embodiments, such as Figure 6 As shown, the user's voice data is collected by the central controller 631 or the user terminal 620.
[0285] In some embodiments, such as Figure 6 As shown, the home appliance 610 is controlled by a central controller 631 or a gateway device 632.
[0286] In some embodiments, the central controller has voice control capabilities, including a network (online) control mode and a local control mode.
[0287] The network (online) control mode of the centralized controller refers to the following: the centralized controller sends the collected voice data to the server through the router. The server recognizes the voice data and generates control commands. The centralized controller then controls the home appliances according to the control commands. In other words, the centralized controller controls the home appliances online through the Internet or other networks via voice control mode.
[0288] For example, the central controller 631 collects voice data and sends it to the first server 661 via the router 640 and the Internet 650. The first server 661 recognizes the voice data (e.g., through a large model) to obtain the corresponding text data and sends the text data to the second server 662. The second server 662 generates corresponding control commands based on the text data and sends them to the central controller 631 via the router 640 and the Internet 650. The central controller 631 then controls the relevant home appliances according to the control commands.
[0289] The local control mode of the centralized controller refers to the centralized controller recognizing the collected voice data locally and generating control commands, and then controlling home appliances according to the control commands. In other words, the centralized controller controls home appliances locally via voice control.
[0290] For example, the central controller 631 collects voice data, recognizes the voice data locally (at the central controller 631 end), and sends the text data to the second server 662 via the router 640 and the Internet 650. The second server 662 generates corresponding control commands based on the text data and sends them to the central controller 631 via the Internet 650 and the router 641. The central controller 631 then controls the relevant home appliances according to the control commands.
[0291] In this way, the network (online) control mode and local control mode can be flexibly switched according to different scenarios or network conditions, improving the effectiveness and efficiency of voice control and enhancing the user experience.
[0292] Alternatively, user terminal 620 can collect voice data, which is then transmitted to first server 661 via router 640 and Internet 650. First server 661 recognizes the voice data (e.g., through a large model) to obtain corresponding text data, and sends the text data to second server 662. Second server 662 generates corresponding control commands based on the text data and transmits them to gateway device 632 via Internet 650 and router 640. Gateway device 632 then controls the relevant home appliances according to the control commands.
[0293] In addition, the functions of the first server 661 and the second server 662 can also be performed by a single server, and this application embodiment does not limit this.
[0294] In some embodiments, the central controller 631 is a central controller with a display screen.
[0295] In some embodiments, gateway device 632 is an intelligent gateway with centralized control functions.
[0296] In some embodiments, in addition to the centralized controller, control operations can also be performed through other controllers with control functions, such as at least one of a wired controller and a remote controller. The centralized controller and the wired controller are also referred to as fixed controllers, which are fixedly installed on the wall of an indoor space. For details related to the centralized controller, please refer to the relevant content in Embodiment 1, which will not be repeated here.
[0297] According to the above embodiments, control commands for home appliances are generated based on a first category of information related to the user's expectations and a second category of information related to the user's feelings in the user's voice data. Thus, based on the user's abstract description of "expectations and feelings," the user's actual intention can be identified and automatically converted into control commands that need to be executed to achieve the actual intention of "expectations and feelings." These control commands may be comprehensive control commands for multiple devices. Therefore, users can achieve advanced control of multiple devices working together to achieve the actual intention of "expectations and feelings" through a brief description (e.g., a sentence), thus improving the user experience.
[0298] Compared to the traditional methods in which users describe control commands and their current physical state, the abstract description of "expected feelings" in this embodiment of the invention can express the user's comprehensive feelings and needs. Furthermore, this description contains more information, thus more fully reflecting the user's actual control intentions.
[0299] In addition, in the voice control mode based on user expectations, by analyzing and understanding the natural language expressed by the user, it is possible to identify the user's comprehensive and complex actual intentions. This solves the problem that users cannot fully describe their actual control intentions in the traditional voice control mode, making the control more precise and improving the user experience.
[0300] In addition, since the "expected feeling" represents the user's actual intention, the user's actual intention can be directly identified or detected from the voice data, rather than "inferring" the user's actual intention from the content of the voice data as in traditional methods, thus significantly improving the accuracy of control.
[0301] Furthermore, users only need to describe their "desired feeling" to control home appliances without needing to understand or master control commands or related information. Moreover, even if users do not know how to control the devices to meet their needs, they can still achieve their control objectives, thereby reducing the operational difficulty and usage threshold of home appliance control and improving the user experience.
[0302] Furthermore, by limiting the interval between the first category of information related to user expectations and the second category of information related to user feelings through preset conditions, the accuracy of identifying the user's actual control intention can be improved, thereby improving the accuracy of the generated control commands, reducing misoperations, and thus better meeting user needs and further improving the user experience.
[0303] Example 3
[0304] Embodiment 3 of this application provides a voice control device. This voice control device corresponds to the voice control method described in Embodiment 1, and the specific details can be found in Embodiment 1.
[0305] Figure 7 This is a schematic block diagram of a voice control device according to an embodiment of this application. Figure 7 As shown, the voice control device 70 includes:
[0306] Module 71 acquires voice data;
[0307] The recognition module 72 identifies first-category information related to the user's expectations and second-category information related to the user's feelings in the voice data;
[0308] The generation module 73, when the interval between the first category information and the second category information meets a preset condition, generates control instructions based on the first category information and the second category information; and
[0309] The control module 74 controls the home appliances in the target space according to the control instructions.
[0310] The specific functions of each of the above units can be found in the relevant steps in Embodiment 1, and will not be repeated here.
[0311] According to the above embodiments, control commands for home appliances are generated based on a first category of information related to the user's expectations and a second category of information related to the user's feelings in the user's voice data. Thus, based on the user's abstract description of "expectations and feelings," the user's actual intention can be identified and automatically converted into control commands that need to be executed to achieve the actual intention of "expectations and feelings." These control commands may be comprehensive control commands for multiple devices. Therefore, users can achieve advanced control of multiple devices working together to achieve the actual intention of "expectations and feelings" through a brief description (e.g., a sentence), thus improving the user experience.
[0312] Compared to the traditional methods in which users describe control commands and their current physical state, the abstract description of "expected feelings" in this embodiment of the invention can express the user's comprehensive feelings and needs. Furthermore, this description contains more information, thus more fully reflecting the user's actual control intentions.
[0313] In addition, in the voice control mode based on user expectations, by analyzing and understanding the natural language expressed by the user, it is possible to identify the user's comprehensive and complex actual intentions. This solves the problem that users cannot fully describe their actual control intentions in the traditional voice control mode, making the control more precise and improving the user experience.
[0314] In addition, since the "expected feeling" represents the user's actual intention, the user's actual intention can be directly identified or detected from the voice data, rather than "inferring" the user's actual intention from the content of the voice data as in traditional methods, thus significantly improving the accuracy of control.
[0315] Furthermore, users only need to describe their "desired feeling" to control home appliances without needing to understand or master control commands or related information. Moreover, even if users do not know how to control the devices to meet their needs, they can still achieve their control objectives, reducing the operational difficulty and usage threshold of home appliance control and improving the user experience.
[0316] Furthermore, by limiting the interval between the first category of information related to user expectations and the second category of information related to user feelings through preset conditions, the accuracy of identifying the user's actual control intention can be improved, thereby improving the accuracy of the generated control commands, reducing misoperations, and thus better meeting user needs and further improving the user experience.
[0317] Example 4
[0318] Embodiment 4 of this application provides an electronic device. The steps executed by the processor of this electronic device correspond to all or part of the steps of the voice control method described in Embodiment 1, and the specific details can be found in the description of Embodiment 1.
[0319] Figure 8 This is a schematic block diagram of the system configuration of an electronic device according to an embodiment of this application. Figure 8 As shown, electronic device 800 may include processor 810 and memory 820; memory 820 is coupled to processor 810. It is worth noting that this figure is exemplary; other types of structures may be used to supplement or replace this structure to achieve telecommunications functions or other functions.
[0320] In one embodiment, the processor 810 may be configured to: acquire voice data; identify in the voice data a first category of information related to the user's expectations and a second category of information related to the user's feelings; generate control instructions based on the first category of information and the second category of information if the interval between the first category of information and the second category of information meets a preset condition; and control home appliances in the target space according to the control instructions.
[0321] like Figure 8 As shown, the electronic device 800 may also include: a communication module 830, an input unit 840, a display 850, a speaker 860, a microphone 870, and a power supply 880. It is worth noting that the electronic device 800 does not necessarily need to include these components. Figure 8 All components shown; in addition, the electronic device 800 may also include Figure 8 For components not shown, please refer to relevant technologies.
[0322] like Figure 8 As shown, the processor 810, sometimes also referred to as a controller or operation control, may include a microprocessor or other processor device and / or logic device, which receives inputs and controls the operation of various components of the electronic device 800.
[0323] The memory 820 may be, for example, one or more of a cache, flash memory, hard drive, removable medium, volatile memory, non-volatile memory, or other suitable means. It can store various types of data, and also stores programs for executing related information. The processor 810 can execute the program stored in the memory 820 to perform information storage or processing, etc. The functions of other components are similar to those in existing systems and will not be described further here. The components of the electronic device 800 can be implemented using dedicated hardware, firmware, software, or a combination thereof without departing from the scope of the invention.
[0324] According to the above embodiments, control commands for home appliances are generated based on a first category of information related to the user's expectations and a second category of information related to the user's feelings in the user's voice data. Thus, based on the user's abstract description of "expectations and feelings," the user's actual intention can be identified and automatically converted into control commands that need to be executed to achieve the actual intention of "expectations and feelings." These control commands may be comprehensive control commands for multiple devices. Therefore, users can achieve advanced control of multiple devices working together to achieve the actual intention of "expectations and feelings" through a brief description (e.g., a sentence), thus improving the user experience.
[0325] Compared to the traditional methods in which users describe control commands and their current physical state, the abstract description of "expected feelings" in this embodiment of the invention can express the user's comprehensive feelings and needs. Furthermore, this description contains more information, thus more fully reflecting the user's actual control intentions.
[0326] In addition, in the voice control mode based on user expectations, by analyzing and understanding the natural language expressed by the user, it is possible to identify the user's comprehensive and complex actual intentions. This solves the problem that users cannot fully describe their actual control intentions in the traditional voice control mode, making the control more precise and improving the user experience.
[0327] In addition, since the "expected feeling" represents the user's actual intention, the user's actual intention can be directly identified or detected from the voice data, rather than "inferring" the user's actual intention from the content of the voice data as in traditional methods, thus significantly improving the accuracy of control.
[0328] Furthermore, users only need to describe their "desired feeling" to control home appliances without needing to understand or master control commands or related information. Moreover, even if users do not know how to control the devices to meet their needs, they can still achieve their control objectives, thereby reducing the operational difficulty and usage threshold of home appliance control and improving the user experience.
[0329] Furthermore, by limiting the interval between the first category of information related to user expectations and the second category of information related to user feelings through preset conditions, the accuracy of identifying the user's actual control intention can be improved, thereby improving the accuracy of the generated control commands, reducing misoperations, and thus better meeting user needs and further improving the user experience.
[0330] This application also provides a computer-readable storage medium storing a computer program that causes a computer to execute the voice control method described in embodiment 1 of this application.
[0331] This application also provides a computer program product, wherein the computer program product stores a computer program that enables a computer to execute the voice control method described in embodiment 1 of this application.
[0332] The home appliances and voice control systems of this application are not only applicable to home scenarios, such as residences and apartments, but also to commercial or public scenarios such as office buildings, shopping malls, schools, and factories. This application does not limit the application scenarios of home appliances and home systems.
[0333] The apparatus and methods described above in the embodiments of this application can be implemented in hardware or in combination with software. This application relates to a computer-readable program that, when executed by a logic component, enables the logic component to implement the apparatus or constituent parts described above, or to implement the various methods or steps described above.
[0334] This application also relates to storage media for storing the above programs, such as hard disks, magnetic disks, optical disks, DVDs, flash memory, etc.
[0335] It should be noted that the limitations on each step involved in this application are not considered as limiting the order of steps, provided that they do not affect the implementation of the specific solution. The steps listed first can be performed first, later, or even simultaneously. As long as the solution can be implemented, they should be considered to fall within the scope of protection of this application.
[0336] The present application has been described above with reference to specific embodiments. However, those skilled in the art should understand that these descriptions are exemplary and not intended to limit the scope of protection of the present application. Those skilled in the art can make various modifications and variations to the present application based on its spirit and principles, and these modifications and variations are also within the scope of the present application.
Claims
1. A voice control method, characterized by, The method includes: Acquire voice data; Identify a first category of information related to the user's expectations and a second category of information related to the user's feelings from the voice data; When the interval between the first category information and the second category information meets a preset condition, a control command is generated based on the first category information and the second category information; and The control commands are used to control the home appliances within the target space.
2. The method according to claim 1, characterized in that, The preset conditions include at least one of the following: The character spacing between the first category information and the second category information is less than or equal to a first threshold. The blank time interval between the first category information and the second category information is less than or equal to the second threshold; The meaning understood in the context of the first category information and the second category information matches the meaning understood solely based on the first category information and the second category information.
3. The method according to claim 1, characterized in that, Identifying a first category of information related to the user's expectations and a second category of information related to the user's feelings from the voice data, including: The speech data is converted into text data, and first category information and second category information are identified in the text data; or The first category of information and the second category of information are identified in the speech data by sound wave matching.
4. The method according to claim 3, characterized in that, Identifying a first category of information and a second category of information in the text data includes: The text data is matched against a preset data table. If a match is successful, the first category information and the second category information in the text data are determined; or, The text data is input into a natural language processing-based model, which outputs the first category information and the second category information.
5. The method according to claim 4, characterized in that, Matching the text data with a preset data table includes: Extract word information from the text data; The word information is matched with the first category keyword set and the second category keyword set in the preset keyword table respectively; The word information that matches the first category of keyword set is determined as the first description information, and the word information that matches the second category of keyword set is determined as the second description information.
6. The method according to claim 1 or 5, characterized in that, The first category of information includes first descriptive information representing the expected intention; The second category of information includes a second descriptive information representing feelings.
7. The method according to claim 6, characterized in that, The second descriptive information representing a feeling includes at least one of a third descriptive information representing bodily sensation and a fourth descriptive information representing mood. The third descriptive information representing bodily sensation includes at least one of scene, location, weather, season, and time, while the fourth descriptive information representing emotional state includes at least one of subjective experience and emotion.
8. The method according to claim 1, characterized in that, The voice data is recognized locally; or The voice data is recognized on the server side; or The voice data is converted into text data on the local end, and the text data is recognized on the server end; or The voice data is subjected to sound wave matching on the local end, and if the matching is successful, the voice data is recognized on the server end.
9. The method according to claim 1, characterized in that, Acquire voice data, including: Continuously acquire voice data for a first preset duration, and stop acquiring voice data when the first preset duration is reached; or The system acquires voice data when user voice data is detected, and stops acquiring voice data when a second preset duration has elapsed after no user voice data has been detected.
10. The method according to claim 1, characterized in that, Generate control instructions based on the first category information and the second category information, including: Based on the first category information and the second category information, determine the environmental parameters that need to be adjusted and the target values of the environmental parameters; Based on the capabilities of the home appliances within the target space, determine whether control of the home appliances can adjust the environmental parameters within the target space to the target value; and If it is determined that the environmental parameters within the target space can be adjusted to the target value, control instructions for the home appliances are generated based on the environmental parameters, the target values of the environmental parameters, and the home appliances used to adjust the environmental parameters.
11. The method according to claim 10, characterized in that, The method further includes: If it is determined that the environmental parameters within the target space cannot be adjusted to the target value, and the environmental parameters are multiple environmental parameters, it is determined whether the control of the home appliance can adjust the necessary environmental parameter among the multiple environmental parameters to the corresponding target value; When the necessary environmental parameter among the various environmental parameters can be adjusted to the corresponding target value, a control command for the home appliance is generated based on the necessary environmental parameter, the target value of the necessary environmental parameter, and the home appliance used to adjust the necessary environmental parameter. When the necessary environmental parameter among the various environmental parameters cannot be adjusted to the corresponding target value, the generation of control commands is paused and the user is notified.
12. The method according to claim 11, characterized in that, The necessary environmental parameters include preset environmental parameters among the multiple environmental parameters, or environmental parameters with preset priorities among the multiple environmental parameters.
13. The method according to claim 12, characterized in that, At least one of the preset environment parameters, the priority of the environment parameters, and the preset priority is preset by the user; or, At least one of the preset environment parameters, the priority of the environment parameters, and the preset priority is set through interaction with the user; or, The preset environment parameters, the priority of the environment parameters, and the preset priority are automatically updated based on at least one of the user's usage habits, environment, current season, user status, and user feedback.
14. The method according to claim 11, characterized in that, The method further includes: If it is determined that the environmental parameters within the target space cannot be adjusted to the target value, at least one of the environmental parameters and the target value shall be adjusted according to the capabilities of the home appliances within the target space; Based on the adjusted environmental parameters, the target values of the environmental parameters, and the home appliances used to adjust the environmental parameters, control instructions for the home appliances are generated.
15. The method according to claim 1, characterized in that, Controlling home appliances within the target space according to the control commands includes: According to the control instructions, multiple home appliances within the target space are controlled.
16. A voice control system, characterized in that, The voice control system includes voice acquisition equipment, a server, control equipment, and at least one home appliance. The voice acquisition device collects the user's voice data; The server acquires the voice data, identifies a first category of information related to the user's expectations and a second category of information related to the user's feelings in the voice data, and generates a control command based on the first category of information and the second category of information when the interval between the first category of information and the second category of information meets a preset condition, and sends the control command to the control device. The control device controls the home appliances in the target space according to the control instructions.
17. A voice control device, characterized in that, The device includes: The acquisition module acquires voice data; The recognition module identifies, in the voice data, a first category of information related to the user's expectations and a second category of information related to the user's feelings; The generation module, when the interval between the first category information and the second category information meets a preset condition, generates control instructions based on the first category information and the second category information; and The control module controls the home appliances in the target space according to the control commands.
18. An electronic device, characterized in that, The electronic device includes: Memory, which stores computer programs; and A processor that, when executing the computer program, implements the method described in any one of claims 1-15.
19. A computer-readable storage medium having a computer program stored thereon, characterized in that, When the computer program is executed by a processor, it implements the method described in any one of claims 1-15.
20. A computer program product having a computer program stored thereon, characterized in that, When the computer program is executed by a processor, it implements the method described in any one of claims 1-15.