Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

280 results about "Speech interaction" patented technology

Speech synthesis method and related equipment

The present application provides a speech synthesis method and related equipment. The method includes the following steps that: the identity of a user is determined according to the current input speech of the user; an acoustic model is obtained from an acoustic model library according to the current input speech; basic speech synthesis information is determined according to the identity of the user, wherein the basic speech synthesis information characterizes variable quantities in the preset sound speed, preset volume, and preset pitch of the acoustic model; a reply text is determined; enhanced speech synthesis information is determined according to the reply text and context information, wherein the enhanced speech synthesis information characterizes variable quantities in the preset timbre, tone and preset rhythm of the acoustic model; and speech synthesis is performed on the reply text through the acoustic model according to the basic speech synthesis information and the enhancedspeech synthesis information, so that reply speech for the user can be obtained. With the speech synthesis method and related apparatus provided by the embodiments of the invention adopted, a device can provide a personalized speech synthesis effect to the user during a man-machine interaction process, and therefore, the speech interaction experience of the user can be improved.
Owner:HUAWEI TECH CO LTD

Multi-round dialogue intelligent voice interaction system and device

The invention discloses a multi-round dialogue intelligent voice interaction system and device. The system comprises a hybrid semantic understanding module, a semantic understanding adaptive module and an automatic dialogue management module. The voice input is converted into a text and input to a hybrid semantic understanding module after being subjected to voice recognition; wherein the hybrid semantic understanding module is used for understanding user intention and extracting corresponding state information, an automatic dialogue management module is used for guiding a dialogue process, outputting dialogue texts and converting the dialogue texts into voice output based on the user intention to realize dialogue, and the semantic understanding self-adaptive module is used for optimized learning of the hybrid semantic understanding module. According to the invention, a plurality of modules such as speech recognition, natural language understanding, natural language generation, speechsynthesis and dialogue management are integrated to form a whole set of multi-round dialogue intelligent speech interaction system which is easy to expand and configure and can be applied to any scene.
Owner:百融云创科技股份有限公司

Speech control method of home appliance system and home appliance control system

The present invention provides a speech control method and a home appliance control system. The home appliance system includes a plurality of electric devices each of which is in data connection witha cloud controller; at least a part of the plurality of electric devices are configured with speech acquisition devices which are adopted as speech interaction devices. The speech control method of the home appliance system includes the following steps that: a plurality of speech interaction devices acquire surrounding speech signals by using the respective acquisition collection devices; the acquired speech signals are identified, the signal parameters of the speech signals are extracted, and whether the speech signals are matched with preset wake-up signals is judged; and speech interactiondevices which receive the speech signals matched with preset wake-up signals send the signal parameters of the received speech signals, so that the cloud controller can select a speech response devicefrom the speech interaction devices according to the parameters of the speech signals; and the cloud controller issues a control instruction for enabling the speech response device to enter a speechresponse state to the speech response device.
Owner:QINGDAO HAIER SMART TECH R & D CO LTD +1

Speech interaction method and speech interaction equipment

The application discloses a speech interaction method and speech interaction equipment. An association relationship between a to-be-distinguished speech and historical interaction data can be analyzed, and the historical interaction data include a user speech instruction before the to-be-distinguished speech and a response result for the user speech instruction; then whether the to-be-distinguished speech is an instruction-type speech can be judged according to the association relationship between the two parts; command response can be not carried out thereon when it is judged that the speechis a non-instruction-type interference speech; and thus one time of wrong human-machine interaction is avoided, and then user experience is improved.
Owner:IFLYTEK CO LTD

System and method for providing real-time and reliable multi-person speech interaction in network game

InactiveCN101316301ASpeed up entryReal-time multi-person voice interactionInterconnection arrangementsTransmissionThe InternetGame server
The invention relates to a system and a method which provide real-time and reliable multiple people voice interaction in network games. The system comprises a telecommunication network, a telephone exchange, the Internet, a game server platform in the Internet and a game client platform in the Internet, wherein, the game server platform is internally provided with a game server control module, a player information data base, a voice server control module and a terminal information data base; the game client platform is internally provided with a game client control module and a voice client control module. The method of the invention combines the IP telephony and the telecommunication network provided with a QoS guarantee, fulfills seamless integration with the game server control module and the game client control module through the voice server control module and the voice client control module, constructs two control platforms, and provides real-time, reliable, convenient and stable multiple people voice interaction. The operation method is simple, easy to master and relatively independent. Therefore, the system and the device ensure that game manufacturers can concentrate on the development of the network games and have rather good popularization and application prospect.
Owner:杨海晨

Robot interaction method and system based on natural language

The invention belongs to the field of robots, and provides a robot interaction method and system based on a natural language to improve the intelligence of a robot and the convenience of human-computer interaction. The method comprises the steps that when the robot is awakened, a voice receiving module recognizes the received natural speech as the corresponding text information; if the text information belongs to a manipulation instruction, a speech interaction processing module issues the manipulation instruction to the robot body to execute the manipulation instruction, otherwise question and answer information is uploaded to a robot cloud server; the robot cloud server intelligently recognizes the question and answer information, and feeds the recognized question answer back to the speech interaction processing module; the speech interaction processing module converts the question answer to speech information corresponding to the question answer; and a speech broadcast module broadcasts speech information corresponding to the question answer to a user. According to the technical scheme provided by the invention, the robot can accurately recognize the meaning of the natural speech of the user, so that human-computer interaction is smooth and convenient.
Owner:SHENZHEN LANGKONG YIKE TECH CO LTD

User terminal for displaying gesture-speech interaction unified interface and display method thereof

The invention discloses a user terminal for displaying a gesture-speech interaction unified interface, which comprises an input device and a display device, wherein the input device is used for receiving at least one of speech input and gesture input of a user; and the display device is used for displaying at least two areas including a first area and a second area, the first area is used for presenting a state relevant to the input speech of the user, and the second area is used for receiving or displaying the gesture input of the user. The invention also discloses a method for displaying a gesture-speech interaction unified interface. According to the user terminal disclosed by the invention, the interaction between a user and the user terminal is more natural and convenient.
Owner:百纳(武汉)信息技术有限公司

Speech interaction method and device

The invention is applicable in the field of speech interaction and provides a speech interaction method and device. The speech interaction method comprises the following steps of: receiving speech data; recognizing the speech data to generate a semantic text; carrying out similarity matching on the semantic text and a generated history speech research record; taking the history speech research record with the similarity exceeding a specified threshold as a basic database; determining at least one text to be matched after carrying out screening treatment on the basic database; matching the semantic text with the determined at least one text to be matched; and executing corresponding operation according to a matching result. The embodiment of the invention can increase the accuracy rate and the success rate of speech interaction.
Owner:TCL CORPORATION

Voice interaction method and device

The present invention discloses a speech interaction method and apparatus, and pertains to the field of speech processing technologies. The method includes: acquiring speech data of a user; performing user attribute recognition on the speech data to obtain a first user attribute recognition result; performing content recognition on the speech data to obtain a content recognition result of the speech data; and performing a corresponding operation according to at least the first user attribute recognition result and the content recognition result, so as to respond to the speech data. According to the present invention, after speech data is acquired, user attribute recognition and content recognition are separately performed on the speech data to obtain a first user attribute recognition result and a content recognition result, and a corresponding operation is performed according to at least the first user attribute recognition result and the content recognition result.
Owner:HUAWEI TECH CO LTD

Speech interaction method, speech interaction device and robot

The embodiment of the invention provides a speech interaction method, a speech interaction device and a robot. The method is applied to the robot, and comprises the steps of obtaining an image when the sound source angle of a received sound signal is in a preset angle range of the robot, and recognizing angles of one or more human faces in the image; selecting the person whose human face angle is the closest to the sound source angle as the speaking person; adjusting the angle of the robot to make the center of the face of the speaking person falls in the center of the front of the robot, so that the sound signal is responded to. According to the speech interaction method, the speech interaction device and the robot, the speech interaction function of the robot can be more intellectualized and personified.
Owner:JIANGSU MUMENG INTELLIGENT TECH

Intelligent speech interaction method and intelligent speech interaction system

The invention discloses an intelligent speech interaction method and an intelligent speech interaction system. The intelligent speech interaction method comprises steps that user interaction speech is received; the speech recognition and the semantic comprehension of the interaction speech are carried out to acquire an identified text and a semantic comprehension result; whether a current speech segment is a single person speech is determined; when yes, a response is provided according to the semantic comprehension result; when no, an instruction relation among the various characters of the current speech field is determined according to the current speech segment and the corresponding semantic comprehension result, and then the response is provided according to the instruction relation among the various characters. The accuracy of the response in man-machine interaction environment, in which a lot of people participate, is improved, and user experience is improved.
Owner:IFLYTEK CO LTD

Vehicle speech interaction method and system and computer readable storage medium

The present invention discloses a vehicle speech interaction method and system and a computer readable storage medium. The method comprises: when a speech interaction device receives a user speech instruction, sending the speech instruction to a preset cloud server; receiving an analysis result sent by the cloud server through the speech interaction device, wherein the analysis result is obtained by the cloud server through data analysis of the speech instruction based on the Internet; and controlling a vehicle to correspond to function devices based on the analysis result through the speech interaction device to realize function demands corresponding to the speech instruction. Through diversified function service of the Internet, the vehicle speech interaction method and system and the computer readable storage medium combine the speech interaction mode and the Internet to effectively satisfy users' various function requirements, improve the work efficiency of the vehicle system and improve the user usage experience.
Owner:陈世科

Speech interaction satisfaction determination method and device

The embodiment of the invention provides a speech interaction satisfaction determination method and a device. The method comprises steps: speech interaction features are acquired, wherein the speech interaction features comprise objective data of speech interaction and subjective data of speech interaction, and the objective data and the subjective data are data for the same theme; the objective data are evaluated and processed to obtain objective evaluation and the subjective data are evaluated and processed to obtain subjective evaluation; and the objective data and the subjective data are used as input of a satisfaction evaluation model, and the speech interaction satisfaction outputted by the satisfaction evaluation model is obtained. True and comprehensive evaluation can be provided for speech interaction.
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD +1

Deep learning-based intelligent industrial robot speech interaction and control method

The invention discloses a deep learning-based intelligent industrial robot speech interaction and control method. The method comprises the following steps that: 1) speech is converted into a speech spectrum: original speech is converted into an image through FFT (Fast Fourier Transformation), wherein the image can be used as input; 2) modeling is performed on the whole speech sentence: the speech spectrum, adopted as input, is utilized to perform unsupervised training on a convolutional neural network; 3) the output sequence O of the convolutional neural network is compared with a tag T, and the convolutional neural network is adjusted in a supervised manner through the BP algorithm; and 4) specific text information is inputted into a robot as a control command. According to the deep learning-based intelligent industrial robot speech interaction and control method of the invention, the speech recognition technology and the industrial robot are combined together, and therefore, a traditional production mode is changed, the labor intensity of workers is decreased, labor productivity is enhanced, and the intelligentization development of industrial technologies can be promoted.
Owner:SOUTH CHINA UNIV OF TECH

Outdoor blind guidance service system and method oriented to blind disturbance people

InactiveCN101483806APrecise Spatial Analysis CapabilitiesMake up for visual deficienciesInstruments for road network navigationLocation information based serviceHand heldCrowds
The invention discloses an outdoor blind guiding service system oriented to paropsia people, which comprises a hand-hold movable termination, a mobile communication system, a mobile location system (GPS, CORS), blind-used obstacle detecting equipment and a cane, urban GIS data application server etc. The hand-hold movable termination makes use of voice recognition to realize the speech interaction with users, which also obtains the information of users and surrounding environment through mobile locating system and blind-used radar and makes use of the mobile communication system to correlate with the urban GIS data application server so as to provide the paropsia people with real-time voice guidance service of moving route. At the same time, the invention also discloses a method to provide outdoor blind guiding service for paropsia people with the system. The system makes full use of present equipment and facilities and makes improvement, so only specific hand-hold terminal is needed and the requirement of other equipment and network can be solved through existing resources.
Owner:NANJING NORMAL UNIVERSITY

Speech interaction method and device

The present application provides a speech interaction method and device. The method comprises the following steps that: speech signals are received and are adopted as target speech signals; whether the target speech signals contain user speech is detected; if the target speech signals contain the user speech, noise volume in an environment is determined; and an interaction instruction corresponding to target user speech is responded according to the noise volume, wherein the target user speech is the user speech in the target speech signals. With the embodiments of the invention adopted, the fluency of a speech interaction process can be improved, and user experience can be improved.
Owner:易视星空科技无锡有限公司

Method for managing mixed initiative human-machine dialogues based on interactive speech

Method for managing mixed-initiative human-machine dialogues based on speech interaction exploiting the separation between a general dialogue knowledge, such as communicative acts, which can be used in multiple application domains, and particular linguistic knowledge, which are domain-specific parameters, to process the dialogue as a sequence of changes of status. Each status consist in a set of features linked both to the processed parameters and to the linguistic and pragmatic context, and describes a certain instant of the communicative situation between the user and the system so to discriminate it from other situations that are also only slightly different. The method employs three components. A first component which, given the various parameters of the domain, defines the parameters on which to intervene to modify the status with the intent of converging towards a situation in which all parameters are acquired with a certain value; in parallel, the component identifies the Communicative Act (CA) which applied to these parameters can make the status evolve in the required direction. A second component creates the sentences to be conveyed to user, whereby obtaining a Communicative Act by instancing said parameters. A third component analyses the user's reply to determine the new system status given the parameters that were provided by the user, their mutual coherence, the previous status of these parameters and other correlated parameters.
Owner:NUANCE COMM INC

Speech interaction method and device, electronic equipment and storage medium

The embodiment of the invention discloses a speech interaction method and device, electronic equipment and a storage medium. The method includes: monitoring a speech of a user in a session process, and identifying current session intention of the user on the basis of the speech; and combining the current session intention and a current session state to execute a session task, wherein the session task includes personalized response and session contents corresponding to the current session intention. According to the solution of the embodiment of the invention, interaction between a robot and the user is enabled to be more personalized, and emotion distance to the user is shortened, at the same time, efficiency of customer service communication is also improved while user demand is satisfied, and user experience is improved.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Emotion recognition method and device, computer equipment and storage medium

The embodiment of the invention provides an emotion recognition method and device, computer equipment and a storage medium. The method comprises the following steps: determining the current conversation text of the current conversation speech by uisng the speech recognition technology; matching the current conversation text with all the preset emotion recognition templates so as to obtain a firstrecognition result; recognizing the current conversation text by using the pre-trained emotion recognition model so as to obtain a second recognition result; and obtaining the emotional state of the current conversation text according to the first recognition result and the second recognition result. According to the emotion recognition method, the emotional state of the conversation text can be recognized through combination of the emotion recognition templates and the emotion recognition models so that the accuracy of emotional state recognition can be enhanced, the dependence on the human operation can be reduced, the labor cost can be reduced and the defect that the speech interaction effect is difficult to control can be overcome.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Interaction type intelligent household service system and interaction type intelligent household service method

The invention relates to an interaction type intelligent household service system and an interaction type intelligent household service method. The interaction type intelligent household service system comprises a video monitoring unit, a speech unit, an interaction terminal, and an analyzing server. A conventional intelligent household system is reconstructed to have an automatic sensing capability and a self-learning capability, and therefore intelligent interaction experience of equipment is improved. The analyzing server is used for intelligent analysis of user scenes, identities, and behaviors according to visual sense, auditory sense, and communication data, and is used for pattern class switching. The user demands in the different scenes are detected actively, and the interaction and the service are realized in an intelligent way. A distributed speech interaction system is used to integrate the functions of the different intelligent equipments, and therefore active interaction of information is realized, and structure connection complexity is reduced.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Speech interaction method and device

The present invention discloses a speech interaction method. The method comprises: when a first control instruction is received, enabling a speech collection function, outputting a first speech prompting, and starting a speech collection progress icon to allow the speech collection progress icon to move along a setting direction; when the speech collection progress icon is moved along the setting direction, performing collection of speech signals in a current environment; when speech signals are collected before the speech collection progress icon is moved to a limitation position along the setting direction, analyzing the speech signals, and obtaining speech data; matching the speech data and the instruction data in a local instruction bank; and when determining that the speech data and the instruction data in the local instruction bank are successfully matched, outputting a second speech prompting corresponding to the instruction data, and executing a speech instruction corresponding to the speech data. The present invention further discloses a speech interaction device.
Owner:刘平舟

Speech enhancement method and system, computer equipment and storage medium

The invention provides a speech enhancement method and system, computer equipment and a storage medium, and relates to the technical field of the human-machine speech interaction. The method comprisesthe following steps: collecting multi-channel acoustic signals through an acoustic vector sensor, preprocessing the multi-channel acoustic signals and acquiring a time-frequency spectrum, filtering the time-frequency spectrum and outputting a signal atlas; performing masking processing on the signal atlas through a nonlinear mask, and outputting an enhanced single-channel speech spectrogram; inputting the single-channel spectrogram into a deep neural network mask estimation model and outputting a mask spectrogram; performing time-frequency masking enhancement on the signal atlas through the mask spectrogram to acquire enhanced amplitude speech spectrogram; reconstructing through the enhanced amplitude speech spectrogram so as to output an enhanced target speech signal. The technical problem that the multi-channel speech enhancement is high in hardware cost, large in collection system volume, and high in operation complexity is solved, and the excellent speech enhancement effect can beacquired under difference interference noise types, strengths and room reverberation conditions.
Owner:PEKING UNIV SHENZHEN GRADUATE SCHOOL

Robot intelligent interaction method and intelligent robot

The invention discloses a robot intelligent interaction method and an intelligent robot. The method comprises the steps that an infrared sensor on the robot judges whether any one is in a target range; if a human is present, a monocular vision positioning principle based on the coplanar P4P is used to position the human body target object; after the human body target object is positioned, face feature data are acquired based on a face identification technology; whether the human body target object is an interactive object is judged based on the face feature data; if the human body target object is an interactive object, the age range of the human body target object is identified based on the face feature data; scene mode data are constructed based on the age range of the human body target object; and a speech content corresponding to the scene mode data is output based on a speech interaction module. According to the the embodiment of the invention, precise matching of interaction scene contents is realized, and an interaction scene mode is more interesting.
Owner:华南智能机器人创新研究院

Speech enhanced interaction method, system, storage medium and electronic device

The invention provides a speech enhanced interaction method, a speech enhanced interaction system, a storage medium and an electronic device. The method includes the following steps that: the time-domain signals of microphones in an annular microphone array are converted into the frequency-domain signals of the microphones, and reverberation suppression and stationary noise suppression are performed on the frequency-domain signals of the microphones; wake-up direction sound source positioning is performed based on the reverberation and stationary noise-removed frequency-domain signals of the microphones, so that a wake-up direction is obtained; main direction beam time-domain signals and wake-up direction beam time-domain signals are obtained in a main direction and the wake-up direction on the basis of the reverberation and stationary noise-removed frequency-domain signals of the microphones; speech recognition is performed on the main direction beam time-domain signals; and wake-up word recognition is performed on the wake-up direction beam time-domain signals, and if the signals are identified as wake-up words, the main direction is changed to the obtained wake-up direction. With the speech enhanced interaction method, the speech enhanced interaction system, the storage medium and the electronic device of the invention adopted, the stability and reliability of speech interaction can be improved effectively.
Owner:FUZHOU ROCKCHIP SEMICON

Speech interaction apparatus and speech interaction method

A speech interaction apparatus starts pushing information to a user and executes a speech interaction about the information in a case where (i) an interaction starting condition for starting pushing the information is set and (ii) the interaction starting condition is satisfied. The speech interaction apparatus includes an interaction policy setting unit and a speech interaction unit. The interaction policy setting unit sets, in consideration of a drive route intended by the user, an interaction policy of certain information which satisfies an interaction starting condition. The speech interaction unit pushes the certain information and executes a speech interaction about the certain information in accordance with the interaction policy set by the interaction policy setting unit. This enables execution of a user-friendly speech interaction while maintaining safety during a drive.
Owner:DENSO CORP

Intelligent speech interaction system based on cloud end

The invention discloses an intelligent speech interaction system based on a cloud end. The intelligent speech interaction system comprises a speech acquisition system, a semantic comprehension system, and a speech response playing system. An MIC audio acquisition device is used to transmit an acquired question audio signal to an audio acquisition module. The audio acquisition module is used to convert an audio input signal into a digital signal, which can be processed by a processor, by adopting an AD conversion way, and the audio digital signal is transmitted to a cloud end data processing module by a wireless network. The cloud end data processing module is used for the data processing and parsing of the acquired audio digital signal by a core control module, and is used for realizing specific function codes. An answer audio signal after being processed and parsed is converted into a playable answer audio signal by an audio playing module, and the answer audio signal is played by an audio playing device.
Owner:北京中科汇联科技股份有限公司

Method and Apparatus for Speech Interaction with Children

A method and apparatus for performing speech interaction with children is provided. The apparatus may be a computing device that includes at least one camera, at least one microphone, memory, and at least one processor for executing stored instructions. The at least one processor may be configured to determine an age range or an age or skill level of the child. The computing device may receive one or more inputs from the child. The at least one processor may perform analysis on the one or more inputs based at least in part on the determined age range or the age or the skill level of the child, and output a speech response to the child based on the performed analysis.
Owner:HELLO CLOVER LLC

Smart unmanned shared home service robot, shared system and business model

The invention discloses a smart unmanned shared home service robot. At least a base seat of an automatic travel system of unmanned driving is arranged. The smart unmanned shared home service robot also includes a main control module, a communication module, an environment sensor, a liquid crystal display screen, a collision sensor, a speech interaction system, a human body pyroelectric-sensor, a photoelectric sensor, a smoke sensor, a camera and a multi-function module, and is used for forming one or more items of a home security-monitoring function system, a home electrical-apparatus management function system, a family housemaid function system, a family doctor function system, a home entertainment accompanying function system and a home fire protection management control function system. The robot realizes multi-function integration, carries out omnibearing home service, can be effectively promoted, and can also reduce economic pressure of a user.

A method for implementing speech interaction application scene

The invention provides a method for realizing voice interactive application which comprises the steps of, defining a plurality of situations, each of which corresponds a plurality of label combination for representing accomplished predetermined functions in the VoiceXML (voice xml marking language), integrating at least one of the said multiple situations in accordance with demands, obtaining VoiceXML labels based on the combined situations, and producing the corresponding VoiceXML file based on the VoiceXML syntax. The invention realizes the flexibility for skip judgment.
Owner:LENOVO (BEIJING) CO LTD

Speech interaction device, speech interaction method and speech interaction type LED asynchronous control system terminal

The invention relates to a speech interaction device, a speech interaction method and a speech interaction type LED asynchronous control system terminal with the speech interaction device. The speech interaction device comprises the components of a network transceiver module, an interaction mode identification module, a language type determination module, a speech information analyzing and processing module, a command executing and processing module, and an executing result processing module. The speech interaction device can identify a fact that the speech interaction type LED asynchronous control system terminal is in a short-distance speech interaction mode or a remote speech interaction mode according to speech commands from different transmission approaches. Afterwards the language type which is adopted in the speech command can be determined, and a control command which corresponds with the speech command is analyzed and executed. Finally, a corresponding prompting speech is output according to a command executing result. Therefore, the speech interaction device can realize intelligent speech interaction between a user and the LED asynchronous control system terminal and relatively high user experience and furthermore can satisfy an intelligent requirement of the user.
Owner:XIAN NOVASTAR TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products