Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

1274 results about "Vocal sound" patented technology

The human voice consists of sound made by a human being using the vocal tract, such as talking, singing, laughing, crying, screaming, etc. The human voice frequency is specifically a part of human sound production in which the vocal folds (vocal cords) are the primary sound source.

Voice alert in dentistry

Dentistry equipment includes a voice alert device adapted to annunciate a status of the dentistry equipment or a process performed by the dentistry equipment. The voice alert device in various embodiments employs synthesized and recorded human voices.
Owner:DISCUS DENTAL LLC

Energy harvesting computer device in association with a communication device configured with apparatus for boosting signal reception

ActiveUS20130157729A1Improve consumer electronics hybrid consumer electronics performanceLow densityMaterial nanotechnologyEnergy efficient ICTCellular telephoneCommunication device
Disclosed embodiments comprise an energy harvesting computer device in association with a communication device comprising interactive user interface operatively configured with CMOS multiple antennas on chip for boosting signal receptions and for providing faster data transmission speed. Disclosed embodiment encompasses three modes of communications—the Cell phone, wireless Internet applications, and Global communication and media information. Embodiments provide communication apparatus operable to enhance mobile communication efficiency with touch sensitive display comprising energy harvesting platform in communication with a charging circuit board configured with memories, processors, sensors, and modules. Embodiments further provide a gaming device, a wireless media device configured with touch pads comprising sensors being embedded in silicon substrate and fused in nano-fiber / microfiber material having excellent electrical characteristics. Certain embodiments provide communication apparatus configured for voice enabled applications comprising human voice auditory operable to convert text into voice auditory and / or voice auditory into text applications.
Owner:TABE JOSEPH AKWO

Media delivery platform

An improved method for delivery and play back of sound and image files is provided. This new method includes the use of sound and / or image clips, which can be snippets or full files, as alerts for a variety of electronic devices or for playing on a handheld device, and for use as a promotion to sell items associated with the files. A collection or library of uniquely selected and / or edited clips may also be provided to the consumer in a manner far more conveniently on conventional telephone equipment than previously available. Algorithms are provided for the delivery, storage and playback of the sound files, including a delivery method algorithm (500), a parametric optimization and compression algorithm (1500), and an error correction algorithm. In contrast to the conventional ring tones or musical chimes used to ring cellular phones currently on the market, the current invention provides a method for ringing cellular phones and landline telephones with real sound recordings including real music, which may be songs lifted from copyright registered CD tracks, and may comprise human voice, various instrument sounds, and other sound effects of a high quality. A software based system for encoding the hardware of existing cellular phones at the time of manufacturing with delivery, storage, and playback capabilities in accordance with the present invention is provided, such that additional hardware is not required.
Owner:SKKY

Automated transcription system and method using two speech converting instances and computer-assisted correction

A system for automating transcription services for one or more users. This system receives a voice dictation file from a current user, which is automatically converted into a first written text based on a set of conversion variables. The same voice dictation file is automatically converted into a second written text based on a second set of conversion variables. The first and second sets of conversion variables have at least one difference, such as different speech recognition programs, different vocabularies, and the like. The system further includes a program for manually editing a copy of the first and second written text to create a verbatim text of the voice dictation file. This verbatim text can be delivered to the current user as transcribed text. The verbatim text can also be fed back into each speech recognition instance to improve the accuracy of each instance with respect to the human voice in the file.
Owner:CUSTOM SPEECH USA

Voice intelligibility enhancement system

Intelligibility of a human voice projected by a loudspeaker in an environment of high ambient noise is enhanced by processing a voice signal in accordance with the frequency response characteristics of the human hearing system. Intelligibility of the human voice is derived largely from the pattern of frequency distribution of voice sounds, such as formants, as perceived by the human hearing system. Intelligibility of speech in a voice signal is enhanced by filtering and expanding the voice signal with a transfer function that approximates an inverse of equal loudness contours for tones in a frontal sound field for humans of average hearing acuity.
Owner:DTS

Mega communication and media apparatus configured to provide faster data transmission speed and to generate electrical energy

Disclosed embodiments comprise communication apparatus operatively configured with CMOS multiple antennas disposed on a chip for boosting communication signals to and for enabling faster data transmission speed and to provide interactive user interface. The communication apparatus is further configured to convert sound waves, vibrations, solar energy, wind force and pressure force into electrical energy communicable to a battery cell. Disclosed embodiment encompasses three modes of communications—the Cell phone, wireless Internet applications, and Global communication and media information. Embodiments provide communication apparatus operable to enhance mobile communication efficiency with touch sensitive display and provide energy harvesting platform on at least the housing for the apparatus and / or the circuit board configured with memories, processors, and modules. Embodiments provide advanced computing and media applications, including in-vehicle interactive communications and wireless Internet applications. Embodiments further provide a gaming device, a wireless media device configured with touch pads comprising sensors being embedded in silicon substrate and fused in micro fiber material having excellent electrical characteristics. Certain embodiments provide communication apparatus configured for voice enabled applications comprising human voice auditory operable to convert text into voice auditory and / or voice auditory into text applications.
Owner:TABE JOSEPH AKWO

Mega communication and media apparatus configured to prevent brain cancerous deseases and to generate electrical energy

Disclosed embodiments comprise a computer device comprising communication apparatus operatively configured for boosting communication signals to prevent cancerous diseases and to provide interactive user interface. The communication apparatus is further configured to convert sound waves, vibrations, solar energy, wind force and pressure force into electrical energy communicable to a battery cell. Disclosed embodiment encompasses three modes of communications—the Cell phone, wireless Internet applications, and Global communication and information. Embodiments provide communication apparatus operable to enhance mobile communication efficiency with touch sensitive display and energy platform configured with memories, processors, modules, and including advanced vehicular computing and media applications for in-vehicle interactive communications and wireless Internet applications. Embodiments further provide a communication apparatus comprising a gaming device, a wireless media device with visor screen configured with touch pads comprising sensors being embedded in silicon substrate and fused in micro fiber material having excellent electrical characteristics. Certain embodiments provide communication apparatus configured for voice enabled applications comprising human voice auditory operable to convert text to voice auditory and / or voice auditory to text applications, providing signal amplification with better data and graphical transmission. Embodiments further provide a communication apparatus comprising a media device configured for various communications and Internet applications.
Owner:TABE JOSEPH AKWO

Communication system with distributed intelligence

A communications system with distributed intelligence thereby allowing easy expansion of the system while also providing a high degree of fault tolerance. The communications system allows for conversion, transmission and restoration of the human voice over a digital network. A telephone can be used as the input device for receiving the analog signal, or human voice. The telephone is also used to select a destination for the analog signal. The information is transmitted over the digital network and may also traverse through a private branch exchange and a wireless network before it arrives at its destination. At the destination the transmitted information is restored to an analog signal and played over one or more speakers. The present system is especially suited for use with an existing network.
Owner:HOOVER THOMAS R

Method, apparatus and computer code for selectively providing access to a service in accordance with spoken content received from a user

Apparatus, methods and computer-readable medium for authenticating a user and selectively providing access to a computer service are described herein. In some embodiments, a) a user input is solicited; b) a voice response to the input soliciting; is received on or from a client device, c) if a determination is made, in accordance with one or more speech delivery features of the voice response, that the voice response is a live human voice response, the client device is permitted to access a computer service; and d) otherwise, client device access to the computer service is denied. Optionally, the access may be permitted only to a pre-determined gender or a pre-determined age group.
Owner:PUDDING HLDG ISRAEL

Mega communication and media apparatus configured to provide faster data transmission speed and to generate electrical energy

Disclosed embodiments comprise energy harvesting device comprising a communication apparatus operatively configured with CMOS multiple antennas disposed on a chip for boosting communication signals and for enabling faster data transmission speed. The communication apparatus is configured to harvest energy within its environment and is further disposed with interactive user interface. The communication apparatus is further configured to convert sound waves, vibrations, solar energy, wind force and pressure force into electrical energy communicable to a battery cell. Disclosed embodiment encompasses three modes of communications—the Cell phone, wireless Internet applications, and Global communication and media information. Embodiments provide communication apparatus operable to enhance mobile communication efficiency with touch sensitive display and provide energy harvesting platform on at least the housing for the communication apparatus and / or the communication circuit board. The communication circuit board is configured with memories, processors, and modules. Embodiments provide advanced computing and media applications, including in-vehicle interactive communications and wireless Internet applications. Embodiments further provide a gaming device, a wireless media device configured with touch pads comprising sensors being embedded in silicon substrate and etched / fused in micro fiber material having excellent electrical characteristics. Certain embodiments provide the communication apparatus being configured for voice enabled applications comprising human voice auditory operable to convert text into voice auditory and / or voice auditory into text applications.
Owner:TABE JOSEPH AKWO

Method and apparatus for audio broadcast of enhanced musical instrument digital interface (MIDI) data formats for control of a sound generator to create music, lyrics, and speech

A method and apparatus for the transmission and reception of broadcasted instrumental music, vocal music, and speech using digital techniques. The data is structured in a manner similar to the current standards for MIDI data. Transmitters broadcast the data to receivers which contain internal sound generators or an interface to external sound generators that create sounds in response to the data. The invention includes transmission of multiple audio data signals for several languages on a conventional radio and television carrier through the use of low bandwidth data. Error detection and correction data is included within the transmitted data. The receiver has various error compensating mechanisms to overcome errors in data that cannot be corrected using the error correcting data that the transmitter sent. The data encodes for elemental vocal sounds and music.
Owner:ELAM CARL

Network teaching method and system with voice recognition function

The invention provides a network teaching method and system. Double identity verification of face recognition and voice recognition is realized, the voice signal collection accuracy in a teaching process, an oral training process, a test process and an examination process are scored and assessed by a plurality of models, so that the assessment accuracy is improved, the network teaching is more autonomous and effective, and particularly in reading and listening and recitation teaching, the authenticity and effectiveness of learning can be improved by these functions of the system. By adopting the method provided by the invention, the face recognition is combined with the voice recognition, the user identity is checked before an oral test or system login of the user, and the user can be better encouraged to carry out a human voice test in a use process.
Owner:SHENZHEN EAGLESOUL EDUCATION SERVICE CO LTD

Capture and application of sender voice dynamics to enhance communication in a speech-to-text environment

A method for providing voice dynamics of human utterances converted to and represented by text within a data processing system. A plurality of predetermined parameters for recognition and representation of dynamics in human utterances are selected. An enhanced human speech recognition software program is created implementing the predetermined parameters on a data processing system. The enhanced software program includes an ability to monitor and record human voice dynamics and provide speech-to-text recognition. The dynamics in a human utterance is captured utilizing the enhanced human speech recognition software. The human utterance is converted into a textual representation utilizing the speech-to-text ability of the software. Finally, the dynamics are merged along with the textual representation of the human utterance to produce a marked-up text document on the data processing system.
Owner:NUANCE COMM INC

Multifunctional home service robot

The invention relates to a multifunctional home service robot. The multifunctional home service robot is characterized by comprising an intelligent control system, a mechanical structure and a remote control terminal, wherein the intelligent control system is installed on the mechanical structure, the remote control terminal can regulate and control the intelligent control system, and thus the mechanical structure is driven to execute corresponding commands. The multifunctional home service robot can carry out voice-image feedback and limb feedback according to human voice commands and meanwhile has the functions of dust removing, disinfecting, organic-matter decomposing, gas detecting, air purifying, self-navigation traveling, self-action charging and the like. All-dimensional camera shooting and projection entertainment can be controlled through an intelligent microcomputer. The multifunctional home service robot can be used for information storage and voice, text and image intelligent interaction. The multifunctional home service robot also has the functions of indoor security inspection under remote control, emergency alarming and the like.
Owner:BEIJING EVOLVER ROBOTICS TECH CO LTD

Vehicle remote control method based on voice command, apparatus and system thereof

The invention, which relates to the vehicle voice control technology, discloses a vehicle remote control method based on a voice command, an apparatus and a system thereof. The method, the apparatus and the system are used to improve safety of the vehicle remote control. The vehicle remote control method based on the voice command comprises the following steps that: a communication terminal and a cloud computing platform server establish communication connection according to a sending connection control command of a user; the communication terminal receives a vehicle control voice command sent by the user and sends to the cloud computing platform server; the cloud computing platform server identifies, analyzes the vehicle control voice command by using an unspecified human voice identification technology so as to obtain the vehicle control information and return to the communication terminal; the communication terminal controls the vehicle which establishes a transmission relation with the communication terminal according to the vehicle control instruction. A scheme of the invention is suitable for carrying out remote voice intelligence control to the vehicle.
Owner:SHENZHEN VCYBER TECH

Automatic volume adjustment method and system

The invention provides an automatic volume adjustment method and system. The method comprises: monitoring voice in an environment where an intelligent terminal is located; judging whether the voice in the environment is human voice, and lowering the multimedia volume output from an earphone when primarily obtaining the human voice in the environment; judging whether the phonetic feature of the human voice in the environment is matched with a pre-stored user phonetic feature, if so, continuing to keep the lowered multimedia volume output from the earphone, and obtaining the human voice in the environment again through the output of the earphone when obtaining human voice with phonetic feature different from the user phonetic feature in the environment again, and if not, keeping the lowered multimedia volume output from the earphone. By adopting the automatic volume adjustment method provided by the invention, a user can clearly hear people speaking around when wearing the earphone, and the user can communicate with people nearby in language without taking off the earphone, thereby completely liberating the user and bringing better user experience for the user.
Owner:湖州帷幄知识产权运营有限公司

Method and system device for retrieving songs based on voice modes

The invention provides a method and system device for retrieving music based on voice modes. The invention aims at designing a method for retrieving music and songs, which can realize the interaction with a computer based on voice and ensure that the computer can actively recognize inflection information of the voice. In the invention, the technology is also implemented on the computer so as to generate a music retrieval system which can be used for KTV song selection, construction of entertainment websites and mobile terminals. The music retrieval system mainly comprises an interaction interface module, a background processing flow module, a music feature library creation module and a transmission channel module. A user can sing on site after clicking a button; the system can record the voice input in real time, can save a record file and process the file after the recording process is finished, and can ultimately sequence names of songs according to similarity; and one song can be played after being clicked, and relevant information of the song can be displayed. If the first retrieval fails, an additional retrieval can be performed, i.e. a cumulative retrieval can be performed on the basis of the previous retrieval by additionally humming / singing another rhythm of the song.
Owner:周明全 +1

Personalized Music Remixing

A personal music mixing system with an embodiment providing beats and vocals configured using a web browser and musical compositions generated from said beats and vocals. Said embodiment provides a plurality of beats and vocals that a user may suitably mix to create a new musical composition and make such composition available for future playback by the user or by others. In some embodiments, the user advantageously may hear a sample musical composition having beats and vocals with particular user-configured parameter settings and may adjust said settings until the user deems the musical composition complete.
Owner:FUNK MACHINE

Man-machine conversation system

The invention discloses a man-machine conversation system applied to an advanced service robot. The man-machine conversation system is characterized in that a voice recognition module, a natural language understanding module, a background service processing module, a natural language generating module and a voice generating module are included and are respectively and independently connected with a conversation management module and used for bidirectional data transmission. The man-machine conversation system has the advantages that the advanced service robot can directly communicate with the human language, the influence caused by a traditional communication way on communication efficiency is avoided, the convenience of man-machine communication is greatly improved, and the aim that the service robot directly listens to human voice instructions is achieved.
Owner:CHENGDU VONXAN AUTOMATION SCI & TECH

Voice trigger for a digital assistant

A method for operating a voice trigger is provided. In some implementations, the method is performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a sound input. The sound input may correspond to a spoken word or phrase, or a portion thereof. The method includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice. The method includes, upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content, such as a predetermined trigger word or phrase. The method also includes, upon a determination that the sound input includes the predetermined content, initiating a speech-based service, such as a voice-based digital assistant.
Owner:APPLE INC

Energy harvesting computer device in association with a communication device configured with apparatus for boosting signal reception

Disclosed embodiments comprise an energy harvesting computer device in association with a communication device comprising interactive user interface operatively configured with CMOS multiple antennas on chip for boosting signal receptions and for providing faster data transmission speed. Disclosed embodiment encompasses three modes of communications—the Cell phone, wireless Internet applications, and Global communication and media information. Embodiments provide communication apparatus operable to enhance mobile communication efficiency with touch sensitive display comprising energy harvesting platform in communication with a charging circuit board configured with memories, processors, sensors, and modules. Embodiments further provide a gaming device, a wireless media device configured with touch pads comprising sensors being embedded in silicon substrate and fused in nano-fiber / microfiber material having excellent electrical characteristics. Certain embodiments provide communication apparatus configured for voice enabled applications comprising human voice auditory operable to convert text into voice auditory and / or voice auditory into text applications.
Owner:TABE JOSEPH AKWO

Intelligent placement of appliance response to voice command

Systems and methods for intelligent placement of appliance response to a voice command are provided. An exemplary system includes a plurality of appliances. An exemplary method includes connecting each of the plurality of appliances over a local area network and generating a location map providing a location of each of the plurality of appliances. The method includes receiving the human voice signal at a plurality of microphones respectively included in the plurality of appliances and determining an originating location of the human voice signal based at least in part on the location map. The method includes selecting one of the plurality of appliances to respond to the human voice signal based at least in part on the location map and the originating location.
Owner:HAIER US APPLIANCE SOLUTIONS INC

Electronic Medical Voice Instruction System

An audible medical information system for a patient or other lay user that is loaded with content by a company or the health care practitioner and can be played at will by the patient and a means for recording information in the audio file. In one embodiment, the system records information in a physical card that the user can have with them without the need for any electronic devices such as a computer or smart phone. Other embodiments may include electronic audio or video files delivered to a computer or smart phone. The audio file can contain up to several minutes of audible information, some of which may be patient specific and some of which may be disease or medication specific. The information may include pre- or post-surgical instructions, information about medications or basic use instructions for medical devices.The file(s), which can play when a card is opened (note card format) or a button is pressed (credit card or digital format), will repeat the information as the user desires. The audio is in the form of a computer generated and / or pre-recorded human voice, which may be customized for the patient's language of choice and speech patterns, and will be clear and understandable. The voice characteristics may be optimized for persuasive characteristics so this tool can help motivate patient adherence to medical instructions. This system has particular utility for patients who do not speak the same language as the health care practitioner or who do not have the ability to understand the written instructions provided, although the system will be available in convenient form factors for all patients and for those who help with their medical care. Easy access to this information will contribute to improved patient satisfaction and compliance with medical instructions which is expected to improve health outcomes.
Owner:MEDIVOCE

Voice remote command and control of a mapping security system

An invention that enables the use of human speech to remotely access, interrogate, control and obtain real time information from security devices in a facility or location. Wireless, or other network connectivity, mobile devices are used as the voice recognition system. These devices interface to a management system located at the facility or location under surveillance. The user is able to view the mobile display device and command the system using human voice. The system supports detecting and tracking security intrusions, controlling the security devices at the location, requesting changes to the display, obtaining status information of the system or any device, and communicating to others that may be accessing the system jointly. The invention also uses hierarchical maps to quickly identify security problems within an enterprise. The system uses real-time altered icons or element pictures that identify the status of that element at a quick glance. The organized use of hierarchical maps to quickly traverse to and identify particular security problems to include intrusions, alarms, failures, pending failures, etc. Intruder movement is also automatically tracked on or between maps.
Owner:FALLON KENNETH T

Download management of audio and visual content, product method and system

An improved method for delivery and play back of sound and image files is provided as exemplary embodiments. This method may include the use of sound and / or image clips, which can be snippets or full files, as alerts for a variety of electronic devices or for playing on a handheld device, and for use as a promotion to sell items associated with the files. A collection or library of uniquely selected and / or edited clips may also be provided to the consumer in a manner far more conveniently on conventional telephone equipment than previously available. Exemplary embodiments may provide algorithms for the delivery, storage and playback of the sound files, including a delivery software system (500), a parametric optimization and compression algorithm (1600), and an error correction algorithm. In contrast to the conventional ring tones or musical chimes used to ring cellular phones currently on the market, the current invention provides a method for ringing cellular phones, electronic devices, and landline telephones with real sound recordings including real music, which may be songs sampled from copyright registered CD tracks, and may comprise human voice, various instrument sounds, and other sound effects of a high quality. A software based system for encoding the hardware of existing cellular phones at the time of manufacturing with delivery, storage, and playback capabilities in accordance with the exemplary embodiments may be provided, such that additional hardware may not be required.
Owner:SKKY

Voice activation

A circuit and a method are given, to realize a very flexible voice activation system using a modular building block approach, that is adaptively tailored to handle certain relevant and case specific operational characteristics describing most of the possible acoustical differing environmental cases to be found in the field of speech recognition. Included are determinations of “Noise estimation and “Speech estimation” values, done effectively without use of Fast Fourier Transform (FFT) methods or zero crossing algorithms only by analyzing the modulation properties of human voice. Said circuit and method are designed in order to be implemented with a very economic number of components, capable to be realized with modern integrated circuit technologies.
Owner:DIALOG SEMICONDUCTOR GMBH

Vehicle accessory microphone

A microphone assembly includes one or more transducers (2210) that are positioned in one or more housings. A preprocessing circuit (2215) includes a inverted comb filter (2245) for eliminating predetermined frequencies between harmonics of the human voice in a predetermined frequency range. A processing circuit (2220) coupled to the preprocessing circuit (2215) is used for outputting an electrical signal such that the transducers (2210) used in combination with the processing circuit (2220) very effectively cancels noise. The microphone assembly can be employed in a vehicle accessory such as a vehicular mirror.
Owner:GENTEX CORP

Unspecific human voice and emotion recognition method and system

ActiveCN102881284AOvercoming the shortcoming of being easily disturbed by speaker changesImprove robustnessSpeech recognitionSpeech soundVocal sound
The invention provides an unspecific human voice and emotion recognition method and system, wherein the method comprises the steps of extracting phonetic features used for recognizing the emotional paralanguage from the voice signal to be recognized, extracting acoustic voice emotional characteristics of the emotional voice signal to be recognized, and mixing recognition results of an emotion recognition channel based on emotional paralanguage and an emotion recognition channel based on acoustic voice emotional characteristics to obtain the emotional state contained in the emotional voice signal to be recognized. By utilizing the characteristics that the change of speakers has little influence on the emotional paralanguage, the emotional paralanguage reflecting the emotion information can be extracted from the emotional voice signal, and the emotion information contained in the emotional paralanguage can assist the auxiliary acoustic emotional characteristics for emotion recognition, so that the purposes of improving the robustness and recognition rate of the voice and emotion recognition can be achieved.
Owner:JIANGSU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products