Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

99results about How to "Improve Speech Recognition Efficiency" patented technology

Speech recognition method and device

The embodiments of the present invention disclose a speech recognition method and device. According to one specific embodiment of the invention, the method includes the following steps that: the identity information of a target user inputting speech input signals is determined in response to the received speech input signals; the commonly-used expression set of the target user is extracted from astored commonly-used expression database on the basis of the identity information of the target user, wherein the commonly-used expression set contains a plurality of commonly-used expressions; acoustic feature extraction is performed on the speech input signals, and the acoustic features of the speech input signals are inputted into an acoustic model, so that the acoustic model score of the speech input signals can be obtained; whether the content of the speech input signals is a commonly-used expression of the target user is determined on the basis of the acoustic model score of the speech input signals and the acoustic model scores of the commonly-used expressions in the stored commonly-used expression set of the target user; and if the content of the speech input signals is a commonly-used expression of the target user, a language model is constructed based on commonly-used expressions is adopted to decode the acoustic features of the speech input signals, so that a speech recognition result can be obtained. With the method and device provided by the embodiments adopted, speech recognition efficiency can be improved.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD

Speaker-dependent voice recognizing method

The invention discloses a speaker-dependent voice recognizing method. The speaker-dependent voice recognizing method is characterized in that a voice data cache and a benchmark voice database are defined preliminarily, and original voice signals are stored in the voice data cache and sent to a voice signal recognition engine to be processed and recognized to acquire recognition results with acoustic features and recognition factors; the benchmark voice database is judged whether to have benchmark voice data or not; if the benchmark voice database has no benchmark voice data, the benchmark voice data are initialized according to voice signal recognition engine results; if the benchmark voice database has the benchmark voice data, corresponding processing methods are selected to process according to different voice signal recognition engine results, so that the benchmark voice data are updated or recognition results of the voice signal recognition engine are corrected; and finally a speaker dependent pronounces the same recognition word repeatedly, and the previous steps are performed repeatedly to update the benchmark voice data continuously to make the benchmark voice data to be optimal. The speaker-dependent voice recognizing method has the advantages that speaker-dependent voice recognition efficiency is improved, and voice false recognition and recognition rejection are reduced; and with the increase of the number of use of users, the benchmark voice data in the benchmark voice database are reliable, recognition accuracy and efficiency are increased, and user experience effects are improved well.
Owner:SICHUAN CHANGHONG ELECTRIC CO LTD

Multiplayer collaborative recording system and identification method based on instant communication

The present invention discloses a multiplayer collaborative recording system and identification method based on instant communication. The system comprises a collaborative recording module which is used for collaborating a recording process and comprises room creating, member adding, session starting, voice identifying as a session record and realizing email sharing, and a message processing module for processing the real-time message generated by the system. The method comprises the steps of using the real-time communication technology to realize the real-time communication of different clients, parallelly uploading the voice by using a synchronous uploading method and reducing the network time delay of transmission, using a Xunfei voice identifier to support the voice identification of multiple languages, carrying out time division identification of different voices based on an asynchronous identification method, and organizing a voice identification result as the session record according to a given format. According to the system and the method, a traditional recording function is extended, the non-face-to-face multiplayer collaborative recording can be realized, and the instant messaging and voice noise reduction technology are used to improve the voice identification speed and accuracy of collaborative recording.
Owner:HOHAI UNIV

Computer voice control method and intelligent voice assistant system

The invention relates to a computer voice control method and an intelligent voice assistant system. The intelligent voice assistant system comprises a display interface used for receiving a first operation instruction which is inputted by a user and is used for starting the intelligent voice assistant system, a memory used for storing voice configuration files and mouse and keyboard configuration files, a voice acquisition device used for acquiring voice commands inputted by the user and transmitting the voice commands to a processor, and a processor used for converting the voice commands into corresponding voice command entries, calling the voice configuration files of the memory and matching the voice command entries with entries of the voice configuration files, if matching succeeds, program operation sequences of the mouse and keyboard configuration files of the memory corresponding to the voice command entries are called to control program operation, and the display interface is further used for displaying a performing result to be success. The intelligent voice assistant system is advantaged in that a computer is controlled for operation through the voice commands instead of mouse and keyboard operation, and thereby the computer is made to be more concise and convenient for use.
Owner:梅其珍

Voice recognition method and device, computer readable storage medium and computer equipment

The embodiment of the invention discloses a voice recognition method and device, a computer readable storage medium and computer equipment. The method comprises the steps: carrying out the feature extraction of to-be-recognized voice information, and obtaining a plurality of feature vectors; calculating the sparseness value of each feature vector, wherein the sparseness value is the relative entropy between the distribution of the self-attention score sequence of each feature vector and the uniform distribution of the self-attention score sequence; determining a first feature vector of which the sparseness value is greater than a preset threshold value and a second feature vector of which the sparseness value is not greater than the preset threshold value; determining a target matrix according to the self-attention calculation result of the first feature vector and the second feature vector; and inputting the target matrix and the feature matrix corresponding to the tag sequence into a classification network for classification processing to obtain a recognition result corresponding to the to-be-recognized voice information. Therefore, the deep learning method is adopted, the calculation amount of the self-attention mechanism in the speech recognition process is reduced, and the speech recognition efficiency is improved.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Voice language recognition method and system based on confidence degree

The invention provides a voice language recognition method and system based on the confidence degree, aiming to solve the problem that in the existing voice recognition, the language recognition efficiency is low. The method comprises the following steps: S1, extracting a voice segment from each voice segment as a preset voice segment, comparing with a preset language database, acquiring the language information matched with the preset voice segment; S2, acquiring the language confidence degree and the confidence degree average value of each voice segment according to the language information,judging whether the confidence degree mean value is larger than a preset credibility threshold value or not, if yes, the current language is used as the default language of the voice information; S3,if not, screening all the voice segments through a preset screening condition, until the mean value of the language confidence degree is larger than the preset threshold value, and acquiring the voice fragment obtained by the screening, and turning to the step S1. By adopting the voice language recognition method and device, the voice recognition efficiency is improved, and meanwhile, the recognition accuracy of the multi-language voice information is further improved.
Owner:HENGQIN INT INTPROP EXCHANGE CENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products