Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

65 results about "Speech perception" patented technology

Speech perception is the process by which the sounds of language are heard, interpreted and understood. The study of speech perception is closely linked to the fields of phonology and phonetics in linguistics and cognitive psychology and perception in psychology. Research in speech perception seeks to understand how human listeners recognize speech sounds and use this information to understand spoken language. Speech perception research has applications in building computer systems that can recognize speech, in improving speech recognition for hearing- and language-impaired listeners, and in foreign-language teaching.

Method and system for speech quality perception evaluation based on speech semantic recognition technology

ActiveCN108877839AGood repeatabilitySolve problems that cannot restore the thinking paradigm of the human brainNatural language data processingSpeech recognitionUser perceptionUsers perceptions
The invention discloses a method and system for speech quality perception evaluation based on the speech semantic recognition technology. According to a text of a sender after user speech conversion and a text of a receiver after user speech conversion, text similarity evaluation is carried out based on a text similarity fitting algorithm; network parameters and event information of communicationunit connection networks of the sender and the receiver are displayed in real time and are stored; a user speech perception evaluation model is established by using speech information according to a telecom psychology algorithm and speech perception evaluation is carried out on a user; and then user perception evaluation is formed by means of text similarity evaluation , network information and voice perception evaluation. With the method disclosed by the invention, problems of poor repeatability of the subjective evaluation method and the human brain thinking paradigm can not be restored by the objective issues can be solved; the method is close to the human brain thinking mode and the perception of the network conversation speech quality of the user; and on the basis of the time-positionmapping, a network issue can be located precisely by combining the network parameter information and events.
Owner:NANJING HOWSO TECH

Binaural speech reverberation eliminating method and device based on speech presence probability and consistency

ActiveCN108986832AReverb removalImprove the perceived quality of speechSpeech analysisLow frequency bandSpeech perception
The invention discloses a binaural speech reverberation eliminating method and device based on the speech presence probability and consistency. The method comprises the steps of 1) performing time delay compensation on speech signals received by two microphones to obtain speech signals aligned in time; 2) performing windowing and framing processing, and transforming the speech signals from the time domain to the frequency domain through Fourier transform; 3) estimating a reverberation power spectrum of a low frequency part based on the speech presence probability; 4) calculating the consistency of different signal components of the speech signals; 5) estimating a reverberation power spectrum of a high frequency part based on the consistency; 6) estimating a reverberation power spectrum combining high and low frequencies according to a division threshold of high and low frequency bands; 7) calculating a final reverberation power spectrum by using a recursive smoothing algorithm; 8) obtaining frequency domain signals with the reverberation being eliminated through a gain function; and 9) obtaining time domain signals with the reverberation being eliminated by using short-time inverseFourier transform. According to the invention, the reverberation on the whole frequency band can be effectively eliminated, and the quality of speech perception is improved.
Owner:PEKING UNIV SHENZHEN GRADUATE SCHOOL

Cipher text speech perception hashing and retrieving scheme based on time-frequency domain trend change

The invention discloses a cipher text speech perception hashing and retrieving scheme based on time-frequency domain trend change. A piece of speech is divided into a time domain part and a frequency domain part for extracting perception hash, the speech is encrypted by a high-efficiency chaotic XOR encryption algorithm adapting to large-scale data, and a perception hash sequence is embedded into the least significant bit of the cipher text speech by the digital watermarking technology to generate a cipher text speech library and a system perception hash table. The cipher text speech library and the perception hash table are uploaded to the cloud. During retrieval, a perception hash sequence is extracted from an index speech provided by a user, the abstract sequence is submitted to a cloud server as an index, and matching retrieval is carried out in the system hash table of the cloud. When the perception hash sequence is matched with a perception hash value in the system hash table, a cipher text speech corresponding to the hash abstract in the hash table is returned to the user, and retrieval succeeds. Rapid and accurate retrieval of an encrypted speech in the cloud is realized. According to the method, weight distinguishing is carried out, matching is carried out successively, and therefore, the matching efficiency in large-scale application is improved.
Owner:SOUTHWEST JIAOTONG UNIV

Symmetrical ternary string represented voice perception Hash sequence constructing and authenticating method

InactiveCN104134443AOvercome weaknessPerceptual hash digest is strongSpeech analysisAlgorithmVoice communication
The invention discloses a symmetrical ternary string represented voice perception Hash sequence constructing and authenticating method. The method comprises the steps that firstly, overall discrete wavelet transforming (DWT) is carried out on voice signals produced after preprocessing and intensity-loudness transformation (ILT); secondly, non-overlapping partitioning is carried out on the low-frequency part of the voice signals produced after DWT, and short-time logarithm energy of blocks is calculated to obtain the signal frequency-domain features; lastly, a final ternary perception Hash sequence is generated based on the time domain spectrum flux features (SFF) of the voice signals, and the voice frequency content is quickly authenticated through Hash matching. The symmetrical ternary string representation of the perception Hash abstract is superior to that of the binary form, the common voice content is operated between the robustness and the difference in a balanced mode, the time complexity of the algorithm is low, efficiency and the abstraction are high, precise manipulation detecting and positioning can be achieved, and the method can be used for authenticating a mobile voice communication terminal with bandwidth resources limited in real time.
Owner:LANZHOU UNIVERSITY OF TECHNOLOGY

Perceptual Hash feature extraction method and system of encrypted voice signal

The invention discloses a perceptual Hash feature extraction method and system of an encrypted voice signal. The method includes steps: performing signal framing on the encrypted voice signal, calculating a short-time cross correlation coefficient between each encrypted voice frame and an adjacent encrypted voice frame, and obtaining a cross correlation coefficient matrix; determining the previousshort-time cross correlation coefficients with large values in each row of the cross correlation coefficient matrix as elements of a feature coefficient matrix, and obtaining the feature coefficientmatrix; decomposing the feature coefficient matrix by employing a non-negative matrix decomposition method, and obtaining a feature parameter matrix; and performing binary Hash construction on the feature parameter matrix by employing a Hash function, and obtaining a perpetual Hash value of the encrypted voice signal. By employing the method or system, the short-time cross correlation coefficientsextracted from the encrypted voice signal can be regarded as perpetual features of the encrypted voice signal, and the perpetual Hash value of the encrypted voice signal is generated through Hash construction so that the robustness, the distinction and the abstractness of direct extraction of the voice perpetual features from the encrypted voice signal are improved.
Owner:LANZHOU UNIVERSITY OF TECHNOLOGY

Method for calculating telescopic resistance interval of music clip

The invention relates to a method for calculating the telescopic resistance interval of a music clip, belonging to the technical field of audio processing. The method comprises the following steps of: firstly, establishing a music telescopic resistance dataset to obtain a telescopic resistance distribution histogram; performing equal-area division to form a telescopic resistance type; extracting multiple audio content characteristics to form a characteristic vector of the music clip; performing generalization processing and solving to obtain a diagonal matrix; distinguishing the differing degree of the music clip by use of the music style; and with K neighbor judgment, calculating the telescopic resistance interval of the clip to be processed. The method provided by the invention puts forward a quantitative expression method for the music telescopic resistance for the first time, and calculates the music telescopic resistance interval by mainly focusing on the characteristics of the audio content and taking the music style as subsidiary as well as combining the machine learning strategy; and the method has relatively high accuracy, is easy to operate, and can be directly applied to the parameter estimation in a music reconstruction algorithm and the study on the characteristics of human perception of the music clip in music psychology and speech perception.
Owner:TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products