Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

91 results about "Vocal pitch" patented technology

Dialogue type voice recognition method and system, electronic equipment and storage medium

PendingCN111508498AAccurate identificationSolve the problem of not being able to cut accuratelySpeech recognitionNoiseEngineering
The invention relates to the technical field of voice recognition, and provides a dialogue type voice recognition method and system, electronic equipment and a storage medium. The dialogue type voicerecognition method comprises the steps of obtaining a dual-channel audio of dialogue type voice, and performing compression reduction and channel separation on the dual-channel audio to obtain a single-channel original audio; performing framing processing on the original audio to obtain a plurality of audio frames, and performing cutting processing on the original audio according to the energy ofeach audio frame to obtain a plurality of effective audio segments; extracting Mel cepstrum features and tone features of the effective audio segments and speaker features of channels where the effective audio segments are located, and inputting the Mel cepstrum features, tone features and speaker features into a speech recognition model to obtain recognition results of the effective audio segments; and generating a voice recognition result of the original audio according to the recognition result of each effective audio segment. According to the invention, accurate cutting of the double-channel dialogue type voice can be realized, and the dialogue type voice can be accurately recognized under the condition of shielding surrounding noise.
Owner:CTRIP COMP TECH SHANGHAI

Method for verifying time domain fine structure novel code of artificial cochlea tone language

ActiveCN109036569AReflect acceptanceMedical simulationElectrotherapyCochleaFine structure
The invention discloses a method for verifying a time domain fine structure novel code of an artificial cochlea tone language. The method mainly comprises the following steps of 1), selecting an experiment biosome; 2), establishing a nervus thalamicus response time-space quantified mode I under the condition of original voice induction; 3), establishing a nervus thalamicus response time-space modeII on the condition of novel coding tone voice stimulation induction; 4), establishing a nervus thalamicus response time-space mode III on the condition of novel coding voice electric stimulation; and 5), through adjusting an electric stimulation mode parameter which corresponds with a voice code, making the nervus thalamicus response time-space mode III on the condition of novel coding voice electric stimulation approach the nervus thalamicus response time-space quantified mode I under the condition of original voice induction. The method can really and objectively reflect the receiving degree of a biosome hearing channel to a certain sound coding strategy and can be used as an auxiliary evaluating method for an artificial cochlea novel voice code.
Owner:CHONGQING UNIV

Chinese tone recognition method based on time frequency crest line-Hough transformation

The invention provides a Chinese tone recognition method based on time frequency crest line-Hough transformation. Chinese tone recognition is converted into classification of the change trend of a line segment in a time frequency distribution diagram so that a new Chinese tone recognition method and technique can be acquired. The method includes the steps that firstly, final voice signals carrying Chinese tones are expressed through the SPWVD time frequency distribution diagram and tone information is shown through a group of similarly-parallel time frequency crest lines in the time frequency diagram; secondly, due to the fact that the main time frequency crest line is a region with larger energy in the diagram, the change trend of different tones is reflected, and in order to reduce the calculated amount, treatment such as binaryzation, thresholding and refining is conducted on the time frequency distribution diagram, and a center line segment of the main time frequency crest line reflecting the change trend of the tones is acquired; thirdly, Hough transformation is conducted on the time frequency distribution diagram containing the center line of the main crest line, so that the intercept and included angle parameters of the center line of the main crest line are acquired; finally, the tone type is judged according to the intercept and the included angle of the line segment and the coordinate values of a start point and an end point of the line segment.
Owner:JIANGNAN UNIV

Computer Chinese phonetic double-click rapid input method

The invention relates to a computer Chinese (consonant-vowel double- click) tone-participative coding quick input method, developed by deep research mainly for the requirements of Ministry Of Labour And Social Security and Ministry of Information Industry for professional skills of Chinese short-hand experts on compute: the primary input 140 Chinese characters per minute, the medium input 180 Chinese characters per minute and the senior input 220 Chinese characters per minute. And the invention overcomes the defect that the traditional phonetic English small letters correspond to on-keyboard English capital letters to input Chinese characters, and adopts a truly original design solution, and proceeds in all cases from raising Chinese character input speed and makes computer Chinese character input reach the limit. And its main principle and features: 1. fully considering English key frequency of the main region of a keyboard, where the middle row is higher than the top row, the top row is higher than the bottom row, and the middle is higher than the four sides; 2. fully considering human hand flexibility, where the right-hand is higher than the left-hand; 3. fully considering use frequencies of initial consonants and vowels and make integral optimized settings from high to low corresponding to the keyboard frequency and flexibility of left and right hands; 4. fully considering rules of daily spoken language and written language and giving punctuations to part of speech classification recognizing function. And it has also features of visual input, strong regularity, low memory quantity, large coding capacity, good word composing effect, low repeated code rate, high input speed, etc.
Owner:孙莹莹

Chinese tone dichotic listening testing system and testing method

The invention relates to a Chinese tone dichotic listening testing system. The system comprises a testee information managing module, a testing material selecting module, a testing parameter configuring module, a testee screening and training module, a dichotic listening testing module and a testing result storing module; the testee information managing module is used for inputting, inquiring, modifying and deleting basic information of a testee; the testing material selecting module is used for supplying and selecting a Chinese tone material; the testing parameter configuring module is used for configuring values of the signal-to-noise ratio and the response time which are adopted in testing; the testee screening and training module is used for screening the testee meeting the requirement and completing acquainting and training on testing content, testing processes and testing methods before testing; the dichotic listening testing module is used for simultaneously supplying testing signal pairs composed of same syllables and different tones to the left ear and the right ear of the testee respectively according to the dichotic listening normal form; the testing result storing module is used for storing the testing process and a testing result of the testee in a file.
Owner:INST OF ACOUSTICS CHINESE ACAD OF SCI

Statement error correction method and device after speech recognition, equipment and storage medium

The embodiment of the invention discloses a statement error correction method and device after speech recognition, equipment and a storage medium. According to the technical scheme provided by the embodiment of the invention, the method comprises the steps of: recognizing the first occurrence probability of each character in the to-be-corrected text through the language model, determining the recognized error word in the to-be-corrected text according to the first occurrence probability, determining the model candidate word by utilizing the language model, determining the homophone candidate word according to the pinyin and tone of the recognized error word, further determining a first sequence and a second sequence between a model candidate word and a homophone candidate word, determining a candidate sequence between the model candidate word and the homophone candidate word according to the first sequence and the second sequence, determining an error correction candidate word according to the candidate sequence,replacing a recognition error word in a to-be-corrected text with the error correction candidate word, and directly docking and modifying the voice recognition result in a non-intrusive manner, so that the training cost of voice recognition network learning is effectively reduced.
Owner:PCI TECH GRP CO LTD +2

Spoken language evaluation method and device

PendingCN112331180AImprove the accuracy of judgmentReduce the impact of large differences in judgment effectsSpeech recognitionEvaluation resultSpoken language
The invention provides a spoken language evaluation method and device. The spoken language evaluation method comprises the steps: obtaining a to-be-evaluated audio and an evaluation text correspondingto the to-be-evaluated audio; determining an attribute characteristic value of each phoneme in the evaluation text and a posterior probability corresponding to each phoneme based on the to-be-evaluated audio and the evaluation text; extracting a pronunciation characteristic value corresponding to the evaluation text based on the evaluation text and the posterior probability corresponding to eachphoneme; generating a characteristic vector corresponding to each phoneme according to the attribute feature value and the pronunciation feature value of each phoneme; and inputting the characteristicvector corresponding to each phoneme into a spoken language evaluation model to obtain an evaluation result output by the spoken language evaluation model. According to the spoken language evaluationmethod provided by the invention, the pronunciation characteristic value corresponding to each phoneme is introduced, and the potential error of the current pronunciation can be accurately explored.Multi-dimensional characteristic information is provided for a spoken language evaluation model, and the judgment accuracy of initial consonants, final consonants and tones is improved.
Owner:BEIJING YUANLI WEILAI SCI & TECH CO LTD

Evaluation system for Chinese tone coding strategy of artificial cochlea

The invention discloses an evaluation system for a Chinese speech coding strategy of an artificial cochlea. A main hardware system comprises a speech signal acquisition module, a fundamental frequencydetection module, a signal preprocessing module, a frequency channel division module, a harmonic selection module, a frequency shift processing module, a filtering module, a speech synthesis module and a playing module. According to the invention, by adoption of Lin's six tones as a basic speech material and cooperation of four tones, the Lin's six tones are processed by using a to-be-tested Chinese speech coding strategy and played to a subject, so the hearing task of tone recognition is completed, and the correct rate of tone recognition is counted to evaluate the advantages and disadvantages of the to-be-tested Chinese speech coding strategy. Compared with a traditional speech test library, the evaluation system provided by the invention has less test content; and the evaluation systemis short in detection time, high in relative detection efficiency and low in cost, and can improve the treatment experience of the subject, inspect a Chinese speech coding algorithm and train and detect the tone recognition ability of hearing-impaired children.
Owner:重庆大学科技园有限责任公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products