Check patentability & draft patents in minutes with Patsnap Eureka AI!

Speech feature mapping method and system based on Bhattacharyya distance

A technology based on Babbitt's distance and speech features, applied in the field of speech recognition/speaker recognition, can solve problems such as difficult to guarantee the direction of feature mapping, without more consideration, without adding prior information, etc., so as to improve the recognition accuracy , improve the accuracy rate, and enhance the effect of robustness

Active Publication Date: 2018-11-06
HUAZHONG NORMAL UNIV
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] (1) The existing technology only uses the real distribution of the features, and starts mapping without any assumptions or operations on the real distribution of the features. It is difficult to guarantee the mapping direction of the features during the mapping process under this mapping method
[0009] (2) The existing technology does not take more considerations when measuring the distribution of complex features and the distribution of reference features
The Bhattacharyian distance is a good choice for determining the distance between two distributions, which is not exploited by the prior art
[0010] (3) The existing technology does not add prior information when processing complex features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech feature mapping method and system based on Bhattacharyya distance
  • Speech feature mapping method and system based on Bhattacharyya distance
  • Speech feature mapping method and system based on Bhattacharyya distance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0063] The speech feature mapping method based on the Bhattacharyian distance provided by the embodiment of the present invention first extracts the features of the speech signal in a complex environment and the clean speech signal; then uses the complex feature and feature mapping formula to initialize the mapping feature, and respectively establishes the mapping feature and clean feature GMM model; then use the EM algorithm to iteratively estimate the minimum Bhatia distance between the two GMM models, and obtain the final mapping feature; finally, the mapping feature is pattern-matched with the speech signal model in a cl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention, which belongs to the technical field of speech recognition / speaker recognition, discloses a speech feature mapping method and system based on Bhattacharyya distance. Features of a speech signal in a complex environment and a clean speech signal are extracted respectively; mapping features are initialized by using a complex feature and a feature mapping formula and GMM models of themapping features and the clean features are established respectively; a minimum Bhattacharyya distance between the two GMM models is estimated iteratively by using an EM algorithm and a final mappingfeature is obtained; and mode matching and identification are carried out on the mapping features and the trained speech signal model in a clean environment. According to the invention, the complex feature mapping is carried out by minimizing the Bhattacharyya distance between the complex feature GMM and the clean feature GMM to obtain mapping features; and mode matching and identification are carried out on the mapping features and the clean model. The mapping features are used for replacing the complex features, so that the speech recognition accuracy is improved effectively.

Description

technical field [0001] The invention belongs to the technical field of speech recognition / speaker recognition, and in particular relates to a speech feature mapping method and system based on Bhattacharyachian distance. Background technique [0002] At present, the existing technologies commonly used in the industry are as follows: [0003] As computer technology promotes the development of human-computer interaction, voice interaction is more and more widely used in real-world scenarios. Voice interaction is the interaction between humans and machines in the form of voice, which is closer to the interaction between humans and more in line with human interaction habits. Voice interaction can enable users to interact with machines more comfortably, make the interaction proceed in a simpler, faster, and more efficient manner, and also make the human-computer interaction process more humanized, highlighting the dominant position of humans in the entire interaction process. V...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/22G10L15/06G10L25/24G10L15/02
CPCG10L15/02G10L15/06G10L15/22G10L25/24
Inventor 王志锋左明章宁国勤叶俊民闵秋莎田元夏丹陈迪罗恒姚璜
Owner HUAZHONG NORMAL UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More