Voice tone recognition method and system based on random forest
A technology of random forests and recognition methods, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of complex construction of convolutional neural networks, increase in computational complexity, and influence on recognition accuracy, and reduce parameter calculation and operation speed. Fast and ensure the effect of recognition accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0034] The present embodiment provides the speech tone recognition method based on random forest;
[0035] Speech tone recognition method based on random forest, including:
[0036] S101: Obtain a speech signal to be recognized, and perform preprocessing on the speech signal to be recognized;
[0037] S102: Extracting and selecting feature parameters of the preprocessed speech signal to be recognized;
[0038] S103: Input the extracted feature parameters into the pre-trained random forest model, and output the tone recognition result of the speech signal to be recognized.
[0039] As one or more embodiments, the preprocessing of the speech signal to be recognized includes: sequentially performing sampling, low-pass filtering, framing and voicing determination on the speech signal to be recognized.
[0040] As one or more embodiments, the extraction and selection of feature parameters are performed on the preprocessed speech signal to be recognized; the specific steps include...
Embodiment
[0063] 1. In the preprocessing part, the sampling rate of the voice signal is 16kHz, and the low-pass filtering is performed by a Chebyshev II low-pass filter with a passband frequency of 500Hz. In the short-time frame processing, the frame length is 30ms, and the frame shift is 10ms. The voicing judgment uses a double-threshold method based on short-term zero-crossing rate and short-term energy.
[0064] 2. Integrating the performance of the feature parameters that have been adopted at the present stage in tone recognition, the following three types of feature parameters are selected for optimization: the basic statistics of the fundamental frequency; , 1 / 3 point to 2 / 3 point, 2 / 3 point to the end), each section extracts four parameters related to the fundamental frequency and energy; parameters related to the fundamental frequency change trend. The above three types of characteristic parameters can form a parameter set, and the weight of each parameter in the parameter set i...
Embodiment 2
[0075] The present embodiment provides the speech tone recognition system based on random forest;
[0076] Speech tone recognition system based on random forest, including:
[0077] A preprocessing module, which is configured to: acquire a speech signal to be recognized, and preprocess the speech signal to be recognized;
[0078] A feature extraction module, which is configured to: extract and select feature parameters for the preprocessed speech signal to be recognized;
[0079] The tone recognition module is configured to: input the extracted feature parameters into a pre-trained random forest model, and output the tone recognition result of the speech signal to be recognized.
[0080] It should be noted here that the above-mentioned preprocessing module, feature extraction module, and tone recognition module correspond to steps S101 to S103 in Embodiment 1, and the examples and application scenarios implemented by the above-mentioned modules are the same as those of the co...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com