Method and device for voice signal processing according to frequency domain energy

A frequency-domain energy and speech signal technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of low accuracy of speech signal segmentation results

Active Publication Date: 2015-09-23
HUAWEI TECH CO LTD
View PDF9 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a method and device for processing speech signals according to frequency-domain energy, so as to solve the problems caused by the characteristics of the speech signal phoneme itself or the influence of strong noise when the speech signal is finely segmented. The problem of low accuracy of signal segmentation results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for voice signal processing according to frequency domain energy
  • Method and device for voice signal processing according to frequency domain energy
  • Method and device for voice signal processing according to frequency domain energy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0085] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0086] figure 1 It is a flowchart of a method for processing a speech signal according to frequency domain energy provided by Embodiment 1 of the present invention. Such as figure 1 As shown, the flow of the method for processing a speech signal according to frequency domain energy provided by this embodiment includes:

[0087] Step 101, r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Provided are a method and apparatus for processing speech signal according to frequency domain energy. The method for processing speech signal according to frequency domain energy comprises: receiving an original speech signal comprising a first speech frame and a second speech frame that are adjacent (101); performing Fourier transform on the first speech frame and the second speech frame to obtain a first frequency domain signal and a second frequency domain signal respectively (102); obtaining frequency domain energy distributions of the first speech frame and the second speech frame (103); obtaining a frequency domain energy relevance coefficient of the first speech frame and the second speech frame (104); and segmenting the original speech signal according to the frequency domain energy relevance coefficient (105). A problem of insufficiently high accuracy of a segmentation result of a speech signal caused by the influence of phonemic features of the speech signal or relatively strong noise during fine segmentation of the speech signal can be solved.

Description

technical field [0001] Embodiments of the present invention relate to speech signal processing technologies, and in particular to a method and device for processing speech signals according to frequency domain energy. Background technique [0002] When evaluating the quality of a speech signal or performing speech recognition, it is often necessary to finely segment the speech signal. [0003] In the prior art, the segmentation of the voice signal is mainly to analyze the sudden change of the time-domain energy in the voice signal, and segment the voice signal according to the time change point of the energy mutation; if there is no change, the voice signal is not to segment. [0004] However, when the speech signal changes, due to the characteristics of the phoneme itself or the influence of strong noise, the energy in the time domain does not necessarily change abruptly. Therefore, the accuracy of the voice signal segmentation result in the prior art is not high. Conte...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/04G10L21/0208G10L25/93
CPCG10L15/04G10L25/78G10L21/0308G10L25/06G10L25/18
Inventor 许丽净
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products