Speech spectrum color enhancement method for speech visualization

A spectrogram and color technology, applied in speech analysis, instruments, etc., can solve the problems of weak expressiveness, single visual effect, difficult to perceive speech, etc., and achieve the effect of easy identification, easy implementation, and simple extraction of parameters.

Inactive Publication Date: 2011-05-04
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in terms of its speech intelligibility, it is still difficult to achieve the ideal effect. Except for a very small number of experts, it is difficult for people to directly perceive speech accurately and effectively by observing the movement of the vocal organs.
In addition, the visual effect is relatively simple and the expressive power is not strong

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech spectrum color enhancement method for speech visualization
  • Speech spectrum color enhancement method for speech visualization
  • Speech spectrum color enhancement method for speech visualization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The technical solutions of the present invention will be further described below in conjunction with the drawings and embodiments.

[0028] Such as figure 1 As shown, it is a system block diagram of a spectrogram color enhancement method for speech visualization, which is mainly divided into three blocks: a feature parameter extraction module, a color generation module and a visualization effect graph generation module.

[0029] 1. Feature parameter extraction module:

[0030] Firstly, after the original speech signal is framed and windowed, the short-term energy value of each frame signal in each characteristic frequency band is extracted.

[0031] (1). Divide the effective frequency band of the speech signal into 12 characteristic frequency bands on average. For example, if the sampling rate is 16KHz and the effective frequency band is 0-8KHz, then the 12 characteristic frequency bands are as follows: 4000Hz-4666.67Hz, 4666.67-5333.33Hz, 5333.33-6000Hz, 6000Hz-6666...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech spectrum color enhancement method for speech visualization, which comprises the following steps of: performing frame division and windowing on original speech signals, and extracting a short time energy value of each frame of signal in each characteristic frequency band; equally dividing an effective frequency band of each speech signal into N characteristic frequency bands, and respectively calculating energy values of each frame of speech signal in the N characteristic frequency bands; correcting preset color saturation in a corresponding characteristic frequency band by taking an energy value in each characteristic frequency band as a parameter; normalizing the energy values of the N frequency bands; correcting the set color saturation of the N characteristic frequency bands by utilizing the normalized energy values; converting corrected hue, saturation and brightness of the N characteristic frequency bands of each speech signal into red, green and blue (RGB) three-primary color values by utilizing a chromatology conversion equation; and drawing a histogram. A speech signal color generating module reflects energy concentrated areas of speech signal frequency spectrums through different colors, so that the energy concentrated areas are easy to identify; and the interframe change of pronunciation is dynamically reflected and a pronunciation rule is met.

Description

technical field [0001] The invention relates to a spectrogram color enhancement method for speech visualization, which belongs to the field of speech visualization. Background technique [0002] Speech is the sound that people make when they speak, and it is indispensable in people's daily life. But for the hearing-impaired, they cannot perceive speech through hearing, causing pain that normal people cannot understand. Studies have shown that in the process of people's perception of the outside world, the most information is acquired by vision, followed by hearing, and the combination of vision and hearing is more than any single sense of perception. In addition, experience tells us that charts are the most convenient and intuitive way for people to express their thoughts and understand things, so people also try to perceive speech visually, or use the combination of vision and hearing to convey more useful information. The purpose of the present invention is to explore an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/06G10L21/10
Inventor 赵胜辉董欣玮王晶匡镜明
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products