Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for adjusting self-adaptive audio sensing loudness

An adjustment method and self-adaptive technology, applied in speech analysis, instruments, etc., can solve problems such as difficulty in ensuring the accuracy of audio perception loudness estimation, loudness deviation from the real auditory perception loudness range, and inability to eliminate adverse effects of audio well, and improve the Accuracy, good real-time performance, and accurate loudness estimation

Inactive Publication Date: 2012-07-25
TIANJIN UNIV
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the ReplayGain standard uses a fixed threshold (95% maximum energy) estimation method for the measurement of the perceived loudness of audio files, which is difficult to guarantee the estimation accuracy of the perceived loudness of different types of audio. The undesired effects of very low and very high loudness components caused, so that the estimated loudness deviates from the real auditory perception loudness interval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for adjusting self-adaptive audio sensing loudness
  • Method for adjusting self-adaptive audio sensing loudness
  • Method for adjusting self-adaptive audio sensing loudness

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The invention proposes an adaptive estimation method of audio perception loudness and a corresponding loudness fast normalization method. Different from the existing ReplayGain standard based on a fixed threshold method, the present invention first extracts the optimal stable decibel interval for the current audio file, and then calculates the perceived loudness of the audio file on this interval, and uses linear subsampling when the file is large Data dimensionality reduction by technology not only improves the estimation accuracy of perceived loudness, but also ensures the real-time performance of the algorithm.

[0024] The invention belongs to the field of multimedia information processing and audio analysis, and relates to a fast and practical new technology for perceptual normalization of audio loudness, which mainly includes two parts: acquisition of audio optimal and stable loudness value and loudness normalization, figure 1 and figure 2 The flow chart of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the fields of multimedia information processing and audio analyzing, and relates to a method for adjusting self-adaptive audio sensing loudness, which comprises that: the current audio file is filtered at the same loudness; the root mean square (RMS) energy value of an audio signal is calculated according to the special window size of the audio signal, and accordingly, the RMS energy sequence of the whole audio file is obtained; the RMS energy sequence is converted into a decibel value sequence, and the decibel value sequence is sorted in an ascending way; a difference method is utilized to calculate the second derivative of the decibel value sequence which is sorted in the ascending way, and a locale window average method is utilized to smooth the second derivative sequence; when the sequence is overlong, the original sequence is sub sampled; and the optimal stable decibel section of the current audio is searched on the smoothed second derivative sequence, and the average decibel of the section is calculated to be used as the optimal stable loudness of the current audio file. The loudness adjustment is carried out on the audio by adopting a linear mappingmethod. The method has the advantages of quick operating speed, accurate correction, satisfaction of acoustic sensing, stable performance, lossless audio frequency and tone quality and the like.

Description

technical field [0001] The invention belongs to the field of multimedia information processing and audio analysis, and relates to a new technology of adaptive audio perception loudness estimation and fast normalization, which can be used to automatically adjust audio files with different loudness perception standards to a unified perception loudness standard. Background technique [0002] The ReplayGain (replay gain) standard is a set of technical standards proposed by David Robinson in 2001 to measure the perceived loudness of MP3 music files and to normalize the audio loudness (see literature: D.Robinson, "ReplayGain specification discussion", www.replaygain.org , 2010). The specific steps are, first, conduct a psychoacoustic scan on the entire audio file to measure its perceived loudness and peak level; then calculate the difference gain value between the original loudness of the audio file and the target loudness (usually set as a sound pressure value of 89 decibels) ;...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L19/00G10L21/02G10L25/48
Inventor 冯伟万亮谭志羽江建民
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products