Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

System for audio analysis and perception enhancement

A perception system, audio technology, applied in the field of audio analysis and perception system, can solve the problem of inaccurate use and so on

Inactive Publication Date: 2016-10-12
唐纳德·詹姆士·德里克 +1
View PDF11 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Thus, current vibro-haptic devices precisely use information from speech signals that air-haptic devices do not, and vice versa

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System for audio analysis and perception enhancement
  • System for audio analysis and perception enhancement
  • System for audio analysis and perception enhancement

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0133] Example 1: Zero Crossing Rate Technique

[0134] Hissing-type expressions (unvoiced) exhibit a broad spectrum. Expressions with stronger fundamental and joint harmonics, on the other hand, appear more periodic and thus present a spectrum with more clearly identifiable peaks. Although periodic calculations can be used to identify voiced expressions from unvoiced ones, this calculation is very computationally expensive and presents limited performance for the computational cost involved.

[0135] Figure 4 A system 300 for generating control signals for an air haptic device is shown. Unless otherwise described, refer to Figure 4 The features described have the same reference image 3 The corresponding features indicated by the described reference numerals plus 100 have the same or similar function.

[0136] By measuring the number of zero crossings of the input acoustic signal per unit time, the system 300 achieves a simple method with usable performance under contr...

example 2

[0144] Example 2: Teager Energy / Discrete Energy Separation Technique

[0145] Since the zero-crossing rate method showed a lot of room for improvement, a better method was found while considering the need to operate on limited hardware.

[0146] Just as the zero-crossing method is based on physical aspects of the signal, the method using Teager energy and discrete energy separation takes this inference step a step further and seeks to use knowledge of the speech generation process.

[0147] When generating two signals of equal magnitude, it is a fact of physics that more energy is required to generate a high frequency signal than a low frequency signal. Silent representations are fundamentally broadband noise (though more correlated than noise), meaning that a lot of energy is added to the generation. In vocal expression, most of the energy is bundled at the relatively low fundamental frequency. Thus, a method that assigns different energies to each frequency band based on t...

example 3

[0149] Example 3: Combination of Zero Crossing Rate, Teager Energy and Discrete Energy Separation Techniques

[0150] Figure 5 A system 400 is shown that combines the zero-crossing rate and Teager energy techniques described above to improve overall performance. Unless otherwise described, refer to Figure 5 Described features and references image 3 Corresponding features indicated by the described reference numerals plus 200 have the same or similar functions.

[0151] The various functional blocks of system 400 have many interactions with each other. The system 400 mainly uses heuristics, and the signal from the classification module 440 is used as a feedback signal to the feature extraction post-processing module 430, which will be used as a noise gating function, to improve the algorithm performance.

[0152] The system 400 includes a feature extraction module 420, a classification module 440 and a control module 460, wherein the feature extraction module 420 is used...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An audio perception system is described, comprising a capture module configured to capture acoustic speech signal information; a feature extraction module configured to extract features that identify a candidate unvoiced portion in an acoustic signal; a classification module configured to identify if the acoustic signal is or contains an unvoiced portion based on the extracted features; and a control module configured to generate a control signal to a sensory stimulation actuator for generating an aero- tactile stimulation to be delivered to a listener, the control signal based at least in part on a signal representing the identified unvoiced portion. Related methods are also described.

Description

technical field [0001] The present invention relates to systems for audio analysis and perception. [0002] Specifically, the present invention contemplates a system for converting auditory speech information into air-tactile stimuli that resemble airflow produced by natural speech. The present invention also relates to a system for delivering air haptic stimuli to a listener when the listener receives or hears the voice information to enhance the perception of the voice information. Background technique [0003] When people speak, they produce audible, visual and somatosensory (vibration and airflow) information that can potentially help a listener understand what he / she is hearing. While auditory information may be sufficient for speech perception, other information streams can enhance speech perception. For example, visual information from the speaker's face can enhance the perception of speech. Touching the speaker's face can also aid speech perception. For example, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0364G10L25/93
CPCG10L21/0364G10L25/93G10L25/03G06F3/016G10L21/0264G10L21/06G10L25/78G10L2025/783
Inventor 唐纳德·詹姆士·德里克汤姆·热拉尔·德莱贝尔
Owner 唐纳德·詹姆士·德里克
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products