Phoneme marking method and device based on audio fingerprint
A technology of audio fingerprinting and marking method, which is applied in speech analysis, instruments, etc., can solve the problems of low efficiency, time-consuming, and manpower, and achieve the effect of avoiding the influence of noise and reducing the comparison time
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] This embodiment provides a phoneme marking method based on audio fingerprints, which is applicable to phoneme marking application scenarios in the field of speech recognition, and can improve marking efficiency. The phoneme marking method based on audio fingerprints consists of a phoneme marking device based on audio fingerprints implemented by software and / or hardware.
[0037] figure 1 It is a flow chart of the audio fingerprint-based phoneme marking method provided in Embodiment 1.
[0038] see figure 1 , the phoneme marking method based on audio fingerprints comprises the steps:
[0039] S10. Perform pre-emphasis, framing and preprocessing of adding a Hamming window to the speech signal to obtain the speech to be marked.
[0040] Specifically, the purpose of the pre-emphasis of the speech signal is to emphasize the high-frequency part of the speech, remove the influence of lip radiation, and increase the high-frequency resolution of the speech. After the pre-emp...
Embodiment 2
[0048] The audio fingerprint-based phoneme marking device provided in this embodiment can be used to implement the audio fingerprint-based phoneme marking method provided in the embodiment of the present invention, and has corresponding functions and beneficial effects.
[0049] figure 2 A structural block diagram of an audio fingerprint-based phoneme marking device provided in Embodiment 2.
[0050] see figure 2 , a phoneme marking device based on audio fingerprints, comprising:
[0051] Preprocessing unit 1 is used to carry out pre-emphasis, framing and preprocessing of adding Hamming window to the speech signal to obtain the speech to be marked;
[0052] The pole acquisition unit 2 is used to extract the audio fingerprint of the voice to be marked, and obtains the voice spectrum pole information of the audio fingerprint of the voice to be marked; the extreme point acquisition unit 2 is specifically used to extract the audio fingerprint of the voice to be marked, and obt...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com