Unlock instant, AI-driven research and patent intelligence for your innovation.

Object recognition method, device, device and storage medium for audio data

A technology for audio data and object recognition, applied in speech analysis, instruments, etc., can solve the problems such as voiceprint cannot be matched, the recognition scheme recognition effect is not good, and the voiceprint cannot know the audio data object, etc., to improve the recognition effect.

Active Publication Date: 2021-10-08
IFLYTEK CO LTD
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, at some point, there may be a problem that the extracted voiceprint cannot match, and the extracted voiceprint cannot match, which will result in the inability to know the object corresponding to the audio data, that is, the existing recognition scheme has poor recognition effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Object recognition method, device, device and storage medium for audio data
  • Object recognition method, device, device and storage medium for audio data
  • Object recognition method, device, device and storage medium for audio data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0086] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0087] In view of the poor recognition effect of the recognition scheme in the prior art, the inventor of this case conducted in-depth research and finally proposed a recognition scheme with better effect. Make an introduction.

[0088] see figure 1 , which shows a schematic flowchart of an object recognition method for audio data provided in an embodiment of the present application, the method may include:

[0089] Step S101: Obtain the audio data to be recognized in the target s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application provides an object recognition method, device, device and storage medium for audio data. The method includes: acquiring audio data to be recognized in a target scene, and a target voiceprint feature set adapted to the target scene; The corresponding target voiceprint feature set is used to identify the object corresponding to the audio data to be recognized. In the object recognition method for audio data provided by the present application, since the target voiceprint feature set is adapted to the target scene, based on the target voiceprint feature set, the voiceprint extracted from the audio data to be recognized in the target scene can be better identified. The features are matched, so that the recognition effect of the object corresponding to the audio data to be recognized in the target scene can be improved.

Description

technical field [0001] The present application relates to the technical field of audio data processing, and in particular to an object recognition method, device, equipment and storage medium for audio data. Background technique [0002] In some scenarios (such as meetings, speeches, diplomacy, etc.), when an object speaks, it is necessary to display the information of the object so that other objects can understand the information of the speaking object, so that it is easier to understand the speech The content of the subject's speech. [0003] It can be understood that, in order to display the information of the speaking object, it is necessary to identify the object corresponding to the audio data after acquiring the audio data of the speaking object. [0004] The existing recognition scheme is: extract the voiceprint feature from the audio data to be recognized, and match the extracted voiceprint with the voiceprint feature in the voiceprint library. However, at some p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/08G10L17/02G10L17/04
CPCG10L17/02G10L17/04G10L17/08
Inventor 张享高建清王智国胡国平胡郁刘庆峰
Owner IFLYTEK CO LTD