Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Spokesman positioning and tracking method and device based on audio and video features

A technology for positioning, tracking, and speaking

Inactive Publication Date: 2021-12-03
安徽创变信息科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Complementarity is mainly reflected in the fact that audio information has all-round characteristics, but its positioning accuracy is poor; although the acquisition of video information is limited by the viewing angle of the camera, it can provide accurate positioning information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Spokesman positioning and tracking method and device based on audio and video features
  • Spokesman positioning and tracking method and device based on audio and video features
  • Spokesman positioning and tracking method and device based on audio and video features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0035] The present application provides a speaker positioning and tracking method based on audio and video features, including:

[0036] The coarse positioning calculation step is configured to i) receive position data of multiple microphones, each microphone is distributed in different positions according to the preset microphone array distribution structure, and ii) receive multiple microphones to collect the sound propagation data of the same speaker and collect the The time data of the same speaker's voice collected by each microphone has associated delay data indicati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a spokesman positioning and tracking method and device based on audio and video features, and the method comprises the steps: a coarse positioning calculation step: receiving the position data of a plurality of microphones, receiving the sound transmission data of the same spokesman collected by the plurality of microphones and the collected time data, and determining position data of the sound source based on different positions of the plurality of microphones and receiving different time delay data of the same sound source; and a quasi-positioning tracking step: controlling the camera to collect images of the speaker objects at the sound source position, and fixing the center coordinate of the camera image with the position of each object in the speaker objects in the video stream, so that the center of the camera image is synchronized with the positions of the speaker objects. According to the invention, the distribution structure of the plurality of microphones is optimized, a better sound source data pickup effect is realized, the accuracy of obtaining the initial positioning of the spokesman based on the audio information is further improved, and the accurate positioning of the spokesman is obtained by using the complementarity between the audio information and the video information.

Description

technical field [0001] The invention relates to the field of indoor positioning and tracking, in particular to a speaker positioning and tracking method and device based on audio and video features. Background technique [0002] In the multi-speaker tracking problem based on audio-video feature fusion in an intelligent environment, the speaker's voice signal and video signal have strong complementarity and correlation. Complementarity is mainly reflected in the fact that audio information has all-round characteristics, but its positioning accuracy is poor; although the acquisition of video information is limited by the viewing angle of the camera, it can provide accurate positioning information. In addition, video information is not affected by acoustic environments such as background noise and room reverberation, while audio information has nothing to do with the complexity of the visual scene. Correlation is reflected in the correlation between the speaker's voice and lip...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G01S5/22G06K9/00G06K9/32G06K9/40
CPCG01S5/22
Inventor 戴李
Owner 安徽创变信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products