Sound-based portrait automatic tracking method and device, intelligent terminal and medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An automatic tracking and intelligent terminal technology, applied in image communication, color TV parts, TV system parts, etc., can solve the problem that the camera cannot rotate in time to capture the speaker, and achieve a good user experience effect

Inactive Publication Date: 2021-02-05

SHENZHEN SKYWORTH RGB ELECTRONICS CO LTD

View PDF3 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to provide a sound-based automatic portrait tracking method, device, intelligent terminal and storage medium for the above-mentioned defects of the prior art, aiming to solve the problems of users in the prior art when using a mobile phone with a camera function. On smart terminals, when there are many people, the camera cannot rotate in time to capture the speaker

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0048] Such as figure 1 As shown in , an embodiment of the present invention provides a sound-based automatic portrait tracking method, which can be applied to a smart terminal with a camera, and the camera can be raised and lowered and the shooting angle can be rotated. In an embodiment of the present invention, the method includes the following steps:

[0049] Step S100, real-time detection and acquisition of human speech;

[0050] When the present invention is actually implemented, it is necessary to pre-set the smart terminal with camera function to support far-field voice, and the camera structure supports up, down, left and right angle rotation or the camera has a super wide-angle function. For example, the method of the present invention is implemented on a smart TV equipped with a liftable and rotatable camera, or a liftable and rotatable camera is pre-installed on a smart terminal without a camera.

[0051] In the embodiment of the present invention, the smart termina...

Embodiment 2

[0065] Such as figure 2 As shown, a sound-based automatic portrait tracking method in this specific application embodiment includes the following steps;

[0066] This embodiment takes the smart TV system as an example. The smart TV system needs to meet the requirements that the smart device supports far-field voice and the smart device camera structure supports up, down, left and right angle rotation or the smart device camera has a super wide-angle function.

[0067] Step 10: When the smart TV detects a person speaking in the camera startup scene mode, it combines voice technology to obtain the location information of the current speaker.

[0068] For example, in the camera application, you can turn on / off the sound-based automatic portrait tracking function in the setting menu (such as the [Sound-transformed portrait automatic tracking] function), or there is no such menu, and the default [Sound-transformed portrait automatic tracking] function (voice-based automatic portr...

Embodiment 3

[0081] Such as figure 2 As shown, Embodiment 3 of the present invention provides a sound-based automatic portrait tracking method, comprising the following steps:

[0082] This embodiment realizes the automatic portrait tracking function of the camera (automatic portrait tracking based on sound), and the conditions that need to be met are that the TV chip SOC needs to be able to run the camera portrait automatic focus function algorithm or the performance of the TV SOC needs to be separately increased by at least 1 NPU (embedded neural network) Network processor) computing power, the camera portrait auto-focus algorithm runs on the NPU alone.

[0083] Step S101: When the camera is turned on in the scene mode, when a person is detected to speak, combined with voice technology, the location information of the current speaker is acquired.

[0084] For example, a smart TV opens a camera function application, obtains a currently captured picture of the camera, and displays it on ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an automatic portrait tracking method and device based on sound, an intelligent terminal and a medium. The method comprises the following steps: detecting and acquiring speaking sound of a person in real time; positioning the position information of the current speaker according to the acquired speaker speaking sound; and controlling the camera to adjust the shooting angleaccording to the position information of the current speaker, and shooting by aligning to the current speaker. The embodiment of the invention is applied to the intelligent terminal with the camera, and the camera can automatically turn to a speaking person during implementation, so that automatic C-bit can be provided for a corresponding main angle when the camera does not know the turning personin a multi-person scene, and better user experience is provided for a user.

Description

technical field [0001] The present invention relates to the technical field of intelligent terminals, in particular to a sound-based automatic portrait tracking method, device, intelligent terminal and storage medium. Background technique [0002] With the development of science and technology and the continuous improvement of people's living standards, there are more and more smart terminals with camera functions. For example, TVs are becoming more and more intelligent, and cameras have become the development trend of TVs. Most TVs have built-in cameras. When you are alone at home, do you wish that there will be a pair of smart eyes looking at you when you wake up? , waiting to be called at any time; when you are in a multi-person video call / multi-person conference / multi-person scene, who should the camera point to? Who is the center and who is displayed on the UI interface? This has become a big problem. Some gestures are assigned to someone, which is inconvenient to use...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): H04N5/232

CPCH04N23/61H04N23/611H04N23/695

Inventor 宁秋梅江润贾增利刘熙桐李凯

Owner SHENZHEN SKYWORTH RGB ELECTRONICS CO LTD

Sound-based portrait automatic tracking method and device, intelligent terminal and medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology