User-defined instruction recognition speech photographing system

A command recognition and self-definition technology, which is applied in speech analysis, speech recognition, TV system components, etc., can solve the problems that the Selfie effect cannot meet the requirements of each user at the same time, and the recognition of specified voice commands is unsuccessful, so as to achieve improvement Practicality, enhanced interactivity effect

Inactive Publication Date: 2016-09-07
JINLING INST OF TECH
View PDF10 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these voice commands are generally specified by the system, that is to say, the user can only realize voice photography through fixed voice commands.
This will inevitably cause certain limitations. First of all, everyone's different speaking styles, different pronunciations, and the existence of dialects may lead to the failure of the specified voice command recognition.
Secondly, when the user wants to take a selfie by voice, considering that everyone’s smile is not the same, therefore, the selfie effect achieved by using the same voice command may not meet the requirements of each user at the same time, for example: some people The best smiles come when you use the voice command “eggplant,” while others prefer “tomato,” “cheese,” or “Kimci” (the Korean word for “kimchi”), etc.
In the prior art, there are relatively few methods or systems in which users can customize voice commands to identify and control the camera to take pictures

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • User-defined instruction recognition speech photographing system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0017] Below in conjunction with accompanying drawing, technical scheme of the present invention is described in further detail:

[0018] The schematic diagram of the system structure of the present invention is as figure 1 As shown, the voice camera system for self-defining command recognition, the system includes a voice command collection module, an audio signal preprocessing module, an audio signal feature extraction module, a voice definition training module and a language recognition control module,

[0019] The voice command collect...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an user-defined instruction recognition speech photographing system, the system comprises a speech instruction collecting module, an audio signal preprocessing module, an audio signal feature extraction module, a speech definition training module and a speech recognition control module, the speech instruction collecting module is used for collecting audio signals of a speech instruction; preprocessing and feature extraction are performed on the collected audio signal through the audio signal preprocessing module and the audio signal feature extraction module in sequence; the speech definition training module is used for establishing a speech feature pattern library and logging the speech instruction corresponding to the processed and extracted audio signal in the feature pattern library; and the speech recognition control module searches a minimum matching error to obtain a recognition result and executes the corresponding speech instruction. The technical scheme disclosed by the invention can improve the practicability of speech photographing function and can realize user personalized customization, and the interactivity between the user and the device can be improved.

Description

technical field [0001] The invention discloses a voice photographing system capable of self-defining instruction recognition and relates to the technical field of audio signal processing. Background technique [0002] With the rapid development of the information industry, intelligent products have been widely favored by people. As a key technology of human-computer interaction, speech recognition has been applied in many aspects of our lives, such as car voice navigation, mobile phone voice-activated dialing, home appliance control and voice database retrieval services, etc. [0003] In the intelligent product market, mobile phones occupy an important position because of their lightness, dexterity and rich APP functions. Among them, various camera software has been favored by the majority of users, and their functions are constantly evolving and improving. It is not difficult to find that in many camera software, there is basically a voice camera function, which mainly con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/22G10L15/02G10L15/06G10L25/24H04N5/232
CPCG10L15/02G10L15/063G10L15/22G10L25/24G10L2015/223H04N23/60
Inventor 王丹丹臧娴
Owner JINLING INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products