Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for processing and acquiring human voice data

A technology of data and human voice, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as difficult to collect voice data, voice recognition performance, voice cannot be reused, and voice commands are difficult to recognize, so as to be suitable for popularization and implementation. Man-machine dialogue, ingeniously designed effects

Pending Publication Date: 2020-06-23
NANJING SILICON INTELLIGENCE TECH CO LTD
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in this method, since the pitch of the speaker who actually uses the speech recognition function is often different from the pitch corresponding to the collected speech data, it is difficult to collect a large amount of speech data and ensure the speech recognition performance
Therefore, since an acoustic model is generally generated by learning voice data of an adult male, it is difficult to recognize voice commands of an adult female, an elderly person, or a child having a different voice pitch, and the recognized voice cannot be reused, let alone Carry out intelligent man-machine dialogue on the recognized voice, for this reason, the present invention proposes a method for processing and obtaining human voice data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for processing and acquiring human voice data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments.

[0022] Reference figure 1 , A method for processing and acquiring human voice data, including the following steps,

[0023] S1: Acquire the sound signal collected by the collector of the mobile terminal; the sound signal is subjected to band-pass filtering processing of the pre-processor to obtain sampled data within a predetermined frequency range;

[0024] S2, collecting voice data of sampled data from a voice-based device;

[0025] S3, accumulate the voice data of the sampled data in the first memory;

[0026] S4, learning the accumulated voice data of the sample data;

[0027] S5, generating a personal acoustic model of the sampled data based ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for processing and acquiring human voice data. The method comprises the following steps: acquiring a voice signal acquired by a collector of a mobile terminal; collecting speech data of the sampled data from a speech-based device; storing a general acoustic model and a personal acoustic model of the sampled data in a second memory, wherein the second memory is connected with a tone conversion unit for converting the tone of the sampled data into other required tones, the tones being selecting from a database, the database being stored in a second memory; when avoice recognition request is received from the sampled data, extracting a feature vector from the voice data of the sampled data; selecting any one of the general acoustic model and a personal acoustic model of the sampled data based on a cumulant of speech data of the sampled data; and recognizing a voice command by using the extracted feature vector and the selected acoustic model. The method is ingenious in design, reasonable in method, capable of reasonably processing human voice and suitable for application and popularization.

Description

Technical field [0001] The present invention relates to the technical field of methods for processing and acquiring human voice data, and in particular to a method for processing and acquiring human voice data. Background technique [0002] According to the conventional voice recognition method, voice recognition is performed using an acoustic model that has been stored in a voice recognition device in advance. The acoustic model is used to represent the attributes of the speaker's speech. For example, phonemes, diphones, triphones, pentaphones, syllables, and characters are used as the basic units of the acoustic model. If phonemes are used as the basic model of an acoustic model, since the number of acoustic models is reduced, context-sensitive acoustic models such as diphones, triphones, or pentaphones are widely used to reflect the synergy caused by changes between adjacent phonemes The phenomenon of coarticulation. Large amounts of data are needed to learn context-sensiti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/26G10L17/04G10L17/22
CPCG10L15/06G10L15/02G10L15/08G10L17/04G10L17/22
Inventor 司马华鹏胡红燕陆放茅玥琪司马德一
Owner NANJING SILICON INTELLIGENCE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products