Recording control method and apparatus
A control method and voice data technology, applied in the field of data processing, can solve problems that affect user experience, affect the accuracy of voice recognition results, and cannot detect the end point of voice, so as to achieve the effect of improving user experience and accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0070] See figure 2 , is a flow chart of an embodiment of a recording control method provided by an exemplary embodiment of this specification, and the method can be applied to figure 1 The example application scenario includes the following steps:
[0071] Step 201: Control the sound pickup device to start recording based on the specific voice data of the target speaker.
[0072] Combined with the above figure 1 For the relevant description of the application scenarios shown, in the embodiment of this specification, for the convenience of description, the above-mentioned activation words are referred to as specific voice data.
[0073] Step 202: Sampling the voice data recorded by the sound pickup device according to a preset sampling rule to obtain sampled data.
[0074] In the embodiment of this specification, after the sound pickup device 140 starts recording, first use the VAD detection algorithm to determine the voice start point, after the voice start point is deter...
Embodiment 2
[0094] See image 3 , is an embodiment flowchart of another recording control method provided by an exemplary embodiment of this specification, the image 3 The method exemplified in the above figure 1 the application scenarios shown, and figure 2 On the basis of the method shown, emphatically describe the process of training the speaker model, the method may include the following steps:
[0095] Step 301: Obtain at least one piece of historical voice data.
[0096] In the embodiment of this specification, during the daily use of the audio receiving device 140, the speech voices of all speakers can be collected, and the collected speech voices can be preprocessed, such as pre-emphasis, framing, windowing, etc., to obtain The voice data is saved, and for the convenience of description, the voice data is referred to as historical voice data.
[0097] In an embodiment, a collection period can be set, for example, one month from the first use of the radio device 140 is set as...
Embodiment 3
[0109] See Figure 4 , is an embodiment flowchart of another recording control method provided by an exemplary embodiment of this specification, the Figure 4 The method exemplified in the above figure 1 the application scenarios shown, and figure 2 On the basis of the shown method, emphatically describe the process of determining whether there is a target speaker model corresponding to the target speaker 110 in at least one speaker model obtained through training, the method may include the following steps:
[0110] Step 401: Perform feature extraction on the specific speech data of the target speaker to obtain feature values of the specific speech data.
[0111] Step 402: For any speaker model in the at least one trained speaker model, perform operations on the speaker model and the feature values of the specific speech data according to preset operation rules.
[0112] Step 403: Judging whether there is a speaker model whose calculation result is greater than the pr...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com