Speech collection method and device, computer equipment, and storage medium

A voice collection and corpus technology, applied in the computer field, can solve problems such as low collection efficiency, inability to clear recordings in time, troublesome operation, etc.

Pending Publication Date: 2018-11-16
PING AN TECH (SHENZHEN) CO LTD
View PDF8 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, voice collection mainly uses professional recording equipment for voice recording, obtains recording files, and then manually marks the speakers corresponding to the recording files. This manual method cannot remove unqualified recordings in time, and the operation is troublesome, which makes the collection efficiency low. At the same time, this manual collection method is not suitable for collecting voices from people who are far away. If you need to collect voices from people in different regions at the same time, you can only purchase multiple recording devices, which wastes a lot of collection costs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech collection method and device, computer equipment, and storage medium
  • Speech collection method and device, computer equipment, and storage medium
  • Speech collection method and device, computer equipment, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0032] see figure 1 , figure 1 The application environment of the voice collection method provided by the embodiment of the present invention is shown. The voice collection method is applied in a voice collection scene of an application account based on a communication application platform. The voice collection scenario includes a server, a client, and a communication application platform, wherein the server, the client, and the communication applica...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech collection method and device, computer equipment, and a storage medium. The method comprises the steps: selecting corresponding initial linguistic data when receivinga linguistic data obtaining request sent by a r through an application account of a communication application platform, and obtaining the identity label information of the user; calling a recording function of the application account of the communication application platform for recording after receiving a recording start request, and to obtain a record file; calling an offline voice recognition function of the application account of the communication application platform to convert the record file into a target text; carrying out the matching of the target text with the initial linguistic data through a text matching algorithm to obtain the text similarity; storing the record file, the identity label information and the mapping relation between the record file and the identity label information into a database if the text similarity is greater than or equal to a preset similarity threshold value, thereby achieving a purpose of quickly collecting the speech data through employing the application account of the communication application platform, and improving the collection efficiency of the speech data.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a voice collection method, device, computer equipment and storage medium. Background technique [0002] With the advancement of science and technology and the rapid development of computer network technology, voiceprint recognition technology and speech recognition technology are more and more popular among people. Among them, voiceprint recognition technology is used to quickly and To recognize and convert natural speech into text, both voiceprint recognition technology and speech recognition technology need to collect a large amount of speaker information and the corresponding voice information of the speaker for model training. [0003] At present, voice collection mainly uses professional recording equipment for voice recording, obtains recording files, and then manually marks the speakers corresponding to the recording files. This manual method cannot remove unqualified re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/26G10L15/22H04L12/58G06F17/30
CPCH04L51/04G10L15/22G10L15/26
Inventor 黄锦伦
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products