Speech recognition method and device, electronic equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and word technology, applied in the computer field, can solve problems such as poor user experience and low recognition accuracy, and achieve the effect of improving speech recognition accuracy and user experience

Active Publication Date: 2020-12-04

BEIJING BYTEDANCE NETWORK TECH CO LTD

View PDF13 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The existing ASR model uses a general-purpose lexicon. When performing speech recognition based on a general-purpose lexicon in a communication scenario containing specialized terms in a specific field, the recognition accuracy is low and the user experience is poor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0029] figure 1 It is a schematic flow chart of a speech recognition method provided by Embodiment 1 of the present disclosure, and this embodiment of the present disclosure is applicable to the situation of performing speech recognition in a communication scenario containing technical terms in a specific field. The method can be executed by a speech recognition device, which can be implemented in the form of software and / or hardware, and which can be configured in an electronic device, for example, in a backend server of a communication application.

[0030] Such as figure 1 As shown, the voice recognition method provided in this embodiment includes:

[0031] S110. When the current condition satisfies the lexicon selection condition, acquire subtitle text information within the communication range, and extract keywords of the subtitle text information.

[0032] In the embodiments of the present disclosure, the communication range may be regarded as the range composed of cli...

Embodiment 2

[0062] The embodiments of the present disclosure may be combined with various optional solutions in the voice recognition method provided in the foregoing embodiments. The speech recognition method provided in this embodiment can perform speech recognition on audio data in the communication range based on similar words to key words, which enriches the recognition scheme; it can also use industry-specific thesaurus and / or similar Words are updated to enable speech recognition based on the updated industry-specific thesaurus and / or similar words for speech recognition, further improving the accuracy of language recognition.

[0063] figure 2 It is a schematic flow chart of a voice recognition method provided in Embodiment 2 of the present disclosure. Such as figure 2 As shown, the voice recognition method provided in this embodiment includes:

[0064] S210. When the number of subtitle text information accumulated in the communication range reaches a first preset value for t...

Embodiment 3

[0095] image 3 It is a schematic structural diagram of a speech recognition device provided by Embodiment 3 of the present disclosure. The speech recognition device provided in this embodiment is suitable for performing speech recognition in a communication scenario containing technical terms in a specific field.

[0096] like image 3 As shown, the speech recognition device includes:

[0097] The keyword extraction module 310 is used to obtain the subtitle text information within the communication range when the current condition meets the thesaurus selection condition, and extract the keywords of the subtitle text information;

[0098] The industry characterization vector selection module 320 is used to determine the characterization vector of the subtitle text information according to the word vector of the keyword, and select a target industry characterization vector similar to the characterization vector of the subtitle text information from the preset industry charact...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention discloses a voice recognition method and device, electronic equipment and a storage medium, wherein the method comprises the steps: obtaining subtitle text informationin a communication range when a current condition meets a lexicon selection condition, and extracting a keyword of the subtitle text information; determining a representation vector of the subtitle text information according to the word vector of the keyword, and selecting a target industry representation vector similar to the representation vector of the subtitle text information from preset industry representation vectors; and performing voice recognition on pulled audio data in the communication range based on an industry exclusive lexicon corresponding to the target industry representationvector. According to the method, the industry exclusive lexicon matched with the professional terms in the specific field can be selected for voice recognition in a communication scene containing theprofessional terms in the specific field, so that the voice recognition precision is improved, and the user experience is improved.

Description

technical field [0001] Embodiments of the present disclosure relate to the field of computer technology, and in particular, to a voice recognition method, system, device, electronic equipment, and storage medium. Background technique [0002] With the continuous development of the Internet and communication technologies, information communication through communication applications has become one of the important ways for users to exchange information. When communication between clients includes audio data, the server can transcribe the audio data into text through Automatic Speech Recognition (ASR) technology, and send the transcribed text to the corresponding client, so that the client can display the subtitles corresponding to the audio data. Existing ASR models usually use a general-purpose lexicon for speech recognition based on a general-purpose lexicon in a communication scenario that contains specialized terms in a specific field. The recognition accuracy is low and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/26G10L15/30G10L15/14

CPCG10L15/30G10L15/142

Inventor 徐文铭郑翔杨晶生

Owner BEIJING BYTEDANCE NETWORK TECH CO LTD

Speech recognition method and device, electronic equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology