Sound and text realignment and information presentation method and device, electronic equipment and storage medium

A forced alignment, text technology, applied in the field of information processing, can solve the problems of speech recognition accuracy not 100%, errors, text and original audio alignment, etc. Effect

Pending Publication Date: 2021-12-07
BEIJING ZITIAO NETWORK TECH CO LTD
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in practice, speech recognition is not 100% accurate and there may be errors
Therefore, there are some apps that pro

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sound and text realignment and information presentation method and device, electronic equipment and storage medium
  • Sound and text realignment and information presentation method and device, electronic equipment and storage medium
  • Sound and text realignment and information presentation method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present disclosure will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0036] It should be noted that, in the case of no conflict, the embodiments in the present disclosure and the features in the embodiments can be combined with each other. The present disclosure will be described in detail below with reference to the accompanying drawings and embodiments.

[0037] figure 1 An exemplary system architecture 100 to which embodiments of the audiotext realignment method, apparatus, electronic device and storage medium of the present disclosure can be applied is shown.

[0038] like figure 1 As shown, the system architecture 1...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a sound and text realignment and information presentation method and device, electronic equipment and a storage medium, and the method comprises the steps: obtaining a target audio, a pre-editing recognition text and a post-editing recognition text, wherein the pre-editing recognition text is a recognition text obtained through the automatic voice recognition of the target audio, and the edited text is a text obtained by editing the recognition text before editing; performing forced alignment on the target audio and the recognition text before editing to determine audio starting and ending time corresponding to each character in the recognition text before editing; performing text alignment on the pre-edited recognition text and the post-edited recognition text to determine a character corresponding to each character in the post-edited recognition text in the pre-edited recognition text; and for each character in the edited recognition text, determining the audio starting and ending time of the character corresponding to the character in the pre-edited recognition text as the audio starting and ending time of the character. According to the invention, high-precision sound and text re-alignment between the target audio and the edited recognition text is realized.

Description

technical field [0001] Embodiments of the present disclosure relate to the technical field of information processing, and in particular, to a voice-text realignment and information presentation method, device, electronic device, and storage medium. Background technique [0002] Voice-text alignment refers to aligning the audio and the speech recognition text corresponding to the audio to mark the pronunciation time of each word in the recognition text. After alignment, the word or word corresponding to the current audio playback content can be highlighted in the recognized text; or, the user can click on the word or word in the recognized text to quickly jump to the corresponding audio for playback. [0003] However, in practice, speech recognition is not 100% accurate and errors may occur. Therefore, there are some applications that provide users with the ability to edit speech recognition text, but the edited text and the original audio will also have alignment problems. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/189G06F40/284G10L15/26G10L25/87
CPCG06F40/189G06F40/284G10L15/26G10L25/87
Inventor 王斌杨晶生
Owner BEIJING ZITIAO NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products