A method and device for creating audio abstract text based on speech recognition

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech recognition and audio technology, used in speech recognition, speech analysis, instruments, etc.

Active Publication Date: 2021-06-11

HAIER YOUJIA INTELLIGENT TECH (BEIJING) CO LTD

View PDF14 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The present invention aims to solve the problems described above, and provides a method and device for enabling users to conveniently and quickly create audio summary texts for audio files such as books

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

no. 1 example

[0057]The first embodiment of the present invention is described for the case of creating an audio summary once, please refer to figure 2 . The so-called one-time creation of an audio summary means that the summary creation command is only received once during the playback of an audio file.

[0058] First, there is a shortcut for the user to issue a create summary command for the audio file being played. The shortcut here includes using a shortcut key or a combination of multiple shortcut keys. The command to create a summary may be a command issued by a hardware component of the device, for example, a signal issued when the volume down key and a power key are pressed simultaneously on a mobile phone, or a signal issued when a certain key is pressed and held. Another example is the touch screen signal received by the playback software when playing audio, such as the signal of double-clicking the screen, the signal of drawing a preset graphic on the screen, and so on. Creat...

no. 2 example

[0064] The second embodiment of the present invention is described for the situation of creating audio summaries multiple times, please refer to figure 2 . The multiple creation of audio abstracts in this embodiment is relative to the single creation of audio abstracts in the first embodiment. Because in the process of actually listening to the audio file, the audience may not only be interested in a piece of content, but hear different exciting parts continuously. At this time, it is necessary to use the method of creating audio summaries multiple times to meet the audience's demand for simultaneous creation of summaries for multiple pieces of content. The specific content includes the following steps:

[0065] First, the first creation summary command is still received when playing an audio file;

[0066] Secondly, determine the playback position of the audio file when receiving the summary command for the first time; determine the position information of the target audi...

no. 3 example

[0080] The present invention also provides a device for creating audio summary text based on speech recognition, such as Figure 4 ,include:

[0081] Command receiving module: used to receive the creation summary command when playing the audio file;

[0082] Playing position determination module: used to determine the playing position of the audio file when receiving the creation summary command;

[0083] Audio segment position determination module: connected to the playback position determination module, used to determine the position information of the target audio segment according to the playback position;

[0084] Obtaining module: used to obtain the audio content in the target audio segment;

[0085] Recognition module: used to recognize audio content to obtain text information.

[0086] Wherein, the method for determining the location information of the target audio segment by the audio segment position determination module includes any one of the following:

[0087...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention provides a speech recognition-based audio summary text creation method and a creation device thereof, wherein the creation method includes the following steps: S1: when playing an audio file, receiving a command to create a summary; S2: determining the playback of the audio file when receiving the command to create a summary Position; S3: Determine the position information of the target audio segment according to the playback position; S4: Obtain the audio content in the target audio segment; S5: Identify the audio content to obtain text information. The invention can conveniently and quickly create audio summaries without using other editing software, and further convert the audio summaries into text summaries for storage. On the one hand, it is convenient for readers to quickly understand the outline of new books that have not been read, and on the other hand, it is also convenient for readers to save the audio clips they care about in real time, so that they can accurately recall the content of books they have listened to in the future, so as to facilitate targeted selection. or not.

Description

technical field [0001] The present invention relates to the technical field of voice information processing, in particular to a method and device capable of creating abstracts for real-time played voice files. Background technique [0002] With the rapid development of smart phones, people's reading methods have also undergone drastic changes. With the emergence of various "listening to books" software, more and more people have switched from traditional reading to listening to books. The benefits of listening to books are numerous, such as protecting eyesight, saving resources, being free from external influences anytime, anywhere, and so on. However, for current audiobook players, there are also some disadvantages. For example, when a user listens to a brand new book, if he wants to determine whether the audio book is suitable for him in a short period of time, he needs to browse the book quickly. And because the existing player can only realize the order complete playin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L15/22G10L15/26H04N21/854H04N21/439

CPCG10L15/22G10L15/26H04N21/439H04N21/854

Inventor王守峰李晓琳

OwnerHAIER YOUJIA INTELLIGENT TECH (BEIJING) CO LTD

A method and device for creating audio abstract text based on speech recognition

What is AI technical title? AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document. A speech recognition and audio technology, used in speech recognition, speech analysis, instruments, etc.

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

no. 1 example

no. 2 example

no. 3 example

PUM

Abstract

Description

Claims

Application Information

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech recognition and audio technology, used in speech recognition, speech analysis, instruments, etc.

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology