Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech processing method and related equipment thereof

A voice processing and voice technology, applied in the field of data processing, can solve the problems of incomplete listening to user instructions, poor human-computer interaction effect, slow response speed of user instructions, etc., and achieve the effect of improving human-computer interaction

Pending Publication Date: 2022-07-05
UNIV OF SCI & TECH OF CHINA +1
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, due to defects in some human-computer interaction processes, the effect of human-computer interaction is relatively poor (for example, the user's instructions are not fully listened to, the response speed to user instructions is relatively slow, etc.)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech processing method and related equipment thereof
  • Speech processing method and related equipment thereof
  • Speech processing method and related equipment thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0087] see Figure 4 , which is a flowchart of a voice processing method provided by an embodiment of the present application.

[0088]The speech processing method applied to the speech processing module 203 provided by the embodiment of the present application includes S1-S3:

[0089] S1: After the speech processing module 203 obtains the current speech segment, the speech processing module 203 determines the user semantic integrity representation information according to the current speech segment.

[0090] The current voice segment is used to represent the audio segment that is sent by the VAD module 202 in real time and carries the user's voice information. For example, the current speech segment can be Figure 5 The audio clip shown with the voice message "Ambient Light" can also be Image 6 "User Expression Fragment B" shown.

[0091] The above "user semantic integrity representation information" is used to indicate the possibility that the user has said that the con...

Embodiment approach

[0116] In fact, in order to further improve the semantic integrity analysis effect and stability, the embodiment of the present application also provides another possible implementation of S12, which may specifically include S121-S122:

[0117] S121: The speech processing module 203 performs semantic integrity recognition processing on the segment text to be processed according to a preset semantic integrity recognition rule, and obtains a semantic integrity recognition result.

[0118] The semantic integrity identification rule refers to a preset rule for identifying the possibility that a text data has complete semantics; and the embodiment of the present application does not limit the semantic integrity identification rule, for example, it may include a full matching list, and At least one candidate grammar rule and its corresponding semantic integrity characterize the data. The full matching list is used to record a large number of candidate text segments and the semantic ...

Embodiment 2

[0172] In fact, for some application fields (for example, human-computer interaction fields such as intelligent control, navigation, etc.), after acquiring the user's voice and text, it is also necessary to provide corresponding feedback to the user based on the user's voice and text (for example, control atmosphere lights on, etc.).

[0173] Based on this, the embodiment of the present application also provides another possible implementation of the speech processing method. In this implementation, the speech processing method not only includes the above S1-S3, but may also include S4:

[0174] S4: The voice processing module 203 responds to the human-computer interaction request carried by the user's voice and text.

[0175] In this embodiment of the present application, after the voice processing module 203 obtains the user's voice and text, the voice processing module 203 may execute a request for human-computer interaction carried by the user's voice and text (for example...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice processing method and related equipment thereof, and the method comprises the steps: determining the semantic integrity representation information of a user according to a current voice segment after the current voice segment is obtained; according to the semantic integrity representation information of the user, the waiting duration of the voice to be used is determined, when it is determined that the next voice segment is not obtained within the waiting duration of the voice to be used after the voice ending moment of the current voice segment, it is determined that the user ends speaking, and a user voice text used for representing the speaking content of the user is obtained; therefore, corresponding response operation can be performed on the user based on the voice text of the user subsequently, so that the reception waiting time length can be dynamically adjusted based on the complete semantics of the content spoken by the user, the defects existing in the man-machine interaction process of performing reception control based on the fixed reception waiting time length can be overcome, and the user experience is improved. And the man-machine interaction effect can be effectively improved.

Description

technical field [0001] The present application relates to the technical field of data processing, and in particular, to a voice processing method and related equipment. Background technique [0002] With the development of human-computer interaction technology, there are more and more application fields of human-computer interaction technology. For example, human-computer interaction technology can be applied to smart home, navigation and other fields. [0003] In fact, during a round of human-computer interaction, the human-computer interaction device can usually first collect user voice data (for example, voice commands, etc.); then perform voice recognition processing on the voice data; finally, refer to the voice recognition results (for example, The text "Help me turn on the ambient light"), and make a corresponding response operation for the user (for example, control the ambient light to turn on, etc.), so as to realize the human-computer interaction process between ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/04G10L15/18G10L15/22G10L15/26
CPCG10L15/04G10L15/1815G10L15/22G10L15/26
Inventor 缪磊李亚刘权
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products