Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Optimal human-machine conversations using emotion-enhanced natural speech using hierarchical neural networks and reinforcement learning

Inactive Publication Date: 2018-03-22
NEW VOICE MEDIA LIMITED
View PDF1 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

The patent describes a system and method for generating emotion-enhanced natural speech using hierarchical neural networks and an optimization component for choosing content and emotion level in conversations. The system includes an audio processing server that receives a raw audio waveform from a neural network and produces an emotion-enhanced audio waveform by associating text-based emotion content markers with portions of the audio waveform. The server then provides the emotion-enhanced audio waveform as an input data set to a dilated convolutional artificial neural network for use in natural speech audio generation. The technical effect of this invention is a more advanced and efficient way to generate emotion-enhanced natural speech audio.

Problems solved by technology

Recognizing human emotion in transcribed or recorded speech is a difficult task for computer programs, and producing convincing emotion in text-to-speech is often a labor-intensive process involving manual configuration and emotion-tagging.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Optimal human-machine conversations using emotion-enhanced natural speech using hierarchical neural networks and reinforcement learning
  • Optimal human-machine conversations using emotion-enhanced natural speech using hierarchical neural networks and reinforcement learning
  • Optimal human-machine conversations using emotion-enhanced natural speech using hierarchical neural networks and reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030]The inventor has conceived, and reduced to practice, a system and method for emotion-enhanced natural speech using dilated convolutional neural networks.

[0031]One or more different aspects may be described in the present application. Further, for one or more of the aspects described herein, numerous alternative arrangements may be described; it should be appreciated that these are presented for illustrative purposes only and are not limiting of the aspects contained herein or the claims presented herein in any way. One or more of the arrangements may be widely applicable to numerous aspects, as may be readily apparent from the disclosure. In general, arrangements are described in sufficient detail to enable those skilled in the art to practice one or more of the aspects, and it should be appreciated that other arrangements may be utilized and that structural, logical, software, electrical and other changes may be made without departing from the scope of the particular aspects....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A system and method for emotion-enhanced natural speech using dilated convolutional neural networks, wherein an audio processing server receives a raw audio waveform from a dilated convolutional artificial neural network, associates text-based emotion content markers with portions of the raw audio waveform to produce an emotion-enhanced audio waveform, and provides the emotion-enhanced audio waveform to the dilated convolutional artificial neural network for use as a new input data set.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS[0001]This application claims priority to U.S. provisional patent application Ser. No. 62 / 516,672, titled “OPTIMAL HUMAN-MACHINE CONVERSATIONS USING EMOTION-ENHANCED NATURAL SPEECH USING HIERARCHICAL NEURAL NETWORKS AND REINFORCEMENT LEARNING”, which was filed on Jun. 8, 2016, and is also a continuation-in-part of U.S. patent application Ser. No. 15 / 442,667 titled “SYSTEM AND METHOD FOR OPTIMIZING COMMUNICATION OPERATIONS USING REINFORCEMENT LEARNING”, filed on Feb. 25, 2017, which claims priority to U.S. provisional patent application 62 / 441,538, titled “SYSTEM AND METHOD FOR OPTIMIZING COMMUNICATION OPERATIONS USING REINFORCEMENT LEARNING”, filed on Jan. 2, 2017, and is also a continuation-in-part of U.S. patent application Ser. No. 15 / 268,611, titled “SYSTEM AND METHOD FOR OPTIMIZING COMMUNICATIONS USING REINFORCEMENT LEARNING”, filed on Sep. 18, 2016, the entire specifications of each of which are incorporated herein by reference.BACKGROUND...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/18G10L15/16G10L15/14G06N3/04G06N3/08G10L25/63G10L13/047
CPCG10L13/10G06N3/04G10L13/047G10L25/63G06N3/08G10L13/033G06N3/045
Inventor MCCORD, ALANUNITT, ASHLEYGALVIN, BRIAN
Owner NEW VOICE MEDIA LIMITED
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products