Methods, apparatus, and products are disclosed for providing expressive user interaction with a multimodal application, the multimodal application operating in a multimodal browser on a multimodal device supporting
multiple modes of user interaction including a voice mode and one or more non-voice
modes, the multimodal application operatively coupled to a speech engine through a
VoiceXML interpreter, including: receiving, by the multimodal browser,
user input from a user through a particular mode of user interaction; determining, by the multimodal browser, user output for the user in dependence upon the
user input; determining, by the multimodal browser, a style for the user output in dependence upon the
user input, the style specifying expressive output characteristics for at least one other mode of user interaction; and rendering, by the multimodal browser, the user output in dependence upon the style.