The disclosure describes an overall
system / method for text-input using a multimodal interface with
speech recognition. Specifically, pluralities of
modes interact with the main speech mode to provide the speech-
recognition system with partial knowledge of the text corresponding to the spoken
utterance forming the input to the
speech recognition system. The knowledge from other
modes is used to dynamically change the ASR
system's active vocabulary thereby significantly increasing recognition accuracy and significantly reducing
processing requirements. Additionally, the
speech recognition system is configured using three different system configurations (always listening, partially listening, and push-to-speak) and for each one of those three different user-interfaces are proposed (speak-and-type, type-and-speak, and speak-while-
typing). Finally, the overall user-interface of the proposed system is designed such that it enhances existing standard text-input methods; thereby minimizing the behavior change for mobile users.