A voice 
processing system comprises a 
processing device that processes and receives a 
stream of voice input as a user is speaking. A 
software program executes program steps for determining a predetermined pattern of speech and 
silence during 
processing of 
stream of voice input so as to play or present the predetermined 
backchannel response to the user. A method provides an audible 
backchannel response between the voice processing 
system and the user, while the user is speaking, in particular, recording a message. The method includes monitoring the message to determine a predetermined pattern of speech and 
silence based on timing between the speech and 
silence periods. Then, the method produces the audible 
backchannel response based on the predetermined pattern. An audible 
user interface includes a 
speech processor that processes or classifies an audio message in the telecommunication device as speech and silence frame while a calling party is speaking, in particular, recording the audio message to a called party. A control circuitry cooperates with the 
speech processor and responds to a predetermined pattern of the speech and silence segments so as to play the preset backchannel response in audible form to the calling party.