The invention discloses a method for processing audio stream playback in the network terminal buffer zone for solving the problem of voice pausing and jamming in the network communication. The aim of the invention is achieved by performing real time audio (e.g. voice) communication on the packet-switching network (e.g. IP network), arranging a shake buffer zone on the receiving end, after the receiving end receives the audio package, it first performs decoding based on the normal sequence, then places it into the shake buffer zone, when the shake buffer zone is to be filled, lower the sampling rate to the audio data to realize the fast playback of the audio data stream, when the shake buffer zone is to be empty, raise the sampling rate to the audio data to realize the low speed playback of the audio data stream, when the audio data in the shake buffer is within the normal range, playback the audio stream with the original sampling rate.