Audio spatial localization is accomplished by utilizing input parameters representing the physical and geometrical aspects of a sound source to modify a monophonic representation of the sound or voice and generate a stereo signal which simulates the acoustical effect of the localized sound. The input parameters include location and velocity, and may also include directivity, reverberation, and other aspects. The input parameters are used to generate control parameters which control voice processing. Thus, each voice is Doppler shifted, separated into left and right channels, equalized, and one channel is delayed, according to the control parameters. In addition, the left and right channels may be separated into front and back channels, which are separately processed to simulate front and back location and motion. The stereo signals may be fed into headphones, or may be fed into a crosstalk cancellation device for use with loudspeakers.