Automatic control of media presentation parameters is provided by using one or more of real-time audio playback measurement data from microphones and audience facial and body expression interpretation from video and infrared cameras, in conjunction with artificial intelligence for interpretation and evaluation of facial and body expression and predetermined perceptual audio models. Media presentation parameters can include, for example, speaker volume, audio equalization, feedback elimination, play/pause, and other audio content-related aspects of presentation. In some embodiments, additional environmental parameters can be modified to enhance audience experience, such as, for example, temperature, lighting, and the like, in response to audience facial and body expression.