Approaches provide for using a voice communications device to control, refine, or otherwise manage the playback of
media content in response to a spoken instruction. For example, the voice communications device can receive a request to refine and / or initiate the playback of
media content, such as music, news, audio books, audio broadcasts, and other such content. Audio input data that includes the request can be received by the voice communications device and an application executing on the voice communications device or otherwise in communication with the voice communications device can analyze the audio input data to determine how to carry out the request. The application can determine whether there is an active play
queue of
media content configured to play using the voice communications device. In the situation where there is no media content being played using the voice communications device, the application can determine media content using information in the request. In the situation where there is an active play
queue of media content, the information can be used to refine the play
queue. Thereafter, the application can cause the media content associated with the active play queue to play using the voice-enable communications device.