An apparatus includes a monocular display with a wireless communications interface, user input device, transmitter, and controller, and may provide a video link to and control and management of a host device and other devices, such as a cell phone, computer, laptop, or media player. The apparatus may receive speech and digitize it. The apparatus may compare the digitized speech in a first language to a table of digitized speech in a second language to provide translation or, alternatively, may compare the digitized speech to a table of control commands. The control commands allow user interaction with the apparatus or other remote devices in a visual and audio manner. The control signals control a “recognized persona” or avatar stored in a memory to provide simulated human attributes to the apparatus, network or third party communication device. The avatar may be changed or upgraded according to user choice.