In a remote operation monitoring system and the like, it is a video processing apparatus capable of intuitively grasping an object operated by an operator and an operation result. The video processing apparatus includes a unit (310, 320, 2104, 2202) for storing information about at least one object displayed on a screen of a display unit; a unit (12, 2105) for designating information about the object; a unit (300, 2201) for searching the store unit based upon the designated information, and for obtaining information within the store unit corresponding to the designated information; and also a unit (20, 2103) for performing a process related to the object based on the obtained information. An operator can readily grasp an object to be operated and a result.