Method of and system for communication across virtual media

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
The method and system address the lack of natural eye contact in virtual communication by aligning gaze through dominant eye identification and manipulation of visual data, enhancing engagement and reducing misinterpretation.

WO2026139522A1PCT designated stage Publication Date: 2026-07-02IRIS LABS AS

View PDF 0 Cites 0 Cited by

Patent Information

Authority / Receiving Office: WO · WO
Patent Type: Applications
Current Assignee / Owner: IRIS LABS AS
Filing Date: 2025-12-22
Publication Date: 2026-07-02

Application Information

Patent Timeline

22 Dec 2025

Application

02 Jul 2026

Publication

WO2026139522A1

IPC: H04N23/90; H04N7/18

AI Tagging

Technology Topics

Medicine Human–computer interaction

Explore More Agents

Novelty Search
Search existing technologies and assess novelty
↗
FTO
Analyze whether a product may infringe others' patents
↗
Design FTO
Check prior-design risk for exterior design
↗
Drafting
Draft patent application text based on a technical solution
↗
Find Solutions with TRIZ
Generate feasible solution to solve your technical challenge
↗

Similar Technology Patents

Get free access to AI patent search and analysis

Check patentability, review prior art and ask IP Agent with full patent context.

AI Technical Summary

Technical Problem

Existing virtual communication technologies fail to provide natural eye contact between participants, leading to reduced engagement and misinterpretation of non-verbal cues due to misalignment of gaze, which is not addressed by current gaze alignment systems.

Method used

A method and system that captures visual data using multiple cameras to identify a participant's dominant eye, manipulates the data to align with the dominant eye's perspective, and transmits it to other participants, ensuring direct eye contact by positioning the camera at the dominant eye's location.

Benefits of technology

Facilitates natural gaze alignment, enhancing user engagement and reducing miscommunication by allowing participants to appear as if they are looking directly at each other, thus improving the realism and effectiveness of virtual interactions.

✦ Generated by Eureka AI based on patent content.

Smart Images

Figure EP2025088818_02072026_PF_FP_ABST

Patent Text Reader

Abstract

A method of facilitating communication between multiple participants across a virtual medium. The method includes, for each of the multiple participants, capturing visual data of the participant using two or more cameras and obtaining an identification of a dominant eye of the participant using at least one of the captured visual data and an input received from the participant. The method includes, for a first participant, displaying, on a display medium, a representation of a second participant based on the captured visual data of the second participant; determining a display position of the identified dominant eye of the second participant; manipulating the captured visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the second participant; and transmitting the manipulated visual data of the first participant to the second participant.

Need to check novelty before this filing date? Find Prior Art

Description

[0001] 99.222.174254 / 01

[0002] Method of and System for Communication Across Virtual Media

[0003] The present invention relates to a method of and system for facilitating communication between two or more participants across a virtual medium, for example during video conferencing calls.

[0004] Virtual communication technologies, such as video conferencing systems, facilitate live digital meetings between multiple participants in different physical locations. Such technologies involve the exchange of video and audio data across a digital medium, and enable remote real-time interactions.

[0005] Typically, virtual communication technologies do not provide eye contact between participants in a digital meeting. Owing to each participant’s camera being in a different physical location to their display (where the other participants are shown), each participant’s gaze is usually not directed towards the camera. This can lead to reduced engagement and human connection between meeting participants. The absence of eye contact may also lead to misinterpretation of non-verbal cues, which are more easily interpreted in direct face-to-face interactions.

[0006] Recent efforts to develop systems and methods that facilitate eye contact in digital meetings have been limited in providing gaze alignment that appears natural to participants. This perceived artificial eye contact can create an increased sense of disconnection and discomfort during virtual interactions.

[0007] It is an aim of the present invention to provide an improved system and method of facilitating eye contact in digital meetings which overcomes at least the above limitations.

[0008] When viewed from a first aspect, the invention provides a method of facilitating communication between two or more participants across a virtual medium, wherein the method comprises, for each of the two or more participants:

[0009] capturing visual data of the participant using two or more cameras; and obtaining an identification of a dominant eye of the participant;wherein the identification is obtained using at least one of:

[0010] the captured visual data; and

[0011] an input received from the participant;

[0012] wherein the method comprises, for at least a first participant of the two or more participants:

[0013] displaying, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data of the second participant;

[0014] determining a display position of the identified dominant eye of the second participant;

[0015] manipulating the captured visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the second participant; and

[0016] transmitting the manipulated visual data of the first participant to the second participant.

[0017] When viewed from a second aspect, the invention provides a system for facilitating communication between two or more participants across a virtual medium, the system comprising:

[0018] two or more cameras configured to capture visual data of each of the two or more participants; and

[0019] processing circuitry configured to:

[0020] obtain an identification of a dominant eye of each of the two or more participants;

[0021] wherein the processing circuitry is configured to obtain the identification using at least one of:

[0022] the captured visual data; and

[0023] an input received from the participant;

[0024] wherein the processing circuitry is further configured to, for at least a first participant of the two or more participants:

[0025] display, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data of the second participant;

[0026] determine a display position of the identified dominant eye of the second participant;manipulate the captured visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the second participant; and

[0027] transmit the manipulated visual data of the first participant to the second participant.

[0028] Thus it will be seen that, in accordance with embodiments of the invention, the claimed method and system allow for direct eye contact to be provided, at a given time, between any two participants communicating across a virtual medium. In particular, the present invention uses dominant eye information to allow each participant to view another participant whilst simultaneously (e.g. appearing to be) looking directly at a camera. By placing a camera at the position of the dominant eye of the participant being displayed, the participants may view each other with no misalignment between the participant’s gaze and the visual data that is captured and transmitted to the other participant.

[0029] Multiple participants may communicate across a virtual medium, which may be a non-physical platform or environment that facilitates the transmission of information through digital means. The use of a virtual medium enables real-time interaction and communication between participants who may be in different locations. Virtual media may include any of video conferencing platforms, holographic communication platforms, virtual avatar platforms, virtual reality platforms and augmented reality platforms. The virtual medium may be implemented using appropriate hardware and software, e.g. a video conferencing system.

[0030] Visual data may be exchanged between participants across the virtual medium. The visual data may comprise a video feed or sequence of images. For example, a live video stream (e.g. together with live audio) may be transmitted, e.g. with minimal latency, such that real-time interactions can occur across the virtual medium.

[0031] It will be appreciated that while the aspects of the invention have been outlined with respect to first and second participants, any suitable and desired number of participants may be part of the communication (e.g. the video call). Furthermore, preferably the steps outlined for at least a first participant are carried out for each of the participants.Visual data of each participant may be captured using two or more physical cameras. Thus the system may comprise two or more physical cameras for (e.g. at the location of) each of the two or more participants. The two or more cameras may comprise any suitable and desired type of video cameras configured to capture visual data of a participant. It will be appreciated that a greater number of video cameras may help to provide a more accurate representation of the participant for whom the visual data is being captured and / or to determine the dominant eye of a participant more accurately.

[0032] The two or more physical cameras may each have a field of view defined by a spatial extent of the respective camera’s observable scene. The field of view may be based on the (e.g. approximate) distance between a participant and their respective display medium.

[0033] A dominant eye of a participant may be the single eye that is primarily responsible for providing a visual processing input to the brain. The dominant eye may send more accurate positional information to the brain. The dominant eye may be preferred by the participant in visual precision tasks, e.g. using a microscope or a camera viewfinder. Some participants may have a less pronounced (i.e. weaker) dominant eye or an equal dominance between their eyes.

[0034] An identification of a dominant eye relating to a participant may comprise an indication to signify either a left or right eye dominance, such as a binary input.

[0035] The input received from the participant may be a (e.g. manual) input to the system indicating the participant’s dominant eye. For example, a participant may determine their dominant eye using a manual test (e.g. using a known test such as a Miles test or a Porta test), or the participant may already be aware of their dominant eye preference.

[0036] The input received from the participant may be an instruction (e.g. an electrical signal generated by a user interface or button) to access a stored identification corresponding to the participant from (e.g. a database in) a storage medium (e.g. memory) of the system. The identification of the dominant eye may therefore byobtained by obtaining the stored identification. This may reduce redundancies and system set-up time, enabling the system to be configured more efficiently.

[0037] In some embodiments, the participant’s dominant eye is determined, (e.g. automatically) using the visual data captured by the two or more cameras.

[0038] In a preferred embodiment, the visual data may comprise a video or sequence of images of the participant that capture the participant’s eye movements and gaze alignment shifts. This visual data may be processed and / or analysed to track the movement of the participant’s eyes.

[0039] By obtaining an identification of a dominant eye of the participant using either a manual, stored or (e.g. automatic) determination method, the system’s configuration time may be reduced. In one example, a participant who is already aware of their dominant eye preference may provide a manual input without the need for an (e.g. automatic) determination. In another example, the processing circuitry may be able to (e.g. automatically) determine the preferred dominant eye for a participant who is otherwise unable or unsure how to determine their dominant eye preference using other (e.g. manual) methods.

[0040] By obtaining an identification of a dominant eye of the participant using manual and automatic methods, the system may be customised to user preferences. For example, the participant may disagree with the automatic determination carried out by the processing circuitry and override the automatically determined identification by inputting a manual input. For example, the participant may be presented with a plurality of images and input a manual input to select the one they prefer (e.g. that provides the participant with the best feeling of eye contact).

[0041] For participants with a weaker dominant eye or an equal dominance between their eyes, the processing circuitry may be configured to allow the participant to experience virtual communication with another participant using either left-eye dominant or right-eye dominant configurations in order to determine a preferred eye. The dominant eye identification may be changed based on which configuration the participant subjectively evaluates as feeling more natural, e.g. the participant’s preferred eye. This may allow the participant to adapt their dominant eyepreference should it change over time, e.g. between subsequent uses of the system.

[0042] For participants with a weaker dominant eye or an equal dominance between their eyes, the processing circuitry may be configured to provide a balanced configuration by combining the left-eye dominant and right-eye dominant configurations. The processing circuitry may be configured to provide a guided assessment (e.g. when coupled to a user interface) to assist a participant with identifying their type and / or degree of eye dominance.

[0043] The processing circuitry may be configured to obtain (e.g. retrieve) the dominant eye information from a storage medium. Thus the system may comprise a storage medium (e.g. memory). In one example, the dominant eye information may be manually input by the participant at a user interface, which may be connected to or comprise a storage medium. In one example, the dominant eye information may be determined by a processor (which may be comprised in the processing circuitry) and stored in a storage medium. The processing circuitry may comprise the storage medium, or the storage medium may be provided separately in the system from the processing circuitry and accessed by the processing circuitry, e.g., via wired or wireless means, to retrieve the dominant eye information.

[0044] Providing a storage medium to store dominant eye information may improve user personalisation and provide a more seamless user experience during subsequent uses of the system (e.g. subsequent video conferencing calls) by reducing the system configuration time.

[0045] The first participant and the second participant may be any combination of two participants of the two or more participants. The first participant may be at different location to the second participant. The first participant and the second participant may have the same or different identified dominant eyes.

[0046] The display medium may be a physical medium such as a screen, flat panel display, wall, hologram or projection screen.The representation of the second participant preferably includes a representation of at least the second participant’s head.

[0047] The display position of the identified dominant eye is determined with respect to the (e.g. plane of, e.g. centre of, the) display medium. The display position, e.g. at which a virtual camera is placed, may be (e.g. determined and) located in two-dimensional image coordinates (e.g. of the display medium) or in three-dimensional coordinates.

[0048] Although the dominant eye may have been identified, this may simply be a manual indication of whether it is the left eye or the right eye, as discussed herein. In addition, it may be necessary to locate the identified dominant eye in the image displayed on the display medium, in order to determine the display position of the identified dominant eye (of the second participant).

[0049] In some embodiments, the representation of the second participant (e.g. the visual data as displayed on the display medium to the first participant) may be used to determine the display position of the identified dominant eye (of the second participant). This may be done using image recognition techniques, e.g. to detect the eyes of the second participant, which may then be combined with the identification of the dominant eye to determine the display position of the identified dominant eye.

[0050] Thus the visual (image) data of the second participant may be used to determine a two-dimensional position (in the visual data, e.g. in the frame of image data), which may then be used to determine a two-dimensional position in the (coordinates of the) display medium.

[0051] Thus, in some embodiments the method comprises (and the processing circuitry is configured to) detecting the eyes of the second participant in the captured visual data of the second participant, and determining the display position of the identified dominant eye of the second participant using the detection of the eyes of the second participant.The manipulated visual data may be transmitted between the first participant and the second participant (who may be situated in different locations) using an internet and / or local area network (LAN) connection. The connection may be wired or wireless.

[0052] The processing circuitry may be local to each participant, e.g. using each participant’s local computer. The processing circuitry may be partially or fully remote from at least one participant, e.g. as part of a cloud-based processor. This may reduce local resource contention and ensure smoother operation of local applications.

[0053] It will be appreciated that by aligning the camera and the position of the dominant eye of the displayed participant, direct eye contact may be achieved. The direct eye contact may provide a natural gaze alignment between participants akin to a face-to-face meeting. Thus, the participants may experience more realistic virtual interactions compared to conventional virtual communication technologies. This may improve user experience and reduce the risk of misinterpretation and miscommunication between participants.

[0054] The two or more cameras may be located outside an area of the display medium.

[0055] In some embodiments, e.g. when the two or more cameras are located outside of an area of the display medium, the manipulating the captured visual data of the first participant comprises combining the visual data captured using each of the two or more cameras to form a single view of the first participant.

[0056] Manipulating the captured visual data may comprise combining visual data captured using each of the two or more cameras (to form a single view of the first participant).

[0057] The two or more cameras (e.g. located outside of an area of the display medium) may comprise physical cameras. The two or more cameras may be arranged in any suitable and desired configuration. In a preferred embodiment, the two or more cameras are located in different positions around (e.g. at least part of) a perimeter of the display medium.The single view of the first participant may correspond to the perspective originating from the display position of the identified dominant eye of the second participant. The single view of the first participant may be a view from a virtual camera that is constructed by combining the visual data captured by (e.g. each of) the two or more cameras. In one embodiment, the visual data from each of the two or more cameras may be combined and / or merged by creating a three-dimensional model of the visual data from the two or more cameras (e.g. using triangulation based on the displacement between the two or more cameras) and then generating (e.g. rendering) the visual data from the perspective of the virtual camera (at the display position of the identified dominant eye of the second participant).

[0058] In some embodiments, a neural network (e.g. implemented on the processing circuitry) may be used to process the visual data captured by (e.g. each of) the two or more cameras to generate and / or refine the three-dimensional model of the visual data from the two or more cameras. In some embodiments, a neural network (e.g. implemented on the processing circuitry) may be used to process the visual data captured by (e.g. each of) the two or more cameras to generate and / or refine the single view of the first participant from the perspective originating from the display position of the identified dominant eye of the second participant (e.g. at the location of the virtual camera), e.g. using the three-dimensional model (which may or may not have been generated and / or refined by a neural network).

[0059] Thus, the neural network may be used to generate and / or refine the three-dimensional model, to generate and / or refine the single view of the first participant from the perspective originating from the display position of the identified dominant eye of the second participant (with or without using the three-dimensional model). Thus, the neural network may perform a second pass of refining the single view of the first participant from the perspective originating from the display position of the identified dominant eye of the second participant. The neural network(s) may be trained using a diffusion-based (or similar) model.

[0060] The processing circuitry is preferably arranged to combine the visual data captured by (e.g. each of) the two or more cameras to construct the view of the firstparticipant from the perspective of the virtual camera, e.g. in one of the ways outlined herein.

[0061] The single view may be located at a position where it may not be otherwise possible to place a physical camera, e.g. inside the area of the display medium, e.g. from the perspective of a participant displayed on the display medium. By combining the visual data from multiple (e.g. physical) cameras, a more accurate representation of the first participant from a perspective that is not available to any one of the (e.g. physical) cameras may be transmitted to the second participant.

[0062] It will be appreciated that a greater number of video cameras may help to create more accurate video image data from the perspective of the virtual camera, e.g. improving the resolution and / or image quality of the manipulated visual data transmitted to the other participants involved in the video conference call. A greater number of video cameras may also enable greater flexibility for the range of positions that may be available for positioning the virtual camera.

[0063] The two or more cameras may be located inside an area of the display medium.

[0064] In some embodiments, the two or more cameras are located inside an area of the display medium. In some embodiments, e.g. when the two or more cameras are located inside an area of the display medium, the two or more cameras comprise a plurality of stationary physical cameras and / or a plurality of movable physical cameras.

[0065] Manipulating the captured visual data may comprise selecting the visual data captured by a selected camera. A selected camera may be a camera of the plurality of stationary physical cameras and / or the plurality of moveable physical cameras positioned nearest to the display position of the identified dominant eye of the second participant.

[0066] In some embodiments, when the two or more cameras are located inside an area of the display medium, and when the two or more cameras comprise a plurality of stationary physical cameras and / or a plurality of movable physical cameras, the manipulating the visual data of the first participant comprises selecting the visual data captured by a selected camera, wherein the selected camera is the camera ofthe plurality of stationary physical cameras and / or the plurality of movable physical cameras (that is) positioned nearest to the display position of the identified dominant eye of the second participant.

[0067] The two or more cameras located inside an area of the display medium may comprise under-display and / or embedded (e.g. in-display) physical cameras. An under-display camera may be a physical camera located behind a display medium that is arranged to receive light through the display medium. An embedded camera may be a physical camera that comprises camera pixels that are interlaced in the same plane as the display medium (e.g. interlaced with the display pixels of a screen).

[0068] The plurality of stationary physical cameras and / or a plurality of movable physical cameras may comprise any suitable and desired type and number of video cameras configured to capture visual data of a participant. It will be appreciated that a greater number of video cameras may help to determine the dominant eye of a participant more accurately. A greater number of video cameras may also enable greater flexibility for the range of positions that may be chosen for the selected camera.

[0069] By selecting a single camera of the plurality of stationary physical cameras and / or the plurality of movable physical cameras, the manipulation of the visual data may be simplified by, at any given time, selecting the visual data captured by the selected camera in order to be transmitted to the second participant, and ignoring (e.g. discarding) the visual data captured by the other cameras in the plurality of stationary physical cameras and / or the plurality of movable physical cameras.

[0070] In some embodiments, when the two or more cameras comprises a plurality of stationary physical cameras, the method comprises adjusting the display position (e.g. of the identified dominant eye) of the second participant such that the identified dominant eye of the second participant coincides with the position of the selected camera.

[0071] The plurality of stationary physical cameras may comprise two or more cameras each with a fixed position relative to the display medium.In some embodiments, when the two or more cameras comprises a plurality of movable physical cameras, the method comprises adjusting the display position (e.g. of the identified dominant eye) of the second participant and / or the position of the selected camera such that the identified dominant eye of the second participant and the position of the selected camera coincide.

[0072] The plurality of movable physical cameras may comprise two or more cameras each with an adjustable position relative to the display medium.

[0073] When the two or more cameras comprises movable physical cameras or stationary physical cameras, the selected camera capturing the first participant may correspond to the perspective originating from the display position of the identified dominant eye of the second participant.

[0074] By aligning the position of the selected camera with the position of the identified dominant eye of the second participant, direct eye contact may be achieved with minimal processing of the captured visual data since only a small positional adjustment may be required.

[0075] In some embodiments, the method comprises reassigning the selected camera to another of the plurality of stationary physical cameras and / or of the plurality of movable physical cameras based on a dynamic display position of the identified dominant eye of the second participant.

[0076] The display position of the identified dominant eye of the second participant may change as a function of time, e.g. as the second participant moves relative to the two or more cameras. Thus, it will be appreciated that by (e.g. dynamically) reassigning the selected camera to another (i.e. a different one) of the plurality of stationary physical cameras and / or of the plurality of movable physical cameras, the selected camera may continuously be positioned nearest to the display position of the identified dominant eye of the second participant. Continuous eye contact may then be sustained even as the second participant moves relative to the display medium.Reassigning the selected camera may also reduce the computational costs required to adjust the display position of the second participant and / or the position of the selected camera such that the identified dominant eye of the second participant and the position of the selected camera coincide.

[0077] In some embodiments, the system may comprise a camera (e.g. physical or virtual) that is not in (i.e. displaced from) the (e.g. plane of the) display medium and from whose perspective the first participant appears, when displayed to the second participant. For example, the camera may be behind or in front of the display medium. Thus the method may comprise (and the processing circuitry may be configured to) manipulate the captured visual data of the first participant to correspond to a perspective passing through (or at least close to) the display position of the identified dominant eye of the second participant.

[0078] When the camera is displaced from the display medium, the location of the camera (e.g. physical or virtual) from whose perspective the first participant appears, the display position of the identified dominant eye of the second participant and the (e.g. dominant eye of the) first participant, are three points in the space of the first participant. Preferably at least two (e.g. all three) of these points lie on a straight line. Thus, the line between the camera (e.g. physical or virtual) from whose perspective the first participant appears and the (e.g. dominant eye of the) first participant preferably passes (or extends) through the display position of the identified dominant eye of the second participant.

[0079] In some embodiments, the method comprises, for at least the first participant of the two or more participants, determining a three-dimensional position of the identified dominant eye of the first participant, and orienting the field of view of the perspective originating from the display position of the identified dominant eye of the second participant such that a central axis of the perspective is directed towards the three-dimensional position of the identified dominant eye of the first participant.

[0080] The three-dimensional position of first participant’s identified dominant eye may be determined by the processing circuitry, e.g. using stereo vision techniques such as stereoscopy. Stereo vision techniques may be used to combine the visual data captured by each of the two or more cameras of the first participant, which arelocated in different positions relative to the first participant, to determine the three-dimensional position (e.g. including depth information).

[0081] The field of view of the perspective may be defined by (e.g. a solid angle of) a spatial extent of the perspective’s observable scene. The first participant may be captured within the field of view of the perspective. The central axis of the perspective may extend through the (e.g. three-dimensional) geometric centre of the field of view of the perspective. The central axis of the perspective may define the primary direction in which the perspective is oriented, and thus may correspond to the primary direction in which the perspective captures visual data.

[0082] Orienting the field of view of the perspective may comprise tilting the virtual camera or the selected camera to adjust the camera’s field of view.

[0083] The perspective may be defined by the field of view.

[0084] A field of view may at least be partly determined by using the distance between the participant and their respective display medium.

[0085] The field of view of the perspective may be defined by one or more (e.g. all) of: an origin, corresponding to a position of the identified eye of the second participant; a central axis about which the field of view may be centred; an orientation, corresponding to the central axis of the perspective; and a solid angle (e.g. centred on the central axis).

[0086] The origin of the field of view of the perspective may be defined by the position of the identified eye of the second participant displayed on the display medium.

[0087] The orientation of the central axis may be defined by the position of the first participant relative to the plane of the display medium.

[0088] The orientation of the field of view of the perspective may be defined orthogonal to the plane of the display medium. An orientation of the field of view of the perspective may be partly determined using the position of the identified eye of the first participant.The perspective may be determined, using one or more (e.g. all) of: a three-dimensional position of the identified eye of the first participant, an orientation of the field of view of the perspective, and a central axis directed towards the three-dimensional position of the identified dominant eye of the first participant.

[0089] The perspective may be any suitable and desired perspective, for example, any perspective with an orientation outward from the display medium, e.g. orthogonally, such that the first participant may be captured within the field of view of the perspective.

[0090] In some embodiments, the method comprises orientating a field of view of the perspective originating from the display position of the identified dominant eye of the second participant, such that a central axis of the perspective is directed orthogonally from the plane of the display medium.

[0091] The method may comprise determining the three-dimensional position of the identified dominant eye of the first participant.

[0092] The method may comprise adjusting the display position of the second participant (e.g. of the identified dominant eye) and, preferably, adjusting the position of the virtual camera to correspond to the adjusted display position of the identified dominant eye of the second participant.

[0093] In some embodiments, the method comprises adjusting the position of the second participant on the display medium (and preferably adjusting the position of the virtual camera to correspond to the adjusted display position of the identified dominant eye of the second participant), such that a direction between the identified dominant eye of the first participant and the identified dominant eye of the second participant is orthogonal to the plane of the display medium.

[0094] The method may comprise adjusting the position of the virtual camera such that a direction between the identified dominant eye of the first participant and the identified dominant eye of the second participant is orthogonal to the plane of the display medium.Thus, the perspective originating from a virtual camera located at the position of the dominant eye of the second participant may be directed orthogonally from the plane of the display medium.

[0095] In some embodiments, the method comprises determining a three-dimensional position of the identified eye of the first participant; and orienting a field of view of the perspective originating from the display position of the identified dominant eye of the second participant, such that a central axis of the perspective is directed towards the three-dimensional position of the identified dominant eye of the first participant.

[0096] It may be preferable to have the first participant captured within the field of view displayed in the middle of the display medium (e.g. the first participant having a position corresponding with the central axis of the field of view).

[0097] In some embodiments, the method comprises (and the processing circuitry is configured to) transforming the visual data from the two or more cameras (e.g. to adjust the camera’s field of view) to form a three-dimensional model (e.g. using triangulation based on the displacement between the two or more cameras). In some embodiments the method comprises (and the processing circuitry is configured to) generating (e.g. rendering) the visual data from the perspective of the virtual camera, e.g. at the location of the display position of the identified dominant eye of the second participant. The visual data may be generated from the perspective of the virtual camera using the three-dimensional model.

[0098] As outlined above, the three-dimensional model and / or the visual data from the perspective of the virtual camera may be generated and / or refined using a neural network.

[0099] In some embodiments, the method comprises, for at least the first participant of the two or more participants, determining a two-dimensional or three-dimensional position of the identified dominant eye of the first participant, and adjusting the vertical component of the display position of the second participant on the displaymedium based on the two-dimensional or three-dimensional position of the identified dominant eye of the first participant.

[0100] By adjusting the vertical component of the display position of the second participant, physical height differences between the first participant and the second participant may be equalised. This may reduce or eliminate the need for the first participant to tilt their head in order to view the second participant, thus ensuring user comfort.

[0101] In some embodiments, the method comprises, for at least the first participant of the two or more participants, determining a physical size of the second participant (e.g. in two or three dimensions), and adjusting the displayed representation of the second participant based on the physical size to approximate the physical size of the second participant.

[0102] The physical size of the second participant may be determined by the processing circuitry, e.g. using stereo vision techniques such as stereoscopy, together with known information about the geometry of the two or more cameras (e.g. distances and angles with respect to each other). The physical size of the second participant may be determined in two-dimensions from the visual data captured by the two or more cameras.

[0103] The processing circuitry may be configured to adjust a depth component of the display position of the second participant in order to control the displayed size of the second participant on the display medium. By providing an enlargement or reduction of the displayed size of the second participant, a scaled representation of the second participant may be provided according to the preferences of the first participant. For example, the displayed size being equivalent to the physical size of the second participant may provide a more realistic and natural representation.

[0104] In some embodiments, communication is facilitated between three or more participants and the method further comprises, for at least a first participant of the three or more participants, displaying, on a display medium, a representation of a third participant of the three or more participants based on the captured visual data of the third participant, determining a display position of the identified dominant eyeof the third participant, manipulating the captured visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the third participant and transmitting the manipulated visual data of the first participant to the third participant.

[0105] Thus, it will be apparent to the skilled person that direct eye contact may only be achieved between two participants (e.g. the first and second participants, e.g. the first and third participants) when the gazes of both participants are aligned towards each other on their respective display media. This is achieved by separately manipulating and transmitting the visual data between any two participants.

[0106] For example, since the first participant can only look at the displayed representation of the second participant or the third participant at a given time, their gaze may only be aligned with either the display position of the second or the third participant’s identified dominant eye. Thus, the second and third participants will only perceive (via the transmitted manipulated visual data of the first participant) eye contact with the first participant when the first participant directs their gaze towards the perspective corresponding to the display position of the second and third participant’s identified dominant eye respectively. As a result, for group interactions of three or more participants, communicational clarity and participant engagement may be enhanced.

[0107] In some embodiments, at least the second participant and the third participant of the three or more participants are displayed on the same display medium.

[0108] In some embodiments, at least the second participant and the third participant of the three or more participants are each displayed on a respective display medium.

[0109] The processing circuitry may be configured to process the visual data captured by the two or more cameras in order to provide dynamic gaze tracking of the first participant’s identified dominant eye over time.

[0110] When viewed from a further aspect, the invention provides a method of facilitating communication between two or more participants across a virtual medium, wherein the method comprises, for each of the two or more participants:capturing visual data of the participant using a single camera; and obtaining an identification of a dominant eye of the participant, wherein the identification is obtained using at least one of:

[0111] the captured visual data; and

[0112] an input received from the participant;

[0113] wherein the method further comprises, for at least a first participant of the two or more participants:

[0114] displaying, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data of the second participant;

[0115] determining a display position of the identified dominant eye of the second participant;

[0116] adjusting the display position of the second participant and / or the position of the single camera such that the identified dominant eye of the second participant and the position of the single camera coincide; and transmitting the visual data captured by the adjusted single camera of the first participant to the second participant.

[0117] The step of adjusting may comprise manipulating the captured visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the second participant.

[0118] When viewed from a further aspect, the invention provides a system for facilitating communication between two or more participants across a virtual medium, the system comprising:

[0119] a single camera configured to capture visual data of each of the two or more participants; and

[0120] processing circuitry configured to:

[0121] obtain an identification of a dominant eye of the participant, wherein the processing circuitry obtains the identification using at least one of:

[0122] the captured visual data; and

[0123] an input received from the participant;

[0124] wherein the processing circuitry is further configured to, for at least a first participant of the two or more participants:display, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data of the second participant;

[0125] determine a display position of the identified dominant eye of the second participant;

[0126] adjust the display position of the second participant and / or the position of the single camera such that the identified dominant eye of the second participant and the position of the single camera coincide; and transmit the visual data captured by the adjusted single camera of the first participant to the second participant.

[0127] The processing circuitry configured to adjust the display position may comprise processing circuitry to manipulate the captured data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the second participant.

[0128] When viewed from a further aspect, the invention provides a display medium arranged to display information based on instructions received from the processing circuitry as described above.

[0129] When viewed from a further aspect, the invention provides a computer readable storage medium storing computer software code which when executing on a processing system performs a method of facilitating communication between two or more participants across a virtual medium, wherein the method comprises, for each of the two or more participants:

[0130] capturing visual data of the participant using two or more cameras; and

[0131] obtaining an identification of a dominant eye of the participant, wherein the identification is obtained using at least one of:

[0132] the captured visual data; and

[0133] an input received from the participant;

[0134] wherein the method further comprises, for at least a first participant of the two or more participants:

[0135] displaying, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data;determining a display position of the identified dominant eye of the second participant;

[0136] manipulating the visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the second participant; and

[0137] transmitting the manipulated visual data of the first participant to the second participant.

[0138] It will be appreciated that all of the features described herein relating to other aspects and embodiments apply equally, as appropriate, to these aspects. For example, in the aspects in which there is a single camera, preferably the system comprises a single camera for each of the two or more participants. Also, for example, features referred to in relation to two or more cameras (e.g. that capture visual data of a participant) may apply equally, where appropriate, for the single camera of these latter aspects.

[0139] It will be appreciated that all the features of the systems described herein may equally apply to the methods described herein, and vice versa.

[0140] Features of any aspect or example described herein may, wherever appropriate, be applied to any other aspect or example described herein. Where reference is made to different examples or sets of examples, it should be understood that these are not necessarily distinct but may overlap.

[0141] One or more non-limiting examples will now be described, by way of example only, and with reference to the accompanying figures in which:

[0142] Figure 1 shows part of a video conferencing system from the perspective of a local participant;

[0143] Figure 2 shows two participants communicating using a video conferencing system;

[0144] Figures 3A, 3B and 3C show a part of a video conferencing system from the perspective of a local participant;

[0145] Figures 4A and 4B show a part of a video conferencing system from the perspective of a local participant;Figure 5 shows a part of a video conferencing system from the perspective of a local participant;

[0146] Figures 6A and 6B show a part of an interconnected video conferencing system;

[0147] Figure 7 shows a flow chart outlining the main steps of a method of facilitating communication between two or more participants across a virtual medium;

[0148] Figure 8 shows a flow chart outlining the main steps of a method of facilitating communication between two or more participants across a virtual medium using virtual cameras;

[0149] Figure 9 shows a flow chart outlining the main steps of a method of facilitating communication between two or more participants across a virtual medium using stationary physical cameras;

[0150] Figure 10 shows a flow chart outlining the main steps of a method of facilitating communication between two or more participants across a virtual medium using movable physical cameras; and

[0151] Figure 11 shows part of a video conferencing system from the perspective of a local participant.

[0152] The examples described herein, in accordance with embodiments of the present invention, are used for video conferencing systems. However, other applications are also envisaged and the examples are not limited to this use.

[0153] Figure 1 shows a system 100 in which a local participant A views a remote participant B on a display medium 101 during a video conferencing call between the two participants. The display medium 101 is arranged to display a visual representation, such as an image or a video feed, of the remote participant B. The visual representation of the remote participant B is based on visual data captured by at least one physical camera of the remote participant B (not shown).

[0154] A (e.g. virtual) camera 110 captures visual data of the local participant A, which is transmitted to a remote location for viewing by the remote participant B at their remote location. The visual data, from the perspective of the (e.g. virtual) camera 110 may be formed by combining visual data captured of the local participant A by multiple physical cameras 102 around the perimeter of the display medium 101.The multiple physical cameras 102 are in communication (e.g. via wired or wireless means) with a processing circuitry 130 of the system 100. The processing circuitry 130 may be partially or fully remote from the local participant A.

[0155] The camera 110 is positioned coincident with the location of the dominant eye 120 of the remote participant on the display medium 101. This allows the local participant A to direct their gaze towards the dominant eye of the remote participant B as well as towards the position of the camera. The local participant A therefore looks directly at the camera 110 and at the remote participant B at the same time.

[0156] The camera 110 has a field of view originating from the location of the dominant eye 120 of the remote participant. As exemplarily illustrated in Figure 1, the extent of the field of view is defined by the spatial volume enclosed by a pyramidal form. An apex of a pyramidal geometry corresponds to the location of the dominant eye 120 of the remote participant. The pyramidal form extends from the apex, located at the plane of the display medium 101, towards the local participant A. The field of view of the camera 110 may have a range that extends beyond the physical position of the local participant A. The local participant A may therefore be captured within the field of view of the camera 110 defined by the bounds of the pyramidal form.

[0157] Figure 2 is an exemplary illustration of a system 200 in which a first participant 210 views a second participant 220 in a first display medium 201 , and the second participant 220 views the first participant 210 in a second display medium 202.

[0158] A first camera 211 captures visual data of the first participant 210 which is then displayed to the second participant 220. The first camera 211 is positioned coincident with the location of the dominant eye of the second participant 220 on the first display medium 201. Similarly, a second camera 212 captures visual data of the second participant 220 which is then displayed to the first participant 210. The second camera 212 is positioned coincident with the location of the dominant eye of the first participant 210 on the second display medium 202.The first camera 211 and the second camera 212 each have a respective field of view similar to the field of view described as above with reference to the camera 110 shown in Figure 1. The first participant 210 is captured within the field of view of the first camera 211 , and the second participant 220 is captured within the field of view of the second camera 212.

[0159] The perspective 231 shows the displayed visual representation of the second participant 220 on the first display medium 201 , from the point of view of the first participant 210. The perspective 232 shows the displayed visual representation of the first participant 210 on the second display medium 202, from the point of view of the second participant 220.

[0160] The first participant 210 and the second participant 220 are illustrated in Figure 2 as being local to each other (e.g. in the same room), but the same configuration may be applied in a remote setting, where the first participant 210 and the second participant 220 are in different physical locations.

[0161] The camera 110, 211, 212 is either a physical camera or a virtual camera. When the camera 110, 211, 212 is a physical camera, it may be embedded in or underneath the display medium 101, 201. When the camera 110, 211, 212 is a virtual camera, it may be constructed using multiple physical cameras that are located in, underneath and / or around the display medium 101, 201.

[0162] Figures 3A and 3B show a system 300 similar to the system 100, in which the local participant A is captured using an array of physical cameras 302. The system is arranged to use the captured visual data of the local participant A of a camera 310 in the array of physical cameras 302. The selected camera 310 is the camera in the array of physical camera that is positioned nearest to the location of the dominant eye 320 of the remote participant B in the display medium.

[0163] In the system 300 shown in Figure 3A, the position of the dominant eye 320 of the remote participant B on the display medium 301 does not coincide with the position of the selected camera 310. From the perspective of the remote participant B, the displayed visual representation of the local participant A appears tilted away and the local participant A does not appear to look directly at the remote participant B.In the system 300 shown in Figure 3B, the displayed image of the remote participant B has been translated such that the position of the dominant eye 320 on the display medium 301 coincides with the position of the selected camera 310. From the perspective of the remote participant B, the displayed visual representation of the local participant A appears forward-facing and the local participant A appears to look directly at the remote participant B.

[0164] Figure 3C shows a system 300’ similar to the system 100, in which the local participant A is captured using a virtual camera 311. The displayed image of the remote participant B and / or the position of the virtual camera 311 have been translated relative to each other, such that they are coincident (e.g. located at the same pixel or group of pixels) on the display medium 301.

[0165] Figures 4A and 4B show a system 400 similar to the systems 100, 300’. The systems 400A, 400B show the orientation of the camera 410 relative to the location of the dominant eye 420A of the local participant A.

[0166] In the system 400A shown in Figure 4A, an angle P is defined between the plane of a display medium 401 and the line of sight of the camera. Since the line of sight of the camera is perpendicular to the plane of the display medium, the angle P is 90°. As the participant A is not positioned along the line of sight of the camera, the camera 410 is not oriented directly towards the location of the dominant eye 420A of the local participant A.

[0167] In the system 400B shown in Figure 4B, the camera 410 has been oriented (e.g. tilted) to align its line of sight with the location of the (obtained and / or determined) dominant eye 420A of the local participant A. An angle Q is defined between the plane of the display medium and the line of sight of the camera. The angle Q is a non-right angle. The location of the dominant eye 420A of the local participant A is positioned along the line of sight of the camera.

[0168] Figure 5 shows a system 500 in which a local participant A views a remote participant B on a first display medium 501 B and a remote participant C on a second display medium 501 C, e.g. during a video conferencing call. The first andsecond display mediums 501 B, 501C are arranged to display a visual representation of the remote participant B as captured by a camera of the remote participant B, and of the remote participant C as captured by a camera of the remote participant C, respectively.

[0169] Cameras 510B, 510C each capture visual data of the local participant A, which is transmitted to the remote locations for viewing by the remote participant B and the remote participant C respectively. The camera 510B is positioned coincident with the location of the dominant eye 520B of the remote participant B on the display medium 501 B. The camera 510C is positioned coincident with the location of the dominant eye 520C of the remote participant C on the display medium 501 C.

[0170] The perspective 530B shows the displayed visual representation of the local participant A from the point of view of the remote participant B. The perspective 530C shows the displayed visual representation of the local participant A from the point of view of the remote participant C. The cameras 510B, 510C each have a respective field of view similar to the field of view described as above with reference to the camera 110 shown in Figure 1. The local participant A is captured within the field of view of the cameras 510B, 510C.

[0171] When the local participant A looks at the dominant eye of the remote participant B (and therefore directs their gaze along the line of sight of the camera 510B), the local participant A appears to look directly at the remote participant B. Meanwhile, the local participant A does not look at the dominant eye of the remote participant C (and therefore does not direct their gaze along the line of sight of the camera 510C), so the local participant A does not appear to look directly at the remote participant C.

[0172] Figure 6A shows a system 600A in which a first participant 601 and a second participant 602 are participating in a video conferencing call at different physical locations. Processing for the video conferencing system 600A, including feature (e.g. eye) tracking and virtual camera generation, is carried out locally at the computers 610A, 610B respectively (e.g. one or both of the computers 610A, 610B comprises processing circuitry for the system 600A). An internet or local areanetwork (LAN) connection 620 is used to transmit and receive data, such as visual and audio data, between the computers 610A, 61 OB at different physical locations.

[0173] Figure 6B shows a system 600B similar to the system 600A, in which the computers 610A, 61 OB both transmit and receive data via a cloud-based processor 630 via an internet or local area network (LAN) connection 620. The cloud-based processor 630 is able to carry out partial or full processing for the video conferencing system 600B (e.g. the cloud-based processor 630 comprises processing circuitry for the system 600B). The degree to which the cloud-based processor 630 carries out processing for the video conferencing system 600B depends on network capabilities and available resources.

[0174] Operation of the video conferencing system shown in Figure 1 (e.g. in the configurations shown in Figures 2-6B) will now be described with reference to the flow charts of Figures 7-10, as well as Figures 1 -6B.

[0175] Figure 7 shows a flow chart outlining the main step of a method 700 of facilitating communication between two or more participants across a virtual medium.

[0176] First, visual data of each the two or more participants is captured using two or more cameras, e.g. the cameras 211, 212 (step 710). An identification of the dominant eye of each participant is then obtained using processing circuitry (step 720), e.g. from information that is stored from a previous manual input into the system. The identification can be determined based on the captured visual data, or determined via a (e.g. manual) input that has been received from each participant.

[0177] For a local participant, a representation of each the other (e.g. remote) participants is displayed by the processing circuitry on a display medium, e.g. the display medium 201, 202 (step 730). Each of the other (e.g. remote) participants may be individually displayed on separate display mediums (as shown in Figure 5), or multiple of the other (e.g. remote) participants may be displayed together or in various combinations on one or more display media.

[0178] For a local participant, a display position of the identified dominant eye for each of the other (e.g. remote) participants (step 740) is determined by the processingcircuitry. This display position is relative to the display medium on which each of the other (e.g. remote) participants is displayed.

[0179] The visual data of the local participant is manipulated by the processing circuitry to correspond to a perspective originating from the display position of the identified dominant eye of the other (e.g. remote) participant (step 750). This provides a direct line of sight between the local participant and the other participant.

[0180] Last, the manipulated visual data of the local participant is transmitted by the processing circuitry to the other participant (step 760). This manipulated data may then be displayed on the display medium that is local to the other participant (e.g. display medium 201, 202).

[0181] Figure 8 shows a flow chart outlining the main steps of a method 800 similar to the method 700 shown in Figure 7. A virtual camera is constructed for each participant using multiple physical cameras which are located in, underneath and / or around a display medium.

[0182] First, before the video conferencing call is initialised, information about the physical set-up of the video conferencing system is retrieved (step 810), e.g. by processing circuitry. The information includes the size and resolution of the display medium (e.g. of the display medium 501 B, 501 C), the position and orientation of each physical camera relative to each other (e.g. of physical cameras positioned around the edges of the display medium), and the position and orientation of each physical camera relative to the display medium. This information is used to construct and position a virtual camera, as well as to generate (e.g. render) the displayed image at a predetermined size.

[0183] The video conferencing call is initialised between two or more participants (step 820). Visual data is captured using physical cameras local to each participant. For a local participant (e.g. local participant A), one or more local display mediums (e.g. display medium 101) are used to display each of the remote participants (e.g. remote participant B). In one embodiment, one or more remote participants are arranged on a single display medium at different locations (e.g. top right quadrant, top left quadrant, bottom right quadrant or bottom left quadrant). In anotherembodiment, and with reference to Figure 5, two or more remote participants B, C are each displayed on a different display medium 501 B, 501 C.

[0184] Information about the dominant eye is retrieved for each participant in the video conferencing call (step 830), e.g. by the processing circuitry. This information can be automatically determined by analysing the visual data using the processing circuitry and / or obtained via a user interface which receives a manual input from each participant.

[0185] The eyes of each participant are detected and the position of each participant’s dominant eye is determined (step 840), e.g. by the processing circuitry. The detection of each participant’s eyes employs image processing techniques to identify features in the visual data captured by the physical cameras. Stereoscopy may be used between the physical cameras to determine the position of each participant’s dominant eye, which may then be transformed into the coordinate system of the screen on which the participant is displayed.

[0186] In some examples, the eyes of the local participant are identified in the visual data that is transmitted to the remote participant for displaying on the display medium. The dominant eye (in the coordinates of the display medium) may then be located using the identification of the eyes and the obtained dominant eye information that was retrieved previously (in step 830).

[0187] A three-dimensional model of each participant (e.g. each participant’s head) is created by the processing circuitry by combining the visual data captured by the physical cameras (step 850). Corresponding features, such as a set of eyes of each participant, are identified and matched between the visual data captured by each physical camera.

[0188] The three-dimensional model is used to generate, using the processing circuitry, an image of each participant from the perspective of a virtual camera. For each local participant, a virtual camera is placed in the position of the displayed (i.e. remote) participant’s dominant eye. The generated image is rendered at the position of the displayed participant’s dominant eye (step 860). Each local participant thereforeappears to look directly at the virtual camera and at the displayed (i.e. remote) participant at the same time.

[0189] For each local participant, the image captured from the perspective of the virtual camera is transmitted to and displayed on the display medium of each remote participant in the video conferencing call (step 870).

[0190] Figure 9 shows a flow chart outlining the main steps of a method 900, which is similar to method 800 outlined in Figure 8 except for steps 950, 960 and 970. An array of physical cameras (such as the array 302 of Figures 3A-B) is used to capture the visual data of each participant. The array of physical cameras according to the embodiment of Figure 9 are stationary relative to a display medium.

[0191] After the eyes of each participant are detected and the position of each participant’s dominant eye is determined, a selected physical camera is selected from the array of physical cameras (step 950). The selected physical camera is the camera which is positioned nearest to the location of the displayed participant’s dominant eye in the display medium.

[0192] Since the array of physical cameras cannot be moved relative to the display medium and thus relative to the local participant, the image of the displayed participant is adjusted (e.g. translated) by the processing circuitry to coincide with the position of the displayed participant’s dominant eye with the (fixed) position of the selected physical camera (step 960). Each local participant therefore appears to look directly at the stationary selected physical camera and at the displayed (i.e. remote) participant at the same time.

[0193] For each local participant, the image captured from the perspective of the selected physical camera is transmitted to and displayed on the display medium of each remote participant in the video conferencing call (step 970).

[0194] Figure 10 shows a flow chart outlining the main steps of a method 1000, which is similar to method 900 outlined in Figure 9 except for step 1060. The array of physical cameras according to the embodiment of Figure 10 are movable relative to a display medium.After the selected physical camera is selected from the array of physical cameras, the position of the displayed participant’s dominant eye and / or the position of the movable selected physical camera is adjusted by the processing circuitry to coincide at the same position on the display medium (step 1060). Each local participant therefore appears to look directly at the movable selected physical camera and at the displayed (i.e. remote) participant at the same time.

[0195] In each of methods 700, 800, 900, 1000, since each participant is streamed from the position of another participant’s dominant eye, a straight line (such as line E of Figure 2) can be formed between any two participants to ensure direct eye contact.

[0196] Figure 11 shows a system 150, for example similar to the system 100 shown in Figure 1. A local participant A views a remote participant B on a display medium 151 during a video conferencing call between the two participants. The visual data of the local participant A is captured by a (e.g. virtual) camera 160 that is displaced from the plane of the display medium 151. As shown in Figure 11, the camera 160 is in front of the display medium 151. In other examples, the camera 160 may be behind the display medium 151 (e.g. when the display medium 151 is transparent or semi-transparent).

[0197] The camera 160 may be a physical camera or, for example, may be a virtual camera that is formed by combining visual data captured of the local participant A by multiple physical cameras, e.g. around the perimeter of the display medium 151.

[0198] The camera 160 is positioned on (or close to) a line that passes through (or close to) the dominant eye of the local participant A and the dominant eye of the remote participant B. This may thus require the dominant eyes of both participants to be obtained and / or determined.

[0199] It will be appreciated by those skilled in the art that the disclosure has been illustrated by describing one or more specific examples thereof, but is not limited to these examples; many variations and modifications are possible, within the scope of the accompanying claims.

Claims

- 32 -CLAIMS1. A method of facilitating communication between two or more participants across a virtual medium, wherein the method comprises, for each of the two or more participants:capturing visual data of the participant using two or more cameras; and obtaining an identification of a dominant eye of the participant;wherein the identification is obtained using at least one of:the captured visual data; andan input received from the participant;wherein the method comprises, for at least a first participant of the two or more participants:displaying, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data of the second participant;determining a display position of the identified dominant eye of the second participant;manipulating the captured visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the second participant; andtransmitting the manipulated visual data of the first participant to the second participant.

2. The method as claimed in claim 1, wherein when the two or more cameras are located outside of an area of the display medium, the manipulating the captured visual data of the first participant comprises combining the visual data captured using each of the two or more cameras to form a single view of the first participant.

3. The method as claimed in claim 1, wherein when the two or more cameras are located inside an area of the display medium, and when the two or more cameras comprises a plurality of stationary physical cameras and / or a plurality of movable physical cameras, the manipulating the visual data of the first participant comprises:- 33 -selecting the visual data captured by a selected camera, wherein the selected camera is the camera of the plurality of stationary physical cameras and / or the plurality of movable physical cameras positioned nearest to the display position of the identified dominant eye of the second participant.

4. The method as claimed in claim 3, wherein when the two or more cameras comprises a plurality of stationary physical cameras, the method comprises adjusting the display position of the second participant such that the identified dominant eye of the second participant coincides with the position of the selected camera.

5. The method as claimed in claim 3, wherein when the two or more cameras comprises a plurality of movable physical cameras, the method comprises adjusting the display position of the second participant and / or the position of the selected camera such that the identified dominant eye of the second participant and the position of the selected camera coincide.

6. The method as claimed in claim 3, 4 or 5, wherein the method comprises reassigning the selected camera to another of the plurality of stationary physical cameras and / or of the plurality of movable physical cameras based on a dynamic display position of the identified dominant eye of the second participant.

7. The method as claimed in any one of the preceding claims, wherein the method comprises, for at least the first participant of the two or more participants:orienting a field of view of the perspective originating from the display position of the identified dominant eye of the second participant, such that a central axis of the perspective is directed orthogonally from the plane of the display medium.

8. The method as claimed in any one of the preceding claims, wherein the method comprises, for at least the first participant of the two or more participants:adjusting the position of the second participant on the display medium, such that a direction between the identified dominant eye of the first participant and the identified dominant eye of the second participant is orthogonal to the plane of the display medium.

9. The method as claimed in any one of the preceding claims, wherein the method comprises, for at least the first participant of the two or more participants:determining a three-dimensional position of the identified dominant eye of the first participant; andorienting a field of view of the perspective originating from the display position of the identified dominant eye of the second participant such that a central axis of the perspective is directed towards the three- dimensional position of the identified dominant eye of the first participant.

10. The method as claimed in any one of the preceding claims, wherein the method comprises transforming the visual data from the two or more cameras to form a three-dimensional model and then generating the visual data from the perspective of a virtual camera at the display position of the identified dominant eye of the second participant.

11. The method as claimed in claim 10, wherein the method comprises using a neural network to:form and / or refine the three-dimensional model; and / orgenerate the visual data from the perspective of a virtual camera at the display position of the identified dominant eye of the second participant.

12. The method as claimed in any one of the preceding claims, wherein the method comprises, for at least the first participant of the two or more participants:determining a two-dimensional position or a three-dimensional position of the identified dominant eye of the first participant; andadjusting the vertical component of the display position of the second participant on the display medium based on the two-dimensional position or the three-dimensional position of the identified dominant eye of the first participant.

13. The method as claimed in any one of the preceding claims, wherein the method comprises, for at least the first participant of the two or more participants:determining a physical size of the second participant; andadjusting the displayed representation of the second participant based on the physical size to approximate the physical size of the second participant.

14. The method as claimed in any one of the preceding claims, wherein communication is facilitated between three or more participants and wherein the method further comprises, for the first participant of the three or more participants:displaying, on a display medium, a representation of a third participant of the three or more participants based on the captured visual data of the third participant;determining a display position of the identified dominant eye of the third participant;manipulating the captured visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the third participant; andtransmitting the manipulated visual data of the first participant to the third participant.

15. The method of claim 14, wherein at least the second participant and the third participant of the three or more participants are displayed on the same display medium.

16. The method of claim 14, wherein at least the second participant and the third participant of the three or more participants are each displayed on a respective display medium.

17. A system for facilitating communication between two or more participants across a virtual medium, the system comprising:two or more cameras configured to capture visual data of each of the two or more participants; andprocessing circuitry configured to:obtain an identification of a dominant eye of each of the two or more participants;wherein the processing circuitry is configured to obtain the identification using at least one of:- 36 -the captured visual data; andan input received from the participant;wherein the processing circuitry is further configured to, for at least a first participant of the two or more participants:display, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data of the second participant;determine a display position of the identified dominant eye of the second participant;manipulate the captured visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the second participant; andtransmit the manipulated visual data of the first participant to the second participant.

18. A method of facilitating communication between two or more participants across a virtual medium, wherein the method comprises, for each of the two or more participants:capturing visual data of the participant using a single camera; obtaining an identification of a dominant eye of the participant, wherein the identification is obtained using at least one of:the captured visual data; andan input received from the participant;wherein the method further comprises, for at least a first participant of the two or more participants:displaying, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data of the second participant;determining a display position of the identified dominant eye of the second participant;adjusting the display position of the second participant and / or the position of the single camera such that the identified dominant eye of the second participant and the position of the single camera coincide; and transmitting the visual data captured by the adjusted single camera of the first participant to the second participant.- 37 -19. A system for facilitating communication between two or more participants across a virtual medium, the system comprising:a single camera configured to capture visual data of each of the two or more participants;processing circuitry configured to:obtain an identification of a dominant eye of the participant, wherein the processing circuitry obtains the identification using at least one of:the captured visual data; andan input received from the participant;wherein the processing circuitry is further configured to, for at least a first participant of the two or more participants:display, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data of the second participant;determine a display position of the identified dominant eye of the second participant;adjust the display position of the second participant and / or the position of the single camera such that the identified dominant eye of the second participant and the position of the single camera coincide; and transmit the visual data captured by the adjusted single camera of the first participant to the second participant.

20. A display medium arranged to display information based on instructions received from the processing circuitry as claimed in claim 17 or 19.

21. A computer readable storage medium storing computer software code which when executing on a processing system performs a method of facilitating communication between two or more participants across a virtual medium, wherein the method comprises, for each of the two or more participants:capturing visual data of the participant using two or more cameras; obtaining an identification of a dominant eye of the participant, wherein the identification is obtained using at least one of:the captured visual data; andan input received from the participant;wherein the method further comprises, for at least a first participant of the two or more participants:displaying, on a display medium, a representation of a second participant of the two or more participants based on the captured visual data;determining a display position of the identified dominant eye of the second participant;manipulating the visual data of the first participant to correspond to a perspective originating from the display position of the identified dominant eye of the second participant; andtransmitting the manipulated visual data of the first participant to the second participant.