Video display control device and video display control method
The video display control device automatically hides outsider faces in real-time to protect personal information, addressing the inefficiencies of manual monitoring in live broadcasts by using a face recognition system to switch display states based on pre-registered data.
Patent Information
- Authority / Receiving Office
- WO · WO
- Patent Type
- Applications
- Current Assignee / Owner
- JVC KENWOOD CORP
- Filing Date
- 2025-12-18
- Publication Date
- 2026-07-02
AI Technical Summary
Existing video distribution systems fail to effectively protect the personal information of outsiders appearing in captured footage, as constant monitoring by administrators is time-consuming and prone to errors, especially during live broadcasts.
A video display control device and method that includes a face data storage unit and a control unit to recognize pre-registered faces, automatically determining and switching the display state of video information to hidden if an outsider's face does not match stored data, ensuring personal information protection by hiding faces under specified conditions.
Automated protection of personal information by hiding outsider faces in video content, reducing the risk of identification and minimizing operational errors, while allowing continuous live streaming with minimal disruption.
Smart Images

Figure JP2025044281_02072026_PF_FP_ABST
Abstract
Description
Video display control device and video display control method
[0001] The present disclosure relates to a video display control device and a video display control method.
[0002] When distributing video content shot at a sports game venue or the like, an outsider other than a person related to the content including the selection of participants may be reflected in the video. If such a video with an outsider reflected is viewed by an unspecified number of viewers, a problem may occur from the viewpoint of personal information protection.
[0003] Patent Document 1 describes a system that performs mask processing or the like on an image portion of a person who is not registered in advance in the shot video information, and then distributes the video so that the individual cannot be identified.
[0004] Japanese Patent Application Laid-Open No. 2009-225398
[0005] When distributing a sports game, an outsider such as a spectator is often reflected. However, if the face of this outsider is reflected on the screen small enough that an individual cannot be identified by the viewer, no problem regarding personal information occurs. Here, even if an individual can be identified by the viewer from the face of an outsider on the screen, if it is reflected in an area where an outsider is expected, such as a place designated as a spectator seat, there is a high possibility that no problem regarding personal information will occur. Therefore, it is only necessary to take some measures so that an individual cannot be identified when the outsider is reflected in an area where the outsider should not be.
[0006] Therefore, when archiving and distributing the video of the shot game later, the administrator checks the video content and performs processing on the video information during the period when an outsider who is reflected significantly in an area where the outsider should not be is present, so that the individual cannot be identified and then distributes it.
[0007] However, in order to perform such processing during live distribution, the administrator needs to constantly monitor the video information, which is time-consuming and there is a risk of overlooking and incorrect operation.
[0008] This disclosure aims to provide a video display control device and a video display control method that appropriately protect the personal information of outsiders other than those involved in the video content who appear in the captured video footage.
[0009] The video display control device according to this embodiment includes a face data storage unit that stores pre-registered face data for recognizing the faces of persons involved in the video content to be displayed, and a control unit. The control unit acquires video information captured by a camera device, and if an outsider whose face does not match the face data stored in the face data storage unit is captured in the acquired video information under predetermined conditions, it determines that the display state of the video information should be hidden, and switches the display state of the video information according to the result of the determination.
[0010] The video display control method according to this embodiment includes storing pre-registered face data for recognizing the faces of persons involved in the video content to be displayed in a face data storage unit; acquiring video information captured by a camera device using a control unit; determining, using the control unit, that if an outsider whose face does not match the face data stored in the face data storage unit is captured in the acquired video information under predetermined conditions, the display state of the video information should be hidden; and switching the display state of the video information according to the result of the determination using the control unit.
[0011] According to the above configuration, the personal information of outsiders other than those involved in the video content who appear in the filmed footage can be appropriately protected.
[0012] Figure 1 is an overall diagram showing the configuration of a video distribution system using a video distribution device as a video display control device according to an embodiment. Figure 2A is a flowchart showing the display state determination process executed by each determination unit of the video distribution device according to an embodiment. Figure 2B is a flowchart showing the display state determination process executed by each determination unit of the video distribution device according to an embodiment. Figure 3 is a screen configuration diagram showing an example of composite video information distributed by the video distribution device according to an embodiment and displayed on a viewer terminal. Figure 4 is a screen configuration diagram showing an example of a state in which the video information in the composite video information of Figure 3 has been switched to hidden. Figure 5 is a screen configuration diagram showing an example of composite video information distributed by the video distribution device according to an embodiment and displayed on a viewer terminal. Figure 6 is a screen configuration diagram showing an example of a state in which the video information in the composite video information of Figure 5 has been switched to hidden. Figure 7 is a screen configuration diagram showing an example of composite video information displayed on a viewer terminal according to Modification 1. Figure 8 is a flowchart showing the display state determination process executed by each determination unit of the video distribution device according to Modification 2.
[0013] Hereinafter, a video distribution system using a video distribution device as a video display control device according to this embodiment will be described with reference to the drawings. In this embodiment, the case in which the video distribution system live streams the status of a golf tournament will be described as an example.
[0014] (Configuration of the video distribution system 1 using the video distribution device 20 according to this embodiment) Figure 1 is an overall diagram showing the configuration of the video distribution system 1 using the video distribution device 20 according to this embodiment. The video distribution system 1 distributes video content that captures the situation of play taking place at multiple holes located at multiple locations in a golf tournament.
[0015] The video distribution system 1 includes a first camera device 10a and a second camera device 10b, which are fixedly installed in a first area AR1 near the first hole of the golf course and are used to film the first hole. The video distribution system 1 also includes a third camera device 10c and a fourth camera device 10d, which are fixedly installed in a second area AR2 near the tenth hole, located in a different location from the first hole, and are used to film the tenth hole. In the following description, camera devices 10a, 10b, 10c, and 10d may be simply referred to as camera device 10. The video distribution system 1 also includes a video distribution device 20 and a viewer terminal 30.
[0016] The camera devices 10a, 10b, 10c, and 10d are connected to the video distribution device 20 via the communication network NW1. The video distribution device 20 and the viewer terminal 30 are connected via the communication network NW2.
[0017] The areas to be filmed within the video distribution system 1 are not limited to two; there may be three or more areas, or only one area. The camera devices installed corresponding to one area are not limited to two; there may be three or more, or only one. In Figure 1, only one viewer terminal 30 is shown for the sake of simplicity, but multiple viewer terminals 30 may be connected to the video distribution device 20 in a communication manner.
[0018] The first camera device 10a captures the surroundings and generates video information Pa, encodes the generated video information Pa, and transmits it to the video distribution device 20 as encoded video information Pax. The second camera device 10b captures the surroundings and generates video information Pb, encodes the generated video information Pb, and transmits it to the video distribution device 20 as encoded video information Pbx. The third camera device 10c captures the surroundings and generates video information Pc, encodes the generated video information Pc, and transmits it to the video distribution device 20 as encoded video information Pcx. The fourth camera device 10d captures the surroundings and generates video information Pd, encodes the generated video information Pd, and transmits it to the video distribution device 20 as encoded video information Pdx.
[0019] The first camera device 10a and the second camera device 10b photograph a predetermined area including the first hole of the golf course from different angles. The third camera device 10c and the fourth camera device 10d photograph a predetermined area including the tenth hole from different angles.
[0020] The video distribution device 20 includes a first receiving unit 21a, a second receiving unit 21b, a third receiving unit 21c, a fourth receiving unit 21d, a face data storage unit 22, a control unit 23, a composite video distribution unit 24, and an instruction receiving unit 25. In the following description, the first receiving unit 21a, the second receiving unit 21b, the third receiving unit 21c, and the fourth receiving unit 21d may be simply referred to as the receiving unit 21.
[0021] The video distribution device 20 may be built on a communication network using cloud computing technology. Each functional unit within the video distribution device 20 does not need to be located in a single device, but may be located in multiple devices that are connected to each other in a communication manner.
[0022] The first receiving unit 21a receives encoded video information Pax from the first camera device 10a. The second receiving unit 21b receives encoded video information Pbx from the second camera device 10b. The third receiving unit 21c receives encoded video information Pcx from the third camera device 10c. The fourth receiving unit 21d receives encoded video information Pdx from the fourth camera device 10d.
[0023] The face data storage unit 22 stores pre-registered face data for recognizing the faces of individuals involved in the video content to be displayed. Face data is registered to the face data storage unit 22 of the video distribution device 20, for example, by a registration terminal 40. The configuration for registering face data is not limited to this, and the video distribution device 20 may also be equipped with a function and input unit for registering face data.
[0024] The control unit 23 is, for example, a general-purpose microcomputer and includes a CPU (Central Processing Unit), ROM (Read Only Memory), RAM (Random Access Memory), and an input / output (I / O) interface. The control unit 23 realizes the functions described below by having the CPU read a predetermined video distribution program from the ROM, etc., expand it into RAM, and execute the various expanded programs. Functionally, the control unit 23 has a display state determination unit 231 and a video synthesis processing unit 232.
[0025] The display state determination unit 231 includes a first determination unit 231a, a second determination unit 231b, a third determination unit 231c, and a fourth determination unit 231d. The first determination unit 231a decodes the encoded video information Pax received by the first receiving unit 21a. The first determination unit 231a determines whether an outsider, whose face does not correspond to the face data stored in the face data storage unit 22, is continuously visible in the decoded video information Pa for a predetermined time or longer under predetermined conditions set based on the main subject determined according to the video content. Specifically, the predetermined condition is that the outsider's face is larger than or equal to a predetermined value determined based on the size of the main subject's face that can be captured. However, the predetermined condition is not limited to face size; it may also be that the outsider's face is located within a predetermined area set based on the location where the main subject should be, or further, that the outsider's face is larger than or equal to a predetermined value and located within the predetermined area. In the following explanation, the predetermined conditions used by the display state determination unit 231 for determination will be described as the size of the outsider's face being greater than or equal to a predetermined value. The first determination unit 231a determines that if an outsider is continuously visible in the video information Pa for a predetermined period of time or longer under the predetermined conditions, it will decide to hide the display state of the video information Pa.
[0026] Similar to the processing of the first determination unit 231a, the second determination unit 231b determines whether or not to hide the display state of the video information Pb. Similar to the processing of the first determination unit 231a, the third determination unit 231c determines whether or not to hide the display state of the video information Pc. Similar to the processing of the first determination unit 231a, the fourth determination unit 231d determines whether or not to hide the display state of the video information Pd.
[0027] The video synthesis processing unit 232 acquires encoded video information Pax from the first receiving unit 21a, encoded video information Pbx from the second receiving unit 21b, encoded video information Pcx from the third receiving unit 21c, and encoded video information Pdx from the fourth receiving unit 21d. The video synthesis processing unit 232 switches the display / hide status of these acquired encoded video information Pax, Pbx, Pcx, and Pdx according to the determination results of the first determination unit 231a, the second determination unit 231b, the third determination unit 231c, and the fourth determination unit 231d, and arranges them within the same screen frame to generate synthesized video information.
[0028] The composite video distribution unit 24 distributes the composite video information generated by the video synthesis processing unit 232 to the viewer terminal 30.
[0029] The instruction receiving unit 25 receives a switching instruction regarding the display of video information within the composite video information transmitted from the viewer terminal 30 and sends it to the video synthesis processing unit 232.
[0030] The viewer terminal 30 is a personal computer or smart device operated by a user viewing the streamed video, and includes a terminal communication unit 31, a decoder 32, a display unit 33, and an input unit 34. The viewer terminal 30 may also include an audio decoder and an audio output unit (not shown).
[0031] The terminal communication unit 31 communicates with the video distribution device 20. The decoder 32 acquires the composite video information distributed from the video distribution device 20 via the terminal communication unit 31 and decodes the encoded video information. The display unit 33 plays the video by displaying the information decoded by the decoder 32. The input unit 34 receives information about user operations.
[0032] By configuring it as described above, the video distribution system 1 performs multi-angle distribution, which displays and distributes multiple video information shot from different angles on a single screen.
[0033] (Operation of the video distribution system 1 according to this embodiment) In the video distribution system 1 according to this embodiment, the video distribution device 20 combines video information captured by multiple camera devices 10a, 10b, 10c, and 10d to perform multi-angle distribution.
[0034] While the video distribution system 1 is performing multi-angle distribution, the first camera device 10a films the golfer playing on the first hole at a predetermined angle. The first camera device 10a encodes the captured video information Pa to generate encoded video information Pax, which is then transmitted sequentially to the video distribution device 20.
[0035] While the video distribution system 1 is performing multi-angle distribution, the second camera device 10b films the golfer playing on the first hole from a different angle than the first camera device 10a. The second camera device 10b encodes the captured video information Pb to generate encoded video information Pbx and transmits it sequentially to the video distribution device 20.
[0036] While the video distribution system 1 is performing multi-angle distribution, the third camera device 10c films golfers playing on the 10th hole, which is located in a different location from the 1st hole, at a predetermined angle. The third camera device 10c encodes the captured video information Pc to generate encoded video information Pcx, which is then transmitted sequentially to the video distribution device 20.
[0037] While the video distribution system 1 is performing multi-angle distribution, the fourth camera device 10d films the golfer playing on the 10th hole from a different angle than the third camera device 10c. The fourth camera device 10d encodes the captured video information Pd to generate encoded video information Pdx and transmits it sequentially to the video distribution device 20.
[0038] In the video distribution device 20, the first receiving unit 21a receives encoded video information Pax transmitted from the first camera device 10a. The second receiving unit 21b receives encoded video information Pbx transmitted from the second camera device 10b. The third receiving unit 21c receives encoded video information Pcx transmitted from the third camera device 10c. The fourth receiving unit 21d receives encoded video information Pdx transmitted from the fourth camera device 10d.
[0039] The video synthesis processing unit 232 generates synthesized video information by arranging the acquired encoded video information Pax, Pbx, Pcx, and Pdx on a single screen. The synthesized video distribution unit 24 distributes the synthesized video information generated by the video synthesis processing unit 232 to the viewer terminal 30.
[0040] When multi-angle distribution is performed in this manner, the first determination unit 231a, second determination unit 231b, third determination unit 231c, and fourth determination unit 231d of the video distribution device 20 each perform a display state determination process.
[0041] Figures 2A and 2B are flowcharts showing the display state determination process performed by the first determination unit 231a, the second determination unit 231b, the third determination unit 231c, and the fourth determination unit 231d. Each of the first determination unit 231a, the second determination unit 231b, the third determination unit 231c, and the fourth determination unit 231d holds a counter (not shown) that counts the number of times an outsider unrelated to the content being distributed is detected from the video information being processed.
[0042] The display state determination process executed by the first determination unit 231a will be explained with reference to the flowcharts in Figures 2A and 2B. When the display state determination process starts, the first determination unit 231a resets the counter value to 0 (step S1) and acquires the encoded video information Pax received by the first receiving unit 21a as the target for processing. The first determination unit 231a decodes the encoded video information Pax to acquire video information Pa and starts a face detection loop that sequentially detects human faces from the video information Pa.
[0043] When the first determination unit 231a detects a human face with a size equal to or greater than a predetermined value from the video information Pa (step S2: YES), it determines whether the image information of the detected face portion matches any of the face data registered in the face data storage unit 22 (step S3).
[0044] Here, the "predetermined value" of the "human face with a size equal to or greater than a predetermined value" detected by the first determination unit 231a is, for example, a size at which face authentication processing is possible, determined according to the number of pixels of the video, etc., and is set based on the size of the human face recognized as the main subject determined according to the video content. This "main subject" is, for example, a person performing a golf play in the content of a golf tournament. That is, the "predetermined value" used by the first determination unit 231a for determination is determined based on the size of the face that can show the main subject. In the content of a golf tournament, a golf play is performed at the tee ground, which is a position closer to the first camera device 10a than the spectators. Therefore, an appropriate face size of a person located at the distance from the first camera device 10a to the tee ground can be set as the "predetermined value".
[0045] Here, in step S2, the case where the predetermined condition used by the display state determination unit 231 for determination is that the face of an outsider is located within a predetermined area will be described. The display state determination unit 231 can set the "within a predetermined area" based on the place where the main subject should be. As described above, in the content of a golf tournament, the main subject often performs a golf play at a fixed position such as the tee ground. Therefore, the first determination unit 231a sets an appropriate area as the "within a predetermined area" as the range in which the face of a person standing at the tee ground is captured within the angle of view of the first camera device 10a
[0046] When the image information of the detected face matches the registered face data (step S3: YES), it returns to the face detection loop and proceeds to step S2. The first determination unit 231a repeats the processing of steps S2 and S3 until a face that does not match the registered face data is detected, or until it is determined that all the human faces in the video information Pa match the registered face data
[0047] When the first determination unit 231a detects a face that does not match the registered face data from the video information Pa (step S3: NO), if the current display state of this video information Pa is "display" (step S4: YES), the counter value is incremented by "1" (step S5). If the current display state of this video information Pa is "non-display" (step S4: NO), the first determination unit 231a does not increment the counter value. The details of the case where the current display state is "display" or "non-display" will be described later.
[0048] When the first determination unit 231a determines that the faces of all persons in the video information Pa match the registered face data (step S2: NO), if the counter value is 1 or more (step S6: YES), the counter value is decremented by "1" (step S7). If the counter value is 0 (the "NO" in step S6), the first determination unit 231a does not decrement the counter value.
[0049] After the processing of steps S4 to S7, the process moves to the flowchart of FIG. 2B. If the current display state of this video information Pa is "display" (step S8: YES), the first determination unit 231a determines whether the current counter value is equal to or greater than a predetermined threshold value (for example, "3") (step S9).
[0050] The first determination unit 231a determines that if the current counter value is less than the threshold (step S9: NO), there is an outsider appearing in the video information Pa at a size greater than a predetermined value, but the time spent in the image is still short and there is a possibility that the outsider will soon disappear from the video information Pa. Here, the threshold can be determined from the perspective of how long an outsider must be continuously visible. Furthermore, it can be determined in relation to the predetermined time processed in step S14, which will be described later. Here, the period during which an outsider is determined if they are continuously visible is called the continuous visibility period. Here, the continuous visibility period is set to 30 seconds, but it is not limited to this. For example, if the continuous visibility period is 30 seconds and the predetermined time processed in step S14 is 10 seconds, the threshold is set to 3. In other words, the threshold can be the value obtained by dividing the continuous visibility period by the predetermined time processed in step S14. Here, the first determination unit 231a determines to maintain the display state of this video information Pa as "displayed" (step S10).
[0051] The first determination unit 231a determines that if the current counter value is above a threshold (step S9: YES), the video information Pa contains an outsider who is visible for a predetermined amount of time or longer and is larger than a predetermined value. Based on this, the first determination unit 231a determines to switch the display state of this video information Pa to "hidden" (step S11).
[0052] The first determination unit 231a determines that if the current display state in step S8 is "hidden" (step S8: NO) and the current counter value is not "0" (step S12: NO), then the outsider is no longer visible in the video information Pa, or has only recently become smaller than a predetermined value, and there is still a possibility that they will soon become visible again. Based on this, the first determination unit 231a determines to maintain the display state of the video information Pa as "hidden" (step S11).
[0053] The first determination unit 231a determines that if the current display state is "hidden" (step S8: NO) and the current counter value is "0" (step S12: YES), then the outsider who was captured in the video information Pa has either moved out of the video information Pa or the face portion has become smaller than a predetermined value and a predetermined time has elapsed. Based on this, the first determination unit 231a determines to switch the display state of the video information Pa to "display" (step S10).
[0054] The first determination unit 231a notifies the video synthesis processing unit 232 of the determination result (step S13). After a predetermined time has elapsed (for example, a few seconds) (step S14: YES), the process returns to step S2 in Figure 2A, where the encoded video information Pax newly received by the first receiving unit 21a is acquired as the processing target, decoded, and the display state determination process is executed.
[0055] Similarly, the second determination unit 231b also performs a display state determination process at predetermined time intervals to determine the display state of the video information Pb. Similarly, the third determination unit 231c also performs a display state determination process at predetermined time intervals to determine the display state of the video information Pc. Similarly, the fourth determination unit 231d also performs a display state determination process at predetermined time intervals to determine the display state of the video information Pd.
[0056] In this way, the process from steps S2 to S14 is repeated for each piece of video information at predetermined time intervals to determine whether to display or hide the information. By switching between these steps as appropriate, video information containing an outsider that is larger than a predetermined value and appears for a predetermined amount of time or longer is hidden.
[0057] In this embodiment, the example was given where the threshold value set to switch the display state from "displayed" to "hidden" and the threshold value set to switch from "hidden" to "displayed" are the same. However, these can be set to different values. For example, the threshold value set to switch the display state from "displayed" to "hidden" may be set to be large, and the threshold value set to switch from "hidden" to "displayed" may be set to be small. By setting them in this way, the period during which the display is hidden can be shortened. This is because there may be a demand to minimize the period during which the video is hidden in a broadcast.
[0058] The video synthesis processing unit 232 acquires encoded video information Pax from the first receiving unit 21a, encoded video information Pbx from the second receiving unit 21b, encoded video information Pcx from the third receiving unit 21c, and encoded video information Pdx from the fourth receiving unit 21d. The video synthesis processing unit 232 switches the display / hide status of these acquired encoded video information Pax, Pbx, Pcx, and Pdx according to the determination results of the first determination unit 231a, the second determination unit 231b, the third determination unit 231c, and the fourth determination unit 231d, and arranges them within the same screen frame to generate synthesized video information.
[0059] Here, the video synthesis processing unit 232, when viewing the synthesized video information from the direction of viewing, arranges the video information Pa, Pb, Pc, and Pd vertically in small screens on the left side of the screen frame, and on the right side of the screen frame, it arranges one of these video information pieces, in this case video information Pa, in a larger screen than the small screen, thereby generating the synthesized video information V1.
[0060] The composite video distribution unit 24 sequentially distributes the composite video information V1 generated by the video synthesis processing unit 232 to the viewer terminal 30.
[0061] On the viewer terminal 30, the decoder 32 receives the composite video information V1 transmitted from the video distribution device 20 via the terminal communication unit 31 and decodes the encoded video information within the composite video information V1. The display unit 33 plays the composite video by displaying the information decoded by the decoder 32. The decoding by the decoder 32 and the display by the display unit 33 can be performed using a web browser on the viewer terminal 30.
[0062] Figure 3 is a screen configuration diagram showing an example of the composite video information V1 displayed on the display unit 33 when the display status of all video information Pa, Pb, Pc, and Pd is determined to be "displayed". In this screen configuration diagram, the video information Pa, Pb, Pc, and Pd are displayed vertically in small screens on the left side of the display unit 33, and the video information Pa is displayed in a large screen on the right side of the display unit 33. This allows viewing from different angles for multiple halls.
[0063] During the distribution of the composite video information V1, if, for example, an outsider other than those involved in the golf tournament—such as a member of the public with a reservation—starts playing during a break in the players' play on the first hole, or if a spectator intrudes, this outsider will appear in the video information Pa and Pb. When video information Pa and Pb containing outsiders in this way are distributed, it may raise issues from the perspective of protecting personal information.
[0064] Therefore, in this embodiment, if an outsider appears in the video information Pa and Pb for a predetermined duration or longer at a size exceeding a predetermined value based on the processing described above, the video information Pa and Pb are switched to hidden. This prevents the distribution of video in which the faces of outsiders are clearly visible, thereby appropriately protecting personal information.
[0065] Figure 4 is a screen configuration diagram showing an example of a state where the video information Pa and Pb in the composite video information V1 are switched to hidden. The hidden area where the video information Pa and Pb in the composite video information V1 are located may be filled with black or display the text information "Please wait a moment."
[0066] Here, in situations where the same subject is photographed from different angles, such as in video information Pa and Pb, the display state determination result may differ depending on whether or not the subject's face is visible. In such cases, for multiple video information that are recognized as photographing the same subject, if the display state of any of these video information is switched to hidden, the other video information may also be switched to hidden. Alternatively, even among video information that photographs the same subject, only the video information that is determined to be hidden may be switched to hidden.
[0067] During the distribution of the composite video information V1, if the user performs an operation in the input unit 34 to select a small screen other than the video information Pa among the small screens displayed on the left side of the display unit 33, for example, a small screen of video information Pc that is not hidden, a switching instruction to switch the video information to be displayed on the large screen on the right side of the display unit 33 to the selected video information Pc is transmitted to the video distribution device 20 via the terminal communication unit 31.
[0068] In the video distribution device 20, the instruction receiving unit 25 receives the switching instruction transmitted from the viewer terminal 30 and sends it to the video synthesis processing unit 232. Based on the instruction received from the instruction receiving unit 25, the video synthesis processing unit 232 switches to position the video information Pc on the right side of the screen frame and updates the synthesized video information V1.
[0069] The composite video distribution unit 24 sequentially distributes the composite video information V1 updated by the video synthesis processing unit 232 to the viewer terminal 30, where the distributed composite video information V1 is displayed on the display unit 33. As a result, the video information displayed on the large screen to the right of the display unit 33 is switched to the video information PC of the selected screen.
[0070] The placement of video information Pa, Pb, Pc, and Pd in the composite video information generated by the video synthesis processing unit 232 is not limited to the placement of video information V1. For example, when the user performs a screen switching operation, the video synthesis processing unit 232 may divide the screen frame into four equal areas and place video information Pa, Pb, Pc, and Pd in each area to generate composite video information V2. When the display state of video information Pa, Pb, Pc, and Pd is determined to be "displayed," and composite video information V2 is generated in this way, the display unit 33 of the viewer terminal 30 will display screen information as shown in Figure 5. Figure 6 is a screen configuration diagram showing an example of a state in which video information Pa and Pb in composite video information V2 are switched to hidden.
[0071] (Modification 1) As modification 1 of this embodiment, the video synthesis processing unit 232 may generate synthesized video information using only video information that has not been determined to be hidden by the display state determination unit 231. Figure 7 is an example of synthesized screen information V3 composed only of video information Pc and Pd, which have not been determined to be hidden, out of the video information Pa, Pb, Pc, and Pd.
[0072] (Modification 2) As a second modification of this embodiment, each determination unit 231a, 231b, 231c, and 231d of the display state determination unit 231 may determine whether a face in the video information that is smaller than a predetermined value matches any of the registered face data, and if they do not match, it may be decided to perform concealment processing on that face portion.
[0073] Figure 8 is a flowchart showing the processes performed by each determination unit 231a, 231b, 231c, and 231d in Modification 2, and replaces the flowchart shown in Figure 2A. In step S2, each determination unit 231a, 231b, 231c, and 231d detects all human faces from the video information, regardless of the size of the faces.
[0074] If each determination unit 231a, 231b, 231c, and 231d detects a face from the video information that does not match the registered face data (step S3: NO), or if the detected face is larger than a predetermined value (step S15: YES), the unit proceeds to step S4.
[0075] If the detected face is smaller than a predetermined value (step S15: NO), it is decided to apply concealment processing such as mosaic processing to the detected face portion (S16), and the process returns to the face detection loop. The other processes are the same as those described in the flowcharts shown in Figures 2A and 2B, so their explanation is omitted.
[0076] After the display state determination processing by each determination unit 231a, 231b, 231c, and 231d is completed, the video synthesis processing unit 232 generates synthesized video information by applying a concealment process, such as mosaic processing, to the face portion of the video information in which the display state was determined to be "displayed" but which was determined to be subject to concealment processing during processing. The video synthesis processing unit 232 may also apply a concealment process, such as superimposing an illustration representing a face onto the determined face portion.
[0077] (Effects of the Embodiment) According to the above embodiment, the video display control device 20 includes a face data storage unit 22 that stores face data for recognizing the faces of persons involved in the video content to be displayed, which has been registered in advance, and a control unit 23. The control unit 23 acquires video information captured by the camera device 10, and if an outsider whose face does not match the face data stored in the face data storage unit 22 is captured in the acquired video information under predetermined conditions, it determines that the display state of the video information should be hidden, and switches the display state of the video information according to the result of the determination.
[0078] This allows the system to automatically hide any outsiders who appear in the video content under specified conditions, thereby appropriately protecting their personal information.
[0079] Furthermore, the control unit 23 may determine that if the face of the outsider is continuously visible in the acquired video information at a size greater than a predetermined value for a predetermined period of time or longer, it will hide the display state of the video information. This prevents the display state from being switched if the time for which the outsider's face is visible in the video information at a size greater than a predetermined value is very short, thus avoiding the frequent switching of the display state and resulting in a difficult-to-view video.
[0080] Furthermore, the video display control device 20 may further include a camera device 10, and the camera device 10 may include multiple camera devices that photograph any of the subjects located in multiple locations. The control unit 23 may acquire video information captured by the multiple camera devices, determine the display state for each acquired video information, hide the video information that is determined to be hidden, and generate screen information in which at least one of the video information captured by the multiple camera devices is placed on the same screen.
[0081] This allows for the display of a composite screen with multiple video clips of different subjects on the same screen, where only the video clips containing an outsider under certain conditions can be hidden, while the other video clips continue to be displayed.
[0082] Furthermore, the control unit 23 may generate screen information using video information that has not been determined to be hidden. This allows displayable video information to be displayed on the largest possible screen.
[0083] Up to this point, video distribution system 1 has been envisioned as a live broadcast of a golf tournament, but the content to be broadcast is not limited to this. For example, video distribution system 1 can be used for video content such as gymnastics competitions or music concerts. In the case of a gymnastics competition, the athletes will be the main subjects, and in the case of a music concert, the performers will be the main subjects. The main subjects are not limited to one person; multiple people may be the subjects. In other words, video distribution system 1 is suitable for video content where there is a distinction between areas where outsiders are expected to be present and areas where they should not be. Furthermore, video distribution system 1 does not need to synthesize and broadcast images from multiple camera devices; it can also be used when broadcasting a single image from a single camera device.
[0084] The entire contents of Japanese Patent Application No. 2024-226886 (Filing Date: December 24, 2024) are incorporated herein by reference.
[0085] This invention is a technology that can be used when distributing video content.
Claims
1. A video display control device comprising: a face data storage unit that stores pre-registered face data for recognizing the faces of persons involved in the video content to be displayed; and a control unit, wherein the control unit acquires video information captured by a camera device, determines that the display state of the video information should be hidden if an outsider whose face does not match the face data stored in the face data storage unit is captured in the acquired video information under predetermined conditions, and switches the display state of the video information according to the result of the determination.
2. The video display control device according to claim 1, wherein the control unit determines that the display state of the video information will be hidden if the face of the outsider is continuously reflected in the acquired video information at a size of a predetermined value or larger for a predetermined time or longer.
3. The video display control device according to claim 1, further comprising the camera device, wherein the camera device includes a plurality of camera devices that photograph any of subjects located at a plurality of locations, the control unit acquires video information captured by the plurality of camera devices, determines the display state for each acquired video information, hides the video information that has been determined to be hidden, and generates screen information in which at least one of the video information captured by the plurality of camera devices is placed on the same screen.
4. The video display control device according to claim 3, wherein the control unit generates screen information using video information that has not been determined to be hidden.
5. A video display control method comprising: storing pre-registered face data for recognizing the faces of persons involved in the video content to be displayed in a face data storage unit; acquiring video information captured by a camera device using a control unit; determining, using the control unit, that the video information should be hidden if an outsider whose face does not match the face data stored in the face data storage unit is captured in the acquired video information under predetermined conditions; and switching the display state of the video information according to the result of the determination using the control unit.