A robot and screen interaction AI picture display method and system

By capturing and preprocessing real-time interactive images using robots, and combining CFG and PuLID technologies, efficient image queue management and parallel processing were achieved. This solved the problems of high cost and slow real-time response in the production of outdoor large-screen CG content, improved user experience and interactive effects, and reduced production costs.

CN122240872APending Publication Date: 2026-06-19TOTEM VISION (GUANGZHOU) DIGITAL TECH CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
TOTEM VISION (GUANGZHOU) DIGITAL TECH CO LTD
Filing Date
2026-02-25
Publication Date
2026-06-19

AI Technical Summary

Technical Problem

In existing technologies, the production cost and time of high-quality computer graphics (CG) content for outdoor large screens are high, making it difficult to meet the needs of rapid iteration and large-scale deployment. Artificial intelligence generated content (AIGC) technology has problems with slow response speed and inability to capture user interaction behavior in real-time interactive video generation, resulting in a poor user experience.

Method used

By capturing real-time interactive images with robots, preprocessing and queuing management are performed. CFG and PuLID technologies are used to ensure the consistency of human features, generate scannable QR codes, and achieve parallel processing and resource optimization of image queues. This breaks through the real-time bottleneck and improves response speed and image quality.

🎯Benefits of technology

It significantly improves the production efficiency and display effect of interactive content on outdoor large screens, reduces production costs, enhances user immersion and interactive experience, supports rapid scene deployment and high-quality interaction, and enhances the application potential of AIGC technology in the field of real-time interaction.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure CN122240872A_ABST
    Figure CN122240872A_ABST
Patent Text Reader

Abstract

This invention provides a robot and screen interactive AI image display method and system. The method includes capturing real-time interactive images based on a start command and starting an interactive program; preprocessing the interactive images by merging a scannable QR code with a QR code in a frame of a preset image to update the frame to be displayed; creating a queue of images to be displayed, adding the frames to be displayed as basic elements to the queue, and combining the task results with the frames to be displayed to obtain the final image result; storing the processed images in the queue of images to be displayed in an image storage container; and generating a QR code for the final result based on the frames to be displayed and the images stored in the image storage container. This invention overcomes the technical bottleneck of traditional solutions in terms of real-time performance, achieves efficient coordination and resource optimization for complex image processing tasks, and significantly improves the production efficiency and display effect of interactive content on outdoor large screens.
Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This invention relates to the field of robot screen interaction technology, and in particular to a method and system for displaying AI images through robot-screen interaction. Background Technology

[0002] Currently, the production of media video content generally faces the challenges of simplification and a lack of depth, leading to homogenization of video advertisements and making it difficult to effectively stimulate user interest and interaction. This is especially true in the outdoor large-screen sector, where content presentation generally lacks originality, failing not only to capture users' attention in an increasingly fragmented information environment but also to encourage effective user interaction, thus preventing the full realization of the enormous potential of digital signage.

[0003] While high-quality computer graphics (CG) content boasts unparalleled advantages in visual presentation, enabling the creation of captivating and immersive experiences, its practical application faces significant challenges. This is primarily due to the specific resolution requirements of large outdoor screens and the variable ambient lighting conditions, necessitating extremely high costs and lengthy production times for CG content creation. From detailed modeling and texture painting to complex lighting and shadow rendering, each step demands substantial human and material resources, severely limiting its widespread adoption in scenarios requiring rapid iteration and large-scale deployment. Therefore, despite strong demand, the high production threshold remains a key bottleneck hindering the widespread adoption of CG content. While AI-generated content (AIGC) technology is considered the future of content production and has shown great potential in many fields, it still faces a series of pressing technical challenges in real-time interactive video generation. Currently, AIGC content is produced relatively slowly, a significant gap compared to the instantaneous response speed required for real-time video rendering. This latency makes it difficult for AIGC to capture and respond to user interactions in real time, resulting in a disjointed and unsmooth user experience. In existing technologies, there is an asynchrony between content production and display. This means that real-time actions or inputs from users on the front end cannot be synchronized to the AIGC backend for processing and content generation in a timely and accurate manner. For example, when users interact with gestures or give voice commands, the AIGC system struggles to complete recognition, understanding, decision-making, and generate corresponding visual feedback within milliseconds. This lag in information flow severely limits the potential of AIGC in dynamic, highly interactive video applications, making it difficult to build truly immersive and personalized interactive experiences. Summary of the Invention

[0004] Therefore, the purpose of this invention is to provide a robot and screen interactive AI image display method and system to overcome the shortcomings of the prior art.

[0005] In a first aspect, the present invention provides a method for displaying AI images through robot-screen interaction, the method comprising: Receive a start command, capture real-time interactive footage based on the start command, and start the corresponding interactive program; The interactive screen is preprocessed, an image storage address is preset, a scannable QR code is generated according to the storage address, and the scannable QR code is merged with the QR code in the frame of the preset image to update the frame to be displayed. The length of the image queue to be displayed is calculated based on the preprocessed interactive screen. An image queue to be displayed is created. The image frame to be displayed is added to the image queue as a basic element and the image is processed to obtain the task result. The task result is combined with the image frame to be displayed to obtain the final image result. The final image result is then updated to the image queue to be displayed. The processed images in the queue of images to be displayed are stored in the image storage container; A QR code for the final result is generated based on the picture frame to be displayed and the image stored in the image memory.

[0006] Furthermore, the step of capturing real-time interactive footage based on the launch command and launching the corresponding interactive program includes: The robot captures real-time interactive footage and issues commands to initiate corresponding interactive programs.

[0007] Furthermore, the step of updating the frame to be displayed includes: The interactive screen is cropped and adjusted sequentially; The cropped and adjusted interactive images are combined to obtain a display frame, wherein the display frame includes a unique number and the current play time.

[0008] Furthermore, the step of image processing includes: The image is processed to identify the characteristics of the person and to segment the person and background in the image. Repair and extract the human feature map from the image, integrate commercial image information and human ID to perform facial fusion, and perform color correction on the image for output.

[0009] Furthermore, the step of repairing and extracting the human feature map from the image, and integrating commercial image information with the human ID for face fusion includes: CFG technology is used to regulate the non-diffusion model Flux in order to guide the generation of characters with consistent features; PuLID technology is used to customize personal images to ensure consistency of identity characteristics.

[0010] Furthermore, after the step of generating a QR code for the final result based on the picture frame to be displayed and the image stored in the image memory, the method further includes: Scan the QR code of the final result to obtain the final result, which may include an interactive video or image. The QR code of the final result includes a fixed URL for an image display page and a parameter pointing to a specific image address in the image storage. Secondly, the present invention also provides a robot and screen interactive AI image display system, the system comprising: A receiving and capturing module is used to receive a start command, capture real-time interactive footage based on the start command, and start the corresponding interactive program. The processing and compositing module is used to preprocess the interactive screen, preset the image storage address, generate a scannable QR code according to the storage address, and merge the scannable QR code with the QR code in the frame of the preset image to update the frame to be displayed. A creation update module is used to calculate the length of the image queue to be displayed based on the preprocessed interactive screen, create an image queue to be displayed, add the image frame to be displayed as a basic element to the image queue to be displayed, and perform image processing to obtain a task result. The task result is combined with the image frame to be displayed to obtain a final image result, and the final image result is updated to the image queue to be displayed. The storage module is used to store the processed images in the image storage container in the image queue to be displayed; The generation module is used to generate a QR code for the final result based on the picture frame to be displayed and the image stored in the image memory.

[0011] Furthermore, the receiving and capturing module includes: The capture unit is used to capture real-time interactive images based on the robot and issue commands through the robot to start the corresponding interactive program.

[0012] Thirdly, the present invention also provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor executes the computer program to implement the robot and screen interactive AI image display method as described above.

[0013] Fourthly, the present invention also provides a storage medium storing a computer program thereon, which, when executed by a processor, implements the above-described method for displaying AI images of a robot interacting with a screen.

[0014] Compared with existing technologies, the beneficial effects of this invention are: by creating a queue of images to be displayed and through queued management and parallel processing architecture, it breaks through the technical bottleneck of real-time performance in traditional solutions. By preprocessing interactive images and updating the processed images to the queue of images to be displayed, it achieves efficient coordination and optimized resource allocation for complex image processing tasks, significantly improving the production efficiency and display effect of interactive content on outdoor large screens. It also lays a solid technical foundation for the widespread application of AIGC (Artificial Intelligence Generated Content) technology in the field of real-time interaction. Attached Figure Description

[0015] Figure 1 This is a flowchart of the robot and screen interactive AI image display method in the first embodiment of the present invention; Figure 2 This is a structural block diagram of the robot and screen interactive AI image display system in the second embodiment of the present invention; Figure 3 This is a schematic diagram of the structure of the electronic device in the third embodiment of the present invention.

[0016] Explanation of key component symbols: 10. Receive and capture module; 20. Processing and compositing module; 30. Creation and update module; 40. Storage module; 50. Generation module; 60. Bus; 61. Processor; 62. Memory; 63. Communication interface.

[0017] The following detailed description, in conjunction with the accompanying drawings, will further illustrate the present invention. Detailed Implementation

[0018] To facilitate understanding of the present invention, a more complete description will be given below with reference to the accompanying drawings. Several embodiments of the invention are illustrated in the drawings. However, the invention can be implemented in many different forms and is not limited to the embodiments described herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete.

[0019] It should be noted that when a component is said to be "fixed to" another component, it can be directly on the other component or there may be an intervening component. When a component is said to be "connected to" another component, it can be directly connected to the other component or there may be an intervening component. The terms "vertical," "horizontal," "left," "right," and similar expressions used in this document are for illustrative purposes only.

[0020] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. The terminology used herein in the description of the invention is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. The term "and / or" as used herein includes any and all combinations of one or more of the associated listed items.

[0021] Example 1 Please see Figure 1 The image shows a robot large-screen interactive multi-terminal interaction method according to the first embodiment of the present invention, the method including steps S1 to S5: S1, Receive a start command, capture real-time interactive footage based on the start command, and start the corresponding interactive program; Specifically, step S1 includes step S11: S11, based on the robot capturing real-time interactive images, and issuing commands through the robot to start the corresponding interactive program; Understandably, once a start command is issued and received, robot A will capture real-time interactive footage and issue commands to open the corresponding interactive program.

[0022] It should be noted that near real-time synchronization of interactive input and visual output is achieved through a collaborative mechanism of real-time image capture and image queue management.

[0023] S2, preprocess the interactive screen, preset the image storage address, generate a scannable QR code according to the storage address, and merge the scannable QR code with the QR code in the frame of the preset image to update the frame to be displayed; Specifically, step S2 includes steps S21 to S22: S21, the interactive screen is cropped and adjusted sequentially; S22, the cropped and adjusted interactive screen is combined to obtain a frame to be displayed, wherein the frame to be displayed includes a unique number and the current play time; Understandably, the captured images are processed, such as cropping and adjusting. Based on preset parameters, the frames are composited, including unique identifiers and current playtime, to create the image to be displayed.

[0024] S3. Calculate the length of the image queue to be displayed based on the preprocessed interactive screen, create an image queue to be displayed, add the image frame to be displayed as a basic element to the image queue to be displayed, and perform image processing to obtain the task result. Combine the task result with the image frame to be displayed to obtain the final image result, and update the final image result to the image queue to be displayed. It should be noted that traditional AIGC technology suffers from processing latency when generating high-quality visual content, failing to meet the millisecond-level response requirements of real-time interaction on outdoor large screens. By employing a queuing processing mechanism and parallel workflow design, the trade-off between content generation speed and visual quality is effectively resolved.

[0025] Specifically, step S3 includes steps S31 to S32: S31, Recognize the image to identify the identity features of the person and segment the person and background in the image; S32, Repair and extract the human feature map in the image, integrate commercial image information and human ID to perform face fusion, and perform color correction on the image and output it; Understandably, a queue is created to store images to be displayed, the image to be processed is read from the queue, further image processing is performed on the read image, and the processed image is updated in the image result queue.

[0026] Specifically, step S32 includes steps S321 to S322: S321 uses CFG technology to regulate the non-diffusion model Flux in order to guide the generation of consistent character features; S322 uses PuLID technology to customize personal images to ensure consistency of identity characteristics.

[0027] It is worth noting that the process involves image recognition, segmentation of the subject and background, extraction and identification of the subject's identity features, and restoration of the extracted feature map to achieve subject ID in-situ fusion. Commercial image information is integrated with the subject ID for facial fusion to achieve subject ID resampling fusion. In this embodiment, CFG technology is used to regulate the non-diffusion model Flux, guiding the generation of consistent subject features. PuLID technology is used to customize the subject image to ensure consistency of identity features. Finally, color correction is applied to ensure that the final output subject's skin tone matches the subject ID. The processed images are managed and prepared through an image display mini-program.

[0028] It should be noted that maintaining a high degree of consistency in identity features is difficult during the image processing of people, and problems such as facial feature distortion or mismatch can easily occur. By using CFG technology to regulate the non-diffusion model Flux in combination with PuLID technology, the consistency of human features is ensured throughout the entire processing flow.

[0029] S4, store the processed images in the image storage container in the image storage queue; Understandably, the processed image can be stored in an image storage container for later use or display.

[0030] S5, Generate a QR code for the final result based on the picture frame to be displayed and the image stored in the image memory; Specifically, step S5 further includes: Scan the QR code of the final result to obtain the final result, which includes an interactive video or image. The QR code of the final result includes a fixed URL of the image display page and a parameter pointing to a specific image address in the image storage.

[0031] Understandably, a QR code for the final result will be automatically generated after the interaction is completed. Users can obtain the final result by scanning the QR code (for example, by downloading the generated interactive video or image).

[0032] It should be noted that outdoor large screens face complex ambient lighting conditions and resolution requirements, making it difficult for existing content production solutions to quickly adapt to different application scenarios. A modular processing architecture and adaptive image processing provide flexible scene adaptation capabilities.

[0033] In summary, the robot and screen interactive AI image display method in the above embodiments of the present invention, by creating a queue of images to be displayed and through queued management and parallel processing architecture, breaks through the technical bottleneck of real-time performance in traditional solutions. By preprocessing the interactive screen and updating the processed images to the queue of images to be displayed, it achieves efficient coordination and resource optimization of complex image processing tasks, significantly improving the production efficiency and display effect of interactive content on outdoor large screens. It also lays a solid technical foundation for the widespread application of AIGC (Artificial Intelligence Generated Content) technology in the field of real-time interaction. The system response time is reduced from several seconds in traditional solutions to milliseconds, improving response speed by over 90%. Through queue management and parallel process design, the overall processing efficiency is improved by over 300%, and the optimized processing flow increases the utilization rate of computing resources to over 85%. The method also utilizes PuLID. The technology ensures that the consistency of character identity features reaches over 95%. Through a dedicated color correction module, it ensures that the color matching degree between the output image and the original character features exceeds 90%. The quality fluctuation range of the generated content under the control of CFG technology is kept within 5%. Real-time interaction latency is reduced, significantly enhancing user immersion. Through character ID recognition and resampling fusion, highly personalized content generation is achieved. The QR code mechanism allows users to easily access and share generated content, improving user satisfaction. Compared with traditional CG production, the overall cost is reduced by more than 90%. The standardized process shortens the deployment time of new scenes from months to weeks. The modular architecture reduces system maintenance complexity and cost by about 50%. The system can flexibly adapt to display devices of different sizes and resolutions. The high-quality interactive experience significantly improves advertising effectiveness and user engagement. The open architecture supports convenient integration and upgrades of subsequent functional modules.

[0034] Example 2 This invention also provides an AI image display system for robot-screen interaction; please refer to [link / reference]. Figure 2 The image shown is a robot and screen interactive AI image display system according to a second embodiment of the present invention. The system includes: The receiving and capturing module 10 is used to receive a start command, capture real-time interactive images based on the start command, and start the corresponding interactive program; The processing and compositing module 20 is used to preprocess the interactive screen, preset the image storage address, generate a scannable QR code according to the storage address, and merge the scannable QR code with the QR code in the frame of the preset image to update the frame to be displayed. The creation update module 30 is used to calculate the length of the display image queue based on the preprocessed interactive screen, create a display image queue, add the display frame as a basic element to the display queue, perform image processing to obtain the task result, combine the task result with the display frame to obtain the final image result, and update the display image queue with the final image result. Storage module 40 is used to store the processed images in the image storage container in the image storage queue to be displayed; The generation module 50 is used to generate a QR code for the final result based on the picture frame to be displayed and the image stored in the image memory.

[0035] In some alternative embodiments, the receiving and capturing module 10 includes: The capture unit is used to capture real-time interactive images based on the robot and issue commands through the robot to start the corresponding interactive program.

[0036] In some alternative embodiments, the steps of the processing synthesis module 20 include: The cropping unit is used to crop and adjust the interactive screen sequentially; The compositing unit is used to composite the cropped and adjusted interactive images to obtain a display frame, wherein the display frame includes a unique number and the current play time.

[0037] In some alternative embodiments, the step of creating the update module 30 includes: The recognition and segmentation unit is used to recognize images to identify the identity features of people and segment the main body of the person and the background in the image; The repair and extraction unit is used to repair and extract the human feature map in the image, integrate commercial image information and human ID to perform facial fusion, and perform color correction on the image for output.

[0038] In some alternative embodiments, the steps of the repair extraction unit include: The guiding unit is used to regulate the non-diffusion model Flux using CFG technology to guide the generation of characters with consistent features. Customization unit, used to customize personal images using PuLID technology to ensure consistency of identity features.

[0039] In some alternative embodiments, the steps of the generation module 50 include: A scanning unit is used to scan the QR code of the final result to obtain the final result, which includes an interactive video or image. The QR code of the final result includes a fixed image display page URL and parameters pointing to a specific image address in the image storage.

[0040] The functions or operation steps implemented by the above modules and units are largely the same as those in the above method embodiments, and will not be repeated here.

[0041] The robot and screen interactive AI image display system provided in this embodiment of the invention has the same implementation principle and technical effect as the aforementioned method embodiment. For the sake of brevity, any parts not mentioned in the system embodiment can be referred to the corresponding content in the aforementioned method embodiment.

[0042] Example 3 The present invention also proposes an electronic device, please refer to [link to relevant documentation]. Figure 3 The image shows an electronic device according to a third embodiment of the present invention.

[0043] The electronic device may include a processor 61 and a memory 62 storing computer program instructions.

[0044] Specifically, the processor 61 may include a central processing unit (CPU), an application specific integrated circuit (ASIC), or one or more integrated circuits that can be configured to implement the present application.

[0045] The memory 62 may include a large-capacity storage device for data or instructions. For example, and not limitingly, the memory 62 may include a hard disk drive (HDD), a floppy disk drive, a solid-state drive (SSD), flash memory, an optical disk drive, a magneto-optical disk drive, magnetic tape, or a Universal Serial Bus (USB) drive, or a combination of two or more of these. Where appropriate, the memory 62 may include removable or non-removable (or fixed) media. Where appropriate, the memory 62 may be internal or external to a data processing device. In a particular embodiment, the memory 62 is non-volatile memory. In a particular embodiment, the memory 62 includes read-only memory (ROM) and random access memory (RAM). Where appropriate, the ROM may be a mask-programmed ROM, a programmable read-only memory (PROM), an erasable programmable read-only memory (EPROM), an electrically erasable programmable read-only memory (EEPROM), an electrically alterable read-only memory (EAROM), or flash memory, or a combination of two or more of these. Where appropriate, the RAM can be Static Random-Access Memory (SRAM) or Dynamic Random-Access Memory (DRAM). DRAM can be Fast Page Mode Dynamic Random-Access Memory (FPMDRAM), Extended Data Out Dynamic Random-Access Memory (EDODRAM), Synchronous Dynamic Random-Access Memory (SDRAM), etc.

[0046] The memory 62 can be used to store or cache various data files that need to be processed and / or communicated, as well as possible computer program instructions executed by the processor 61.

[0047] The processor 61 reads and executes the computer program instructions stored in the memory 62 to implement the robot and screen interactive AI image display method of Embodiment 1 described above.

[0048] In some embodiments, the electronic device may further include a communication interface 63 and a bus 60. For example, Figure 3 As shown, the processor 61, memory 62, and communication interface 63 are connected through bus 60 and complete communication with each other.

[0049] The communication interface 63 is used to enable communication between the various modules, devices, units, and / or equipment in this application. The communication interface 63 can also enable data communication with other components such as external devices, image / data acquisition devices, databases, external storage, and image / data processing workstations.

[0050] Bus 60 includes hardware, software, or both, that couples components of a device together. Bus 60 includes, but is not limited to, at least one of the following: data bus, address bus, control bus, expansion bus, and local bus. For example, and not as a limitation, bus 60 may include an Accelerated Graphics Port (AGP) or other graphics bus, an Extended Industry Standard Architecture (EISA) bus, a Front Side Bus (FSB), a Hyper Transport (HT) interconnect, an Industry Standard Architecture (ISA) bus, an InfiniBand interconnect, a Low Pin Count (LPC) bus, a memory bus, a Micro Channel Architecture (MCA) bus, a Peripheral Component Interconnect (PCI) bus, a PCI-Express (PCI-X) bus, a Serial Advanced Technology Attachment (SATA) bus, a Video Electronics Standards Association Local Bus (VLB) bus, or other suitable buses, or a combination of two or more of these. Where appropriate, bus 60 may include one or more buses. Although this application describes and illustrates a specific bus, this application considers any suitable bus or interconnection.

[0051] The electronic device can acquire the robot and screen interactive AI image display system and execute the robot and screen interactive AI image display method of this embodiment.

[0052] Furthermore, in conjunction with the robot and screen interactive AI image display method in Embodiment 1 above, this application can provide a storage medium for implementation. This storage medium stores computer program instructions; when these computer program instructions are executed by a processor, they implement the robot and screen interactive AI image display method of Embodiment 1 above.

[0053] In the description of this specification, references to terms such as "one embodiment," "some embodiments," "example," "specific example," or "some examples," etc., indicate that a specific feature, structure, material, or characteristic described in connection with that embodiment or example is included in at least one embodiment or example of the invention. In this specification, the illustrative expressions of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the specific features, structures, materials, or characteristics described may be combined in any suitable manner in one or more embodiments or examples.

[0054] The embodiments described above are merely illustrative of several implementations of the present invention, and while the descriptions are specific and detailed, they should not be construed as limiting the scope of the present invention. It should be noted that those skilled in the art can make various modifications and improvements without departing from the concept of the present invention, and these modifications and improvements all fall within the scope of protection of the present invention. Therefore, the scope of protection of this patent should be determined by the appended claims.

Claims

1. A robot and screen interactive AI picture presentation method, characterized by, The method includes: Receive a start command, capture real-time interactive footage based on the start command, and start the corresponding interactive program; The interactive screen is preprocessed, an image storage address is preset, a scannable QR code is generated according to the storage address, and the scannable QR code is merged with the QR code in the frame of the preset image to update the frame to be displayed. The length of the image queue to be displayed is calculated based on the preprocessed interactive screen. An image queue to be displayed is created. The image frame to be displayed is added to the image queue as a basic element and the image is processed to obtain the task result. The task result is combined with the image frame to be displayed to obtain the final image result. The final image result is then updated to the image queue to be displayed. The processed images in the queue of images to be displayed are stored in the image storage container; A QR code for the final result is generated based on the picture frame to be displayed and the image stored in the image memory. 2.The robot and screen interaction AI picture exhibition method according to claim 1, characterized in that, The steps of capturing real-time interactive footage based on the launch command and launching the corresponding interactive program include: The robot captures real-time interactive footage and issues commands to initiate corresponding interactive programs. 3.The robot and screen interaction AI picture exhibition method of claim 1, wherein, The step of updating the photo frame to be displayed includes: The interactive screen is cropped and adjusted sequentially; The cropped and adjusted interactive images are combined to obtain a display frame, wherein the display frame includes a unique number and the current play time.

4. The robot and screen interactive AI image display method according to claim 1, characterized in that, The steps for image processing include: The image is processed to identify the characteristics of the person and to segment the person and background in the image. Repair and extract the human feature map from the image, integrate commercial image information and human ID to perform facial fusion, and perform color correction on the image for output.

5. The robot and screen interactive AI image display method according to claim 4, characterized in that, The steps of repairing and extracting the human feature map from the image, and integrating commercial image information with the human ID to perform face fusion include: CFG technology is used to regulate the non-diffusion model Flux in order to guide the generation of characters with consistent features; PuLID technology is used to customize personal images to ensure consistency of identity characteristics.

6. The robot and screen interactive AI image display method according to claim 1, characterized in that, After the step of generating a QR code for the final result based on the picture frame to be displayed and the image stored in the image memory, the method further includes: Scan the QR code of the final result to obtain the final result, which includes an interactive video or image. The QR code of the final result includes a fixed URL of the image display page and a parameter pointing to a specific image address in the image storage.

7. A robot-screen interactive AI image display system, characterized in that, The system includes: A receiving and capturing module is used to receive a start command, capture real-time interactive footage based on the start command, and start the corresponding interactive program. The processing and compositing module is used to preprocess the interactive screen, preset the image storage address, generate a scannable QR code according to the storage address, and merge the scannable QR code with the QR code in the frame of the preset image to update the frame to be displayed. A creation update module is used to calculate the length of the image queue to be displayed based on the preprocessed interactive screen, create an image queue to be displayed, add the image frame to be displayed as a basic element to the image queue to be displayed, and perform image processing to obtain a task result. The task result is combined with the image frame to be displayed to obtain a final image result, and the final image result is updated to the image queue to be displayed. The storage module is used to store the processed images in the image storage container in the image queue to be displayed; The generation module is used to generate a QR code for the final result based on the picture frame to be displayed and the image stored in the image memory.

8. The robot and screen interactive AI image display system according to claim 7, characterized in that, The receiving and capturing module includes: The capture unit is used to capture real-time interactive images based on the robot and issue commands through the robot to start the corresponding interactive program.

9. An electronic device comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, characterized in that, When the processor executes the computer program, it implements the robot and screen interactive AI image display method as described in any one of claims 1 to 6.

10. A storage medium having a computer program stored thereon, characterized in that, When executed by the processor, the program implements the robot and screen interactive AI image display method as described in any one of claims 1 to 6.