A collaborative dual-band laser input method and terminal and system for spatial interaction

By employing a collaborative dual-band laser input method, combining visible indicator lasers and infrared positioning lasers, the problem of separation between visual feedback and positioning in large-space interaction is solved, achieving high-precision, real-time intelligent interactive control and satisfying the natural interactive experience of "what you see is what you get".

CN122308633APending Publication Date: 2026-06-30BEIJING HAOWANG TECHNOLOGY CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
BEIJING HAOWANG TECHNOLOGY CO LTD
Filing Date
2026-04-06
Publication Date
2026-06-30

AI Technical Summary

Technical Problem

Existing technologies struggle to achieve natural pointing, precise positioning, real-time response, and intelligent collaborative interactive control in large-space interactive scenarios, resulting in complex user operations, inaccurate positioning, and unclear feedback, failing to meet the demand for "what you see is what you decide, and what you point to is what you control".

Method used

By employing a collaborative dual-band laser input method, which simultaneously emits visible indicator lasers and infrared positioning lasers, and combines network communication with intelligent semantic parsing, visual feedback and precise positioning are integrated. This establishes a stable connection between the terminal and the back-end system, enabling the system to understand the user's operational intentions and generate corresponding control commands.

Benefits of technology

It achieves "what you see is what you get" in large-space interaction, provides intuitive visual feedback, ensures sub-centimeter-level positioning accuracy, has an interaction response latency of less than 50ms, reduces operational complexity, and supports efficient intelligent interaction in complex environments.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure CN122308633A_ABST
    Figure CN122308633A_ABST
Patent Text Reader

Abstract

This invention discloses a collaborative dual-band laser input method, terminal, and system for spatial interaction, aiming to solve problems such as unintuitive operation, susceptibility to accuracy interference, and high latency in large-space interactions. The method is applied to a smart handheld terminal and includes: a synchronous transmission step, responding to user operation by controlling the terminal to synchronously emit a visible indicator laser and an infrared positioning laser with parallel or coincident output optical axes, forming a light spot with completely consistent position on the target surface, achieving "what you see is what you set"; a communication establishment step, establishing a network communication connection between the terminal and the backend control system; and an intelligent interaction step, receiving and executing control commands generated by the backend control system, which are intelligently generated by the backend system based at least on the precise spatial coordinates of the infrared light spot and the semantic attributes of its associated target object. This invention achieves natural, accurate, real-time, and intelligent large-space human-computer interaction, suitable for scenarios such as smart exhibition halls and command centers.
Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This invention relates to the fields of human-computer interaction, spatial computing and intelligent control technology, specifically to a collaborative dual-band laser precision pointing and intelligent interaction method for large-space interactive scenarios such as smart exhibition halls, command centers, and large conference rooms, as well as an intelligent handheld terminal and system for implementing this method. Background Technology

[0002] In large-scale digital spaces such as smart exhibition halls, command centers, and large interactive conference rooms, users, including guides and commanders, need to intuitively, accurately, and flexibly interact with physical devices and digital content such as large screens, lighting, sand tables, and projectors from medium to long distances (several meters to tens of meters). Existing technological solutions struggle to meet the core requirements of "natural pointing, precise positioning, real-time response, and intelligent collaboration" in large spaces, exhibiting numerous technical limitations. 1. Traditional center console and dedicated remote control: The functions are fixed and the operation logic is complex. Users need to memorize a large number of button correspondences with devices. It cannot achieve the natural spatial mapping of "point and control", resulting in a fragmented human-computer interaction experience. It is also difficult to adapt to the operation needs of users who move freely in large spaces.

[0003] 2. Computer vision gesture recognition solution: It is greatly affected by factors such as ambient light intensity, user operation posture, and occlusion of objects in the scene. The stability and positioning accuracy of gesture recognition are difficult to guarantee. In addition, the lack of clear visual feedback during user operation can easily lead to operation fatigue and misoperation. It is not suitable for large-space interaction in complex environments.

[0004] 3. Air mouse and gyroscope remote control: They generally suffer from cursor drift and require frequent calibration. Their operation logic is still based on the indirect mapping of a planar mouse, which does not conform to the natural human interaction habit of "direct pointing". In addition, their effective operating distance is limited and cannot support large-space operation within a range of tens of meters.

[0005] 4. Ordinary laser pointers: They can only emit visible lasers to achieve simple visual indication of light spots. They lack spatial positioning capabilities, cannot accurately and reliably convert the user's pointing intentions into control commands for the digital system, and lack the ability to intelligently coordinate with the back-end control system and the ability to perceive the context of the object being operated. They can only be used as an indicating tool and cannot achieve interactive control.

[0006] In summary, existing technologies lack a next-generation human-computer interaction solution that can adapt to large-space interaction scenarios while simultaneously satisfying the requirements of intuitive operation, high positioning accuracy, real-time feedback, and intelligent interaction, making it difficult to achieve the ultimate interactive experience of "what you see is what you decide, and what you point to is what you control". Summary of the Invention

[0007] The technical problem this invention aims to solve is to overcome the aforementioned deficiencies of existing technologies and provide a collaborative dual-band laser input method, terminal, and system for spatial interaction. This solution addresses the core pain points in large-space interaction—"separation of visual feedback and positioning reference, positioning accuracy affected by environmental interference, delayed interaction response, and system inability to understand operational intentions"—through the deep integration and collaboration of physical optics, spatial positioning, network communication, and intelligent semantic parsing. It achieves natural intelligent spatial interaction where "what you see is what you decide, and what you point to is what you control."

[0008] To solve the above-mentioned technical problems, the present invention adopts the following technical solution: Firstly, this invention provides a collaborative dual-band laser input method for spatial interaction, applied to a smart handheld terminal. This method integrates visual feedback and precise positioning through the collaborative emission of dual-band lasers, and combines network communication and intelligent background analysis to achieve intelligent response to operational intentions. Specifically, it includes the following steps: Synchronous transmission steps: In response to user input, the intelligent handheld terminal is controlled to simultaneously transmit a first-band visible indicator laser and a second-band infrared positioning laser. The visible indicator laser and the infrared positioning laser are configured so that their emission optical axes are parallel or coincident, forming light spots with completely consistent spatial positions on the target surface. This design achieves dual technical effects: it provides the operator with intuitive and clear spatial directional visual feedback, and ensures that the precise spatial coordinates of the infrared positioning laser correspond one-to-one with the visual direction of the visible indicator laser in physical space. This fundamentally realizes the basis for "what you see is what you get" spatial interaction, solving the problem of separation between visual feedback and system positioning reference in traditional solutions.

[0009] Communication establishment steps: A stable network communication connection is established with the back-end control system through the communication unit of the intelligent handheld terminal, providing a communication foundation for the bidirectional transmission of subsequent control commands.

[0010] Intelligent interaction steps: Through the network communication connection, the system receives spatial interaction control commands from the backend control system and executes corresponding spatial interaction operations according to the control commands. The control commands are intelligently generated by the backend control system based on at least the precise spatial coordinates of the infrared positioning laser spot on the target surface and the semantic attributes of the target object associated with those coordinates. This enables the system to understand "what the user is pointing to," rather than merely perceiving "where the user is pointing," thus upgrading from "simple positioning" to "intelligent interaction."

[0011] Preferably, the "semantic attributes of the target object" include at least one of the target object's type, function, or executable operation. Through semantic attribute mapping, a relationship is established between physical space coordinates and the logical meaning of the target object, providing a basis for the background control system to parse the user's operation intention.

[0012] Preferably, the network communication connection established in the communication establishment step is a wireless communication connection, more preferably a Wi-Fi connection based on Transmission Control Protocol / Internet Protocol (TCP / IP), which adapts to the needs of mobile operation of user handheld terminals in large spaces and can seamlessly access the existing local area network in the scene, reducing system deployment costs. More preferably, the smart handheld terminal and the background control system transmit real-time interactive data required for spatial interaction via User Datagram Protocol (UDP). Utilizing the connectionless and low-latency technical characteristics of the UDP protocol, the real-time requirements for control command transmission in spatial interaction scenarios are effectively met, ensuring the smoothness of the interaction process.

[0013] Secondly, the present invention provides a collaborative dual-band laser input terminal for spatial interaction, configured to implement the aforementioned collaborative dual-band laser input method. This terminal is handheld and portable, adaptable to the needs of large-space mobile operation, and specifically includes: The laser emitting component is configured to simultaneously emit a first-band visible indicator laser and a second-band infrared positioning laser in response to user input. Through precise optical structure design and calibration, the output optical axes of the two laser beams are kept parallel or coincident, ensuring that the two laser beams form spatially consistent light spots on the target surface.

[0014] The main control component, as the core control unit of the terminal, is used to execute preset program logic, coordinate the work of various functional components such as the laser emission component, communication component, and user input component, and realize the parsing, triggering and feedback of instructions.

[0015] The communication component is electrically connected to the main control component and configured to establish a network communication connection with the background control system to realize bidirectional transmission of control commands, status information and other data between the terminal and the background control system.

[0016] The user input component is electrically connected to the main control component and is used to sense various user input operations and transmit operation signals to the main control component. The input operations include, but are not limited to, button operations, touch operations, and gesture operations.

[0017] Preferably, the laser emitting component includes a visible light laser, an infrared laser, and a precision optical mirror group. The optical mirror group is used to calibrate and adjust the output optical axes of the two laser beams to ensure that the optical axes are parallel or coincident. Preferably, the communication component is a wireless local area network (Wi-Fi) communication component with an embedded TCP / IP protocol stack, supporting real-time data transmission via the UDP protocol.

[0018] Thirdly, this invention provides a collaborative dual-band laser input system for spatial interaction, which achieves end-to-end spatial interaction from "laser pointing" to "intelligent control" in large spaces through the collaborative work of modular components. The system specifically includes: As described above, the intelligent handheld terminal serves as the front-end execution unit for user operation, enabling dual-band laser emission, user operation sensing, and network communication.

[0019] The spatial positioning subsystem is used to capture in real time the light spot formed on the target surface by the infrared positioning laser emitted by the smart handheld terminal, and calculate the precise spatial coordinates of the light spot on the target surface based on the light spot characteristics, so as to provide high-precision positioning data for subsequent semantic mapping and instruction generation.

[0020] The background control subsystem establishes a communication connection with the spatial positioning subsystem and simultaneously connects with the smart handheld terminal via a network, serving as the core intelligent parsing and command generation unit of the system. The background control subsystem includes: The semantic mapping unit is used to map the precise spatial coordinates calculated by the spatial positioning subsystem to the semantic attributes of the corresponding target object, establish a bridge between physical spatial coordinates and the logical meaning of the target object, and realize the system's context awareness of the operated object.

[0021] The instruction generation unit is used to intelligently parse, based at least on the precise spatial coordinates and the semantic attributes of the associated target object, combined with preset logical judgment rules, to generate spatial interaction control instructions that conform to the user's operation intentions and are context-adapted, and send them to the smart handheld terminal.

[0022] The communication interface unit, as the communication hub of the background control subsystem, is used to establish a stable network communication connection with the intelligent handheld terminal to realize the bidirectional transmission of control commands and status information, and at the same time establish a communication connection with the spatial positioning subsystem to receive precise spatial coordinate data.

[0023] The beneficial effects of this invention are that, compared with the prior art, it achieves a technological breakthrough in large-space interactive scenarios through the deep integration of physical optics, spatial computing, network communication, and intelligent semantic parsing, specifically including: 1. Intuitive and natural interaction with precise and clear feedback: Through the core design of "dual-band laser with parallel / coincident optical axes", the visible light spot seen by the user is completely consistent with the spatial position of the infrared light spot located by the system. This completely solves the pain point of separation between user visual feedback and system perception benchmark in traditional interaction solutions, realizing a natural interaction of "what you see is what you get". In addition, users can obtain real-time and clear directional feedback through the visible light spot during operation, which greatly reduces the error rate.

[0024] 2. High positioning accuracy and strong anti-interference capability: Using infrared positioning laser as a spatial positioning beacon, it is less affected by ambient visible light, on-site obstruction and other factors. Combined with a high-precision spatial positioning subsystem, it can achieve stable pointing and positioning with sub-centimeter accuracy in large space scenes with complex lighting, meeting the needs of precise interaction.

[0025] 3. Real-time interactive response and smooth operation experience: Through the optimized "TCP / IP protocol suite + Wi-Fi + UDP" network communication architecture, low-latency transmission of control commands and status information is achieved, and the data transmission latency can be controlled within 50ms, meeting the stringent requirements of real-time response for large-scale interaction and ensuring the smoothness of the entire interaction process.

[0026] 4. Intelligent understanding of intent and low operational complexity: The introduction of a "semantic attribute mapping mechanism for target objects" enables the back-end control system to understand the context of the user's operation, rather than just recognizing physical coordinates. It can automatically switch terminal working modes or trigger complex linkages of devices / scenes, achieving a leap from "manual precise control" to "intelligent intent interaction" and significantly reducing the user's learning cost.

[0027] 5. Modular system design with strong integration and scalability: The intelligent handheld terminal, spatial positioning subsystem, and back-end control subsystem of this invention adopt a modular combination architecture, which is easy to integrate into the intelligent control systems of existing smart exhibition halls, command centers, large conference rooms and other scenarios. Moreover, by expanding semantic mapping rules and logical judgment rules, it can support increasingly rich interactive objects and application scenarios, and has good engineering application value. Attached Figure Description

[0028] Figure 1 This is a schematic diagram of the external structure of a smart handheld terminal provided in an embodiment of the present invention.

[0029] Figure 2 This is a schematic diagram of the internal functional module structure of a smart handheld terminal provided in an embodiment of the present invention.

[0030] Figure 3 This is a schematic diagram of the overall architecture of a collaborative dual-band laser input system for spatial interaction provided in an embodiment of the present invention.

[0031] Figure 4 This is a schematic flowchart of a spatial interactive input method based on cooperative dual-band laser provided in an embodiment of the present invention.

[0032] In the diagram: 100 - Intelligent handheld terminal, 110 - Housing, 120 - Laser emission window, 130 - Operation buttons, 140 - Indicator component; 200 - Laser emission assembly, 210 - Visible laser, 220 - Infrared laser, 230 - Optical lens group; 300 - Main control assembly; 400 - Communication assembly; 500 - User input assembly; 600 - Power module; 700 - Feedback assembly; 800 - Spatial positioning subsystem; 900 - Backend control subsystem, 910 - Communication interface unit, 920 - Semantic mapping unit, 930 - Command generation unit. Detailed Implementation

[0033] To make the objectives, technical solutions, and advantages of this invention clearer, the invention will be further described in detail below with reference to the accompanying drawings and specific embodiments. Obviously, the described embodiments are preferred embodiments of this invention and are only used to explain the invention, not to limit the scope of protection of this invention. All other embodiments obtained by those skilled in the art based on the embodiments of this invention without creative effort are within the scope of protection of this invention.

[0034] It should be noted that the "target surface" mentioned in this invention refers to any physical surface that can be pointed at by a laser in a large-space interactive scene, including but not limited to cultural walls, large screens, sand tables, walls, exhibition boards, etc.; the "spatial coordinates" can be two-dimensional rectangular coordinates or three-dimensional spatial coordinates, whichever is selected according to the positioning requirements of the actual scene, and both are within the protection scope of this invention.

[0035] Please see Figure 1 and Figure 2 In this embodiment, the intelligent handheld terminal 100 is designed with a pen-shaped portable structure, making it easy for users to hold with one hand and operate in large spaces. It includes a housing 110 and various functional components integrated within the housing. The front end of the housing 110 has a laser emission window 120 for laser emission; the side wall of the housing 110 has operation buttons 130 and an indicator component 140. The operation buttons 130 are part of the user input component 500, and the indicator component 140 is part of the feedback component 700, used to provide visual feedback to the user.

[0036] The internal functional components of the intelligent handheld terminal 100 include a laser emitting component 200, a main control component 300, a communication component 400, a user input component 500, a power module 600, and a feedback component 700. Each component is electrically connected to the main control component 300 and is uniformly coordinated and controlled by the main control component 300. Laser emitting assembly 200 includes a 650nm red visible light laser 210, an 850nm near-infrared laser 220, and a precision optical mirror group 230. The optical mirror group 230 includes a collimating lens and a beam-combining prism, used to calibrate and adjust the output optical axes of the two laser beams. This ensures that the optical axis calibration is completed before the terminal leaves the factory. Within an effective working distance of 5-50 meters, the center deviation of the light spots formed by the two laser beams on the target surface is less than 1-2 millimeters, achieving substantial overlap of the output optical axes and guaranteeing "what you see is what you get."

[0037] Main control component 300: It adopts STM32 series low-power MCU as the core control unit of the terminal. It has built-in program logic to parse user input operations, control the synchronous start and stop of laser emission component 200, coordinate the data transmission and reception of communication component 400, and control the feedback trigger of feedback component 700.

[0038] Communication Component 400: Utilizes the ESP32 series communication module supporting 2.4G / 5G dual-band Wi-Fi, with an embedded complete TCP / IP protocol stack. It can be programmed into UDP client mode for low-latency real-time communication with the backend control subsystem 900; it is also compatible with the TCP protocol and can be switched for high-reliability data transmission as needed.

[0039] User input component 500 includes physical operation buttons 130 and a thumb touchpad. The physical buttons are used to trigger basic operations such as laser emission and power on / off; the thumb touchpad is used to perform interactive operations such as single click, double click, and drag, and can convert all user operation signals into electrical signals for transmission to the main control component 300.

[0040] Power Module 600: Uses a rechargeable lithium battery to provide a stable 3.3V power supply to all components of the terminal, adapting to the portable use needs of handheld terminals.

[0041] Feedback component 700 includes an LED indicator (indicator component 140) and a miniature vibration motor. The LED indicator can use different colors (red / green / blue) to indicate the terminal's working status and mode switching results; the vibration motor provides tactile feedback for operation triggering and command reception through short vibrations. This achieves dual feedback of visual and tactile feedback, enhancing the user's operating experience.

[0042] Please see Figure 3 In this embodiment, the overall architecture of the collaborative dual-band laser input system for spatial interaction includes a smart handheld terminal 100, a spatial positioning subsystem 800, and a back-end control subsystem 900. Each module communicates collaboratively via a network, forming a complete spatial interaction system of "front-end operation - mid-end positioning - back-end intelligent analysis." Spatial positioning subsystem 800: Employs a high-precision positioning scheme based on multi-camera machine vision, deployed at the top of an unobstructed location in a large open space. Multiple sets of high-definition cameras capture the light spot formed by the infrared positioning laser on the target surface in real time. A visual positioning algorithm calculates the millimeter-level precision three-dimensional spatial coordinates of the light spot on the target surface, and the positioning data is transmitted in real time to the backend control subsystem 900 via a local area network. It should be noted that the spatial positioning subsystem 800 can also be replaced by other high-precision spatial sensing technologies such as UWB, LiDAR, and ultrasound. Any technology that can achieve accurate spatial coordinate calculation of the infrared positioning laser spot is considered an equivalent alternative to this invention.

[0043] The background control subsystem 900 is an industrial control computer deployed at the scene (an embedded computing device such as a Raspberry Pi 4B can also be used), running customized intelligent control software, including a communication interface unit 910, a semantic mapping unit 920, and an instruction generation unit 930. Among them: The communication interface unit 910 is configured as a UDP server to establish stable communication with the UDP client of the smart handheld terminal 100, and at the same time establish a data connection with the spatial positioning subsystem 800 to realize bidirectional transmission of positioning data, control commands and status information.

[0044] The semantic mapping unit 920 maintains a spatial object semantic mapping database, recording the spatial coordinate boundaries, object ID, and corresponding semantic attributes of each physical device / digital content in the scene. Semantic attributes include at least object type, function, and executable operation. An example mapping data is: {Object ID: “Screen_A”, Coordinate Boundaries: (X1,Y1,Z1)-(X2,Y2,Z2), Type: “4K LED Display Screen”, Function: “Product Display, Video Playback”, Executable Operations: [“Content Zooming”, “Content Dragging”, “Video Play / Pause”]}.

[0045] The instruction generation unit 930 has built-in preset logical judgment rules, which are used to intelligently parse spatial coordinates, semantic attributes and user operation events to generate spatial interaction control instructions that conform to the user's intentions.

[0046] Please see Figure 4 and combined Figure 1-3 Using the interactive digital city sand table in a smart exhibition hall as a typical application scenario, this paper details the specific execution process of the collaborative dual-band laser input method of the present invention. In this scenario, the user is an exhibition hall guide, the target object is the digital city sand table, and the interaction requirement is to highlight the sand table area and play narration audio and video through pointing and simple gestures. The specific process is as follows: S401: Terminal Power-On and Communication Establishment. The guide turns on the power to the smart handheld terminal 100. The main control component 300 controls each component to complete a self-test. After the self-test passes, the communication component 400 automatically connects to the exhibition hall's preset Wi-Fi LAN and establishes a UDP communication connection with the communication interface unit 910 of the backend control subsystem 900, completing the terminal device's registration and authentication. At this time, the terminal's LED indicator light is solid red, indicating successful communication establishment.

[0047] S402: Dual-band synchronous laser emission. The presenter presses the laser emission button on the terminal. The main control component 300 receives the user input signal and immediately controls the laser emission component 200 to simultaneously illuminate the 650nm visible light laser 210 and the 850nm infrared laser 220. The two laser beams with their optical axes aligned are emitted through the laser emission window 120, forming a red visible light spot and an infrared positioning spot with completely consistent spatial positions on the target surface. At this time, the terminal's LED indicator switches to a solid green light.

[0048] S403: User laser pointing operation. The guide holds a smart handheld terminal 100 and points a visible red spot at the central area of ​​the digital city model in the exhibition hall, providing intuitive visual directional feedback. Simultaneously, an invisible infrared positioning spot is projected onto the same location in the central area of ​​the model.

[0049] S404: Precise Spot Positioning and Coordinate Transmission. The multi-camera array of the spatial positioning subsystem 800 captures the infrared positioning spot in real time, calculates the precise three-dimensional spatial coordinates (X,Y,Z) of the spot in milliseconds using a visual positioning algorithm, and sends the coordinate data to the communication interface unit 910 of the background control subsystem 900 in real time via the local area network.

[0050] S405: Spatial coordinates and semantic attribute mapping. The communication interface unit 910 transmits the received three-dimensional coordinate data to the semantic mapping unit 920. The semantic mapping unit 920 queries the spatial object semantic mapping database based on the coordinates (X, Y, Z), determines that the coordinates fall within the spatial coordinate boundary range of "Digital City Sand Table - Central District", and retrieves the semantic attributes of the target object: {Type: "Interactive Sand Table Model", Function: "Regional Planning Display", Executable Operations: ["Highlight Area Building Cluster", "Play Area Explanation Audio and Video", "Retrieve Area Planning Data"]}.

[0051] S406: Background Intelligent Parsing and Mode Control Command Generation. The semantic mapping unit 920 transmits coordinate data and corresponding semantic attributes to the command generation unit 930. The command generation unit 930 generates a spatial interaction control command to "switch to model interaction mode" according to preset logical judgment rules, and sends this command to the intelligent handheld terminal 100 via the communication interface unit 910. After receiving the command, the main control component 300 controls the LED indicator to switch from green to blue and triggers a short vibration of the micro-vibration motor, providing visual and tactile feedback to the guide to indicate that the mode switch was successful.

[0052] S407: User interaction gesture operation and signal transmission. The instructor performs a "double-tap" gesture operation on the terminal's thumb touchpad. The user input component 500 converts this gesture operation into an electrical signal and transmits it to the main control component 300. The main control component 300 encapsulates the "double-tap" operation event into a UDP data packet and immediately sends it to the background control subsystem 900 via the communication component 400.

[0053] S408: Background Intelligent Parsing and Execution Command Generation. The communication interface unit 910 of the background control subsystem 900 receives and parses the "double-click" operation event data packet and transmits it to the command generation unit 930. The command generation unit 930 combines the currently identified target (digital city sand table - central area), the semantic attributes of the target object, and the "double-click" operation event, and generates the final system execution command according to preset logical judgment rules: "Highlight the central area building complex in the digital city sand table and trigger the playback of the 'Central Area Development Plan' narration audio and video." This execution command is sent to the sand table control host and the exhibition hall audio system through the communication interface unit 910.

[0054] S409: Execution Result Feedback. After receiving the execution command, the sand table control host and audio system immediately execute the corresponding operation, highlighting the central area of ​​the sand table and playing the narration audio and video. Simultaneously, the sand table control host sends a "successful operation" status message to the background control subsystem 900. The background control subsystem 900 forwards this status message to the smart handheld terminal 100, which triggers a vibration motor to vibrate twice briefly, indicating successful operation to the guide.

[0055] Thus, a complete, natural, intuitive, and intelligent large-space interaction process is completed. Throughout the process, the guide only needs to complete three steps: "power on → laser pointing → simple gestures," without needing to operate a fixed central control panel or memorize complex buttons. This achieves natural spatial interaction of "point and control," and the response latency of the entire interaction process is controlled within 50ms, resulting in a smooth and lag-free operating experience.

[0056] The scope of protection of this invention is not limited to the specific embodiments described above. For those skilled in the art, this invention can have various modifications and variations. Any modifications, equivalent substitutions, or improvements made to the above embodiments within the scope of the technical concept and inventive spirit of this invention should be included within the scope of protection of this invention. For example, the form of the intelligent handheld terminal 100 can be designed as a short stick shape, a handheld remote control shape, etc., according to scenario requirements; the visible light band of the laser emitting component 200 can be replaced with other visible light bands such as 532nm green light, and the infrared light band can be replaced with other near-infrared bands such as 980nm; the communication component 400 can add Bluetooth BLE 5.0 as a backup communication method on top of Wi-Fi; the spatial positioning subsystem 800 can select different high-precision positioning technologies according to the accuracy requirements of the scenario. As long as its technical solution is consistent with the core concept of this invention, it falls within the scope of protection of this invention.

Claims

1. A collaborative dual-band laser input method for spatial interaction, applied to a smart handheld terminal, characterized in that, The method includes the following steps: Synchronous transmission steps: In response to user operation, the intelligent handheld terminal is controlled to synchronously transmit a visible indicator laser in the first band and an infrared positioning laser in the second band. The visible indicator laser and the infrared positioning laser are configured to have parallel or coincident emission optical axes to form light spots with completely consistent spatial positions on the target surface. This provides the operator with intuitive spatial pointing visual feedback and ensures that the precise spatial coordinates of the infrared positioning laser correspond one-to-one with the visual pointing of the visible indicator laser, achieving a spatial interaction effect of "what you see is what you determine". Communication establishment steps: Establish a network communication connection with the back-end control system through the communication unit of the intelligent handheld terminal; Intelligent interaction steps: Receive control commands from the background control system via the network communication connection, and execute corresponding spatial interaction operations according to the control commands; wherein, the control commands are generated by the background control system based at least on the precise spatial coordinates of the infrared positioning laser spot and the semantic attributes of the target object associated with those coordinates.

2. The method according to claim 1, characterized in that, The "semantic attributes of the target object" include at least one of the target object's type, function, or executable operation.

3. The method according to claim 1 or 2, characterized in that, The network communication connection established in the communication establishment step is a wireless communication connection.

4. The method according to claim 3, characterized in that, The wireless communication connection is implemented based on Transmission Control Protocol / Internet Protocol (TCP / IP).

5. The method according to claim 4, characterized in that, The wireless communication connection is a wireless local area network (Wi-Fi) connection, and the smart handheld terminal and the background control system transmit real-time interactive data required for spatial interaction through User Datagram Protocol (UDP). By utilizing the connectionless and low-latency characteristics of the UDP protocol, the real-time requirements of spatial interaction are met.

6. A collaborative dual-band laser input terminal for spatial interaction, characterized in that, include: A laser emitting assembly is configured to simultaneously emit a first-band visible indicator laser and a second-band infrared positioning laser in response to a user input operation, and to keep the output optical axes of the two laser beams parallel or coincident. Main control component; A communication component, connected to the main control component, is configured to establish a network communication connection with the background control system; The user input component is connected to the main control component and is used to sense user input operations.

7. The terminal according to claim 6, characterized in that, The communication component is a wireless communication component.

8. The terminal according to claim 7, characterized in that, The wireless communication component is a wireless local area network (Wi-Fi) communication component.

9. A collaborative dual-band laser input system for spatial interaction, characterized in that, include: The intelligent handheld terminal as described in any one of claims 6 to 8; A spatial positioning subsystem is used to capture the light spot formed on the target surface by the infrared positioning laser emitted by the smart handheld terminal, and to calculate the precise spatial coordinates of the light spot on the target surface. A background control subsystem is communicatively connected to the spatial positioning subsystem, and the background control subsystem includes: A semantic mapping unit is used to map the precise spatial coordinates to the corresponding semantic attributes of the target object; The instruction generation unit is used to generate spatial interaction control instructions to be sent to the smart handheld terminal, based at least on the precise spatial coordinates and the semantic attributes of the associated target object. The communication interface unit is used to establish a network communication connection with the smart handheld terminal to transmit the control commands.

10. A computer-readable storage medium having a computer program stored thereon, characterized in that, When the computer program is executed by a processor, it implements the method as described in any one of claims 1 to 5.