system
The system addresses health and safety risks in children's digital device use by detecting posture and distance, filtering content, monitoring personal information, and managing time, ensuring a safe and healthy digital experience.
Patent Information
- Authority / Receiving Office
- JP · JP
- Patent Type
- Applications
- Current Assignee / Owner
- SOFTBANK GROUP CORP
- Filing Date
- 2024-12-16
- Publication Date
- 2026-06-26
AI Technical Summary
The increasing use of digital devices by children poses health risks such as vision decline, access to inappropriate content, and personal information leakage, with existing solutions failing to comprehensively address these issues.
A system that includes sensors to detect user distance and posture, filters inappropriate content, monitors personal information input, and manages usage time, providing alerts and notifications through an interactive interface to promote healthy internet use.
The system effectively protects children's eyesight, prevents access to harmful content, safeguards personal information, and manages usage time, creating a safe and healthy digital environment.
Smart Images

Figure 2026105332000001_ABST
Abstract
Description
Technical Field
[0001] The technology of the present disclosure relates to a system.
Background Art
[0002] Patent Document 1 discloses a method for controlling a persona chatbot, which is performed by at least one processor, including steps of receiving a user utterance, adding the user utterance to a prompt including an instruction sentence related to an explanation of a chatbot character, encoding the prompt, and inputting the encoded prompt into a language model to generate a chatbot utterance in response to the user utterance.
Prior Art Documents
Patent Documents
[0003]
Patent Document 1
Summary of the Invention
Problems to be Solved by the Invention
[0004] With the use of digital devices by children in modern times, many health problems and safety risks have emerged. Specifically, these include vision decline due to long-term use, the risk of accessing inappropriate content on the Internet, and the danger of personal information leakage. Furthermore, it is also necessary to minimize the friction regarding Internet use that occurs between parents and children. Means for comprehensively solving these problems are required.
Means for Solving the Problems
[0005] This invention provides a system that continuously detects the user's distance and posture to prevent vision deterioration caused by children's use of digital devices, and generates alerts to promote eye protection based on that information. Furthermore, to prevent access to inappropriate content on the internet, it classifies and filters content through data processing means to prevent problems before they occur. In addition, to protect personal information, it is equipped with a function to monitor the user's input actions and issue warnings as needed. Moreover, it is equipped with a timer and character display to notify users of usage time and break times in an enjoyable way, reducing conflicts between parents and children and promoting healthy internet use.
[0006] "Sensor means" refers to devices and technologies used to detect the distance and posture of a child user.
[0007] "Control means" refers to devices or processes that have the function of generating alerts to promote the protection of eyesight based on information obtained from sensor means.
[0008] "Data processing means" refers to technologies and algorithms used to classify and filter content on the internet and restrict access to inappropriate content.
[0009] "Monitoring measures" refer to devices and technologies that have the function of monitoring the actions that child users input on the internet and encouraging the protection of personal information.
[0010] A "timer mechanism" refers to a device or function that measures elapsed time and manages the user's status in order to notify the user of usage time and break time.
[0011] "Interface means" refers to an interactive display function that uses characters to display notifications and warnings presented to the user. [Brief explanation of the drawing]
[0012] [Figure 1]This is a conceptual diagram showing an example of the configuration of a data processing system according to the first embodiment. [Figure 2] This is a conceptual diagram showing an example of the essential functions of a data processing device and a smart device according to the first embodiment. [Figure 3] This is a conceptual diagram showing an example of the configuration of a data processing system according to the second embodiment. [Figure 4] This is a conceptual diagram showing an example of the main functions of a data processing device and smart glasses according to the second embodiment. [Figure 5] This is a conceptual diagram showing an example of the configuration of a data processing system according to the third embodiment. [Figure 6] This is a conceptual diagram showing an example of the main functions of a data processing device and a headset-type terminal according to the third embodiment. [Figure 7] This is a conceptual diagram showing an example of the configuration of a data processing system according to the fourth embodiment. [Figure 8] This is a conceptual diagram showing an example of the main functions of a data processing device and a robot according to the fourth embodiment. [Figure 9] This shows an emotion map where multiple emotions are mapped. [Figure 10] This shows an emotion map where multiple emotions are mapped. [Figure 11] This is a sequence diagram showing the processing flow of the data processing system in Example 1. [Figure 12] This is a sequence diagram showing the processing flow of the data processing system in Application Example 1. [Figure 13] This is a sequence diagram showing the processing flow of the data processing system in Example 2, when an emotion engine is combined. [Figure 14] This is a sequence diagram showing the processing flow of the data processing system in Application Example 2, which combines an emotion engine. [Modes for carrying out the invention]
[0013] Hereinafter, an example of an embodiment of the system relating to the technology of this disclosure will be described with reference to the attached drawings.
[0014] First, the terms used in the following description will be explained.
[0015] In the following embodiments, the labeled processor (hereinafter simply referred to as "processor") may be a single arithmetic unit or a combination of multiple arithmetic units. Also, the processor may be a single type of arithmetic unit or a combination of multiple types of arithmetic units. Examples of arithmetic units include a CPU (Central Processing Unit), a GPU (Graphics Processing Unit), a GPGPU (General-Purpose computing on Graphics Processing Units), an APU (Accelerated Processing Unit), and the like.
[0016] In the following embodiments, the labeled RAM (Random Access Memory) is a memory in which information is temporarily stored and is used as a work memory by the processor.
[0017] In the following embodiments, the labeled storage is one or more non-volatile storage devices that store various programs and various parameters, etc. Examples of non-volatile storage devices include flash memory (SSD (Solid State Drive)), magnetic disks (e.g., hard disks), or magnetic tapes, and the like.
[0018] In the following embodiments, the labeled communication I / F (Interface) is an interface including a communication processor and an antenna, etc. The communication I / F controls communication between multiple computers. Examples of communication standards applied to the communication I / F include wireless communication standards including 5G (5th Generation Mobile Communication System), Wi-Fi (registered trademark), or Bluetooth (registered trademark), and the like.
[0019] In the following embodiments, "A and / or B" is synonymous with "at least one of A and B." That is, "A and / or B" means that it may be A alone, or B alone, or a combination of A and B. Furthermore, in this specification, the same concept as "A and / or B" applies when expressing three or more things linked by "and / or."
[0020] [First Embodiment]
[0021] Figure 1 shows an example of the configuration of the data processing system 10 according to the first embodiment.
[0022] As shown in Figure 1, the data processing system 10 includes a data processing device 12 and a smart device 14. An example of the data processing device 12 is a server.
[0023] The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network).
[0024] The smart device 14 comprises a computer 36, a reception device 38, an output device 40, a camera 42, and a communication interface 44. The computer 36 comprises a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The reception device 38, output device 40, and camera 42 are also connected to the bus 52.
[0025] The reception device 38 is equipped with a touch panel 38A and a microphone 38B, etc., and receives user input. The touch panel 38A receives user input by detecting contact with an object (e.g., a pen or finger). The microphone 38B receives user input by detecting the user's voice. The control unit 46A transmits data indicating the user input received by the touch panel 38A and microphone 38B to the data processing device 12. In the data processing device 12, the specific processing unit 290 acquires the data indicating the user input.
[0026] The output device 40 includes a display 40A and a speaker 40B, and presents data to the user 20 by outputting the data in a form perceptible to the user 20 (e.g., audio and / or text). The display 40A displays visible information such as text and images according to instructions from the processor 46. The speaker 40B outputs audio according to instructions from the processor 46. The camera 42 is a small digital camera equipped with an optical system such as a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor.
[0027] Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various types of information between processor 46 and processor 28 via network 54.
[0028] Figure 2 shows an example of the main functions of the data processing device 12 and the smart device 14.
[0029] As shown in Figure 2, in the data processing device 12, a specific processing is performed by the processor 28. A specific processing program 56 is stored in the storage 32. The specific processing program 56 is an example of a "program" related to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 according to the specific processing program 56 executed on the RAM 30.
[0030] The storage 32 stores the data generation model 58 and the emotion identification model 59. The data generation model 58 and the emotion identification model 59 are used by the identification processing unit 290.
[0031] In the smart device 14, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The reception output program 60 is used in conjunction with a specific processing program 56 by the data processing system 10. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48.
[0032] Next, the specific processing performed by the specific processing unit 290 of the data processing device 12 will be described. In the following description, the data processing device 12 will be referred to as the "server" and the smart device 14 as the "terminal".
[0033] This invention provides a system that promotes healthy internet use among children by providing features such as eye protection, content filtering, personal information protection, and usage time management. The program's processing is described below in natural language.
[0034] First, the device uses sensors to detect the distance and posture of the user, i.e., the child's face. This activates a function that issues alerts for appropriate eye protection, prompting the user to take a break or say things like "You're too close to the screen" or "It's time for a break." These notifications are displayed using child-friendly characters, allowing for natural behavioral correction.
[0035] Next, the server receives the URL of the website the child is trying to access and checks it against a database to determine if it contains inappropriate content. This filtering technology is essential to protect children from harmful information. If the content is deemed inappropriate, the device immediately notifies the user and suggests safe sites and activities.
[0036] Furthermore, the device monitors children's attempts to enter personal information online. When a user enters an email address or other personal information, it immediately issues a warning and urges them to refrain from doing so, thereby reducing the risk of data breaches.
[0037] Furthermore, the device manages the child's internet usage time, notifies them when the set time is approaching, and automatically switches to rest mode when the actual time is up. This function helps maintain the child's health by avoiding prolonged continuous use and encouraging appropriate breaks.
[0038] As a concrete example, when a child uses the internet to supplement their studies, the server filters and displays only safe learning-related sites, and the device notifies the child to take a break before screen time becomes too long. This provides an environment where learning can be done efficiently while also protecting health and safety.
[0039] The following describes the processing flow.
[0040] Step 1:
[0041] The device activates its sensors to monitor the child's face position and orientation in real time. The detected data is used to calculate the distance and angle of the face.
[0042] Step 2:
[0043] Based on monitoring results, the device determines if the user is in a state that could negatively affect their eyesight, such as "holding the screen too close." If this is detected, an alert message "You need to take a break" is displayed on the screen.
[0044] Step 3:
[0045] A user attempts to access a website. At that time, the device captures the accessed URL and sends it to the server.
[0046] Step 4:
[0047] The server receives the URL, compares it against filtering criteria in the database, and analyzes whether the website in question is inappropriate.
[0048] Step 5:
[0049] If the server determines that the website in question is inappropriate, it returns that result to the device. The device then displays a message to the child stating, "This page is not accessible," and suggests safe alternative content.
[0050] Step 6:
[0051] The device monitors personal data (such as email addresses and credit card information) when a child starts entering information.
[0052] Step 7:
[0053] If personal information is detected in an input field, the device will immediately issue a warning and prompt the child not to enter the information.
[0054] Step 8:
[0055] The device counts the elapsed time since internet use began and manages the time to ensure it does not exceed the set limit. When the remaining time is running low, it notifies the child of the countdown.
[0056] Step 9:
[0057] When the set usage time is reached, the device locks the screen, prompting the child to take a break. At this time, the display screen shows relaxation methods and game suggestions accompanied by characters.
[0058] (Example 1)
[0059] Next, we will describe Example 1. In the following description, the data processing device 12 will be referred to as the "server," and the smart device 14 will be referred to as the "terminal."
[0060] In today's information society, ensuring children use the internet safely and healthily is a critical issue. In particular, concerns about vision problems, access to inappropriate content, personal data breaches, and health damage from prolonged use are serious. Traditional methods have struggled to comprehensively address these issues, necessitating a unified solution.
[0061] The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 1 is realized by the following means.
[0062] In this invention, the server includes detection means, determination means, information processing means, monitoring means, and time management means. This enables the protection of children's eyesight, control of inappropriate content access, protection of personal information, and management of usage time.
[0063] "Detection means" refers to devices or technologies used to sense the user's distance and posture.
[0064] The "decision-making mechanism" is a mechanism that generates a warning for eye protection based on the detection results.
[0065] "Information processing means" refers to technologies for classifying and selecting data and controlling access to inappropriate information.
[0066] A "monitoring measure" is a system designed to observe the user's input operations and prevent the unauthorized input of personal information.
[0067] A "time management system" is a management system used to limit a user's activity time and maintain their health.
[0068] "Means of expression" refers to a function that provides notifications or warnings to users through the display of characters.
[0069] This invention is a system that supports healthy internet use among children, and its specific embodiments are shown below.
[0070] The device features a built-in camera and infrared sensor to detect the child's distance and posture in real time. The data acquired by the sensors is used for vision protection, and if the distance is too close or the posture is inappropriate, a warning is displayed to the child using a character notification. In this process, the camera and infrared sensor are used as hardware, and a software algorithm for vision protection processes the data.
[0071] The server receives the URL of a website that a child is trying to access and filters it against a database to eliminate inappropriate content. The database contains keywords and site lists necessary for filtering, and machine learning techniques are used to identify inappropriate elements.
[0072] The device also monitors when children enter personal information into forms. For example, if it detects the input of an email address or phone number, it immediately issues a warning to prevent the leakage of personal information. This uses browser plugins and real-time data checking functions provided by the operating system.
[0073] Furthermore, the device manages the child's internet usage time, notifies them when the set time limit is reached, and switches to rest mode when the time is up. This allows users to take appropriate breaks. An installed timer application is used for time management.
[0074] As a concrete example, when a child uses the internet for online learning, the server displays only safe websites related to learning, and the device sends notifications at regular intervals recommending a break. This provides a safe and healthy learning environment.
[0075] The following are examples of prompts for a generative AI model.
[0076] "Please propose specific ways to ensure children can use the internet safely. This should include measures such as eye protection, content filtering, personal information protection, and managing usage time."
[0077] The flow of the specific processing in Example 1 will be explained using Figure 11.
[0078] Step 1:
[0079] The device activates its built-in camera and infrared sensor to detect the distance and posture of the child's face. Input includes image data of the child's face and distance measurement data. This data is analyzed by an algorithm to determine if the distance falls below a predetermined threshold (e.g., 30cm or more) or if the posture is inappropriate. If the distance is too close or the posture is inappropriate, the device generates a warning message using a character to protect the child's eyesight. This prompts the user to immediately move away from the screen.
[0080] Step 2:
[0081] The server receives the URL of the website that the child has requested to access. The input includes the URL data and metadata such as the access time. The server compares this URL with a database to check if it matches any pre-registered lists of inappropriate URLs or keywords. A text search algorithm is used for data processing. If it is determined to be inappropriate, the server sends an access denied message and a suggestion of a safe site to the device as output. This procedure guides the child to safe information without accessing inappropriate content.
[0082] Step 3:
[0083] The device monitors the input in real time when a child fills out a form in their web browser. The input is the text data entered by the child. The plugin used for monitoring performs regular expression matching to detect keywords that constitute personal information (e.g., email address, phone number). If such keywords are detected, a warning pop-up is displayed as output, and the submission function is disabled. This procedure prevents the leakage of personal information.
[0084] Step 4:
[0085] The device activates a built-in timer when a child starts using the internet and measures the usage time. The input recorded is the start time of use. Five minutes before the set usage limit is reached, a warning notification is generated, and the message "You have 5 minutes of usage time remaining" is displayed on the screen. When the set time is up, the internet connection is temporarily disabled, and a message prompting a break is displayed. This allows children to use the internet while taking appropriate breaks.
[0086] (Application Example 1)
[0087] Next, we will explain Application Example 1. In the following explanation, the data processing device 12 will be referred to as the "server," and the smart device 14 will be referred to as the "terminal."
[0088] In recent years, creating an environment where children can use the internet safely and healthily has become crucial. However, there are concerns about the risks children face when spending long hours online, such as eye strain, access to inappropriate content, and the leakage of personal information. There is a need to develop systems that mitigate these risks and appropriately manage usage time, thereby promoting the healthy physical and mental development of children.
[0089] The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 1 is realized by the following means.
[0090] In this invention, the server includes detection means for detecting the location and posture of a child user; control means for generating warnings to encourage the protection of the user's vision based on the detection means; processing means for classifying and filtering digital information and restricting access to inappropriate digital information; and filtering means for analyzing web location markers accessed by the user and allowing only safe digital information. This enables children to use the internet safely while protecting their eyesight and managing their usage time.
[0091] A "detection device" is a device that has the function of sensing the position and posture of a child user in real time and converting it into numerical data.
[0092] A "control means" is a device that generates warnings to reduce the visual strain on the user based on data obtained from a detection means.
[0093] A "processing means" is a component that handles data management functions to classify digital information, select inappropriate content, and restrict access to it.
[0094] A "monitoring device" is a device that has the function of monitoring user input operations and implementing security measures to prevent the leakage of personal information.
[0095] A "filtering method" is a system that analyzes the web location information accessed by the user and has the function of allowing only safe digital information.
[0096] The system that realizes this invention is configured using a server and a terminal.
[0097] The server's primary role is to classify and filter digital information, restricting access to inappropriate content. Specifically, the server receives web location markers that users attempt to access and compares them against a database to determine if the information is safe. This filters out inappropriate digital information, providing users with a safe internet environment.
[0098] On the other hand, the terminal functions as an interface for the user to access the internet. It uses a built-in camera and distance sensor to detect the user's position and posture in real time. Using image processing libraries such as OpenCV, it generates warnings through the display device if the user is too close to the display or has poor posture.
[0099] To manage usage time, the device manages time information through a real-time database and notifies the user when usage ends based on the set usage time. This helps users avoid excessive use and promotes healthy usage. A virtual character is used for notifications, providing user interaction in a friendly manner.
[0100] For example, when a user accesses learning content over the internet, the server provides only secure, educational digital information. Furthermore, the device helps maintain user concentration and well-being by prompting breaks before prolonged use.
[0101] By utilizing generative AI models, it is possible to optimize the content of the messages and warnings for individual users. An example of a prompt to input into the generative AI model is, "Please tell me how to guide my child to use the internet safely." Such a system can provide comprehensive support for children's digital lives.
[0102] The flow of a specific process in Application Example 1 will be explained using Figure 12.
[0103] Step 1:
[0104] The server receives a web location identifier (URL) sent by the user. The received URL is compared against a database on the server to determine if it is secure digital information. The database contains a list of URLs that have been previously verified as secure. If the URL is determined to be inappropriate, the server sends a corresponding notification to the device.
[0105] Step 2:
[0106] The device uses its built-in camera and distance sensor to detect the user's position and posture. Using the OpenCV library, it analyzes video data from the camera to determine if the user is too close to the display or has poor posture. This information is then used to evaluate the appropriateness of the distance and posture through programmatic thresholding. Depending on the result, the device displays visual or audible warnings to the user.
[0107] Step 3:
[0108] The device measures the user's internet usage time and manages the progress of that time in a real-time database. Timekeeping information is obtained by a timer on the device, and notifications are sent based on the set usage limit. When the end time is approaching, the device generates a voice message through a virtual character prompting the user to take a break.
[0109] Step 4:
[0110] When a user enters digital information, the terminal monitors keystrokes and detects specific patterns from a personal information protection perspective. If a specific email address or phone number input pattern is recognized, the system immediately issues a warning to the user prompting them to stop entering information. This warning is provided in either a visual or audio format.
[0111] Step 5:
[0112] By utilizing a generative AI model, the device provides user-friendly interactions. An example prompt is "How can I help my child use the internet safely?", and the generative AI model generates optimal advice and wording, delivering it to the user through communication. This enables flexible, personalized guidance.
[0113] Furthermore, an emotion engine that estimates the user's emotions may be incorporated. That is, the identification processing unit 290 may use the emotion identification model 59 to estimate the user's emotions and perform identification processing using the user's emotions.
[0114] This invention provides a system for children's internet use that combines eye protection, filtering of inappropriate content, protection of personal information, usage time management, and recognition of the user's emotions. This system is designed to ensure a safe and healthy digital experience for children. The program's processing is described below in natural language.
[0115] First, the device uses sensors to capture the child's face and detect their distance and posture. Based on this data, it determines if eye protection is needed and displays an appropriate alert. The content of the alert can be fine-tuned based on the user's emotions, and this is supported by an emotion engine.
[0116] The emotion engine analyzes sensor data and usage patterns from the device to predict emotions using the child's facial expressions, voice, and interaction data. This emotion data is used not only for alerts but also for providing safe content.
[0117] Next, the device captures the URLs of the web pages the child accesses, and inappropriate content is filtered using data processing tools. At this point, the emotion engine adjusts the filtering criteria and suggested safe alternative content based on the child's psychological state.
[0118] Furthermore, the device monitors whether personal information is included when a child enters data online, and if the emotion engine detects anxiety or stress, it provides more careful warnings and explanations. Especially when a child is emotional, a friendly character explains the situation and the interface is designed to reassure the child.
[0119] In usage time management, the device displays an alert when a set time has elapsed, guiding users to take breaks naturally. In this process as well, the emotion engine detects fatigue and boredom, supporting time management by issuing alerts at appropriate times.
[0120] For example, when a child is learning online, the device's emotion engine detects stress during learning and suggests interactions such as, "Let's refresh ourselves with fun quizzes and games," making breaks enjoyable. In this way, an approach that takes emotional states into account can provide children with a safe and enjoyable environment for using the internet.
[0121] The following describes the processing flow.
[0122] Step 1:
[0123] The device uses sensors to monitor the distance and posture of the child's face and processes this data in real time.
[0124] Step 2:
[0125] The device inputs the acquired data into an emotion engine, which analyzes the child's emotions based on their facial expressions and voice. Based on these results, it decides whether or not to issue an alert for vision protection.
[0126] Step 3:
[0127] The device uses the results of an emotion engine analysis to display alerts such as "Let's move a little further away from the screen!" in a more friendly tone.
[0128] Step 4:
[0129] When a user attempts to access a webpage, the device sends the URL to the server.
[0130] Step 5:
[0131] The server receives the URL, compares it with the database to determine if it contains inappropriate content, and sends the result back to the terminal.
[0132] Step 6:
[0133] The device's emotional engine assesses the child's psychological state and, if necessary, blocks access to inappropriate content, informing them in a gentle tone, "This site is not safe."
[0134] Step 7:
[0135] The device monitors whether any input actions performed by the child include personal information.
[0136] Step 8:
[0137] If personal information is detected, the device will display a warning, but simultaneously, taking into account the results of the emotion engine's analysis, it will offer reassuring words to reduce the child's stress.
[0138] Step 9:
[0139] The device tracks usage time, and an emotional engine displays a countdown that gently reminds the child, "It's almost time for a break," depending on their fatigue or boredom.
[0140] Step 10:
[0141] When usage time ends, the device switches to vacation mode, which includes relaxation suggestions, and automatically locks the screen. At this time, a character suggests an activity that children can enjoy, saying, "Here's what's next!"
[0142] (Example 2)
[0143] Next, we will describe Example 2. In the following description, the data processing device 12 will be referred to as the "server" and the smart device 14 as the "terminal".
[0144] When children use the internet, there is a need to protect their eyesight, restrict access to inappropriate content, protect their personal information, and provide a user environment that takes their emotions into consideration. In conventional methods, these elements are often managed individually, making it difficult to build an integrated digital protection environment.
[0145] The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 2 is realized by the following means.
[0146] In this invention, the server includes sensing means for detecting the face, facial expressions, and voice of a child user; analysis means for analyzing the user's emotions based on the sensing means and generating alerts corresponding to the emotional state; and processing means for classifying and filtering content, restricting access to inappropriate information, and suggesting alternative information. This enables a total digital protection environment for child users.
[0147] "Sensing means" refers to sensors or devices used to detect the child user's face, facial expressions, and voice in real time.
[0148] "Analysis means" refers to the process of analyzing the user's emotional state based on data obtained from sensing means and generating appropriate alerts.
[0149] "Processing means" refers to a series of information processing steps that not only classify content and filter out inappropriate information, but also suggest appropriate alternative information.
[0150] "Monitoring measures" refer to a system that constantly monitors the user's input actions and provides necessary warnings from the perspective of protecting personal information.
[0151] A "timer device" refers to equipment that notifies the user of usage time and break times, and generates alerts at appropriate times according to their emotional state.
[0152] "Interface means" refers to a user interface that provides notifications and warnings to the user through character displays tailored to their emotional state.
[0153] This invention is a system for realizing a safe and secure digital experience for children using the internet. Specifically, the system includes sensors, an analysis engine, data processing, and interface means.
[0154] The device is equipped with sensor devices such as a camera and microphone. These are used as "sensing means" to capture the child's face, expressions, and voice in real time. The device sends this data to a server as basic data for emotion analysis.
[0155] On the server, the collected data is processed using "analysis tools" to analyze the child's emotional state. Emotion prediction algorithms and machine learning models are used for this analysis. Based on these analysis results, the content of the alerts presented to the user on the terminal is adjusted.
[0156] To prevent access to inappropriate content, the device captures the browser's URL information and filters it using a "processing mechanism." This filtering system proactively blocks inappropriate information and suggests safe alternative content.
[0157] Furthermore, the device constantly monitors the child's input and uses "monitoring tools" to issue a warning if personal information is included. If stress or anxiety is detected, a friendly character will be displayed to calmly explain the situation.
[0158] To manage usage time, the device uses a "timer mechanism" to notify the user of usage time and break time. If fatigue is detected, it will make a suggestion such as "Let's take a little more break" depending on the user's emotional state.
[0159] For example, if a user is experiencing stress during online learning, the device might suggest interactions such as "Let's refresh ourselves with fun quizzes and games," offering suggestions to complement learning with other activities. This ensures a safe digital environment while maintaining learning effectiveness.
[0160] An example of a prompt would be, "Explain how to optimize alerts based on a child's emotions during online learning." This provides a basis for properly utilizing generative AI models.
[0161] The flow of the specific processing in Example 2 will be explained using Figure 13.
[0162] Step 1:
[0163] The device uses its built-in camera and microphone to collect sensor data such as the child's face, expressions, and voice. It acquires this data as input and transmits it to a server in real time. Specifically, the device activates its camera and periodically records facial movements and voice.
[0164] Step 2:
[0165] The server receives sensor data transmitted from the terminal as input to the emotion analysis engine. The data processing performed here involves analyzing the emotional state using machine learning algorithms. As output, it generates scores and labels indicating the child's emotional state. Specifically, it determines states such as "happy" or "stressed" based on factors such as the presence or absence of a smile and the tone of voice.
[0166] Step 3:
[0167] The server determines appropriate alert content for the user based on the analysis results and outputs it to the terminal. It receives the results of sentiment analysis as input and creates an alert message corresponding to the emotional state as output. A specific example of its operation is generating a message such as, "Please sit down and move away from the screen."
[0168] Step 4:
[0169] The device captures the URL of the webpage the user wants to access and passes it as input to the data processing system. In this step, calculations are performed to filter whether the URL is inappropriate. The output suggests safe alternative content to the user. Specifically, if there is a page that cannot be accessed, it will display a message such as "This site cannot be viewed, please try this instead."
[0170] Step 5:
[0171] The device monitors children's online input behavior and displays a warning if input containing personal information is detected. It acquires user typing data as input and uses an emotion engine to detect anxiety, thereby outputting a more considerate interface. Specifically, a friendly character appears on the screen saying, "Let's not enter addresses."
[0172] Step 6:
[0173] The device monitors the set usage time and passes the elapsed time as input data to the timer. The output is a break alert to the user. Specifically, it displays "It's almost time for a break" and has a function to temporarily lock the screen.
[0174] (Application Example 2)
[0175] Next, we will explain application example 2. In the following explanation, the data processing device 12 will be referred to as a "server" and the smart device 14 as a "terminal".
[0176] Children are required to have a safe and comfortable user experience in the digital information environment, while simultaneously receiving appropriate visual protection, filtering of digital content, protection of personal information, and management of usage time, all while considering their emotional state. However, conventional systems have struggled to comprehensively achieve these goals. In addition, there is a lack of means to effectively manage and improve children's digital experiences while reducing the burden on parents in the home environment.
[0177] The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 2 is realized by the following means.
[0178] In this invention, the server includes sensor means for detecting the user's distance and posture, analysis means for recognizing emotions and adjusting the digital experience, and adjustment means for communicating with an intelligent machine that monitors educational activities and provides support based on the user's emotions. This enables the protection of children's vision and support for digital education that is in harmony with their emotions.
[0179] "User" refers to an individual who uses a system in a digital information environment, and in particular, to a child who uses a device under the supervision of a parent or guardian.
[0180] "Sensor means" refers to devices and technologies for detecting the user's distance, posture, and facial expressions, and includes cameras, microphones, and other sensing devices.
[0181] "Control means" refers to technology that generates warnings and alerts to encourage visual protection in the user based on data obtained from sensor means.
[0182] "Data processing means" refers to a processing device and program for classifying digital information and filtering out inappropriate content.
[0183] "Monitoring measures" refer to technologies and devices that check the user's input activities and prevent the leakage of personal information.
[0184] "Analysis means" refers to technology that recognizes the user's emotional state based on information such as facial expressions and voice, and adjusts the digital experience accordingly.
[0185] "Adjustment means" refers to technology that improves and provides the user experience in cooperation with intelligent machines, etc., in response to the user's emotions recognized by the analysis means.
[0186] An "intelligent machine" refers to a robot or device equipped with information processing and communication capabilities to enhance children's learning and entertainment.
[0187] "Educational activities" refer to all forms of learning and training aimed at improving children's knowledge and skills.
[0188] This invention provides a system in which a server interacts with an intelligent machine installed in the home, namely a consumer robot, to support a safe and comfortable digital experience for children. The terminal uses sensor means to sense the user's face and body movements and detect distance and posture. Based on this, a control means functions to determine whether visual protection is needed and to display an alert.
[0189] The server collects and analyzes sensor data through cloud-based services. Here, it uses Google® Cloud Vision API and Microsoft® Azure® Emotion API to analyze the user's facial expressions and voice, and obtain emotional data. Furthermore, the analyzed emotional data is used as a means of tailoring content and activities to the user. For example, generative AI models using TENSORFLOW® or PyTorch analyze and predict emotions in real time, and fine-tune filtering criteria.
[0190] If a user attempts to access inappropriate digital information, data processing mechanisms on the server immediately evaluate the information and verify its suitability using the Google Safe Browsing API. During this process, the device monitors input operations using monitoring mechanisms to prevent the leakage of personal information and issues warnings as needed.
[0191] As a concrete example, when a child is using a digital device to participate in an educational program, a robot can be present to manage the appropriateness of the digital information and usage time while suggesting fun activities. For example, by entering a prompt such as, "Determine from this child's facial expression whether they are stressed and generate code to suggest appropriate countermeasures," the system can provide solutions tailored to individual needs.
[0192] In this way, the invention solves the challenges of protecting children's eyesight and digital safety, and realizes a system that reduces the burden on parents at home.
[0193] The flow of a specific process in Application Example 2 will be explained using Figure 14.
[0194] Step 1:
[0195] The device uses sensors to detect the user's facial and body movements in real time. It receives raw data from cameras and microphones as input and processes it as sensor data. Output includes distance, posture, and facial expression information. Specifically, it performs face detection from camera images and uses a distance sensor to measure the distance to the user.
[0196] Step 2:
[0197] The server receives sensor data via cloud services and performs emotion analysis using the Google Cloud Vision API and Emotion API. It receives facial images and voice data as input and sends this data to the API for analysis. The output is an estimate of the user's emotional state. Specifically, it analyzes facial expressions from images, predicts emotions based on tone of voice, and compiles this information.
[0198] Step 3:
[0199] The server uses data processing tools to analyze the URL of the digital content the user is trying to access. It receives URLs captured from a web browser as input and uses the Google Safe Browsing API for classification and filtering. The output evaluates whether the content is safe. Specifically, the API scans the URL and returns whether it poses a risk.
[0200] Step 4:
[0201] The terminal uses monitoring devices to monitor the user's input operations. It collects and analyzes log data from keyboard and touch operations as input. The output provides an indicator of whether personal information has been entered. Specifically, it compares the entered string with a pre-configured personal information pattern and generates an alert if a match is found.
[0202] Step 5:
[0203] The server utilizes a generative AI model based on analyzed emotional states to provide suggestions and content recommendations to the user using adjustment mechanisms. It receives emotional data as input and provides prompts to the generative AI model. As output, it provides the user with optimal action plans and content suggestions. Specifically, it receives a prompt message such as "Determine whether this child's facial expression indicates stress and generate code to suggest appropriate countermeasures," adjusts the generated suggestions, and responds to the user.
[0204] The specific processing unit 290 transmits the result of the specific processing to the smart device 14. In the smart device 14, the control unit 46A causes the output device 40 to output the result of the specific processing. The microphone 38B acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 38B to the data processing device 12. In the data processing device 12, the specific processing unit 290 acquires the audio data.
[0205] Data generation model 58 is a so-called generative AI (Artificial Intelligence). An example of data generation model 58 is ChatGPT (registered trademark) (Internet search).<URL: https: / / openai.com / blog / chatgpt> ), Gemini (registered trademark) (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and with inference data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 infers from the input inference data according to the instructions indicated by the prompts, and outputs the inference results in data formats such as audio data and text data. Here, inference refers to, for example, analysis, classification, prediction, and / or summarization.
[0206] In the above embodiment, an example was given in which specific processing is performed by the data processing device 12, but the technology of this disclosure is not limited thereto, and the specific processing may also be performed by the smart device 14.
[0207] [Second Embodiment]
[0208] Figure 3 shows an example of the configuration of the data processing system 210 according to the second embodiment.
[0209] As shown in Figure 3, the data processing system 210 includes a data processing device 12 and smart glasses 214. An example of the data processing device 12 is a server.
[0210] The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network).
[0211] The smart glasses 214 include a computer 36, a microphone 238, a speaker 240, a camera 42, and a communication interface 44. The computer 36 includes a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The microphone 238, speaker 240, and camera 42 are also connected to the bus 52.
[0212] The microphone 238 receives voice signals from the user 20 and receives instructions from the user 20. The microphone 238 captures the voice signals from the user 20, converts the captured voice into audio data, and outputs it to the processor 46. The speaker 240 outputs audio according to the instructions from the processor 46.
[0213] Camera 42 is a small digital camera equipped with an optical system including a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor, and captures images of the area around the user 20 (for example, an imaging range defined by a field of view equivalent to the width of a typical healthy person's field of vision).
[0214] Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various information between processor 46 and processor 28 via network 54. The exchange of various information between processor 46 and processor 28 using communication interfaces 44 and 26 is performed in a secure manner.
[0215] Figure 4 shows an example of the main functions of the data processing device 12 and the smart glasses 214. As shown in Figure 4, the data processing device 12 performs specific processing using the processor 28. The storage 32 stores the specific processing program 56.
[0216] The specific processing program 56 is an example of a "program" relating to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 in accordance with the specific processing program 56 executed on the RAM 30.
[0217] The storage 32 stores the data generation model 58 and the emotion identification model 59. The data generation model 58 and the emotion identification model 59 are used by the identification processing unit 290.
[0218] In the smart glasses 214, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48.
[0219] Next, the identification processing performed by the identification processing unit 290 of the data processing device 12 will be described. In the following description, the data processing device 12 will be referred to as the "server" and the smart glasses 214 will be referred to as the "terminal".
[0220] This invention provides a system that promotes healthy internet use among children by providing features such as eye protection, content filtering, personal information protection, and usage time management. The program's processing is described below in natural language.
[0221] First, the device uses sensors to detect the distance and posture of the user, i.e., the child's face. This activates a function that issues alerts for appropriate eye protection, prompting the user to take a break or say things like "You're too close to the screen" or "It's time for a break." These notifications are displayed using child-friendly characters, allowing for natural behavioral correction.
[0222] Next, the server receives the URL of the website the child is trying to access and checks it against a database to determine if it contains inappropriate content. This filtering technology is essential to protect children from harmful information. If the content is deemed inappropriate, the device immediately notifies the user and suggests safe sites and activities.
[0223] Furthermore, the device monitors children's attempts to enter personal information online. When a user enters an email address or other personal information, it immediately issues a warning and urges them to refrain from doing so, thereby reducing the risk of data breaches.
[0224] Furthermore, the device manages the child's internet usage time, notifies them when the set time is approaching, and automatically switches to rest mode when the actual time is up. This function helps maintain the child's health by avoiding prolonged continuous use and encouraging appropriate breaks.
[0225] As a concrete example, when a child uses the internet to supplement their studies, the server filters and displays only safe learning-related sites, and the device notifies the child to take a break before screen time becomes too long. This provides an environment where learning can be done efficiently while also protecting health and safety.
[0226] The following describes the processing flow.
[0227] Step 1:
[0228] The device activates its sensors to monitor the child's face position and orientation in real time. The detected data is used to calculate the distance and angle of the face.
[0229] Step 2:
[0230] Based on monitoring results, the device determines if the user is in a state that could negatively affect their eyesight, such as "holding the screen too close." If this is detected, an alert message "You need to take a break" is displayed on the screen.
[0231] Step 3:
[0232] A user attempts to access a website. At that time, the device captures the accessed URL and sends it to the server.
[0233] Step 4:
[0234] The server receives the URL, compares it against filtering criteria in the database, and analyzes whether the website in question is inappropriate.
[0235] Step 5:
[0236] If the server determines that the website in question is inappropriate, it returns that result to the device. The device then displays a message to the child stating, "This page is not accessible," and suggests safe alternative content.
[0237] Step 6:
[0238] The device monitors personal data (such as email addresses and credit card information) when a child starts entering information.
[0239] Step 7:
[0240] If personal information is detected in an input field, the device will immediately issue a warning and prompt the child not to enter the information.
[0241] Step 8:
[0242] The device counts the elapsed time since internet use began and manages the time to ensure it does not exceed the set limit. When the remaining time is running low, it notifies the child of the countdown.
[0243] Step 9:
[0244] When the set usage time is reached, the device locks the screen, prompting the child to take a break. At this time, the display screen shows relaxation methods and game suggestions accompanied by characters.
[0245] (Example 1)
[0246] Next, we will describe Example 1. In the following description, the data processing device 12 will be referred to as the "server," and the smart glasses 214 will be referred to as the "terminal."
[0247] In today's information society, ensuring children use the internet safely and healthily is a critical issue. In particular, concerns about vision problems, access to inappropriate content, personal data breaches, and health damage from prolonged use are serious. Traditional methods have struggled to comprehensively address these issues, necessitating a unified solution.
[0248] The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 1 is realized by the following means.
[0249] In this invention, the server includes detection means, determination means, information processing means, monitoring means, and time management means. This enables the protection of children's eyesight, control of inappropriate content access, protection of personal information, and management of usage time.
[0250] "Detection means" refers to devices or technologies used to sense the user's distance and posture.
[0251] The "decision-making mechanism" is a mechanism that generates a warning for eye protection based on the detection results.
[0252] "Information processing means" refers to technologies for classifying and selecting data and controlling access to inappropriate information.
[0253] A "monitoring measure" is a system designed to observe the user's input operations and prevent the unauthorized input of personal information.
[0254] A "time management system" is a management system used to limit a user's activity time and maintain their health.
[0255] "Means of expression" refers to a function that provides notifications or warnings to users through the display of characters.
[0256] This invention is a system that supports healthy internet use among children, and its specific embodiments are shown below.
[0257] The device features a built-in camera and infrared sensor to detect the child's distance and posture in real time. The data acquired by the sensors is used for vision protection, and if the distance is too close or the posture is inappropriate, a warning is displayed to the child using a character notification. In this process, the camera and infrared sensor are used as hardware, and a software algorithm for vision protection processes the data.
[0258] The server receives the URL of a website that a child is trying to access and filters it against a database to eliminate inappropriate content. The database contains keywords and site lists necessary for filtering, and machine learning techniques are used to identify inappropriate elements.
[0259] The device also monitors when children enter personal information into forms. For example, if it detects the input of an email address or phone number, it immediately issues a warning to prevent the leakage of personal information. This uses browser plugins and real-time data checking functions provided by the operating system.
[0260] Furthermore, the device manages the child's internet usage time, notifies them when the set time limit is reached, and switches to rest mode when the time is up. This allows users to take appropriate breaks. An installed timer application is used for time management.
[0261] As a concrete example, when a child uses the internet for online learning, the server displays only safe websites related to learning, and the device sends notifications at regular intervals recommending a break. This provides a safe and healthy learning environment.
[0262] The following are examples of prompts for a generative AI model.
[0263] "Please propose specific ways to ensure children can use the internet safely. This should include measures such as eye protection, content filtering, personal information protection, and managing usage time."
[0264] The flow of the specific processing in Example 1 will be explained using Figure 11.
[0265] Step 1:
[0266] The device activates its built-in camera and infrared sensor to detect the distance and posture of the child's face. Input includes image data of the child's face and distance measurement data. This data is analyzed by an algorithm to determine if the distance falls below a predetermined threshold (e.g., 30cm or more) or if the posture is inappropriate. If the distance is too close or the posture is inappropriate, the device generates a warning message using a character to protect the child's eyesight. This prompts the user to immediately move away from the screen.
[0267] Step 2:
[0268] The server receives the URL of the website that the child has requested to access. The input includes the URL data and metadata such as the access time. The server compares this URL with a database to check if it matches any pre-registered lists of inappropriate URLs or keywords. A text search algorithm is used for data processing. If it is determined to be inappropriate, the server sends an access denied message and a suggestion of a safe site to the device as output. This procedure guides the child to safe information without accessing inappropriate content.
[0269] Step 3:
[0270] The device monitors the input in real time when a child fills out a form in their web browser. The input is the text data entered by the child. The plugin used for monitoring performs regular expression matching to detect keywords that constitute personal information (e.g., email address, phone number). If such keywords are detected, a warning pop-up is displayed as output, and the submission function is disabled. This procedure prevents the leakage of personal information.
[0271] Step 4:
[0272] The device activates a built-in timer when a child starts using the internet and measures the usage time. The input recorded is the start time of use. Five minutes before the set usage limit is reached, a warning notification is generated, and the message "You have 5 minutes of usage time remaining" is displayed on the screen. When the set time is up, the internet connection is temporarily disabled, and a message prompting a break is displayed. This allows children to use the internet while taking appropriate breaks.
[0273] (Application Example 1)
[0274] Next, we will explain Application Example 1. In the following explanation, the data processing device 12 will be referred to as the "server," and the smart glasses 214 will be referred to as the "terminal."
[0275] In recent years, creating an environment where children can use the internet safely and healthily has become crucial. However, there are concerns about the risks children face when spending long hours online, such as eye strain, access to inappropriate content, and the leakage of personal information. There is a need to develop systems that mitigate these risks and appropriately manage usage time, thereby promoting the healthy physical and mental development of children.
[0276] The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 1 is realized by the following means.
[0277] In this invention, the server includes detection means for detecting the location and posture of a child user; control means for generating warnings to encourage the protection of the user's vision based on the detection means; processing means for classifying and filtering digital information and restricting access to inappropriate digital information; and filtering means for analyzing web location markers accessed by the user and allowing only safe digital information. This enables children to use the internet safely while protecting their eyesight and managing their usage time.
[0278] A "detection device" is a device that has the function of sensing the child user's position and posture in real time and converting it into numerical data.
[0279] A "control means" is a device that generates warnings to reduce the visual strain on the user based on data obtained from a detection means.
[0280] A "processing means" is a component that handles data management functions to classify digital information, select inappropriate content, and restrict access to it.
[0281] A "monitoring device" is a device that has the function of monitoring user input operations and implementing security measures to prevent the leakage of personal information.
[0282] "Filtering means" is a system that analyzes the web location identifiers accessed by the user and has the function of allowing only safe digital information.
[0283] The system for realizing this invention is composed of a server and a terminal.
[0284] The server mainly classifies and selects digital information and plays a role in restricting access to inappropriate content. Specifically, the server receives the web location identifier that the user attempts to access and checks with the database whether it is safe information. Thereby, inappropriate digital information is filtered, and a safe Internet environment is provided for the user.
[0285] On the other hand, the terminal functions as an interface when the user uses the Internet. Using the built-in camera and distance sensor, the position and posture of the user are detected in real time. When the user is too close to the display or has a bad posture, a warning is generated through the display device by using an image processing library such as OpenCV.
[0286] For the management of usage time, the terminal manages the time information through the real-time database and notifies the end of usage based on the set usage time. Thereby, the user avoids excessive use, and healthy use is promoted. A virtual character is used for the notification, providing interaction to the user in an easy-to-approach form.
[0287] As a specific example, when the user accesses learning content on the Internet, the server provides only safe educational digital information. Also, the terminal supports maintaining the user's concentration and health by prompting a break before the usage time becomes long.
[0288] By utilizing generative AI models, it is possible to optimize the content of the messages and warnings for individual users. An example of a prompt to input into the generative AI model is, "Please tell me how to guide my child to use the internet safely." Such a system can provide comprehensive support for children's digital lives.
[0289] The flow of a specific process in Application Example 1 will be explained using Figure 12.
[0290] Step 1:
[0291] The server receives a web location identifier (URL) sent by the user. The received URL is compared against a database on the server to determine if it is secure digital information. The database contains a list of URLs that have been previously verified as secure. If the URL is determined to be inappropriate, the server sends a corresponding notification to the device.
[0292] Step 2:
[0293] The device uses its built-in camera and distance sensor to detect the user's position and posture. Using the OpenCV library, it analyzes video data from the camera to determine if the user is too close to the display or has poor posture. This information is then used to evaluate the appropriateness of the distance and posture through programmatic thresholding. Depending on the result, the device displays visual or audible warnings to the user.
[0294] Step 3:
[0295] The device measures the user's internet usage time and manages the progress of that time in a real-time database. Timekeeping information is obtained by a timer on the device, and notifications are sent based on the set usage limit. When the end time is approaching, the device generates a voice message through a virtual character prompting the user to take a break.
[0296] Step 4:
[0297] When a user enters digital information, the terminal monitors keystrokes and detects specific patterns from a personal information protection perspective. If a specific email address or phone number input pattern is recognized, the system immediately issues a warning to the user prompting them to stop entering information. This warning is provided in either a visual or audio format.
[0298] Step 5:
[0299] By utilizing a generative AI model, the device provides user-friendly interactions. An example prompt is "How can I help my child use the internet safely?", and the generative AI model generates optimal advice and wording, delivering it to the user through communication. This enables flexible, personalized guidance.
[0300] Furthermore, an emotion engine that estimates the user's emotions may be incorporated. That is, the identification processing unit 290 may use the emotion identification model 59 to estimate the user's emotions and perform identification processing using the user's emotions.
[0301] This invention provides a system for children's internet use that combines eye protection, filtering of inappropriate content, protection of personal information, usage time management, and recognition of the user's emotions. This system is designed to ensure a safe and healthy digital experience for children. The program's processing is described below in natural language.
[0302] First, the device uses sensors to capture the child's face and detect their distance and posture. Based on this data, it determines if eye protection is needed and displays an appropriate alert. The content of the alert can be fine-tuned based on the user's emotions, and this is supported by an emotion engine.
[0303] The emotion engine analyzes according to the sensor data and usage status of the terminal, and predicts emotions using children's facial expressions, voices, and interaction data. This emotion data is not only used for alerts but also for providing safe content.
[0304] Next, the terminal captures the URL of the web page accessed by the child, and the data processing means filters inappropriate content. At this point, the emotion engine adjusts the filtering criteria and proposed safe alternative content based on the child's psychological state.
[0305] Furthermore, the terminal monitors whether personal information is included when the child inputs on the Internet. When the emotion engine detects anxiety or stress, it provides more detailed warnings and explanations. Especially when the child is emotional, it has an interface with friendly characters to explain the situation and reassure the child.
[0306] In terms of usage time management, the terminal presents an alert when the set time has passed and guides the child to take a break naturally. In this process, the emotion engine also supports time management by detecting fatigue and boredom and issuing alerts at appropriate times.
[0307] As a specific example, when a child is engaged in online learning, when the terminal detects the stress during learning with the emotion engine, it proposes an interaction such as "Let's refresh with fun quizzes and games" to make the break enjoyable. In this way, by taking an approach that takes into account the emotional state, it is possible to provide an environment where children can use the Internet safely and enjoyably.
[0308] The following explains the processing flow.
[0309] Step 1:
[0310] The terminal monitors the distance and posture of the child's face using sensor means and processes this data in real time.
[0311] Step 2:
[0312] The device inputs the acquired data into an emotion engine, which analyzes the child's emotions based on their facial expressions and voice. Based on these results, it decides whether or not to issue an alert for vision protection.
[0313] Step 3:
[0314] The device uses the results of an emotion engine analysis to display alerts such as "Let's move a little further away from the screen!" in a more friendly tone.
[0315] Step 4:
[0316] When a user attempts to access a webpage, the device sends the URL to the server.
[0317] Step 5:
[0318] The server receives the URL, compares it with the database to determine if it contains inappropriate content, and sends the result back to the terminal.
[0319] Step 6:
[0320] The device's emotional engine assesses the child's psychological state and, if necessary, blocks access to inappropriate content, informing them in a gentle tone, "This site is not safe."
[0321] Step 7:
[0322] The device monitors whether any input actions performed by the child include personal information.
[0323] Step 8:
[0324] If personal information is detected, the device will display a warning, but simultaneously, taking into account the results of the emotion engine's analysis, it will offer reassuring words to reduce the child's stress.
[0325] Step 9:
[0326] The device tracks usage time, and an emotional engine displays a countdown that gently reminds the child, "It's almost time for a break," depending on their fatigue or boredom.
[0327] Step 10:
[0328] When usage time ends, the device switches to vacation mode, which includes relaxation suggestions, and automatically locks the screen. At this time, a character suggests an activity that children can enjoy, saying, "Here's what's next!"
[0329] (Example 2)
[0330] Next, we will describe Example 2. In the following description, the data processing device 12 will be referred to as the "server" and the smart glasses 214 will be referred to as the "terminal".
[0331] When children use the internet, there is a need to protect their eyesight, restrict access to inappropriate content, protect their personal information, and provide a user environment that takes their emotions into consideration. In conventional methods, these elements are often managed individually, making it difficult to build an integrated digital protection environment.
[0332] The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 2 is realized by the following means.
[0333] In this invention, the server includes sensing means for detecting the face, facial expressions, and voice of a child user; analysis means for analyzing the user's emotions based on the sensing means and generating alerts corresponding to the emotional state; and processing means for classifying and filtering content, restricting access to inappropriate information, and suggesting alternative information. This enables a total digital protection environment for child users.
[0334] "Sensing means" refers to sensors or devices used to detect the child user's face, facial expressions, and voice in real time.
[0335] "Analysis means" refers to the process of analyzing the user's emotional state based on data obtained from sensing means and generating appropriate alerts.
[0336] "Processing means" refers to a series of information processing steps that not only classify content and filter out inappropriate information, but also suggest appropriate alternative information.
[0337] "Monitoring measures" refer to a system that constantly monitors the user's input actions and provides necessary warnings from the perspective of protecting personal information.
[0338] A "timer device" refers to equipment that notifies the user of usage time and break times, and generates alerts at appropriate times according to their emotional state.
[0339] "Interface means" refers to a user interface that provides notifications and warnings to the user through character displays tailored to their emotional state.
[0340] This invention is a system for realizing a safe and secure digital experience for children using the internet. Specifically, the system includes sensors, an analysis engine, data processing, and interface means.
[0341] The device is equipped with sensor devices such as a camera and microphone. These are used as "sensing means" to capture the child's face, expressions, and voice in real time. The device sends this data to a server as basic data for emotion analysis.
[0342] On the server, the collected data is processed using "analysis tools" to analyze the child's emotional state. Emotion prediction algorithms and machine learning models are used for this analysis. Based on these analysis results, the content of the alerts presented to the user on the terminal is adjusted.
[0343] To prevent access to inappropriate content, the device captures the browser's URL information and filters it using a "processing mechanism." This filtering system proactively blocks inappropriate information and suggests safe alternative content.
[0344] Furthermore, the device constantly monitors the child's input and uses "monitoring tools" to issue a warning if personal information is included. If stress or anxiety is detected, a friendly character will be displayed to calmly explain the situation.
[0345] To manage usage time, the device uses a "timer mechanism" to notify the user of usage time and break time. If fatigue is detected, it will make a suggestion such as "Let's take a little more break" depending on the user's emotional state.
[0346] For example, if a user is experiencing stress during online learning, the device might suggest interactions such as "Let's refresh ourselves with fun quizzes and games," offering suggestions to complement learning with other activities. This ensures a safe digital environment while maintaining learning effectiveness.
[0347] An example of a prompt would be, "Explain how to optimize alerts based on a child's emotions during online learning." This provides a basis for properly utilizing generative AI models.
[0348] The flow of the specific processing in Example 2 will be explained using Figure 13.
[0349] Step 1:
[0350] The device uses its built-in camera and microphone to collect sensor data such as the child's face, expressions, and voice. It acquires this data as input and transmits it to a server in real time. Specifically, the device activates its camera and periodically records facial movements and voice.
[0351] Step 2:
[0352] The server receives sensor data transmitted from the terminal as input to the emotion analysis engine. The data processing performed here involves analyzing the emotional state using machine learning algorithms. As output, it generates scores and labels indicating the child's emotional state. Specifically, it determines states such as "happy" or "stressed" based on factors such as the presence or absence of a smile and the tone of voice.
[0353] Step 3:
[0354] The server determines appropriate alert content for the user based on the analysis results and outputs it to the terminal. It receives the results of sentiment analysis as input and creates an alert message corresponding to the emotional state as output. A specific example of its operation is generating a message such as, "Please sit down and move away from the screen."
[0355] Step 4:
[0356] The device captures the URL of the webpage the user wants to access and passes it as input to the data processing system. In this step, calculations are performed to filter whether the URL is inappropriate. The output suggests safe alternative content to the user. Specifically, if there is a page that cannot be accessed, it will display a message such as "This site cannot be viewed, please try this instead."
[0357] Step 5:
[0358] The device monitors children's online input behavior and displays a warning if input containing personal information is detected. It acquires user typing data as input and uses an emotion engine to detect anxiety, thereby outputting a more considerate interface. Specifically, a friendly character appears on the screen saying, "Let's not enter addresses."
[0359] Step 6:
[0360] The device monitors the set usage time and passes the elapsed time as input data to the timer. The output is a break alert to the user. Specifically, it displays "It's almost time for a break" and has a function to temporarily lock the screen.
[0361] (Application Example 2)
[0362] Next, we will explain application example 2. In the following explanation, the data processing device 12 will be referred to as the "server," and the smart glasses 214 will be referred to as the "terminal."
[0363] Children are required to have a safe and comfortable user experience in the digital information environment, while simultaneously receiving appropriate visual protection, filtering of digital content, protection of personal information, and management of usage time, all while considering their emotional state. However, conventional systems have struggled to comprehensively achieve these goals. In addition, there is a lack of means to effectively manage and improve children's digital experiences while reducing the burden on parents in the home environment.
[0364] The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 2 is realized by the following means.
[0365] In this invention, the server includes sensor means for detecting the user's distance and posture, analysis means for recognizing emotions and adjusting the digital experience, and adjustment means for communicating with an intelligent machine that monitors educational activities and provides support based on the user's emotions. This enables the protection of children's vision and support for digital education that is in harmony with their emotions.
[0366] "User" refers to an individual who uses a system in a digital information environment, and in particular, to a child who uses a device under the supervision of a parent or guardian.
[0367] "Sensor means" refers to devices and technologies for detecting the user's distance, posture, and facial expressions, and includes cameras, microphones, and other sensing devices.
[0368] "Control means" refers to technology that generates warnings and alerts to encourage visual protection in the user based on data obtained from sensor means.
[0369] "Data processing means" refers to a processing device and program for classifying digital information and filtering out inappropriate content.
[0370] "Monitoring measures" refer to technologies and devices that check the user's input activities and prevent the leakage of personal information.
[0371] "Analysis means" refers to technology that recognizes the user's emotional state based on information such as facial expressions and voice, and adjusts the digital experience accordingly.
[0372] "Adjustment means" refers to technology that improves and provides the user experience in cooperation with intelligent machines, etc., in response to the user's emotions recognized by the analysis means.
[0373] An "intelligent machine" refers to a robot or device equipped with information processing and communication capabilities to enhance children's learning and entertainment.
[0374] "Educational activities" refer to all forms of learning and training aimed at improving children's knowledge and skills.
[0375] This invention provides a system in which a server interacts with an intelligent machine installed in the home, namely a consumer robot, to support a safe and comfortable digital experience for children. The terminal uses sensor means to sense the user's face and body movements and detect distance and posture. Based on this, a control means functions to determine whether visual protection is needed and to display an alert.
[0376] The server collects and analyzes sensor data through cloud-based services. Here, it uses the Google Cloud Vision API and Microsoft Azure Emotion API to analyze the user's facial expressions and voice, and obtain emotional data. Furthermore, the analyzed emotional data is used as a means of tailoring content and activities to the user. For example, generative AI models using TensorFlow or PyTorch analyze and predict emotions in real time, and fine-tune filtering criteria.
[0377] If a user attempts to access inappropriate digital information, data processing mechanisms on the server immediately evaluate the information and verify its suitability using the Google Safe Browsing API. During this process, the device monitors input operations using monitoring mechanisms to prevent the leakage of personal information and issues warnings as needed.
[0378] As a concrete example, when a child is using a digital device to participate in an educational program, a robot can be present to manage the appropriateness of the digital information and usage time while suggesting fun activities. For example, by entering a prompt such as, "Determine from this child's facial expression whether they are stressed and generate code to suggest appropriate countermeasures," the system can provide solutions tailored to individual needs.
[0379] In this way, the invention solves the challenges of protecting children's eyesight and digital safety, and realizes a system that reduces the burden on parents at home.
[0380] The flow of a specific process in Application Example 2 will be explained using Figure 14.
[0381] Step 1:
[0382] The device uses sensors to detect the user's facial and body movements in real time. It receives raw data from cameras and microphones as input and processes it as sensor data. Output includes distance, posture, and facial expression information. Specifically, it performs face detection from camera images and uses a distance sensor to measure the distance to the user.
[0383] Step 2:
[0384] The server receives sensor data via cloud services and performs emotion analysis using the Google Cloud Vision API and Emotion API. It receives facial images and voice data as input and sends this data to the API for analysis. The output is an estimate of the user's emotional state. Specifically, it analyzes facial expressions from images, predicts emotions based on tone of voice, and compiles this information.
[0385] Step 3:
[0386] The server uses data processing tools to analyze the URL of the digital content the user is trying to access. It receives URLs captured from a web browser as input and uses the Google Safe Browsing API for classification and filtering. The output evaluates whether the content is safe. Specifically, the API scans the URL and returns whether it poses a risk.
[0387] Step 4:
[0388] The terminal uses monitoring devices to monitor the user's input operations. It collects and analyzes log data from keyboard and touch operations as input. The output provides an indicator of whether personal information has been entered. Specifically, it compares the entered string with a pre-configured personal information pattern and generates an alert if a match is found.
[0389] Step 5:
[0390] The server utilizes a generative AI model based on analyzed emotional states to provide suggestions and content recommendations to the user using adjustment mechanisms. It receives emotional data as input and provides prompts to the generative AI model. As output, it provides the user with optimal action plans and content suggestions. Specifically, it receives a prompt message such as "Determine whether this child's facial expression indicates stress and generate code to suggest appropriate countermeasures," adjusts the generated suggestions, and responds to the user.
[0391] The specific processing unit 290 transmits the result of the specific processing to the smart glasses 214. In the smart glasses 214, the control unit 46A causes the speaker 240 to output the result of the specific processing. The microphone 238 acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 238 to the data processing unit 12. In the data processing unit 12, the specific processing unit 290 acquires the audio data.
[0392] Data generation model 58 is a type of so-called generative AI (Artificial Intelligence). One example of data generation model 58 is ChatGPT (Internet search<URL: https: / / openai.com / blog / chatgpt> ), Gemini (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and with inference data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 infers from the input inference data according to the instructions indicated by the prompts, and outputs the inference results in data formats such as audio data and text data. Here, inference refers to, for example, analysis, classification, prediction, and / or summarization.
[0393] In the above embodiment, an example was given in which specific processing is performed by the data processing device 12, but the technology of this disclosure is not limited thereto, and the specific processing may also be performed by the smart glasses 214.
[0394] [Third Embodiment]
[0395] Figure 5 shows an example of the configuration of the data processing system 310 according to the third embodiment.
[0396] As shown in Figure 5, the data processing system 310 includes a data processing device 12 and a headset terminal 314. An example of the data processing device 12 is a server.
[0397] The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network).
[0398] The headset terminal 314 includes a computer 36, a microphone 238, a speaker 240, a camera 42, a communication interface 44, and a display 343. The computer 36 includes a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The microphone 238, speaker 240, camera 42, and display 343 are also connected to the bus 52.
[0399] The microphone 238 receives voice signals from the user 20 and receives instructions from the user 20. The microphone 238 captures the voice signals from the user 20, converts the captured voice into audio data, and outputs it to the processor 46. The speaker 240 outputs audio according to the instructions from the processor 46.
[0400] Camera 42 is a small digital camera equipped with an optical system including a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor, and captures images of the area around the user 20 (for example, an imaging range defined by a field of view equivalent to the width of a typical healthy person's field of vision).
[0401] Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various information between processor 46 and processor 28 via network 54. The exchange of various information between processor 46 and processor 28 using communication interfaces 44 and 26 is performed in a secure manner.
[0402] Figure 6 shows an example of the main functions of the data processing device 12 and the headset terminal 314. As shown in Figure 6, the data processing device 12 performs specific processing using the processor 28. The storage 32 stores the specific processing program 56.
[0403] The specific processing program 56 is an example of a "program" relating to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 in accordance with the specific processing program 56 executed on the RAM 30.
[0404] The storage 32 stores the data generation model 58 and the emotion identification model 59. The data generation model 58 and the emotion identification model 59 are used by the identification processing unit 290.
[0405] In the headset terminal 314, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48.
[0406] Next, the specific processing performed by the specific processing unit 290 of the data processing device 12 will be described. In the following description, the data processing device 12 will be referred to as the "server" and the headset terminal 314 will be referred to as the "terminal".
[0407] This invention provides a system that promotes healthy internet use among children by providing features such as eye protection, content filtering, personal information protection, and usage time management. The program's processing is described below in natural language.
[0408] First, the device uses sensors to detect the distance and posture of the user, i.e., the child's face. This activates a function that issues alerts for appropriate eye protection, prompting the user to take a break or say things like "You're too close to the screen" or "It's time for a break." These notifications are displayed using child-friendly characters, allowing for natural behavioral correction.
[0409] Next, the server receives the URL of the website the child is trying to access and checks it against a database to determine if it contains inappropriate content. This filtering technology is essential to protect children from harmful information. If the content is deemed inappropriate, the device immediately notifies the user and suggests safe sites and activities.
[0410] Furthermore, the device monitors children's attempts to enter personal information online. When a user enters an email address or other personal information, it immediately issues a warning and urges them to refrain from doing so, thereby reducing the risk of data breaches.
[0411] Furthermore, the device manages the child's internet usage time, notifies them when the set time is approaching, and automatically switches to rest mode when the actual time is up. This function helps maintain the child's health by avoiding prolonged continuous use and encouraging appropriate breaks.
[0412] As a concrete example, when a child uses the internet to supplement their studies, the server filters and displays only safe learning-related sites, and the device notifies the child to take a break before screen time becomes too long. This provides an environment where learning can be done efficiently while also protecting health and safety.
[0413] The following describes the processing flow.
[0414] Step 1:
[0415] The device activates its sensors to monitor the child's face position and orientation in real time. The detected data is used to calculate the distance and angle of the face.
[0416] Step 2:
[0417] Based on monitoring results, the device determines if the user is in a state that could negatively affect their eyesight, such as "holding the screen too close." If this is detected, an alert message "You need to take a break" is displayed on the screen.
[0418] Step 3:
[0419] A user attempts to access a website. At that time, the device captures the accessed URL and sends it to the server.
[0420] Step 4:
[0421] The server receives the URL, compares it against filtering criteria in the database, and analyzes whether the website in question is inappropriate.
[0422] Step 5:
[0423] If the server determines that the website in question is inappropriate, it returns that result to the device. The device then displays a message to the child stating, "This page is not accessible," and suggests safe alternative content.
[0424] Step 6:
[0425] The device monitors personal data (such as email addresses and credit card information) when a child starts entering information.
[0426] Step 7:
[0427] If personal information is detected in an input field, the device will immediately issue a warning and prompt the child not to enter the information.
[0428] Step 8:
[0429] The device counts the elapsed time since internet use began and manages the time to ensure it does not exceed the set limit. When the remaining time is running low, it notifies the child of the countdown.
[0430] Step 9:
[0431] When the set usage time is reached, the device locks the screen, prompting the child to take a break. At this time, the display screen shows relaxation methods and game suggestions accompanied by characters.
[0432] (Example 1)
[0433] Next, we will describe Example 1. In the following description, the data processing device 12 will be referred to as the "server," and the headset-type terminal 314 will be referred to as the "terminal."
[0434] In today's information society, ensuring children use the internet safely and healthily is a critical issue. In particular, concerns about vision problems, access to inappropriate content, personal data breaches, and health damage from prolonged use are serious. Traditional methods have struggled to comprehensively address these issues, necessitating a unified solution.
[0435] The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 1 is realized by the following means.
[0436] In this invention, the server includes detection means, determination means, information processing means, monitoring means, and time management means. This enables the protection of children's eyesight, control of inappropriate content access, protection of personal information, and management of usage time.
[0437] "Detection means" refers to devices or technologies used to sense the user's distance and posture.
[0438] The "decision-making mechanism" is a mechanism that generates a warning for eye protection based on the detection results.
[0439] "Information processing means" refers to technologies for classifying and selecting data and controlling access to inappropriate information.
[0440] A "monitoring measure" is a system designed to observe the user's input operations and prevent the unauthorized input of personal information.
[0441] A "time management system" is a management system used to limit a user's activity time and maintain their health.
[0442] "Means of expression" refers to a function that provides notifications or warnings to users through the display of characters.
[0443] This invention is a system that supports healthy internet use among children, and its specific embodiments are shown below.
[0444] The device features a built-in camera and infrared sensor to detect the child's distance and posture in real time. The data acquired by the sensors is used for vision protection, and if the distance is too close or the posture is inappropriate, a warning is displayed to the child using a character notification. In this process, the camera and infrared sensor are used as hardware, and a software algorithm for vision protection processes the data.
[0445] The server receives the URL of a website that a child is trying to access and filters it against a database to eliminate inappropriate content. The database contains keywords and site lists necessary for filtering, and machine learning techniques are used to identify inappropriate elements.
[0446] The device also monitors when children enter personal information into forms. For example, if it detects the input of an email address or phone number, it immediately issues a warning to prevent the leakage of personal information. This uses browser plugins and real-time data checking functions provided by the operating system.
[0447] Furthermore, the device manages the child's internet usage time, notifies them when the set time limit is reached, and switches to rest mode when the time is up. This allows users to take appropriate breaks. An installed timer application is used for time management.
[0448] As a concrete example, when a child uses the internet for online learning, the server displays only safe websites related to learning, and the device sends notifications at regular intervals recommending a break. This provides a safe and healthy learning environment.
[0449] The following are examples of prompts for a generative AI model.
[0450] "Please propose specific ways to ensure children can use the internet safely. This should include measures such as eye protection, content filtering, personal information protection, and managing usage time."
[0451] The flow of the specific processing in Example 1 will be explained using Figure 11.
[0452] Step 1:
[0453] The device activates its built-in camera and infrared sensor to detect the distance and posture of the child's face. Input includes image data of the child's face and distance measurement data. This data is analyzed by an algorithm to determine if the distance falls below a predetermined threshold (e.g., 30cm or more) or if the posture is inappropriate. If the distance is too close or the posture is inappropriate, the device generates a warning message using a character to protect the child's eyesight. This prompts the user to immediately move away from the screen.
[0454] Step 2:
[0455] The server receives the URL of the website that the child has requested to access. The input includes the URL data and metadata such as the access time. The server compares this URL with a database to check if it matches any pre-registered lists of inappropriate URLs or keywords. A text search algorithm is used for data processing. If it is determined to be inappropriate, the server sends an access denied message and a suggestion of a safe site to the device as output. This procedure guides the child to safe information without accessing inappropriate content.
[0456] Step 3:
[0457] The device monitors the input in real time when a child fills out a form in their web browser. The input is the text data entered by the child. The plugin used for monitoring performs regular expression matching to detect keywords that constitute personal information (e.g., email address, phone number). If such keywords are detected, a warning pop-up is displayed as output, and the submission function is disabled. This procedure prevents the leakage of personal information.
[0458] Step 4:
[0459] The device activates a built-in timer when a child starts using the internet and measures the usage time. The input recorded is the start time of use. Five minutes before the set usage limit is reached, a warning notification is generated, and the message "You have 5 minutes of usage time remaining" is displayed on the screen. When the set time is up, the internet connection is temporarily disabled, and a message prompting a break is displayed. This allows children to use the internet while taking appropriate breaks.
[0460] (Application Example 1)
[0461] Next, we will explain Application Example 1. In the following explanation, the data processing device 12 will be referred to as the "server," and the headset-type terminal 314 will be referred to as the "terminal."
[0462] In recent years, creating an environment where children can use the internet safely and healthily has become crucial. However, there are concerns about the risks children face when spending long hours online, such as eye strain, access to inappropriate content, and the leakage of personal information. There is a need to develop systems that mitigate these risks and appropriately manage usage time, thereby promoting the healthy physical and mental development of children.
[0463] The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 1 is realized by the following means.
[0464] In this invention, the server includes detection means for detecting the location and posture of a child user; control means for generating warnings to encourage the protection of the user's vision based on the detection means; processing means for classifying and filtering digital information and restricting access to inappropriate digital information; and filtering means for analyzing web location markers accessed by the user and allowing only safe digital information. This enables children to use the internet safely while protecting their eyesight and managing their usage time.
[0465] A "detection device" is a device that has the function of sensing the child user's position and posture in real time and converting it into numerical data.
[0466] A "control means" is a device that generates warnings to reduce the visual strain on the user based on data obtained from a detection means.
[0467] A "processing means" is a component that handles data management functions to classify digital information, select inappropriate content, and restrict access to it.
[0468] A "monitoring device" is a device that has the function of monitoring user input operations and implementing security measures to prevent the leakage of personal information.
[0469] A "filtering method" is a system that analyzes the web location information accessed by the user and has the function of allowing only safe digital information.
[0470] The system that realizes this invention is configured using a server and terminals.
[0471] The server's primary role is to classify and filter digital information, restricting access to inappropriate content. Specifically, the server receives web location markers that users attempt to access and compares them against a database to determine if the information is safe. This filters out inappropriate digital information, providing users with a safe internet environment.
[0472] On the other hand, the terminal functions as an interface for the user to access the internet. It uses a built-in camera and distance sensor to detect the user's position and posture in real time. Using image processing libraries such as OpenCV, it generates warnings through the display device if the user is too close to the display or has poor posture.
[0473] To manage usage time, the device manages time information through a real-time database and notifies the user when usage ends based on the set usage time. This helps users avoid excessive use and promotes healthy usage. A virtual character is used for notifications, providing user interaction in a friendly manner.
[0474] For example, when a user accesses learning content over the internet, the server provides only secure, educational digital information. Furthermore, the device helps maintain user concentration and well-being by prompting breaks before prolonged use.
[0475] By utilizing generative AI models, it is possible to optimize the content of the messages and warnings for individual users. An example of a prompt to input into the generative AI model is, "Please tell me how to guide my child to use the internet safely." Such a system can provide comprehensive support for children's digital lives.
[0476] The flow of a specific process in Application Example 1 will be explained using Figure 12.
[0477] Step 1:
[0478] The server receives a web location identifier (URL) sent by the user. The received URL is compared against a database on the server to determine if it is secure digital information. The database contains a list of URLs that have been previously verified as secure. If the URL is determined to be inappropriate, the server sends a corresponding notification to the device.
[0479] Step 2:
[0480] The device uses its built-in camera and distance sensor to detect the user's position and posture. Using the OpenCV library, it analyzes video data from the camera to determine if the user is too close to the display or has poor posture. This information is then used to evaluate the appropriateness of the distance and posture through programmatic thresholding. Depending on the result, the device displays visual or audible warnings to the user.
[0481] Step 3:
[0482] The device measures the user's internet usage time and manages the progress of that time in a real-time database. Timekeeping information is obtained by a timer on the device, and notifications are sent based on the set usage limit. When the end time is approaching, the device generates a voice message through a virtual character prompting the user to take a break.
[0483] Step 4:
[0484] When a user enters digital information, the terminal monitors keystrokes and detects specific patterns from a personal information protection perspective. If a specific email address or phone number input pattern is recognized, the system immediately issues a warning to the user prompting them to stop entering information. This warning is provided in either a visual or audio format.
[0485] Step 5:
[0486] By utilizing a generative AI model, the device provides user-friendly interactions. An example prompt is "How can I help my child use the internet safely?", and the generative AI model generates optimal advice and wording, delivering it to the user through communication. This enables flexible, personalized guidance.
[0487] Furthermore, an emotion engine that estimates the user's emotions may be incorporated. That is, the identification processing unit 290 may use the emotion identification model 59 to estimate the user's emotions and perform identification processing using the user's emotions.
[0488] This invention provides a system for children's internet use that combines eye protection, filtering of inappropriate content, protection of personal information, usage time management, and recognition of the user's emotions. This system is designed to ensure a safe and healthy digital experience for children. The program's processing is described below in natural language.
[0489] First, the device uses sensors to capture the child's face and detect their distance and posture. Based on this data, it determines if eye protection is needed and displays an appropriate alert. The content of the alert can be fine-tuned based on the user's emotions, and this is supported by an emotion engine.
[0490] The emotion engine analyzes sensor data and usage patterns from the device to predict emotions using the child's facial expressions, voice, and interaction data. This emotion data is used not only for alerts but also for providing safe content.
[0491] Next, the device captures the URLs of the web pages the child accesses, and inappropriate content is filtered using data processing tools. At this point, the emotion engine adjusts the filtering criteria and suggested safe alternative content based on the child's psychological state.
[0492] Furthermore, the device monitors whether personal information is included when a child enters data online, and if the emotion engine detects anxiety or stress, it provides more careful warnings and explanations. Especially when a child is emotional, a friendly character explains the situation and the interface is designed to reassure the child.
[0493] In usage time management, the device displays an alert when a set time has elapsed, guiding users to take breaks naturally. In this process as well, the emotion engine detects fatigue and boredom, supporting time management by issuing alerts at appropriate times.
[0494] For example, when a child is learning online, the device's emotion engine detects stress during learning and suggests interactions such as, "Let's refresh ourselves with fun quizzes and games," making breaks enjoyable. In this way, an approach that takes emotional states into account can provide children with a safe and enjoyable environment for using the internet.
[0495] The following describes the processing flow.
[0496] Step 1:
[0497] The device uses sensors to monitor the distance and posture of the child's face and processes this data in real time.
[0498] Step 2:
[0499] The device inputs the acquired data into an emotion engine, which analyzes the child's emotions based on their facial expressions and voice. Based on these results, it decides whether or not to issue an alert for vision protection.
[0500] Step 3:
[0501] The device uses the results of an emotion engine analysis to display alerts such as "Let's move a little further away from the screen!" in a more friendly tone.
[0502] Step 4:
[0503] When a user attempts to access a webpage, the device sends the URL to the server.
[0504] Step 5:
[0505] The server receives the URL, compares it with the database to determine if it contains inappropriate content, and sends the result back to the terminal.
[0506] Step 6:
[0507] The device's emotional engine assesses the child's psychological state and, if necessary, blocks access to inappropriate content, informing them in a gentle tone, "This site is not safe."
[0508] Step 7:
[0509] The device monitors whether any input actions performed by the child include personal information.
[0510] Step 8:
[0511] If personal information is detected, the device will display a warning, but simultaneously, taking into account the results of the emotion engine's analysis, it will offer reassuring words to reduce the child's stress.
[0512] Step 9:
[0513] The device tracks usage time, and an emotional engine displays a countdown that gently reminds the child, "It's almost time for a break," depending on their fatigue or boredom.
[0514] Step 10:
[0515] When usage time ends, the device switches to vacation mode, which includes relaxation suggestions, and automatically locks the screen. At this time, a character suggests an activity that children can enjoy, saying, "Here's what's next!"
[0516] (Example 2)
[0517] Next, we will describe Example 2. In the following description, the data processing device 12 will be referred to as the "server," and the headset-type terminal 314 will be referred to as the "terminal."
[0518] When children use the internet, there is a need to protect their eyesight, restrict access to inappropriate content, protect their personal information, and provide a user environment that takes their emotions into consideration. In conventional methods, these elements are often managed individually, making it difficult to build an integrated digital protection environment.
[0519] The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 2 is realized by the following means.
[0520] In this invention, the server includes sensing means for detecting the face, facial expressions, and voice of a child user; analysis means for analyzing the user's emotions based on the sensing means and generating alerts corresponding to the emotional state; and processing means for classifying and filtering content, restricting access to inappropriate information, and suggesting alternative information. This enables a total digital protection environment for child users.
[0521] "Sensing means" refers to sensors and devices used to detect the child user's face, facial expressions, and voice in real time.
[0522] "Analysis means" refers to the process of analyzing the user's emotional state based on data obtained from sensing means and generating appropriate alerts.
[0523] "Processing means" refers to a series of information processing steps that not only classify content and filter out inappropriate information, but also suggest appropriate alternative information.
[0524] "Monitoring measures" refer to a system that constantly monitors the user's input actions and provides necessary warnings from the perspective of protecting personal information.
[0525] A "timer device" refers to equipment that notifies the user of usage time and break times, and generates alerts at appropriate times according to their emotional state.
[0526] "Interface means" refers to a user interface that provides notifications and warnings to the user through character displays tailored to their emotional state.
[0527] This invention is a system for realizing a safe and secure digital experience for children using the internet. Specifically, the system includes sensors, an analysis engine, data processing, and interface means.
[0528] The device is equipped with sensor devices such as a camera and microphone. These are used as "sensing means" to capture the child's face, expressions, and voice in real time. The device sends this data to a server as basic data for emotion analysis.
[0529] On the server, the collected data is processed using "analysis tools" to analyze the child's emotional state. Emotion prediction algorithms and machine learning models are used for this analysis. Based on these analysis results, the content of the alerts presented to the user on the terminal is adjusted.
[0530] To prevent access to inappropriate content, the device captures the browser's URL information and filters it using a "processing mechanism." This filtering system proactively blocks inappropriate information and suggests safe alternative content.
[0531] Furthermore, the device constantly monitors the child's input and uses "monitoring tools" to issue a warning if personal information is included. If stress or anxiety is detected, a friendly character will be displayed to calmly explain the situation.
[0532] To manage usage time, the device uses a "timer mechanism" to notify the user of usage time and break time. If fatigue is detected, it will make a suggestion such as "Let's take a little more break" depending on the user's emotional state.
[0533] For example, if a user is experiencing stress during online learning, the device might suggest interactions such as "Let's refresh ourselves with fun quizzes and games," offering suggestions to complement learning with other activities. This ensures a safe digital environment while maintaining learning effectiveness.
[0534] An example of a prompt message would be, "Explain how to optimize alerts based on a child's emotions during online learning." This provides a basis for properly utilizing generative AI models.
[0535] The flow of the specific processing in Example 2 will be explained using Figure 13.
[0536] Step 1:
[0537] The device uses its built-in camera and microphone to collect sensor data such as the child's face, expressions, and voice. It acquires this data as input and transmits it to a server in real time. Specifically, the device activates its camera and periodically records facial movements and voice.
[0538] Step 2:
[0539] The server receives sensor data transmitted from the terminal as input to the emotion analysis engine. The data processing performed here involves analyzing the emotional state using machine learning algorithms. As output, it generates scores and labels indicating the child's emotional state. Specifically, it determines states such as "happy" or "stressed" based on factors such as the presence or absence of a smile and the tone of voice.
[0540] Step 3:
[0541] The server determines appropriate alert content for the user based on the analysis results and outputs it to the terminal. It receives the results of sentiment analysis as input and creates an alert message corresponding to the emotional state as output. A specific example of its operation is generating a message such as, "Please sit down and move away from the screen."
[0542] Step 4:
[0543] The device captures the URL of the webpage the user wants to access and passes it as input to the data processing system. In this step, calculations are performed to filter whether the URL is inappropriate. The output suggests safe alternative content to the user. Specifically, if there is a page that cannot be accessed, it will display a message such as "This site cannot be viewed, please try this instead."
[0544] Step 5:
[0545] The device monitors children's online input behavior and displays a warning if input containing personal information is detected. It acquires user typing data as input and uses an emotion engine to detect anxiety, thereby outputting a more considerate interface. Specifically, a friendly character appears on the screen saying, "Let's not enter addresses."
[0546] Step 6:
[0547] The device monitors the set usage time and passes the elapsed time as input data to the timer. The output is a break alert to the user. Specifically, it displays "It's almost time for a break" and has a function to temporarily lock the screen.
[0548] (Application Example 2)
[0549] Next, we will explain application example 2. In the following explanation, the data processing device 12 will be referred to as the "server," and the headset-type terminal 314 will be referred to as the "terminal."
[0550] Children are required to have a safe and comfortable user experience in the digital information environment, while simultaneously receiving appropriate visual protection, filtering of digital content, protection of personal information, and management of usage time, all while considering their emotional state. However, conventional systems have struggled to comprehensively achieve these goals. In addition, there is a lack of means to effectively manage and improve children's digital experiences while reducing the burden on parents in the home environment.
[0551] The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 2 is realized by the following means.
[0552] In this invention, the server includes sensor means for detecting the user's distance and posture, analysis means for recognizing emotions and adjusting the digital experience, and adjustment means for communicating with an intelligent machine that monitors educational activities and provides support based on the user's emotions. This enables the protection of children's vision and support for digital education that is in harmony with their emotions.
[0553] "User" refers to an individual who uses a system in a digital information environment, and in particular, to a child who uses a device under the supervision of a parent or guardian.
[0554] "Sensor means" refers to devices and technologies for detecting the user's distance, posture, and facial expressions, and includes cameras, microphones, and other sensing devices.
[0555] "Control means" refers to technology that generates warnings and alerts to encourage visual protection in the user based on data obtained from sensor means.
[0556] "Data processing means" refers to a processing device and program for classifying digital information and filtering out inappropriate content.
[0557] "Monitoring measures" refer to technologies and devices that check the user's input activities and prevent the leakage of personal information.
[0558] "Analysis means" refers to technology that recognizes the user's emotional state based on information such as facial expressions and voice, and adjusts the digital experience accordingly.
[0559] "Adjustment means" refers to technology that improves and provides the user experience in cooperation with intelligent machines, etc., in response to the user's emotions recognized by the analysis means.
[0560] An "intelligent machine" refers to a robot or device equipped with information processing and communication capabilities to enhance children's learning and entertainment.
[0561] "Educational activities" refer to all forms of learning and training aimed at improving children's knowledge and skills.
[0562] This invention provides a system in which a server interacts with an intelligent machine installed in the home, namely a consumer robot, to support a safe and comfortable digital experience for children. The terminal uses sensor means to sense the user's face and body movements and detect distance and posture. Based on this, a control means functions to determine whether visual protection is needed and to display an alert.
[0563] The server collects and analyzes sensor data through cloud-based services. Here, it uses the Google Cloud Vision API and Microsoft Azure Emotion API to analyze the user's facial expressions and voice, and obtain emotional data. Furthermore, the analyzed emotional data is used as a means of tailoring content and activities to the user. For example, generative AI models using TensorFlow or PyTorch analyze and predict emotions in real time, and fine-tune filtering criteria.
[0564] If a user attempts to access inappropriate digital information, data processing mechanisms on the server immediately evaluate the information and verify its suitability using the Google Safe Browsing API. During this process, the device monitors input operations using monitoring mechanisms to prevent the leakage of personal information and issues warnings as needed.
[0565] As a concrete example, when a child is using a digital device to participate in an educational program, a robot can be present to manage the appropriateness of the digital information and usage time while suggesting fun activities. For example, by entering a prompt such as, "Determine from this child's facial expression whether they are stressed and generate code to suggest appropriate countermeasures," the system can provide solutions tailored to individual needs.
[0566] In this way, the invention solves the challenges of protecting children's eyesight and digital safety, and realizes a system that reduces the burden on parents at home.
[0567] The flow of a specific process in Application Example 2 will be explained using Figure 14.
[0568] Step 1:
[0569] The device uses sensors to detect the user's facial and body movements in real time. It receives raw data from cameras and microphones as input and processes it as sensor data. Output includes distance, posture, and facial expression information. Specifically, it performs face detection from camera images and uses a distance sensor to measure the distance to the user.
[0570] Step 2:
[0571] The server receives sensor data via cloud services and performs emotion analysis using the Google Cloud Vision API and Emotion API. It receives facial images and voice data as input and sends this data to the API for analysis. The output is an estimate of the user's emotional state. Specifically, it analyzes facial expressions from images, predicts emotions based on tone of voice, and compiles this information.
[0572] Step 3:
[0573] The server uses data processing tools to analyze the URL of the digital content the user is trying to access. It receives URLs captured from a web browser as input and uses the Google Safe Browsing API for classification and filtering. The output evaluates whether the content is safe. Specifically, the API scans the URL and returns whether it poses a risk.
[0574] Step 4:
[0575] The terminal uses monitoring devices to monitor the user's input operations. It collects and analyzes log data from keyboard and touch operations as input. The output provides an indicator of whether personal information has been entered. Specifically, it compares the entered string with a pre-configured personal information pattern and generates an alert if a match is found.
[0576] Step 5:
[0577] The server utilizes a generative AI model based on analyzed emotional states to provide suggestions and content recommendations to the user using adjustment mechanisms. It receives emotional data as input and provides prompts to the generative AI model. As output, it provides the user with optimal action plans and content suggestions. Specifically, it receives a prompt message such as "Determine whether this child's facial expression indicates stress and generate code to suggest appropriate countermeasures," adjusts the generated suggestions, and responds to the user.
[0578] The specific processing unit 290 transmits the result of the specific processing to the headset terminal 314. In the headset terminal 314, the control unit 46A causes the speaker 240 and display 343 to output the result of the specific processing. The microphone 238 acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 238 to the data processing unit 12. In the data processing unit 12, the specific processing unit 290 acquires the audio data.
[0579] Data generation model 58 is a type of so-called generative AI (Artificial Intelligence). One example of data generation model 58 is ChatGPT (Internet search<URL: https: / / openai.com / blog / chatgpt> ), Gemini (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and with inference data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 infers from the input inference data according to the instructions indicated by the prompts, and outputs the inference results in data formats such as audio data and text data. Here, inference refers to, for example, analysis, classification, prediction, and / or summarization.
[0580] In the above embodiment, an example was given in which specific processing is performed by the data processing device 12, but the technology of this disclosure is not limited thereto, and specific processing may also be performed by the headset terminal 314.
[0581] [Fourth Embodiment]
[0582] Figure 7 shows an example of the configuration of the data processing system 410 according to the fourth embodiment.
[0583] As shown in Figure 7, the data processing system 410 includes a data processing device 12 and a robot 414. An example of the data processing device 12 is a server.
[0584] The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network).
[0585] The robot 414 includes a computer 36, a microphone 238, a speaker 240, a camera 42, a communication interface 44, and a controlled object 443. The computer 36 includes a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The microphone 238, speaker 240, camera 42, and controlled object 443 are also connected to the bus 52.
[0586] The microphone 238 receives voice signals from the user 20 and receives instructions from the user 20. The microphone 238 captures the voice signals from the user 20, converts the captured voice into audio data, and outputs it to the processor 46. The speaker 240 outputs audio according to the instructions from the processor 46.
[0587] Camera 42 is a small digital camera equipped with an optical system including a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor, and captures images of the area around the user 20 (for example, an imaging range defined by a field of view equivalent to the width of a typical healthy person's field of vision).
[0588] Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various information between processor 46 and processor 28 via network 54. The exchange of various information between processor 46 and processor 28 using communication interfaces 44 and 26 is performed in a secure manner.
[0589] The controlled object 443 includes a display device, LEDs in the eyes, and motors that drive the arms, hands, and feet. The posture and gestures of the robot 414 are controlled by controlling the motors of the arms, hands, and feet. Some of the robot 414's emotions can be expressed by controlling these motors. Furthermore, the robot 414's facial expressions can also be expressed by controlling the illumination state of the LEDs in its eyes.
[0590] Figure 8 shows an example of the main functions of the data processing device 12 and the robot 414. As shown in Figure 8, the data processing device 12 performs specific processing using the processor 28. The storage 32 stores the specific processing program 56.
[0591] The specific processing program 56 is an example of a "program" relating to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 in accordance with the specific processing program 56 executed on the RAM 30.
[0592] The storage 32 stores the data generation model 58 and the emotion identification model 59. The data generation model 58 and the emotion identification model 59 are used by the identification processing unit 290.
[0593] In robot 414, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48.
[0594] Next, the specific processing performed by the specific processing unit 290 of the data processing device 12 will be described. In the following description, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal".
[0595] This invention provides a system that promotes healthy internet use among children by providing features such as eye protection, content filtering, personal information protection, and usage time management. The program's processing is described below in natural language.
[0596] First, the device uses sensors to detect the distance and posture of the user, i.e., the child's face. This activates a function that issues alerts for appropriate eye protection, prompting the user to take a break or say things like "You're too close to the screen" or "It's time for a break." These notifications are displayed using child-friendly characters, allowing for natural behavioral correction.
[0597] Next, the server receives the URL of the website the child is trying to access and checks it against a database to determine if it contains inappropriate content. This filtering technology is essential to protect children from harmful information. If the content is deemed inappropriate, the device immediately notifies the user and suggests safe sites and activities.
[0598] Furthermore, the device monitors children's attempts to enter personal information online. When a user enters an email address or other personal information, it immediately issues a warning and urges them to refrain from doing so, thereby reducing the risk of data breaches.
[0599] Furthermore, the device manages the child's internet usage time, notifies them when the set time is approaching, and automatically switches to rest mode when the actual time is up. This function helps maintain the child's health by avoiding prolonged continuous use and encouraging appropriate breaks.
[0600] As a concrete example, when a child uses the internet to supplement their studies, the server filters and displays only safe learning-related sites, and the device notifies the child to take a break before screen time becomes too long. This provides an environment where learning can be done efficiently while also protecting health and safety.
[0601] The following describes the processing flow.
[0602] Step 1:
[0603] The device activates its sensors to monitor the child's face position and orientation in real time. The detected data is used to calculate the distance and angle of the face.
[0604] Step 2:
[0605] Based on monitoring results, the device determines if the user is in a state that could negatively affect their eyesight, such as "holding the screen too close." If this is detected, an alert message "You need to take a break" is displayed on the screen.
[0606] Step 3:
[0607] A user attempts to access a website. At that time, the device captures the accessed URL and sends it to the server.
[0608] Step 4:
[0609] The server receives the URL, compares it against filtering criteria in the database, and analyzes whether the website in question is inappropriate.
[0610] Step 5:
[0611] If the server determines that the website in question is inappropriate, it returns that result to the device. The device then displays a message to the child stating, "This page is not accessible," and suggests safe alternative content.
[0612] Step 6:
[0613] The device monitors personal data (such as email addresses and credit card information) when a child starts entering information.
[0614] Step 7:
[0615] If personal information is detected in an input field, the device will immediately issue a warning and prompt the child not to enter the information.
[0616] Step 8:
[0617] The device counts the elapsed time since internet use began and manages the time to ensure it does not exceed the set limit. When the remaining time is running low, it notifies the child of the countdown.
[0618] Step 9:
[0619] When the set usage time is reached, the device locks the screen, prompting the child to take a break. At this time, the display screen shows relaxation methods and game suggestions accompanied by characters.
[0620] (Example 1)
[0621] Next, we will describe Example 1. In the following description, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal".
[0622] In today's information society, ensuring children use the internet safely and healthily is a critical issue. In particular, concerns about vision problems, access to inappropriate content, personal data breaches, and health damage from prolonged use are serious. Traditional methods have struggled to comprehensively address these issues, necessitating a unified solution.
[0623] The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 1 is realized by the following means.
[0624] In this invention, the server includes detection means, determination means, information processing means, monitoring means, and time management means. This enables the protection of children's eyesight, control of inappropriate content access, protection of personal information, and management of usage time.
[0625] "Detection means" refers to devices or technologies used to sense the user's distance and posture.
[0626] The "decision-making mechanism" is a mechanism that generates a warning for eye protection based on the detection results.
[0627] "Information processing means" refers to technologies for classifying and selecting data and controlling access to inappropriate information.
[0628] A "monitoring measure" is a system designed to observe the user's input operations and prevent the unauthorized input of personal information.
[0629] A "time management system" is a management system used to limit a user's activity time and maintain their health.
[0630] "Means of expression" refers to a function that provides notifications or warnings to users through the display of characters.
[0631] This invention is a system that supports healthy internet use among children, and its specific embodiments are shown below.
[0632] The device features a built-in camera and infrared sensor to detect the child's distance and posture in real time. The data acquired by the sensors is used for vision protection, and if the distance is too close or the posture is inappropriate, a warning is displayed to the child using a character notification. In this process, the camera and infrared sensor are used as hardware, and a software algorithm for vision protection processes the data.
[0633] The server receives the URL of a website that a child is trying to access and filters it against a database to eliminate inappropriate content. The database contains keywords and site lists necessary for filtering, and machine learning techniques are used to identify inappropriate elements.
[0634] The device also monitors when children enter personal information into forms. For example, if it detects the input of an email address or phone number, it immediately issues a warning to prevent the leakage of personal information. This uses browser plugins and real-time data checking functions provided by the operating system.
[0635] Furthermore, the device manages the child's internet usage time, notifies them when the set time limit is reached, and switches to rest mode when the time is up. This allows users to take appropriate breaks. An installed timer application is used for time management.
[0636] As a concrete example, when a child uses the internet for online learning, the server displays only safe websites related to learning, and the device sends notifications at regular intervals recommending a break. This provides a safe and healthy learning environment.
[0637] The following are examples of prompts for a generative AI model.
[0638] "Please propose specific ways to ensure children can use the internet safely. This should include measures such as eye protection, content filtering, personal information protection, and managing usage time."
[0639] The flow of the specific processing in Example 1 will be explained using Figure 11.
[0640] Step 1:
[0641] The device activates its built-in camera and infrared sensor to detect the distance and posture of the child's face. Input includes image data of the child's face and distance measurement data. This data is analyzed by an algorithm to determine if the distance falls below a predetermined threshold (e.g., 30cm or more) or if the posture is inappropriate. If the distance is too close or the posture is inappropriate, the device generates a warning message using a character to protect the child's eyesight. This prompts the user to immediately move away from the screen.
[0642] Step 2:
[0643] The server receives the URL of the website that the child has requested to access. The input includes the URL data and metadata such as the access time. The server compares this URL with a database to check if it matches any pre-registered lists of inappropriate URLs or keywords. A text search algorithm is used for data processing. If it is determined to be inappropriate, the server sends an access denied message and a suggestion of a safe site to the device as output. This procedure guides the child to safe information without accessing inappropriate content.
[0644] Step 3:
[0645] The device monitors the input in real time when a child fills out a form in their web browser. The input is the text data entered by the child. The plugin used for monitoring performs regular expression matching to detect keywords that constitute personal information (e.g., email address, phone number). If such keywords are detected, a warning pop-up is displayed as output, and the submission function is disabled. This procedure prevents the leakage of personal information.
[0646] Step 4:
[0647] The device activates a built-in timer when a child starts using the internet and measures the usage time. The input recorded is the start time of use. Five minutes before the set usage limit is reached, a warning notification is generated, and the message "You have 5 minutes of usage time remaining" is displayed on the screen. When the set time is up, the internet connection is temporarily disabled, and a message prompting a break is displayed. This allows children to use the internet while taking appropriate breaks.
[0648] (Application Example 1)
[0649] Next, we will explain Application Example 1. In the following explanation, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal".
[0650] In recent years, creating an environment where children can use the internet safely and healthily has become crucial. However, there are concerns about the risks children face when spending long hours online, such as eye strain, access to inappropriate content, and the leakage of personal information. There is a need to develop systems that mitigate these risks and appropriately manage usage time, thereby promoting the healthy physical and mental development of children.
[0651] The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 1 is realized by the following means.
[0652] In this invention, the server includes detection means for detecting the location and posture of a child user; control means for generating warnings to encourage the protection of the user's vision based on the detection means; processing means for classifying and filtering digital information and restricting access to inappropriate digital information; and filtering means for analyzing web location markers accessed by the user and allowing only safe digital information. This enables children to use the internet safely while protecting their eyesight and managing their usage time.
[0653] A "detection device" is a device that has the function of sensing the child user's position and posture in real time and converting it into numerical data.
[0654] A "control means" is a device that generates warnings to reduce the visual strain on the user based on data obtained from a detection means.
[0655] A "processing means" is a component that handles data management functions to classify digital information, select inappropriate content, and restrict access to it.
[0656] A "monitoring device" is a device that has the function of monitoring user input operations and implementing security measures to prevent the leakage of personal information.
[0657] A "filtering method" is a system that analyzes the web location information accessed by the user and has the function of allowing only safe digital information.
[0658] The system that realizes this invention is configured using a server and terminals.
[0659] The server's primary role is to classify and filter digital information, restricting access to inappropriate content. Specifically, the server receives web location markers that users attempt to access and compares them against a database to determine if the information is safe. This filters out inappropriate digital information, providing users with a safe internet environment.
[0660] On the other hand, the terminal functions as an interface for the user to access the internet. It uses a built-in camera and distance sensor to detect the user's position and posture in real time. Using image processing libraries such as OpenCV, it generates warnings through the display device if the user is too close to the display or has poor posture.
[0661] To manage usage time, the device manages time information through a real-time database and notifies the user when usage ends based on the set usage time. This helps users avoid excessive use and promotes healthy usage. A virtual character is used for notifications, providing user interaction in a friendly manner.
[0662] For example, when a user accesses learning content over the internet, the server provides only secure, educational digital information. Furthermore, the device helps maintain user concentration and well-being by prompting breaks before prolonged use.
[0663] By utilizing generative AI models, it is possible to optimize the content of the messages and warnings for individual users. An example of a prompt to input into the generative AI model is, "Please tell me how to guide my child to use the internet safely." Such a system can provide comprehensive support for children's digital lives.
[0664] The flow of a specific process in Application Example 1 will be explained using Figure 12.
[0665] Step 1:
[0666] The server receives a web location identifier (URL) sent by the user. The received URL is compared against a database on the server to determine if it is secure digital information. The database contains a list of URLs that have been previously verified as secure. If the URL is determined to be inappropriate, the server sends a corresponding notification to the device.
[0667] Step 2:
[0668] The device uses its built-in camera and distance sensor to detect the user's position and posture. Using the OpenCV library, it analyzes video data from the camera to determine if the user is too close to the display or has poor posture. This information is then used to evaluate the appropriateness of the distance and posture through programmatic thresholding. Depending on the result, the device displays visual or audible warnings to the user.
[0669] Step 3:
[0670] The device measures the user's internet usage time and manages the progress of that time in a real-time database. Timekeeping information is obtained by a timer on the device, and notifications are sent based on the set usage limit. When the end time is approaching, the device generates a voice message through a virtual character prompting the user to take a break.
[0671] Step 4:
[0672] When a user enters digital information, the terminal monitors keystrokes and detects specific patterns from a personal information protection perspective. If a specific email address or phone number input pattern is recognized, the system immediately issues a warning to the user prompting them to stop entering information. This warning is provided in either a visual or audio format.
[0673] Step 5:
[0674] By utilizing a generative AI model, the device provides user-friendly interactions. An example prompt is "How can I help my child use the internet safely?", and the generative AI model generates optimal advice and wording, delivering it to the user through communication. This enables flexible, personalized guidance.
[0675] Furthermore, an emotion engine that estimates the user's emotions may be incorporated. That is, the identification processing unit 290 may use the emotion identification model 59 to estimate the user's emotions and perform identification processing using the user's emotions.
[0676] This invention provides a system for children's internet use that combines eye protection, filtering of inappropriate content, protection of personal information, usage time management, and recognition of the user's emotions. This system is designed to ensure a safe and healthy digital experience for children. The program's processing is described below in natural language.
[0677] First, the device uses sensors to capture the child's face and detect their distance and posture. Based on this data, it determines if eye protection is needed and displays an appropriate alert. The content of the alert can be fine-tuned based on the user's emotions, and this is supported by an emotion engine.
[0678] The emotion engine analyzes sensor data and usage patterns from the device to predict emotions using the child's facial expressions, voice, and interaction data. This emotion data is used not only for alerts but also for providing safe content.
[0679] Next, the device captures the URLs of the web pages the child accesses, and inappropriate content is filtered using data processing tools. At this point, the emotion engine adjusts the filtering criteria and suggested safe alternative content based on the child's psychological state.
[0680] Furthermore, the device monitors whether personal information is included when a child enters data online, and if the emotion engine detects anxiety or stress, it provides more careful warnings and explanations. Especially when a child is emotional, a friendly character explains the situation and the interface is designed to reassure the child.
[0681] In usage time management, the device displays an alert when a set time has elapsed, guiding users to take breaks naturally. In this process as well, the emotion engine detects fatigue and boredom, supporting time management by issuing alerts at appropriate times.
[0682] For example, when a child is learning online, the device's emotion engine detects stress during learning and suggests interactions such as, "Let's refresh ourselves with fun quizzes and games," making breaks enjoyable. In this way, an approach that takes emotional states into account can provide children with a safe and enjoyable environment for using the internet.
[0683] The following describes the processing flow.
[0684] Step 1:
[0685] The device uses sensors to monitor the distance and posture of the child's face and processes this data in real time.
[0686] Step 2:
[0687] The device inputs the acquired data into an emotion engine, which analyzes the child's emotions based on their facial expressions and voice. Based on these results, it decides whether or not to issue an alert for vision protection.
[0688] Step 3:
[0689] The device uses the results of an emotion engine analysis to display alerts such as "Let's move a little further away from the screen!" in a more friendly tone.
[0690] Step 4:
[0691] When a user attempts to access a webpage, the device sends the URL to the server.
[0692] Step 5:
[0693] The server receives the URL, compares it with the database to determine if it contains inappropriate content, and sends the result back to the terminal.
[0694] Step 6:
[0695] The device's emotional engine assesses the child's psychological state and, if necessary, blocks access to inappropriate content, informing them in a gentle tone, "This site is not safe."
[0696] Step 7:
[0697] The device monitors whether any input actions performed by the child include personal information.
[0698] Step 8:
[0699] If personal information is detected, the device will display a warning, but simultaneously, taking into account the results of the emotion engine's analysis, it will offer reassuring words to reduce the child's stress.
[0700] Step 9:
[0701] The device tracks usage time, and an emotional engine displays a countdown that gently reminds the child, "It's almost time for a break," depending on their fatigue or boredom.
[0702] Step 10:
[0703] When usage time ends, the device switches to vacation mode, which includes relaxation suggestions, and automatically locks the screen. At this time, a character suggests an activity that children can enjoy, saying, "Here's what's next!"
[0704] (Example 2)
[0705] Next, we will describe Example 2. In the following description, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal".
[0706] When children use the internet, there is a need to protect their eyesight, restrict access to inappropriate content, protect their personal information, and provide a user environment that takes their emotions into consideration. In conventional methods, these elements are often managed individually, making it difficult to build an integrated digital protection environment.
[0707] The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 2 is realized by the following means.
[0708] In this invention, the server includes sensing means for detecting the face, facial expressions, and voice of a child user; analysis means for analyzing the user's emotions based on the sensing means and generating alerts corresponding to the emotional state; and processing means for classifying and filtering content, restricting access to inappropriate information, and suggesting alternative information. This enables a total digital protection environment for child users.
[0709] "Sensing means" refers to sensors and devices used to detect the child user's face, facial expressions, and voice in real time.
[0710] "Analysis means" refers to the process of analyzing the user's emotional state based on data obtained from sensing means and generating appropriate alerts.
[0711] "Processing means" refers to a series of information processing steps that not only classify content and filter out inappropriate information, but also suggest appropriate alternative information.
[0712] "Monitoring measures" refer to a system that constantly monitors the user's input actions and provides necessary warnings from the perspective of protecting personal information.
[0713] A "timer device" refers to equipment that notifies the user of usage time and break times, and generates alerts at appropriate times according to their emotional state.
[0714] "Interface means" refers to a user interface that provides notifications and warnings to the user through character displays tailored to their emotional state.
[0715] This invention is a system for realizing a safe and secure digital experience for children using the internet. Specifically, the system includes sensors, an analysis engine, data processing, and interface means.
[0716] The device is equipped with sensor devices such as a camera and microphone. These are used as "sensing means" to capture the child's face, expressions, and voice in real time. The device sends this data to a server as basic data for emotion analysis.
[0717] On the server, the collected data is processed using "analysis tools" to analyze the child's emotional state. Emotion prediction algorithms and machine learning models are used for this analysis. Based on these analysis results, the content of the alerts presented to the user on the terminal is adjusted.
[0718] To prevent access to inappropriate content, the device captures the browser's URL information and filters it using a "processing mechanism." This filtering system proactively blocks inappropriate information and suggests safe alternative content.
[0719] Furthermore, the device constantly monitors the child's input and uses "monitoring tools" to issue a warning if personal information is included. If stress or anxiety is detected, a friendly character will be displayed to calmly explain the situation.
[0720] To manage usage time, the device uses a "timer mechanism" to notify the user of usage time and break time. If fatigue is detected, it will make a suggestion such as "Let's take a little more break" depending on the user's emotional state.
[0721] For example, if a user is experiencing stress during online learning, the device might suggest interactions such as "Let's refresh ourselves with fun quizzes and games," offering suggestions to complement learning with other activities. This ensures a safe digital environment while maintaining learning effectiveness.
[0722] An example of a prompt message would be, "Explain how to optimize alerts based on a child's emotions during online learning." This provides a basis for properly utilizing generative AI models.
[0723] The flow of the specific processing in Example 2 will be explained using Figure 13.
[0724] Step 1:
[0725] The device uses its built-in camera and microphone to collect sensor data such as the child's face, expressions, and voice. It acquires this data as input and transmits it to a server in real time. Specifically, the device activates its camera and periodically records facial movements and voice.
[0726] Step 2:
[0727] The server receives sensor data transmitted from the terminal as input to the emotion analysis engine. The data processing performed here involves analyzing the emotional state using machine learning algorithms. As output, it generates scores and labels indicating the child's emotional state. Specifically, it determines states such as "happy" or "stressed" based on factors such as the presence or absence of a smile and the tone of voice.
[0728] Step 3:
[0729] The server determines appropriate alert content for the user based on the analysis results and outputs it to the terminal. It receives the results of sentiment analysis as input and creates an alert message corresponding to the emotional state as output. A specific example of its operation is generating a message such as, "Please sit down and move away from the screen."
[0730] Step 4:
[0731] The device captures the URL of the webpage the user wants to access and passes it as input to the data processing system. In this step, calculations are performed to filter whether the URL is inappropriate. The output suggests safe alternative content to the user. Specifically, if there is a page that cannot be accessed, it will display a message such as "This site cannot be viewed, please try this instead."
[0732] Step 5:
[0733] The device monitors children's online input behavior and displays a warning if input containing personal information is detected. It acquires user typing data as input and uses an emotion engine to detect anxiety, thereby outputting a more considerate interface. Specifically, a friendly character appears on the screen saying, "Let's not enter addresses."
[0734] Step 6:
[0735] The device monitors the set usage time and passes the elapsed time as input data to the timer. The output is a break alert to the user. Specifically, it displays "It's almost time for a break" and has a function to temporarily lock the screen.
[0736] (Application Example 2)
[0737] Next, we will explain application example 2. In the following explanation, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal".
[0738] Children are required to have a safe and comfortable user experience in the digital information environment, while simultaneously receiving appropriate visual protection, filtering of digital content, protection of personal information, and management of usage time, all while considering their emotional state. However, conventional systems have struggled to comprehensively achieve these goals. In addition, there is a lack of means to effectively manage and improve children's digital experiences while reducing the burden on parents in the home environment.
[0739] The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 2 is realized by the following means.
[0740] In this invention, the server includes sensor means for detecting the user's distance and posture, analysis means for recognizing emotions and adjusting the digital experience, and adjustment means for communicating with an intelligent machine that monitors educational activities and provides support based on the user's emotions. This enables the protection of children's vision and support for digital education that is in harmony with their emotions.
[0741] "User" refers to an individual who uses a system in a digital information environment, and in particular, to a child who uses a device under the supervision of a parent or guardian.
[0742] "Sensor means" refers to devices and technologies for detecting the user's distance, posture, and facial expressions, and includes cameras, microphones, and other sensing devices.
[0743] "Control means" refers to technology that generates warnings and alerts to encourage visual protection in the user based on data obtained from sensor means.
[0744] "Data processing means" refers to a processing device and program for classifying digital information and filtering out inappropriate content.
[0745] "Monitoring measures" refer to technologies and devices that check the user's input activities and prevent the leakage of personal information.
[0746] "Analysis means" refers to technology that recognizes the user's emotional state based on information such as facial expressions and voice, and adjusts the digital experience accordingly.
[0747] "Adjustment means" refers to technology that improves and provides the user experience in cooperation with intelligent machines, etc., in response to the user's emotions recognized by the analysis means.
[0748] An "intelligent machine" refers to a robot or device equipped with information processing and communication capabilities to enhance children's learning and entertainment.
[0749] "Educational activities" refer to all forms of learning and training aimed at improving children's knowledge and skills.
[0750] This invention provides a system in which a server interacts with an intelligent machine installed in the home, namely a consumer robot, to support a safe and comfortable digital experience for children. The terminal uses sensor means to sense the user's face and body movements and detect distance and posture. Based on this, a control means functions to determine whether visual protection is needed and to display an alert.
[0751] The server collects and analyzes sensor data through cloud-based services. Here, it uses the Google Cloud Vision API and Microsoft Azure Emotion API to analyze the user's facial expressions and voice, and obtain emotional data. Furthermore, the analyzed emotional data is used as a means of tailoring content and activities to the user. For example, generative AI models using TensorFlow or PyTorch analyze and predict emotions in real time, and fine-tune filtering criteria.
[0752] If a user attempts to access inappropriate digital information, data processing mechanisms on the server immediately evaluate the information and verify its suitability using the Google Safe Browsing API. During this process, the device monitors input operations using monitoring mechanisms to prevent the leakage of personal information and issues warnings as needed.
[0753] As a concrete example, when a child is using a digital device to participate in an educational program, a robot can be present to manage the appropriateness of the digital information and usage time while suggesting fun activities. For example, by entering a prompt such as, "Determine from this child's facial expression whether they are stressed and generate code to suggest appropriate countermeasures," the system can provide solutions tailored to individual needs.
[0754] In this way, the invention solves the challenges of protecting children's eyesight and digital safety, and realizes a system that reduces the burden on parents at home.
[0755] The flow of a specific process in Application Example 2 will be explained using Figure 14.
[0756] Step 1:
[0757] The device uses sensors to detect the user's facial and body movements in real time. It receives raw data from cameras and microphones as input and processes it as sensor data. Output includes distance, posture, and facial expression information. Specifically, it performs face detection from camera images and uses a distance sensor to measure the distance to the user.
[0758] Step 2:
[0759] The server receives sensor data via cloud services and performs emotion analysis using the Google Cloud Vision API and Emotion API. It receives facial images and voice data as input and sends this data to the API for analysis. The output is an estimate of the user's emotional state. Specifically, it analyzes facial expressions from images, predicts emotions based on tone of voice, and compiles this information.
[0760] Step 3:
[0761] The server uses data processing tools to analyze the URL of the digital content the user is trying to access. It receives URLs captured from a web browser as input and uses the Google Safe Browsing API for classification and filtering. The output evaluates whether the content is safe. Specifically, the API scans the URL and returns whether it poses a risk.
[0762] Step 4:
[0763] The terminal uses monitoring devices to monitor the user's input operations. It collects and analyzes log data from keyboard and touch operations as input. The output provides an indicator of whether personal information has been entered. Specifically, it compares the entered string with a pre-configured personal information pattern and generates an alert if a match is found.
[0764] Step 5:
[0765] The server utilizes a generative AI model based on analyzed emotional states to provide suggestions and content recommendations to the user using adjustment mechanisms. It receives emotional data as input and provides prompts to the generative AI model. As output, it provides the user with optimal action plans and content suggestions. Specifically, it receives a prompt message such as "Determine whether this child's facial expression indicates stress and generate code to suggest appropriate countermeasures," adjusts the generated suggestions, and responds to the user.
[0766] The specific processing unit 290 transmits the result of the specific processing to the robot 414. In the robot 414, the control unit 46A causes the speaker 240 and the controlled object 443 to output the result of the specific processing. The microphone 238 acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 238 to the data processing unit 12. In the data processing unit 12, the specific processing unit 290 acquires the audio data.
[0767] Data generation model 58 is a type of so-called generative AI (Artificial Intelligence). One example of data generation model 58 is ChatGPT (Internet search<URL: https: / / openai.com / blog / chatgpt> ), Gemini (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and with inference data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 infers from the input inference data according to the instructions indicated by the prompts, and outputs the inference results in data formats such as audio data and text data. Here, inference refers to, for example, analysis, classification, prediction, and / or summarization.
[0768] In the above embodiment, an example was given in which specific processing is performed by the data processing device 12, but the technology of this disclosure is not limited thereto, and the specific processing may also be performed by the robot 414.
[0769] Furthermore, the emotion identification model 59, acting as an emotion engine, may determine the user's emotion according to a specific mapping. Specifically, the emotion identification model 59 may determine the user's emotion according to a specific mapping, which is an emotion map (see Figure 9). Similarly, the emotion identification model 59 may also determine the robot's emotion, and the identification processing unit 290 may perform identification processing using the robot's emotion.
[0770] Figure 9 shows an emotion map 400 in which multiple emotions are mapped. In the emotion map 400, emotions are arranged in concentric circles radiating from the center. The closer to the center of the concentric circles, the more primitive the emotions are located. Further out of the concentric circles, emotions representing states and actions arising from mental states are located. Emotion is a concept that includes feelings and mental states. On the left side of the concentric circles, emotions that are generally generated from reactions occurring in the brain are located. On the right side of the concentric circles, emotions that are generally induced by situational judgment are located. Above and below the concentric circles, emotions that are generally generated from reactions occurring in the brain and induced by situational judgment are located. In addition, the emotion of "pleasure" is located on the upper side of the concentric circles, and the emotion of "displeasure" is located on the lower side. Thus, in the emotion map 400, multiple emotions are mapped based on the structure in which emotions arise, and emotions that are likely to occur simultaneously are mapped close together.
[0771] These emotions are distributed at the 3 o'clock position on the Emotion Map 400, and usually fluctuate between feelings of security and anxiety. In the right half of the Emotion Map 400, situational awareness takes precedence over internal feelings, resulting in a calm impression.
[0772] The inside of the Emotion Map 400 represents inner thoughts, while the outside represents actions. Therefore, the further you go from the outside of the Emotion Map 400, the more visible (expressed in actions) your emotions become.
[0773] Here, human emotions are based on various balances, such as posture and blood sugar levels. When these balances deviate from the ideal, it results in discomfort, and when they approach the ideal, it results in pleasure. Similarly, in robots, cars, motorcycles, etc., emotions can be created based on various balances, such as posture and battery level. When these balances deviate from the ideal, it results in discomfort, and when they approach the ideal, it results in pleasure. The emotion map can be generated, for example, based on Dr. Mitsuyoshi's emotion map (Research on a system for analyzing brain physiological signals of speech emotion recognition and emotion, Tokushima University, doctoral dissertation: https: / / ci.nii.ac.jp / naid / 500000375379). The left half of the emotion map contains emotions belonging to a region called "response," where sensation is dominant. The right half of the emotion map contains emotions belonging to a region called "situation," where situational awareness is dominant.
[0774] The emotion map defines two emotions that promote learning. One is the emotion around the middle of the negative "repentance" and "reflection" on the situation side. In other words, it is when the robot experiences negative emotions such as "I never want to feel this way again" or "I don't want to be scolded again." The other is the emotion around the positive "desire" on the reaction side. In other words, it is when the robot has positive feelings such as "I want more" or "I want to know more."
[0775] The emotion identification model 59 inputs user input into a pre-trained neural network, obtains emotion values representing each emotion shown in the emotion map 400, and determines the user's emotion. This neural network is pre-trained based on multiple training data sets, which are combinations of user input and emotion values representing each emotion shown in the emotion map 400. Furthermore, this neural network is trained so that emotions located close together have similar values, as shown in the emotion map 900 in Figure 10. Figure 10 shows an example where multiple emotions such as "reassured," "calm," and "confident" have similar emotion values.
[0776] The above description primarily focuses on the functions of the data processing device 12 in relation to this disclosure. However, the system related to this disclosure is not necessarily implemented on a server. The system related to this disclosure may be implemented as a general information processing system. This disclosure may be implemented, for example, as a software program that runs on a personal computer or as an application that runs on a smartphone. The method related to this disclosure may be provided to users in SaaS (Software as a Service) format.
[0777] In the above embodiment, an example was given in which a specific process is performed by a single computer 22. However, the technology of this disclosure is not limited thereto, and a distributed processing of the specific process may be performed by multiple computers, including computer 22. For example, a data generation model 58 may be provided in an external device of the data processing device 12, and the external device may generate data according to the input data.
[0778] In the above embodiment, an example was given in which the specific processing program 56 is stored in the storage 32, but the technology of this disclosure is not limited thereto. For example, the specific processing program 56 may be stored in a portable, computer-readable, non-temporary storage medium such as a USB (Universal Serial Bus) memory. The specific processing program 56 stored in the non-temporary storage medium is installed in the computer 22 of the data processing device 12. The processor 28 executes specific processing according to the specific processing program 56.
[0779] Alternatively, the specific processing program 56 may be stored in a storage device such as a server connected to the data processing device 12 via the network 54, and the specific processing program 56 may be downloaded and installed on the computer 22 in response to a request from the data processing device 12.
[0780] Furthermore, it is not necessary to store the entirety of the specific processing program 56 in a storage device such as a server connected to the data processing device 12 via the network 54, or to store the entirety of the specific processing program 56 in the storage 32; it is acceptable to store only a portion of the specific processing program 56.
[0781] The following types of processors can be used as hardware resources to perform specific processing. Examples of processors include a CPU, a general-purpose processor that functions as a hardware resource to perform specific processing by executing software, i.e., a program. Other examples of processors include dedicated electrical circuits, such as FPGAs (Field-Programmable Gate Arrays), PLDs (Programmable Logic Devices), or ASICs (Application Specific Integrated Circuits), which have circuit configurations specifically designed to perform specific processing. All of these processors have built-in or connected memory, and all of them perform specific processing by using memory.
[0782] The hardware resource that performs a specific process may consist of one of these various processors, or it may consist of a combination of two or more processors of the same or different types (for example, a combination of multiple FPGAs, or a combination of a CPU and an FPGA). Alternatively, the hardware resource that performs a specific process may consist of a single processor.
[0783] Examples of configurations using a single processor include, firstly, a configuration in which one or more CPUs and software are combined to form a single processor, and this processor functions as a hardware resource that performs a specific process. Secondly, there is a configuration using a processor that realizes the functions of the entire system, including multiple hardware resources that perform a specific process, on a single IC chip, as exemplified by SoCs (System-on-a-chip). In this way, a specific process is realized using one or more of the above types of processors as hardware resources.
[0784] Furthermore, the hardware structure of these various processors can more specifically utilize electrical circuits that combine circuit elements such as semiconductor devices. Also, the specific processing described above is merely an example. Therefore, it goes without saying that unnecessary steps can be deleted, new steps added, or the processing order rearranged, as long as it does not deviate from the main purpose.
[0785] The descriptions and illustrations presented above are detailed explanations of the technical aspects of this disclosure and are merely examples of the technical aspects. For example, the above descriptions of the structure, function, operation, and effect are examples of the structure, function, operation, and effect of the technical aspects of this disclosure. Therefore, it goes without saying that you may delete unnecessary parts, add new elements, or replace elements in the descriptions and illustrations presented above, as long as you do not deviate from the essence of the technical aspects of this disclosure. Furthermore, in order to avoid confusion and facilitate understanding of the technical aspects of this disclosure, explanations of common technical knowledge and the like that do not require special explanation to enable the implementation of the technical aspects of this disclosure have been omitted from the descriptions and illustrations presented above.
[0786] All documents, patent applications, and technical standards described herein are incorporated by reference to the same extent as if each individual document, patent application, and technical standard were specifically and individually noted to be incorporated by reference.
[0787] The following is further disclosed regarding the embodiments described above.
[0788] (Claim 1)
[0789] A sensor means for detecting the distance and posture of a child user,
[0790] A control means that generates an alert to encourage the user to protect their eyesight based on the aforementioned sensor means,
[0791] A data processing means for classifying and filtering content and restricting access to inappropriate content,
[0792] Monitoring means to monitor user input and promote the protection of personal information,
[0793] A system that includes this.
[0794] (Claim 2)
[0795] The system according to claim 1, further comprising a timer means for notifying the user of usage time and break time.
[0796] (Claim 3)
[0797] The system according to claim 1, further comprising interface means for providing notifications and warnings to the user by character display.
[0798] "Example 1"
[0799] (Claim 1)
[0800] A detection means for detecting the user's distance and posture,
[0801] A determination means that generates a warning to protect eyesight based on the detection means,
[0802] Information processing means for classifying and selecting data and controlling access to inappropriate information,
[0803] Monitoring means to monitor user input operations and promote the protection of personal information,
[0804] A time management tool to limit the user's activity time and maintain their health,
[0805] A system that includes this.
[0806] (Claim 2)
[0807] The system according to claim 1, further comprising time measurement means for notifying the user of activity time and rest time.
[0808] (Claim 3)
[0809] The system according to claim 1, comprising means for providing the presented notices and warnings in character representation.
[0810] "Application Example 1"
[0811] (Claim 1)
[0812] A detection means for detecting the position and posture of a child user,
[0813] A control means that generates a warning to encourage the protection of the user's vision based on the detection means,
[0814] Processing means for classifying and screening digital information and restricting access to inappropriate digital information,
[0815] Monitoring means to monitor user input operations and promote the security of personal information,
[0816] A filtering mechanism that analyzes the web location information accessed by the user and allows only secure digital information,
[0817] A system that includes this.
[0818] (Claim 2)
[0819] The system according to claim 1, further comprising a timing means for informing the user of usage time and break time.
[0820] (Claim 3)
[0821] The system according to claim 1, further comprising display means for providing notifications and warnings to the user by displaying virtual characters.
[0822] "Example 2 of combining an emotion engine"
[0823] (Claim 1)
[0824] A sensing means for detecting the face, facial expressions, and voice of a child user,
[0825] Analysis means for analyzing the user's emotions based on the sensing means and generating an alert corresponding to the emotional state,
[0826] A processing means for classifying and filtering content, restricting access to inappropriate information, and suggesting alternative information,
[0827] A monitoring system for monitoring user input behavior, encouraging the protection of personal information, and providing warnings through a user-friendly character interface when the user is in an unstable emotional state,
[0828] A system that includes this.
[0829] (Claim 2)
[0830] The system according to claim 1, further comprising a timer means for notifying the user of usage time and break time, and for generating alerts at appropriate times based on emotional state.
[0831] (Claim 3)
[0832] The system according to claim 1, further comprising an interface means for providing notifications and warnings presented to the user through character displays with content appropriate to the user's emotional state.
[0833] "Application example 2 when combining with an emotional engine"
[0834] (Claim 1)
[0835] Sensor means for detecting the user's distance and posture,
[0836] Control means for generating a warning to promote the protection of the user's vision based on the aforementioned sensor means,
[0837] A data processing means for classifying and filtering digital information and restricting access to inappropriate digital information,
[0838] Monitoring measures to monitor user input activity and promote the protection of personal information,
[0839] Analytical means for recognizing emotions and adjusting digital experiences,
[0840] An intelligent machine that monitors educational activities and communicates with a means of providing support based on the user's emotions,
[0841] A system that includes this.
[0842] (Claim 2)
[0843] The system according to claim 1, further comprising a timing means for notifying the user of usage time and break time.
[0844] (Claim 3)
[0845] The system according to claim 1, further comprising display means for providing notifications and warnings to the user by character display. [Explanation of Symbols]
[0846] 10, 210, 310, 410 Data Processing Systems 12 Data Processing Devices 14 Smart Devices 214 Smart Glasses 314 Headset-type terminal 414 Robots< / url:> < / url:> < / url:> < / url:>
Claims
1. A detection means for detecting the position and posture of a child user, A control means that generates a warning to encourage the protection of the user's vision based on the detection means, Processing means for classifying and screening digital information and restricting access to inappropriate digital information, Monitoring means to monitor user input operations and promote the security of personal information, A filtering mechanism that analyzes the web location information accessed by the user and allows only secure digital information, A system that includes this.
2. The system according to claim 1, further comprising a timing means for informing the user of usage time and break time.
3. The system according to claim 1, further comprising display means for providing notifications and warnings to the user by displaying virtual characters.