Data processing device, data processing method, and data processing program
The data processing device uses biometric sensors in earphones to generate personalized music and vibrations based on user state, addressing the limitations of conventional AI earphones by providing real-time stress relief.
Patent Information
- Authority / Receiving Office
- JP · JP
- Patent Type
- Applications
- Current Assignee / Owner
- SOFTBANK GROUP CORP
- Filing Date
- 2024-12-03
- Publication Date
- 2026-06-15
AI Technical Summary
Conventional AI earphones do not utilize real-time biometric information to personalize music and vibrations based on a user's instantaneous mental state, limiting their effectiveness in stress reduction and relaxation.
A data processing device and method that utilizes biometric information from earphones to evaluate a user's state and generate personalized music and vibration patterns through a data generation model, incorporating sensors for heart rate, galvanic skin response, and blood oxygen saturation to provide tailored audio and tactile feedback.
Enables real-time adjustment of music and vibrations to match the user's emotional and physical state, enhancing stress reduction and relaxation effects.
Smart Images

Figure 2026096757000001_ABST
Abstract
Description
【Technical Field】 【0001】 The technology of the present disclosure relates to a data processing device, a data processing method, and a data processing program. 【Background Art】 【0002】 Patent Document 1 discloses a persona chatbot control method performed by at least one processor, including steps of receiving a user utterance, adding the user utterance to a prompt including an instruction sentence related to an explanation of a chatbot character, encoding the prompt, and inputting the encoded prompt into a language model to generate a chatbot utterance in response to the user utterance. 【Prior Art Documents】 【Patent Documents】 【0003】 【Patent Document 1】 Japanese Unexamined Patent Application Publication No. 2022-180282 【Summary of the Invention】 【Problems to be Solved by the Invention】 【0004】 In modern society, stress and anxiety are increasing, and there are concerns about their impact on physical and mental health. Although conventional AI earphones have the function of providing information based on a user's utterance and action history, they do not have the function of utilizing a user's biometric information in real time and personalizing music and vibrations according to the state. Therefore, it has been difficult to maximize the stress reduction and relaxation effects corresponding to the instantaneous mental state of individual users. Also, Patent Document 1 discloses a technology for providing information based on a user's utterance and action history, but does not disclose a technology for acquiring and analyzing biometric information in real time and generating and providing music data and vibration pattern data according to the user's state. The object of the present invention is to provide a data processing device, a data processing method, and a data processing program that can generate music data and vibration pattern data corresponding to a user's biometric information. [Means for solving the problem] 【0005】 A first aspect of the technology of this disclosure is a data processing device comprising: an input unit that inputs user data including biometric information collected by two earphones worn on the user's ears, including a microphone, speaker, camera, biometric information sensor, and vibration application unit; a processing unit that performs a specific processing using a data generation model that generates a predetermined inference result corresponding to the user data; and an output unit that plays the result of the specific processing from the speaker and applies vibration to the vibration application unit, wherein the input unit inputs biometric information detected by the biometric information sensor as the user data, and the processing unit evaluates the user's state based on the biometric information and performs a process as the specific processing that generates music data and vibration pattern data corresponding to the evaluated user's state as the result of the specific processing. 【0006】 A second aspect of the technology of the present disclosure is a data processing method in which a computer performs a specific processing using a data generation model that inputs user data including biometric information collected by two earphones worn on the user's ears, which include a microphone, speaker, camera, biometric information sensor, and vibration application unit, and generates a predetermined inference result corresponding to the user data, wherein the specific processing includes inputting biometric information detected by the biometric information sensor as the user data, evaluating the user's state based on the biometric information, and generating music data and vibration pattern data corresponding to the evaluated user's state as the result of the specific processing, and the computer performs a process of playing the result of the specific processing from the speaker and applying vibration to the vibration application unit. 【0007】 A third aspect of the technology of this disclosure is a data processing program that causes a computer to perform a specific process using a data generation model that inputs user data including biometric information collected by two earphones worn on the user's ears, which include a microphone, speaker, camera, biometric information sensor, and vibration application unit, and generates a predetermined inference result corresponding to the user data, wherein the specific process is performed by inputting biometric information detected by the biometric information sensor as the user data, evaluating the user's state based on the biometric information, and generating music data and vibration pattern data corresponding to the evaluated user's state as the result of the specific process, and the computer is instructed to perform a process to play the result of the specific process from the speaker and apply vibration to the vibration application unit. [Brief explanation of the drawing] 【0008】 [Figure 1] Figure 1 is a conceptual diagram showing an example of the configuration of a data processing system. [Figure 2] Figure 2 is a conceptual diagram showing an example of the main functions of a data processing device and earphones. [Figure 3A] Figure 3A shows an example of an earphone configuration. [Figure 3B] Figure 3B shows the user wearing earphones. [Figure 3C] Figure 3C is a diagram illustrating the field of view of camera 42. [Figure 3D] Figure 3D shows the user wearing the earphones. [Figure 3E] Figure 3E shows the user wearing the earphones. [Figure 3F] Figure 3F shows the user wearing earphones. [Figure 4] The functional configuration of a specific processing unit of a data processing device is shown in general terms. [Figure 5] An example of the operation flow of a specific process performed by the data processing device in the first embodiment is schematically shown. [Figure 6]An example of the operation flow of a specific process performed by the data processing device in the second embodiment is schematically shown. [Modes for carrying out the invention] 【0009】 Hereinafter, an example of an embodiment of the data processing device, data processing method, and program relating to the technology of this disclosure will be described with reference to the attached drawings. 【0010】 First, let's explain the terminology used in the following explanation. 【0011】 In the following embodiments, the signed processor (hereinafter simply referred to as "processor") may be a single arithmetic unit or a combination of multiple arithmetic units. Furthermore, the processor may be a single type of arithmetic unit or a combination of multiple types of arithmetic units. Examples of arithmetic units include a CPU (Central Processing Unit), a GPU (Graphics Processing Unit), a GPGPU (General-Purpose computing on Graphics Processing Units), or an APU (Accelerated Processing Unit). 【0012】 In the following embodiments, signed RAM (Random Access Memory) is a memory that temporarily stores information and is used as work memory by the processor. 【0013】 In the following embodiments, the signed storage is one or more non-volatile storage devices that store various programs and various parameters. Examples of non-volatile storage devices include flash memory (SSD (Solid State Drive)), magnetic disks (e.g., hard disks), or magnetic tapes. 【0014】 In the following embodiments, the labeled communication I / F (Interface) is an interface including a communication processor, an antenna, and the like. The communication I / F manages communication between multiple computers. Examples of communication standards applied to the communication I / F include wireless communication standards such as 5G (5th Generation Mobile Communication System), Wi-Fi (registered trademark), or Bluetooth (registered trademark). 【0015】 In the following embodiments, "A and / or B" is synonymous with "at least one of A and B". That is, "A and / or B" means that it may be only A, only B, or a combination of A and B. Also, in this specification, when expressing three or more matters connected by "and / or", the same concept as "A and / or B" is applied. 【0016】 [First Embodiment] FIG. 1 shows an example of the configuration of a data processing system 10 according to an embodiment. 【0017】 As shown in FIG. 1, the data processing system 10 includes a data processing device 12 and earphones 14. An example of the data processing device 12 is a server. In this embodiment, the data processing device 12 is an example of the "data processing device" according to the technology of the present disclosure. 【0018】 The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network). 【0019】 The earphone 14 includes a computer 36, a microphone 38, a biometric information sensor 39, a speaker 40, a vibration application unit 41, a camera 42, and a communication interface 44. The computer 36 includes a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The microphone 38, speaker 40, and camera 42 are also connected to the bus 52. 【0020】 The microphone 38 receives voice signals from the user 20 and accepts instructions from the user 20. The microphone 38 captures the voice signals from the user 20, converts the captured voice into audio data, and outputs it to the processor 46. The speaker 40 outputs audio according to the instructions from the processor 46. Hereafter, the microphone 38 may be simply referred to as the microphone 38. The biometric information sensor 39 is a sensor that detects the user's biometric information and includes at least one of the following sensors: a heart rate sensor that detects heart rate (HR), a galvanic skin response (GSR) sensor that detects galvanic skin response, and a blood oxygen saturation concentration sensor (SpO2) that detects blood oxygen saturation. The biometric information sensor 39 is located inside the earphone 14 or in a part that comes into contact with the user's skin 20. The biometric information sensor 39 also performs noise reduction and smoothing processing on the detected biometric information to generate data suitable for analysis. The vibration-applying unit 41 applies vibration to the user 20. For example, the vibration-applying unit 41 is configured using an actuator. 【0021】 Camera 42 is a small digital camera equipped with an optical system including a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor, and captures images of the area around the user 20 (for example, an imaging range defined by a field of view equivalent to the width of a typical healthy person's field of vision). 【0022】 Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various types of information between processor 46 and processor 28 via network 54. 【0023】 Figure 2 shows an example of the main functions of the data processing device 12 and the earphone 14. 【0024】 As shown in Figure 2, in the data processing device 12, specific processing is performed by the processor 28. A specific processing program 56 is stored in the storage 32. The specific processing program 56 is an example of a "data processing program" related to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 according to the specific processing program 56 executed on the RAM 30. 【0025】 The storage 32 stores the data generation model 58. The data generation model 58 is used by the specific processing unit 290. 【0026】 (Earphones 14) In the earphone 14, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48. 【0027】 The earphone 14 may be interpreted as a canal-type earphone that is fitted into the ear canal of the user 20, as shown in Figure 3A. However, the earphone 14 is not limited to a canal type; it may also be an inner-ear type earphone that is inserted into the inner ear of the user 20, or a headphone type earphone that covers the entire ear of the user 20. Each of the two earphones 14 is equipped with a microphone 38, a speaker 40, and a camera 42. The sound and images collected by the two earphones 14 fitted into the ears of the user 20 may be recorded as a life log in the database 24. 【0028】 The life log can be interpreted as a history of the user 20's actions in daily life, and may include sounds and images associated with the user 20, specifically sounds collected by the microphone 38 and images taken by the camera 42 during daily life. The life log may record sounds and images associated with the user 20, along with the date, time, and location in which they were acquired. 【0029】 The sounds collected by the microphone 38 may include the voice of the person the user 20 is talking to, and sounds that occur around the user 20 while walking or cycling (such as the sound of cars driving, birds chirping, the babbling of a stream, and the sound of trees swaying in the wind). 【0030】 As shown in Figure 3C, the camera 42 may capture images of the scenery within its field of view that is in front of the user 20, or it may capture images of scenery within its field of view that is not in front of the user 20, for example, to the side, behind, below, or above the user 20. The images captured by the camera 42 may include images of the person the user 20 is talking to, the scenery around the user 20 when they are walking or cycling, and images of the pet the user 20 is walking with. 【0031】 Since each of the two earphones 14 is equipped with a camera 42, the two earphones 14 worn on the user's ears 20 are positioned at a specific distance apart, one on the left ear and the other on the right ear, as shown in Figure 3B. Therefore, compared to cases where two cameras are arranged side by side in a single housing, such as in a video camera, the spacing between the two cameras 42 can be increased, making 3D sensing easier. 3D sensing can be interpreted as measuring three-dimensional shapes. 【0032】 Furthermore, when the two earphones 14 are placed in the user 20's ears, the two cameras 42 are positioned close to the user 20's left and right eyes, allowing images (captured images) that are nearly identical to those seen with the naked eye to be recorded as a life log in the database 24. Consequently, in specific processing, it becomes easier to reproduce information corresponding to inquiries from the user 20, that is, information corresponding to the content of the user 20's speech. 【0033】 While the two earphones 14 are attached to the user 20, all or part of the images captured by the camera 42 may be recorded in the database 24 as a life log. Specifically, when the two earphones 14 are attached to the user 20, the recording of images captured by the camera 42 to the database 24 may begin, and when the two earphones 14 are removed from the user 20, the recording of those images to the database 24 may end. 【0034】 While the two earphones 14 are worn by the user 20, all or part of the sound collected by the microphone 38 may be recorded as a lifelog in the database 24. Specifically, when the two earphones 14 are worn by the user 20, the recording of the sound collected by the microphone 38 to the database 24 may begin, and when the two earphones 14 are removed from the user 20, the recording of the sound to the database 24 may end. 【0035】 Next, we will describe the processing of the specific processing unit 290 when the data processing device 12 receives an utterance from the user 20 wearing the earphones 14 regarding the user 20's memories or actions, and performs specific processing to propose information corresponding to the content of the user 20's utterance to the user 20. 【0036】 (Specific processing) In this embodiment, the specific processing involves inputting user data and performing specific processing using a data generation model that generates predetermined inference results corresponding to the input user data. Specifically, in the specific processing, when utterances related to the user's memories or actions are received as user data from a user 20 wearing earphones 14, the system refers to the database 24 and performs processing to propose information corresponding to the content of the utterances to the user 20. Specifically, after a life log is recorded in the database 24, if the user 20 wearing earphones 14 makes an utterance related to the user's memories or actions, the specific processing may involve referring to the database 24 and proposing information corresponding to the content of the utterances to the user 20. 【0037】 (Example of specific processing) If the user wearing the earphones requests a message that will trigger the recall of a specific memory, the specific processing unit 290 may propose one or more messages selected based on the life log to the user who made the request, as information corresponding to the content of the utterance (request). 【0038】 For example, if user 20, wearing earphones 14, tries to recall their memory and asks, "What did I say to person A around [date] at [time]?", the identification processing unit 290, as part of its identification process, inputs this message as a prompt to the data generation model 58. The identification processing unit 290 may refer to the life log in database 24 and, based on the output obtained from the data generation model 58, generate a message such as, "I think you said, 'I found a nice restaurant, let's make a reservation.'" This message may be interpreted as an example of information corresponding to the content of user 20's utterance. 【0039】 For example, if user 20 wearing earphones 14 tries to recall their memory and asks, "Who was I talking to around [date] at [time]?", the identification processing unit 290 will input this message as a prompt to the data generation model 58 as part of its identification process. The identification processing unit 290 may refer to the life log in database 24 and, based on the output obtained from the data generation model 58, generate a message such as, "It seems you were talking with two friends at that time, probably B and C." This message may be interpreted as an example of information corresponding to the content of user 20's utterance. 【0040】 For example, if user 20, wearing earphones 14, tries to recall their emotions and says, "How did I feel when I was talking to person A around [date] at [time]?", the identification processing unit 290, as part of its identification process, inputs this message as a prompt to the data generation model 58. The identification processing unit 290 may refer to the life log in database 24 and, based on the output obtained from the data generation model 58, generate a message such as, "At that time, you were laughing a lot, so it seems you had a good impression of your friend and were very happy." This message may be interpreted as an example of information corresponding to the content of user 20's utterance. 【0041】 (Example of specific processing, part 2) If a user 20 wearing earphones 14 mutters a specific matter as part of their utterance, the specific processing unit 290 may suggest to the user 20 who requested the message, based on their life log, recommended actions for the user 20 regarding that matter, as information corresponding to the content of their utterance (muttering). 【0042】 For example, when user 20 wearing earphones 14 is shopping at a specific retail store and says, "What should I buy?", the specific processing unit 290 inputs this message as a prompt to the data generation model 58 as a specific processing step. The specific processing unit 290 may refer to the life log in the database 24 and, based on the output obtained by the data generation model 58, generate a message such as, "A few months ago, you purchased product A at this store and commented that it wasn't very tasty, so how about purchasing recently released products B and C this time?" This message may be interpreted as an example of information corresponding to the content of user 20's utterance. 【0043】 (Third example of specific processing) As shown in Figure 3D, when user 20, wearing earphones 14, is operating a PC and says, "What was the name of product A that I searched for the day before yesterday?", the identification processing unit 290 inputs this message as a prompt to the data generation model 58 as part of its identification processing. The data generation model 58 refers to the life log in the database 24 and analyzes the video of the PC screen when user 20 was operating it in the past to generate a specific output. Based on the output obtained by the data generation model 58, the identification processing unit 290 may generate a message such as "Product A is ○○○". This message may be interpreted as an example of information corresponding to the content of user 20's utterance. 【0044】 (Fourth example of specific processing) As shown in Figure 3E, if user 20, wearing earphones 14, says "There was a place nearby with a great view, but I wonder where it is?" while cycling, the identification processing unit 290 inputs this message as a prompt to the data generation model 58 as part of its identification process. The data generation model 58 refers to the life log in database 24 and analyzes places previously visited by user 20 and the route to those places to generate a specific output. Based on the output obtained by the data generation model 58, the identification processing unit 290 may generate a message such as "I think it's Cape XX, about 500m from here." This message can be interpreted as an example of information corresponding to the content of user 20's utterance. 【0045】 (Example 5 of specific processing) As shown in Figure 3F, when user 20, wearing earphones 14, meets Mr. X at company A, the company he is visiting, and says, "Can you tell me this person's name?", the identification processing unit 290 inputs this message as a prompt to the data generation model 58 as part of the identification process. The data generation model 58 refers to the life log in database 24 and generates specific output from the history of people that user 20 met when he visited company A. Based on the output obtained from the data generation model 58, the identification processing unit 290 may generate a message such as, "I think his name is ○○." This message may be interpreted as an example of information corresponding to the content of user 20's utterance. 【0046】 As shown in Figure 4, the specific processing unit 290 includes an input unit 291, a processing unit 292, and an output unit 293. 【0047】 The input unit 291 acquires user input received through the earphone 14. Specifically, it acquires the user's voice received through the earphone 14. 【0048】 The processing unit 292 performs specific processing using the data generation model 58. Specifically, it inputs voice from the user into the data generation model 58 and obtains a generation result. More specifically, when it receives an utterance from the user 20 wearing the earphones 14 regarding the user 20's memories or actions, it performs a specific processing step of proposing information corresponding to the content of the utterance to the user 20. 【0049】 The output unit 293 transmits the result of the specific processing to the earphone 14. In the earphone 14, the control unit 46A causes the speaker 40 to output the result of the specific processing. The microphone 38 acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 38 to the data processing unit 12. In the data processing unit 12, the specific processing unit 290 acquires the audio data. 【0050】 Data generation model 58 is a so-called generative AI (Artificial Intelligence). An example of data generation model 58 is ChatGPT (registered trademark) (Internet search).<URL: https: / / openai.com / blog / chatgpt> ), Gemini (registered trademark) (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include those described above. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and with inference data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 infers from the input inference data according to the instructions shown by the prompts, and outputs the inference results in data formats such as audio data and text data. Here, inference refers to, for example, analysis, classification, prediction, and / or summarization. 【0051】 Next, the operation of the data processing system 10 will be explained. 【0052】 An example of the flow of a specific processing method will be explained with reference to Figure 5. Note that the flow of a specific processing method shown in Figure 5 is an example of a "data processing method" related to the technology disclosed herein. 【0053】 In step S301, the data processing device 12 receives user data, including sound and images collected by the two earphones 14. 【0054】 In step S302, if the data processing device 12 receives an utterance from the user wearing the earphones 14 regarding the user's memories or actions, it executes a specific process to propose information corresponding to the content of the utterance to the user 20 based on the user's life log. 【0055】 In step S303, the data processing device 12 executes a process to play back the result of a specific process from the speaker 40. 【0056】 [Second Embodiment] Next, a data processing system according to the second embodiment will be described. Since the data processing system has the same configuration as the first embodiment, the same reference numerals are used and their description is omitted. 【0057】 As shown in Figure 4 above, the specific processing unit 290 of the data processing device 12 includes an input unit 291, a processing unit 292, and an output unit 293. 【0058】 The input unit 291 sequentially acquires user data, including biometric information detected by the biometric information sensor 39. 【0059】 Specifically, the input unit 291 acquires biometric information detected by the biometric information sensor 39 included in the earphone 14. 【0060】 The processing unit 292 evaluates the user's state based on biological information and generates music data and vibration pattern data corresponding to the evaluated user state as a result of specific processing. 【0061】 Specifically, the processing unit 292 evaluates the user's state based on the biometric information acquired by the input unit 291, and generates music data and vibration pattern data according to the evaluated user's state by inputting prompts to the data generation model 58. For example, the processing unit 292 inputs the biometric information acquired by the input unit 291 to the data generation model 58, and at the same time inputs the prompt, "This is the user's biometric information. Evaluate the user's state based on this biometric information, generate music data that is optimal for the evaluated user's state, and generate vibration pattern data that matches the playback of the music data." 【0062】 The processing unit 292 may also generate music data and vibration pattern data according to the user's state by inputting a prompt to the data generation model 58 instructing it to generate music data and vibration pattern data according to the user's state based on the biometric information acquired by the input unit 291, the user's life log which records past biometric information, music data, and vibration pattern data, and the user's music preference data. For example, the biometric information acquired by the input unit 291, the user's life log, and the user's music preference data are input to the data generation model 58, and the prompt "These are the user's biometric information, the user's life log, and the user's music preference data. Evaluate the user's state based on this biometric information, and considering the user's life log and the user's music preference data, generate music data that is optimal for the evaluated user's state, and generate vibration pattern data that matches the playback of the music data." is input to the data generation model 58. The life log and music preference data are stored in the database 24 and can be obtained by reading them from the database 24. The data generation model 58 includes a machine learning or deep learning algorithm and is a model trained to evaluate the user's mental and physical state from biometric information. 【0063】 The output unit 293 transmits the generated music data and vibration pattern data to the earphone 14 as a result of specific processing. In the earphone 14, the control unit 46A causes the speaker 40 to play the generated music data and controls the vibration application unit 41 to apply vibration to the user according to the vibration pattern data. As a result, the user 20 can obtain stress reduction and relaxation effects. 【0064】 Furthermore, the processing unit 292 may comprehensively analyze life log and music preference data and personalize music data and vibration pattern data according to the individual characteristics of the user. This allows the user 20 to obtain a personalized therapeutic experience. 【0065】 Furthermore, the processing unit 292 may continuously monitor biological information while providing music and vibration, and perform feedback control to adjust the music data and vibration pattern data in real time according to changes in the user's state. This allows the user 20 to more effectively experience stress reduction and relaxation effects. 【0066】 Next, the operation of the data processing system 10 will be explained. 【0067】 An example of the flow of a specific processing method will be explained with reference to Figure 6. Note that the flow of a specific processing method shown in Figure 6 is an example of a "data processing method" related to the technology disclosed herein. 【0068】 In step S400, the input unit 291 receives user data, which includes biometric information detected by the two earphones 14, a user 20 life log containing past biometric information, music data, and vibration pattern data, and user 20's music preference data. 【0069】 In step S401, the processing unit 292 inputs a prompt to the data generation model 58 instructing it to evaluate the state of user 20 based on the biometric information acquired by the input unit 291, the user 20's life log which contains past biometric information, music data, and vibration pattern data, and the user 20's music preference data, and to generate music data and vibration pattern data corresponding to the evaluated state of user 20. Specifically, it generates a prompt that reads, "The input data includes the user's current biometric information, a life log which includes past biometric information, music data, and vibration pattern data, and music preference data. Please generate music data and vibration pattern data that match this user data." 【0070】 In step S402, the processing unit 292 inputs the acquired user data and the prompt generated in step S401 to the data generation model 58 to generate music data and vibration pattern data. 【0071】 In step S403, the output unit 293 transmits the generated music data and vibration pattern data to the earphone 14. The earphone 14 then uses the control unit 46A to instruct the speaker 40 to play the generated music data and the vibration application unit 41 to apply vibration to the user according to the vibration pattern data. This allows the user 20 to experience stress reduction and relaxation. 【0072】 In step S404, the processing unit 292 determines whether or not to terminate the process. For example, if playback of music data has finished, or if the user has given an instruction to terminate playback of music data, it determines to terminate the process and terminates the specific process. On the other hand, if it determines not to terminate the process, it returns to step S400. 【0073】 Through the above process, music data and vibration pattern data corresponding to the user 20's biometric information can be generated. As a result, user 20 can experience stress reduction and relaxation effects. 【0074】 For example, if user 20 is experiencing stress, the biometric information sensor 39 detects an increase in heart rate and changes in skin electrical response as biometric information. Based on the biometric information, the specific processing unit 290 evaluates that user 20 is experiencing stress, and generates slow-tempo music and a gentle vibration pattern that have a high relaxation effect, corresponding to the stressed state, and provides them to user 20. 【0075】 Furthermore, when user 20 puts on the earphones 14 during a break from work, the biometric information sensor 39 detects an increase in heart rate and GSR value. The specific processing unit 290 evaluates that user 20 is experiencing stress and generates slow-tempo, low-frequency music and a gentle vibration pattern that have a high relaxation effect, and provides them to user 20. 【0076】 Furthermore, if user 20 uses earphones 14 while jogging, the biometric information sensor 39 detects an appropriate heart rate and GSR value. The specific processing unit 290 evaluates that user 20 is in an exercise state and generates upbeat music and a rhythmic vibration pattern to enhance motivation, in accordance with user 20's exercise state, and provides them to user 20. 【0077】 The above description primarily focuses on the functions of the data processing device 12 in relation to this disclosure. However, the system related to this disclosure is not necessarily implemented on a server. The system related to this disclosure may be implemented as a general information processing system. This disclosure may be implemented, for example, as a software program that runs on a personal computer or as an application that runs on a smartphone. The method related to this disclosure may be provided to users in SaaS (Software as a Service) format. 【0078】 In the above embodiment, an example was given in which a specific process is performed by a single computer 22. However, the technology of this disclosure is not limited thereto, and a distributed processing method for the specific process may be used, which includes computer 22 and multiple other computers. 【0079】 In the above embodiment, an example was given in which the specific processing program 56 is stored in the storage 32, but the technology of this disclosure is not limited thereto. For example, the specific processing program 56 may be stored in a portable, computer-readable, non-temporary storage medium such as a USB (Universal Serial Bus) memory. The specific processing program 56 stored in the non-temporary storage medium is installed in the computer 22 of the data processing device 12. The processor 28 executes specific processing according to the specific processing program 56. 【0080】 Alternatively, the specific processing program 56 may be stored in a storage device such as a server connected to the data processing device 12 via the network 54, and the specific processing program 56 may be downloaded and installed on the computer 22 in response to a request from the data processing device 12. 【0081】 Furthermore, it is not necessary to store the entirety of the specific processing program 56 in a storage device such as a server connected to the data processing device 12 via the network 54, or to store the entirety of the specific processing program 56 in the storage 32; it is acceptable to store only a portion of the specific processing program 56. 【0082】 The following types of processors can be used as hardware resources to perform specific processing. Examples of processors include a CPU, a general-purpose processor that functions as a hardware resource to perform specific processing by executing software, i.e., a program. Other examples of processors include dedicated electrical circuits, such as FPGAs (Field-Programmable Gate Arrays), PLDs (Programmable Logic Devices), or ASICs (Application Specific Integrated Circuits), which have circuit configurations specifically designed to perform specific processing. All of these processors have built-in or connected memory, and all of them perform specific processing by using memory. 【0083】 The hardware resource that performs a specific process may consist of one of these various processors, or it may consist of a combination of two or more processors of the same or different types (for example, a combination of multiple FPGAs, or a combination of a CPU and an FPGA). Alternatively, the hardware resource that performs a specific process may consist of a single processor. 【0084】 Examples of configurations using a single processor include, firstly, a configuration in which one or more CPUs and software are combined to form a single processor, and this processor functions as a hardware resource that performs a specific process. Secondly, there is a configuration using a processor that realizes the functions of the entire system, including multiple hardware resources that perform a specific process, on a single IC chip, as exemplified by SoCs (System-on-a-chip). In this way, a specific process is realized using one or more of the above types of processors as hardware resources. 【0085】 Furthermore, the hardware structure of these various processors can more specifically utilize electrical circuits that combine circuit elements such as semiconductor devices. Also, the specific processing described above is merely an example. Therefore, it goes without saying that unnecessary steps can be deleted, new steps added, or the processing order rearranged, as long as it does not deviate from the main purpose. 【0086】 The descriptions and illustrations presented above are detailed explanations of the technical aspects of this disclosure and are merely examples of the technical aspects. For example, the above descriptions of the structure, function, operation, and effect are examples of the structure, function, operation, and effect of the technical aspects of this disclosure. Therefore, it goes without saying that you may delete unnecessary parts, add new elements, or replace elements in the descriptions and illustrations presented above, as long as you do not deviate from the essence of the technical aspects of this disclosure. Furthermore, in order to avoid confusion and facilitate understanding of the technical aspects of this disclosure, explanations of common technical knowledge and the like that do not require special explanation to enable the implementation of the technical aspects of this disclosure have been omitted from the descriptions and illustrations presented above. 【0087】 All documents, patent applications, and technical standards described herein are incorporated by reference to the same extent as if each individual document, patent application, and technical standard were specifically and individually noted as being incorporated by reference. 【0088】 Furthermore, the following additional information is disclosed regarding the above explanation. 【0089】 (Note 1) An input unit that receives user data, including biometric information collected by two earphones worn on the user's ears, which include a microphone, speaker, camera, biometric information sensor, and vibration application unit, A processing unit that performs specific processing using a data generation model that generates predetermined inference results according to the user data, An output unit that reproduces the result of the specified processing from the speaker and applies vibration to the vibration application unit, Equipped with, The input unit inputs the biometric information detected by the biometric information sensor as user data. The processing unit evaluates the user's state based on the biological information and inputs a prompt to the data generation model instructing it to generate music data and vibration pattern data corresponding to the evaluated user's state, thereby performing a process to generate the music data and vibration pattern data as a result of the specific processing, as the specific processing. 【0090】 (Note 2) The data processing device according to Appendix 1, wherein the processing unit generates the music data and vibration pattern data as a result of the specific processing by inputting a prompt to the data generation model instructing it to generate music data and vibration pattern data according to the user's state, based on the user data, the user's life log in which past biometric information, music data, and vibration pattern data are recorded, and the user's music preference data. 【0091】 (Note 3) The processing unit comprehensively analyzes the life log and the music preference data and personalizes the music data and vibration pattern data according to the individual characteristics of the user, as described in Appendix 2. 【0092】 (Note 4) The processing unit is a data processing device according to any one of the appendices 1 to 3, wherein the processing unit continuously monitors the biological information while providing music and vibration, and performs feedback control to adjust the music data and vibration pattern data in real time according to changes in the user's state. 【0093】 (Note 5) The data generation model is a model trained to evaluate the user's mental and physical state from the biometric information, as described in any of the appendices 1 to 4. 【0094】 (Note 6) A data processing method in which a computer performs a specific process using a data generation model that inputs user data including biometric information collected by two earphones worn on the user's ears, which include a microphone, speaker, camera, biometric information sensor, and vibration application unit, and generates a predetermined inference result according to the user data, The biological information detected by the biological information sensor is input as user data. The process of generating the music data and vibration pattern data as a result of the specified process is executed by inputting a prompt to the data generation model that instructs it to evaluate the user's state based on the biometric information and generate music data and vibration pattern data corresponding to the evaluated user's state, The process of playing back the result of the specified process from the speaker and applying vibration to the vibration application unit is performed as follows: A data processing method performed by the aforementioned computer. 【0095】 (Note 7) A data processing program that causes a computer to perform specific processing using a data generation model that inputs user data including biometric information collected by two earphones worn on the user's ears, which include a microphone, speaker, camera, biometric information sensor, and vibration application unit, and generates predetermined inference results according to the user data, The biological information detected by the biological information sensor is input as user data. The process of generating the music data and vibration pattern data as a result of the specified process is executed by inputting a prompt to the data generation model that instructs it to evaluate the user's state based on the biometric information and generate music data and vibration pattern data corresponding to the evaluated user's state, The process of playing back the result of the specified process from the speaker and applying vibration to the vibration application unit is performed as follows: A data processing program to be executed by the aforementioned computer. [Explanation of symbols] 【0096】 10 Data Processing Systems 12 Data Processing Devices 14 Earphones 290 Specific Processing Unit 291 Input section 292 Processing Unit 293 Output section< / url:>
Claims
[Claim 1] An input unit that receives user data, including biometric information collected by two earphones worn on the user's ears, which include a microphone, speaker, camera, biometric information sensor, and vibration application unit, A processing unit that performs specific processing using a data generation model that generates predetermined inference results according to the user data, An output unit that reproduces the result of the specified processing from the speaker and applies vibration to the vibration application unit, Equipped with, The input unit inputs the biometric information detected by the biometric information sensor as user data. The processing unit evaluates the user's state based on the biological information and inputs a prompt to the data generation model instructing it to generate music data and vibration pattern data corresponding to the evaluated user's state, thereby performing a process to generate the music data and vibration pattern data as a result of the specific processing, as the specific processing. [Claim 2] The data processing apparatus according to claim 1, wherein the processing unit generates the music data and vibration pattern data as a result of the specific processing by inputting a prompt to the data generation model instructing it to generate music data and vibration pattern data according to the state of the user, based on the user data, the user's life log in which past biometric information, music data, and vibration pattern data are recorded, and the user's music preference data. [Claim 3] The data processing apparatus according to claim 2, wherein the processing unit comprehensively analyzes the life log and the music preference data and personalizes the music data and vibration pattern data according to the individual characteristics of the user. [Claim 4] The data processing apparatus according to claim 1, wherein the processing unit continuously monitors the biological information while providing music and vibration, and performs feedback control to adjust the music data and vibration pattern data in real time according to changes in the user's state. [Claim 5] The data processing apparatus according to claim 1, wherein the data generation model includes a machine learning or deep learning algorithm and is a model trained to evaluate the mental and physical state of a user from the biometric information. [Claim 6] A data processing method in which a computer performs a specific processing using a data generation model that inputs user data including biometric information collected by two earphones worn on the user's ears, which include a microphone, speaker, camera, biometric information sensor, and vibration application unit, and generates a predetermined inference result according to the user data, The biological information detected by the biological information sensor is input as user data. The process of generating the music data and vibration pattern data as a result of the specified process is executed by inputting a prompt to the data generation model that instructs it to evaluate the user's state based on the biometric information and generate music data and vibration pattern data corresponding to the evaluated user's state, The process of playing back the result of the specified process from the speaker and applying vibration to the vibration application unit is performed as follows: A data processing method performed by the aforementioned computer. [Claim 7] A data processing program that causes a computer to perform specific processing using a data generation model that inputs user data including biometric information collected by two earphones worn on the user's ears, which include a microphone, speaker, camera, biometric information sensor, and vibration application unit, and generates predetermined inference results according to the user data, The biological information detected by the biological information sensor is input as user data. The process of generating the music data and vibration pattern data as a result of the specified process is executed by inputting a prompt to the data generation model that instructs it to evaluate the user's state based on the biometric information and generate music data and vibration pattern data corresponding to the evaluated user's state, The process of playing back the result of the specified process from the speaker and applying vibration to the vibration application unit is performed as follows: A data processing program to be executed by the aforementioned computer.