Earphones, data processing device, data processing method, and data processing program

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
Earphones with data collection and processing units allow for the estimation and reproduction of emotions and intentions through vibrations, enhancing communication by conveying feelings beyond words or images.

JP2026096808APending Publication Date: 2026-06-15SOFTBANK GROUP CORP

View PDF 1 Cites 0 Cited by

Patent Information

Authority / Receiving Office: JP · JP
Patent Type: Applications
Current Assignee / Owner: SOFTBANK GROUP CORP
Filing Date: 2024-12-03
Publication Date: 2026-06-15

Application Information

Patent Timeline

03 Dec 2024

Application

15 Jun 2026

Publication

JP2026096808A

IPC: G10K15/04; H04R1/10; G10L13/00; G06F3/01; G06F3/16

AI Tagging

Application Domain

Input/output for user-computer interaction Earpiece/earphone attachments

Explore More Agents

Novelty Search
Search existing technologies and assess novelty
↗
FTO
Analyze whether a product may infringe others' patents
↗
Design FTO
Check prior-design risk for exterior design
↗
Drafting
Draft patent application text based on a technical solution
↗
Find Solutions with TRIZ
Generate feasible solution to solve your technical challenge
↗

Similar Technology Patents

Get free access to AI patent search and analysis

Check patentability, review prior art and ask IP Agent with full patent context.

Smart Images

Figure 2026096808000001_ABST

Patent Text Reader

Abstract

We provide earphones, a data processing device, a data processing method, and a program that enable sharing emotions and feelings with someone at a distance that cannot be fully conveyed through words or images alone. [Solution] The earphone comprises a housing worn on the ear of a transmitting second user who communicates with a receiving first user; a data collection unit that collects operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on the housing; a processing unit that performs identification processing using a data generation model that generates an estimation result that estimates at least one of the transmitting second user's intentions and emotions based on the operation data, and generates vibration data that causes vibrations corresponding to the result of the identification processing to be generated in the earphone worn on the receiving first user's ear; and an output unit that transmits the vibration data to the earphone worn on the receiving first user's ear.

Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] The technology of the present disclosure relates to earphones, a data processing device, a data processing method, and a data processing program.

Background Art

[0002] Patent Document 1 discloses a persona chatbot control method performed by at least one processor, including steps of receiving a user utterance, adding the user utterance to a prompt including an instruction sentence related to an explanation of a chatbot character, encoding the prompt, and inputting the encoded prompt into a language model to generate a chatbot utterance in response to the user utterance.

Prior Art Documents

Patent Documents

[0003]

Patent Document 1

Summary of the Invention

Problems to be Solved by the Invention

[0004] However, since the prior art mainly depends on vision and hearing as communication means for a distant partner, there is room for improvement in sharing emotions, feelings, etc. that cannot be fully conveyed only by words or images with a distant partner.

Means for Solving the Problems

[0005] A first aspect of the technology of this disclosure is an earphone comprising: a housing worn on the ear of a transmitting second user who communicates with a receiving first user; a data collection unit that collects operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on the housing; a processing unit that performs a specific processing using a data generation model that generates a predetermined estimation result according to the operation data and generates vibration data that causes vibrations to be generated in the earphone worn on the receiving first user's ear according to the result of the specific processing; and an output unit that transmits the vibration data to the earphone worn on the receiving first user's ear, wherein the processing unit performs a process to estimate at least one of the intentions and emotions of the transmitting second user based on the operation data as the specific processing, thereby causing vibrations that reproduce at least one of the intentions and emotions according to the result of the specific processing to be generated in the earphone worn on the receiving first user's ear.

[0006] A second aspect of the technology of the present disclosure is an earphone comprising: a vibrating unit that vibrates a housing worn on the ear of a first receiving user; an input unit that receives operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on an earphone worn on the ear of a second transmitting user who communicates with the first receiving user; a processing unit that performs a specific processing using a data generation model that generates a predetermined estimation result corresponding to the input operation data; and a control unit that generates vibrations in the vibrating unit according to the result of the specific processing, wherein the processing unit performs a process to estimate at least one of the intentions and emotions of the second transmitting user based on the operation data as the specific processing, and the control unit generates vibrations in the vibrating unit that reproduce the intentions and at least one of the emotions according to the result of the specific processing.

[0007] A third aspect of the technology of the present disclosure is a data processing device comprising: an input unit that receives operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on an earphone worn in the ear of a transmitting second user who communicates with a receiving first user who wears an earphone; a processing unit that performs a specific processing using a data generation model that generates a predetermined estimation result according to the input operation data; a control unit that generates vibration data that causes vibrations to be generated in the earphone worn in the ear of the receiving first user according to the result of the specific processing; and an output unit that transmits the vibration data to the earphone worn in the ear of the receiving first user, wherein the processing unit performs a process to estimate at least one of the intentions and emotions of the transmitting second user based on the operation data as the specific processing, and the control unit generates vibration data that reproduces at least one of the intentions and emotions according to the result of the specific processing.

[0008] A fourth aspect of the technology of the present disclosure is a data processing method that includes a process in which at least one processor performs a specific process using a data generation model that generates a predetermined estimation result in accordance with operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on an earphone worn in the ear of a receiving-side first user who is wearing an earphone; generates vibration data that causes vibrations to be generated in the earphone worn in the ear of the receiving-side first user in accordance with the result of the specific process; transmits the vibration data to the earphone worn in the ear of the receiving-side first user; performs a process as the specific process that estimates at least one of the intentions and emotions of the transmitting-side second user based on the operation data; and generates vibration data that reproduces at least one of the intentions and emotions in accordance with the result of the specific process.

[0009] A fifth aspect of the technology of the present disclosure is a data processing program that causes at least one processor to perform a specific process using a data generation model that generates a predetermined estimation result in accordance with operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on an earphone worn in the ear of a receiving-side first user who is communicating with the earphone; generate vibration data that causes vibrations in the earphone worn in the ear of the receiving-side first user in accordance with the result of the specific process; transmit the vibration data to the earphone worn in the ear of the receiving-side first user; perform a process as the specific process that estimates at least one of the intentions and emotions of the transmitting-side second user based on the operation data; and generate vibration data that reproduces at least one of the intentions and emotions in accordance with the result of the specific process. [Brief explanation of the drawing]

[0010] [Figure 1] Figure 1 is a conceptual diagram showing an example of the configuration of a data processing system according to the first embodiment of this disclosure. [Figure 2] Figure 2 is a conceptual diagram showing an example of the main functions of a data processing device and earphone according to the first embodiment. [Figure 3A] Figure 3A shows an example of the configuration of earphones according to the first embodiment. [Figure 3B] Figure 3B shows the state in which a user is wearing the earphones according to the first embodiment. [Figure 3C] Figure 3C is a diagram illustrating the field of view of camera 42. [Figure 3D] Figure 3D shows a user wearing the earphones according to the first embodiment. [Figure 3E] Figure 3E shows a user wearing the earphones according to the first embodiment. [Figure 3F] Figure 3F shows the state in which a user is wearing the earphones according to the first embodiment. [Figure 4]FIG. 4 schematically shows the functional configuration of the specific processing unit of the data processing apparatus according to the first embodiment. [Figure 5] FIG. 5 schematically shows an example of the operation flow of the specific processing by the data processing apparatus according to the first embodiment. [Figure 6] FIG. 6 is a conceptual diagram showing an example of the configuration of the data processing system according to the second embodiment of the present disclosure. [Figure 7] FIG. 7 is a conceptual diagram showing an example of the main functional parts of the data processing apparatus and the earphone according to the second embodiment. [Figure 8] FIG. 12 schematically shows the functional configuration of the specific processing unit of the data processing apparatus according to the second embodiment. [Figure 9] FIG. 15 schematically shows an example of the operation flow of the specific processing by the data processing apparatus according to the second embodiment. [Figure 10] FIG. 18 is a conceptual diagram showing an example of the configuration of the data processing system according to the third embodiment of the present disclosure. [Figure 11] FIG. 21 is a conceptual diagram showing an example of the main functional parts of the data processing apparatus and the earphone according to the third embodiment. [Figure 12] FIG. 24 schematically shows the functional configuration of the specific processing unit of the data processing apparatus according to the third embodiment. [Figure 13] FIG. 27 schematically shows an example of the operation flow of the specific processing by the data processing apparatus according to the third embodiment. [Figure 14] FIG. 30 is a conceptual diagram showing an example of the configuration of the data processing system according to the fourth embodiment of the present disclosure. [Figure 15] FIG. 33 is a conceptual diagram showing an example of the main functional parts of the data processing apparatus and the earphone according to the fourth embodiment. [Figure 16] FIG. 36 schematically shows the functional configuration of the specific processing unit of the data processing apparatus according to the fourth embodiment. [Figure 17] FIG. 39 schematically shows an example of the operation flow of the specific processing by the data processing apparatus according to the fourth embodiment.

BEST MODE FOR CARRYING OUT THE INVENTION

[0011] Hereinafter, an example of an embodiment of a data processing apparatus, a data processing method, and a program according to the technology of the present disclosure will be described with reference to the accompanying drawings.

[0012] First, the terms used in the following description will be explained.

[0013] In the following embodiments, the signed processor (hereinafter simply referred to as "processor") may be one arithmetic unit or a combination of a plurality of arithmetic units. Also, the processor may be one type of arithmetic unit or a combination of a plurality of types of arithmetic units. Examples of arithmetic units include a CPU (Central Processing Unit), a GPU (Graphics Processing Unit), a GPGPU (General-Purpose computing on Graphics Processing Units), or an APU (Accelerated Processing Unit), etc.

[0014] In the following embodiments, the signed RAM (Random Access Memory) is a memory in which information is temporarily stored and is used as a work memory by the processor.

[0015] In the following embodiments, the signed storage is one or more non-volatile storage devices that store various programs and various parameters, etc. Examples of non-volatile storage devices include flash memory (SSD (Solid State Drive)), magnetic disks (e.g., hard disks), or magnetic tapes, etc.

[0016] In the following embodiments, the signed communication interface (I / F) is an interface that includes a communication processor and an antenna, etc. The communication interface manages communication between multiple computers. Examples of communication standards applicable to the communication interface include wireless communication standards such as 5G (5th Generation Mobile Communication System), Wi-Fi (registered trademark), or Bluetooth (registered trademark).

[0017] In the following embodiments, "A and / or B" is synonymous with "at least one of A and B." That is, "A and / or B" means that it may be A alone, or B alone, or a combination of A and B. Furthermore, in this specification, the same concept as "A and / or B" applies when expressing three or more things linked by "and / or."

[0018] [First Embodiment] Figure 1 shows an example of the configuration of the data processing system 10 according to the first embodiment.

[0019] As shown in Figure 1, the data processing system 10 includes a data processing device 12 and earphones 14. An example of the data processing device 12 is a server. In the first embodiment, the data processing device 12 is an example of a "data processing device" according to the technology of this disclosure.

[0020] The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network).

[0021] The earphone 14 includes a computer 36, a microphone 38, a speaker 41, a camera 42, and a communication interface 44. The computer 36 includes a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The microphone 38, speaker 41, and camera 42 are also connected to the bus 52.

[0022] The microphone 38 receives voice signals from the user 20 and accepts instructions from the user 20. The microphone 38 captures the voice signals from the user 20, converts the captured voice into audio data, and outputs it to the processor 46. The speaker 41 outputs audio according to the instructions from the processor 46. Hereafter, the microphone 38 may be simply referred to as the microphone 38.

[0023] Camera 42 is a small digital camera equipped with an optical system including a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor, and captures images of the area around the user 20 (for example, an imaging range defined by a field of view equivalent to the width of a typical healthy person's field of vision).

[0024] Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various types of information between processor 46 and processor 28 via network 54.

[0025] Figure 2 shows an example of the main functions of the data processing device 12 and the earphone 14.

[0026] As shown in Figure 2, in the data processing device 12, specific processing is performed by the processor 28. A specific processing program 56 is stored in the storage 32. The specific processing program 56 is an example of a "data processing program" related to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 according to the specific processing program 56 executed on the RAM 30.

[0027] The storage 32 stores the data generation model 58. The data generation model 58 is used by the specific processing unit 290.

[0028] (Earphones 14) In the earphone 14, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48.

[0029] The earphone 14 may be interpreted as a canal-type earphone that is fitted into the ear canal of the user 20, as shown in Figure 3A. However, the earphone 14 is not limited to a canal type; it may also be an inner-ear type earphone that is inserted into the inner ear of the user 20, or a headphone type earphone that covers the entire ear of the user 20. Each of the two earphones 14 is equipped with a microphone 38, a speaker 41, and a camera 42. The sound and images collected by the two earphones 14 fitted into the ears of the user 20 may be recorded as a life log in the database 24.

[0030] The life log can be interpreted as a history of the user 20's actions in daily life, and may include sounds and images associated with the user 20, specifically sounds collected by the microphone 38 and images taken by the camera 42 during daily life. The life log may record sounds and images associated with the user 20, along with the date, time, and location in which they were acquired.

[0031] The sounds collected by the microphone 38 may include the voice of the person the user 20 is talking to, and sounds that occur around the user 20 while walking or cycling (such as the sound of cars driving, birds chirping, the babbling of a stream, and the sound of trees swaying in the wind).

[0032] As shown in Figure 3C, the camera 42 may capture images of the scenery within its field of view that is in front of the user 20, or it may capture images of scenery within its field of view that is not in front of the user 20, for example, to the side, behind, below, or above the user 20. The images captured by the camera 42 may include images of the person the user 20 is talking to, the scenery around the user 20 when they are walking or cycling, and images of the pet the user 20 is walking with.

[0033] Since each of the two earphones 14 is equipped with a camera 42, the two earphones 14 worn on the user's ears 20 are positioned at a specific distance apart, one on the left ear and the other on the right ear, as shown in Figure 3B. Therefore, compared to cases where two cameras are arranged side by side in a single housing, such as in a video camera, the spacing between the two cameras 42 can be increased, making 3D sensing easier. 3D sensing can be interpreted as measuring three-dimensional shapes.

[0034] Furthermore, when the two earphones 14 are placed in the user 20's ears, the two cameras 42 are positioned close to the user 20's left and right eyes, allowing images (captured images) that are nearly identical to those seen with the naked eye to be recorded as a life log in the database 24. Consequently, in specific processing, it becomes easier to reproduce information corresponding to inquiries from the user 20, that is, information corresponding to the content of the user 20's speech.

[0035] While the two earphones 14 are attached to the user 20, all or part of the images captured by the camera 42 may be recorded in the database 24 as a life log. Specifically, when the two earphones 14 are attached to the user 20, the recording of images captured by the camera 42 to the database 24 may begin, and when the two earphones 14 are removed from the user 20, the recording of those images to the database 24 may end.

[0036] While the two earphones 14 are worn by the user 20, all or part of the sound collected by the microphone 38 may be recorded as a lifelog in the database 24. Specifically, when the two earphones 14 are worn by the user 20, the recording of the sound collected by the microphone 38 to the database 24 may begin, and when the two earphones 14 are removed from the user 20, the recording of the sound to the database 24 may end.

[0037] Next, we will describe the processing of the specific processing unit 290 when the data processing device 12 receives an utterance from the user 20 wearing the earphones 14 regarding the user 20's memories or actions, and performs specific processing to propose information corresponding to the content of the user 20's utterance to the user 20.

[0038] (Specific processing) In the first embodiment, the identification process uses a data generation model that inputs user data and generates predetermined estimation results corresponding to the input user data. Specifically, in the identification process, when utterances related to the user 20's memories or actions are received as user data from a user 20 wearing earphones 14, the process is executed to propose information corresponding to the content of the utterances to the user 20 by referring to the database 24. Specifically, after a life log is recorded in the database 24, if the user 20 wearing earphones 14 makes an utterance related to the user 20's memories or actions, the identification process may be executed to propose information corresponding to the content of the utterances to the user 20 by referring to the database 24.

[0039] (Example of specific processing) If the user wearing the earphones requests a message that will trigger the recall of a specific memory, the specific processing unit 290 may propose one or more messages selected based on the life log to the user who made the request, as information corresponding to the content of the utterance (request).

[0040] For example, if user 20, wearing earphones 14, tries to recall their memory and asks, "What did I say to person A around [date] at [time]?", the identification processing unit 290, as part of its identification process, inputs this message as a prompt to the data generation model 58. The identification processing unit 290 may refer to the life log in database 24 and, based on the output obtained from the data generation model 58, generate a message such as, "I think you said, 'I found a nice restaurant, let's make a reservation.'" This message may be interpreted as an example of information corresponding to the content of user 20's utterance.

[0041] For example, if user 20 wearing earphones 14 tries to recall their memory and asks, "Who was I talking to around [date] at [time]?", the identification processing unit 290 will input this message as a prompt to the data generation model 58 as part of its identification process. The identification processing unit 290 may refer to the life log in database 24 and, based on the output obtained from the data generation model 58, generate a message such as, "It seems you were talking with two friends at that time, probably B and C." This message may be interpreted as an example of information corresponding to the content of user 20's utterance.

[0042] For example, if user 20, wearing earphones 14, tries to recall their emotions and says, "How did I feel when I was talking to person A around [date] at [time]?", the identification processing unit 290, as part of its identification process, inputs this message as a prompt to the data generation model 58. The identification processing unit 290 may refer to the life log in database 24 and, based on the output obtained from the data generation model 58, generate a message such as, "At that time, you were laughing a lot, so it seems you had a good impression of your friend and were very happy." This message may be interpreted as an example of information corresponding to the content of user 20's utterance.

[0043] (Second example of specific processing) If a user 20 wearing earphones 14 mutters a specific matter as part of their utterance, the specific processing unit 290 may suggest to the user 20 who requested the message, based on their life log, recommended actions for the user 20 regarding that matter, as information corresponding to the content of their utterance (muttering).

[0044] For example, when user 20 wearing earphones 14 is shopping at a specific retail store and says, "What should I buy?", the specific processing unit 290 inputs this message as a prompt to the data generation model 58 as a specific processing step. The specific processing unit 290 may refer to the life log in the database 24 and, based on the output obtained by the data generation model 58, generate a message such as, "A few months ago, you purchased product A at this store and commented that it wasn't very tasty, so how about purchasing recently released products B and C this time?" This message may be interpreted as an example of information corresponding to the content of user 20's utterance.

[0045] (Third example of specific processing) As shown in Figure 3D, when user 20, wearing earphones 14, is operating a PC and says, "What was the name of product A that I searched for the day before yesterday?", the identification processing unit 290 inputs this message as a prompt to the data generation model 58 as part of its identification processing. The data generation model 58 refers to the life log in the database 24 and analyzes the video of the PC screen when user 20 was operating it in the past to generate a specific output. Based on the output obtained by the data generation model 58, the identification processing unit 290 may generate a message such as "Product A is ○○○". This message may be interpreted as an example of information corresponding to the content of user 20's utterance.

[0046] (Fourth example of specific processing) As shown in Figure 3E, if user 20, wearing earphones 14, says "There was a place nearby with a great view, but I wonder where it is?" while cycling, the identification processing unit 290 inputs this message as a prompt to the data generation model 58 as part of its identification process. The data generation model 58 refers to the life log in database 24 and analyzes places previously visited by user 20 and the route to those places to generate a specific output. Based on the output obtained by the data generation model 58, the identification processing unit 290 may generate a message such as "I think it's Cape XX, about 500m from here." This message can be interpreted as an example of information corresponding to the content of user 20's utterance.

[0047] (Example 5 of specific processing) As shown in Figure 3F, when user 20, wearing earphones 14, meets Mr. X at company A, the company he is visiting, and says, "Can you tell me this person's name?", the identification processing unit 290 inputs this message as a prompt to the data generation model 58 as part of the identification process. The data generation model 58 refers to the life log in database 24 and generates specific output from the history of people that user 20 met when he visited company A. Based on the output obtained from the data generation model 58, the identification processing unit 290 may generate a message such as, "I think his name is ○○." This message may be interpreted as an example of information corresponding to the content of user 20's utterance.

[0048] As shown in Figure 4, the specific processing unit 290 includes an input unit 291, a processing unit 292, and an output unit 293.

[0049] The input unit 291 acquires user input received through the earphone 14. Specifically, it acquires the user's voice received through the earphone 14.

[0050] The processing unit 292 performs specific processing using the data generation model 58. Specifically, it inputs voice from the user into the data generation model 58 and obtains a generation result. More specifically, when it receives an utterance from the user 20 wearing the earphones 14 regarding the user 20's memories or actions, it performs a specific processing step of proposing information corresponding to the content of the utterance to the user 20.

[0051] The output unit 293 transmits the result of the specific processing to the earphone 14. In the earphone 14, the control unit 46A causes the speaker 41 to output the result of the specific processing. The microphone 38 acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 38 to the data processing unit 12. In the data processing unit 12, the specific processing unit 290 acquires the audio data.

[0052] Data generation model 58 is a type of so-called generative AI (Artificial Intelligence). One example of data generation model 58 is ChatGPT (Internet search<URL: https: / / openai.com / blog / chatgpt> ), Gemini (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include those described above. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and estimation data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 estimates the input estimation data according to the instructions shown by the prompts, and outputs the estimation results in data formats such as audio data and text data. Here, estimation refers to, for example, analysis, classification, prediction, and / or summarization.

[0053] Next, the operation of the data processing system 10 will be explained.

[0054] An example of the flow of a specific processing method will be explained with reference to Figure 5. Note that the flow of a specific processing method shown in Figure 5 is an example of a "data processing method" related to the technology disclosed herein.

[0055] In step S300, the data processing device 12 receives user data, including sound and images collected by the two earphones 14.

[0056] In step S302, if the data processing device 12 receives an utterance from the user wearing the earphones 14 regarding the user's memories or actions, it executes a specific process to propose information to the user 20 that corresponds to the content of the utterance, based on the user's life log.

[0057] In step S303, the data processing device 12 executes a process to play back the result of a specific process from the speaker 41.

[0058] The following describes the second to fourth embodiments for sharing emotions and feelings that cannot be fully conveyed through words or images alone between people who are far apart. In the second to fourth embodiments, functions and configurations that are the same as or equivalent to those in the first embodiment are denoted by the same reference numerals as in the first embodiment, and redundant explanations are omitted.

[0059] [Second Embodiment] (Summary of the second embodiment) The second embodiment describes a configuration in which the transmitting side performs operation data, emotion determination, and vibration data generation and transmission, and the receiving side generates vibration. In the second embodiment, as shown in Figure 6, an example is described in which an earphone 14-2 worn by a specific receiving first user (user 20) receives vibration data transmitted from an earphone 14-2 worn by a transmitting second user (user 20A) other than the receiving first user, and vibrates the housing 14a of the earphone 14-2 worn by the receiving first user based on the vibration data. The vibration data may be interpreted as data that reproduces at least one of the intentions and emotions of the transmitting second user as vibration in the earphone 14-2 worn by the receiving first user. In the following explanation, for the sake of simplicity, the earphone 14-2 may be simply referred to as the earphone.

[0060] When the sending user (second user) manipulates their own earphones with their fingers, adjusting the intensity and rhythm according to their mood and the message they want to convey, the receiving user (first user)'s earphones vibrate in conjunction with that manipulation. For example, if the sending user (second user) taps their own earphones at a steady rhythm to signal to the receiving user (first user) that they are about to begin transmitting a specific message, the receiving user's earphones will vibrate at the same rhythm. For example, if the sending user (second user) swipes their own earphones to convey a specific message (such as being tired or irritated), the receiving user's earphones will also vibrate intermittently in conjunction with the swipe. For example, if the sending user (second user) long-presses their own earphones to indicate that something urgent has happened, the receiving user's earphones will vibrate continuously for the duration of the long-press.

[0061] In this way, by having the first user's earphones vibrate in conjunction with the emotions and circumstances of the second user (sender), emotions and feelings that cannot be fully conveyed through words alone can be shared as vibrations between the first user and the second user (sender), enabling more intimate communication. Furthermore, even in situations where speaking aloud is not possible, such as during a meeting, and operating the screen of a terminal device such as a smartphone is difficult, communication can still take place between the first user and the second user (sender).

[0062] Furthermore, by conveying the emotions of the transmitting user (second user) to the receiving user (first user) through vibration, real-time communication with the other party is possible even in situations where voice communication is difficult, such as when there are many cars around the receiving user (first user) or when the receiving user (first user) is in a crowded place.

[0063] Furthermore, by conveying the emotions of the transmitting user (second user) to the receiving user (first user) through vibration, the receiving user (first user) can intuitively understand the emotions and intentions of the transmitting user (second user) without losing concentration, even while performing specific tasks such as desk work, attending meetings, or driving a car, compared to communicating by voice.

[0064] Furthermore, by conveying the emotions of the transmitting user (second user) to the receiving user (first user) through vibration, even in situations where conversation is difficult, such as when the receiving user (first user) is in a quiet space, there is no risk of people around the receiving user (first user) knowing the emotions of the transmitting user (second user).

[0065] In particular, if the earphones are of the in-ear (open-type) design, there is a risk that sound may leak to the surroundings of the receiving user (first user). In contrast, according to the second embodiment, even if the earphones are of the open-type design, there is no risk that people around the receiving user (first user) will be able to hear the emotions of the transmitting user (second user). The same applies to canal-type earphones and in-ear type earphones.

[0066] In the second embodiment, in addition to the emotions of the transmitting second user, information about the transmitting second user's physical condition, such as body temperature, pulse rate, and blood pressure fluctuations, may also be communicated via vibration. For example, the receiving first user can be notified by the vibration of the earphones that the transmitting second user's pulse rate is increasing due to exercise.

[0067] In the second embodiment, the earphone of the receiving first user may communicate directly wirelessly with the earphone of the transmitting second user.

[0068] In the second embodiment, the case where the terminal device on the transmitting side (second user) that transmits operation data is an earphone is described, but the terminal device on the transmitting side (second user) may be a terminal device other than an earphone, such as a smartphone or a laptop computer.

[0069] (Specific example of the configuration of the second embodiment) Figure 6 shows an example of the configuration of the data processing system 10-2 according to the second embodiment. The data processing system 10-2 may include a data processing device 12 and a plurality of earphones.

[0070] User 20, who is wearing one of the earphones, may be interpreted as the "receiving first user" in relation to the technology of this disclosure. User 20A, who is wearing the other earphone, may be interpreted as the "transmitting second user" in relation to the technology of this disclosure.

[0071] Each earphone shown in Figure 6 may include a processor 46-2, RAM 48, storage 50-2, camera 42, microphone 38, touch sensor 39, vibrator 43, speaker 41, and communication I / F 44.

[0072] The touch sensor 39 may detect at least one operation performed on the earphone worn by the transmitting second user, such as a swipe, tap, or long press. The transmitting second user's earphone may detect the operation using the touch sensor 39, generate operation data including information indicating the detected operation, and generate vibration data based on the generated operation data. Furthermore, it may transmit the vibration data to the receiving first user's earphone. The vibration data may be interpreted as data that causes the processor 46-2 to perform a specific processing using a data generation model that generates a predetermined estimation result according to the operation data, and then generates vibrations in the earphone worn in the receiving first user's ear according to the result of the specific processing.

[0073] The second user's earphones may transmit vibration data wirelessly directly to the first user's earphones, that is, without going through the data processing device 12.

[0074] The earphones of the receiving first user may perform a specific process to estimate at least one of the intentions and emotions of the transmitting second user based on the received vibration data, and may generate vibrations corresponding to the result of this specific process using the vibration unit 43 inside the earphones worn by the receiving first user.

[0075] Figure 7 shows an example of the main functions of the data processing device 12 and the earphone. In the earphone, data collection processing and specific processing are performed by the processor 46-2.

[0076] Storage 50-2 stores the data acquisition program 60-2, as well as the specific processing program 61-2 and the data generation model 62-2. The specific processing program 61-2 and the data generation model 62-2 are used for specific processing in the specific processing unit 46B-2.

[0077] The processor 46-2 reads the data acquisition program 60-2 from the storage 50-2 and executes the read data acquisition program 60-2 on the RAM 48. The data acquisition process is realized by the processor 46-2 operating as a control unit 46A-2 according to the data acquisition program 60-2 executed on the RAM 48.

[0078] The processor 46-2 reads a specific processing program 61-2 from the storage 50-2 and executes the read specific processing program 61-2 on the RAM 48. The specific processing is realized by the processor 46-2 operating as a specific processing unit 46B-2 according to the specific processing program 61-2 executed on the RAM 48.

[0079] Data generation model 62-2 is a so-called generative AI. An example of data generation model 62-2 is ChatGPT (registered trademark) (Internet search).<URL: https: / / openai.com / blog / chatgpt> ), Gemini (registered trademark) (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 62-2 is obtained by performing deep learning on a neural network. The data generation model 62-2 is input with prompts containing instructions, and estimation data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 62-2 estimates the input estimation data according to the instructions shown by the prompts, and outputs the estimation results in data formats such as audio data and text data. Here, estimation refers to, for example, analysis, classification, prediction, and / or summarization.

[0080] As shown in Figure 8, the control unit 46A-2 may include a data acquisition unit 100-2. The specific processing unit 46B-2 may include an input unit 200-2, a processing unit 201-2, and an output unit 202-2.

[0081] The data acquisition unit 100-2 may collect the outputs of the microphone 38, touch sensor 39, and camera 42 shown in Figure 6, and may also collect the operation data described above. As described above, the operation data may include information indicating the operation detected by the touch sensor 39.

[0082] The operations include swiping, tapping, and long-pressing on the earphones. A swipe operation can be interpreted as, for example, placing a finger on the touch sensor 39 and sliding that finger in any direction. A tap operation can be interpreted as, for example, briefly touching the touch sensor 39 with a finger and then releasing it. A tap operation may include a double tap, which is two quick taps in quick succession. A long-press operation can be interpreted as keeping a finger in contact with the touch sensor 39 and pressing down without lifting it.

[0083] For example, if user 20A, who is attending the same meeting as user 20, comes up with a good idea and wants to inform user 20 of it, user 20A can perform a tap operation, for example. This generates operation data indicating that it is a tap operation, and vibration data corresponding to this operation data is sent to the earphones.

[0084] For example, if user 20A becomes upset and irritated during a meeting, user 20A performs a double-tap operation to inform user 20 that they are in a bad mood. This generates operation data indicating a double-tap operation, and vibration data corresponding to this operation data is sent to the earphone.

[0085] (Processing section 201-2) The data collected by the data collection unit 100-2 may be input to the processing unit 201-2 via the input unit 200-2. The processing unit 201-2 may perform specific processing using a data generation model 62-2 (generation AI model) that generates a predetermined estimation result according to the input operation data. The processing unit 201-2 may generate vibration data that causes vibrations corresponding to the result of the specific processing to be generated in the earphones worn on the ears of the receiving first user. For example, the processing unit 201-2 may perform a process as specific processing to estimate at least one of the intentions and emotions of the transmitting second user based on the operation data, thereby generating vibrations in the earphones worn on the ears of the receiving first user that reproduce at least one of the intentions and emotions of the transmitting second user according to the result of the specific processing.

[0086] Specifically, the processing unit 201-2 may use the data generation model 62-2 to estimate the user 20A's emotions, intentions, etc., based on information contained in the operation data, such as the strength of the touch operation, the rhythm of the touch operation, and the duration of the touch operation.

[0087] For example, if user 20A taps the earphones they are wearing three times, the processing unit 201-2 recognizes to a pre-configured user 20 that a situation has occurred where user 20 should be alerted to user 20A. In this case, the processing unit 201-2 may generate vibration data indicating a specific vibration period, specific vibration intensity, and specific number of vibrations, and transmit it to the output unit 202-2, in order to accurately convey user 20A's intention. Specifically, the processing unit 201-2 may transmit vibration data to the output unit 202-2 that generates the lowest vibration level three times at one-second intervals. This allows, for example, user 20A, who is participating in a meeting, to quickly inform user 20, who is also participating in the same meeting room, that user 20A wants to convey some kind of signal, instead of using a phone call or text message.

[0088] For example, if user 20A presses and holds the earphones they are wearing, for instance, if the housing 14a is pressed by a finger for 5 seconds, the processing unit 201-2 recognizes that user 20A is in a situation where they need to communicate an urgent matter or something important quickly to a pre-configured user 20. In this case, the processing unit 201-2 may generate vibration data indicating a specific vibration period, specific vibration intensity, and specific number of vibrations, and transmit it to the output unit 202-2 in order to accurately convey user 20A's intentions. Specifically, the processing unit 201-2 may transmit vibration data to the output unit 202-2 that generates the highest vibration level for 3 seconds continuously, followed by 10 consecutive vibrations with a 1-second pause in between. This allows, for example, user 20 to be quickly informed that user 20A needs to communicate an urgent matter, instead of using a phone call or text message.

[0089] The processing unit 201-2 may perform specific processing considering the operation history of the housing 14a worn on the ear of the second transmitting user.

[0090] The processing unit 201-2 may generate audio data, while generating vibrations, that, depending on the result of a specific processing, provides at least one of the intentions and emotions of the transmitting second user as voice guidance, to be played through earphones worn in the ears of the receiving first user. This allows the receiving first user to understand the content of the vibrations more clearly.

[0091] (Output section 202-2) The output unit 202-2 may transmit vibration data to the earphone 14-2 worn on the ear of the receiving first user. Specifically, the output unit 202-2 may transmit operation data wirelessly directly to the earphone 14-2 worn on the ear of the receiving first user. As a result, vibration data is transmitted in a shorter time compared to when it is transmitted to the earphone 14-2 worn on the ear of the receiving first user via the data processing device 12, so that if the receiving first user is at a distance where they can see the transmitting second user, communication can be established in a shorter time.

[0092] The earphone 14-2 of the receiving first user, which has received the vibration data, may generate vibrations in the vibration unit 43 according to the result of specific processing by the processing unit 201-2. Specifically, the earphone 14-2 of the receiving first user may generate vibration data that reproduces at least one of the user 20A's intentions and emotions based on the data generated according to the result of specific processing, and transmit it to the vibration unit 43.

[0093] For example, if the processing unit 201-2 transmits vibration data to the receiving first user's earphone 14-2 that generates the lowest vibration level three times at one-second intervals, the receiving first user's earphone 14-2 may transmit a pulse-shaped drive signal corresponding to the vibration data to the vibration unit 43 (e.g., a motor with an eccentric rotor). This drive signal is, for example, a rectangular signal with a low voltage level that repeats high and low levels dozens of times within one second, and generates three such signals at one-second intervals.

[0094] Furthermore, if the processing unit 201-2 transmits vibration data to the receiving first user's earphone 14-2, which consists of generating the highest vibration level for 3 seconds continuously, followed by a 1-second pause, and then generating 10 consecutive vibrations, the receiving first user's earphone 14-2 may transmit a pulse-like drive signal corresponding to the vibration data to the vibration unit 43 (for example, a motor with an eccentric rotor). This drive signal is, for example, a signal that repeats high and low levels hundreds of times within 3 seconds, generating a high-voltage rectangular signal 10 times at 1-second intervals.

[0095] Next, with reference to Figure 9, an example of operation related to specific processing by the data processing system 10-2 according to the second embodiment will be described.

[0096] The earphone of the transmitting second user may generate operation data in step S21, estimate emotions based on the operation data in step S22, generate vibration data in step S23, and transmit the vibration data to the receiving first user's earphone in step S24.

[0097] When the receiving first user's earphone receives vibration data in step S25, it may generate vibrations in the vibration unit 43 in step S26 according to the result of a specific processing.

[0098] In addition, in the data processing system 10-2 according to the second embodiment, a special application may be installed on the earphones used by user 20, and the results of the above-mentioned specific processing may be linked to the application. For example, by managing a history of the number of times the transmitting second user touched the earphones and the content of those operations as a log in the application, even if the receiving first user is unable to react when operation data is entered, they can check the history and take an appropriate reaction.

[0099] Furthermore, the operation data may include the intentions of the sending second user, such as being well or wanting to express gratitude, and may also include the emotions of the sending second user, such as joy, sadness, excitement, or a sense of relief.

[0100] As described above, according to the data processing system 10-2 of the second embodiment, earphones can communicate wirelessly with each other directly or via a data processing device, enabling real-time sharing of tactile sensations. Furthermore, because the earphones fit into the ears, tactile sensations through vibration can be felt naturally and directly. Conventional earphones are specialized for playing voice and music, but this disclosure combines vibration functionality with wireless communication to realize tactile sharing with remote locations. In addition, emotions and feelings that cannot be fully conveyed by words or images alone can be shared through touch, improving the quality of communication. It also enables intimate interaction, and tactile sharing makes it possible to have intimate real-time communication with someone far away. Modern digital communication relies mainly on sight and hearing, but it is difficult to fully convey emotions and subtle nuances with such means alone. Especially when communicating with someone in a remote location, the lack of physical contact can result in a lack of intimacy and presence. According to this disclosure, it is possible to provide a new means of communication that allows for sharing tactile sensations with someone far away and conveying emotions at a deeper level. Furthermore, by linking the data processing device 12 and the earphones, it is possible to share complex tactile patterns among multiple users via the data processing device 12 as needed.

[0101] [Third Embodiment] (Summary of the third embodiment) The third embodiment describes a configuration example in which the transmitting side generates and transmits operation data, and the receiving side determines emotion, generates vibration data, and vibrates. In the third embodiment, as shown in Figure 10, an example is described in which an earphone 14-3 worn by a specific receiving side first user (user 20) receives operation data transmitted from an earphone 14-3 worn by a transmitting side second user (user 20A) other than the receiving side first user, and vibrates the housing 14a of the earphone 14-3 worn by the receiving side first user based on the operation data. The operation data may include information indicating the detected operation content. In the following explanation, for simplicity, the earphone 14-3 may be simply referred to as the earphone.

[0102] When the sending user (second user) manipulates their own earphones with their fingers, adjusting the intensity and rhythm according to their mood and the message they want to convey, the receiving user (first user)'s earphones vibrate in conjunction with that manipulation. For example, if the sending user (second user) taps their own earphones at a steady rhythm to signal to the receiving user (first user) that they are about to begin transmitting a specific message, the receiving user's earphones will vibrate at the same rhythm. For example, if the sending user (second user) swipes their own earphones to convey a specific message (such as being tired or irritated), the receiving user's earphones will also vibrate intermittently in conjunction with the swipe. For example, if the sending user (second user) long-presses their own earphones to indicate that something urgent has happened, the receiving user's earphones will vibrate continuously for the duration of the long-press.

[0103] In this way, by having the first user's earphones vibrate in conjunction with the emotions and circumstances of the second user (sender), emotions and feelings that cannot be fully conveyed through words alone can be shared as vibrations between the first user and the second user (sender), enabling more intimate communication. Furthermore, even in situations where speaking aloud is not possible, such as during a meeting, and operating the screen of a terminal device such as a smartphone is difficult, communication can still take place between the first user and the second user (sender).

[0104] Furthermore, by conveying the emotions of the transmitting user (second user) to the receiving user (first user) through vibration, real-time communication with the other party is possible even in situations where voice communication is difficult, such as when there are many cars around the receiving user (first user) or when the receiving user (first user) is in a crowded place.

[0105] Furthermore, by conveying the emotions of the transmitting user (second user) to the receiving user (first user) through vibration, the receiving user (first user) can intuitively understand the emotions and intentions of the transmitting user (second user) without losing concentration, even while performing specific tasks such as desk work, attending meetings, or driving a car, compared to communicating by voice.

[0106] Furthermore, by conveying the emotions of the transmitting user (second user) to the receiving user (first user) through vibration, even in situations where conversation is difficult, such as when the receiving user (first user) is in a quiet space, there is no risk of people around the receiving user (first user) knowing the emotions of the transmitting user (second user).

[0107] In particular, if the earphones are of the in-ear (open-type) design, there is a risk that sound may leak to the surroundings of the receiving user (first user). In contrast, according to the third embodiment, even if the earphones are of the open-type design, there is no risk that people around the receiving user (first user) will be able to hear the emotions of the transmitting user (second user). The same applies to canal-type earphones and in-ear type earphones.

[0108] In the third embodiment, in addition to the emotions of the transmitting second user, information about the transmitting second user's physical condition, such as body temperature, pulse rate, and blood pressure fluctuations, may also be communicated via vibration. For example, the receiving first user can be notified by the vibration of the earphones that the transmitting second user's pulse rate is increasing due to exercise.

[0109] In the third embodiment, the earphone of the receiving first user may communicate directly wirelessly with the earphone of the transmitting second user.

[0110] In the third embodiment, the case where the terminal device on the transmitting side (second user) that transmits operation data is an earphone is described, but the terminal device on the transmitting side (second user) may be a terminal device other than an earphone, such as a smartphone or a laptop computer.

[0111] (Specific example of the configuration of the third embodiment) Figure 10 shows an example of the configuration of a data processing system 10-3 according to the third embodiment. The data processing system 10-3 may include a data processing device 12 and a plurality of earphones.

[0112] User 20, who is wearing one of the earphones, may be interpreted as the "receiving first user" according to the third embodiment of this disclosure. User 20A, who is wearing the other earphone, may be interpreted as the "transmitting second user" according to the third embodiment.

[0113] Each earphone shown in Figure 10 may include a processor 46-3, RAM 48, storage 50-3, camera 42, microphone 38, touch sensor 39, vibrator 43, speaker 41, and communication I / F 44.

[0114] The touch sensor 39 may detect at least one operation performed on the earphone worn by the transmitting second user, such as a swipe, tap, or long press. The transmitting second user's earphone may detect the operation using the touch sensor 39, generate operation data including information indicating the detected operation, and transmit the operation data to the receiving first user's earphone. Specifically, the transmitting second user's earphone may transmit the operation data wirelessly to the receiving first user's earphone directly, that is, without going through the data processing device 12.

[0115] The earphone of the receiving first user, which has received the operation data, may generate vibration data based on the operation data. The vibration data may be interpreted as data that causes the earphone worn in the ear of the receiving first user to generate vibrations corresponding to the result of a specific processing performed by the processor 46-3 using a data generation model that generates a predetermined estimation result according to the operation data.

[0116] The earphones of the receiving first user may perform a specific process to estimate at least one of the intentions and emotions of the transmitting second user based on vibration data, and may generate vibrations corresponding to the result of this specific process using the vibration unit 43 inside the earphones worn by the receiving first user.

[0117] Figure 11 shows an example of the main functions of the data processing device 12 and the earphone. In the earphone, data acquisition processing and specific processing are performed by the processor 46-3.

[0118] Storage 50-3 stores the data acquisition program 60-3, as well as the specific processing program 61-3 and the data generation model 62-3. The specific processing program 61-3 and the data generation model 62-3 are used for specific processing in the specific processing unit 46B-3.

[0119] The processor 46-3 reads the data acquisition program 60-3 from the storage 50-3 and executes the read data acquisition program 60-3 on the RAM 48. The data acquisition process is realized by the processor 46-3 operating as a control unit 46A-3 according to the data acquisition program 60-3 executed on the RAM 48.

[0120] The processor 46-3 reads a specific processing program 61-3 from the storage 50-3 and executes the read specific processing program 61-3 on the RAM 48. The specific processing is realized when the processor 46-3 operates as a specific processing unit 46B-3 according to the specific processing program 61-3 executed on the RAM 48.

[0121] Data generation model 62-3 is a so-called generative AI. An example of data generation model 62-3 is ChatGPT (registered trademark) (Internet search).<URL: https: / / openai.com / blog / chatgpt> ), Gemini (registered trademark) (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 62-3 is obtained by performing deep learning on a neural network. The data generation model 62-3 is input with prompts containing instructions, and estimation data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 62-3 estimates the input estimation data according to the instructions shown by the prompts, and outputs the estimation results in data formats such as audio data and text data. Here, estimation refers to, for example, analysis, classification, prediction, and / or summarization.

[0122] As shown in Figure 12, the control unit 46A-3 may include a data acquisition unit 100-3. The specific processing unit 46B-3 may include an input unit 200-3, a processing unit 201-3, and a control unit 202-3.

[0123] The data acquisition unit 100-3 may collect the outputs of the microphone 38, touch sensor 39, and camera 42 shown in Figure 10, and may also collect the operation data described above. As described above, the operation data may include information indicating the operation detected by the touch sensor 39.

[0124] The operations include swiping, tapping, and long-pressing on the earphones. A swipe operation can be interpreted as, for example, placing a finger on the touch sensor 39 and sliding that finger in any direction. A tap operation can be interpreted as, for example, briefly touching the touch sensor 39 with a finger and then releasing it. A tap operation may include a double tap, which is two quick taps in quick succession. A long-press operation can be interpreted as keeping a finger in contact with the touch sensor 39 and pressing down without lifting it.

[0125] For example, if User 20A, who is attending the same meeting as User 20, comes up with a good idea and wants to inform User 20, User 20A can perform a tap operation, for example. This generates operation data indicating that it is a tap operation, and this operation data is sent to User 20's earphones.

[0126] For example, if user 20A becomes upset and irritated during a meeting, user 20A performs a double-tap operation to inform user 20 that they are in a bad mood. This generates operation data indicating a double-tap operation, and this operation data is sent to user 20's earphones.

[0127] (Processing section 201-3) The data collected by the data acquisition unit 100-3 may be input to the processing unit 201-3 via the input unit 200-3.

[0128] The data acquisition unit 100-3 or the input unit 200-3 may receive operation data transmitted wirelessly from an earphone worn on the ear of the transmitting second user. Specifically, the input unit 200-3 may directly receive operation data wirelessly from an earphone 14-3 worn on the ear of the receiving first user. As a result, the transmitted data is transmitted in a shorter time compared to when the operation data is transmitted via the data processing device 12 to the earphone 14-3 worn on the ear of the receiving first user. Therefore, if the receiving first user is at a distance where they can see the transmitting second user, communication can be established in a shorter time.

[0129] The processing unit 201-3 may perform specific processing using a data generation model 62-3 (generating AI model) that generates a predetermined estimation result according to the input operation data. The processing unit 201-3 may generate vibration data that causes vibrations corresponding to the result of the specific processing to be generated in the earphones worn in the ears of the receiving first user. For example, the processing unit 201-3 may perform a process as specific processing that estimates at least one of the intentions and emotions of the transmitting second user based on the operation data, thereby generating vibrations in the earphones worn in the ears of the receiving first user that reproduce at least one of the intentions and emotions of the transmitting second user according to the result of the specific processing.

[0130] Specifically, the processing unit 201-3 may use the data generation model 62-3 to estimate the user 20A's emotions, intentions, etc., based on information contained in the operation data, such as the strength of the touch operation, the rhythm of the touch operation, and the duration of the touch operation.

[0131] For example, if user 20A taps the earphones they are wearing three times, the processing unit 201-3 recognizes that a situation has occurred where user 20 needs to be noticed by user 20, as set in advance. In this case, the processing unit 201-3 may generate vibration data indicating a specific vibration period, specific vibration intensity, and specific number of vibrations, and transmit it to the control unit 202-3, in order to accurately convey user 20A's intention. Specifically, the processing unit 201-3 may transmit vibration data to the control unit 202-3 that generates the lowest vibration level three times at one-second intervals. The control unit 202-3 then generates vibrations in the vibration unit 43 that reproduce at least one of the intentions and emotions of the transmitting second user, according to the result of the specific processing, i.e., the vibration data. This allows, for example, user 20A, who is participating in a meeting, to quickly inform user 20, who is also participating in the same meeting room, that user 20A wants to convey some kind of signal, instead of using a phone call or text message.

[0132] For example, if user 20A presses and holds the earphones they are wearing, for instance, if the housing 14a is pressed by a finger for 5 seconds, the processing unit 201-3 recognizes that user 20A is in a situation where they need to communicate an urgent matter or something important to a pre-configured user 20. In this case, the processing unit 201-3 may generate vibration data indicating a specific vibration period, specific vibration intensity, and specific number of vibrations, and transmit it to the control unit 202-3, in order to accurately convey user 20A's intentions. Specifically, the processing unit 201-3 may transmit vibration data to the control unit 202-3 that generates the highest vibration level for 3 seconds continuously, followed by 10 consecutive vibrations with a 1-second pause in between. The control unit 202-3 then generates vibrations in the vibration unit 43 that reproduce at least one of the intentions and emotions of the transmitting second user, according to the result of the specific processing, i.e., the vibration data. This allows, for example, to quickly inform user 20 that user 20A has an urgent matter to communicate, instead of using a phone call or text message.

[0133] The processing unit 201-3 may perform specific processing considering the operation history of the transmitting second user on the housing 14a worn on the transmitting second user's ear. For example, if the transmitting second user performs a touch operation at a specific time, the processing unit 201-3 may generate history data that associates the content of that operation with the content of a touch operation performed at a specific time after that time, in chronological order. This history data may be sent to the receiving first user's earphone, and the history content may be played back on the first earphone. The history data may include message information such as, for example, "Mr. / Ms. XX gave you some kind of signal 5 minutes ago and 1 minute ago," or "Mr. / Ms. XX sent you an agreement signal 3 minutes ago." By considering the operation history in this way, it is possible to prevent the first earphone from overlooking the transmitting second user's signals and to confirm what kind of reaction the operation specifically represented.

[0134] While generating vibration data, the processing unit 201-3 may, depending on the result of a specific processing, generate audio data that is played back as voice guidance from the earphones worn by the receiving user 1, representing at least one of the intentions and emotions of the transmitting user 2. This allows the receiving user 1 to understand the content of the vibrations more clearly.

[0135] (Control Unit 202-3) The control unit 202-3 may cause the vibration unit 43 to generate vibrations that reproduce at least one of the intentions and emotions of the transmitting second user, in accordance with the result of a specific process, i.e., vibration data. Specifically, the receiving first user's earphone 14-3 may generate a drive signal that reproduces at least one of the intentions and emotions of user 20A based on the vibration data generated in accordance with the result of the specific process, and transmit it to the vibration unit 43.

[0136] For example, if the processing unit 201-3 generates vibration data that generates the lowest vibration level three times at one-second intervals, the control unit 202-3 may transmit a pulse-shaped drive signal corresponding to the vibration data to the vibration unit 43 (e.g., a motor with an eccentric rotor). This drive signal is, for example, a rectangular signal with a low voltage level that repeats high and low levels dozens of times within one second, and generates three such signals at one-second intervals.

[0137] Furthermore, if the processing unit 201-3 generates vibration data that generates the highest vibration level for 3 seconds continuously, followed by a 1-second pause, and then generates 10 consecutive vibrations, the control unit 202-3 may transmit a pulse-shaped drive signal corresponding to the vibration data to the vibration unit 43 (for example, a motor with an eccentric rotor). This drive signal is, for example, a signal that repeats high and low levels hundreds of times within 3 seconds, generating a high-voltage rectangular signal 10 times at 1-second intervals.

[0138] Next, with reference to Figure 13, an example of operation related to specific processing by the data processing system 10-3 according to the third embodiment will be described.

[0139] In step S31, the second user's earphone generates operation data, and in step S32, transmits the operation data to the first user's earphone. In step S33, the second user's earphone receives the operation data, estimates emotions based on the operation data in step S34, generates vibration data in step S35, and in step S36, transmits a drive signal to the vibration unit 43, thereby generating vibrations in the vibration unit 43 according to the result of a specific process.

[0140] In addition, in the data processing system 10-3 according to the third embodiment, a special application may be installed on the earphones used by user 20, and the results of the above-mentioned specific processing may be linked to the application. For example, by managing a history of the number of times the transmitting second user touched the earphones and the content of those operations as a log in the application, even if the receiving first user is unable to react when operation data is entered, they can check the history and take an appropriate reaction.

[0141] Furthermore, the operation data may include the intentions of the sending second user, such as being well or wanting to express gratitude, and may also include the emotions of the sending second user, such as joy, sadness, excitement, or a sense of relief.

[0142] As described above, according to the data processing system 10-3 of the third embodiment, earphones can communicate wirelessly with each other directly or via a data processing device, enabling real-time sharing of tactile sensations. Furthermore, because the earphones fit into the ears, tactile sensations through vibration can be felt naturally and directly. Conventional earphones are specialized for playing voice and music, but this disclosure combines vibration functionality with wireless communication to realize tactile sharing with remote locations. In addition, emotions and feelings that cannot be fully conveyed by words or images alone can be shared through touch, improving the quality of communication. It also enables intimate interaction, and tactile sharing makes it possible to have intimate real-time communication with someone far away. Modern digital communication relies mainly on sight and hearing, but it is difficult to fully convey emotions and subtle nuances with such means alone. Especially when communicating with someone in a remote location, the lack of physical contact can result in a lack of intimacy and presence. According to this disclosure, it is possible to provide a new means of communication that allows for sharing tactile sensations with someone far away and conveying emotions at a deeper level. Furthermore, by linking the data processing device 12 and the earphones, it is possible to share complex tactile patterns among multiple users via the data processing device 12 as needed.

[0143] [Fourth Embodiment] (Summary of the fourth embodiment) The fourth embodiment describes a configuration in which operation data is generated and transmitted on the transmitting side, emotion determination, vibration data generation and transmission are performed by the processing unit, and vibration occurs on the receiving side. In the fourth embodiment, as shown in Figure 14, the data processing unit 12-4 receives operation data transmitted from the earphone 14-4 worn by the transmitting second user (user 20A), who is not the receiving first user, and vibrates the housing 14a of the earphone 14-4 worn by the receiving first user based on the operation data. The operation data may include information indicating the detected operation content. In the following explanation, for simplicity, the earphone 14-4 may be simply referred to as the earphone.

[0144] When the sending user (second user) manipulates their own earphones with their fingers, adjusting the intensity and rhythm according to their mood and the message they want to convey, the receiving user (first user)'s earphones vibrate in conjunction with that manipulation. For example, if the sending user (second user) taps their own earphones at a steady rhythm to signal to the receiving user (first user) that they are about to begin transmitting a specific message, the receiving user's earphones will vibrate at the same rhythm. For example, if the sending user (second user) swipes their own earphones to convey a specific message (such as being tired or irritated), the receiving user's earphones will also vibrate intermittently in conjunction with the swipe. For example, if the sending user (second user) long-presses their own earphones to indicate that something urgent has happened, the receiving user's earphones will vibrate continuously for the duration of the long-press.

[0145] In this way, by having the first user's earphones vibrate in conjunction with the emotions and circumstances of the second user (sender), emotions and feelings that cannot be fully conveyed through words alone can be shared as vibrations between the first user and the second user (sender), enabling more intimate communication. Furthermore, even in situations where speaking aloud is not possible, such as during a meeting, and operating the screen of a terminal device such as a smartphone is difficult, communication can still take place between the first user and the second user (sender).

[0146] Furthermore, by conveying the emotions of the transmitting user (second user) to the receiving user (first user) through vibration, real-time communication with the other party is possible even in situations where voice communication is difficult, such as when there are many cars around the receiving user (first user) or when the receiving user (first user) is in a crowded place.

[0147] Furthermore, by conveying the emotions of the transmitting user (second user) to the receiving user (first user) through vibration, the receiving user (first user) can intuitively understand the emotions and intentions of the transmitting user (second user) without losing concentration, even while performing specific tasks such as desk work, attending meetings, or driving a car, compared to communicating by voice.

[0148] Furthermore, by conveying the emotions of the transmitting user (second user) to the receiving user (first user) through vibration, even in situations where conversation is difficult, such as when the receiving user (first user) is in a quiet space, there is no risk of people around the receiving user (first user) knowing the emotions of the transmitting user (second user).

[0149] In particular, if the earphones are of the in-ear (open-type) design, there is a risk that sound may leak to the surroundings of the receiving user (first user). In contrast, according to the fourth embodiment, even if the earphones are of the open-type design, there is no risk that people around the receiving user (first user) will be able to hear the emotions of the transmitting user (second user). The same applies to canal-type earphones and in-ear type earphones.

[0150] In the fourth embodiment, in addition to the emotions of the transmitting second user, information about the transmitting second user's physical condition, such as body temperature, pulse rate, and blood pressure fluctuations, may also be communicated via vibration. For example, the receiving first user can be notified by the vibration of the earphones that the transmitting second user's pulse rate is increasing due to exercise.

[0151] In the fourth embodiment, the earphone of the receiving first user may communicate directly wirelessly with the earphone of the transmitting second user.

[0152] In the fourth embodiment, the case where the terminal device on the transmitting side (second user) that transmits operation data is an earphone is described, but the terminal device on the transmitting side (second user) may be a terminal device other than an earphone, such as a smartphone or a laptop computer.

[0153] (Specific example of the configuration of the fourth embodiment) Figure 14 shows an example of the configuration of a data processing system 10-4 according to the fourth embodiment. The data processing system 10-4 may include a data processing device 12 and a plurality of earphones.

[0154] User 20, who is wearing one of the earphones, may be interpreted as the "receiving first user" according to the fourth embodiment of this disclosure. User 20A, who is wearing the other earphone, may be interpreted as the "transmitting second user" according to the fourth embodiment.

[0155] Each earphone shown in Figure 14 may include a processor 46-4, RAM 48, storage 50-4, camera 42, microphone 38, touch sensor 39, vibrator 43, speaker 41, and communication I / F 44.

[0156] The touch sensor 39 may detect at least one operation performed on the earphone worn by the transmitting second user, such as a swipe, tap, or long press. The transmitting second user's earphone may detect the operation using the touch sensor 39, generate operation data including information indicating the detected operation, and transmit the operation data to the data processing system 10-4. Specifically, the transmitting second user's earphone may transmit the operation data to the data processing system 10-4 wirelessly.

[0157] The data processing system 10-4, upon receiving the operation data, may generate vibration data based on the operation data. The vibration data may be interpreted as data that generates vibrations in the earphone worn on the ear of the receiving first user, based on the result of a specific processing performed by the processor 46-4 using a data generation model that generates a predetermined estimation result corresponding to the operation data.

[0158] The data processing system 10-4 may perform a specific process to estimate at least one of the intentions and emotions of the transmitting second user based on the vibration data, and may generate vibrations corresponding to the result of this specific process using the vibration unit 43 inside the earphone worn by the receiving first user.

[0159] Figure 15 shows an example of the main functions of the data processing unit 12 and the earphone. In the earphone, data acquisition processing and specific processing are performed by the processor 46-4.

[0160] Storage 50-4 stores the data collection program 60-4 as well as the specific processing program 61-4. The specific processing program 61-4 is used for specific processing in the specific processing unit 46B-4.

[0161] The processor 46-4 reads the data acquisition program 60-4 from the storage 50-4 and executes the read data acquisition program 60-4 on the RAM 48. The data acquisition process is realized by the processor 46-4 operating as a control unit 46A according to the data acquisition program 60-4 executed on the RAM 48.

[0162] The processor 46-4 reads a specific processing program 61-4 from the storage 50-4 and executes the read specific processing program 61-4 on the RAM 48. The specific processing is realized by the processor 46-4 operating as a specific processing unit 46B-4 according to the specific processing program 61-4 executed on the RAM 48.

[0163] In the data processing device 12-4, data acquisition processing and specific processing are performed by the processor 28-4.

[0164] Storage 32-4 stores the data acquisition program 55-4, as well as the specific processing program 56-4 and the data generation model 58-4. The specific processing program 56-4 and the data generation model 58-4 are used for specific processing in the specific processing unit 290-4.

[0165] The processor 28-4 reads the data acquisition program 55-4 from the storage 32-4 and executes the read data acquisition program 55-4 on the RAM 30. The data acquisition process is realized by the processor 28-4 acting as a control unit 291-4 according to the data acquisition program 55-4 executed on the RAM 30.

[0166] The processor 28-4 reads a specific processing program 56-4 from the storage 32-4 and executes the read specific processing program 56-4 on the RAM 30. The specific processing is realized by the processor 28-4 acting as a specific processing unit 290-4 according to the specific processing program 56-4 executed on the RAM 30.

[0167] Data generation model 58-4 is a so-called generative AI. An example of data generation model 58-4 is ChatGPT (registered trademark) (Internet search).<URL: https: / / openai.com / blog / chatgpt> ), Gemini (registered trademark) (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58-4 is obtained by performing deep learning on a neural network. The data generation model 58-4 is input with prompts containing instructions, and estimation data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58-4 estimates the input estimation data according to the instructions shown by the prompts, and outputs the estimation results in data formats such as audio data and text data. Here, estimation refers to, for example, analysis, classification, prediction, and / or summarization.

[0168] Data generation model 58-4 is a so-called generative AI. An example of data generation model 58-4 is ChatGPT (registered trademark) (Internet search).<URL: https: / / openai.com / blog / chatgpt> ), Gemini (registered trademark) (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58-4 is obtained by performing deep learning on a neural network. The data generation model 58-4 is input with prompts containing instructions, and estimation data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58-4 estimates the input estimation data according to the instructions shown by the prompts, and outputs the estimation results in data formats such as audio data and text data. Here, estimation refers to, for example, analysis, classification, prediction, and / or summarization.

[0169] As shown in Figure 16, the control unit 291-4 may include a data acquisition unit 100-4. The specific processing unit 290-4 may include an input unit 200-4, a processing unit 201-4, and an output unit 202-4.

[0170] The data acquisition unit 100-4 may collect the outputs of the microphone 38, touch sensor 39, and camera 42 shown in Figure 14, and may also collect the operation data described above. As described above, the operation data may include information indicating the operation detected by the touch sensor 39.

[0171] The operations include swiping, tapping, and long-pressing on the earphones. A swipe operation can be interpreted as, for example, placing a finger on the touch sensor 39 and sliding that finger in any direction. A tap operation can be interpreted as, for example, briefly touching the touch sensor 39 with a finger and then releasing it. A tap operation may include a double tap, which is two quick taps in quick succession. A long-press operation can be interpreted as keeping a finger in contact with the touch sensor 39 and pressing down without lifting it.

[0172] For example, if user 20A, who is attending the same meeting as user 20, comes up with a good idea and wants to inform user 20 of it, user 20A can perform a tap operation, for example, to generate operation data indicating that it is a tap operation, and this operation data is sent to the data processing device 12-4.

[0173] For example, if user 20A becomes upset and irritated during a meeting, user 20A performs a double-tap operation to inform user 20 that they are in a bad mood. This generates operation data indicating that a double-tap operation is performed, and this operation data is sent to the data processing device 12-4.

[0174] (Processing section 201-4) The data collected by the data acquisition unit 100-4 may be input to the processing unit 201-4 via the input unit 200-4.

[0175] The data acquisition unit 100-4 or the input unit 200-4 may receive operation data transmitted wirelessly from an earphone worn in the ear of the transmitting second user. Specifically, the input unit 200-4 may receive operation data wirelessly directly from an earphone 14-4 worn in the ear of the receiving first user. This allows the receiving first user and the transmitting second user to communicate even if the receiving first user is not at a distance where they can see the transmitting second user and the transmitting second user's wireless data does not reach the receiving first user, as the operation data is received by the data processing device 12.

[0176] The processing unit 201-4 may perform specific processing using a data generation model 58-4 (generation AI model) that generates a predetermined estimation result according to the input operation data. The processing unit 201-4 may generate vibration data that causes vibrations corresponding to the result of the specific processing to be generated in the earphones worn in the ears of the receiving first user. For example, the processing unit 201-4 may perform a process as specific processing that estimates at least one of the intentions and emotions of the transmitting second user based on the operation data, thereby generating vibrations in the earphones worn in the ears of the receiving first user that reproduce at least one of the intentions and emotions of the transmitting second user according to the result of the specific processing.

[0177] Specifically, the processing unit 201-4 may use the data generation model 58-4 to estimate the user 20A's emotions, intentions, etc., based on information contained in the operation data, such as the strength of the touch operation, the rhythm of the touch operation, and the duration of the touch operation.

[0178] For example, if user 20A taps the earphones they are wearing three times, the processing unit 201-4 recognizes that a situation has occurred where user 20 should be alerted to user 20A. In this case, the processing unit 201-4 may generate vibration data indicating a specific vibration period, specific vibration intensity, specific number of vibrations, etc., in order to accurately convey user 20A's intention, and transmit it to the output unit 202-4. Specifically, the processing unit 201-4 may transmit vibration data to the output unit 202-4 that generates the lowest vibration level three times at one-second intervals.

[0179] The output unit 202-4 may transmit vibration data to an earphone worn in the ear of the receiving first user.

[0180] The data acquisition unit 70 of the receiving user's earphone, which receives vibration data, transfers the collected vibration data to the control unit 46A. The control unit 46A generates a vibration in the vibration unit 43 that reproduces at least one of the intentions and emotions of the transmitting user, the second user, according to the vibration data. This allows, for example, user 20A, who is participating in a meeting, to quickly inform user 20, who is also participating in the same meeting room, that user 20A wants to convey some kind of signal, instead of using a phone call or text message.

[0181] For example, if user 20A presses and holds the earphone being worn, for instance, if the housing 14a is pressed by a finger for 5 seconds, the processing unit 201-4 recognizes that user 20A is in a situation where they need to communicate an urgent matter or something important to a pre-configured user 20. In this case, the processing unit 201-4 may generate vibration data indicating a specific vibration period, specific vibration intensity, specific number of vibrations, etc., in order to accurately convey user 20A's intentions, and transmit it to the receiving first user's earphone via the output unit 202-4. Specifically, the processing unit 201-4 may transmit vibration data that generates the highest vibration level for 3 seconds continuously, followed by 10 consecutive vibrations with a 1-second pause in between. The control unit 46A of the receiving first user's earphone generates vibrations in the vibration unit 43 that reproduce at least one of the intentions and emotions of the transmitting second user, according to the vibration data. This allows, for example, to quickly inform user 20 that user 20A has an urgent matter to communicate, instead of using a phone call or text message.

[0182] The processing unit 201-4 may perform specific processing considering the operation history of the transmitting second user on the housing 14a worn on the transmitting second user's ear. For example, if the transmitting second user performs a touch operation at a specific time, the processing unit 201-4 may generate history data that associates the content of that operation with the content of a touch operation performed at a specific time after that time, in chronological order. This history data may be sent to the receiving first user's earphone, and the history content may be played back on the first earphone. The history data may include message information such as, for example, "Mr. / Ms. XX gave you some kind of signal 5 minutes ago and 1 minute ago," or "Mr. / Ms. XX sent you an agreement signal 3 minutes ago." By considering the operation history in this way, it is possible to prevent the first earphone from overlooking the transmitting second user's signals and to confirm what kind of reaction the operation specifically represented.

[0183] While generating vibration data, the processing unit 201-4 may, depending on the result of a specific processing, generate audio data that provides at least one of the intentions and emotions of the transmitting second user as voice guidance, to be played through the earphones worn by the receiving first user. This allows the receiving first user to understand the content of the vibrations more clearly.

[0184] (The control unit 46A is,) The control unit 46A may cause the vibration unit 43 to generate vibrations that reproduce at least one of the intentions and emotions of the transmitting second user, in accordance with the result of a specific process, i.e., vibration data. Specifically, the control unit 46A may generate a drive signal that reproduces at least one of the intentions and emotions of user 20A based on the vibration data generated in accordance with the result of a specific process, and transmit it to the vibration unit 43.

[0185] For example, if the processing unit 201-4 generates vibration data that generates the lowest vibration level three times at one-second intervals, the control unit 46A may transmit a pulse-shaped drive signal corresponding to the vibration data to the vibration unit 43 (e.g., a motor with an eccentric rotor). This drive signal is, for example, a rectangular signal with a low voltage level that repeats high and low levels dozens of times within one second, and generates three such signals at one-second intervals.

[0186] Furthermore, if the processing unit 201-4 generates vibration data that generates the highest vibration level for 3 seconds continuously, followed by a 1-second pause, and then generates 10 consecutive vibrations, the control unit 46A may transmit a pulse-shaped drive signal corresponding to the vibration data to the vibration unit 43 (for example, a motor with an eccentric rotor). This drive signal is, for example, a signal that repeats high and low levels hundreds of times within 3 seconds, generating a high-voltage rectangular signal 10 times at 1-second intervals.

[0187] Next, with reference to Figure 17, an example of operation related to specific processing by the data processing system 10-4 according to the fourth embodiment will be described.

[0188] In step S41, the second user's earphone generates operation data, and in step S42, it transmits the operation data to the data processing device 12-4. In step S43, the data processing device 12-4 receives the operation data, in step S44, it estimates emotions based on the operation data, and in step S45, it generates vibration data, and in step S46, it transmits the vibration data to the first user's earphone.

[0189] The receiving first user's earphone may receive vibration data in step S47 and transmit a drive signal to the vibration unit 43 in step S48, thereby generating vibrations in the vibration unit 43 according to the result of a specific process.

[0190] In addition, in the data processing system 10-4 according to the fourth embodiment, a special application may be installed on the earphones used by user 20, and the results of the above-mentioned specific processing may be linked to the application. For example, by managing a history of the number of times the transmitting second user touched the earphones and the content of those operations as a log in the application, even if the receiving first user is unable to react when operation data is entered, they can check the history and take an appropriate reaction.

[0191] Furthermore, the operation data may include the intentions of the sending second user, such as being well or wanting to express gratitude, and may also include the emotions of the sending second user, such as joy, sadness, excitement, or a sense of relief.

[0192] As described above, according to the data processing system 10-4 of the fourth embodiment, earphones can communicate wirelessly with each other directly or via a data processing device, enabling real-time sharing of tactile sensations. Furthermore, because the earphones fit into the ears, tactile sensations through vibration can be felt naturally and directly. Conventional earphones are specialized for playing voice and music, but this disclosure combines vibration functionality with wireless communication to realize tactile sharing with remote locations. In addition, emotions and feelings that cannot be fully conveyed by words or images alone can be shared through touch, improving the quality of communication. It also enables intimate interaction, and tactile sharing makes it possible to have intimate real-time communication with someone far away. Modern digital communication relies mainly on sight and hearing, but it is difficult to fully convey emotions and subtle nuances with such means alone. Especially when communicating with someone in a remote location, the lack of physical contact can result in a lack of intimacy and presence. According to this disclosure, it is possible to provide a new means of communication that allows for sharing tactile sensations with someone far away and conveying emotions at a deeper level. Furthermore, by linking the data processing device 12-4 with the earphones, it is possible to share complex tactile patterns among multiple users via the data processing device 12-4 as needed.

[0193] The above description primarily focuses on the functions of the data processing device 12 in relation to this disclosure. However, the system related to this disclosure is not necessarily implemented on a server. The system related to this disclosure may be implemented as a general information processing system. This disclosure may be implemented, for example, as a software program that runs on a personal computer or as an application that runs on a smartphone. The method related to this disclosure may be provided to users in SaaS (Software as a Service) format.

[0194] In the embodiments described above, examples were given in which a specific process is performed by a single computer 22. However, the technology of this disclosure is not limited thereto, and a distributed processing method for the specific process may be used, involving multiple computers, including computer 22.

[0195] In the embodiments described above, examples were given in which the specific processing program 56 is stored in the storage 32, but the technology of this disclosure is not limited thereto. For example, the specific processing program 56 may be stored in a portable, computer-readable, non-temporary storage medium such as a USB (Universal Serial Bus) memory. The specific processing program 56 stored in the non-temporary storage medium is installed in the computer 22 of the data processing device 12. The processor 28 executes specific processing according to the specific processing program 56.

[0196] Alternatively, the specific processing program 56 may be stored in a storage device such as a server connected to the data processing device 12 via the network 54, and the specific processing program 56 may be downloaded and installed on the computer 22 in response to a request from the data processing device 12.

[0197] Furthermore, it is not necessary to store the entirety of the specific processing program 56 in a storage device such as a server connected to the data processing device 12 via the network 54, or to store the entirety of the specific processing program 56 in the storage 32; it is acceptable to store only a portion of the specific processing program 56.

[0198] The following types of processors can be used as hardware resources to perform specific processing. Examples of processors include a CPU, a general-purpose processor that functions as a hardware resource to perform specific processing by executing software, i.e., a program. Other examples of processors include dedicated electrical circuits, such as FPGAs (Field-Programmable Gate Arrays), PLDs (Programmable Logic Devices), or ASICs (Application Specific Integrated Circuits), which have circuit configurations specifically designed to perform specific processing. All of these processors have built-in or connected memory, and all of them perform specific processing by using memory.

[0199] The hardware resource that performs a specific process may consist of one of these various processors, or it may consist of a combination of two or more processors of the same or different types (for example, a combination of multiple FPGAs, or a combination of a CPU and an FPGA). Alternatively, the hardware resource that performs a specific process may consist of a single processor.

[0200] Examples of configurations using a single processor include, firstly, a configuration in which one or more CPUs and software are combined to form a single processor, and this processor functions as a hardware resource that performs a specific process. Secondly, there is a configuration using a processor that realizes the functions of the entire system, including multiple hardware resources that perform a specific process, on a single IC chip, as exemplified by SoCs (System-on-a-chip). In this way, a specific process is realized using one or more of the above types of processors as hardware resources.

[0201] Furthermore, the hardware structure of these various processors can more specifically utilize electrical circuits that combine circuit elements such as semiconductor devices. Also, the specific processing described above is merely an example. Therefore, it goes without saying that unnecessary steps can be deleted, new steps added, or the processing order rearranged, as long as it does not deviate from the main purpose.

[0202] The descriptions and illustrations presented above are detailed explanations of the technical aspects of this disclosure and are merely examples of the technical aspects. For example, the above descriptions of the structure, function, operation, and effect are examples of the structure, function, operation, and effect of the technical aspects of this disclosure. Therefore, it goes without saying that you may delete unnecessary parts, add new elements, or replace elements in the descriptions and illustrations presented above, as long as you do not deviate from the essence of the technical aspects of this disclosure. Furthermore, in order to avoid confusion and facilitate understanding of the technical aspects of this disclosure, explanations of common technical knowledge and the like that do not require special explanation to enable the implementation of the technical aspects of this disclosure have been omitted from the descriptions and illustrations presented above.

[0203] All documents, patent applications, and technical standards described herein are incorporated by reference to the same extent as if each individual document, patent application, and technical standard were specifically and individually noted to be incorporated by reference.

[0204] Furthermore, the following additional information is disclosed regarding the above explanation.

[0205] (Note 1) A housing worn on the ear of the transmitting user (second user) who communicates with the receiving user (first user), A data collection unit collects operation data generated by detecting at least one of the following operations on the housing: swipe operation, tap operation, and long press operation. A processing unit that performs a specific processing using a data generation model that generates a predetermined estimation result according to the operation data, and generates vibration data that generates vibrations corresponding to the result of the specific processing in the earphone worn in the ear of the receiving first user, An output unit that transmits the vibration data to an earphone worn in the ear of the receiving first user, Equipped with, The processing unit performs a process to estimate at least one of the intentions and emotions of the transmitting second user based on the operation data, as the specific processing, thereby generating vibrations in the earphone worn in the ear of the receiving first user that reproduce the intentions and emotions according to the result of the specific processing. (Note 2) The earphone as described in Appendix 1, wherein the processing unit performs the specific processing taking into consideration the operation history of the housing worn in the ear of the second transmitting user by the second transmitting user. (Note 3) The output unit is the earphone described in Appendix 1, which transmits the operation data wirelessly. (Note 4) The processing unit generates the vibration and, depending on the result of the specific processing, generates audio data to be played back as voice guidance from the earphones worn in the ear of the receiving first user, which includes at least one of the intention and the emotion. The output unit is the earphone described in Appendix 1, which transmits the audio data wirelessly. (Note 5) A vibrating unit that vibrates the housing worn on the ear of the first receiving user, An input unit that receives operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on an earphone worn in the ear of the transmitting second user who is communicating with the receiving first user, A processing unit that performs specific processing using a data generation model that generates a predetermined estimation result corresponding to the input operation data, A control unit that generates vibrations in the vibration unit according to the result of the specified processing, Equipped with, The processing unit performs a process, as the specific process, to estimate at least one of the intentions and emotions of the transmitting second user based on the operation data. The control unit generates vibrations in the vibrating unit that reproduce at least one of the intention and the emotion, depending on the result of the specific processing, in an earphone. (Note 6) The earphone as described in Appendix 5, wherein the processing unit performs the specific processing taking into consideration the operation history of the earphone worn in the ear of the transmitting second user by the transmitting second user. (Note 7) The input unit is the earphone described in Appendix 5, which receives the operation data transmitted wirelessly from the earphone worn in the ear of the transmitting second user. (Note 8) The earphone as described in Appendix 5, wherein the control unit generates vibrations in the vibrating unit and, according to the result of the specific processing, generates audio data that plays at least one of the intention and the emotion as audio guidance. (Note 9) An input unit that receives operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on the earphone worn in the ear of a first user (receiver) who is wearing the earphone, by a second user (transmitter) who is communicating with the first user (receiver) who is wearing the earphone, A processing unit that performs specific processing using a data generation model that generates a predetermined estimation result corresponding to the input operation data, A control unit that generates vibration data to cause vibrations corresponding to the result of the specified processing to be generated in the earphone worn in the ear of the receiving first user, and An output unit that transmits the vibration data to an earphone worn in the ear of the receiving first user, Equipped with, The processing unit performs a process, as the specific process, to estimate at least one of the intentions and emotions of the transmitting second user based on the operation data. The control unit generates vibration data that reproduces at least one of the intention and the emotion, according to the result of the specific processing. (Note 10) The processing unit performs the specific processing taking into consideration the operation history of the earphones worn in the ears of the transmitting second user, as described in Appendix 9. (Note 11) The data processing device described in Appendix 9, wherein the input unit receives the operation data transmitted by wireless communication from an earphone worn in the ear of the transmitting second user. (Note 12) The control unit generates vibrations in the earphone worn in the ear of the receiving first user, and generates audio data that plays at least one of the intention and emotion as voice guidance according to the result of the specific processing. The output unit transmits the voice guidance to an earphone worn in the ear of the receiving first user, as described in Appendix 9. (Note 13) At least one processor, A data generation model is used to perform a specific processing step that generates a predetermined estimation result according to the operation data generated by detecting at least one of the swipe, tap, and long-press operations performed on the earphone worn by the second transmitting user, who is communicating with the first receiving user wearing the earphone. The system generates vibration data that causes vibrations corresponding to the result of the specified processing to be generated in the earphone worn in the ear of the receiving first user. The vibration data is transmitted to the earphone worn in the ear of the receiving first user. Based on the aforementioned operation data, a process is performed as the specified process to estimate at least one of the intentions and emotions of the transmitting second user. Depending on the result of the specified processing, the vibration data is generated that reproduces at least one of the intention and the emotion. A data processing method that performs processing that includes the following. (Note 14) At least one processor, A data generation model is used to perform a specific processing step that generates a predetermined estimation result according to the operation data generated by detecting at least one of the swipe, tap, and long-press operations performed on the earphone worn by the second transmitting user, who is communicating with the first receiving user wearing the earphone. The system generates vibration data that causes vibrations corresponding to the result of the specified processing to be generated in the earphone worn in the ear of the receiving first user. The vibration data is transmitted to the earphone worn in the ear of the receiving first user. Based on the aforementioned operation data, a process is performed as the specified process to estimate at least one of the intentions and emotions of the transmitting second user. Depending on the result of the specified processing, the vibration data is generated that reproduces at least one of the intention and the emotion. A data processing program that performs a process that includes the following. [Explanation of Symbols]

[0206] 10 Data Processing Systems 12 Data Processing Devices 14 Earphones 290 Specific Processing Unit 291 Input section 292 Processing Unit 293 Output section< / url:> < / url:> < / url:> < / url:> < / url:>

Claims

1. A housing worn on the ear of the transmitting second user who communicates with the receiving first user, A data collection unit collects operation data generated by detecting at least one of the following operations on the housing: swipe operation, tap operation, and long press operation. A processing unit that performs a specific processing using a data generation model that generates a predetermined estimation result according to the operation data, and generates vibration data that generates vibrations corresponding to the result of the specific processing in the earphone worn in the ear of the receiving first user, An output unit that transmits the vibration data to an earphone worn in the ear of the receiving first user, Equipped with, The processing unit performs a process to estimate at least one of the intentions and emotions of the transmitting second user based on the operation data, as the specific processing, and generates vibrations in the earphone worn in the ear of the receiving first user that reproduce the intentions and emotions according to the result of the specific processing.

2. The earphone according to claim 1, wherein the processing unit performs the specific processing taking into consideration the operation history of the housing worn on the ear of the transmitting second user by the transmitting second user.

3. The earphone according to claim 1, wherein the output unit transmits the operation data wirelessly.

4. The processing unit generates the vibration and, according to the result of the specific processing, generates audio data to be played back as voice guidance from the earphone worn in the ear of the receiving first user, which includes at least one of the intention and the emotion. The earphone according to claim 1, wherein the output unit transmits the audio data wirelessly.

5. A vibrating unit that vibrates the housing worn on the ear of the first receiving user, An input unit that receives operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on an earphone worn in the ear of the transmitting second user who is communicating with the receiving first user, A processing unit that performs specific processing using a data generation model that generates a predetermined estimation result corresponding to the input operation data, A control unit that generates vibrations in the vibration unit according to the result of the specified processing, Equipped with, The processing unit performs a process, as the specific process, to estimate at least one of the intentions and emotions of the transmitting second user based on the operation data. The control unit generates vibrations in the vibrating unit that reproduce at least one of the intention and the emotion, according to the result of the specific processing, in an earphone.

6. The earphone according to claim 5, wherein the processing unit performs the specific processing taking into consideration the operation history of the earphone worn in the ear of the transmitting second user by the transmitting second user.

7. The earphone according to claim 5, wherein the input unit receives the operation data transmitted from the earphone worn in the ear of the transmitting second user via wireless communication.

8. The earphone according to claim 5, wherein the control unit generates vibrations in the vibrating unit and generates audio data that plays at least one of the intention and the emotion as audio guidance according to the result of the specific processing.

9. An input unit that receives operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on the earphones worn in the ear of a second user who is communicating with a first user who is wearing earphones, A processing unit that performs specific processing using a data generation model that generates a predetermined estimation result corresponding to the input operation data, A control unit that generates vibration data to cause vibrations corresponding to the result of the specified processing to be generated in the earphone worn in the ear of the receiving first user, and An output unit that transmits the vibration data to an earphone worn in the ear of the receiving first user, Equipped with, The processing unit performs a process, as the specific process, to estimate at least one of the intentions and emotions of the transmitting second user based on the operation data. The control unit is a data processing device that generates vibration data that reproduces at least one of the intention and the emotion, according to the result of the specific processing.

10. The data processing device according to claim 9, wherein the processing unit performs the specific processing taking into consideration the operation history of the earphones worn in the ear of the transmitting second user by the transmitting second user.

11. The data processing device according to claim 9, wherein the input unit receives the operation data transmitted by wireless communication from an earphone worn in the ear of the transmitting second user.

12. The control unit generates vibrations in the earphone worn in the ear of the receiving first user, and generates audio data that plays at least one of the intention and emotion as voice guidance according to the result of the specific processing. The data processing device according to claim 9, wherein the output unit transmits the voice guidance to an earphone worn in the ear of the receiving first user.

13. At least one processor, A data generation model is used to perform a specific processing that generates a predetermined estimation result according to the operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on the earphone worn in the ear of a second transmitting user who is communicating with a first receiving user who is wearing the earphone. The system generates vibration data that causes vibrations corresponding to the result of the specified processing to be generated in the earphone worn in the ear of the receiving first user. The vibration data is transmitted to the earphone worn in the ear of the receiving first user. Based on the aforementioned operation data, a process is performed as the specified process to estimate at least one of the intentions and emotions of the transmitting second user. Depending on the result of the specified processing, the vibration data is generated that reproduces at least one of the intention and the emotion. A data processing method that performs processing that includes the following.

14. At least one processor, A data generation model is used to perform a specific processing that generates a predetermined estimation result according to the operation data generated by detecting at least one of a swipe operation, a tap operation, and a long press operation on the earphone worn in the ear of a second transmitting user who is communicating with a first receiving user who is wearing the earphone. The system generates vibration data that causes vibrations corresponding to the result of the specified processing to be generated in the earphone worn in the ear of the receiving first user. The vibration data is transmitted to the earphone worn in the ear of the receiving first user. Based on the aforementioned operation data, a process is performed as the specified process to estimate at least one of the intentions and emotions of the transmitting second user. Depending on the result of the specified processing, the vibration data is generated that reproduces at least one of the intention and the emotion. A data processing program that performs a process that includes the following.