system
The system addresses the challenge of finding ideal partners by allowing users to input preferences, using image and natural language processing to create realistic interactions, enhancing immersion and satisfaction in romance simulations.
Patent Information
- Authority / Receiving Office
- JP · JP
- Patent Type
- Applications
- Current Assignee / Owner
- SOFTBANK GROUP CORP
- Filing Date
- 2024-12-09
- Publication Date
- 2026-06-19
AI Technical Summary
Individuals face difficulty in finding their ideal romantic partners, and existing dating simulation games lack natural interactions and immersion, failing to provide satisfying experiences.
A system that allows users to input their ideal romantic partner's appearance and personality, using image and natural language processing models to generate realistic interactions, learn from conversations, and improve over time.
Enables users to have natural and immersive interactions with their ideal partner, providing a personalized and fulfilling romance simulation experience.
Smart Images

Figure 2026100677000001_ABST
Abstract
Description
【Technical Field】 【0001】 The technology of the present disclosure relates to a system. 【Background Art】 【0002】 Patent Document 1 discloses a method for controlling a persona chatbot, which is performed by at least one processor and includes steps of receiving a user utterance, adding the user utterance to a prompt including an instruction sentence related to an explanation of a character of the chatbot, encoding the prompt, and inputting the encoded prompt into a language model to generate a chatbot utterance in response to the user utterance. 【Prior Art Documents】 【Patent Documents】 【0003】 【Patent Document 1】 Japanese Patent Application Laid-Open No. 2022-180282 【Summary of the Invention】 【Problems to be Solved by the Invention】 【0004】 In modern society, it is very difficult for individual users to find their ideal romantic partners. This difficulty becomes a time-consuming and mental burden, especially for busy individuals at a specific stage of life. Furthermore, existing dating simulation games fall into unnatural conversations and monotonous scenarios in user experience and cannot sufficiently provide the immersion and satisfaction that users seek. The present invention aims to solve this problem by enabling users to have natural interactions with romantic partners having their ideal appearance and personality and realizing a realistic dating simulation never before achieved. 【Means for Solving the Problems】 【0005】 This invention first provides a means for users to input the appearance and personality of their ideal romantic partner, thereby gaining a concrete understanding of the user's expectations and desires. Next, it includes means for applying an image generation model based on the acquired user input information to generate an image of a person with the ideal appearance and presenting that image. This allows the user to visually interact with their ideal partner. Furthermore, by incorporating means for realizing interaction with the user using a natural language processing model, the invention remembers the conversation content, continuously learns from it, and provides a more natural and effective conversation experience. Through these means, this invention provides users with a realistic dating simulation experience, enabling them to interact with their ideal partner in a natural way. 【0006】 A "user" is the individual who uses this system to input the appearance and personality traits of their ideal romantic partner. 【0007】 "Information about appearance and personality entered" refers to specific data that expresses the visual and personality characteristics of the user's ideal romantic partner. 【0008】 An "image generation model" is an algorithm that uses machine learning techniques to generate images that meet specified conditions based on features input by the user. 【0009】 A "natural language processing model" is a technology that includes machine learning to analyze, understand, and generate human language, and is a system that enables interaction with users through this technology. 【0010】 "Means for memorizing conversation content and continuously learning from it" refers to a technical process in which user interactions are recorded in a database, and the system uses this data to perform self-improvement. [Brief explanation of the drawing] 【0011】 [Figure 1] This is a conceptual diagram showing an example of the configuration of a data processing system according to the first embodiment. [Figure 2]This is a conceptual diagram showing an example of the essential functions of a data processing device and a smart device according to the first embodiment. [Figure 3] This is a conceptual diagram showing an example of the configuration of a data processing system according to the second embodiment. [Figure 4] This is a conceptual diagram showing an example of the main functions of a data processing device and smart glasses according to the second embodiment. [Figure 5] This is a conceptual diagram showing an example of the configuration of a data processing system according to the third embodiment. [Figure 6] This is a conceptual diagram showing an example of the main functions of a data processing device and a headset-type terminal according to the third embodiment. [Figure 7] This is a conceptual diagram showing an example of the configuration of a data processing system according to the fourth embodiment. [Figure 8] This is a conceptual diagram showing an example of the main functions of a data processing device and a robot according to the fourth embodiment. [Figure 9] This shows an emotion map where multiple emotions are mapped. [Figure 10] This shows an emotion map where multiple emotions are mapped. [Figure 11] This is a sequence diagram showing the processing flow of the data processing system in Example 1. [Figure 12] This is a sequence diagram showing the processing flow of the data processing system in Application Example 1. [Figure 13] This is a sequence diagram showing the processing flow of the data processing system in Example 2, when an emotion engine is combined. [Figure 14] This is a sequence diagram showing the processing flow of the data processing system in Application Example 2, which combines an emotion engine. [Modes for carrying out the invention] 【0012】 Hereinafter, an example of an embodiment of the system relating to the technology of this disclosure will be described with reference to the attached drawings. 【0013】 First, the terms used in the following description will be explained. 【0014】 In the following embodiments, the numbered processor (hereinafter simply referred to as "processor") may be a single arithmetic unit or a combination of multiple arithmetic units. Also, the processor may be a single type of arithmetic unit or a combination of multiple types of arithmetic units. Examples of arithmetic units include a CPU (Central Processing Unit), a GPU (Graphics Processing Unit), a GPGPU (General-Purpose computing on Graphics Processing Units), an APU (Accelerated Processing Unit), and the like. 【0015】 In the following embodiments, the numbered RAM (Random Access Memory) is a memory in which information is temporarily stored and is used as a work memory by the processor. 【0016】 In the following embodiments, the numbered storage is one or more non-volatile storage devices that store various programs and various parameters, etc. Examples of non-volatile storage devices include flash memory (SSD (Solid State Drive)), magnetic disks (e.g., hard disks), or magnetic tapes, and the like. 【0017】 In the following embodiments, the numbered communication I / F (Interface) is an interface including a communication processor and an antenna, etc. The communication I / F controls communication between multiple computers. Examples of communication standards applicable to the communication I / F include wireless communication standards including 5G (5th Generation Mobile Communication System), Wi-Fi (registered trademark), or Bluetooth (registered trademark), and the like. 【0018】 In the following embodiments, "A and / or B" is synonymous with "at least one of A and B." That is, "A and / or B" means that it may be A alone, or B alone, or a combination of A and B. Furthermore, in this specification, the same concept as "A and / or B" applies when expressing three or more things linked by "and / or." 【0019】 [First Embodiment] 【0020】 Figure 1 shows an example of the configuration of the data processing system 10 according to the first embodiment. 【0021】 As shown in Figure 1, the data processing system 10 includes a data processing device 12 and a smart device 14. An example of the data processing device 12 is a server. 【0022】 The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network). 【0023】 The smart device 14 comprises a computer 36, a reception device 38, an output device 40, a camera 42, and a communication interface 44. The computer 36 comprises a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The reception device 38, output device 40, and camera 42 are also connected to the bus 52. 【0024】 The reception device 38 is equipped with a touch panel 38A and a microphone 38B, etc., and receives user input. The touch panel 38A receives user input by detecting contact with an object (e.g., a pen or finger). The microphone 38B receives user input by detecting the user's voice. The control unit 46A transmits data indicating the user input received by the touch panel 38A and microphone 38B to the data processing device 12. In the data processing device 12, the specific processing unit 290 acquires the data indicating the user input. 【0025】 The output device 40 includes a display 40A and a speaker 40B, and presents data to the user 20 by outputting the data in a form perceptible to the user 20 (e.g., audio and / or text). The display 40A displays visible information such as text and images according to instructions from the processor 46. The speaker 40B outputs audio according to instructions from the processor 46. The camera 42 is a small digital camera equipped with an optical system such as a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor. 【0026】 Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various types of information between processor 46 and processor 28 via network 54. 【0027】 Figure 2 shows an example of the main functions of the data processing device 12 and the smart device 14. 【0028】 As shown in Figure 2, in the data processing device 12, a specific processing is performed by the processor 28. A specific processing program 56 is stored in the storage 32. The specific processing program 56 is an example of a "program" related to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 according to the specific processing program 56 executed on the RAM 30. 【0029】 The storage 32 stores the data generation model 58 and the emotion identification model 59. The data generation model 58 and the emotion identification model 59 are used by the identification processing unit 290. 【0030】 In the smart device 14, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The reception output program 60 is used in conjunction with a specific processing program 56 by the data processing system 10. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48. 【0031】 Next, the specific processing performed by the specific processing unit 290 of the data processing device 12 will be described. In the following description, the data processing device 12 will be referred to as the "server" and the smart device 14 as the "terminal". 【0032】 This invention provides a romance simulation system that enriches the user experience, with the user, terminal, and server working together to realize each function. 【0033】 First, the user inputs information about their ideal romantic partner's appearance and personality through the device's interface. The device receives this information and sends it to the server in the specified data format. 【0034】 The server uses an image generation model based on the received data to generate a person image with the features specified by the user. This image generation process employs machine learning algorithms to faithfully reproduce the shape and style desired by the user. The generated image is sent back to the user's device, allowing the user to confirm their visual representation of their romantic partner. 【0035】 Next, the server starts up its natural language processing model and prepares to interact with the user. It customizes the conversation style based on the personality traits entered by the user and sets up an initial conversation scenario. 【0036】 When a user starts a conversation via the chat interface, the server analyzes the user's input as the conversation progresses and instantly generates and returns natural-sounding responses. This gives the user the feeling of interacting with their ideal partner. 【0037】 Furthermore, the server records all conversations and uses machine learning techniques to continuously train the conversation model, improving the accuracy of future conversations. For example, if a user previously said "I like the sea," the server can generate context-related questions in the next conversation, such as "Have you been to the sea recently?" 【0038】 Through this interaction, users can engage with their ideal character and enjoy a personalized romance simulation experience. This invention provides the opportunity to experience ideal interactions in a virtual environment that are difficult to achieve in the real world. 【0039】 The following describes the processing flow. 【0040】 Step 1: 【0041】 The user inputs information about their ideal romantic partner's appearance and personality through the device's interface. 【0042】 Step 2: 【0043】 The terminal receives input from the user and sends it to the server in a formatted data format. 【0044】 Step 3: 【0045】 The server analyzes the received data and uses an image generation model to generate images of people based on features specified by the user. 【0046】 Step 4: 【0047】 The server sends the generated image of a person to the terminal, and the terminal displays it to the user. 【0048】 Step 5: 【0049】 The server activates a natural language processing model and customizes the conversational agent based on the user's personality settings. 【0050】 Step 6: 【0051】 When a user initiates a conversation via their device, the server analyzes the user's input and generates an appropriate response. 【0052】 Step 7: 【0053】 The server records the content of the conversation with the user in a database and provides feedback to the model for continuous learning. 【0054】 Step 8: 【0055】 Based on the conversation data collected by the server, the entire system is improved so that users can enjoy a more natural and personalized experience in subsequent interactions. 【0056】 (Example 1) 【0057】 Next, we will describe Example 1. In the following description, the data processing device 12 will be referred to as the "server," and the smart device 14 will be referred to as the "terminal." 【0058】 Current interactive systems with virtual characters lack the ability to easily generate characters with the appearance and personality desired by the user and to provide natural dialogue based on them. Furthermore, they suffer from insufficient continuity of dialogue and poor understanding of context, limiting the user experience. Additionally, they are unable to leverage the user's past statements to generate more relevant conversations, making it difficult to continuously improve user satisfaction. 【0059】 The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 1 is realized by the following means. 【0060】 In this invention, the server includes means for acquiring information about features input by the user, means for generating an image with ideal features using a generative model based on the user's input, and means for using a language processing model to enable natural interaction with the user. This enables the generation of a character with the appearance and personality desired by the user, and natural, contextual interaction with that character. 【0061】 A "user" is an individual or end-user who utilizes the system and is the entity that provides input and interacts with the simulation. 【0062】 "Characteristics" refer to information such as specific attributes, personality traits, and appearance that users input, and are elements that influence character creation. 【0063】 A "generative model" is an algorithm or software that creates realistic images or visual outputs based on user input. 【0064】 An "image" is a visual representation created by a generative model, depicting a character with characteristics specified by the user. 【0065】 "Display" refers to the process of visually unfolding and making viewable the generated image on the user's device. 【0066】 "Dialogue" refers to communication between a user and a system, including exchanges via text and voice. 【0067】 A "language processing model" is an algorithm or software that processes text-based input and generates appropriate responses to enable natural communication with users. 【0068】 "Memory" refers to the act of storing past conversations and interactions with users in a database or similar system, for use in future interactions. 【0069】 "Learning" is the process of analyzing accumulated dialogue data and improving algorithms to enhance the system's capabilities. 【0070】 The system of this invention allows the user to input ideal characteristics using a terminal interface, and then uses a generative AI model to create a virtual character based on that input, enabling natural interaction. Specifically, the server and terminal work together to provide the user with a personalized simulation experience. 【0071】 The user first inputs information about the characteristics of the character they want, such as "black hair" and "kind personality," through the input device on the terminal. This information is collected by the terminal and sent to the server via the internet. 【0072】 The server processes the received data and uses a generative AI model to generate images of characters with specified features. Software used here includes Stable Diffusion and similar image generation algorithms. This enables visual simulations tailored to user requirements. 【0073】 The generated image is sent from the server to the terminal and displayed for the user to visually confirm. The user can then use this image to enjoy an interactive virtual experience. 【0074】 Furthermore, the server utilizes language processing models to achieve natural dialogue. The server analyzes user input and instantly generates responses based on an optimized dialogue model. This allows users to enjoy natural conversations with the generated character. Examples of language processing models used here include GPT-3®. 【0075】 As a concrete example, an example of a prompt is as follows: "Create a visual and conversational style based on the characteristics of your ideal partner." Based on this, the user can gain a new conversational experience in a virtual environment. 【0076】 This system is designed to provide users with ideal interactions that are difficult to experience in reality, and to realize personalized romance simulations. 【0077】 The flow of the specific processing in Example 1 will be explained using Figure 11. 【0078】 Step 1: 【0079】 The user inputs the characteristics of their ideal character through the terminal's interface. This input includes details such as appearance and personality. The terminal converts this information into JSON data and sends it to the server. In this step, the user's characteristic request is taken as input, and formatted request data is generated as output. 【0080】 Step 2: 【0081】 The server parses the JSON data received from the terminal and uses a generative AI model to generate an image of a character with features specified by the user. Specifically, it uses an algorithm such as Stable Diffusion to synthesize a visual based on the specified attributes. The input is the user's feature data, and the output is the generated character image. 【0082】 Step 3: 【0083】 The generated image is sent from the server to the terminal. The terminal displays the received image on its display device, allowing the user to visually confirm it. In this step, the generated image data is received as input, and presented visually to the user as output. 【0084】 Step 4: 【0085】 The user initiates interaction with the character through the terminal. The server activates a language processing model and generates natural responses based on the user's input. A model such as GPT-3 is used in this process. The user's message is used as input, and the generated response is returned as output. The server analyzes the input in real time to ensure natural and relevant dialogue. 【0086】 Step 5: 【0087】 The server records the conversation content in a database and continuously improves the conversation model through machine learning. This learning process leverages past conversation data to improve the accuracy of subsequent conversations. Past conversation data is used as input, and an improved conversation algorithm is obtained as output. 【0088】 These processing steps allow users to obtain a highly customized virtual experience. 【0089】 (Application Example 1) 【0090】 Next, we will explain Application Example 1. In the following explanation, the data processing device 12 will be referred to as the "server," and the smart device 14 will be referred to as the "terminal." 【0091】 In the modern era, the means by which users can interact intimately with their ideal characters are limited, and integrating that experience across virtual and physical environments is particularly difficult. There is a need for a system that provides physically actionable dialogue in accordance with user expectations. 【0092】 The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 1 is realized by the following means. 【0093】 In this invention, the server includes means for acquiring information about characteristics entered by the user, means for generating a person image with an ideal appearance using an image generation algorithm based on the user's input, and means for controlling a humanoid automated device to act as the user's ideal partner. This makes it possible for the user to experience a virtual lover in the real world. 【0094】 A "user" is an individual who seeks to interact with their ideal character using the system. 【0095】 "Characteristics" refer to information about the ideal appearance and personality that the user enters. 【0096】 "Means of acquiring information" refers to an interface for collecting requests and information entered by the user. 【0097】 An "image generation algorithm" is a machine learning model that creates images of people with ideal appearances based on user requests. 【0098】 "Means for generating human images" refers to the process of creating digital images using image generation algorithms. 【0099】 A "natural language processing algorithm" is a language analysis and response generation technique that enables natural dialogue between the user and the system. 【0100】 A "humanoid automated device" is a human-shaped device that physically acts based on user instructions. 【0101】 "Means of making it act as an ideal partner" refers to a mechanism that controls a humanoid automated device based on the user's wishes, causing it to perform specific actions or engage in dialogue. 【0102】 To realize this invention, the user first inputs the characteristics of their ideal character through the terminal's interface. These characteristics include details about appearance and personality. This information is collected by the terminal and sent to the server. 【0103】 On the server, an image generation algorithm is activated based on the collected information, and an image of a person with the appearance desired by the user is generated. Image generation software such as Stable Diffusion is used in this process. The generated image is sent to the user's terminal for visual confirmation. 【0104】 Furthermore, the server uses natural language processing algorithms to refine the content of the interaction with the user. Technologies such as OpenAI® GPT are utilized here to analyze the flow of the conversation and generate natural responses that correspond to user input. The generated responses are sent to a humanoid robot, which then behaves as the user's ideal character. 【0105】 For example, if a user enters "a kind person who likes the outdoors," the device can respond in a gentle voice, "Have you been hiking recently?" Through such interactions, users can experience a virtual romantic partner in the real world. 【0106】 Examples of prompts used for the generating AI model include sentences like, "Please create a dialogue scenario with a character that has the personality and hobbies desired by the user." 【0107】 The flow of a specific process in Application Example 1 will be explained using Figure 12. 【0108】 Step 1: 【0109】 The user uses an interface via their device to input the characteristics of their ideal character. Specifically, they enter text information about the character's appearance and personality into the input fields. This input is stored on the device as data necessary for subsequent processing. 【0110】 Step 2: 【0111】 The terminal converts the collected user characteristic information into a specified data format and sends it to the server. This process is carried out by a data transmission protocol, and the server analyzes this information for use in subsequent processing. 【0112】 Step 3: 【0113】 The server activates an image generation algorithm based on the characteristic information received from the user. It processes the input characteristics to create an image generation prompt, and then uses an image generation model (e.g., Stable Diffusion) based on that prompt to generate an image of a person with an ideal appearance. The output image is sent back to the terminal for visual confirmation. 【0114】 Step 4: 【0115】 The user reviews the character image received on their device. They check whether the character's appearance matches their preferences, and if dissatisfied, they adjust the characteristics and re-enter the information. 【0116】 Step 5: 【0117】 The server invokes natural language processing algorithms to enable natural interaction with the user. Based on the characteristics input by the user, it generates appropriate conversation setting prompts and creates a dialogue scenario using a natural language model (e.g., OpenAI GPT). This output is used for subsequent processing as response data for the conversation. 【0118】 Step 6: 【0119】 The server transmits the generated dialogue response data to a humanoid automated device. This device initiates actions, including specific voice tones and gestures, to interact with the user. Through this device, the user can experience interaction with a virtual character in the real world. 【0120】 Furthermore, an emotion engine that estimates the user's emotions may be incorporated. That is, the identification processing unit 290 may use the emotion identification model 59 to estimate the user's emotions and perform identification processing using the user's emotions. 【0121】 This invention realizes a system in a romance simulation game that enables emotionally rich dialogue with the user's desired ideal romantic partner. The entire system includes image generation based on user input, natural language processing to enable natural conversation, and an emotion engine to recognize the user's emotions. 【0122】 First, the user uses the interface on their device to input visual characteristics and personality parameters as traits of their ideal romantic partner. This information is sent from the device to a server, where it is analyzed. 【0123】 The server invokes an image generation model based on the input data and generates a human image that faithfully represents the specified features. This image is sent back to the terminal, allowing the user to visually confirm their ideal character. 【0124】 Next, the server applies a natural language processing model to deploy a conversational agent that incorporates the user's personality information. This model is designed to record the content of conversations with the user and optimize response generation for future interactions. 【0125】 Furthermore, an emotion engine operates to analyze emotions from user input and voice. This emotion analysis is performed using an algorithm that identifies emotional states based on the timing of text input, the tone of words used, or, in the case of voice input, the tone and speed. 【0126】 For example, if a user enters "I'm feeling good today," the emotion engine can detect a positive emotion, and the server can generate a correspondingly cheerful response. If the user enters "I'm a little tired today," the emotion engine will detect a negative emotion, and the natural language processing model will react by changing the response to something more encouraging. 【0127】 A system configured in this way can provide users with realistic and emotional interactions, enhancing immersion in dating simulations. Users can experience more engaging and fulfilling interactions by receiving responses that take into account their own emotions and moods. 【0128】 The following describes the processing flow. 【0129】 Step 1: 【0130】 The user uses an interface on their device to input visual characteristics and personality information of their ideal romantic partner. 【0131】 Step 2: 【0132】 The device formats the data collected from the user and sends it to the server. 【0133】 Step 3: 【0134】 The server analyzes the received data, activates an image generation model, and generates a person image based on the input features. 【0135】 Step 4: 【0136】 The server generates image data of a person and sends it back to the terminal, which then visually presents the image to the user. 【0137】 Step 5: 【0138】 The server uses a natural language processing model to set up a conversational agent based on the user's personality information. 【0139】 Step 6: 【0140】 The user enters text or voice input through the device's conversation interface and sends it to the server. 【0141】 Step 7: 【0142】 The server analyzes the input information through a natural language processing model and uses an emotion engine to determine the user's emotional state. 【0143】 Step 8: 【0144】 Based on the emotional data determined by the emotion engine, the server generates a response adapted to the user's emotions and sends it to the terminal. 【0145】 Step 9: 【0146】 The terminal receives a response from the server and presents it to the user either visually or audibly. 【0147】 Step 10: 【0148】 The server records conversation history and sentiment data in a database, and continuously uses it to improve the accuracy of future responses. 【0149】 (Example 2) 【0150】 Next, we will describe Example 2. In the following description, the data processing device 12 will be referred to as the "server," and the smart device 14 will be referred to as the "terminal." 【0151】 Traditional romance simulation games have only been able to offer users generalized characters and dialogue, making it difficult to achieve interactions that reflect the user's individual emotions and desires. As a result, users were unable to fully immerse themselves, and the simulation experience was limited. There is a need for a system that can solve this problem and enable more individualized and emotionally rich dialogue. 【0152】 The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 2 is realized by the following means. 【0153】 In this invention, the server includes means for acquiring information on characteristics and temperament entered by the user, means for generating a character image with ideal characteristics using digital image generation technology based on the user's input, and means for presenting the character image. This enables the provision of visuals and conversational content based on the user's individual preferences, resulting in an immersive simulation experience. 【0154】 "Characteristics" refer to specific features related to the appearance and behavior of the character that the user considers ideal. 【0155】 "Temperament" refers to specific attributes related to the personality and disposition of the character that the user idealizes. 【0156】 "Digital image generation technology" refers to technology that uses a computer to automatically generate character images based on characteristics input by the user. 【0157】 A "character image" refers to a visual digital image generated based on characteristics specified by the user. 【0158】 "Presentation" refers to the act of displaying the generated character image so that the user can visually confirm it. 【0159】 "Language processing technology" refers to the technology that processes language data to enable natural dialogue with users, performing semantic analysis and response generation. 【0160】 An "emotion engine" refers to a system that analyzes emotions from user input and voice, identifies those emotions, and utilizes that information. 【0161】 This invention realizes a system that enables users to create their ideal romantic partner in a dating simulation game. Users input information about the characteristics and temperament of their ideal partner through the terminal interface. For example, it is possible to specify visual characteristics such as "black hair and large eyes" or personality parameters such as "highly kind and moderately intelligent." 【0162】 User input is transmitted to the server via the terminal. The server utilizes digital image generation technology based on the received information and generates the ideal character image using a generation AI model. Specifically, the generation AI model uses generation technology that can faithfully reproduce a variety of features. 【0163】 The generated character image is sent back to the device, allowing the user to visually confirm the image. Furthermore, the server applies language processing technology based on the temperament specified by the user to set up a conversational agent that enables natural dialogue. This allows the user to enjoy personalized conversations. 【0164】 In addition, the emotion engine analyzes the user's emotions from text and voice input, and the server generates a response based on those emotions. For example, if the user inputs "I'm happy today," the emotion engine can capture that positive emotion and generate a cheerful response. 【0165】 Examples of prompts include, "What are the physical characteristics of your ideal partner?" and "Please tell me how you feel today." 【0166】 With a system configured in this way, users can experience interactions that match their desires and emotions, leading to a deeper sense of immersion and satisfaction in romance simulation games. 【0167】 The flow of the specific processing in Example 2 will be explained using Figure 13. 【0168】 Step 1: 【0169】 The user uses the terminal's interface to input information about the characteristics and temperament of their ideal character. This input data includes physical features (e.g., hair color, eye shape) and personality parameters (e.g., kindness, intelligence). The terminal collects this input data and prepares to send it to the server. 【0170】 Step 2: 【0171】 The terminal sends the collected user input data to the server. The server receives this data and performs analysis. The analysis involves data processing to determine how the input characteristics will be reflected in the digital image. Based on these results, the system is ready to use digital image generation technology. 【0172】 Step 3: 【0173】 The server uses a generative AI model based on the analysis results to generate a character image that reflects the user's characteristics. The generative AI model used here incorporates an algorithm that accurately reproduces features. The generated character image is output as digital data and sent to the terminal. 【0174】 Step 4: 【0175】 The terminal receives character images sent from the server and presents them to the user. The user can visually confirm these images. This process allows the user to concretely evaluate the appearance of their ideal character. 【0176】 Step 5: 【0177】 The server uses language processing techniques to configure conversational agents based on temperament information input from the user. Here, natural language processing algorithms are used to construct conversation scenarios that enable natural dialogue with the user. This allows users to experience smooth communication when interacting with the characters. 【0178】 Step 6: 【0179】 The server uses an emotion engine to analyze the user's text and voice input and identify their emotional state. Emotional data is extracted by analyzing the input words and tone. This analysis makes it possible to generate appropriate responses that match the user's emotions. 【0180】 Step 7: 【0181】 The server generates a response based on the emotion engine's output and sends it to the terminal. The terminal receives this response and displays it to the user. This allows the user to experience a realistic and rich dialogue that resonates with their emotions. 【0182】 (Application Example 2) 【0183】 Next, we will explain application example 2. In the following explanation, the data processing device 12 will be referred to as the "server," and the smart device 14 will be referred to as the "terminal." 【0184】 There is a challenge in providing a system that allows users to interactively engage in emotionally rich conversations with their ideal companion. In particular, there is a need to generate personalized responses that respond to the user's emotional state and provide a more immersive experience. Existing systems have struggled to accurately understand the user's emotions and provide natural interactions accordingly. 【0185】 The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 2 is realized by the following means. 【0186】 In this invention, the server includes means for acquiring information on characteristics and properties entered by the user, means for generating a target image with ideal characteristics using an image generation device based on the user's input, and means for displaying the target image. This allows the user to enjoy emotionally harmonious dialogue in real time, enabling a more engaging interaction experience. 【0187】 A "user" is a person who operates this system and inputs information about ideal characteristics and traits. 【0188】 "Input" refers to the act of a user providing information about characteristics and properties to the system. 【0189】 "Characteristics" refer to the specific physical and personality traits that the user's ideal partner should possess. 【0190】 "Information" refers to data that a user provides to the system, including details about its characteristics and properties. 【0191】 An "image generation device" is a system component that generates an image embodying specified characteristics based on user input. 【0192】 The "target image" is a visual representation of a person with ideal characteristics, generated by an image generation device. 【0193】 "Displaying" refers to the act of showing the generated target image in a way that allows the user to visually confirm it. 【0194】 "Natural language processing technology" is a language analysis technology that enables natural conversations with users. 【0195】 "Dialogue" refers to two-way communication that takes place between the user and the system. 【0196】 An "emotion analysis device" is a system component that analyzes emotions from user input or voice and selects an appropriate response based on that analysis. 【0197】 A "response" is a reply that a system generates in response to user input or state. 【0198】 "Recording" refers to the act of saving the content of conversations with users for later reference and learning. 【0199】 "Learning" is the process by which a system uses past dialogue data to improve the accuracy and naturalness of its responses. 【0200】 In carrying out this invention, the system consists of a user, a server, and an end device. First, the user inputs information about the characteristics and properties of the ideal companion through an interface on the end device. This input includes visual characteristics and personality traits. 【0201】 The server receives the user's input data and first uses an image generator to produce a target image that embodies the specified characteristics. The generated target image is sent to the end device's display device so that the user can visually confirm it. Open-source generative AI models (e.g., DALL-E or Stable Diffusion) are used for image generation. 【0202】 Furthermore, the server utilizes natural language processing technology to enable natural conversations with the user. This conversation includes analyzing user input data, recording the conversation content, and learning from it. The Python Transformers library is used for natural language processing. 【0203】 Furthermore, the server operates an emotion analysis device to understand the user's emotional state. This device analyzes the user's input and voice and selects an appropriate response based on their emotions. Emotion analysis uses analysis libraries such as TextBlob. This enables the provision of emotionally harmonious responses to the user. 【0204】 For example, if a user enters "I was happy today," the server's emotion analyzer can detect a positive emotion and generate a response such as, "What a great day! Did something good happen?" 【0205】 An example of a prompt is, "Generate a blonde character with a gentle smile, as the ideal companion." This prompt is used to instruct the image generation model on the specific characteristics of the ideal companion. 【0206】 The flow of a specific process in Application Example 2 will be explained using Figure 14. 【0207】 Step 1: 【0208】 The user uses the terminal interface to input information about the characteristics and traits of their ideal companion. This information includes visual characteristics and personality parameters. The entered data is sent to the server. 【0209】 Step 2: 【0210】 The server analyzes the input data received from the user and creates prompt statements for the generating AI model. These prompt statements are designed to specifically express the characteristics desired by the user. This prepares the instructions for the AI model to generate appropriate images. 【0211】 Step 3: 【0212】 The server uses a generative AI model to generate a target image based on the prompt text. The prompt is input to the image generation model (e.g., DALL-E), and the generated image is received. The obtained image is sent to the terminal for the user to visually confirm. 【0213】 Step 4: 【0214】 The terminal displays the target image received from the server on its screen, enabling the user to visually confirm the image. The user can then examine the generated image and verify whether the desired characteristics are reflected. 【0215】 Step 5: 【0216】 The server uses natural language processing technology to set up the initial interaction with the user. Based on the content of the initial conversation, it builds the context of the conversation and prepares for continuous learning. This enables more natural conversation. 【0217】 Step 6: 【0218】 An emotion analysis device analyzes the user's emotions from their input and voice. The server analyzes the input data and identifies the emotional state. Based on this, the system determines the optimal response. 【0219】 Step 7: 【0220】 Based on the sentiment analysis results, the server optimizes the generated response using natural language processing techniques and sends a reply to the user. If the user provides positive input, it can return a similarly positive response. 【0221】 The specific processing unit 290 transmits the result of the specific processing to the smart device 14. In the smart device 14, the control unit 46A causes the output device 40 to output the result of the specific processing. The microphone 38B acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 38B to the data processing device 12. In the data processing device 12, the specific processing unit 290 acquires the audio data. 【0222】 Data generation model 58 is a so-called generative AI (Artificial Intelligence). An example of data generation model 58 is ChatGPT (registered trademark) (Internet search).<URL: https: / / openai.com / blog / chatgpt> ), Gemini (registered trademark) (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and with inference data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 infers from the input inference data according to the instructions indicated by the prompts, and outputs the inference results in data formats such as audio data and text data. Here, inference refers to, for example, analysis, classification, prediction, and / or summarization. 【0223】 In the above embodiment, an example was given in which specific processing is performed by the data processing device 12, but the technology of this disclosure is not limited thereto, and the specific processing may also be performed by the smart device 14. 【0224】 [Second Embodiment] 【0225】 Figure 3 shows an example of the configuration of the data processing system 210 according to the second embodiment. 【0226】 As shown in Figure 3, the data processing system 210 includes a data processing device 12 and smart glasses 214. An example of the data processing device 12 is a server. 【0227】 The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network). 【0228】 The smart glasses 214 include a computer 36, a microphone 238, a speaker 240, a camera 42, and a communication interface 44. The computer 36 includes a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The microphone 238, speaker 240, and camera 42 are also connected to the bus 52. 【0229】 The microphone 238 receives voice signals from the user 20 and receives instructions from the user 20. The microphone 238 captures the voice signals from the user 20, converts the captured voice into audio data, and outputs it to the processor 46. The speaker 240 outputs audio according to the instructions from the processor 46. 【0230】 Camera 42 is a small digital camera equipped with an optical system including a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor, and captures images of the area around the user 20 (for example, an imaging range defined by a field of view equivalent to the width of a typical healthy person's field of vision). 【0231】 Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various information between processor 46 and processor 28 via network 54. The exchange of various information between processor 46 and processor 28 using communication interfaces 44 and 26 is performed in a secure manner. 【0232】 Figure 4 shows an example of the main functions of the data processing device 12 and the smart glasses 214. As shown in Figure 4, the data processing device 12 performs specific processing using the processor 28. The storage 32 stores the specific processing program 56. 【0233】 The specific processing program 56 is an example of a "program" relating to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 in accordance with the specific processing program 56 executed on the RAM 30. 【0234】 The storage 32 stores the data generation model 58 and the emotion identification model 59. The data generation model 58 and the emotion identification model 59 are used by the identification processing unit 290. 【0235】 In the smart glasses 214, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48. 【0236】 Next, the identification processing performed by the identification processing unit 290 of the data processing device 12 will be described. In the following description, the data processing device 12 will be referred to as the "server" and the smart glasses 214 will be referred to as the "terminal". 【0237】 This invention provides a romance simulation system that enriches the user experience, with the user, terminal, and server working together to realize each function. 【0238】 First, the user inputs information about their ideal romantic partner's appearance and personality through the device's interface. The device receives this information and sends it to the server in the specified data format. 【0239】 The server uses an image generation model based on the received data to generate a person image with the features specified by the user. This image generation process employs machine learning algorithms to faithfully reproduce the shape and style desired by the user. The generated image is sent back to the user's device, allowing the user to confirm their visual representation of their romantic partner. 【0240】 Next, the server starts up its natural language processing model and prepares to interact with the user. It customizes the conversation style based on the personality traits entered by the user and sets up an initial conversation scenario. 【0241】 When a user starts a conversation via the chat interface, the server analyzes the user's input as the conversation progresses and instantly generates and returns natural-sounding responses. This gives the user the feeling of interacting with their ideal partner. 【0242】 Furthermore, the server records all conversations and uses machine learning techniques to continuously train the conversation model, improving the accuracy of future conversations. For example, if a user previously said "I like the sea," the server can generate context-related questions in the next conversation, such as "Have you been to the sea recently?" 【0243】 Through this interaction, users can engage with their ideal character and enjoy a personalized romance simulation experience. This invention provides the opportunity to experience ideal interactions in a virtual environment that are difficult to achieve in the real world. 【0244】 The following describes the processing flow. 【0245】 Step 1: 【0246】 The user inputs information about their ideal romantic partner's appearance and personality through the device's interface. 【0247】 Step 2: 【0248】 The terminal receives input from the user and sends it to the server in a formatted data format. 【0249】 Step 3: 【0250】 The server analyzes the received data and uses an image generation model to generate images of people based on features specified by the user. 【0251】 Step 4: 【0252】 The server sends the generated image of a person to the terminal, and the terminal displays it to the user. 【0253】 Step 5: 【0254】 The server activates a natural language processing model and customizes the conversational agent based on the user's personality settings. 【0255】 Step 6: 【0256】 When a user initiates a conversation via their device, the server analyzes the user's input and generates an appropriate response. 【0257】 Step 7: 【0258】 The server records the content of the conversation with the user in a database and provides feedback to the model for continuous learning. 【0259】 Step 8: 【0260】 Based on the conversation data collected by the server, the entire system is improved so that users can enjoy a more natural and personalized experience in subsequent interactions. 【0261】 (Example 1) 【0262】 Next, we will describe Example 1. In the following description, the data processing device 12 will be referred to as the "server," and the smart glasses 214 will be referred to as the "terminal." 【0263】 Current interactive systems with virtual characters lack the ability to easily generate characters with the appearance and personality desired by the user and to provide natural dialogue based on them. Furthermore, they suffer from insufficient continuity of dialogue and poor understanding of context, limiting the user experience. Additionally, they are unable to leverage the user's past statements to generate more relevant conversations, making it difficult to continuously improve user satisfaction. 【0264】 The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 1 is realized by the following means. 【0265】 In this invention, the server includes means for acquiring information about features input by the user, means for generating an image with ideal features using a generative model based on the user's input, and means for using a language processing model to enable natural interaction with the user. This enables the generation of a character with the appearance and personality desired by the user, and natural, contextual interaction with that character. 【0266】 A "user" is an individual or end-user who utilizes the system and is the entity that provides input and interacts with the simulation. 【0267】 "Characteristics" refer to information such as specific attributes, personality traits, and appearance that users input, and are elements that influence character creation. 【0268】 A "generative model" is an algorithm or software that creates realistic images or visual outputs based on user input. 【0269】 An "image" is a visual representation created by a generative model, depicting a character with characteristics specified by the user. 【0270】 "Display" refers to the process of visually unfolding and making viewable the generated image on the user's device. 【0271】 "Dialogue" refers to communication between a user and a system, including exchanges via text and voice. 【0272】 A "language processing model" is an algorithm or software that processes text-based input and generates appropriate responses to enable natural communication with users. 【0273】 "Memory" refers to the act of storing past conversations and interactions with users in a database or similar system, for use in future interactions. 【0274】 "Learning" is the process of analyzing accumulated dialogue data and improving algorithms to enhance the system's capabilities. 【0275】 The system of this invention allows the user to input ideal characteristics using a terminal interface, and then uses a generative AI model to create a virtual character based on that input, enabling natural interaction. Specifically, the server and terminal work together to provide the user with a personalized simulation experience. 【0276】 The user first inputs information about the characteristics of the character they want, such as "black hair" and "kind personality," through the input device on the terminal. This information is collected by the terminal and sent to the server via the internet. 【0277】 The server processes the received data and uses a generative AI model to generate images of characters with specified features. Software used here includes Stable Diffusion and similar image generation algorithms. This enables visual simulations tailored to user requirements. 【0278】 The generated image is sent from the server to the terminal and displayed so that the user can visually confirm it. The user can use this image to enjoy an interactive virtual experience. 【0279】 Furthermore, the server utilizes a language processing model to achieve natural conversations. The server analyzes the input from the user and immediately generates a response based on an optimized conversation model. As a result, the user can enjoy a natural conversation with the generated character. Examples of language processing models used here include GPT-3. 【0280】 As a specific example, an example of a prompt sentence is as follows: "Please create a visual and conversation style based on the characteristics of an ideal partner." Based on this, the user can obtain a new conversation experience in a virtual environment. 【0281】 This system is designed to provide ideal interactions that are difficult for users to experience in reality and to realize individualized love simulations. 【0282】 The flow of specific processing in Example 1 will be described using FIG. 11. 【0283】 Step 1: 【0284】 The user inputs the characteristics of an ideal character through the interface of the terminal. The input content includes appearance, personality, etc. The terminal converts this information into data in JSON format and sends it to the server. In this step, the user's characteristic request is taken as input, and formatted request data is generated as output. 【0285】 Step 2: 【0286】 The server analyzes the JSON data received from the terminal and generates an image of a character with the features specified by the user using a generative AI model. As a specific operation, it synthesizes visuals based on the specified attributes by utilizing an algorithm such as Stable Diffusion. The input is the user's feature data, and the output is the generated character image. 【0287】 Step 3: 【0288】 The generated image is sent from the server to the terminal. The terminal displays the received image on a display device so that the user can visually confirm it. In this step, the input is the generated image data, and the output is to visually present it to the user. 【0289】 Step 4: 【0290】 The user starts interacting with the character through the terminal. The server activates a language processing model and generates a natural response based on the user's input. At this time, a model such as GPT-3 is used. The input is the user's message, and the output is the generated response that is returned. The server analyzes the input in real time to ensure a natural and relevant conversation. 【0291】 Step 5: 【0292】 The server records the conversation content in a database and continuously improves the conversation model through machine learning. Through this learning process, the past conversation data is utilized to improve the accuracy of the next conversation. The input is the past conversation data, and the output is the improved conversation algorithm. 【0293】 Through such processing steps, the user can obtain a highly customized virtual experience. 【0294】 (Application Example 1) 【0295】 Next, we will explain Application Example 1. In the following explanation, the data processing device 12 will be referred to as the "server," and the smart glasses 214 will be referred to as the "terminal." 【0296】 In the modern era, the means by which users can interact intimately with their ideal characters are limited, and integrating that experience across virtual and physical environments is particularly difficult. There is a need for a system that provides physically actionable dialogue in accordance with user expectations. 【0297】 The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 1 is realized by the following means. 【0298】 In this invention, the server includes means for acquiring information about characteristics entered by the user, means for generating a person image with an ideal appearance using an image generation algorithm based on the user's input, and means for controlling a humanoid automated device to act as the user's ideal partner. This makes it possible for the user to experience a virtual lover in the real world. 【0299】 A "user" is an individual who seeks to interact with their ideal character using the system. 【0300】 "Characteristics" refer to information about the ideal appearance and personality that the user enters. 【0301】 "Means of acquiring information" refers to an interface for collecting requests and information entered by the user. 【0302】 An "image generation algorithm" is a machine learning model that creates images of people with ideal appearances based on user requests. 【0303】 "Means for generating human images" refers to the process of creating digital images using image generation algorithms. 【0304】 The "Natural Language Processing Algorithm" is a language analysis and response generation technology that enables natural interaction between users and systems. 【0305】 The "Humanoid-Type Automatic Device" is a humanoid device that physically acts based on user instructions. 【0306】 The "Means for Acting as an Ideal Partner" is a mechanism that controls the humanoid-type automatic device based on user preferences and causes it to perform specific actions and interactions. 【0307】 To realize this invention, the user first inputs the characteristics of the ideal character through the interface of the terminal. The characteristics include details regarding appearance and personality. This information is collected by the terminal and transmitted to the server. 【0308】 On the server, based on the collected information, an image generation algorithm is activated, and a portrait of a person with the appearance desired by the user is generated. In this process, image generation software such as Stable Diffusion is used. The generated image is transmitted to the user's terminal, enabling visual confirmation. 【0309】 Furthermore, the server uses the natural language processing algorithm to adjust the content of the interaction with the user. Here, technologies such as OpenAI GPT are utilized to analyze the flow of the conversation and generate natural responses according to the user input. The generated responses are sent to the humanoid-type automatic device, and the device acts as the user's ideal character. 【0310】 As a specific example, when the user inputs "a partner with a kind personality who likes the outdoors", the device can talk to the user in a gentle voice, saying "Have you been hiking recently?" Through such interactions, the user can experience a virtual lover in the real world. 【0311】 Examples of prompts used for the generating AI model include sentences like, "Please create a dialogue scenario with a character that has the personality and hobbies desired by the user." 【0312】 The flow of a specific process in Application Example 1 will be explained using Figure 12. 【0313】 Step 1: 【0314】 The user uses an interface via their device to input the characteristics of their ideal character. Specifically, they enter text information about the character's appearance and personality into the input fields. This input is stored on the device as data necessary for subsequent processing. 【0315】 Step 2: 【0316】 The terminal converts the collected user characteristic information into a specified data format and sends it to the server. This process is carried out by a data transmission protocol, and the server analyzes this information for use in subsequent processing. 【0317】 Step 3: 【0318】 The server activates an image generation algorithm based on the characteristic information received from the user. It processes the input characteristics to create an image generation prompt, and then uses an image generation model (e.g., Stable Diffusion) based on that prompt to generate an image of a person with an ideal appearance. The output image is sent back to the terminal for visual confirmation. 【0319】 Step 4: 【0320】 The user reviews the character image received on their device. They check whether the character's appearance matches their preferences, and if dissatisfied, they adjust the characteristics and re-enter the information. 【0321】 Step 5: 【0322】 The server invokes natural language processing algorithms to enable natural interaction with the user. Based on the characteristics input by the user, it generates appropriate conversation setting prompts and creates a dialogue scenario using a natural language model (e.g., OpenAI GPT). This output is used for subsequent processing as response data for the conversation. 【0323】 Step 6: 【0324】 The server transmits the generated dialogue response data to a humanoid automated device. This device initiates actions, including specific voice tones and gestures, to interact with the user. Through this device, the user can experience interaction with a virtual character in the real world. 【0325】 Furthermore, an emotion engine that estimates the user's emotions may be incorporated. That is, the identification processing unit 290 may use the emotion identification model 59 to estimate the user's emotions and perform identification processing using the user's emotions. 【0326】 This invention realizes a system in a romance simulation game that enables emotionally rich dialogue with the user's desired ideal romantic partner. The entire system includes image generation based on user input, natural language processing to enable natural conversation, and an emotion engine to recognize the user's emotions. 【0327】 First, the user uses the interface on their device to input visual characteristics and personality parameters as traits of their ideal romantic partner. This information is sent from the device to a server, where it is analyzed. 【0328】 The server invokes an image generation model based on the input data and generates a human image that faithfully represents the specified features. This image is sent back to the terminal, allowing the user to visually confirm their ideal character. 【0329】 Next, the server applies a natural language processing model to deploy a conversational agent that incorporates the user's personality information. This model is designed to record the content of conversations with the user and optimize response generation for future interactions. 【0330】 Furthermore, an emotion engine operates to analyze emotions from user input and voice. This emotion analysis is performed using an algorithm that identifies emotional states based on the timing of text input, the tone of words used, or, in the case of voice input, the tone and speed. 【0331】 For example, if a user enters "I'm feeling good today," the emotion engine can detect a positive emotion, and the server can generate a correspondingly cheerful response. If the user enters "I'm a little tired today," the emotion engine will detect a negative emotion, and the natural language processing model will react by changing the response to something more encouraging. 【0332】 A system configured in this way can provide users with realistic and emotional interactions, enhancing immersion in dating simulations. Users can experience more engaging and fulfilling interactions by receiving responses that take into account their own emotions and moods. 【0333】 The following describes the processing flow. 【0334】 Step 1: 【0335】 The user uses an interface on their device to input visual characteristics and personality information of their ideal romantic partner. 【0336】 Step 2: 【0337】 The device formats the data collected from the user and sends it to the server. 【0338】 Step 3: 【0339】 The server analyzes the received data, activates an image generation model, and generates a person image based on the input features. 【0340】 Step 4: 【0341】 The server generates image data of a person and sends it back to the terminal, which then visually presents the image to the user. 【0342】 Step 5: 【0343】 The server uses a natural language processing model to set up a conversational agent based on the user's personality information. 【0344】 Step 6: 【0345】 The user enters text or voice input through the device's conversation interface and sends it to the server. 【0346】 Step 7: 【0347】 The server analyzes the input information through a natural language processing model and uses an emotion engine to determine the user's emotional state. 【0348】 Step 8: 【0349】 Based on the emotional data determined by the emotion engine, the server generates a response adapted to the user's emotions and sends it to the terminal. 【0350】 Step 9: 【0351】 The terminal receives a response from the server and presents it to the user either visually or audibly. 【0352】 Step 10: 【0353】 The server records conversation history and sentiment data in a database, and continuously uses it to improve the accuracy of future responses. 【0354】 (Example 2) 【0355】 Next, we will describe Example 2. In the following description, the data processing device 12 will be referred to as the "server" and the smart glasses 214 will be referred to as the "terminal". 【0356】 Traditional romance simulation games have only been able to offer users generalized characters and dialogue, making it difficult to achieve interactions that reflect the user's individual emotions and desires. As a result, users were unable to fully immerse themselves, and the simulation experience was limited. There is a need for a system that can solve this problem and enable more individualized and emotionally rich dialogue. 【0357】 The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 2 is realized by the following means. 【0358】 In this invention, the server includes means for acquiring information on characteristics and temperament entered by the user, means for generating a character image with ideal characteristics using digital image generation technology based on the user's input, and means for presenting the character image. This enables the provision of visuals and conversational content based on the user's individual preferences, resulting in an immersive simulation experience. 【0359】 "Characteristics" refer to specific features related to the appearance and behavior of the character that the user considers ideal. 【0360】 "Temperament" refers to specific attributes related to the personality and disposition of the character that the user idealizes. 【0361】 "Digital image generation technology" refers to technology that uses a computer to automatically generate character images based on characteristics input by the user. 【0362】 A "character image" refers to a visual digital image generated based on characteristics specified by the user. 【0363】 "Presentation" refers to the act of displaying the generated character image so that the user can visually confirm it. 【0364】 "Language processing technology" refers to the technology that processes language data to enable natural dialogue with users, performing semantic analysis and response generation. 【0365】 An "emotion engine" refers to a system that analyzes emotions from user input and voice, identifies those emotions, and utilizes that information. 【0366】 This invention realizes a system that enables users to create their ideal romantic partner in a dating simulation game. Users input information about the characteristics and temperament of their ideal partner through the terminal interface. For example, it is possible to specify visual characteristics such as "black hair and large eyes" or personality parameters such as "highly kind and moderately intelligent." 【0367】 User input is transmitted to the server via the terminal. The server utilizes digital image generation technology based on the received information and generates the ideal character image using a generation AI model. Specifically, the generation AI model uses generation technology that can faithfully reproduce a variety of features. 【0368】 The generated character image is sent back to the device, allowing the user to visually confirm the image. Furthermore, the server applies language processing technology based on the temperament specified by the user to set up a conversational agent that enables natural dialogue. This allows the user to enjoy personalized conversations. 【0369】 In addition, the emotion engine analyzes the user's emotions from text and voice input, and the server generates a response based on those emotions. For example, if the user inputs "I'm happy today," the emotion engine can capture that positive emotion and generate a cheerful response. 【0370】 Examples of prompts include, "What are the physical characteristics of your ideal partner?" and "Please tell me how you feel today." 【0371】 With a system configured in this way, users can experience interactions that match their desires and emotions, leading to a deeper sense of immersion and satisfaction in romance simulation games. 【0372】 The flow of the specific processing in Example 2 will be explained using Figure 13. 【0373】 Step 1: 【0374】 The user uses the terminal's interface to input information about the characteristics and temperament of their ideal character. This input data includes physical features (e.g., hair color, eye shape) and personality parameters (e.g., kindness, intelligence). The terminal collects this input data and prepares to send it to the server. 【0375】 Step 2: 【0376】 The terminal sends the collected user input data to the server. The server receives this data and performs analysis. The analysis involves data processing to determine how the input characteristics will be reflected in the digital image. Based on these results, the system is ready to use digital image generation technology. 【0377】 Step 3: 【0378】 The server uses a generative AI model based on the analysis results to generate a character image that reflects the user's characteristics. The generative AI model used here incorporates an algorithm that accurately reproduces features. The generated character image is output as digital data and sent to the terminal. 【0379】 Step 4: 【0380】 The terminal receives character images sent from the server and presents them to the user. The user can visually confirm these images. This process allows the user to concretely evaluate the appearance of their ideal character. 【0381】 Step 5: 【0382】 The server uses language processing techniques to configure conversational agents based on temperament information input from the user. Here, natural language processing algorithms are used to construct conversation scenarios that enable natural dialogue with the user. This allows users to experience smooth communication when interacting with the characters. 【0383】 Step 6: 【0384】 The server uses an emotion engine to analyze the user's text and voice input and identify their emotional state. Emotional data is extracted by analyzing the input words and tone. This analysis makes it possible to generate appropriate responses that match the user's emotions. 【0385】 Step 7: 【0386】 The server generates a response based on the emotion engine's output and sends it to the terminal. The terminal receives this response and displays it to the user. This allows the user to experience a realistic and rich dialogue that resonates with their emotions. 【0387】 (Application Example 2) 【0388】 Next, we will explain application example 2. In the following explanation, the data processing device 12 will be referred to as the "server," and the smart glasses 214 will be referred to as the "terminal." 【0389】 There is a challenge in providing a system that allows users to interactively engage in emotionally rich conversations with their ideal companion. In particular, there is a need to generate personalized responses that respond to the user's emotional state and provide a more immersive experience. Existing systems have struggled to accurately understand the user's emotions and provide natural interactions accordingly. 【0390】 The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 2 is realized by the following means. 【0391】 In this invention, the server includes means for acquiring information on characteristics and properties entered by the user, means for generating a target image with ideal characteristics using an image generation device based on the user's input, and means for displaying the target image. This allows the user to enjoy emotionally harmonious dialogue in real time, enabling a more engaging interaction experience. 【0392】 A "user" is a person who operates this system and inputs information about ideal characteristics and traits. 【0393】 "Input" refers to the act of a user providing information about characteristics and properties to the system. 【0394】 "Characteristics" refer to the specific physical and personality traits that the user's ideal partner should possess. 【0395】 "Information" refers to data that a user provides to the system, including details about its characteristics and properties. 【0396】 An "image generation device" is a system component that generates an image embodying specified characteristics based on user input. 【0397】 The "target image" is a visual representation of a person with ideal characteristics, generated by an image generation device. 【0398】 "Displaying" refers to the act of showing the generated target image in a way that allows the user to visually confirm it. 【0399】 "Natural language processing technology" is a language analysis technology that enables natural conversations with users. 【0400】 "Dialogue" refers to two-way communication that takes place between the user and the system. 【0401】 An "emotion analysis device" is a system component that analyzes emotions from user input or voice and selects an appropriate response based on that analysis. 【0402】 A "response" is a reply that a system generates in response to user input or state. 【0403】 "Recording" refers to the act of saving the content of conversations with users for later reference and learning. 【0404】 "Learning" is the process by which a system uses past dialogue data to improve the accuracy and naturalness of its responses. 【0405】 In carrying out this invention, the system consists of a user, a server, and an end device. First, the user inputs information about the characteristics and properties of the ideal companion through an interface on the end device. This input includes visual characteristics and personality traits. 【0406】 The server receives the user's input data and first uses an image generator to produce a target image that embodies the specified characteristics. The generated target image is sent to the end device's display device so that the user can visually confirm it. Open-source generative AI models (e.g., DALL-E or Stable Diffusion) are used for image generation. 【0407】 Furthermore, the server utilizes natural language processing technology to enable natural conversations with the user. This conversation includes analyzing user input data, recording the conversation content, and learning from it. The Python Transformers library is used for natural language processing. 【0408】 Furthermore, the server operates an emotion analysis device to understand the user's emotional state. This device analyzes the user's input and voice and selects an appropriate response based on their emotions. Emotion analysis uses analysis libraries such as TextBlob. This enables the provision of emotionally harmonious responses to the user. 【0409】 For example, if a user enters "I was happy today," the server's emotion analyzer can detect a positive emotion and generate a response such as, "What a great day! Did something good happen?" 【0410】 An example of a prompt is, "Generate a blonde character with a gentle smile, as the ideal companion." This prompt is used to instruct the image generation model on the specific characteristics of the ideal companion. 【0411】 The flow of a specific process in Application Example 2 will be explained using Figure 14. 【0412】 Step 1: 【0413】 The user uses the terminal interface to input information about the characteristics and traits of their ideal companion. This information includes visual characteristics and personality parameters. The entered data is sent to the server. 【0414】 Step 2: 【0415】 The server analyzes the input data received from the user and creates prompt statements for the generating AI model. These prompt statements are designed to specifically express the characteristics desired by the user. This prepares the instructions for the AI model to generate appropriate images. 【0416】 Step 3: 【0417】 The server uses a generative AI model to generate a target image based on the prompt text. The prompt is input to the image generation model (e.g., DALL-E), and the generated image is received. The obtained image is sent to the terminal for the user to visually confirm. 【0418】 Step 4: 【0419】 The terminal displays the target image received from the server on its screen, enabling the user to visually confirm the image. The user can then examine the generated image and verify whether the desired characteristics are reflected. 【0420】 Step 5: 【0421】 The server uses natural language processing technology to set up the initial interaction with the user. Based on the content of the initial conversation, it builds the context of the conversation and prepares for continuous learning. This enables more natural conversation. 【0422】 Step 6: 【0423】 An emotion analysis device analyzes the user's emotions from their input and voice. The server analyzes the input data and identifies the emotional state. Based on this, the system determines the optimal response. 【0424】 Step 7: 【0425】 Based on the sentiment analysis results, the server optimizes the generated response using natural language processing techniques and sends a reply to the user. If the user provides positive input, it can return a similarly positive response. 【0426】 The specific processing unit 290 transmits the result of the specific processing to the smart glasses 214. In the smart glasses 214, the control unit 46A causes the speaker 240 to output the result of the specific processing. The microphone 238 acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 238 to the data processing unit 12. In the data processing unit 12, the specific processing unit 290 acquires the audio data. 【0427】 Data generation model 58 is a type of so-called generative AI (Artificial Intelligence). An example of data generation model 58 is ChatGPT (Internet Search<URL: https: / / openai.com / blog / chatgpt> ), Gemini (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and with inference data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 infers from the input inference data according to the instructions indicated by the prompts, and outputs the inference results in data formats such as audio data and text data. Here, inference refers to, for example, analysis, classification, prediction, and / or summarization. 【0428】 In the above embodiment, an example was given in which specific processing is performed by the data processing device 12, but the technology of this disclosure is not limited thereto, and the specific processing may also be performed by the smart glasses 214. 【0429】 [Third Embodiment] 【0430】 Figure 5 shows an example of the configuration of the data processing system 310 according to the third embodiment. 【0431】 As shown in Figure 5, the data processing system 310 includes a data processing device 12 and a headset terminal 314. An example of the data processing device 12 is a server. 【0432】 The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network). 【0433】 The headset terminal 314 includes a computer 36, a microphone 238, a speaker 240, a camera 42, a communication interface 44, and a display 343. The computer 36 includes a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The microphone 238, speaker 240, camera 42, and display 343 are also connected to the bus 52. 【0434】 The microphone 238 receives voice signals from the user 20 and receives instructions from the user 20. The microphone 238 captures the voice signals from the user 20, converts the captured voice into audio data, and outputs it to the processor 46. The speaker 240 outputs audio according to the instructions from the processor 46. 【0435】 Camera 42 is a small digital camera equipped with an optical system including a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor, and captures images of the area around the user 20 (for example, an imaging range defined by a field of view equivalent to the width of a typical healthy person's field of vision). 【0436】 Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various information between processor 46 and processor 28 via network 54. The exchange of various information between processor 46 and processor 28 using communication interfaces 44 and 26 is performed in a secure manner. 【0437】 Figure 6 shows an example of the main functions of the data processing device 12 and the headset terminal 314. As shown in Figure 6, the data processing device 12 performs specific processing using the processor 28. The storage 32 stores the specific processing program 56. 【0438】 The specific processing program 56 is an example of a "program" relating to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 in accordance with the specific processing program 56 executed on the RAM 30. 【0439】 The storage 32 stores the data generation model 58 and the emotion identification model 59. The data generation model 58 and the emotion identification model 59 are used by the identification processing unit 290. 【0440】 In the headset terminal 314, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48. 【0441】 Next, the specific processing performed by the specific processing unit 290 of the data processing device 12 will be described. In the following description, the data processing device 12 will be referred to as the "server" and the headset terminal 314 will be referred to as the "terminal". 【0442】 This invention provides a romance simulation system that enriches the user experience, with the user, terminal, and server working together to realize each function. 【0443】 First, the user inputs information about their ideal romantic partner's appearance and personality through the device's interface. The device receives this information and sends it to the server in the specified data format. 【0444】 The server uses an image generation model based on the received data to generate a person image with the features specified by the user. This image generation process employs machine learning algorithms to faithfully reproduce the shape and style desired by the user. The generated image is sent back to the user's device, allowing the user to confirm their visual representation of their romantic partner. 【0445】 Next, the server starts up its natural language processing model and prepares to interact with the user. It customizes the conversation style based on the personality traits entered by the user and sets up an initial conversation scenario. 【0446】 When a user starts a conversation via the chat interface, the server analyzes the user's input as the conversation progresses and instantly generates and returns natural-sounding responses. This gives the user the feeling of interacting with their ideal partner. 【0447】 Furthermore, the server records all conversations and uses machine learning techniques to continuously train the conversation model, improving the accuracy of future conversations. For example, if a user previously said "I like the sea," the server can generate context-related questions in the next conversation, such as "Have you been to the sea recently?" 【0448】 Through this interaction, users can engage with their ideal character and enjoy a personalized romance simulation experience. This invention provides the opportunity to experience ideal interactions in a virtual environment that are difficult to achieve in the real world. 【0449】 The following describes the processing flow. 【0450】 Step 1: 【0451】 The user inputs information about their ideal romantic partner's appearance and personality through the device's interface. 【0452】 Step 2: 【0453】 The terminal receives input from the user and sends it to the server in a formatted data format. 【0454】 Step 3: 【0455】 The server analyzes the received data and uses an image generation model to generate images of people based on features specified by the user. 【0456】 Step 4: 【0457】 The server sends the generated image of a person to the terminal, and the terminal displays it to the user. 【0458】 Step 5: 【0459】 The server activates a natural language processing model and customizes the conversational agent based on the user's personality settings. 【0460】 Step 6: 【0461】 When a user initiates a conversation via their device, the server analyzes the user's input and generates an appropriate response. 【0462】 Step 7: 【0463】 The server records the content of the conversation with the user in a database and provides feedback to the model for continuous learning. 【0464】 Step 8: 【0465】 Based on the conversation data collected by the server, the entire system is improved so that users can enjoy a more natural and personalized experience in subsequent interactions. 【0466】 (Example 1) 【0467】 Next, we will describe Example 1. In the following description, the data processing device 12 will be referred to as the "server," and the headset-type terminal 314 will be referred to as the "terminal." 【0468】 Current interactive systems with virtual characters lack the ability to easily generate characters with the appearance and personality desired by the user and to provide natural dialogue based on them. Furthermore, they suffer from insufficient continuity of dialogue and poor understanding of context, limiting the user experience. Additionally, they are unable to leverage the user's past statements to generate more relevant conversations, making it difficult to continuously improve user satisfaction. 【0469】 The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 1 is realized by the following means. 【0470】 In this invention, the server includes means for acquiring information about features input by the user, means for generating an image with ideal features using a generative model based on the user's input, and means for using a language processing model to enable natural interaction with the user. This enables the generation of a character with the appearance and personality desired by the user, and natural, contextual interaction with that character. 【0471】 A "user" is an individual or end-user who utilizes the system and is the entity that provides input and interacts with the simulation. 【0472】 "Characteristics" refer to information such as specific attributes, personality traits, and appearance that users input, and are elements that influence character creation. 【0473】 A "generative model" is an algorithm or software that creates realistic images or visual outputs based on user input. 【0474】 An "image" is a visual representation created by a generative model, depicting a character with characteristics specified by the user. 【0475】 "Display" refers to the process of visually unfolding and making viewable the generated image on the user's device. 【0476】 "Dialogue" refers to communication between a user and a system, including exchanges via text and voice. 【0477】 A "language processing model" is an algorithm or software that processes text-based input and generates appropriate responses to enable natural communication with users. 【0478】 "Memory" refers to the act of storing past conversations and interactions with users in a database or similar system, for use in future interactions. 【0479】 "Learning" is the process of analyzing accumulated dialogue data and improving algorithms to enhance the system's capabilities. 【0480】 The system of this invention allows the user to input ideal characteristics using a terminal interface, and then uses a generative AI model to create a virtual character based on that input, enabling natural interaction. Specifically, the server and terminal work together to provide the user with a personalized simulation experience. 【0481】 The user first inputs information about the characteristics of the character they want, such as "black hair" and "kind personality," through the input device on the terminal. This information is collected by the terminal and sent to the server via the internet. 【0482】 The server processes the received data and uses a generative AI model to generate images of characters with specified features. Software used here includes Stable Diffusion and similar image generation algorithms. This enables visual simulations tailored to user requirements. 【0483】 The generated image is sent from the server to the terminal and displayed for the user to visually confirm. The user can then use this image to enjoy an interactive virtual experience. 【0484】 Furthermore, the server utilizes language processing models to achieve natural dialogue. The server analyzes user input and instantly generates responses based on an optimized dialogue model. This allows users to enjoy natural conversations with the generated character. Examples of language processing models used here include GPT-3. 【0485】 As a concrete example, an example of a prompt is as follows: "Create a visual and conversational style based on the characteristics of your ideal partner." Based on this, the user can gain a new conversational experience in a virtual environment. 【0486】 This system is designed to provide users with ideal interactions that are difficult to experience in reality, and to realize personalized romance simulations. 【0487】 The flow of the specific processing in Example 1 will be explained using Figure 11. 【0488】 Step 1: 【0489】 The user inputs the characteristics of their ideal character through the terminal's interface. This input includes details such as appearance and personality. The terminal converts this information into JSON data and sends it to the server. In this step, the user's characteristic request is taken as input, and formatted request data is generated as output. 【0490】 Step 2: 【0491】 The server parses the JSON data received from the terminal and uses a generative AI model to generate an image of a character with features specified by the user. Specifically, it uses an algorithm such as Stable Diffusion to synthesize a visual based on the specified attributes. The input is the user's feature data, and the output is the generated character image. 【0492】 Step 3: 【0493】 The generated image is sent from the server to the terminal. The terminal displays the received image on its display device, allowing the user to visually confirm it. In this step, the generated image data is received as input, and presented visually to the user as output. 【0494】 Step 4: 【0495】 The user initiates interaction with the character through the terminal. The server activates a language processing model and generates natural responses based on the user's input. A model such as GPT-3 is used in this process. The user's message is used as input, and the generated response is returned as output. The server analyzes the input in real time to ensure natural and relevant dialogue. 【0496】 Step 5: 【0497】 The server records the conversation content in a database and continuously improves the conversation model through machine learning. This learning process leverages past conversation data to improve the accuracy of subsequent conversations. Past conversation data is used as input, and an improved conversation algorithm is obtained as output. 【0498】 These processing steps allow users to obtain a highly customized virtual experience. 【0499】 (Application Example 1) 【0500】 Next, we will explain Application Example 1. In the following explanation, the data processing device 12 will be referred to as the "server," and the headset-type terminal 314 will be referred to as the "terminal." 【0501】 In the modern era, the means by which users can interact intimately with their ideal characters are limited, and integrating that experience across virtual and physical environments is particularly difficult. There is a need for a system that provides physically actionable dialogue in accordance with user expectations. 【0502】 The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 1 is realized by the following means. 【0503】 In this invention, the server includes means for acquiring information about characteristics entered by the user, means for generating a person image with an ideal appearance using an image generation algorithm based on the user's input, and means for controlling a humanoid automated device to act as the user's ideal partner. This makes it possible for the user to experience a virtual lover in the real world. 【0504】 A "user" is an individual who seeks to interact with their ideal character using the system. 【0505】 "Characteristics" refer to information about the ideal appearance and personality that the user enters. 【0506】 "Means of acquiring information" refers to an interface for collecting requests and information entered by the user. 【0507】 An "image generation algorithm" is a machine learning model that creates images of people with ideal appearances based on user requests. 【0508】 "Means for generating human images" refers to the process of creating digital images using image generation algorithms. 【0509】 A "natural language processing algorithm" is a language analysis and response generation technique that enables natural dialogue between the user and the system. 【0510】 A "humanoid automated device" is a human-shaped device that physically acts based on user instructions. 【0511】 "Means of making it act as an ideal partner" refers to a mechanism that controls a humanoid automated device based on the user's wishes, causing it to perform specific actions or engage in dialogue. 【0512】 To realize this invention, the user first inputs the characteristics of their ideal character through the terminal's interface. These characteristics include details about appearance and personality. This information is collected by the terminal and sent to the server. 【0513】 On the server, an image generation algorithm is activated based on the collected information, and an image of a person with the appearance desired by the user is generated. Image generation software such as Stable Diffusion is used in this process. The generated image is sent to the user's terminal for visual confirmation. 【0514】 Furthermore, the server uses natural language processing algorithms to refine the content of the interaction with the user. Technologies such as OpenAI GPT are utilized here to analyze the flow of the conversation and generate natural responses that correspond to user input. The generated responses are sent to a humanoid robot, which then behaves as the user's ideal character. 【0515】 For example, if a user enters "a kind person who likes the outdoors," the device can respond in a gentle voice, "Have you been hiking recently?" Through such interactions, users can experience a virtual romantic partner in the real world. 【0516】 Examples of prompts used for the generating AI model include sentences like, "Please create a dialogue scenario with a character that has the personality and hobbies desired by the user." 【0517】 The flow of a specific process in Application Example 1 will be explained using Figure 12. 【0518】 Step 1: 【0519】 The user uses an interface via their device to input the characteristics of their ideal character. Specifically, they enter text information about the character's appearance and personality into the input fields. This input is stored on the device as data necessary for subsequent processing. 【0520】 Step 2: 【0521】 The terminal converts the collected user characteristic information into a specified data format and sends it to the server. This process is carried out by a data transmission protocol, and the server analyzes this information for use in subsequent processing. 【0522】 Step 3: 【0523】 The server activates an image generation algorithm based on the characteristic information received from the user. It processes the input characteristics to create an image generation prompt, and then uses an image generation model (e.g., Stable Diffusion) based on that prompt to generate an image of a person with an ideal appearance. The output image is sent back to the terminal for visual confirmation. 【0524】 Step 4: 【0525】 The user reviews the character image received on their device. They check whether the character's appearance matches their preferences, and if dissatisfied, they adjust the characteristics and re-enter the information. 【0526】 Step 5: 【0527】 The server invokes natural language processing algorithms to enable natural interaction with the user. Based on the characteristics input by the user, it generates appropriate conversation setting prompts and creates a dialogue scenario using a natural language model (e.g., OpenAI GPT). This output is used for subsequent processing as response data for the conversation. 【0528】 Step 6: 【0529】 The server transmits the generated dialogue response data to a humanoid automated device. This device initiates actions, including specific voice tones and gestures, to interact with the user. Through this device, the user can experience interaction with a virtual character in the real world. 【0530】 Furthermore, an emotion engine that estimates the user's emotions may be incorporated. That is, the identification processing unit 290 may use the emotion identification model 59 to estimate the user's emotions and perform identification processing using the user's emotions. 【0531】 This invention realizes a system in a romance simulation game that enables emotionally rich dialogue with the user's desired ideal romantic partner. The entire system includes image generation based on user input, natural language processing to enable natural conversation, and an emotion engine to recognize the user's emotions. 【0532】 First, the user uses the interface on their device to input visual characteristics and personality parameters as traits of their ideal romantic partner. This information is sent from the device to a server, where it is analyzed. 【0533】 The server invokes an image generation model based on the input data and generates a human image that faithfully represents the specified features. This image is sent back to the terminal, allowing the user to visually confirm their ideal character. 【0534】 Next, the server applies a natural language processing model to deploy a conversational agent that incorporates the user's personality information. This model is designed to record the content of conversations with the user and optimize response generation for future interactions. 【0535】 Furthermore, an emotion engine operates to analyze emotions from user input and voice. This emotion analysis is performed using an algorithm that identifies emotional states based on the timing of text input, the tone of words used, or, in the case of voice input, the tone and speed. 【0536】 For example, if a user enters "I'm feeling good today," the emotion engine can detect a positive emotion, and the server can generate a correspondingly cheerful response. If the user enters "I'm a little tired today," the emotion engine will detect a negative emotion, and the natural language processing model will react by changing the response to something more encouraging. 【0537】 A system configured in this way can provide users with realistic and emotional interactions, enhancing immersion in dating simulations. Users can experience more engaging and fulfilling interactions by receiving responses that take into account their own emotions and moods. 【0538】 The following describes the processing flow. 【0539】 Step 1: 【0540】 The user uses an interface on their device to input visual characteristics and personality information of their ideal romantic partner. 【0541】 Step 2: 【0542】 The device formats the data collected from the user and sends it to the server. 【0543】 Step 3: 【0544】 The server analyzes the received data, activates an image generation model, and generates a person image based on the input features. 【0545】 Step 4: 【0546】 The server generates image data of a person and sends it back to the terminal, which then visually presents the image to the user. 【0547】 Step 5: 【0548】 The server uses a natural language processing model to set up a conversational agent based on the user's personality information. 【0549】 Step 6: 【0550】 The user enters text or voice input through the device's conversation interface and sends it to the server. 【0551】 Step 7: 【0552】 The server analyzes the input information through a natural language processing model and uses an emotion engine to determine the user's emotional state. 【0553】 Step 8: 【0554】 Based on the emotional data determined by the emotion engine, the server generates a response adapted to the user's emotions and sends it to the terminal. 【0555】 Step 9: 【0556】 The terminal receives a response from the server and presents it to the user either visually or audibly. 【0557】 Step 10: 【0558】 The server records conversation history and sentiment data in a database, and continuously uses it to improve the accuracy of future responses. 【0559】 (Example 2) 【0560】 Next, we will describe Example 2. In the following description, the data processing device 12 will be referred to as the "server," and the headset-type terminal 314 will be referred to as the "terminal." 【0561】 Traditional romance simulation games have only been able to offer users generalized characters and dialogue, making it difficult to achieve interactions that reflect the user's individual emotions and desires. As a result, users were unable to fully immerse themselves, and the simulation experience was limited. There is a need for a system that can solve this problem and enable more individualized and emotionally rich dialogue. 【0562】 The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 2 is realized by the following means. 【0563】 In this invention, the server includes means for acquiring information on characteristics and temperament entered by the user, means for generating a character image with ideal characteristics using digital image generation technology based on the user's input, and means for presenting the character image. This enables the provision of visuals and conversational content based on the user's individual preferences, resulting in an immersive simulation experience. 【0564】 "Characteristics" refer to specific features related to the appearance and behavior of the character that the user considers ideal. 【0565】 "Temperament" refers to specific attributes related to the personality and disposition of the character that the user idealizes. 【0566】 "Digital image generation technology" refers to technology that uses a computer to automatically generate character images based on characteristics input by the user. 【0567】 A "character image" refers to a visual digital image generated based on characteristics specified by the user. 【0568】 "Presentation" refers to the act of displaying the generated character image so that the user can visually confirm it. 【0569】 "Language processing technology" refers to the technology that processes language data to enable natural dialogue with users, performing semantic analysis and response generation. 【0570】 An "emotion engine" refers to a system that analyzes emotions from user input and voice, identifies those emotions, and utilizes that information. 【0571】 This invention realizes a system that enables users to create their ideal romantic partner in a dating simulation game. Users input information about the characteristics and temperament of their ideal partner through the terminal interface. For example, it is possible to specify visual characteristics such as "black hair and large eyes" or personality parameters such as "highly kind and moderately intelligent." 【0572】 User input is transmitted to the server via the terminal. The server utilizes digital image generation technology based on the received information and generates the ideal character image using a generation AI model. Specifically, the generation AI model uses generation technology that can faithfully reproduce a variety of features. 【0573】 The generated character image is sent back to the device, allowing the user to visually confirm the image. Furthermore, the server applies language processing technology based on the temperament specified by the user to set up a conversational agent that enables natural dialogue. This allows the user to enjoy personalized conversations. 【0574】 In addition, the emotion engine analyzes the user's emotions from text and voice input, and the server generates a response based on those emotions. For example, if the user inputs "I'm happy today," the emotion engine can capture that positive emotion and generate a cheerful response. 【0575】 Examples of prompts include, "What are the physical characteristics of your ideal partner?" and "Please tell me how you feel today." 【0576】 With a system configured in this way, users can experience interactions that match their desires and emotions, leading to a deeper sense of immersion and satisfaction in romance simulation games. 【0577】 The flow of the specific processing in Example 2 will be explained using Figure 13. 【0578】 Step 1: 【0579】 The user uses the terminal's interface to input information about the characteristics and temperament of their ideal character. This input data includes physical features (e.g., hair color, eye shape) and personality parameters (e.g., kindness, intelligence). The terminal collects this input data and prepares to send it to the server. 【0580】 Step 2: 【0581】 The terminal sends the collected user input data to the server. The server receives this data and performs analysis. The analysis involves data processing to determine how the input characteristics will be reflected in the digital image. Based on these results, the system is ready to use digital image generation technology. 【0582】 Step 3: 【0583】 The server uses a generative AI model based on the analysis results to generate a character image that reflects the user's characteristics. The generative AI model used here incorporates an algorithm that accurately reproduces features. The generated character image is output as digital data and sent to the terminal. 【0584】 Step 4: 【0585】 The terminal receives character images sent from the server and presents them to the user. The user can visually confirm these images. This process allows the user to concretely evaluate the appearance of their ideal character. 【0586】 Step 5: 【0587】 The server uses language processing techniques to configure conversational agents based on temperament information input from the user. Here, natural language processing algorithms are used to construct conversation scenarios that enable natural dialogue with the user. This allows users to experience smooth communication when interacting with the characters. 【0588】 Step 6: 【0589】 The server uses an emotion engine to analyze the user's text and voice input and identify their emotional state. Emotional data is extracted by analyzing the input words and tone. This analysis makes it possible to generate appropriate responses that match the user's emotions. 【0590】 Step 7: 【0591】 The server generates a response based on the emotion engine's output and sends it to the terminal. The terminal receives this response and displays it to the user. This allows the user to experience a realistic and rich dialogue that resonates with their emotions. 【0592】 (Application Example 2) 【0593】 Next, we will explain application example 2. In the following explanation, the data processing device 12 will be referred to as the "server," and the headset-type terminal 314 will be referred to as the "terminal." 【0594】 There is a challenge in providing a system that allows users to interactively engage in emotionally rich conversations with their ideal companion. In particular, there is a need to generate personalized responses that respond to the user's emotional state and provide a more immersive experience. Existing systems have struggled to accurately understand the user's emotions and provide natural interactions accordingly. 【0595】 The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 2 is realized by the following means. 【0596】 In this invention, the server includes means for acquiring information on characteristics and properties entered by the user, means for generating a target image with ideal characteristics using an image generation device based on the user's input, and means for displaying the target image. This allows the user to enjoy emotionally harmonious dialogue in real time, enabling a more engaging interaction experience. 【0597】 A "user" is a person who operates this system and inputs information about ideal characteristics and traits. 【0598】 "Input" refers to the act of a user providing information about characteristics and properties to the system. 【0599】 "Characteristics" refer to the specific physical and personality traits that the user's ideal partner should possess. 【0600】 "Information" refers to data that a user provides to the system, including details about its characteristics and properties. 【0601】 An "image generation device" is a system component that generates an image embodying specified characteristics based on user input. 【0602】 The "target image" is a visual representation of a person with ideal characteristics, generated by an image generation device. 【0603】 "Displaying" refers to the act of showing the generated target image in a way that allows the user to visually confirm it. 【0604】 "Natural language processing technology" is a language analysis technology that enables natural conversations with users. 【0605】 "Dialogue" refers to two-way communication that takes place between the user and the system. 【0606】 An "emotion analysis device" is a system component that analyzes emotions from user input or voice and selects an appropriate response based on that analysis. 【0607】 A "response" is a reply that a system generates in response to user input or state. 【0608】 "Recording" refers to the act of saving the content of conversations with users for later reference and learning. 【0609】 "Learning" is the process by which a system uses past dialogue data to improve the accuracy and naturalness of its responses. 【0610】 In carrying out this invention, the system consists of a user, a server, and an end device. First, the user inputs information about the characteristics and properties of the ideal companion through an interface on the end device. This input includes visual characteristics and personality traits. 【0611】 The server receives the user's input data and first uses an image generator to produce a target image that embodies the specified characteristics. The generated target image is sent to the end device's display device so that the user can visually confirm it. Open-source generative AI models (e.g., DALL-E or Stable Diffusion) are used for image generation. 【0612】 Furthermore, the server utilizes natural language processing technology to enable natural conversations with the user. This conversation includes analyzing user input data, recording the conversation content, and learning from it. The Python Transformers library is used for natural language processing. 【0613】 Furthermore, the server operates an emotion analysis device to understand the user's emotional state. This device analyzes the user's input and voice and selects an appropriate response based on their emotions. Emotion analysis uses analysis libraries such as TextBlob. This enables the provision of emotionally harmonious responses to the user. 【0614】 For example, if a user enters "I was happy today," the server's emotion analyzer can detect a positive emotion and generate a response such as, "What a great day! Did something good happen?" 【0615】 An example of a prompt is, "Generate a blonde character with a gentle smile, as the ideal companion." This prompt is used to instruct the image generation model on the specific characteristics of the ideal companion. 【0616】 The flow of a specific process in Application Example 2 will be explained using Figure 14. 【0617】 Step 1: 【0618】 The user uses the terminal interface to input information about the characteristics and traits of their ideal companion. This information includes visual characteristics and personality parameters. The entered data is sent to the server. 【0619】 Step 2: 【0620】 The server analyzes the input data received from the user and creates prompt statements for the generating AI model. These prompt statements are designed to specifically express the characteristics desired by the user. This prepares the instructions for the AI model to generate appropriate images. 【0621】 Step 3: 【0622】 The server uses a generative AI model to generate a target image based on the prompt text. The prompt is input to the image generation model (e.g., DALL-E), and the generated image is received. The obtained image is sent to the terminal for the user to visually confirm. 【0623】 Step 4: 【0624】 The terminal displays the target image received from the server on its screen, enabling the user to visually confirm the image. The user can then examine the generated image and verify whether the desired characteristics are reflected. 【0625】 Step 5: 【0626】 The server uses natural language processing technology to set up the initial interaction with the user. Based on the content of the initial conversation, it builds the context of the conversation and prepares for continuous learning. This enables more natural conversation. 【0627】 Step 6: 【0628】 An emotion analysis device analyzes the user's emotions from their input and voice. The server analyzes the input data and identifies the emotional state. Based on this, the system determines the optimal response. 【0629】 Step 7: 【0630】 Based on the sentiment analysis results, the server optimizes the generated response using natural language processing techniques and sends a reply to the user. If the user provides positive input, it can return a similarly positive response. 【0631】 The specific processing unit 290 transmits the result of the specific processing to the headset terminal 314. In the headset terminal 314, the control unit 46A causes the speaker 240 and display 343 to output the result of the specific processing. The microphone 238 acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 238 to the data processing unit 12. In the data processing unit 12, the specific processing unit 290 acquires the audio data. 【0632】 Data generation model 58 is a type of so-called generative AI (Artificial Intelligence). An example of data generation model 58 is ChatGPT (Internet Search<URL: https: / / openai.com / blog / chatgpt> ), Gemini (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and with inference data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 infers from the input inference data according to the instructions indicated by the prompts, and outputs the inference results in data formats such as audio data and text data. Here, inference refers to, for example, analysis, classification, prediction, and / or summarization. 【0633】 In the above embodiment, an example was given in which specific processing is performed by the data processing device 12, but the technology of this disclosure is not limited thereto, and specific processing may also be performed by the headset terminal 314. 【0634】 [Fourth Embodiment] 【0635】 Figure 7 shows an example of the configuration of the data processing system 410 according to the fourth embodiment. 【0636】 As shown in Figure 7, the data processing system 410 includes a data processing device 12 and a robot 414. An example of the data processing device 12 is a server. 【0637】 The data processing device 12 comprises a computer 22, a database 24, and a communication interface 26. The computer 22 is an example of a "computer" related to the technology of this disclosure. The computer 22 comprises a processor 28, RAM 30, and storage 32. The processor 28, RAM 30, and storage 32 are connected to a bus 34. The database 24 and the communication interface 26 are also connected to the bus 34. The communication interface 26 is connected to a network 54. An example of the network 54 is a WAN (Wide Area Network) and / or a LAN (Local Area Network). 【0638】 The robot 414 includes a computer 36, a microphone 238, a speaker 240, a camera 42, a communication interface 44, and a controlled object 443. The computer 36 includes a processor 46, RAM 48, and storage 50. The processor 46, RAM 48, and storage 50 are connected to a bus 52. The microphone 238, speaker 240, camera 42, and controlled object 443 are also connected to the bus 52. 【0639】 The microphone 238 receives voice signals from the user 20 and receives instructions from the user 20. The microphone 238 captures the voice signals from the user 20, converts the captured voice into audio data, and outputs it to the processor 46. The speaker 240 outputs audio according to the instructions from the processor 46. 【0640】 Camera 42 is a small digital camera equipped with an optical system including a lens, aperture, and shutter, and an image sensor such as a CMOS (Complementary Metal-Oxide-Semiconductor) image sensor or a CCD (Charge Coupled Device) image sensor, and captures images of the area around the user 20 (for example, an imaging range defined by a field of view equivalent to the width of a typical healthy person's field of vision). 【0641】 Communication interface 44 is connected to network 54. Communication interfaces 44 and 26 are responsible for the exchange of various information between processor 46 and processor 28 via network 54. The exchange of various information between processor 46 and processor 28 using communication interfaces 44 and 26 is performed in a secure manner. 【0642】 The controlled object 443 includes a display device, LEDs in the eyes, and motors that drive the arms, hands, and feet. The posture and gestures of the robot 414 are controlled by controlling the motors of the arms, hands, and feet. Some of the robot 414's emotions can be expressed by controlling these motors. Furthermore, the robot 414's facial expressions can also be expressed by controlling the illumination state of the LEDs in its eyes. 【0643】 Figure 8 shows an example of the main functions of the data processing device 12 and the robot 414. As shown in Figure 8, the data processing device 12 performs specific processing using the processor 28. The storage 32 stores the specific processing program 56. 【0644】 The specific processing program 56 is an example of a "program" relating to the technology of this disclosure. The processor 28 reads the specific processing program 56 from the storage 32 and executes the read specific processing program 56 on the RAM 30. The specific processing is realized by the processor 28 operating as a specific processing unit 290 in accordance with the specific processing program 56 executed on the RAM 30. 【0645】 The storage 32 stores the data generation model 58 and the emotion identification model 59. The data generation model 58 and the emotion identification model 59 are used by the identification processing unit 290. 【0646】 In robot 414, the processor 46 performs the reception output processing. The storage 50 stores the reception output program 60. The processor 46 reads the reception output program 60 from the storage 50 and executes the read reception output program 60 on the RAM 48. The reception output processing is realized by the processor 46 operating as a control unit 46A according to the reception output program 60 executed on the RAM 48. 【0647】 Next, the specific processing performed by the specific processing unit 290 of the data processing device 12 will be described. In the following description, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal". 【0648】 This invention provides a romance simulation system that enriches the user experience, with the user, terminal, and server working together to realize each function. 【0649】 First, the user inputs information about their ideal romantic partner's appearance and personality through the device's interface. The device receives this information and sends it to the server in the specified data format. 【0650】 The server uses an image generation model based on the received data to generate a person image with the features specified by the user. This image generation process employs machine learning algorithms to faithfully reproduce the shape and style desired by the user. The generated image is sent back to the user's device, allowing the user to confirm their visual representation of their romantic partner. 【0651】 Next, the server starts up its natural language processing model and prepares to interact with the user. It customizes the conversation style based on the personality traits entered by the user and sets up an initial conversation scenario. 【0652】 When a user starts a conversation via the chat interface, the server analyzes the user's input as the conversation progresses and instantly generates and returns natural-sounding responses. This gives the user the feeling of interacting with their ideal partner. 【0653】 Furthermore, the server records all conversations and uses machine learning techniques to continuously train the conversation model, improving the accuracy of future conversations. For example, if a user previously said "I like the sea," the server can generate context-related questions in the next conversation, such as "Have you been to the sea recently?" 【0654】 Through this interaction, users can engage with their ideal character and enjoy a personalized romance simulation experience. This invention provides the opportunity to experience ideal interactions in a virtual environment that are difficult to achieve in the real world. 【0655】 The following describes the processing flow. 【0656】 Step 1: 【0657】 The user inputs information about their ideal romantic partner's appearance and personality through the device's interface. 【0658】 Step 2: 【0659】 The terminal receives input from the user and sends it to the server in a formatted data format. 【0660】 Step 3: 【0661】 The server analyzes the received data and uses an image generation model to generate images of people based on features specified by the user. 【0662】 Step 4: 【0663】 The server sends the generated image of a person to the terminal, and the terminal displays it to the user. 【0664】 Step 5: 【0665】 The server activates a natural language processing model and customizes the conversational agent based on the user's personality settings. 【0666】 Step 6: 【0667】 When a user initiates a conversation via their device, the server analyzes the user's input and generates an appropriate response. 【0668】 Step 7: 【0669】 The server records the content of the conversation with the user in a database and provides feedback to the model for continuous learning. 【0670】 Step 8: 【0671】 Based on the conversation data collected by the server, the entire system is improved so that users can enjoy a more natural and personalized experience in subsequent interactions. 【0672】 (Example 1) 【0673】 Next, we will describe Example 1. In the following description, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal". 【0674】 Current interactive systems with virtual characters lack the ability to easily generate characters with the appearance and personality desired by the user and to provide natural dialogue based on them. Furthermore, they suffer from insufficient continuity of dialogue and poor understanding of context, limiting the user experience. Additionally, they are unable to leverage the user's past statements to generate more relevant conversations, making it difficult to continuously improve user satisfaction. 【0675】 The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 1 is realized by the following means. 【0676】 In this invention, the server includes means for acquiring information about features input by the user, means for generating an image with ideal features using a generative model based on the user's input, and means for using a language processing model to enable natural interaction with the user. This enables the generation of a character with the appearance and personality desired by the user, and natural, contextual interaction with that character. 【0677】 A "user" is an individual or end-user who utilizes the system and is the entity that provides input and interacts with the simulation. 【0678】 "Characteristics" refer to information such as specific attributes, personality traits, and appearance that users input, and are elements that influence character creation. 【0679】 A "generative model" is an algorithm or software that creates realistic images or visual outputs based on user input. 【0680】 An "image" is a visual representation created by a generative model, depicting a character with characteristics specified by the user. 【0681】 "Display" refers to the process of visually unfolding and making viewable the generated image on the user's device. 【0682】 "Dialogue" refers to communication between a user and a system, including exchanges via text and voice. 【0683】 A "language processing model" is an algorithm or software that processes text-based input and generates appropriate responses to enable natural communication with users. 【0684】 "Memory" refers to the act of storing past conversations and interactions with users in a database or similar system, for use in future interactions. 【0685】 "Learning" is the process of analyzing accumulated dialogue data and improving algorithms to enhance the system's capabilities. 【0686】 The system of this invention allows the user to input ideal characteristics using a terminal interface, and then uses a generative AI model to create a virtual character based on that input, enabling natural interaction. Specifically, the server and terminal work together to provide the user with a personalized simulation experience. 【0687】 The user first inputs information about the characteristics of the character they want, such as "black hair" and "kind personality," through the input device on the terminal. This information is collected by the terminal and sent to the server via the internet. 【0688】 The server processes the received data and uses a generative AI model to generate images of characters with specified features. Software used here includes Stable Diffusion and similar image generation algorithms. This enables visual simulations tailored to user requirements. 【0689】 The generated image is sent from the server to the terminal and displayed for the user to visually confirm. The user can then use this image to enjoy an interactive virtual experience. 【0690】 Furthermore, the server utilizes language processing models to achieve natural dialogue. The server analyzes user input and instantly generates responses based on an optimized dialogue model. This allows users to enjoy natural conversations with the generated character. Examples of language processing models used here include GPT-3. 【0691】 As a concrete example, an example of a prompt is as follows: "Create a visual and conversational style based on the characteristics of your ideal partner." Based on this, the user can gain a new conversational experience in a virtual environment. 【0692】 This system is designed to provide users with ideal interactions that are difficult to experience in reality, and to realize personalized romance simulations. 【0693】 The flow of the specific processing in Example 1 will be explained using Figure 11. 【0694】 Step 1: 【0695】 The user inputs the characteristics of their ideal character through the terminal's interface. This input includes details such as appearance and personality. The terminal converts this information into JSON data and sends it to the server. In this step, the user's characteristic request is taken as input, and formatted request data is generated as output. 【0696】 Step 2: 【0697】 The server parses the JSON data received from the terminal and uses a generative AI model to generate an image of a character with features specified by the user. Specifically, it uses an algorithm such as Stable Diffusion to synthesize a visual based on the specified attributes. The input is the user's feature data, and the output is the generated character image. 【0698】 Step 3: 【0699】 The generated image is sent from the server to the terminal. The terminal displays the received image on its display device, allowing the user to visually confirm it. In this step, the generated image data is received as input, and presented visually to the user as output. 【0700】 Step 4: 【0701】 The user initiates interaction with the character through the terminal. The server activates a language processing model and generates natural responses based on the user's input. A model such as GPT-3 is used in this process. The user's message is used as input, and the generated response is returned as output. The server analyzes the input in real time to ensure natural and relevant dialogue. 【0702】 Step 5: 【0703】 The server records the conversation content in a database and continuously improves the conversation model through machine learning. This learning process leverages past conversation data to improve the accuracy of subsequent conversations. Past conversation data is used as input, and an improved conversation algorithm is obtained as output. 【0704】 These processing steps allow users to obtain a highly customized virtual experience. 【0705】 (Application Example 1) 【0706】 Next, we will explain Application Example 1. In the following explanation, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal". 【0707】 In the modern era, the means by which users can interact intimately with their ideal characters are limited, and integrating that experience across virtual and physical environments is particularly difficult. There is a need for a system that provides physically actionable dialogue in accordance with user expectations. 【0708】 The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 1 is realized by the following means. 【0709】 In this invention, the server includes means for acquiring information about characteristics entered by the user, means for generating a person image with an ideal appearance using an image generation algorithm based on the user's input, and means for controlling a humanoid automated device to act as the user's ideal partner. This makes it possible for the user to experience a virtual lover in the real world. 【0710】 A "user" is an individual who seeks to interact with their ideal character using the system. 【0711】 "Characteristics" refer to information about the ideal appearance and personality that the user enters. 【0712】 "Means of acquiring information" refers to an interface for collecting requests and information entered by the user. 【0713】 An "image generation algorithm" is a machine learning model that creates images of people with ideal appearances based on user requests. 【0714】 "Means for generating human images" refers to the process of creating digital images using image generation algorithms. 【0715】 A "natural language processing algorithm" is a language analysis and response generation technique that enables natural dialogue between the user and the system. 【0716】 A "humanoid automated device" is a human-shaped device that physically acts based on user instructions. 【0717】 "Means of making it act as an ideal partner" refers to a mechanism that controls a humanoid automated device based on the user's wishes, causing it to perform specific actions or engage in dialogue. 【0718】 To realize this invention, the user first inputs the characteristics of their ideal character through the terminal's interface. These characteristics include details about appearance and personality. This information is collected by the terminal and sent to the server. 【0719】 On the server, an image generation algorithm is activated based on the collected information, and an image of a person with the appearance desired by the user is generated. Image generation software such as Stable Diffusion is used in this process. The generated image is sent to the user's terminal for visual confirmation. 【0720】 Furthermore, the server uses natural language processing algorithms to refine the content of the interaction with the user. Technologies such as OpenAI GPT are utilized here to analyze the flow of the conversation and generate natural responses that correspond to user input. The generated responses are sent to a humanoid robot, which then behaves as the user's ideal character. 【0721】 For example, if a user enters "a kind person who likes the outdoors," the device can respond in a gentle voice, "Have you been hiking recently?" Through such interactions, users can experience a virtual romantic partner in the real world. 【0722】 Examples of prompts used for the generating AI model include sentences like, "Please create a dialogue scenario with a character that has the personality and hobbies desired by the user." 【0723】 The flow of a specific process in Application Example 1 will be explained using Figure 12. 【0724】 Step 1: 【0725】 The user uses an interface via their device to input the characteristics of their ideal character. Specifically, they enter text information about the character's appearance and personality into the input fields. This input is stored on the device as data necessary for subsequent processing. 【0726】 Step 2: 【0727】 The terminal converts the collected user characteristic information into a specified data format and sends it to the server. This process is carried out by a data transmission protocol, and the server analyzes this information for use in subsequent processing. 【0728】 Step 3: 【0729】 The server activates an image generation algorithm based on the characteristic information received from the user. It processes the input characteristics to create an image generation prompt, and then uses an image generation model (e.g., Stable Diffusion) based on that prompt to generate an image of a person with an ideal appearance. The output image is sent back to the terminal for visual confirmation. 【0730】 Step 4: 【0731】 The user reviews the character image received on their device. They check whether the character's appearance matches their preferences, and if dissatisfied, they adjust the characteristics and re-enter the information. 【0732】 Step 5: 【0733】 The server invokes natural language processing algorithms to enable natural interaction with the user. Based on the characteristics input by the user, it generates appropriate conversation setting prompts and creates a dialogue scenario using a natural language model (e.g., OpenAI GPT). This output is used for subsequent processing as response data for the conversation. 【0734】 Step 6: 【0735】 The server transmits the generated dialogue response data to a humanoid automated device. This device initiates actions, including specific voice tones and gestures, to interact with the user. Through this device, the user can experience interaction with a virtual character in the real world. 【0736】 Furthermore, an emotion engine that estimates the user's emotions may be incorporated. That is, the identification processing unit 290 may use the emotion identification model 59 to estimate the user's emotions and perform identification processing using the user's emotions. 【0737】 This invention realizes a system in a romance simulation game that enables emotionally rich dialogue with the user's desired ideal romantic partner. The entire system includes image generation based on user input, natural language processing to enable natural conversation, and an emotion engine to recognize the user's emotions. 【0738】 First, the user uses the interface on their device to input visual characteristics and personality parameters as traits of their ideal romantic partner. This information is sent from the device to a server, where it is analyzed. 【0739】 The server invokes an image generation model based on the input data and generates a human image that faithfully represents the specified features. This image is sent back to the terminal, allowing the user to visually confirm their ideal character. 【0740】 Next, the server applies a natural language processing model to deploy a conversational agent that incorporates the user's personality information. This model is designed to record the content of conversations with the user and optimize response generation for future interactions. 【0741】 Furthermore, an emotion engine operates to analyze emotions from user input and voice. This emotion analysis is performed using an algorithm that identifies emotional states based on the timing of text input, the tone of words used, or, in the case of voice input, the tone and speed. 【0742】 For example, if a user enters "I'm feeling good today," the emotion engine can detect a positive emotion, and the server can generate a correspondingly cheerful response. If the user enters "I'm a little tired today," the emotion engine will detect a negative emotion, and the natural language processing model will react by changing the response to something more encouraging. 【0743】 A system configured in this way can provide users with realistic and emotional interactions, enhancing immersion in dating simulations. Users can experience more engaging and fulfilling interactions by receiving responses that take into account their own emotions and moods. 【0744】 The following describes the processing flow. 【0745】 Step 1: 【0746】 The user uses an interface on their device to input visual characteristics and personality information of their ideal romantic partner. 【0747】 Step 2: 【0748】 The device formats the data collected from the user and sends it to the server. 【0749】 Step 3: 【0750】 The server analyzes the received data, activates an image generation model, and generates a person image based on the input features. 【0751】 Step 4: 【0752】 The server generates image data of a person and sends it back to the terminal, which then visually presents the image to the user. 【0753】 Step 5: 【0754】 The server uses a natural language processing model to set up a conversational agent based on the user's personality information. 【0755】 Step 6: 【0756】 The user enters text or voice input through the device's conversation interface and sends it to the server. 【0757】 Step 7: 【0758】 The server analyzes the input information through a natural language processing model and uses an emotion engine to determine the user's emotional state. 【0759】 Step 8: 【0760】 Based on the emotional data determined by the emotion engine, the server generates a response adapted to the user's emotions and sends it to the terminal. 【0761】 Step 9: 【0762】 The terminal receives a response from the server and presents it to the user either visually or audibly. 【0763】 Step 10: 【0764】 The server records conversation history and sentiment data in a database, and continuously uses it to improve the accuracy of future responses. 【0765】 (Example 2) 【0766】 Next, we will describe Example 2. In the following description, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal". 【0767】 Traditional romance simulation games have only been able to offer users generalized characters and dialogue, making it difficult to achieve interactions that reflect the user's individual emotions and desires. As a result, users were unable to fully immerse themselves, and the simulation experience was limited. There is a need for a system that can solve this problem and enable more individualized and emotionally rich dialogue. 【0768】 The identification process performed by the identification processing unit 290 of the data processing device 12 in Example 2 is realized by the following means. 【0769】 In this invention, the server includes means for acquiring information on characteristics and temperament entered by the user, means for generating a character image with ideal characteristics using digital image generation technology based on the user's input, and means for presenting the character image. This enables the provision of visuals and conversational content based on the user's individual preferences, resulting in an immersive simulation experience. 【0770】 "Characteristics" refer to specific features related to the appearance and behavior of the character that the user considers ideal. 【0771】 "Temperament" refers to specific attributes related to the personality and disposition of the character that the user idealizes. 【0772】 "Digital image generation technology" refers to technology that uses a computer to automatically generate character images based on characteristics input by the user. 【0773】 A "character image" refers to a visual digital image generated based on characteristics specified by the user. 【0774】 "Presentation" refers to the act of displaying the generated character image so that the user can visually confirm it. 【0775】 "Language processing technology" refers to the technology that processes language data to enable natural dialogue with users, performing semantic analysis and response generation. 【0776】 An "emotion engine" refers to a system that analyzes emotions from user input and voice, identifies those emotions, and utilizes that information. 【0777】 This invention realizes a system that enables users to create their ideal romantic partner in a dating simulation game. Users input information about the characteristics and temperament of their ideal partner through the terminal interface. For example, it is possible to specify visual characteristics such as "black hair and large eyes" or personality parameters such as "highly kind and moderately intelligent." 【0778】 User input is transmitted to the server via the terminal. The server utilizes digital image generation technology based on the received information and generates the ideal character image using a generation AI model. Specifically, the generation AI model uses generation technology that can faithfully reproduce a variety of features. 【0779】 The generated character image is sent back to the device, allowing the user to visually confirm the image. Furthermore, the server applies language processing technology based on the temperament specified by the user to set up a conversational agent that enables natural dialogue. This allows the user to enjoy personalized conversations. 【0780】 In addition, the emotion engine analyzes the user's emotions from text and voice input, and the server generates a response based on those emotions. For example, if the user inputs "I'm happy today," the emotion engine can capture that positive emotion and generate a cheerful response. 【0781】 Examples of prompts include, "What are the physical characteristics of your ideal partner?" and "Please tell me how you feel today." 【0782】 With a system configured in this way, users can experience interactions that match their desires and emotions, leading to a deeper sense of immersion and satisfaction in romance simulation games. 【0783】 The flow of the specific processing in Example 2 will be explained using Figure 13. 【0784】 Step 1: 【0785】 The user uses the terminal's interface to input information about the characteristics and temperament of their ideal character. This input data includes physical features (e.g., hair color, eye shape) and personality parameters (e.g., kindness, intelligence). The terminal collects this input data and prepares to send it to the server. 【0786】 Step 2: 【0787】 The terminal sends the collected user input data to the server. The server receives this data and performs analysis. The analysis involves data processing to determine how the input characteristics will be reflected in the digital image. Based on these results, the system is ready to use digital image generation technology. 【0788】 Step 3: 【0789】 The server uses a generative AI model based on the analysis results to generate a character image that reflects the user's characteristics. The generative AI model used here incorporates an algorithm that accurately reproduces features. The generated character image is output as digital data and sent to the terminal. 【0790】 Step 4: 【0791】 The terminal receives character images sent from the server and presents them to the user. The user can visually confirm these images. This process allows the user to concretely evaluate the appearance of their ideal character. 【0792】 Step 5: 【0793】 The server uses language processing techniques to configure conversational agents based on temperament information input from the user. Here, natural language processing algorithms are used to construct conversation scenarios that enable natural dialogue with the user. This allows users to experience smooth communication when interacting with the characters. 【0794】 Step 6: 【0795】 The server uses an emotion engine to analyze the user's text and voice input and identify their emotional state. Emotional data is extracted by analyzing the input words and tone. This analysis makes it possible to generate appropriate responses that match the user's emotions. 【0796】 Step 7: 【0797】 The server generates a response based on the emotion engine's output and sends it to the terminal. The terminal receives this response and displays it to the user. This allows the user to experience a realistic and rich dialogue that resonates with their emotions. 【0798】 (Application Example 2) 【0799】 Next, we will explain application example 2. In the following explanation, the data processing device 12 will be referred to as the "server" and the robot 414 as the "terminal". 【0800】 There is a challenge in providing a system that allows users to interactively engage in emotionally rich conversations with their ideal companion. In particular, there is a need to generate personalized responses that respond to the user's emotional state and provide a more immersive experience. Existing systems have struggled to accurately understand the user's emotions and provide natural interactions accordingly. 【0801】 The specific processing performed by the specific processing unit 290 of the data processing device 12 in Application Example 2 is realized by the following means. 【0802】 In this invention, the server includes means for acquiring information on characteristics and properties entered by the user, means for generating a target image with ideal characteristics using an image generation device based on the user's input, and means for displaying the target image. This allows the user to enjoy emotionally harmonious dialogue in real time, enabling a more engaging interaction experience. 【0803】 A "user" is a person who operates this system and inputs information about ideal characteristics and traits. 【0804】 "Input" refers to the act of a user providing information about characteristics and properties to the system. 【0805】 "Characteristics" refer to the specific physical and personality traits that the user's ideal partner should possess. 【0806】 "Information" refers to data that a user provides to the system, including details about its characteristics and properties. 【0807】 An "image generation device" is a system component that generates an image embodying specified characteristics based on user input. 【0808】 The "target image" is a visual representation of a person with ideal characteristics, generated by an image generation device. 【0809】 "Displaying" refers to the act of showing the generated target image in a way that allows the user to visually confirm it. 【0810】 "Natural language processing technology" is a language analysis technology that enables natural conversations with users. 【0811】 "Dialogue" refers to two-way communication that takes place between the user and the system. 【0812】 An "emotion analysis device" is a system component that analyzes emotions from user input or voice and selects an appropriate response based on that analysis. 【0813】 A "response" is a reply that a system generates in response to user input or state. 【0814】 "Recording" refers to the act of saving the content of conversations with users for later reference and learning. 【0815】 "Learning" is the process by which a system uses past dialogue data to improve the accuracy and naturalness of its responses. 【0816】 In carrying out this invention, the system consists of a user, a server, and an end device. First, the user inputs information about the characteristics and properties of the ideal companion through an interface on the end device. This input includes visual characteristics and personality traits. 【0817】 The server receives the user's input data and first uses an image generator to produce a target image that embodies the specified characteristics. The generated target image is sent to the end device's display device so that the user can visually confirm it. Open-source generative AI models (e.g., DALL-E or Stable Diffusion) are used for image generation. 【0818】 Furthermore, the server utilizes natural language processing technology to enable natural conversations with the user. This conversation includes analyzing user input data, recording the conversation content, and learning from it. The Python Transformers library is used for natural language processing. 【0819】 Furthermore, the server operates an emotion analysis device to understand the user's emotional state. This device analyzes the user's input and voice and selects an appropriate response based on their emotions. Emotion analysis uses analysis libraries such as TextBlob. This enables the provision of emotionally harmonious responses to the user. 【0820】 For example, if a user enters "I was happy today," the server's emotion analyzer can detect a positive emotion and generate a response such as, "What a great day! Did something good happen?" 【0821】 An example of a prompt is, "Generate a blonde character with a gentle smile, as the ideal companion." This prompt is used to instruct the image generation model on the specific characteristics of the ideal companion. 【0822】 The flow of a specific process in Application Example 2 will be explained using Figure 14. 【0823】 Step 1: 【0824】 The user uses the terminal interface to input information about the characteristics and traits of their ideal companion. This information includes visual characteristics and personality parameters. The entered data is sent to the server. 【0825】 Step 2: 【0826】 The server analyzes the input data received from the user and creates prompt statements for the generating AI model. These prompt statements are designed to specifically express the characteristics desired by the user. This prepares the instructions for the AI model to generate appropriate images. 【0827】 Step 3: 【0828】 The server uses a generative AI model to generate a target image based on the prompt text. The prompt is input to the image generation model (e.g., DALL-E), and the generated image is received. The obtained image is sent to the terminal for the user to visually confirm. 【0829】 Step 4: 【0830】 The terminal displays the target image received from the server on its screen, enabling the user to visually confirm the image. The user can then examine the generated image and verify whether the desired characteristics are reflected. 【0831】 Step 5: 【0832】 The server uses natural language processing technology to set up the initial interaction with the user. Based on the content of the initial conversation, it builds the context of the conversation and prepares for continuous learning. This enables more natural conversation. 【0833】 Step 6: 【0834】 An emotion analysis device analyzes the user's emotions from their input and voice. The server analyzes the input data and identifies the emotional state. Based on this, the system determines the optimal response. 【0835】 Step 7: 【0836】 Based on the sentiment analysis results, the server optimizes the generated response using natural language processing techniques and sends a reply to the user. If the user provides positive input, it can return a similarly positive response. 【0837】 The specific processing unit 290 transmits the result of the specific processing to the robot 414. In the robot 414, the control unit 46A causes the speaker 240 and the controlled object 443 to output the result of the specific processing. The microphone 238 acquires audio indicating user input for the result of the specific processing. The control unit 46A transmits the audio data indicating user input acquired by the microphone 238 to the data processing unit 12. In the data processing unit 12, the specific processing unit 290 acquires the audio data. 【0838】 Data generation model 58 is a type of so-called generative AI (Artificial Intelligence). An example of data generation model 58 is ChatGPT (Internet Search<URL: https: / / openai.com / blog / chatgpt> ), Gemini (Internet search) <url: https: gemini.google.com ?hl="ja">Examples of generative AI include the following. The data generation model 58 is obtained by performing deep learning on a neural network. The data generation model 58 is input with prompts containing instructions, and with inference data such as audio data representing speech, text data representing text, and image data representing images. The data generation model 58 infers from the input inference data according to the instructions indicated by the prompts, and outputs the inference results in data formats such as audio data and text data. Here, inference refers to, for example, analysis, classification, prediction, and / or summarization. 【0839】 In the above embodiment, an example was given in which the specific processing is performed by the data processing device 12, but the technology of this disclosure is not limited thereto, and the specific processing may also be performed by the robot 414. 【0840】 Furthermore, the emotion identification model 59, acting as an emotion engine, may determine the user's emotion according to a specific mapping. Specifically, the emotion identification model 59 may determine the user's emotion according to a specific mapping, which is an emotion map (see Figure 9). Similarly, the emotion identification model 59 may also determine the robot's emotion, and the identification processing unit 290 may perform identification processing using the robot's emotion. 【0841】 Figure 9 shows an emotion map 400 in which multiple emotions are mapped. In the emotion map 400, emotions are arranged in concentric circles radiating from the center. The closer to the center of the concentric circles, the more primitive the emotions are located. Further out of the concentric circles, emotions representing states and actions arising from mental states are located. Emotion is a concept that includes feelings and mental states. On the left side of the concentric circles, emotions that are generally generated from reactions occurring in the brain are located. On the right side of the concentric circles, emotions that are generally induced by situational judgment are located. Above and below the concentric circles, emotions that are generally generated from reactions occurring in the brain and induced by situational judgment are located. In addition, the emotion of "pleasure" is located on the upper side of the concentric circles, and the emotion of "displeasure" is located on the lower side. Thus, in the emotion map 400, multiple emotions are mapped based on the structure in which emotions arise, and emotions that are likely to occur simultaneously are mapped close together. 【0842】 These emotions are distributed at the 3 o'clock position on the Emotion Map 400, and usually fluctuate between feelings of security and anxiety. In the right half of the Emotion Map 400, situational awareness takes precedence over internal feelings, resulting in a calm impression. 【0843】 The inside of the Emotion Map 400 represents inner thoughts, while the outside represents actions. Therefore, the further you go from the outside of the Emotion Map 400, the more visible (expressed in actions) your emotions become. 【0844】 Here, human emotions are based on various balances, such as posture and blood sugar levels. When these balances deviate from the ideal, it results in discomfort, and when they approach the ideal, it results in pleasure. Similarly, in robots, cars, motorcycles, etc., emotions can be created based on various balances, such as posture and battery level. When these balances deviate from the ideal, it results in discomfort, and when they approach the ideal, it results in pleasure. The emotion map can be generated, for example, based on Dr. Mitsuyoshi's emotion map (Research on a system for analyzing brain physiological signals of speech emotion recognition and emotion, Tokushima University, doctoral dissertation: https: / / ci.nii.ac.jp / naid / 500000375379). The left half of the emotion map contains emotions belonging to a region called "response," where sensation is dominant. The right half of the emotion map contains emotions belonging to a region called "situation," where situational awareness is dominant. 【0845】 The emotion map defines two emotions that promote learning. One is the emotion around the middle of the negative "repentance" and "reflection" on the situation side. In other words, it is when the robot experiences negative emotions such as "I never want to feel this way again" or "I don't want to be scolded again." The other is the emotion around the positive "desire" on the reaction side. In other words, it is when the robot has positive feelings such as "I want more" or "I want to know more." 【0846】 The emotion identification model 59 inputs user input into a pre-trained neural network, obtains emotion values representing each emotion shown in the emotion map 400, and determines the user's emotion. This neural network is pre-trained based on multiple training data sets, which are combinations of user input and emotion values representing each emotion shown in the emotion map 400. Furthermore, this neural network is trained so that emotions located close together have similar values, as shown in the emotion map 900 in Figure 10. Figure 10 shows an example where multiple emotions such as "reassured," "calm," and "confident" have similar emotion values. 【0847】 The above description primarily focuses on the functions of the data processing device 12 in relation to this disclosure. However, the system related to this disclosure is not necessarily implemented on a server. The system related to this disclosure may be implemented as a general information processing system. This disclosure may be implemented, for example, as a software program that runs on a personal computer or as an application that runs on a smartphone. The method related to this disclosure may be provided to users in SaaS (Software as a Service) format. 【0848】 In the above embodiment, an example was given in which a specific process is performed by a single computer 22. However, the technology of this disclosure is not limited thereto, and a distributed processing of the specific process may be performed by multiple computers, including computer 22. For example, a data generation model 58 may be provided in an external device of the data processing device 12, and the external device may generate data according to the input data. 【0849】 In the above embodiment, an example was given in which the specific processing program 56 is stored in the storage 32, but the technology of this disclosure is not limited thereto. For example, the specific processing program 56 may be stored in a portable, computer-readable, non-temporary storage medium such as a USB (Universal Serial Bus) memory. The specific processing program 56 stored in the non-temporary storage medium is installed in the computer 22 of the data processing device 12. The processor 28 executes specific processing according to the specific processing program 56. 【0850】 Alternatively, the specific processing program 56 may be stored in a storage device such as a server connected to the data processing device 12 via the network 54, and the specific processing program 56 may be downloaded and installed on the computer 22 in response to a request from the data processing device 12. 【0851】 Furthermore, it is not necessary to store the entirety of the specific processing program 56 in a storage device such as a server connected to the data processing device 12 via the network 54, or to store the entirety of the specific processing program 56 in the storage 32; it is acceptable to store only a portion of the specific processing program 56. 【0852】 The following types of processors can be used as hardware resources to perform specific processing. Examples of processors include a CPU, a general-purpose processor that functions as a hardware resource to perform specific processing by executing software, i.e., a program. Other examples of processors include dedicated electrical circuits, such as FPGAs (Field-Programmable Gate Arrays), PLDs (Programmable Logic Devices), or ASICs (Application Specific Integrated Circuits), which have circuit configurations specifically designed to perform specific processing. All of these processors have built-in or connected memory, and all of them perform specific processing by using memory. 【0853】 The hardware resource that performs a specific process may consist of one of these various processors, or it may consist of a combination of two or more processors of the same or different types (for example, a combination of multiple FPGAs, or a combination of a CPU and an FPGA). Alternatively, the hardware resource that performs a specific process may consist of a single processor. 【0854】 Examples of configurations using a single processor include, firstly, a configuration in which one or more CPUs and software are combined to form a single processor, and this processor functions as a hardware resource that performs a specific process. Secondly, there is a configuration using a processor that realizes the functions of the entire system, including multiple hardware resources that perform a specific process, on a single IC chip, as exemplified by SoCs (System-on-a-chip). In this way, a specific process is realized using one or more of the above types of processors as hardware resources. 【0855】 Furthermore, the hardware structure of these various processors can more specifically utilize electrical circuits that combine circuit elements such as semiconductor devices. Also, the specific processing described above is merely an example. Therefore, it goes without saying that unnecessary steps can be deleted, new steps added, or the processing order rearranged, as long as it does not deviate from the main purpose. 【0856】 The descriptions and illustrations presented above are detailed explanations of the technical aspects of this disclosure and are merely examples of the technical aspects. For example, the above descriptions of the structure, function, operation, and effect are examples of the structure, function, operation, and effect of the technical aspects of this disclosure. Therefore, it goes without saying that you may delete unnecessary parts, add new elements, or replace elements in the descriptions and illustrations presented above, as long as you do not deviate from the essence of the technical aspects of this disclosure. Furthermore, in order to avoid confusion and facilitate understanding of the technical aspects of this disclosure, explanations of common technical knowledge and the like that do not require special explanation to enable the implementation of the technical aspects of this disclosure have been omitted from the descriptions and illustrations presented above. 【0857】 All documents, patent applications, and technical standards described herein are incorporated by reference to the same extent as if each individual document, patent application, and technical standard were specifically and individually noted to be incorporated by reference. 【0858】 The following is further disclosed regarding the embodiments described above. 【0859】 (Claim 1) 【0860】 A means of obtaining information about appearance and personality entered by the user, 【0861】 A means for generating a person image with an ideal appearance using an image generation model based on the user's input, 【0862】 The means for displaying the aforementioned person image, 【0863】 Means of using natural language processing models to enable natural conversations with users, 【0864】 A means for memorizing the aforementioned conversation content and continuously learning it, 【0865】 A system that includes this. 【0866】 (Claim 2) 【0867】 The system according to claim 1, characterized in that the image generation model includes an algorithm for generating images based on specific attributes. 【0868】 (Claim 3) 【0869】 The system according to claim 1, characterized in that the natural language processing model includes an algorithm that generates a response while maintaining the context of the conversation based on the user's input history. 【0870】 "Example 1" 【0871】 (Claim 1) 【0872】 A means of obtaining information about the characteristics entered by the user, 【0873】 A means for generating an image with ideal features using a generative model based on the user's input, 【0874】 means for displaying the aforementioned image, 【0875】 Means of using language processing models to enable natural interaction with users, 【0876】 A means for memorizing the content of the aforementioned dialogue and for continuous learning, 【0877】 A system that includes this. 【0878】 (Claim 2) 【0879】 The system according to claim 1, characterized in that the generative model includes an algorithm for generating images based on certain data. 【0880】 (Claim 3) 【0881】 The system according to claim 1, characterized in that the language processing model includes an algorithm that generates a response while maintaining the context of the dialogue based on the user's input history. 【0882】 "Application Example 1" 【0883】 (Claim 1) 【0884】 A means of obtaining information about characteristics entered by the user, 【0885】 A means for generating a person image with an ideal appearance using an image generation algorithm based on the user's input, 【0886】 The means for displaying the aforementioned person image, 【0887】 Means of using natural language processing algorithms to enable natural conversation with users, 【0888】 A means of recording the content of the aforementioned dialogue and allowing for continuous learning, 【0889】 A means of controlling a humanoid automated device to make it act as the user's ideal partner, 【0890】 A system that includes this. 【0891】 (Claim 2) 【0892】 The system according to claim 1, characterized in that the image generation algorithm includes a process for generating an image based on specific features. 【0893】 (Claim 3) 【0894】 The system according to claim 1, characterized in that the natural language processing algorithm includes a process for generating a response while maintaining the flow of the conversation based on the user's input history. 【0895】 "Example 2 of combining an emotion engine" 【0896】 (Claim 1) 【0897】 A means for obtaining information on characteristics and temperament entered by the user, 【0898】 A means for generating a character image with ideal characteristics using digital image generation technology based on the user's input, 【0899】 The means for presenting the aforementioned character image, 【0900】 Means of using language processing techniques to enable natural interaction with users, 【0901】 A means for memorizing the content of the aforementioned dialogue and for continuous learning, 【0902】 An emotion engine means for analyzing emotions from user input and voice, 【0903】 means for generating a response based on the aforementioned emotion analysis information, 【0904】 A system that includes this. 【0905】 (Claim 2) 【0906】 The system according to claim 1, characterized in that the digital image generation technology includes an algorithm for generating images based on specific features. 【0907】 (Claim 3) 【0908】 The system according to claim 1, characterized in that the language processing technology includes an algorithm that generates a response while maintaining the context of the conversation based on the user's input history. 【0909】 "Application example 2 when combining with an emotional engine" 【0910】 (Claim 1) 【0911】 A means for obtaining information on characteristics and properties entered by the user, 【0912】 A means for generating a target image with ideal characteristics using an image generation device based on the user's input, 【0913】 means for displaying the aforementioned target image, 【0914】 Means of using natural language processing techniques to enable natural conversation with users, 【0915】 A means for recording the content of the aforementioned dialogue and allowing it to be continuously learned, 【0916】 Means including an emotion analysis device that analyzes the user's emotions and selects a response accordingly, 【0917】 A system that includes this. 【0918】 (Claim 2) 【0919】 The system according to claim 1, characterized in that the image generation device includes a mechanism for generating images based on specific characteristics. 【0920】 (Claim 3) 【0921】 The system according to claim 1, characterized in that the natural language processing technology includes a mechanism for generating a response while maintaining the context of the dialogue based on user input information. [Explanation of symbols] 【0922】 10, 210, 310, 410 Data Processing Systems 12 Data Processing Devices 14 Smart Devices 214 Smart Glasses 314 Headset-type terminal 414 Robots< / url:> < / url:> < / url:> < / url:>
Claims
[Claim 1] A means of obtaining information about appearance and personality entered by the user, A means for generating a person image with an ideal appearance using an image generation model based on the user's input, The means for displaying the aforementioned person image, Means of using natural language processing models to enable natural conversations with users, A means for memorizing the aforementioned conversation content and continuously learning it, A system that includes this. [Claim 2] The system according to claim 1, characterized in that the image generation model includes an algorithm for generating images based on specific attributes. [Claim 3] The system according to claim 1, characterized in that the natural language processing model includes an algorithm that generates a response while maintaining the context of the conversation based on the user's input history.
Citation Information
Patent Citations
Persona chatbot control method and system
JP2022180282A