Information processing terminal, information processing method, information processing program

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
The information processing system addresses the inconvenience in posting voice data on SNS by allowing voice message exchange and filtered posting between user terminals, enhancing convenience and appropriateness of content sharing with position tracking.

JP7875649B1Active Publication Date: 2026-06-18BSIZE INC

View PDF 5 Cites 0 Cited by

Patent Information

Authority / Receiving Office: JP · JP
Patent Type: Patents
Current Assignee / Owner: BSIZE INC
Filing Date: 2026-05-13
Publication Date: 2026-06-18

Application Information

Patent Timeline

13 May 2026

Application

18 Jun 2026

Publication

JP7875649B1

IPC: H04L51/52

CPC: H04L51/10; H04L51/52; H04L51/212; G10L15/26; G10L17/00; G06F3/167; G06Q10/40; G10L17/02

AI Tagging

Application Domain

Speech recognition Transmission

Explore More Agents

Novelty Search
Search existing technologies and assess novelty
↗
FTO
Analyze whether a product may infringe others' patents
↗
Design FTO
Check prior-design risk for exterior design
↗
Drafting
Draft patent application text based on a technical solution
↗
Find Solutions with TRIZ
Generate feasible solution to solve your technical challenge
↗

Similar Technology Patents

Get free access to AI patent search and analysis

Check patentability, review prior art and ask IP Agent with full patent context.

Smart Images

Figure 0007875649000001_ABST

Patent Text Reader

Abstract

To provide an information processing terminal, information processing method, and information processing program that offer high convenience for posting. and. First, obtain the first designated information that specifies the audio data to be posted from one or more audio data. The acquisition unit and the posting unit which posts the audio data specified by the first designated information acquired by the first acquisition unit. An information processing terminal equipped with [a specific feature].

Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0006] , ,

[0005]

[0001] The present invention relates to an information processing terminal, an information processing method, and an information processing program capable of data posting. It does.

Background Art

[0002] SNS (Social Networking Service) has become widely popular. With the enrichment of content, not only character data but also image data, video data, audio data, etc. can now be posted.

[0003] For example, in Patent Document 1, when posting audio data to SNS, the conversation between users is recorded, and the recorded conversation data is posted on SNS, so that the conversation content between users can be heard by a third user who did not participate in the conversation. An information processing system is described.

Prior Art Documents

Patent Documents

[0004]

Patent Document 1

Summary of the Invention

Problems to be Solved by the Invention

[0005] However, in the conventional information processing system, there is still room for improvement in the convenience of data posting. There is.

[0006] The present invention has been made in view of the above problems, and an object thereof is to provide an information processing terminal, an information processing method, and an information processing program with high convenience for data posting.

Means for Solving the Problems

[0007] To solve the above problems, the information processing terminal according to the present invention posts from one or more audio data A first acquisition unit obtains first designation information that specifies the audio data to be used, and the first acquisition unit has acquired It comprises a posting unit that posts audio data specified in the first designated information. [Effects of the Invention]

[0008] According to the present invention, an information processing terminal, an information processing method, and an information processing program are available that offer high convenience for posting. We can serve lamb. [Brief explanation of the drawing]

[0009] [Figure 1] This figure shows an example of the arrangement of an information processing system according to the embodiment. [Figure 2] This figure shows an example of the configuration of an information processing server according to the embodiment. [Figure 3] This figure shows an example of the configuration of an information processing device server according to the embodiment. [Figure 4] This figure shows an example of the configuration of the first user terminal according to the embodiment. [Figure 5] This figure shows an example of the configuration of the first user terminal according to the embodiment. [Figure 6] This figure shows an example of the configuration of a second user terminal according to the embodiment. [Figure 7] This figure shows an example of the configuration of a second user terminal according to the embodiment. [Figure 8] This figure shows an example of a screen displayed on the display device of the first user terminal according to the embodiment. [Figure 9] This flowchart shows an example of processing performed by the information processing system according to the embodiment. [Figure 10] This flowchart shows an example of processing performed by the information processing system according to the embodiment. [Figure 11]It is a flowchart showing an example of processing by the information processing system according to the embodiment. [Figure 12] It is a flowchart showing an example of processing by the information processing system according to the embodiment. [Figure 13] It is a flowchart showing an example of processing by the information processing system according to the embodiment.

Mode for Carrying Out the Invention

[0010] [Embodiment] Hereinafter, the information processing system 1 according to the embodiment will be described with reference to the drawings. The information processing system 1 according to the embodiment is a so-called monitoring system, and from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user. from the second user terminal 4 that can be carried and used by the person being monitored (for example, a child), information uploaded to the server 2 at regular intervals such as every 1.5 minutes, the position of the second user terminal 4 is determined, and the determined position is notified to the first user terminal 3 carried or used by the monitor (for example, a family member such as a parent or a grandparent) from the server 2. Further, in the information processing system 1 according to the present embodiment, the first user terminal 3 and the second user terminal 4 are provided with microphones and speakers, and are configured to be able to transmit and receive voice messages (hereinafter also referred to as voices) to each other. That is, it is configured to be able to exchange voice messages between the monitor and the person being monitored. Further, the monitor is configured to be able to select an arbitrary message from the conversation with the person being monitored and post it on SNS or the like. In the following description, the monitor is also referred to as the first user. Also, the person being monitored is also referred to as the second user.

[0011] As shown in Figure 1, the information processing system 1 consists of a server 2 and a network 5 connected to the server 2. It comprises one or more first user terminals 3 and second user terminals 4 connected via [a certain method]. The first user terminal 3 transmits voice data to an external SNS server (not shown) via the network 5. Data, text data, image data, video data (Note that in the following explanation, image data and At least one of the video data (also called image data, etc.), location data, time data, etc. It is configured to allow data to be posted. In the example shown in Figure 1, the information processing system 1 is configured to allow data to be posted. The configuration consists of one server 2, one first user terminal 3, and one second user terminal 4. However, the server 2, first user terminal 3, and second user terminal 4 provided by the information processing system 1 The numbers are arbitrary.

[0012] (Server 2) Figures 2 and 3 are configuration diagrams of Server 2. Figure 2 shows the main hardware configuration of Server 2. Server 2 is equipped with a communication IF200A, storage device 200B, CPU200C, etc. Although not shown in Figure 2, Server 2 is an input device (e.g., mouse, keyboard). Display devices (such as touch panels, etc.) and display devices (CRT (Cathode Ray Tube), LCD displays) It may also be equipped with features such as an OLED display.

[0013] The communication IF200A is connected to other devices (for example, the first user terminal 3, the second user terminal 4, etc.) It is an interface for communication with [the other party].

[0014] Storage device 200B is, for example, an HDD (Hard Disk Drive) or a semiconductor storage device (SSD). It is a Solid State Drive. The storage device 200B contains various data and information processing programs. This is stored. Furthermore, some or all of the various data stored in memory device 200B are, USB (Universal Serial Bus) external storage devices such as memory sticks and external HDDs, and network connections The data may be stored in the memory of another information processing device connected via 5. In this case, -ver 2 refers to various data stored in external storage devices or other information processing devices. It will be acquired.

[0015] The storage device 200B contains account information for the first user terminal 3, for example, the first user terminal 3. Identification number, name, contact information (email address, phone number), second user (for example, yourself) The identification number of the second user terminal 4, which is owned by a child (or similar), is stored. 200B contains account information for the second user terminal 4, for example, the identification number of the second user terminal 4. Number, name, first user terminal owned by the first user (for example, a family member such as your parent or grandparent) The last three identification numbers are stored. Also, the storage device 200B contains the first user terminal 3 and Logs, including data sent and received by the second user terminal 4, are stored in association with the account. It is.

[0016] The CPU200C controls the server 2 according to this embodiment, and includes ROM and RAM (not shown). It is equipped with the following features.

[0017] Figure 3 is a functional block diagram of Server 2. As shown in Figure 3, Server 2 is a receiving unit 2 It includes functions such as 01, a transmission unit 202, and a storage device control unit 203. Note that the functions shown in Figure 3 are also included. The CPU 200C executes the information processing program stored in the memory device 200B. This is achieved by doing so.

[0018] The receiving unit 201 receives data transmitted from the first user terminal 3 or the second user terminal 4, for example Then, it receives audio data, etc.

[0019] The transmitting unit 202 receives data from the first user terminal 3, for example, voice data, and transmits it to the second user terminal 3. It is sent to the second user terminal 4. The transmission unit 202 also receives data from the second user terminal 4. For example, audio data is sent to the first user terminal 3.

[0020] The storage device control unit 203 processes the data transmitted and received by the first user terminal 3 and the second user terminal 4. The data is associated with the identification number of the account or user terminal that sent or received the data and stored in the storage device 20. Store in 0B.

[0021] (First user terminal 3) The first user terminal 3 is a terminal owned by the first user, for example, the first user terminal 3 Application software for making a terminal function with each of the functions shown in this embodiment This includes smartphones with the software installed. The first user is the first user terminal. By sending and receiving voice data with the second user terminal 4 registered using 3, the second user ( For example, you can communicate with your child (using voice). Figure 4 shows the first user terminal. The main hardware configuration of the end 3 is shown, consisting of a communication IF300A, a storage device 300B, and an input device 3. 00C, Display device 300D, CPU 300E, Microphone 300F, Speaker 300 It is equipped with G, etc.

[0022] The communication IF300A is an interface for communicating with other devices (in this embodiment, server 2). - It's the face.

[0023] The storage device 300B is, for example, an HDD (Hard Disk Drive) or a semiconductor storage device (SSD). It is a Solid State Drive. The storage device 300B contains the terminal identification number and information processing program. This includes things like RAM (application software), dictionaries containing prohibited words and phrases for posting, etc. It is stored in memory. Also, the storage device 300B contains information between the first user terminal 3 and the second user terminal 4. The transmitted and received data is stored. For example, the storage device 300B contains the data of the first user terminal 3 The audio data sent and received between the second user terminal 4 and the text converted from this audio data. The text data is stored in association with the data. The terminal identification number identifies the first user terminal 3. This is a number used for that purpose. The terminal identification number is assigned to the data transmitted from the first user terminal 3. By doing so, Server 2 can determine which first user terminal 3 the received data was sent from. It can determine whether or not. Note that the terminal identification number is an IP (Internet Protocol) You may also use the ID, MAC (Media Access Control) address, etc., and Server 2 is the It may also be possible to grant it to each user terminal 3.

[0024] The input device 300C is, for example, an input device such as a keyboard, mouse, or touch panel. However, other devices or equipment may be used as long as they can be used for input. Also, voice input devices are also acceptable. That's fine.

[0025] Display device 300D is, for example, a liquid crystal display, a plasma display, an organic EL display. While displays are an option, other devices or equipment (e.g., CRT: Cathode) can display the image if possible. (Ray Tube) is also acceptable.

[0026] The CPU300E controls the first user terminal 3 according to this embodiment, and includes a ROM (not shown) and It is equipped with RAM.

[0027] The Microphone 300F is an audio device that converts sound into electrical signals. (First User Terminal) 3 users can input audio using microphone 300F. The recorded audio is transmitted to server 2 by the transmission unit 302, which will be described later.

[0028] The Speaker 300G is an audio device that converts electrical signals into sound. For example, For example, transmitted from the second user terminal 4 via server 2 and stored in storage device 300B Play the audio data.

[0029] Figure 5 shows a functional block diagram of the first user terminal 3, where the first user terminal 3 is a receiving terminal. Unit 301, transmission unit 302, storage device control unit 303, input reception unit 304 (reception unit), display unit Control unit 305, acquisition unit 306 (1st to 3rd acquisition units), posting unit 307, generation unit 308, transformation It has functions such as a replacement unit 309, a recognition unit 310, a notification unit 311, and a registration unit 312. The function shown in 5 is that the CPU 300E processes the information processing program stored in the storage device 300B. This is achieved by executing `ram`.

[0030] The receiving unit 301 receives data transmitted from, for example, the server 2.

[0031] The transmission unit 302 transmits data in response to input operations received by the input reception unit 304, for example. Send to -ver2.

[0032] The storage device control unit 303 controls the storage device 300B. For example, the storage device control unit 30 3 is the data transmitted and received by the first user terminal 3 and the second user terminal 4. The verified account or user terminal identification number is associated with the stored information in the storage device 300B. Furthermore, the storage device control unit 303, for example, between the first user terminal 3 and the second user terminal 4 The transmitted and received audio data is associated with the text data obtained by converting this audio data into text. Store in memory device 300B.

[0033] The input receiving unit 304 receives input operations from the input device 300C. For example, the input receiving unit The attachment unit 304 accepts the selection of whether or not to post the audio data notified by the notification unit 311. ru.

[0034] The display control unit 305 controls the display device 300D and the data received by the receiving unit 301. These are displayed on the display device 300D.

[0035] The acquisition unit 306 selects the audio to be posted from one or more audio data received by the input reception unit 304. The first specification information that specifies the data is obtained. Here, the acquisition unit 306 specifies character data. The information to be acquired may be obtained in units of two or more sentences or audio data. Unit 306 acquires first specified information based on the character data converted by the conversion unit 309. It may be done in this way. Also, the acquisition unit 306 acquires the image data etc. to be posted (image data and video). Second designation information is obtained that specifies at least one of the data. Here, image data, etc. Image data etc. automatically specified by the generation unit 308 of the first user terminal 3 may also be used, or input reception unit The 304 response may also contain image data or other data specified by the user.

[0036] The posting unit 307 receives the audio data specified in the first designated information acquired by the acquisition unit 306 as the SN. The post is submitted to S, etc. Additionally, the posting unit 307 receives the playback data generated by the generation unit 308 from SNS. It posts to, etc. Also, the posting unit 307 posts the audio data that was specified, along with the text data. The data may also be posted to social media, etc. Note that the text data must be posted as specified. This is data obtained by converting audio data into text. Furthermore, the submission section 307 is the data obtained when a submission is specified. It is acceptable to post information about the speaker of the audio data along with the audio data on social media, etc. Furthermore, the submission section 307 refers to a dictionary containing words and phrases that are prohibited from being submitted and registers them in the dictionary. The posting of audio data containing the specified words to social media and other platforms will be restricted. The posting unit 307 accepts input. Based on the selection of whether the submission received by unit 304 is acceptable or not, the audio is broadcast by the notification unit 311. Post the data to social media, etc. Here, "posting" refers to sharing data with third parties such as social media or websites. The data should be posted to a platform where the device can view or download the posted data. This refers to the process of uploading data and making it viewable or downloadable by third-party devices. Oh, this platform will exchange data etc. with the terminals of an unspecified number of recipients. Not only things, but also the recipients of the unspecified large number of people are limited to a specific large number of people or a single person. It functions as a communication tool between two parties, such as for exchanging data between devices. This includes things that a single recipient can view or download using the communication tool in question. The process of uploading data to make it downloadable also constitutes posting. Furthermore, if the recipient's device is able to download... As for the form of posting for viewing or downloading data, as already mentioned, This is not limited to cases where the user uploads the data itself, but also includes alternatives such as the poster's device or A URL created by a server or other entity that received data for download from the poster's device. This also includes methods that send information for downloading data, such as links, to the recipient's device. On the recipient's device, by selecting the link in question, they can access the link in question. It is possible to download the associated data.

[0037] The generation unit 308 uses the audio data specified by the first designated information acquired by the acquisition unit 306, and the image It generates playback data that is created by combining it with image data, etc. The data may be image data etc. automatically acquired by the generation unit 308 of the first user terminal 3, The user may specify image data, etc. Note that playback data refers to the playback of audio data. The speaker (recognized by the recognition unit 310) and the conversion unit 309 of the audio data being played are converted accordingly. The system may also be configured to present the text data of the audio data.

[0038] The conversion unit 309 converts the audio data acquired by the acquisition unit 306 into text data.

[0039] The recognition unit 310 recognizes the speaker of the voice data. Here, the speaker of the voice data is the voice data You can recognize it from the identification number assigned to the data, or you can analyze the audio data to extract features. Even if recognition is performed by comparing it with the speech features of pre-registered speakers, good.

[0040] The notification unit 311 notifies that it has received audio. This notification is, for example, from the application Push notifications via software (for example, notification sounds or displays on the 300D display device) It is carried out by [unclear]. In addition, the news department 311 has audio data containing words and phrases registered in the dictionary. The status is notified. The content of the notification by the notification unit 311 is displayed on the display device control unit 305. It is displayed on 300D. In addition, the content of the notification from the notification unit 311 is broadcast audibly from speaker 300G. It is also acceptable to announce it verbally.

[0041] The registration unit 312 registers (stores) the words acquired by the acquisition unit 306 in the dictionary. Section 312 removes the words obtained by the acquisition section 306 from the list of words that are prohibited from posting (post The words (that can be written) are registered (stored) in the dictionary. The words registered in the dictionary by the registration unit 312 are The data is stored in the storage device 200B by the storage device control unit 303.

[0042] (Second user terminal 4) The second user terminal 4 is a terminal used by the second user of this information processing system 1. The user sends and receives voice data using the second user terminal 4 with the first user terminal 3 that was registered. This allows you to communicate with the first user (for example, your family) via voice. Figure 6 shows the main hardware configuration of the second user terminal 4, and the second user terminal 4 is Signal IF400A, Storage device 400B, Input device 400C, Display device 400D, CPU 40 It features 0E, microphone 400F, speaker 400G, GPS sensor 400H, etc. .

[0043] The communication IF400A is an interface for communicating with other devices (in this embodiment, Server 2). - It's the face.

[0044] Storage devices 400B include, for example, HDDs (Hard Disk Drives) and semiconductor storage devices (SSDs). It is a Solid State Drive. The storage device 400B contains the terminal identification number and information processing program. RAM, audio data transmitted from the first user terminal 3, etc. are stored. The terminal identification number is This is a number used to identify the second user terminal 4. Data transmitted from the second user terminal 4. By assigning a terminal identification number to the terminal, Server 2 can determine which second user terminal the received data is from. It is possible to determine whether it was sent from 4. Note that the terminal identification number is IP Uses (Internet Protocol) addresses, MAC (Media Access Control) addresses, etc. Alternatively, server 2 may grant it to the second user terminal 4.

[0045] Input device 400C is an input device such as a keyboard, mouse, or touch panel. However, other devices or equipment may be used as long as they can be used for input. Also, voice input devices are also acceptable. It is also permissible. The second user operates the input device 400C to input audio to the first user terminal. It can send data to terminal 3, or play audio data sent from the first user terminal 3. .

[0046] The display device 400D is, for example, an LED. The display device 400D lights up in a predetermined pattern. The device will indicate that it has received audio by illuminating or flashing.

[0047] The CPU400E controls the second user terminal 4 according to this embodiment, and includes a ROM (not shown) and It is equipped with RAM.

[0048] The Microphone 400F is an audio device that converts sound into electrical signals. (Second User Terminal) Users 4 can input audio using the microphone 400F. The recorded audio is transmitted to server 2 by the transmission unit 402, which will be described later.

[0049] The Speaker 400G is an audio device that converts electrical signals into sound. For example, For example, transmitted from the first user terminal 3 via server 2 and stored in storage device 400B Play audio data. Also, speaker 400G generates sound in a predetermined pattern. This notifies that audio has been received.

[0050] The GPS sensor 400H collects time data from the atomic clock onboard the satellite, and the satellite's ephemeris data. The system receives signals from GPS satellites that include data such as orbit, and the transmission time of the received signals is calculated as follows: The current location is determined by calculating the distance from the satellite based on the difference in reception time. Sensor 400H outputs the identified current location.

[0051] Figure 7 shows a functional block diagram of the second user terminal 4, where the second user terminal 4 is a receiving terminal. Unit 401, transmission unit 402, storage device control unit 403, input reception unit 404, display device control unit 4 It has functions such as 05. Note that the functions shown in Figure 7 are performed by the CPU 400E on the storage device 400. This is achieved by executing the information processing program stored in B.

[0052] The receiving unit 401 receives data transmitted from, for example, server 2, such as voice data. ru.

[0053] The transmission unit 402 transmits data, for example, in response to an input operation received by the input reception unit 304. For example, the audio data will be sent to server 2.

[0054] The storage device control unit 403 controls the storage device 400B. For example, the storage device control unit 40 3 controls the storage device 400B to write and read data. For example, 403 stores the data received by the receiving unit 401 in the storage device 400B.

[0055] The input receiving unit 404 receives input operations from the input device 400C. For example, unit 4 accepts a playback command for audio data stored in the memory device 400B.

[0056] The display device control unit 405 controls the display device 400D. For example, when the receiving unit 401 receives audio data, the display device 400D (LED) will turn on a predetermined pattern Light up or flash using a button or similar.

[0057] (display screen) Figure 8 shows an example of screen G1 displayed on the display device 300D of the first user terminal 3. Yes. See Figure 8 below for the screen G displayed on the display device 300D of the first user terminal 3. Let's explain one example. Note that the same configuration as the one described with reference to Figures 1 to 7 is not the same as the configuration described in Figures 1 to 7. The same symbols are used to omit redundant explanations.

[0058] As shown in Figure 8, the display device 300D has a connection between the first user terminal 3 and the second user terminal 4. The text data obtained by converting the audio data sent and received between them is chronologically organized by the audio data file. They are displayed in order (hereinafter also referred to as timeline display).

[0059] In the example shown in Figure 8, the name of the second user 11 (or handle name) is displayed at the top of screen G1. ) is displayed. Also, on the left side of screen G1, the audio data sent from the second user terminal 4 is displayed. The time 12D when the character data 12B converted from the audio file of the second user (Ta) was sent ( (Uses timestamp information) and is displayed along with icon 12A. Also, the play button Selecting n12C displays the corresponding audio data (audio file) for the character data 12B. ) is played and the audio can be heard. Also, on the right side of screen G1, the first user terminal 3 or The text data 13A obtained by converting the transmitted audio data (the first user's audio file) is sent. It is displayed along with the time of transmission (using timestamp information). Also, the first unit Each character data sent from terminal 3 has status 13D (for example, second user terminal Whether or not the audio data was played in step 4 is also indicated. Also, select the play button 13B. Then, the audio data (audio file) corresponding to the displayed text data 13A is played. You can listen to it.

[0060] When the first user posts audio data to social media, etc., they use an input device such as a touch panel 3 By operating 00C and specifying the converted text data of the audio data you want to post, you can post to social media. Specify the audio data to post. Post by operating the input device 300C, such as a touch panel. When you specify the text data to be converted from the audio data you want to convert, the original audio data is posted in section 30. It will be posted to social media etc. via 7. Note that audio data is specified for each audio data file. You can specify it individually, or you can specify multiple audio data files at once. If specified, the first audio data and the last audio data will be in the timeline shown in Figure 8. If specified, the audio data from the first to the last audio data, including any intermediate audio data, will be selected. It may be configured in this way.

[0061] In the example shown in Figure 8, the text data converted from the audio data is the same as the audio data file. The data is displayed in chronological order by rank, but the 12B text data converted from the audio data is not displayed. It may be configured as follows. In this case, for example, instead of character data 12B, audio data While it's also possible to display the playback time, this is not the only example.

[0062] (Information processing) Figures 9 to 13 are flowcharts illustrating an example of information processing in Information Processing System 1. The information processing of Information Processing System 1 will be explained below with reference to Figures 9 to 13. The same reference numerals are used to denote the same components as those described with reference to Figures 1 to 8, and the explanations are redundant. Omit it.

[0063] (Call processing) Figure 9 is a flowchart showing an example of call processing in Information Processing System 1. Refer to Figure 9 to explain an example of call processing in Information Processing System 1. Note that in Figure 9 This section describes the case where voice data is transmitted from the second user terminal 4 to the first user terminal 3.

[0064] (Step S101) The second user inputs voice by operating the input device 400C of the second user terminal 4.

[0065] (Step S102) The input audio is converted into an electrical signal by microphone 400F and then processed as audio data. It is transmitted from the transmission unit 402 to the server 2 as a data entry. The data includes the identification number of the second user terminal 4, a timestamp, and other data. Yes, they are.

[0066] (Step S103) The receiving unit 201 of server 2 receives the voice data transmitted from the second user terminal 4.

[0067] (Step S104) The storage device control unit 203 receives and transmits audio data transmitted by the second user terminal 4. The account or user terminal identification number is associated with the stored information and stored in the storage device 200B.

[0068] (Step S105) The transmitting unit 202 of server 2 refers to the storage device 200B, and the receiving unit 201 receives the audio. Identify the identification number of the first user terminal 3 that is associated with the identification number assigned to the data, The audio data sent from the second user terminal 4 is transmitted to the designated first user terminal 3.

[0069] (Step S106) The receiving unit 301 of the first user terminal 3 receives the voice data transmitted from the server 2.

[0070] (Step S107) The conversion unit 309 of the first user terminal 3 converts the audio data received by the receiving unit 301 into text data. Convert to.

[0071] (Step S108) The storage device control unit 303 of the first user terminal 3 receives audio data from the second user terminal 4. The audio data and the resulting text data are associated and recorded in the storage device 300B. To remember.

[0072] (Step S109) The notification unit 311 of the first user terminal 3 notifies that it has received audio.

[0073] (Call processing) Figure 10 is a flowchart showing an example of call processing in Information Processing System 1. An example of call processing in the information processing system 1 will be explained with reference to Figure 10. Section 0 explains the case where voice data is transmitted from the first user terminal 3 to the second user terminal 4. do.

[0074] (Step S201) The first user inputs voice by operating the input device 300C of the first user terminal 3. The recorded audio is converted into an electrical signal by microphone 400F.

[0075] (Step S202) The conversion unit 309 of the first user terminal 3 converts the input voice data into text data.

[0076] (Step S203) The storage device control unit 303 of the first user terminal 3 processes the input audio data and this audio data The character data obtained by converting "タ" into a character is associated with the data and stored in the memory device 300B.

[0077] (Step S204) The transmission unit 302 of the first user terminal 3 transmits the input voice data to the server 2. Oh, the data transmitted from the first user terminal 3 includes the identification number of the first user terminal 3, time, Data such as stamps is attached.

[0078] (Step S205) The receiving unit 201 of server 2 receives the voice data transmitted from the first user terminal 3.

[0079] (Step S206) The storage device control unit 203 receives and transmits audio data transmitted by the first user terminal 3. The account or user terminal identification number is associated with the stored information and stored in the storage device 200B.

[0080] (Step S207) The transmitting unit 202 of server 2 refers to the storage device 200B, and the receiving unit 201 receives the audio. Identify the identification number of the second user terminal 4 that is associated with the identification number assigned to the data, The audio data sent from the first user terminal 3 is transmitted to the designated second user terminal 4.

[0081] (Step S208) The receiving unit 401 of the second user terminal 4 receives the voice data transmitted from the server 2.

[0082] (Step S209) The LED on the second user terminal 4 indicates that audio has been received by lighting up or by making a sound.

[0083] (Submission process) Figure 11 is a flowchart showing an example of the submission process in Information Processing System 1. Referring to Figure 11, an example of the posting process of Information Processing System 1 will be explained.

[0084] (Step S301) The first user operates the input device 300C of the first user terminal 3 to post to social media, etc. Specify the audio data. This audio data specification can be done for each audio data file. Yes, or you may specify multiple audio data files at once or by selecting them. (First User) The acquisition unit 306 of terminal 3 selects one or more voice data received by the input reception unit 304 to post. Obtain the first specification information that specifies the audio data to be used. In this embodiment, by specifying text data, the corresponding audio data can be transmitted via SNS, etc. It will be posted to, but by directly specifying the audio data (audio data file), you can use SNS etc. The configuration may also include specifying the audio data to be posted.

[0085] (Step S302) The notification unit 311 of the first user terminal 3 refers to the dictionary and the voice data specified in the first designated information. The system determines whether the data contains any prohibited words. The notification unit 311 analyzes the audio data to determine whether any prohibited words are present. The conversion unit 309 may determine whether or not the phrase is included, or it may determine based on the converted character data. You may also determine whether or not a forbidden word is included. If the audio data contains a forbidden word (YES), the first user terminal 3 executes the process in step S303. If the clause is not included (NO), the first user terminal 3 executes the process in step S305. do.

[0086] (Step S303) The notification unit 311 notifies the existence of audio data containing words registered in the dictionary. The content of the notification from 11 is displayed on the display device 300D by the display device control unit 305. Furthermore, the content of the notification by the notification unit 311 may also be broadcast as audio from the speaker 300G. .

[0087] (Step S304) The input receiving unit 304 selects whether or not to allow the posting of the voice data notified by the notification unit 311. Accepted. If the user selects OK to post (YES), the first user terminal 3 will step Execute the process for S305. If the user selects NO to submit the post, the first user will proceed. End 3 terminates the process (the first user starts again from specifying the audio data in step S301). (I will stop posting to Meruka.)

[0088] (Step S305) The recognition unit 310 of the first user terminal 3 recognizes the speaker of the selected voice data. The speaker of the audio data may be identified from the identification number assigned to the audio data, or the audio data The system analyzes the data to extract features and compares them with the features of pre-registered speaker voices. It may also be possible to recognize it by [this method].

[0089] (Step S306) The first user may, if necessary, operate the input device 300C of the first user terminal 3 to access SNS. Specify the image data etc. to be posted to etc. The input receiving unit 304 receives the specified image data etc. If enabled (YES), the acquisition unit 306 of the first user terminal 3 specifies the image data, etc. 2. After obtaining the specified information, the first user terminal 3 executes the process in step S307. As described above, the image data, etc., is automatically generated by the generation unit 308 of the first user terminal 3. You may specify the following. If the input receiving unit 304 has not received a specification of image data, etc. ( NO), the first user terminal 3 executes the process in step S308. Note that the first user is In step S306, without going through the process of specifying image data, etc., audio data You may only post this.

[0090] (Step S307) The generation unit 308 of the first user terminal 3 is specified by the first designated information acquired by the acquisition unit 306. The system generates playback data by combining the audio data with image data, etc., for playback. As described above, the image data etc. is automatically acquired by the generation unit 308 of the first user terminal 3. Data can be used, or image data specified by the user can be used.

[0091] (Step S308) The posting section 307 of the first user terminal 3 will not accept the case where image data etc. is not specified (step S306 NO), the audio data specified in the first designated information acquired by acquisition unit 306 is SN Post to S, etc. Also, if image data etc. is specified in the posting section 307 (step If the input S306 is YES, the playback data generated by the generation unit 308 is posted to social media, etc.

[0092] Furthermore, in the posting process illustrated in Figure 1 above, when posting audio data to social media, etc. It would also be possible to have a configuration that allows users to post both the converted audio data and the resulting text data. When posting data to social media, etc., the first user entered the data using the input device 300C. Unrelated text, sentences, hashtags, URLs, etc., converted from audio data. It would also be good to have a structure that allows you to post both together.

[0093] (Registration process) Figure 12 is a flowchart showing an example of the registration process in Information Processing System 1. Referring to Figure 12, an example of the registration process of the information processing system 1 will be explained.

[0094] (Step S401) The first user operates the input device 300C of the first user terminal 3 to enter the words or phrases that are prohibited from being posted. Input. The acquisition unit 306 of the first user terminal 3 acquires the words received by the input reception unit 304. It's advantageous.

[0095] (Step S402) The registration unit 312 of the first user terminal 3 prohibits posting of the words acquired by the acquisition unit 306. The words are registered (stored) in the dictionary. The words registered in the dictionary by the registration unit 312 are stored in the memory system. The data is stored in memory device 200B by unit 303.

[0096] In the registration process described above with reference to Figure 12, the first user accesses the first user terminal 3. The user is operating the input device 300C to enter a phrase that should be prohibited from posting, but this does not necessarily mean that the first user is being targeted. It is not necessary to input anything. For example, the first user terminal 3 automatically determines which words are prohibited from being posted. It may be registered in the dictionary as a word or phrase that is prohibited from being posted, or as a word or phrase that is prohibited from being posted (first category). It may be configured to recommend to the user. In this case, for example, if the first user has previously registered The system learns the words and phrases that should be prohibited from being posted, and based on these learning results, the first user terminal 3 posts... The system may also be configured to automatically determine which words or phrases are prohibited.

[0097] (Exclusion process) Figure 13 is a flowchart showing an example of the exclusion process in Information Processing System 1. Referring to Figure 13, an example of exclusion processing in information processing system 1 will be explained.

[0098] (Step S501) The first user operates the input device 300C of the first user terminal 3 to enter a phrase or word that is prohibited from being posted. Enter the words to be excluded. The acquisition unit 306 of the first user terminal 3 receives the input reception unit 304. Retrieve the attached word or phrase.

[0099] (Step S502) The registration unit 312 of the first user terminal 3 prohibits posting of the words acquired by the acquisition unit 306. The word(s) to be excluded from the list (words that can be submitted) are registered (stored) in the dictionary. Registration section 312 The words and phrases registered in the dictionary are stored in the memory device 200B by the memory device control unit 303.

[0100] As described above, the first user terminal 3 (information processing terminal) according to the embodiment has one or more voice data An acquisition unit 306 that acquires first designated information that specifies the audio data to be posted from the data, and the acquisition unit 306 The posting unit 307 posts the audio data specified in the first designated information acquired by the acquisition unit 306. It is equipped with the following. This method allows you to specify audio data and post it to social media, making it highly convenient. .

[0101] Furthermore, the first user terminal 3 in the embodiment is specified by the first designated information acquired by the acquisition unit 306. This process generates playback data that combines predefined audio data with image data and other elements for playback. It comprises a generation unit 308 and a posting unit 307, which receives the playback data generated by the generation unit 308. Post a tag. In this way, audio data and image data are combined to generate playback data, and SNS This improves convenience by allowing users to post to various platforms.

[0102] Furthermore, the first user terminal 3 according to the embodiment provides a second designation for specifying the image data, etc. to be posted. The generation unit 308 includes an acquisition unit 306 that acquires information, and the generation unit 308 generates the first designated information acquired by the acquisition unit 306. The audio data specified in the information and the image specified in the second specified information acquired by the acquisition unit 306. It generates playback data that is created by combining it with other data. In this way, it is possible to generate playback data by specifying image data, etc., which is convenient. To improve.

[0103] Furthermore, the first user terminal 3 according to this embodiment converts the voice data acquired by the acquisition unit 306 into text The acquisition unit 306 includes a conversion unit 309 that converts data, and the acquisition unit 306 converts the data converted by the conversion unit 309. The first specified information is obtained based on the character data. This is how audio data is converted into text, and then posted to social media, etc. Since you can specify the type of post, you can understand the content of the post at a glance, improving convenience. do.

[0104] Furthermore, the acquisition unit 306 of the first user terminal 3 according to the embodiment provides information that specifies character data. This information is obtained on a sentence-by-sentence or audio data-by-sentence basis. In this way, audio data to be posted to social media, etc., can be specified either sentence by sentence or by audio data by data. This improves convenience.

[0105] Furthermore, the acquisition unit 306 of the first user terminal 3 according to the embodiment provides information that specifies character data. This is obtained in units of two or more sentences or audio data. In this way, you can specify multiple audio files at once to post to social media, etc. This improves convenience.

[0106] Furthermore, the posting unit 307 of the first user terminal 3 according to the embodiment receives the audio data specified for posting. Along with the text data, post it to social media, etc. This improves convenience by allowing users to post not only audio data but also text data.

[0107] Furthermore, the text data is text data converted from audio data that was specified for posting to social media, etc. That is the case. This allows you to post text data converted from audio data to social media, etc. Convenience will improve.

[0108] Furthermore, the first user terminal 3 according to this embodiment includes a recognition unit 310 that recognizes the speaker of the voice data. The posting unit 307 includes the audio data to which the posting is specified, along with the speaker of the audio data. Submit information. Therefore, it becomes possible to show who is speaking on social media, improving convenience.

[0109] Furthermore, the posting unit 307 of the first user terminal 3 according to this embodiment registers words and phrases that are prohibited from being posted. Referencing the dictionary, the system restricts the posting of audio data containing words or phrases registered in the dictionary. In this way, pre-registered words (for example, school names, place names, names, etc., which identify personal information) can be used to identify individuals. Posting audio data containing (such as) words that could be used to create such content to social media will be prohibited, thus improving convenience. To improve.

[0110] Furthermore, the first user terminal 3 according to this embodiment contains audio data including words and phrases registered in the dictionary. It is equipped with a notification unit 311 that notifies of its presence. In this way, pre-registered words (for example, school names, place names, names, etc., which identify personal information) can be used to identify individuals. It improves convenience by notifying the presence of audio data containing phrases that are likely to be used.

[0111] Furthermore, the first user terminal 3 in this embodiment receives voice data broadcast by the notification unit 311. It includes a reception unit 304 that accepts the option of whether or not to submit a post. The posting unit 307 is located at the reception unit Based on the selection of whether the submission received by 304 is acceptable or not, the audio data was reported by the notification unit 311. Submit a post. Thus, if audio data containing words registered in the dictionary exists, it can be shared on social media, etc. The ability to choose whether or not to post improves convenience.

[0112] Furthermore, the first user terminal 3 according to the embodiment includes an acquisition unit 30 that acquires words or phrases that are prohibited from being posted. The system includes a 6 and a registration unit 312 that registers the words acquired by the acquisition unit 306 into a dictionary. In this way, you can register words that you want to be prohibited from posting on social media, etc., making it very convenient. To rise.

[0113] [Modified examples of embodiments] In addition, in the above embodiment, at least the functions of the first user terminal 3 shown in Figure 5 It is also possible for server 2 to possess a portion of it. For example, the first user terminal 3 shown in Figure 5 possesses Among the functions are the acquisition unit 306 (1st to 3rd acquisition units), the generation unit 308, the conversion unit 309, and the recognition unit. Server 2 shall have some or all of the functions of 310, the notification unit 311, the registration unit 312, etc. This is also acceptable. In this case, for example, on server 2, the steps described with reference to Figure 11 Processes S301 to S307 are executed, and the posting unit 307 of the first user terminal 3 sends a message to the SNS. Audio data and playback data are posted to these sites. Please refer to Figures 12 and 13 for further explanation. The processing in steps S401 to S402 and at least one of steps S501 to S502 This will be executed.

[0114] Furthermore, the above embodiments and modifications are all examples of how the present invention can be implemented. This is merely an example, and the technical scope of the present invention should not be interpreted as being limited by it. It does not deviate from its essence or its main features. It can be implemented in various forms. [Explanation of symbols]

[0115] 1. Information Processing System 2. Server (Information Processing Device) 200A Communication IF 200B storage device 200C CPU 201 Receiver 202 Transmitter 203 Storage Unit 3. First User Terminal (Information Processing Terminal) 300A communication IF 300B storage device 300C Input Device 300D display device 300E CPU 300F Microphone 300G Speaker 301 Receiver 302 Transmitter 303 Storage Unit 304 Input Reception Department (Reception Department) 305 Display device control unit 306 Acquisition Department (1st ~ 3rd Acquisition Department) 307 Submissions 308 Generation part 309 Conversion Unit 310 Recognition part 311 Hochi Department 312 Registration Department 4. Second User Terminal 400A Communication IF 400B storage device 400C Input Device 400D display device (LED) 400E CPU 400F Microphone 400G Speaker 400H GPS Sensor 401 Receiver 402 Transmitter 403 Storage Unit 404 Input Reception Section 405 Display Unit 5 Network

Claims

1. In a monitoring system comprising a first terminal used by the caregiver and a second terminal carried by the person being cared for, the first terminal is defined as the first terminal, wherein the location information of the second terminal is transmitted to the first terminal. A first means for acquiring first voice data sent from the second terminal to the first terminal, A second means for receiving the designation of at least one of the first audio data from the aforementioned caregiver, A program that functions as a third means for generating playback data, which is generated by combining the specified first audio data with image data or video data.

2. In a monitoring system comprising a first terminal used by the caregiver, a second terminal carried by the person being cared for, and an information processing device that transmits location information of the second terminal to the first terminal, the information processing device is described as follows: A first means that receives first voice data transmitted from the second terminal and transmits it to the first terminal, A second means for receiving the designation of at least one of the first audio data from the aforementioned caregiver, A program that functions as a third means for generating playback data, which is generated by combining the specified first audio data with image data or video data.

3. The program according to claim 1 or 2, wherein the second means accepts the designation of at least one first audio data by the caregiver designating a first object representing one first audio data on a screen where a first object representing the first audio data is displayed.

4. The aforementioned screen shows: The first object is displayed at one end in chronological order, The program according to claim 3, wherein a second object indicating second audio data sent from the first terminal to the second terminal is displayed on the other end in chronological order.

5. The program according to claim 3 or 4, wherein the first object includes characters obtained by converting the first audio data.

6. The third means is a program according to any one of claims 1 to 5, which generates the playback data including time data associated with the specified first audio data.

7. The image data or video data is a program according to any one of claims 1 to 6, as specified by the caregiver.

8. The program according to any one of claims 1 to 6, wherein the image data or video data is automatically specified.

9. The program according to any one of claims 1 to 8, wherein the playback data includes information about the person being monitored.

10. The program according to any one of claims 1 to 9, wherein the playback data is data posted to a social networking service (SNS).

11. The program according to any one of claims 1 to 10, wherein the first audio data is a voice message input to the microphone of the second terminal.

12. The program according to any one of claims 1 to 11, wherein the first audio data is not attached to a video.

13. The program according to any one of claims 1 to 12, wherein the person being monitored is a child, and the person monitoring is their guardian.

14. The program according to claim 1, wherein the first terminal functions as a fourth means for acquiring location information of the second terminal at predetermined intervals.

15. The program according to claim 2, wherein the information processing device functions as a fourth means for acquiring location information of the second terminal at predetermined intervals.

16. An information processing method in a monitoring system comprising a first terminal used by a caregiver and a second terminal carried by the person being cared for, wherein the location information of the second terminal is transmitted to the first terminal, The system receives a designation from the caregiver for at least one of the first voice data sent from the second terminal to the first terminal, An information processing method comprising generating playback data that combines the specified first audio data with image data or video data for playback.

17. The information processing method according to claim 16, further comprising posting the aforementioned playback data to a social networking service (SNS).

18. The monitoring system comprises a first terminal used by the caregiver and a second terminal carried by the person being cared for, wherein the location information of the second terminal is transmitted to the first terminal, and the first terminal is a monitoring system in which A first means for acquiring first voice data sent from the second terminal to the first terminal, A second means for receiving the designation of at least one of the first audio data from the aforementioned caregiver, A first terminal comprising a third means for generating playback data that combines the specified first audio data with image data or video data for playback.

19. The first terminal according to claim 18, further comprising a fourth means for posting the aforementioned playback data to social media.

20. The monitoring system comprises a first terminal used by the caregiver, a second terminal carried by the person being cared for, and an information processing device that transmits location information of the second terminal to the first terminal, wherein the information processing device is described above. A first means that receives first voice data transmitted from the second terminal and transmits it to the first terminal, A second means for receiving the designation of at least one of the first audio data from the aforementioned caregiver, An information processing apparatus comprising: a third means for generating playback data which is played back by combining the specified first audio data and image data or video data.

21. A monitoring system comprising the first terminal and the second terminal according to claim 18 or 19.

22. A monitoring system comprising the first terminal, the second terminal, and the information processing device described in claim 20.

23. The monitoring system according to claim 21 or 22, wherein the second terminal is not a smartphone and has a shape that is roughly square when viewed from the front.