Electronic device, method, and non-transitory computer-readable storage medium for determining uplink transmission power
By controlling uplink transmission power within a time window to avoid exceeding a threshold, the device mitigates the health risks from electromagnetic waves during calls, providing a safer user experience.
Patent Information
- Authority / Receiving Office
- WO · WO
- Patent Type
- Applications
- Current Assignee / Owner
- SAMSUNG ELECTRONICS CO LTD
- Filing Date
- 2025-10-27
- Publication Date
- 2026-06-25
AI Technical Summary
Electromagnetic waves generated by uplink transmission in electronic devices during calls can be harmful to human health, and existing systems struggle to effectively manage transmission power to minimize this impact.
The electronic device controls uplink transmission power by ensuring that the power accumulated within a time window does not exceed a threshold, thereby reducing the potential harm from electromagnetic waves.
This approach minimizes the health risks associated with electromagnetic exposure by dynamically managing transmission power, ensuring safe operation during calls.
Smart Images

Figure KR2025017164_25062026_PF_FP_ABST
Abstract
Description
Electronic device, method, and non-transient computer-readable storage medium for determining uplink transmission power
[0001] The following descriptions relate to an electronic device, a method, and a non-transient computer-readable storage medium for determining uplink transmission power.
[0002] The electronic device can perform uplink transmission based on the user's utterance while a call is being performed. Electromagnetic waves generated by uplink transmission may be harmful to the human body. The electronic device (101) can control the transmission power so that the transmission power accumulated within a time window does not exceed a threshold power in order to minimize the impact of electromagnetic waves generated by uplink transmission.
[0003] The information described above may be provided as related art for the purpose of aiding understanding of the present disclosure. No claim or determination is made as to whether any of the foregoing may be applied as prior art related to the present disclosure.
[0004] An electronic device is provided. The electronic device may include at least one speaker. The electronic device may include at least one microphone. The electronic device may include a communication circuit. The electronic device may include a memory that stores instructions and includes one or more storage media. The electronic device may include at least one processor that includes a processing circuit. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause uplink voice data of a first language generated based on a user's utterance to be transmitted while an application for providing interpretation services for a call performed using the communication circuit is executed. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause a transmission time interval for transmitting uplink voice data of a second language based on generating uplink voice data of a second language from the utterance of the first language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to determine a transmission power for transmitting uplink voice data of the second language based on the transmission time interval for transmitting uplink voice data of the second language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to transmit uplink voice data of the second language based on the transmission power.
[0005] A method is provided by an electronic device comprising at least one speaker, at least one microphone, and a communication circuit. The method may include the operation of transmitting uplink voice data of a first language generated based on a user's utterance while an application for providing interpretation services for a call performed using the communication circuit is executed. The method may include the operation of determining a transmission time interval for transmitting uplink voice data of the second language based on generating uplink voice data of the second language from the utterance of the first language. The method may include the operation of determining a transmission power for transmitting uplink voice data of the second language based on the transmission time interval for transmitting uplink voice data of the second language. The method may include the operation of transmitting uplink voice data of the second language based on the transmission power.
[0006] A non-transient computer-readable storage medium is provided. The non-transient computer-readable storage medium may store one or more programs. The one or more programs may include instructions that, when executed individually or collectively by at least one processor of an electronic device, cause the electronic device to transmit uplink voice data of a first language generated based on a user's utterance while an application for providing interpretation services for a call performed using the communication circuit is executed. The one or more programs may include instructions that, when executed individually or collectively by at least one processor of an electronic device, cause the electronic device to determine a transmission time interval for transmitting uplink voice data of a second language based on generating uplink voice data of a second language from the utterance of the first language. The above one or more programs may include instructions that, when executed individually or collectively by at least one processor of an electronic device, cause the electronic device to determine a transmission power for transmitting uplink voice data of the second language based on the transmission time interval for transmitting uplink voice data of the second language. The above one or more programs may include instructions that, when executed individually or collectively by at least one processor of an electronic device, cause the electronic device to transmit uplink voice data of the second language based on the transmission power.
[0007] In relation to the description of the drawings, the same or similar reference numerals may be used for identical or similar components.
[0008] Figure 1 is a block diagram of an electronic device in a network environment.
[0009] Figure 2 illustrates the components of an electronic device.
[0010] Figure 3 illustrates an artificial intelligence system.
[0011] FIG. 4 illustrates a system for providing voice call services.
[0012] FIGS. 5A and FIGS. 5B illustrate graphs representing the transmission power of an electronic device during a voice call service.
[0013] FIG. 6a illustrates a flowchart showing the operations of an electronic device for determining transmission power in a transmission state.
[0014] FIG. 6b is a flowchart showing the operations of an electronic device for determining transmission power in a transmission state.
[0015] FIG. 7 illustrates a flowchart showing the operations of an electronic device for determining transmission power in a receiving state.
[0016] Figure 8 shows a graph representing the transmission power of an electronic device during a voice call service.
[0017] FIG. 9 illustrates a flowchart showing the operations of an electronic device for determining transmission power.
[0018] FIG. 10 illustrates a flowchart showing the operations of an electronic device for determining transmission power.
[0019] FIG. 11 illustrates a flowchart showing the operations of an electronic device for determining transmission power.
[0020] FIG. 12 is a flowchart showing the operations of an electronic device for determining transmission power.
[0021] The terms used in this disclosure are used merely to describe specific embodiments and are not intended to limit the scope of other embodiments. A singular expression may include a plural expression unless the context clearly indicates otherwise. Terms used herein, including technical or scientific terms, may have the same meaning as generally understood by those skilled in the art described in this disclosure. Terms used in this disclosure that are defined in a general dictionary may be interpreted as having the same or similar meaning as they have in the context of the relevant technology, and are not to be interpreted in an ideal or overly formal sense unless explicitly defined in this disclosure. In some cases, even terms defined in this disclosure are not to be interpreted to exclude the embodiments of this disclosure.
[0022] In the various embodiments of the present disclosure described below, a hardware-based approach is described as an example. However, since the various embodiments of the present disclosure include techniques using both hardware and software, the various embodiments of the present disclosure do not exclude a software-based approach.
[0023] Additionally, in this disclosure, expressions of "greater than" or "less than" may be used to determine whether a specific condition is satisfied or fulfilled; however, this is merely for the purpose of expressing an example and does not exclude descriptions of "greater than" or "less than." Conditions described as "greater than" may be replaced with "greater than," conditions described as "less than" may be replaced with "less than," and conditions described as "greater than and less than" may be replaced with "greater than and less than." Furthermore, "A" to "B" below refer to at least one of elements from A (including A) to B (including B). Below, "C" and / or "D" refers to including at least one of "C" or "D," i.e., {"C", "D", "C" and "D"}.
[0024] Figure 1 is a block diagram of an electronic device in a network environment.
[0025] Referring to FIG. 1, in a network environment (100), an electronic device (101) may communicate with an electronic device (102) through a first network (198) (e.g., a short-range wireless communication network) or with at least one of an electronic device (104) or a server (108) through a second network (199) (e.g., a long-range wireless communication network). According to one embodiment, the electronic device (101) may communicate with the electronic device (104) through a server (108). According to one embodiment, the electronic device (101) may include a processor (120), memory (130), input module (150), sound output module (155), display module (160), audio module (170), sensor module (176), interface (177), connection terminal (178), haptic module (179), camera module (180), power management module (188), battery (189), communication module (190), subscriber identification module (196), or antenna module (197). In some embodiments, at least one of these components (e.g., connection terminal (178)) may be omitted from the electronic device (101), or one or more other components may be added. In some embodiments, some of these components (e.g., sensor module (176), camera module (180), or antenna module (197)) may be integrated into a single component (e.g., display module (160)).
[0026] The processor (120) can control at least one other component (e.g., a hardware or software component) of the electronic device (101) connected to the processor (120) by executing software (e.g., a program (140)), for example, and can perform various data processing or operations. According to one embodiment, as at least part of the data processing or operations, the processor (120) can store commands or data received from other components (e.g., a sensor module (176) or a communication module (190)) in volatile memory (132), process the commands or data stored in volatile memory (132), and store the resulting data in non-volatile memory (134). According to one embodiment, the processor (120) may include a main processor (121) (e.g., a central processing unit or an application processor) or an auxiliary processor (123) that can operate independently or together with it (e.g., a graphics processing unit, a neural processing unit (NPU), an image signal processor, a sensor hub processor, or a communication processor). For example, if the electronic device (101) includes a main processor (121) and an auxiliary processor (123), the auxiliary processor (123) may be configured to use lower power than the main processor (121) or to be specialized for a designated function. The auxiliary processor (123) may be implemented separately from the main processor (121) or as part thereof.
[0027] The auxiliary processor (123) may control at least some of the functions or states associated with at least one component of the electronic device (101) (e.g., display module (160), sensor module (176), or communication module (190)) on behalf of the main processor (121) while the main processor (121) is in an inactive (e.g., sleep) state, or together with the main processor (121) while the main processor (121) is in an active (e.g., application execution) state. According to one embodiment, the auxiliary processor (123) (e.g., image signal processor or communication processor) may be implemented as part of another functionally related component (e.g., camera module (180) or communication module (190)). According to one embodiment, the auxiliary processor (123) (e.g., neural network processing unit) may include a hardware structure specialized for processing an artificial intelligence model. The artificial intelligence model may be generated through machine learning. Such learning may be performed, for example, on the electronic device (101) itself where the artificial intelligence model is executed, or through a separate server (e.g., server (108)). The learning algorithm may include, for example, supervised learning, unsupervised learning, semi-supervised learning, or reinforcement learning, but is not limited to the examples described above. The artificial intelligence model may include a plurality of artificial neural network layers.An artificial neural network may be a deep neural network (DNN), a convolutional neural network (CNN), a recurrent neural network (RNN), a restricted Boltzmann machine (RBM), a deep belief network (DBN), a bidirectional recurrent deep neural network (BRDNN), a deep Q-network, or a combination of two or more of the above, but is not limited to the examples described above. In addition to the hardware structure, the artificial intelligence model may include a software structure, either additionally or substantially.
[0028] The memory (130) can store various data used by at least one component of the electronic device (101) (e.g., processor (120) or sensor module (176)). The data may include, for example, software (e.g., program (140)) and input or output data for related commands. The memory (130) may include volatile memory (132) or non-volatile memory (134).
[0029] The program (140) may be stored as software in memory (130) and may include, for example, an operating system (142), middleware (144), or an application (146).
[0030] The input module (150) can receive commands or data to be used for a component of the electronic device (101) (e.g., processor (120)) from outside the electronic device (101) (e.g., user). The input module (150) may include, for example, a microphone, a mouse, a keyboard, a key (e.g., a button), or a digital pen (e.g., a stylus pen).
[0031] The sound output module (155) can output a sound signal to the outside of the electronic device (101). The sound output module (155) may include, for example, a speaker or a receiver. The speaker may be used for general purposes, such as multimedia playback or recording playback. The receiver may be used to receive incoming calls. According to one embodiment, the receiver may be implemented separately from the speaker or as part thereof.
[0032] The display module (160) can visually provide information to an external (e.g., user) of the electronic device (101). The display module (160) may include, for example, a display, a holographic device, or a projector and a control circuit for controlling said device. According to one embodiment, the display module (160) may include a touch sensor configured to detect a touch, or a pressure sensor configured to measure the intensity of the force generated by said touch.
[0033] The audio module (170) can convert sound into an electrical signal or, conversely, convert an electrical signal into sound. According to one embodiment, the audio module (170) can acquire sound through the input module (150) or output sound through the sound output module (155) or an external electronic device (e.g., electronic device (102)) (e.g., speaker or headphones) connected directly or wirelessly to the electronic device (101).
[0034] The sensor module (176) can detect the operating state of the electronic device (101) (e.g., power or temperature) or the external environmental state (e.g., user state) and generate an electrical signal or data value corresponding to the detected state. According to one embodiment, the sensor module (176) may include, for example, a gesture sensor, a gyroscope sensor, a barometric pressure sensor, a magnetic sensor, an accelerometer sensor, a grip sensor, a proximity sensor, a color sensor, an IR (infrared) sensor, a biosensor, a temperature sensor, a humidity sensor, or an illuminance sensor.
[0035] The interface (177) may support one or more specified protocols that can be used for the electronic device (101) to be connected directly or wirelessly to an external electronic device (e.g., electronic device (102)). According to one embodiment, the interface (177) may include, for example, a high definition multimedia interface (HDMI), a universal serial bus (USB) interface, an SD card interface, or an audio interface.
[0036] The connection terminal (178) may include a connector through which the electronic device (101) can be physically connected to an external electronic device (e.g., electronic device (102)). According to one embodiment, the connection terminal (178) may include, for example, an HDMI connector, a USB connector, an SD card connector, or an audio connector (e.g., a headphone connector).
[0037] The haptic module (179) can convert an electrical signal into a mechanical stimulus (e.g., vibration or movement) or an electrical stimulus that can be perceived by the user through tactile or kinesthetic senses. According to one embodiment, the haptic module (179) may include, for example, a motor, a piezoelectric element, or an electric stimulation device.
[0038] The camera module (180) can capture still images and video. According to one embodiment, the camera module (180) may include one or more lenses, image sensors, image signal processors, or flashes.
[0039] The power management module (188) can manage power supplied to the electronic device (101). According to one embodiment, the power management module (188) can be implemented, for example, as at least part of a power management integrated circuit (PMIC).
[0040] The battery (189) can supply power to at least one component of the electronic device (101). According to one embodiment, the battery (189) may include, for example, a non-rechargeable primary battery, a rechargeable secondary battery, or a fuel cell.
[0041] The communication module (190) can support the establishment of a direct (e.g., wired) communication channel or a wireless communication channel between an electronic device (101) and an external electronic device (e.g., electronic device (102), electronic device (104), or server (108)), and the performance of communication through the established communication channel. The communication module (190) may include one or more communication processors that operate independently of the processor (120) (e.g., application processor) and support direct (e.g., wired) communication or wireless communication. According to one embodiment, the communication module (190) may include a wireless communication module (192) (e.g., cellular communication module, short-range wireless communication module, or GNSS (global navigation satellite system) communication module) or a wired communication module (194) (e.g., LAN (local area network) communication module, or power line communication module). The corresponding communication module among these communication modules can communicate with an external electronic device (104) through a first network (198) (e.g., a short-range communication network such as Bluetooth, WiFi (wireless fidelity) direct, or IrDA (infrared data association)) or a second network (199) (e.g., a legacy cellular network, a 5G network, a next-generation communication network, the Internet, or a computer network (e.g., a LAN or WAN). These various types of communication modules may be integrated into a single component (e.g., a single chip) or implemented as multiple separate components (e.g., multiple chips). The wireless communication module (192) can identify or authenticate the electronic device (101) within a communication network such as the first network (198) or the second network (199) using subscriber information (e.g., International Mobile Subscriber Identifier (IMSI)) stored in the subscriber identification module (196).
[0042] The wireless communication module (192) can support 5G networks and next-generation communication technologies following 4G networks, for example, new radio access technology. NR access technology can support high-speed transmission of high-capacity data (enhanced mobile broadband (eMBB)), minimization of terminal power and connection of multiple terminals (massive machine type communications (mMTC)), or high reliability and low latency (ultra-reliable and low-latency communications (URLLC)). The wireless communication module (192) can support a high-frequency band (e.g., mmWave band) to achieve a high data transmission rate, for example. The wireless communication module (192) can support various technologies for securing performance in the high-frequency band, such as beamforming, massive MIMO (multiple-input and multiple-output), full-dimensional MIMO (FD-MIMO), array antenna, analog beam-forming, or large-scale antenna. The wireless communication module (192) can support various requirements specified in the electronic device (101), external electronic device (e.g., electronic device (104)), or network system (e.g., second network (199)). According to one embodiment, the wireless communication module (192) may support a Peak data rate (e.g., 20 Gbps or more) for eMBB realization, loss coverage (e.g., 164 dB or less) for mMTC realization, or U-plane latency (e.g., downlink (DL) and uplink (UL) each 0.5 ms or less, or round trip 1 ms or less) for URLLC realization.
[0043] An antenna module (197) can transmit a signal or power to or from an external source (e.g., an external electronic device). According to one embodiment, the antenna module (197) may include an antenna comprising a radiator made of a conductor or a conductive pattern formed on a substrate (e.g., a PCB). According to one embodiment, the antenna module (197) may include a plurality of antennas (e.g., an array antenna). In this case, at least one antenna suitable for a communication method used in a communication network, such as a first network (198) or a second network (199), may be selected from the plurality of antennas, for example, by a communication module (190). A signal or power may be transmitted or received between the communication module (190) and an external electronic device through the selected at least one antenna. According to some embodiments, in addition to the radiator, other components (e.g., a radio frequency integrated circuit (RFIC)) may be additionally formed as part of the antenna module (197).
[0044] According to various embodiments, the antenna module (197) may form a mmWave antenna module. According to one embodiment, the mmWave antenna module may include a printed circuit board, an RFIC disposed on or adjacent to a first surface (e.g., bottom surface) of the printed circuit board and capable of supporting a specified high frequency band (e.g., mmWave band), and a plurality of antennas (e.g., array antennas) disposed on or adjacent to a second surface (e.g., top surface or side surface) of the printed circuit board and capable of transmitting or receiving a signal of the specified high frequency band.
[0045] At least some of the above components can be connected to each other via a communication method between peripheral devices (e.g., bus, GPIO (general purpose input and output), SPI (serial peripheral interface), or MIPI (mobile industry processor interface)) and exchange signals (e.g., commands or data) with each other.
[0046] According to one embodiment, commands or data may be transmitted or received between the electronic device (101) and an external electronic device (104) through a server (108) connected to a second network (199). Each of the external electronic devices (102, or 104) may be the same or a different type of device as the electronic device (101). According to one embodiment, all or part of the operations performed on the electronic device (101) may be performed on one or more of the external electronic devices (102, 104, or 108). For example, if the electronic device (101) needs to perform a function or service automatically or in response to a request from a user or another device, the electronic device (101) may request one or more external electronic devices to perform at least part of the function or service instead of performing the function or service itself or additionally. One or more external electronic devices that receive the above request may execute at least part of the requested function or service, or additional function or service related to the request, and transmit the result of the execution to the electronic device (101). The electronic device (101) may provide the result as is or additionally processed as at least part of the response to the request. For this purpose, for example, cloud computing, distributed computing, mobile edge computing (MEC), or client-server computing technology may be used. The electronic device (101) may provide ultra-low latency services using, for example, distributed computing or mobile edge computing. In another embodiment, the external electronic device (104) may include an Internet of Things (IoT) device. The server (108) may be an intelligent server using machine learning and / or neural networks. According to one embodiment, the external electronic device (104) or the server (108) may be included within the second network (199).The electronic device (101) can be applied to intelligent services (e.g., smart home, smart city, smart car, or healthcare) based on 5G communication technology and IoT-related technology.
[0047] Figure 2 illustrates the components of an electronic device.
[0048] In FIG. 2, the electronic device (101) may include a processor (201), memory (202), communication circuit (203), display (204), microphone (205), speaker (206), and artificial intelligence module (207). For example, the processor (201), memory (202), communication circuit (203), display (204), microphone (205), speaker (206), and artificial intelligence module (207) may be electrically and / or operably coupled with each other by a communication bus. The hardware components being operationally connected may mean that a direct or indirect connection between the hardware components is established via wired or wireless means so that a second hardware component (e.g., memory (202), communication circuit (203), display (204), microphone (205), speaker (206), and / or artificial intelligence module (207)) is controlled by a first hardware component (e.g., processor (201)) among the hardware components. The artificial intelligence module (207) illustrated in FIG. 2 is illustrated as a hardware component, but the present disclosure is not limited thereto. For example, the artificial intelligence module (207) may correspond to a software component. The hardware components illustrated in FIG. 2 are illustrated based on different blocks, but the present disclosure is not limited thereto. For example, at least a portion of the hardware components illustrated in FIG. 2 (e.g., processor (201), memory (202), communication circuit (203), display (204), microphone (205), speaker (206), and / or artificial intelligence module (207)) may be included in a single integrated circuit, such as a system on chip (SoC) or a system in package (SIP). The type and number of hardware components included in the electronic device (101) are not limited to those illustrated in FIG. 2.For example, the electronic device (101) may include only some of the hardware components shown in FIG. 2.
[0049] In one embodiment, the electronic device (101) may include a processor (201). The processor (201) may include a hardware component for processing data based on one or more instructions. The hardware component for processing data may include, for example, an arithmetic and logic unit (ALU), a floating point unit (FPU), and a field programmable gate array (FPGA). As an example, the hardware component for processing data may include a central processing unit (CPU), a graphics processing unit (GPU), a digital signal processing unit (DSP), a microcontroller (MCU), and / or a neural processing unit (NPU). The number of processors (201) may be one or more. For example, the processor (201) may have the structure of a multi-core processor, such as a dual core, a quad core, or a hexa core. The processor (201) of FIG. 2 can have the same content as the processor (120) of FIG. 1 applied substantially.
[0050] In one embodiment, the processor (201) may include various processing circuits and / or a plurality of processors. For example, the term “processor” as used herein, including in the claims, may include various processing circuits including at least one processor, and one or more of the at least one processor may be configured to perform the various functions described below in a distributed manner, individually and / or collectively. As used below, where “processor,” “at least one processor,” and “one or more processors” are described as being configured to perform various functions, these terms encompass, for example, but not limited to, situations where one processor performs some of the cited functions and other processor(s) perform other parts of the cited functions, and also situations where one processor can perform all of the cited functions. Additionally, the at least one processor may include a combination of processors that perform the enumerated / disclosed various functions, for example, in a distributed manner. The at least one processor may execute program instructions to achieve or perform the various functions.
[0051] In one embodiment, the electronic device (101) may include a memory (202). The memory (202) may include a hardware component for storing data and / or instructions that are input to or output from the processor (201). For example, the memory (202) may include a volatile memory such as random-access memory (RAM) and / or a non-volatile memory such as read-only memory (ROM). The volatile memory may include, for example, at least one of dynamic RAM (DRAM), static RAM (SRAM), cache RAM, and pseudo SRAM (PSRAM). The non-volatile memory may include, for example, programmable ROM (PROM), erasable PROM (EPROM), electrically erasable PROM (EEPROM), flash memory, a hard disk, a compact disk, and an embedded multimedia card (eMMC).
[0052] In one embodiment, one or more instructions (or commands) representing operations and / or operations performed by the processor (201) of the electronic device (101) may be stored within the memory (202) of the electronic device (101). A set of one or more instructions may be referred to as a program, firmware, operating system, process, routine, sub-routine, and / or application. Hereinafter, the statement that an application is installed within the electronic device (101) may mean that one or more instructions provided in the form of an application are stored within the memory (202), and that one or more applications are stored in an executable format by the processor (201) of the electronic device (101). The specific details regarding the memory (202) of FIG. 2 may be substantially the same as the details regarding the memory (130) of FIG. 1.
[0053] In one embodiment, the electronic device (101) may include a communication circuit (203). The communication circuit (203) may include a circuit for supporting the transmission and / or reception of electrical signals between the electronic device (101) and an external device (e.g., a base station, another electronic device). The communication circuit (203) may include at least one of a modem, an antenna, and an O / E (optic / electronic) converter. The communication circuit (203) may support the transmission and / or reception of electrical signals based on various types of communication means such as Ethernet, Bluetooth, BLE (Bluetooth Low Energy), ZigBee, LTE (Long Term Evolution), and 5G NR (New Radio). The specific details regarding the communication circuit (203) of FIG. 2 may be substantially the same as those regarding the communication module (109) and / or antenna module (197) of FIG. 1.
[0054] In one embodiment, the electronic device (101) may include a display (204). The display (204) may include a display panel, a touch sensor, and / or a processing circuit. In one embodiment, the display panel may be used to display visual information (e.g., an image, affordance, screen, object, UI (user interface), GUI (graphic user interface), and / or visual object). For example, the display panel may have a display area capable of receiving touch input. In one embodiment, the touch sensor may be used to obtain data about an external object located on the display panel. For example, the touch sensor may be located within or on the display panel to provide an area of the display panel capable of receiving touch input. For example, the touch sensor may be configured to obtain data about contact points on at least some part of said area. In one embodiment, the processing circuit may control the touch sensor. For example, the processing circuit may process signals or data obtained (or received) through the touch sensor. The specific details regarding the display (204) of FIG. 2 can be substantially applied in the same way as the details regarding the display module (160) of FIG. 1.
[0055] In one embodiment, the electronic device (101) may include a microphone (205). The microphone (205) may be configured to acquire sound (e.g., voice input, utterance) acquired from outside the electronic device (101). The microphone (205) of FIG. 2 may correspond to the input module (150) and / or audio module (170) of FIG. 1.
[0056] In one embodiment, the electronic device (101) may include a speaker (206). The speaker (206) may be configured to output a voice output, a voice signal, and / or sound based on voice data. The speaker (260) of FIG. 2 may correspond to the sound output module (155) of FIG. 1.
[0057] In one embodiment, the electronic device (101) may include an artificial intelligence module (207). The artificial intelligence module (207) may be a unit (function code, separate device, circuit, or set of instructions) for performing functions. For example, the artificial intelligence module (207) may be a unit (function code, separate device, circuit, or set of instructions) for automatic speech recognition (ASR), machine translation (MT), large language model (LLM), and / or large multimodal model (LMM). Hereinafter, the artificial intelligence module (207) may be referred to as an artificial intelligence model or other terms having an equivalent technical and functional meaning. The specific details regarding the artificial intelligence module (207) of FIG. 2 may be substantially identical to the details regarding the artificial intelligence system (300) of FIG. 3.
[0058] Figure 3 illustrates an artificial intelligence system.
[0059] Referring to FIG. 3, an artificial intelligence (AI) system (300) may include an input / output interface (310), an AI framework (320), a generative AI model (330), and / or a knowledge repository (390). The artificial intelligence system of FIG. 3 may be referred to as an artificial intelligence model, an artificial intelligence module, or other terms having an equivalent technical or functional meaning.
[0060] In one embodiment, the input / output interface (310) may receive input. The input may include user input and / or data obtained or generated by the electronic device (101). The data may include images, video, audio, and / or text generated by the processor (201) of the electronic device (101). User input may include natural language, touch data obtained through a touch circuit included in the display (204), and images and / or videos displayed (and / or to be displayed) through the display (204). In examples that are not limited, user input may be received by the input / output interface (310) along with context information. The context information may relate to the state at the time the user input is received (e.g., the state of the electronic device (101) and / or the state surrounding the electronic device (101) (e.g., user state). For example, the context information may include information about one or more software applications executed within the electronic device (101) at the time the user input is received. For example, the context information may include information about the location of the electronic device (101) when user input is received. For example, the user input may be integrated with the context information. For example, user input with context information integrated as input may be received by the input / output interface (310).
[0061] In one embodiment, the input / output interface (310) may transmit (or provide) an output. The output may include a result (or result information) generated or obtained by the AI system (300) based on the input. The format of the output may vary. For example, the output may include natural language. For example, the output may include content (e.g., media content and / or multimedia content). For example, the output may include an action related to the user of the electronic device (101). For example, the output may have a format according to the user settings of the electronic device (101). For example, the input / output interface (310) may be described as a user question / response interface (310).
[0062] In one embodiment, the AI framework (320) may be used to obtain information (or data) about an input from an input / output interface (310) and to control one or more components related to the AI system (300) using the obtained information.
[0063] For example, a prompt design component (321) within an AI framework (320) can generate or acquire prompts for a generative AI model (330) (e.g., including automatic speech recognition (ASR), large language model (LLM), machine translation (MT), large vision model (LVM), and / or large multimodal model (LMM)) using the acquired information. For example, the prompt design component (321) may be described as an AI component that uses a learning algorithm and / or a neural network to provide prompts that are enhanced over time. For example, the prompt design component (321) can generate or acquire prompts by accessing a knowledge component (e.g., knowledge repository (390)) containing user preference data, a prompt library, and / or prompt examples using the acquired information. The generated prompts may be provided to the generative AI model (330).
[0064] For example, an API / plugin management component (322) within the AI framework (320) may be used to support communication for additional information requested (or induced) in relation to the prompt provided (or to be provided) to the generative AI model (330). For example, the API / plugin management component (322) may be used to create or establish a channel for communication with various data sources (e.g., knowledge repository (390)). For example, the API / plugin management component (322) may support access to at least some of the data sources. For example, the API / plugin management component (322) may be used to request other components (e.g., application / service component (380)) that perform feedback (or response) according to the prompt. As an example without limitation, information obtained (or generated) through the API / plugin management component (322) may be provided to the prompt design component (321) for the creation of the prompt. As an example that is not limited, information obtained (or generated) through the API / plugin management component (322) can be provided to the generative AI model (330).
[0065] For example, an improvement component (323) within the AI framework (320) can at least partially tune (or adjust) (or change) the result (e.g., content) obtained (or output) from the generative AI model (330). For example, the improvement component (323) can determine or verify whether the content obtained from the generative AI model (330) is related to the input. For example, the improvement component (323) can determine or verify whether the content obtained from the generative AI model (330) contains biased content. For example, the improvement component (323) can determine or verify whether the content obtained from the generative AI model (330) contains harmful content. For example, the improvement component (323) can support or assist in performing additional processing to improve the content obtained from the generative AI model (330). For example, the improvement component (323) may support providing a hint to the user to improve the content.
[0066] In one embodiment, the generative AI model (330) may be described as an artificial intelligence neural network that generates feedback in response to a prompt. For example, the feedback may include additional data and / or information relative to the prompt, but relative to the prompt. For example, the feedback may include new content relative to the prompt. For example, the generative AI model (330) may include a model that generates images and / or a model that generates language. For example, the model that generates images may include a generative adversarial network (GAN) and / or a variational autoencoder (VAE). For example, the model that generates images may include a diffusion-based generative model (e.g., a transformer VAE). For example, the generative AI model (330) may include an LMM that generates the feedback by recognizing text, images, and / or speech.
[0067] As an example without limitation, an AI framework (320) and / or a generative AI model (330) may be included within an artificial intelligence module (207) (e.g., including a processing circuit) within the electronic device (101). For example, the artificial intelligence module (207) may be operatively coupled with a processor (201) of the electronic device (101). For example, the artificial intelligence module (207) may be operatively coupled with a display driving circuit of the electronic device (101). For example, the artificial intelligence module (207) may be operatively coupled with a sensor hub of the electronic device (101) for one or more sensors within the electronic device (101).
[0068] Some of the operations described below may be executed (or performed) through the artificial intelligence system described with reference to FIG. 3.
[0069] FIG. 4 illustrates a system for providing voice call services.
[0070] Referring to FIG. 4, a system for providing voice call services may include an electronic device (101), a base station (410), a network (420), a base station (430), and / or an electronic device (440). The electronic device (101) may be owned by a user (401), and the electronic device (101) may be owned by a user (402). FIG. 4 describes a system for providing voice call services between the electronic device (101) of the user (401) and the electronic device (440) of the user (402). Referring to FIG. 4, the electronic device (101) may be connected to a base station (410), and the electronic device (440) may be connected to a base station (430). FIG. 4 illustrates a case where the electronic device (101) and the electronic device (440) are connected through two base stations (e.g., base station (410) and base station (430)), but the present disclosure is not limited thereto. For example, the electronic device (101) and the electronic device (404) may be connected through a single base station (e.g., base station (410) or base station (430)). The structure, arrangement, and / or connection status of the system of FIG. 4 may be changed.
[0071] In one embodiment, the network (420) may include a core network and an IMS (internet protocol multimedia subsystem) network. For example, the core network may include an EPC (evolved packet core) of LTE (long term evolution) and / or a 5GC (5th generation core) of NR (new radio). For example, the IMS network may be a network for supporting VoLTE (voice over LTE) and / or VoNR (voice over NR). For example, the network (420) may further include at least one network entity used for processing and transmitting voice data. An IMS network is described in FIG. 4, but the present disclosure is not limited thereto. For example, the descriptions according to the present disclosure may also apply to VoIP (voice over internet protocol) or CSFB (circuit switched fallback) based calls.
[0072] In one embodiment, the electronic device (101) can make a call with the electronic device (440) using a communication circuit (203). For example, the electronic device (101) can generate voice data based on the utterance of the user (401). The electronic device (101) can transmit the voice data to the base station (410). The base station (410) can transmit the voice data received from the electronic device (101) to the electronic device (440) via the network (420) and the base station (430). For example, the electronic device (440) can transmit the voice data generated by the utterance of the user (402) to the base station (410) via the base station (430) and the network (420). The electronic device (101) can receive the voice data generated by the utterance of the user (402) from the base station (410). In the following, operations according to the present disclosure are described with respect to the electronic device (101) for convenience of explanation. For example, voice data transmitted by the electronic device (101) may be referred to as uplink voice data or other terms having an equivalent technical / functional meaning. For example, voice data received by the electronic device (101) may be referred to as downlink voice data or other terms having an equivalent technical / functional meaning.
[0073] For example, uplink transmission performed by the electronic device (101) may cause electromagnetic waves to the user. Electromagnetic waves from uplink transmission may cause damage to the user's health. To reduce damage to the user's health caused by electromagnetic waves, the transmit power of the electronic device (101) may be limited by a time-averaged specific absorption rate (SAR). The time-averaged SAR may represent the value of the electromagnetic wave energy absorbed by the human body per unit mass averaged over time. For example, the transmit power accumulated within a time window may be controlled so as not to exceed a threshold power. For example, the electronic device (101) may reduce the current transmit power so that the transmit power accumulated within a time window does not exceed the threshold power.
[0074] For example, the frequency and / or time interval of uplink transmission caused by the utterance of a user (e.g., user (401)) during a call may be difficult to predict. Since the frequency and / or time interval of uplink transmission are difficult to predict, the threshold power, which is the criterion for reducing transmission power, may be set relatively low. Since the threshold power, which is the criterion for reducing transmission power, is set relatively low, transmission power may be controlled to be reduced even when there is no need to limit transmission power. As transmission power is reduced by the above control, a degradation of communication performance (or call quality) may occur. For example, in a weak electric field such as a cell edge, a link failure (or call drop) may occur as transmission power is reduced by the above control.
[0075] In the following, to solve the aforementioned problems, an electronic device, a method, and a non-transient computer-readable storage medium are described for preventing a reduction in transmission power by predicting the frequency and / or time interval of uplink transmission in a call situation where an AI-based interpretation service is provided.
[0076] FIGS. 5A and 5B illustrate graphs representing the transmission power of an electronic device during a voice call service. FIGS. 5A and 5B illustrate graphs representing the transmission power and the cumulative transmission power within a time window in a call situation where an artificial intelligence-based interpretation service is provided. For example, FIG. 5A illustrates a graph (510) representing the transmission power and the cumulative transmission power within a time window in the transmitting state of the electronic device (101). For example, FIG. 5B illustrates a graph (520) representing the transmission power and the cumulative transmission power within a time window in the receiving state of the electronic device (101).
[0077] Referring to FIG. 5a, for example, an electronic device (101) may transmit voice data generated by a user's utterance during a time interval (511). The user's utterance may be based on a first language (e.g., Korean). The electronic device (101) may transmit voice data based on a transmission power (e.g., 0.1 W (watt)) during a time interval (511) between a first time instance and a fifth time instance. For example, the electronic device (101) may generate voice data of a second language (e.g., English) based on an utterance of the first language (e.g., Korean) during at least part of the time interval (511) and / or a sixth time instance. In one example, the electronic device (101) may generate text corresponding to an utterance of the first language by performing speech-to-text (STT) on an utterance of the first language. The electronic device (101) can generate text of a second language by performing machine translation (MT) on text corresponding to utterances of a first language. The electronic device (101) can generate voice data of a second language by performing text to speech (TTS) on text of a second language. For example, the electronic device (101) can transmit voice data of a second language during a time interval (512). The electronic device (101) can transmit voice data of a second language based on transmission power (e.g., 0.1W) during the time interval (512) between the 7th time instance and the 11th time instance.
[0078] As described above, after transmitting voice data corresponding to a user's utterance, transmission of voice data corresponding to the interpretation of the utterance may be anticipated. For example, in a call situation where an AI-based interpretation service is supported, an electronic device (101) in a transmission state can predict a time interval (512) during which uplink transmission will be performed. Therefore, the electronic device (101) can determine the transmission power based on the time interval (512). For example, the electronic device (101) can determine whether the total accumulated transmission power within the time window exceeds a threshold power based on the accumulated transmission power predicted during the time interval (512). For example, if the total accumulated transmission power within the time window does not exceed the threshold power, the electronic device (101) can increase the current transmission power. By increasing the current transmission power, the communication performance (or call quality) of the electronic device (101) can be improved. In this regard, more specific operations of the electronic device (101) are described in FIG. 6a and FIG. 6b.
[0079] Referring to FIG. 5b, for example, an electronic device (101) may receive voice data generated by the speech of another user (e.g., user (402) of FIG. 4) during a time interval (521). The speech of the other user may be based on a second language (e.g., English). The electronic device (101) may receive voice data during a time interval (521) between a first time instance and a fifth time instance. Since no uplink transmission by the electronic device (101) is performed during the time interval (521), the cumulative transmission power may be reduced. For example, the electronic device (101) may generate voice data of the first language from voice data of the second language during at least part of the time interval (521) and / or a sixth time instance. In one example, the electronic device (101) may generate text corresponding to the voice data of the second language by performing STT on the voice data of the second language. The electronic device (101) can generate text of the first language by performing MT on text corresponding to voice data of the second language. The electronic device (101) can generate voice data of the first language by performing TTS on text of the first language. For example, the electronic device (101) can output a sound based on voice data of the first language to the outside through a speaker (206) during a time interval (522). The electronic device (101) can output a sound based on voice data of the first language to the outside during a time interval (522) between the 7th time instance and the 11th time instance. Since uplink transmission by the electronic device (101) is not performed during the time interval (522), the accumulated transmission power may be reduced.
[0080] As described above, after receiving voice data, the output of voice data according to interpretation of the voice data can be anticipated. For example, in a call situation where an AI-based interpretation service is supported, the electronic device (101) in the receiving state can predict a time interval (522) during which uplink transmission will not be performed. Additionally, the electronic device (101) can predict that the transmission power accumulated during the time interval (522) will decrease. Therefore, the electronic device (101) can determine the transmission power based on the total accumulated transmission power within the time window predicted during the time interval (522). For example, since the total accumulated power within the time window predicted during the time interval (522) will decrease, the electronic device (101) can increase the current transmission power. By increasing the current transmission power, the communication performance (or call quality) of the electronic device (101) can be improved. In this regard, more specific operations of the electronic device (101) are described in FIG. 7.
[0081] FIG. 6a illustrates a flowchart showing the operations of an electronic device for determining transmission power in a transmitting state. The operations of FIG. 6a can be performed by the electronic device (101) of FIG. 1 and FIG. 2. For example, at least some of the operations can be controlled by the processor (201) of the electronic device (101). In the following, each operation may be performed sequentially, but is not necessarily performed sequentially. For example, the order of each operation may be changed. For example, at least two operations may be performed in parallel. FIG. 6a describes the operations of the electronic device (101) for determining transmission power in a transmitting state.
[0082] Referring to FIG. 6a, in operation 601, an electronic device (101) according to one embodiment may transmit voice data of a first language (e.g., Korean) generated based on a user's utterance. For example, the electronic device (101) may acquire the user's utterance through a microphone (205) while an application for providing interpretation services for a call is running. For example, the user's utterance may be based on a first language (e.g., Korean). For example, the application may be an artificial intelligence-based application for providing interpretation services. In one example, the application may be referred to as an assistant application, an interpretation application, a translation application, or other terms having an equivalent technical / functional meaning. For example, the electronic device (101) may generate voice data of the first language based on the user's utterance acquired through the microphone (205). For example, an electronic device (101) can transmit voice data of a first language during a time interval based on a first transmission power (e.g., 0.5W). In one example, the first transmission power may be determined based on a maximum value of transmission power (e.g., Pmax) set by a radio resource control (RRC) message and / or a transmit power control (TPC) command of downlink control information (DCI).
[0083] In operation 602, an electronic device (101) according to one embodiment may determine a time interval for transmitting voice data of a second language corresponding to voice data of a first language. For example, operation 602 may be performed over at least a portion of the time interval during which voice data of the first language is transmitted. For example, voice data of the second language may be the result of performing interpretation on a user's speech.
[0084] In one embodiment, the electronic device (101) can generate text corresponding to an utterance of the first language by performing speech-to-text (STT) on an utterance of the first language (e.g., Korean). The electronic device (101) can generate text of the second language by performing machine translation (MT) on the text of the first language. The electronic device (101) can generate voice data of the second language by performing text-to-speech (TTS) on the text of the second language. For example, the electronic device (101) can determine a time interval for transmitting voice data of the second language based on the length of an utterance according to the voice data of the second language. In one example, the length of an utterance can be determined based on the size of the voice data of the second language. In another example, the electronic device (101) can determine a time interval for transmitting voice data of the second language based on the time interval in which the voice data of the first language was transmitted. The time interval for transmitting voice data of the second language may be proportional to the time interval in which voice data of the first language is transmitted.
[0085] In operation 603, an electronic device (101) according to one embodiment can determine transmission power based on a time interval for transmitting voice data of a second language.
[0086] In one embodiment, the electronic device (101) can determine the cumulative transmission power predicted during the time interval in which voice data of the second language is transmitted. For example, the current transmission power may be the first transmission power. The electronic device (101) can identify the second transmission power increased by a predefined value from the first transmission power. The electronic device (101) can determine the cumulative transmission power predicted when uplink transmission is performed based on the second transmission power during the time interval in which voice data of the second language is transmitted. For example, the cumulative transmission power may mean the total cumulative transmission power within the time window.
[0087] In one embodiment, the electronic device (101) may determine whether the accumulated transmission power exceeds a threshold power. For example, the electronic device (101) may increase the transmission power from a first transmission power to a second transmission power based on the determination that the accumulated transmission power is less than the threshold power. In another example, the electronic device (101) may maintain the transmission power at the first transmission power based on the determination that the accumulated transmission power exceeds the threshold power.
[0088] In operation 604, an electronic device (101) according to one embodiment may transmit voice data of a second language based on transmission power. For example, the electronic device (101) may transmit uplink voice data of a second language based on transmission power during a time interval. Another electronic device (e.g., electronic device (440) of FIG. 4) that receives the uplink voice data from a base station (e.g., base station (430) of FIG. 4) may output a sound corresponding to the voice data of the second language through a speaker. For example, the electronic device (101) may display texts corresponding to the voice data of the second language through a display (204). For example, the texts displayed through the display (204) may include text of the first language and / or text of the second language.
[0089] FIG. 6b is a flowchart illustrating the operations of an electronic device for determining transmission power in a transmitting state. The operations of FIG. 6b may be performed by the electronic device (101) of FIG. 1 and FIG. 2. For example, at least some of the operations may be controlled by the processor (201) of the electronic device (101). In the following, each operation may be performed sequentially, but is not necessarily performed sequentially. For example, the order of each operation may be changed. For example, at least two operations may be performed in parallel. FIG. 6b describes the operations of the electronic device (101) for determining transmission power in a transmitting state.
[0090] Referring to FIG. 6b, in operation 611, an electronic device (101) according to one embodiment may acquire a user utterance while the accumulated transmission power exceeds a predefined threshold power. For example, the electronic device (101) may acquire the user's utterance through a microphone (205) while an application for providing interpretation services for a call is running. For example, the user's utterance may be based on a first language (e.g., Korean). The current accumulated transmission power may exceed a predefined threshold power.
[0091] In operation 612, an electronic device (101) according to one embodiment may refrain from transmitting voice data of the first language based on the user's speech. Voice data of the first language may be unnecessary to the other person on the call who uses the second language. Since voice data of the first language is unnecessary to the other person on the call who uses the second language, the electronic device (101) may refrain from transmitting voice data of the first language based on the user's speech. Since voice data of the first language is not transmitted through the communication circuit (203), an increase in cumulative transmission power due to the transmission of said voice data can be prevented.
[0092] In operation 613, an electronic device (101) according to one embodiment may determine a time interval for transmitting voice data of a second language generated based on a user's speech. For example, the voice data of the second language may be the result of performing interpretation on the user's speech.
[0093] In one embodiment, the electronic device (101) can generate text corresponding to an utterance of the first language by performing speech-to-text (STT) on an utterance of the first language (e.g., Korean). The electronic device (101) can generate text of the second language by performing machine translation (MT) on the text of the first language. The electronic device (101) can generate voice data of the second language by performing text-to-speech (TTS) on the text of the second language. For example, the electronic device (101) can determine a time interval for transmitting voice data of the second language based on the length of an utterance according to the voice data of the second language. In one example, the length of an utterance can be determined based on the size of the voice data of the second language. In another example, the electronic device (101) can determine a time interval for transmitting voice data of the second language based on the time interval in which the voice data of the first language was transmitted. The time interval for transmitting voice data of the second language may be proportional to the time interval in which voice data of the first language is transmitted.
[0094] In operation 614, an electronic device (101) according to one embodiment can determine transmission power based on a time interval for transmitting voice data of a second language.
[0095] In one embodiment, the electronic device (101) can determine the cumulative transmission power predicted during the time interval in which voice data of the second language is transmitted. For example, the current transmission power may be the first transmission power. The electronic device (101) can identify the second transmission power increased by a predefined value from the first transmission power. The electronic device (101) can determine the cumulative transmission power predicted when uplink transmission is performed based on the second transmission power during the time interval in which voice data of the second language is transmitted. For example, the cumulative transmission power may mean the total cumulative transmission power within the time window.
[0096] In one embodiment, the electronic device (101) may determine whether the accumulated transmission power exceeds a threshold power. For example, the electronic device (101) may increase the transmission power from a first transmission power to a second transmission power based on the determination that the accumulated transmission power is less than the threshold power. In another example, the electronic device (101) may maintain the transmission power at the first transmission power based on the determination that the accumulated transmission power exceeds the threshold power.
[0097] In operation 615, an electronic device (101) according to one embodiment may transmit voice data of a second language based on transmission power. For example, the electronic device (101) may transmit uplink voice data of a second language based on transmission power during a time interval. Another electronic device (e.g., electronic device (440) of FIG. 4) that receives the uplink voice data from a base station (e.g., base station (430) of FIG. 4) may output a sound corresponding to the voice data of the second language through a speaker. For example, the electronic device (101) may display texts corresponding to the voice data of the second language through a display (204). For example, the texts displayed through the display (204) may include text of the first language and / or text of the second language.
[0098] FIG. 7 illustrates a flowchart showing the operations of an electronic device for determining transmission power in a receiving state. The operations of FIG. 7 may be performed by the electronic device (101) of FIG. 1 and FIG. 2. For example, at least some of the operations may be controlled by the processor (201) of the electronic device (101). In the following, each operation may be performed sequentially, but is not necessarily performed sequentially. For example, the order of each operation may be changed. For example, at least two operations may be performed in parallel. For example, the operations of the electronic device (101) described in FIG. 7 may be performed subsequently to the operations of the electronic device (101) described in FIG. 6a or FIG. 6b. FIG. 7 describes the operations of the electronic device (101) for determining transmission power in a receiving state.
[0099] Referring to FIG. 7, in operation 701, an electronic device (101) according to one embodiment may receive voice data of a second language (e.g., English). For example, the electronic device (101) may receive voice data of the second language through a communication circuit (203) while an application for providing interpretation services for a call is running. For example, the voice data of the second language may be based on the utterance of another user (e.g., user (402) of FIG. 4). For example, the utterance of another user may be based on the second language (e.g., English). For example, the application may be an artificial intelligence-based application for providing interpretation services. In one example, the application may be referred to as an assistant application, an interpretation application, a translation application, or other terms having an equivalent technical / functional meaning. For example, the electronic device (101) may receive voice data of the second language during a time interval. The electronic device (101) can output a sound according to voice data of the second language through a speaker (206) during at least part of the time interval. For example, it may be predicted that uplink transmission by the electronic device (101) will not be performed during the time interval. For example, if uplink transmission by the electronic device (101) is not performed, the accumulated transmission power within the time window may be reduced.
[0100] In operation 702, an electronic device (101) according to one embodiment can generate voice data of a first language from voice data of a second language. For example, the electronic device (101) can generate text corresponding to voice data of a second language by performing speech to text (STT) on voice data of a second language. The electronic device (101) can generate text of a first language by performing machine translation (MT) on text of a second language. The electronic device (101) can generate voice data of a first language by performing text to speech (TTS) on text of a first language. For example, voice data of a first language may be the result of performing interpretation on the speech of another user (e.g., voice data of a second language).
[0101] In operation 703, an electronic device (101) according to one embodiment may determine an output time interval for outputting a sound according to voice data of a first language. For example, the electronic device (101) may determine an output time interval during which a sound according to voice data of a first language is output through a speaker (206). For example, the electronic device (101) may determine an output time interval based on the size of the voice data of the first language.
[0102] In operation 704, an electronic device (101) according to one embodiment can determine transmission power based on an output time interval.
[0103] In one embodiment, the electronic device (101) can determine the cumulative transmission power predicted during the time interval in which a sound according to the voice data of the first language is output. For example, the current transmission power may be the second transmission power. For example, it may be assumed that uplink transmission by the electronic device (101) will not be performed during the output time interval. The electronic device (101) can determine the cumulative transmission power predicted when uplink transmission is not performed during the output time interval based on the output time interval. For example, the cumulative transmission power may mean the total cumulative transmission power within the time window.
[0104] In one embodiment, the electronic device (101) may determine whether the accumulated transmission power exceeds a threshold power. For example, the electronic device (101) may increase the transmission power from a second transmission power to a third transmission power based on the determination that the accumulated transmission power is less than the threshold power. In another example, the electronic device (101) may maintain the transmission power at a first transmission power based on the determination that the accumulated transmission power exceeds the threshold power.
[0105] In one embodiment, the electronic device (101) may output a sound corresponding to voice data of a first language through a speaker (206) during an output time interval. For example, the electronic device (101) may display texts corresponding to voice data of a first language through a display (204). For example, the texts displayed through the display (204) may include text of a first language and / or text of a second language.
[0106] In one embodiment, the electronic device (101) may perform operations according to FIG. 6a or FIG. 6b based on an increased third transmission power after an output time interval.
[0107] FIG. 8 illustrates a graph showing the transmission power of an electronic device during a voice call service. FIG. 8 illustrates a graph (800) showing the maximum transmission power (801) and transmission power (802) of an electronic device (101) in a call situation where an artificial intelligence-based interpretation service is provided.
[0108] Referring to FIG. 8, in one embodiment, an electronic device (101) may transmit voice data of a first language based on a first transmission power (e.g., 0.5 W (watt)) during a first time interval (810). In one example, the voice data transmitted by the electronic device (101) may be referred to as uplink voice data or other terms having an equivalent technical / functional meaning. For example, the electronic device (101) may acquire a user's utterance through a microphone (205) while an application for providing interpretation services for a call is running. For example, the user's utterance may be based on a first language (e.g., Korean). For example, the user's utterance may occur in at least part of the first time interval (810) between a first time instance and a fifth time instance. The electronic device (101) may generate voice data of the first language based on the user's utterance. The electronic device (101) can transmit voice data of a first language based on a first transmission power (e.g., 0.5W) during a first time interval (810). For example, the first transmission power may be determined based on a transmit power control (TPC) command of downlink control information (DCI) obtained from a base station. In another example, the first transmission power may have a value reduced by a predetermined value (e.g., 1dB) compared to the transmission power determined based on the TPC command.
[0109] In one embodiment, the electronic device (101) may determine a second time interval (820) for transmitting voice data of a second language (e.g., English). For example, the voice data of the second language may be the result of performing interpretation on a user's utterance. For example, the electronic device (101) may generate voice data of the second language based on an utterance of the first language in at least part of the first time interval (810) and / or in a sixth time instance. For example, the electronic device (101) may generate text corresponding to an utterance of the first language by performing speech to text (STT) on an utterance of the first language. The electronic device (101) may generate text of the second language by performing machine translation (MT) on the text of the first language. The electronic device (101) may generate voice data of the second language by performing text to speech (TTS) on the text of the second language. For example, the electronic device (101) may determine a second time interval (820) based on the length of a speech according to the voice data of the second language. In one example, the length of the speech may be determined based on the size of the voice data of the second language. In another example, the electronic device (101) may determine a second time interval (820) based on a first time interval (810). The second time interval (820) may be proportional to the first time interval (810). For example, the second time interval (820) may represent a time interval between the seventh time instance and the eleventh time instance.
[0110] In one embodiment, the electronic device (101) may determine a second transmission power (e.g., 0.7W) for transmitting voice data in the second time interval (820) based on the second time interval (820) during which voice data of the second language is to be transmitted. For example, the electronic device (101) may identify a second transmission power (e.g., 0.7W) that is increased by a predefined value (e.g., 0.2W) from a first transmission power (e.g., 0.5W). The electronic device (101) may determine an expected cumulative transmission power when transmitting voice data of the second language during the second time interval (820) based on the second transmission power. In the example illustrated in FIG. 8, the expected cumulative transmission power may be 3.5. Based on the expected cumulative transmission power, the electronic device (101) may determine whether the total cumulative transmission power within the time window in the eleventh time instance exceeds a threshold power. For example, the electronic device (101) may increase the transmission power from a first transmission power (e.g., 0.5W) to a second transmission power (e.g., 0.7W) upon determining that the total accumulated transmission power within the time window is less than the threshold power. In another example, the electronic device (101) may maintain the transmission power at the first transmission power (e.g., 0.5W) upon determining that the total accumulated transmission power within the time window exceeds the threshold power. The electronic device (101) may transmit voice data of the second language during a second time interval (820) based on the determined transmission power. Another electronic device (e.g., the electronic device (440) of FIG. 4) receiving the voice data from a base station (e.g., the base station (430) of FIG. 4) may output a sound corresponding to the voice data of the second language through a speaker during at least part of the second time interval (820).
[0111] As described above, the electronic device (101) can predict a second time interval (820) during which data related to the interpretation of a user's speech is to be transmitted. For example, the occurrence of uplink transmission by the electronic device (101) during the second time interval (820) can be predicted. For example, the electronic device (101) can increase the transmission power based on the second time interval (820). By increasing the transmission power, the communication performance (or call quality) of the electronic device (101) can be improved.
[0112] Referring to FIG. 8, in one embodiment, an electronic device (101) may receive voice data of a second language during a third time interval (830). In one example, the voice data received by the electronic device (101) may be referred to as downlink voice data or other terms having an equivalent technical / functional meaning. For example, the electronic device (101) may output a sound corresponding to the voice data of the second language through a speaker (206) during the third time interval (830). For example, it may be predicted that uplink transmission by the electronic device (101) will not be performed during the third time interval (830). For example, if uplink transmission by the electronic device (101) is not performed, the total accumulated transmission power within the time window may be reduced.
[0113] In one embodiment, the electronic device (101) may determine a fourth time interval (840) for outputting sound according to voice data of the first language through a speaker (206). For example, the voice data of the first language may be the result of performing interpretation on the speech of another user (e.g., voice data of the second language). For example, the electronic device (101) may generate voice data of the first language based on voice data of the second language in at least part of the third time interval (830). For example, the electronic device (101) may generate text corresponding to voice data of the second language by performing STT on voice data of the second language. The electronic device (101) may generate text of the first language by performing MT on text of the second language. The electronic device (101) may generate voice data of the first language by performing TTS on text of the first language. For example, the electronic device (101) may determine a fourth time interval (840) based on the length of a speech according to the speech data of the first language. In one example, the length of the speech may be determined based on the size of the speech data of the first language. In another example, the electronic device (101) may determine a fourth time interval (840) based on a third time interval (830). The fourth time interval (840) may be proportional to the third time interval (830). For example, the fourth time interval (840) may represent a time interval between the 18th time instance and the 23rd time instance.
[0114] In one embodiment, the electronic device (101) may determine a third transmission power (e.g., 0.9W) in the fourth time interval (840) based on the fourth time interval (840) during which voice data of the first language is output. For example, it may be assumed that no uplink transmission by the electronic device (101) will be performed during the fourth time interval (840). The predicted cumulative transmission power during the fourth time interval (840) may be 0W. Based on the predicted cumulative transmission power (e.g., 0W), the electronic device (101) may determine whether the total cumulative transmission power within the time window in the 23rd time instance exceeds a threshold power. For example, the electronic device (101) may increase the transmission power from a second transmission power (e.g., 0.7W) to a third transmission power (e.g., 0.9W) upon determining that the total cumulative transmission power within the time window is less than the threshold power. In another example, the electronic device (101) can maintain the transmission power at a second transmission power (e.g., 0.7W) upon determining that the total accumulated transmission power within the time window exceeds a threshold power.
[0115] As described above, the electronic device (101) can predict a fourth time interval (840) during which data related to the interpretation of another user's speech is received. The likelihood that uplink transmission by the electronic device (101) will be performed during the fourth time interval (840) may be relatively low. If uplink transmission is not performed, the accumulated power within the time window may decrease. Accordingly, the electronic device (101) may increase transmission power based on the fourth time interval (820) during which uplink transmission is predicted not to be performed. By increasing transmission power, the communication performance (or call quality) of the electronic device (101) may be improved.
[0116] FIG. 9 illustrates a flowchart showing the operations of an electronic device for determining transmission power. The operations of FIG. 9 may be performed by the electronic device (101) of FIG. 1 and FIG. 2. For example, at least some of the operations may be controlled by a processor (201) of the electronic device (101). In the following, each operation may be performed sequentially, but is not necessarily performed sequentially. For example, the order of each operation may be changed. For example, at least two operations may be performed in parallel.
[0117] Referring to FIG. 9, in operation 901, an electronic device (101) according to one embodiment may receive voice data of a second language (e.g., English). In one example, the voice data received by the electronic device (101) may be referred to as downlink voice data or other terms having an equivalent technical / functional meaning. For example, the voice data of the second language may be based on the utterance of another user (e.g., user (402) of FIG. 4). For example, the electronic device (101) may receive voice data of the second language while an application for providing interpretation services for a call is running. In one example, the application may be referred to as an assistant application, an interpretation application, a translation application, or other terms having an equivalent technical / functional meaning.
[0118] In operation 902, an electronic device (101) according to one embodiment can display texts of a first language that respond to voice data of a second language through a display (204).
[0119] In one embodiment, the electronic device (101) can generate text corresponding to downlinked speech data of the second language by performing speech-to-text (STT) on speech data of the second language (e.g., English). The electronic device (101) can generate text of the first language (e.g., Korean) by performing machine translation (MT) on text of the second language. The electronic device (101) can generate texts that are responses to the utterances of other users based on text of the first language. For example, an artificial intelligence model may be used to generate the texts. In one example, the artificial intelligence model may be a large language model (LLM) or a large multimodal model (LMM). However, this is merely an example and the present disclosure is not limited thereto. For example, other artificial intelligence models may be used to generate the texts. The electronic device (101) can display the texts through a display (204).
[0120] In operation 903, an electronic device (101) according to one embodiment can identify user input for text among texts displayed through a display (204). For example, the electronic device (101) can identify one text among texts displayed through the display (204) based on the user's touch input.
[0121] In operation 904, an electronic device (101) according to one embodiment can generate voice data of a second language based on text of a first language. For example, the electronic device (101) can generate text of a second language by performing machine translation (MT) on text of a first language. The electronic device (101) can generate voice data of a second language by performing text to speech (TTS) on text of a second language.
[0122] In operation 905, an electronic device (101) according to one embodiment may determine a transmission time interval for transmitting voice data of a second language. For example, the electronic device (101) may determine a time interval for transmitting the voice data based on the size of the voice data of the second language.
[0123] In operation 906, an electronic device (101) according to one embodiment may determine a transmission power based on a time interval during which voice data of a second language is transmitted. For example, the current transmission power of the electronic device (101) may be a first transmission power. For example, the electronic device (101) may identify a second transmission power that is increased by a predefined value from the first transmission power. The electronic device (101) may determine an expected cumulative transmission power when transmitting voice data of the second language during a time interval based on the second transmission power. The electronic device (101) may determine whether the total cumulative transmission power within a time window exceeds a threshold power based on the expected cumulative transmission power. For example, the electronic device (101) may increase the transmission power from the first transmission power to the second transmission power based on the determination that the total cumulative transmission power is less than the threshold power. In another example, the electronic device (101) can maintain the transmission power at a first transmission power based on the determination that the total accumulated transmission power exceeds a threshold power.
[0124] In operation 907, an electronic device (101) according to one embodiment may transmit voice data of a second language during a time interval based on transmission power. For example, another electronic device (e.g., electronic device (440) of FIG. 4) that receives the voice data from a base station (e.g., base station (430) of FIG. 4) may output a sound corresponding to the voice data of the second language through a speaker. For example, the electronic device (101) may display texts corresponding to the voice data of the second language through a display (204). For example, the texts displayed through the display (204) may include text of the first language and / or text of the second language.
[0125] FIG. 10 illustrates a flowchart showing the operations of an electronic device for determining transmission power. The operations of FIG. 10 may be performed by the electronic device (101) of FIG. 1 and FIG. 2. For example, at least some of the operations may be controlled by a processor (201) of the electronic device (101). In the following, each operation may be performed sequentially, but is not necessarily performed sequentially. For example, the order of each operation may be changed. For example, at least two operations may be performed in parallel.
[0126] Referring to FIG. 10, in operation 1001, an electronic device (101) according to one embodiment may generate voice data of a first language based on a user's utterance. For example, the electronic device (101) may acquire the user's utterance through a microphone (205) while an application for providing interpretation services for a call is running. For example, the user's utterance may be based on a first language (e.g., Korean). For example, the application may be an artificial intelligence-based application for providing interpretation services. In one example, the application may be referred to as an assistant application, an interpretation application, a translation application, or other terms having an equivalent technical / functional meaning. For example, the electronic device (101) may generate voice data of a first language based on the user's utterance acquired through the microphone (205).
[0127] In operation 1002, an electronic device (101) according to one embodiment may determine a first prediction time interval for receiving a response in a second language that is predicted or expected from an utterance in a first language. For example, the electronic device (101) may generate text corresponding to an utterance in the first language by performing speech-to-text (STT) on an utterance in the first language. The electronic device (101) may generate text in the second language by performing machine translation (MT) on an utterance in the first language. Based on the text in the second language, the electronic device (101) may generate (or predict) a response in the second language (e.g., English) to an utterance in the first language (e.g., Korean). For example, an artificial intelligence model may be used to generate a response in the second language. In one example, the artificial intelligence model may be a large language model (LLM) or a large multimodal model (LMM). However, this is merely an example, and the present disclosure is not limited thereto. For example, other artificial intelligence models may be used to generate a response in a second language to an utterance in a first language. The electronic device (101) may determine a first predicted time interval for receiving voice data for a response generated using an artificial intelligence model.
[0128] In operation 1003, an electronic device (101) according to one embodiment may determine a second predicted time interval for outputting voice data of a first language corresponding to a response of a second language. For example, the electronic device (101) may generate text of the first language by performing MT on a response of the second language. The electronic device (101) may determine a second predicted time interval for outputting voice data of the first language.
[0129] In operation 1004, an electronic device (101) according to one embodiment may determine transmission power based on a first predicted time interval and a second predicted time interval. For example, the first predicted time interval and the second predicted time interval may represent time intervals during which uplink transmission by the electronic device (101) is predicted not to be performed. For example, during the first predicted time interval and the second predicted time interval, the total accumulated transmission power within the time window may be expected to decrease. The electronic device (101) may determine transmission power based on the first predicted time interval and the second predicted time interval. For example, the electronic device (101) may increase transmission power from a first transmission power to a second transmission power. For example, the difference between the first transmission power and the second transmission power may be predefined. The electronic device (101) may transmit voice data of the first language generated in operation 1001 based on the second transmission power. In examples not limited to this, the transmission power may be determined further based on the time interval during which the voice data of the second language is transmitted. The voice data of the second language may be the result of performing interpretation on the user's speech. For example, the time interval during which the voice data of the second language is transmitted may precede the first predicted time interval and the second predicted time interval. For example, the time interval during which the voice data of the second language is transmitted may represent the time interval during which uplink transmission by the electronic device (101) is predicted to be performed. During the time interval during which the voice data of the second language is transmitted, the total accumulated transmission power within the time window may be expected to increase. The electronic device (101) may determine the transmission power based on the time interval during which the voice data of the second language is transmitted, the first predicted time interval, and / or the second predicted time interval.
[0130] FIG. 11 illustrates a flowchart showing the operations of an electronic device for determining transmission power. The operations of FIG. 11 may be performed by the electronic device (101) of FIG. 1 and FIG. 2. For example, at least some of the operations may be controlled by a processor (201) of the electronic device (101). In the following, each operation may be performed sequentially, but is not necessarily performed sequentially. For example, the order of each operation may be changed. For example, at least two operations may be performed in parallel.
[0131] Referring to FIG. 11, in operation 1101, an electronic device (101) according to one embodiment can receive voice data of a second language (e.g., English). For example, the electronic device (101) can receive voice data of the second language through a communication circuit (203) while an application for providing interpretation services for a call is running.
[0132] In operation 1102, an electronic device (101) according to one embodiment may determine a first predicted time interval for transmitting an utterance of a first language that is expected or predicted from voice data of a second language. For example, the electronic device (101) may generate text corresponding to the voice data of the second language by performing speech to text (STT) on the voice data of the second language. The electronic device (101) may generate text of the first language by performing machine translation (MT) on the text of the second language. The electronic device (101) may generate (or predict) a response (e.g., user's response) to voice data of the second language (e.g., utterance of a call partner) based on the text of the first language. For example, an artificial intelligence model may be used to generate (or predict) the response. In one example, the artificial intelligence model may be a large language model (LLM) or a large multimodal model (LMM). However, this is merely an example and the present disclosure is not limited thereto. For example, other artificial intelligence models may be used to generate (or predict) a response. The electronic device (101) may determine a first predicted time interval for transmitting voice data based on the generated response.
[0133] In operation 1103, an electronic device (101) according to one embodiment may determine a second predicted time interval for transmitting voice data of a second language corresponding to an utterance of a first language. For example, the electronic device (101) may generate text of the second language by performing MT on a response of the first language. Based on the text of the second language, the electronic device (101) may determine a second predicted time interval for transmitting voice data of the second language.
[0134] In operation 1104, an electronic device (101) according to one embodiment may determine transmission power based on a first predicted time interval and a second predicted time interval. For example, the first predicted time interval and the second predicted time interval may represent time intervals in which uplink transmission by the electronic device (101) is predicted to be performed. For example, during the first predicted time interval and the second predicted time interval, the total cumulative transmission power within the time window may be expected to increase. The electronic device (101) may determine transmission power based on the first predicted time interval and the second predicted time interval. In an example that is not limited, transmission power may be determined further based on a time interval in which an utterance according to voice data of the first language is output. The utterance of the first language may be the result of performing interpretation on the utterance of the other person on the call. For example, the time interval during which the utterance of the first language is output may precede the first predicted time interval and the second predicted time interval. For example, the time interval during which the utterance of the first language is output may represent a time interval during which uplink transmission by the electronic device (101) is predicted not to be performed. During the time interval during which the utterance of the first language is output, the total accumulated transmission power within the time window may be expected to decrease. The electronic device (101) may determine the transmission power based on the time interval during which the utterance of the first language is output, the first predicted time interval, and / or the second predicted time interval.
[0135] For example, the current transmission power may be a first transmission power. The electronic device (101) may identify a second transmission power that is increased from the first transmission power by a predefined value. The electronic device (101) may determine whether the predicted cumulative transmission power exceeds a threshold power when uplink transmission is performed based on the second transmission power during predicted time intervals. For example, the electronic device (101) may increase the transmission power from the first transmission power to the second transmission power based on the determination that the cumulative transmission power is less than the threshold power. In another example, the electronic device (101) may maintain the transmission power at the first transmission power based on the determination that the cumulative transmission power exceeds the threshold power.
[0136] FIG. 12 is a flowchart illustrating the operations of an electronic device for determining transmission power. The operations of FIG. 12 may be performed by the electronic device (101) of FIG. 1 and FIG. 2. For example, at least some of the operations may be controlled by the processor (201) of the electronic device (101). In the following, each operation may be performed sequentially, but is not necessarily performed sequentially. For example, the order of each operation may be changed. For example, at least two operations may be performed in parallel. The operations of the electronic device (101) described in FIG. 12 may be performed subsequently to the operations of the electronic device (101) described in FIG. 10 or FIG. 11.
[0137] Referring to FIG. 12, in operation 1201, an electronic device (101) according to one embodiment may change the transmission power from a first transmission power to a second transmission power based on predicted time intervals. In one example, the electronic device (101) may change (or increase) the transmission power from a first transmission power to a second transmission power according to the operations described in FIG. 10. In one example, the electronic device (101) may change (or increase) the transmission power from a first transmission power to a second transmission power according to the operations described in FIG. 11.
[0138] In operation 1202, an electronic device (101) according to one embodiment can determine whether an error for a predicted time interval is less than a threshold value. For example, the error for a predicted time interval may represent the difference between the predicted time intervals (e.g., a first predicted time interval and a second predicted time interval) and the actual time intervals exemplified in FIG. 10 and FIG. 11.
[0139] In operation 1203, an electronic device (101) according to one embodiment can maintain the transmission power at a second transmission power. For example, the electronic device (101) can maintain the transmission power at a second transmission power based on a determination that the error for the prediction time interval is less than a threshold value.
[0140] In operation 1204, an electronic device (101) according to one embodiment may change the transmission power from a second transmission power to a first transmission power. For example, the electronic device (101) may change (or decrease) the transmission power from the second transmission power to the first transmission power based on a determination that the error for the prediction time interval exceeds a threshold value.
[0141] The technical problems to be solved in this disclosure are not limited to those mentioned above, and other technical problems not mentioned will be clearly understood by those skilled in the art to which this disclosure pertains.
[0142] The electronic device described above may include at least one speaker. The electronic device may include at least one microphone. The electronic device may include a communication circuit. The electronic device may include a memory that stores instructions and includes one or more storage media. The electronic device may include at least one processor that includes a processing circuit. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause uplink voice data of a first language generated based on a user's utterance while an application for providing interpretation services for a call performed using the communication circuit is executed. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause a transmission time interval for transmitting uplink voice data of a second language based on generating uplink voice data of a second language from the utterance of the first language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to determine a transmission power for transmitting uplink voice data of the second language based on the transmission time interval for transmitting uplink voice data of the second language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to transmit uplink voice data of the second language based on the transmission power.
[0143] For example, when the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause voice data of the first language to be generated from the downlink voice data of the second language based on receiving downlink voice data of the second language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause an output time interval to be determined for outputting a sound according to the voice data of the first language through the at least one speaker. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the transmission power to be increased based on the output time interval for outputting the sound.
[0144] For example, when the instructions are executed individually or collectively by the at least one processor, the electronic device may cause to determine the cumulative transmission power to be used within a time window based on the transmission time interval and a second transmission power greater than the first transmission power. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause to increase the transmission power for transmitting uplink voice data of the second language from the first transmission power to the second transmission power based on the determination that the cumulative transmission power is less than the threshold power. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause to maintain the transmission power for transmitting uplink voice data of the second language at the first transmission power based on the determination that the cumulative transmission power exceeds the threshold power.
[0145] For example, when the instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to generate text of the first language corresponding to the utterance by performing speech-to-text (STT) on the utterance. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to generate text of the second language by performing machine translation (MT) on the text of the first language. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to generate uplink speech data of the second language by performing text-to-speech (TTS) on the text of the second language.
[0146] For example, when the instructions are executed individually or collectively by the at least one processor, the electronic device may cause sound corresponding to uplink voice data of the second language to be output through the at least one speaker within the transmission time interval. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause texts corresponding to uplink voice data of the second language to be displayed through the display within the transmission time interval. The texts may include text of the first language and text of the second language.
[0147] For example, when the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause an artificial intelligence model to determine a predicted time interval for receiving downlink voice data of the second language predicted from uplink voice data of the first language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause an artificial intelligence model to determine a predicted output time interval for outputting a sound according to voice data of the first language generated based on downlink voice data of the second language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause an artificial intelligence model to increase the transmission power from a first transmission power to a second transmission power based on the predicted time interval and the predicted output time interval.
[0148] For example, when the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to receive downlink voice data of the second language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to determine a first prediction time interval for transmitting second uplink voice data of the first language predicted from the downlink voice data of the second language using an artificial intelligence model. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to determine a second prediction time interval for transmitting third uplink voice data of the second language corresponding to the second uplink voice data of the first language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the transmission power to increase from the first transmission power to the second transmission power based on the first prediction time interval and the second prediction time interval.
[0149] For example, when the instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to receive downlink voice data of the second language. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to display responses to the downlink voice data of the second language, generated using an artificial intelligence model, through the display. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to generate second uplink voice data of the second language from the responses based on identifying user input for the response among the responses. When the instructions are executed individually or collectively by the at least one processor, the electronic device may cause the electronic device to determine a second transmission time interval for transmitting the second uplink voice data of the second language. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the transmission power to be changed based on the second transmission time interval. When the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the second uplink voice data to be transmitted based on the changed transmission power.
[0150] For example, the above threshold power can be configured to satisfy a predefined specific absorption rate (SAR).
[0151] For example, when the above instructions are executed individually or collectively by the at least one processor, the electronic device may cause the transmission time interval to be determined based on the size of the uplink voice data of the second language.
[0152] A method performed by an electronic device comprising at least one speaker, at least one microphone, and a communication circuit as described above may include the operation of transmitting uplink voice data of a first language generated based on a user's utterance while an application for providing interpretation services for a call performed using the communication circuit is executed. The method may include the operation of determining a transmission time interval for transmitting uplink voice data of the second language based on generating uplink voice data of the second language from the utterance of the first language. The method may include the operation of determining a transmission power for transmitting uplink voice data of the second language based on the transmission time interval for transmitting uplink voice data of the second language. The method may include the operation of transmitting uplink voice data of the second language based on the transmission power.
[0153] For example, the above method may include an operation of generating voice data of the first language from downlink voice data of the second language based on receiving downlink voice data of the second language. The above method may include an operation of determining an output time interval for outputting a sound according to the voice data of the first language through the at least one speaker. The above method may include an operation of increasing the transmission power based on the output time interval for outputting the sound.
[0154] For example, the method may include an operation to determine an accumulated transmission power to be used within a time window based on the transmission time interval and a second transmission power greater than the first transmission power. The method may include an operation to increase the transmission power for transmitting uplink voice data of the second language from the first transmission power to the second transmission power based on a determination that the accumulated transmission power is less than a threshold power. The method may include an operation to maintain the transmission power for transmitting uplink voice data of the second language at the first transmission power based on a determination that the accumulated transmission power exceeds the threshold power.
[0155] For example, the above method may include an operation of generating text of the first language corresponding to the utterance by performing speech to text (STT) on the utterance. The above method may include an operation of generating text of the second language by performing machine translation (MT) on the text of the first language. The above method may include an operation of generating uplink voice data of the second language by performing text to speech (TTS) on the text of the second language.
[0156] For example, the above method may include an operation of outputting a sound corresponding to the uplink voice data of the second language through the at least one speaker within the transmission time interval. The above method may include an operation of displaying texts corresponding to the uplink voice data of the second language through the display within the transmission time interval. The texts may include text of the first language and text of the second language.
[0157] For example, the above method may include an operation of determining a predicted time interval for receiving downlink voice data of the second language predicted from uplink voice data of the first language using an artificial intelligence model. The above method may include an operation of determining a predicted output time interval for outputting a sound according to voice data of the first language generated based on downlink voice data of the second language. The above method may include an operation of increasing the transmission power from a first transmission power to a second transmission power based on the predicted time interval and the predicted output time interval.
[0158] For example, the above method may include an operation of receiving downlink voice data of the second language. The above method may include an operation of determining a first prediction time interval for transmitting second uplink voice data of the first language predicted from the downlink voice data of the second language using an artificial intelligence model. The above method may include an operation of determining a second prediction time interval for transmitting third uplink voice data of the second language corresponding to the second uplink voice data of the first language. The above method may include an operation of increasing the transmission power from the first transmission power to the second transmission power based on the first prediction time interval and the second prediction time interval.
[0159] For example, the above method may include an operation of receiving downlink voice data of the second language. The above method may include an operation of displaying responses to the downlink voice data of the second language, generated using an artificial intelligence model, through the display. The above method may include an operation of generating second uplink voice data of the second language from the responses based on identifying user input for the responses among the responses. The above method may include an operation of determining a second transmission time interval for transmitting the second uplink voice data of the second language. The above method may include an operation of changing the transmission power based on the second transmission time interval. The above method may include an operation of transmitting the second uplink voice data based on the changed transmission power.
[0160] For example, the above threshold power can be configured to satisfy a predefined specific absorption rate (SAR).
[0161] For example, the above method may include an operation to determine the transmission time interval based on the size of the uplink voice data of the second language.
[0162] The effects obtainable from the present disclosure are not limited to those mentioned above, and other unmentioned effects will be clearly understood by those skilled in the art to which the present disclosure belongs.
[0163] For one or more embodiments, at least one of the components described in one or more of the prior art drawings may be configured to perform one or more operations, techniques, processes and / or methods as described in the present disclosure. For example, a processor (e.g., a baseband processor) described in the present disclosure in relation to one or more of the prior art drawings may be configured to operate according to one or more examples described in the present disclosure. As another example, circuits associated with user equipment (UE), a base station, a network element, etc., as described above in relation to one or more of the prior art drawings may be configured to operate according to one or more examples described herein.
[0164] Any of the embodiments described above may be combined with any other embodiment (or combination of embodiments) unless otherwise explicitly stated. The foregoing description of one or more embodiments is for illustrative and explanatory purposes only, and is not intended to limit or exhaust the scope of the embodiments in the exact form disclosed. Modifications and variations are possible in light of the foregoing teachings or may be obtained from the practice of various embodiments.
[0165] The electronic devices according to the various embodiments disclosed in this document may be of various forms. The electronic devices may include, for example, portable communication devices (e.g., smartphones), computer devices, portable multimedia devices, portable medical devices, cameras, electronic devices, or consumer electronics. The electronic devices according to the embodiments of this document are not limited to the devices described above.
[0166] The various embodiments of this document and the terms used therein are not intended to limit the technical features described in this document to specific embodiments, and should be understood to include various modifications, equivalents, or substitutions of said embodiments. In connection with the description of the drawings, similar reference numerals may be used for similar or related components. The singular form of a noun corresponding to an item may include one or more of said items unless the relevant context clearly indicates otherwise. In this document, phrases such as "A or B," "at least one of A and B," "at least one of A or B," "A, B or C," "at least one of A, B and C," and "at least one of A, B, or C" may each include any one of the items listed together in the corresponding phrase, or all possible combinations thereof. Terms such as "first," "second," or "first" or "second" may be used simply to distinguish said components from other said components and do not limit said components in any other aspect (e.g., importance or order). Where any (e.g., 1st) component is referred to as "coupled" or "connected" to another (e.g., 2nd) component, with or without the terms "functionally" or "communicationly," it means that said any component may be connected to said other component directly (e.g., via a wire), wirelessly, or through a third component.
[0167] The term “module” as used in the various embodiments of this document may include a unit implemented in hardware, software, or firmware, and may be used interchangeably with terms such as logic, logic block, component, or circuit, for example. A module may be a component formed integrally, or a minimum unit of said component or a part thereof that performs one or more functions. For example, according to one embodiment, a module may be implemented in the form of an application-specific integrated circuit (ASIC).
[0168] Various embodiments of the present document may be implemented as software (e.g., program (140)) comprising one or more instructions stored in a storage medium (e.g., internal memory (136) or external memory (138)) readable by a machine (e.g., electronic device (101)). For example, a processor (e.g., processor (120)) of the machine (e.g., electronic device (101)) may call at least one of the one or more instructions stored in the storage medium and execute it. This enables the machine to be operated to perform at least one function according to the at least one called instruction. The one or more instructions may include code generated by a compiler or code that can be executed by an interpreter. The storage medium readable by the machine may be provided in the form of a non-transitory storage medium. Here, 'non-temporary' simply means that the storage medium is a tangible device and does not contain a signal (e.g., electromagnetic waves), and the term does not distinguish between cases where data is stored semi-permanently and cases where it is stored temporarily.
[0169] According to one embodiment, the method according to the various embodiments disclosed herein may be provided as included in a computer program product. The computer program product may be traded between a seller and a buyer as a product. The computer program product may be distributed in the form of a device-readable storage medium (e.g., compact disc read-only memory (CD-ROM)), or distributed online (e.g., download or upload) through an application store (e.g., Play Store™) or directly between two user devices (e.g., smartphones). In the case of online distribution, at least a portion of the computer program product may be temporarily stored or temporarily created on a device-readable storage medium, such as the memory of a manufacturer's server, an application store's server, or a relay server.
[0170] According to various embodiments, each component (e.g., module or program) of the components described above may include a singular or multiple entities, and some of the multiple entities may be separated and placed in other components. According to various embodiments, one or more of the components or operations of the aforementioned components may be omitted, or one or more other components or operations may be added. Generally or additionally, multiple components (e.g., module or program) may be integrated into a single component. In this case, the integrated component may perform one or more functions of each of the multiple components in the same or similar manner as those performed by the corresponding component among the multiple components prior to integration. According to various embodiments, operations performed by the module, program, or other components may be executed sequentially, in parallel, iteratively, or heuristically, or one or more of the operations may be executed in a different order, omitted, or one or more other operations may be added.
Claims
1. In an electronic device, At least one speaker; At least one microphone; Communication circuit; Memory for storing instructions and including one or more storage media; and It includes at least one processor comprising a processing circuit, and When the above instructions are executed individually or collectively by the at least one processor, the electronic device, While an application for providing interpretation services for a call performed using the above communication circuit is running, uplink voice data of a first language generated based on a user's utterance is transmitted, and Based on generating uplink voice data of a second language from the utterance of the first language, a transmission time interval for transmitting uplink voice data of the second language is determined, and Based on the transmission time interval for transmitting uplink voice data of the second language, a transmission power for transmitting uplink voice data of the second language is determined, and Causing to transmit uplink voice data of the second language based on the above transmission power, Electronic device.
2. In Paragraph 1, When the above instructions are executed individually or collectively by the at least one processor, the electronic device, Based on receiving downlink voice data of the second language, voice data of the first language is generated from the downlink voice data of the second language, and Determining an output time interval for outputting a sound according to the voice data of the first language through at least one speaker, and Causing to increase the transmission power based on the output time interval for outputting the sound above, Electronic device.
3. In Paragraph 1, When the above instructions are executed individually or collectively by the at least one processor, the electronic device, Based on the above transmission time interval and a second transmission power greater than the first transmission power, the cumulative transmission power to be used within the time window is determined, and Based on the determination that the above accumulated transmission power is less than the threshold power, the transmission power for transmitting uplink voice data of the second language is increased from the first transmission power to the second transmission power, and, Causing the transmission power to be maintained at the first transmission power for transmitting uplink voice data of the second language, in accordance with the determination that the above accumulated transmission power exceeds the above threshold power. Electronic device.
4. In Paragraph 1, When the above instructions are executed individually or collectively by the at least one processor, the electronic device, By performing STT (speech to text) on the above utterance, text of the first language corresponding to the above utterance is generated, and By performing machine translation (MT) on the text of the first language, the text of the second language is generated, and By performing TTS (text to speech) on the text of the second language, causing to generate the uplink voice data of the second language, Electronic device.
5. In Paragraph 1, When the above instructions are executed individually or collectively by the at least one processor, the electronic device, Within the transmission time interval, a sound according to the uplink voice data of the second language is output through the at least one speaker, and Causing texts corresponding to the uplink voice data of the second language to be displayed through the display within the above transmission time interval, The above texts include the text of the first language and the text of the second language, Electronic device.
6. In Paragraph 1, When the above instructions are executed individually or collectively by the at least one processor, the electronic device, Determining a prediction time interval for receiving downlink voice data of the second language predicted from uplink voice data of the first language using an artificial intelligence model, and Determining a predicted output time interval for outputting a sound according to the voice data of the first language generated based on the downlink voice data of the second language, and Based on the above-mentioned predicted time interval and the above-mentioned predicted output time interval, causing the transmission power to increase from the first transmission power to the second transmission power, Electronic device.
7. In Paragraph 1, When the above instructions are executed individually or collectively by the at least one processor, the electronic device, Receiving downlink voice data of the above second language, and Determining a first prediction time interval for transmitting second uplink voice data of the first language predicted from downlink voice data of the second language using an artificial intelligence model, and Determining a second prediction time interval for transmitting third uplink voice data of the second language corresponding to second uplink voice data of the first language, and Based on the first prediction time interval and the second prediction time interval, causing the transmission power to increase from the first transmission power to the second transmission power, Electronic device.
8. In Paragraph 1, When the above instructions are executed individually or collectively by the at least one processor, the electronic device, Receiving downlink voice data of the above second language, and Responses to downlink voice data of the second language, generated using an artificial intelligence model, are displayed through the display, and Based on identifying user input for a response among the above responses, second uplink voice data of the second language is generated from the response, and Determining a second transmission time interval for transmitting the second uplink voice data of the second language, and Based on the above second transmission time interval, the transmission power is changed, and Based on the above-mentioned changed transmission power, causing the second uplink voice data to be transmitted, Electronic device.
9. In Paragraph 3, The above critical power is configured to satisfy a predefined specific absorption rate (SAR), Electronic device.
10. In Paragraph 1, When the above instructions are executed individually or collectively by the at least one processor, the electronic device, Causing to determine the transmission time interval based on the size of the uplink voice data of the second language, Electronic device.
11. A method performed by an electronic device comprising at least one speaker, at least one microphone, and a communication circuit, The operation of transmitting uplink voice data of a first language generated based on a user's utterance while an application for providing interpretation services for a call performed using the above communication circuit is executed; An operation to determine a transmission time interval for transmitting uplink voice data of the second language based on generating uplink voice data of the second language from the utterance of the first language; An operation to determine a transmission power for transmitting uplink voice data of the second language based on the transmission time interval for transmitting uplink voice data of the second language; and Based on the transmission power, the operation of transmitting uplink voice data of the second language, method.
12. In Paragraph 11, An operation to generate voice data of the first language from the downlink voice data of the second language based on receiving downlink voice data of the second language; An operation to determine an output time interval for outputting a sound according to voice data of the first language through at least one speaker; and Based on the output time interval for outputting the sound, further including an operation to increase the transmission power, method.
13. In paragraph 11, the operation of determining the transmission power is, An operation to determine the cumulative transmission power to be used within a time window based on the above transmission time interval and a second transmission power greater than the first transmission power; An operation to increase the transmission power for transmitting uplink voice data of the second language from the first transmission power to the second transmission power, in accordance with the determination that the accumulated transmission power is less than the threshold power; and Based on the determination that the accumulated transmission power exceeds the threshold power, the operation of maintaining the transmission power for transmitting uplink voice data of the second language at the first transmission power, method.
14. In Paragraph 11, The operation of generating text of the first language corresponding to the utterance by performing STT (speech to text) on the utterance; The operation of generating text of the second language by performing machine translation (MT) on the text of the first language; and The method further includes the operation of generating uplink voice data of the second language by performing text-to-speech (TTS) on the text of the second language. method.
15. In Paragraph 11, The operation of outputting a sound according to the uplink voice data of the second language through the at least one speaker within the transmission time interval; The method further includes the operation of displaying texts corresponding to the uplink voice data of the second language through the display within the transmission time interval. The above texts include the text of the first language and the text of the second language, method.