Dynamic video playing method and device, electronic equipment and medium

By displaying panoramic images captured during dynamic video recording on electronic devices, users can trigger the playback of dynamic videos by selecting image areas, solving the problem of cumbersome operation in existing technologies and achieving the effects of simplified operation and improved user experience.

CN119767091BActive Publication Date: 2026-06-19VIVO MOBILE COMM CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
VIVO MOBILE COMM CO LTD
Filing Date
2024-12-31
Publication Date
2026-06-19

AI Technical Summary

Technical Problem

In the existing technology, the operation process of playing dynamic videos on electronic devices is cumbersome and requires multiple input steps.

Method used

By displaying a panoramic image associated with a dynamic video, which is formed by merging at least two images acquired during the capture of the dynamic video, each image being associated with at least one video clip, the user can trigger playback of the video clip associated with the target image in the dynamic video by selecting the first image region in the panoramic image.

🎯Benefits of technology

The process of playing dynamic videos has been simplified, allowing users to directly trigger the playback of video clips in dynamic videos by selecting a panoramic image area, thus improving the convenience of operation and user experience.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure CN119767091B_ABST
    Figure CN119767091B_ABST
Patent Text Reader

Abstract

This application discloses a dynamic video playback method, apparatus, electronic device, and medium, belonging to the field of camera technology. The method may include: displaying a panoramic image associated with a dynamic video, the panoramic image being obtained by fusing at least two images, the at least two images being images acquired during the shooting of the dynamic video, the dynamic video including at least two video segments, each image being associated with at least one video segment; receiving a first input, the first input being used to select a first image region in the panoramic image, the first image region including image content of at least one target image, the target image being an image from the at least two images; and responding to the first input, playing the first video segment in the dynamic video associated with at least one target image based on the first image region.
Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This application belongs to the field of terminal technology, specifically relating to a dynamic video playback method, device, electronic device, and medium. Background Technology

[0002] With the development of terminal technology, electronic devices are becoming increasingly feature-rich; for example, electronic devices can play dynamic videos.

[0003] For example, if a user needs an electronic device to play a dynamic video, the user can first trigger the electronic device to display the album interface by inputting information, and then the user can click on the video icon in the album interface to trigger the electronic device to play the video; similarly, if a user needs an electronic device to play a video, the user can first find the video icon in the album interface and click on the video icon to trigger the electronic device to play the video.

[0004] However, the above method requires multiple inputs to trigger the electronic device to play dynamic videos, making the process of playing dynamic videos on the electronic device rather cumbersome. Summary of the Invention

[0005] The purpose of this application is to provide a dynamic video playback method, apparatus, electronic device, and medium that can simplify the operation process of playing dynamic videos on electronic devices.

[0006] In a first aspect, embodiments of this application provide a dynamic video playback method, the method comprising: displaying a panoramic image associated with a dynamic video, the panoramic image being obtained by fusing at least two images, the at least two images being images acquired during the shooting of the dynamic video, the dynamic video including at least two video segments, each image being associated with at least one video segment; receiving a first input, the first input being used to select a first image region in the panoramic image, the first image region including image content of at least one target image, the target image being an image among the at least two images; and responding to the first input, playing a first video segment in the dynamic video associated with at least one target image based on the first image region.

[0007] Secondly, embodiments of this application provide a dynamic video playback device, comprising a display module, a receiving module, and a processing module; the display module is used to display a panoramic image associated with a dynamic video, the panoramic image being obtained by fusing at least two images, the at least two images being images acquired during the shooting of the dynamic video, the dynamic video including at least two video segments, each image being associated with at least one video segment; the receiving module is used to receive a first input, the first input being used to select a first image region in the panoramic image, the first image region including image content of at least one target image, the target image being an image among the at least two images; the processing module is used to, in response to the first input received by the receiving module, play a first video segment in the dynamic video associated with at least one target image based on the first image region.

[0008] Thirdly, embodiments of this application provide an electronic device including a processor and a memory, wherein the memory stores programs or instructions executable on the processor, and the programs or instructions, when executed by the processor, implement the steps of the method described in the first aspect.

[0009] Fourthly, embodiments of this application provide a readable storage medium on which a program or instructions are stored, which, when executed by a processor, implement the steps of the method described in the first aspect.

[0010] Fifthly, embodiments of this application provide a chip, the chip including a processor and a communication interface, the communication interface being coupled to the processor, the processor being used to run programs or instructions to implement the method as described in the first aspect.

[0011] In a sixth aspect, embodiments of this application provide a computer program product stored in a storage medium, which is executed by at least one processor to implement the method described in the first aspect.

[0012] In this embodiment, since the electronic device can display a panoramic image associated with a dynamic video, the panoramic image is obtained by fusing at least two images acquired during the shooting of the dynamic video, and each of the at least two images is associated with at least one video segment in the dynamic video, when the user triggers the electronic device to select a first image area in the panoramic image (the first image area includes the image content of at least one target image) through a first input, the electronic device can play the first video segment associated with at least one target image in the dynamic video. That is, by selecting an image area in the panoramic image, the electronic device can be triggered to play at least a portion of the video segments in the dynamic video associated with the panoramic image, thus simplifying the operation process of playing dynamic videos. Attached Figure Description

[0013] Figure 1 This is a flowchart illustrating a dynamic video playback method provided in an embodiment of this application;

[0014] Figure 2A This is a schematic diagram of an interface for a dynamic video playback method provided in an embodiment of this application;

[0015] Figure 2B This is a schematic diagram of an interface for a dynamic video playback method provided in an embodiment of this application;

[0016] Figure 3A This is a schematic diagram of an interface for a dynamic video playback method provided in an embodiment of this application;

[0017] Figure 3B This is a schematic diagram of an interface for a dynamic video playback method provided in an embodiment of this application;

[0018] Figure 3C This is a schematic diagram of an interface for a dynamic video playback method provided in an embodiment of this application;

[0019] Figure 4A This is a schematic diagram of an interface for a dynamic video playback method provided in an embodiment of this application;

[0020] Figure 4B This is a schematic diagram of an interface for a dynamic video playback method provided in an embodiment of this application;

[0021] Figure 5 This is a schematic diagram of the structure of the dynamic video playback device provided in the embodiments of this application;

[0022] Figure 6 This is a schematic diagram of the structure of an electronic device provided in an embodiment of this application;

[0023] Figure 7 This is a schematic diagram of the structure of an electronic device provided in an embodiment of this application. Detailed Implementation

[0024] The technical solutions of the embodiments of this application will be clearly described below with reference to the accompanying drawings. Obviously, the described embodiments are only some, not all, of the embodiments of this application. All other embodiments obtained by those skilled in the art based on the embodiments of this application are within the scope of protection of this application.

[0025] The terms "first," "second," etc., used in the specification and claims of this application are used to distinguish similar objects and not to describe a specific order or sequence. It should be understood that such use of data can be interchanged where appropriate so that embodiments of this application can be implemented in orders other than those illustrated or described herein, and the objects distinguished by "first," "second," etc., are generally of the same class and the number of objects is not limited; for example, a first object can be one or more. Furthermore, in the specification and claims, "and / or" indicates at least one of the connected objects, and the character " / " generally indicates that the preceding and following objects are in an "or" relationship.

[0026] The following description, in conjunction with the accompanying drawings, details the imaging methods, apparatus, electronic devices, and media provided in this application through specific embodiments and application scenarios.

[0027] The dynamic video playback method provided in this application can be executed by an electronic device, or at least one of the functional modules and entity modules within the electronic device capable of implementing the dynamic video playback method. The specific implementation can be determined based on actual usage requirements, and this application does not impose any limitations. This application uses an electronic device executing the dynamic video playback method as an example to illustrate the dynamic video playback method provided in this application.

[0028] This application provides a method for playing dynamic videos. Figure 1 A flowchart illustrating the dynamic video playback method provided in an embodiment of this application is shown, as follows: Figure 1 As shown, the dynamic video playback method provided in this application embodiment may include the following steps 101 to 103.

[0029] Step 101: The electronic device displays a panoramic image associated with dynamic video.

[0030] The panoramic image is obtained by fusing at least two images, which are acquired during the shooting of a dynamic video. The dynamic video includes at least two video segments, and each image is associated with at least one video segment.

[0031] Among them, at least one of the above video segments is included in the above-mentioned video segments.

[0032] In some embodiments of this application, the aforementioned at least one video segment can be any of the following: video segments with consecutive playback times, or video segments with matching content (such as including the same object).

[0033] In some embodiments of this application, the electronic device's dynamic video and the aforementioned at least two images can be images captured by the electronic device through the same camera.

[0034] In some embodiments of this application, the above-mentioned at least two images are captured during the process of the electronic device capturing dynamic video. Specifically, the electronic device can determine a portion of the video frames during the process of capturing dynamic video as at least two images.

[0035] In some embodiments of this application, each of the at least two images mentioned above corresponds to a video frame in a dynamic video.

[0036] In some embodiments of this application, the association of an image with at least one video segment can be any of the following: the image identifier of the image is associated with the segment identifier of the at least one video segment, or the image identifier of the image is associated with the identifier of a key video frame in the at least one video segment.

[0037] Among them, the aforementioned key video frames are at least one of the following:

[0038] A video frame in a video clip whose image content matches the image content of that image;

[0039] Video frames captured at the same time as the image;

[0040] The video frame with the middle playback time in a video clip. For example, if the duration of a video clip is A seconds (s), then the key video frame can be the video frame corresponding to A / 2 seconds.

[0041] The centered video frame in a video clip. For example, if a video clip consists of 51 video frames, the key video frame could be the 26th video frame.

[0042] In some embodiments of this application, the shooting angles of at least two of the above images are different.

[0043] Step 102: The electronic device receives the user's first input.

[0044] Step 103: In response to the first input, the electronic device plays a first video segment in the dynamic video associated with at least one target image, based on the first image region.

[0045] The first input can be used to select a first image region in the panoramic image. The first image region includes the image content of at least one target image, which can be one of the at least two images mentioned above.

[0046] In some embodiments of this application, "the first image region includes the image content of at least one target image" may include: for each of the at least one target image, the first image region includes at least a portion of the image content of one target image.

[0047] For example, the first image region may include all the image content of the i-th target image and a portion of the image content of the j-th target image, where i and j are positive integers.

[0048] In some embodiments of this application, if the first image region includes the image content of a target image, the electronic device can play at least one video segment in the dynamic video associated with the target image.

[0049] If the first image region includes the image content of at least two target images, the electronic device can continuously play the video segments associated with the at least two target images in the playback order of the video segments associated with the at least two target images.

[0050] In some embodiments of this application, the aforementioned first input includes, but is not limited to: touch input by the user through a touch device such as a finger or stylus, or voice commands input by the user, or specific gestures input by the user, or other feasible inputs. The specific input can be determined according to actual usage needs, and this application does not limit it.

[0051] In some embodiments of this application, the aforementioned specific gesture can be any one of the following: a single-click gesture, a swipe gesture, a drag gesture, a pressure-recognition gesture, a two-finger spread gesture, a two-finger pinch gesture, a long-press gesture, an area change gesture, a double-press gesture, or a double-click gesture.

[0052] In some embodiments of this application, the aforementioned touch input can be click input, long press input, hard press input, two-finger spread input, swipe input, two-finger pinch input, or any possible touch input, and this application embodiment does not limit it. The click input can be a single click, double click, or any number of clicks.

[0053] In some embodiments of this application, step 103 can be implemented by steps 103A and 103B as described below.

[0054] Step 103A: The electronic device responds to the first input and updates the display area of ​​the first image according to the input parameters of the first input.

[0055] In some embodiments of this application, the input parameters of the first input may include, but are not limited to: input location, input trajectory, input time, and input object.

[0056] In some embodiments of this application, updating the display of the first image area includes at least one of the following: updating the display size of the first image area (method 1), and rotating the display angle of the first image area (method 2).

[0057] The method of updating the display size of the first image area can be either to enlarge or shrink the display of the first image area.

[0058] In some embodiments of this application, in method 1, the first input can be any of the following:

[0059] (1) Update the input for the display range of the panoramic image;

[0060] (2) Adjust the input of the display parameters of the range selection box displayed on the panoramic image.

[0061] In some embodiments of this application, under normal circumstances, the electronic device can display the entire image area of ​​a panoramic image. In (1) above, the user can trigger the electronic device to reduce the display range of the panoramic image through a first input, thereby enlarging the display size of the first image area in the panoramic image. It can be understood that the first image area is the display area of ​​the panoramic image after adjusting the display range.

[0062] In some embodiments of this application, in (1) above, the first input may include, but is not limited to, any of the following: double-click input by the user on the panoramic image, input by the user spreading two fingers on the panoramic image, swipe input by the user on the panoramic image, preset gesture input or preset voice input by the user to trigger the zooming of the display size of the panoramic image, etc.

[0063] For example, such as Figure 2A As shown, by double-clicking on the panoramic image 20, the electronic device can reduce the display area of ​​the panoramic image, and based on the reduced display area, such as... Figure 2B As shown, the area double-clicked by the user is magnified, which is a portion of the panoramic image, also known as the first image area 22. In other words, the electronic device updates the display of the first image area based on the input position of the first input. It can be seen that before the user double-clicks, the electronic device displays the entire panoramic image area; after the user double-clicks, the electronic device magnifies and displays the double-clicked area within the panoramic image.

[0064] It's understandable that when an electronic device adjusts the display range of a panoramic image, the display ratio of the image area also changes. For example, if a user double-clicks on a panoramic image, the electronic device can display the double-clicked image area at a 16:9 aspect ratio.

[0065] In some embodiments of this application, when the first input is a double-click input, the electronic device can determine the image area within a preset range centered on the input position of the first input as the first image area.

[0066] In some embodiments of this application, when the first input is an input of two fingers spread apart, the electronic device can determine the first image region based on the input trajectory of the first input. For example, the image region located between the two fingers in the panoramic image can be determined as the first image region.

[0067] In some embodiments of this application, in mode 2, the first input can be: input from a user sliding clockwise or counterclockwise on a panoramic image.

[0068] For example, if a user slides 30 degrees counterclockwise on the first image area of ​​a panoramic image, the electronic device can respond to the first input and rotate the first image area counterclockwise by 30 degrees according to the angle corresponding to the input trajectory of the first input.

[0069] In some embodiments of this application, in the above (2), step 102 can be implemented by step 102A, and step 103A can be implemented by step 103A1.

[0070] Step 102A: When a range selection box is displayed on the panoramic image, the electronic device receives the user's first input to the range selection box.

[0071] Step 103A1: The electronic device responds to the first input and, based on the input parameters of the first input, magnifies the first image area selected by the display range selection box in the panoramic image.

[0072] For further description of the input parameters of the first input, please refer to the relevant descriptions in the above embodiments.

[0073] In some embodiments of this application, when an electronic device displays a panoramic image, the electronic device can display a range selection box on the panoramic image. The user can then perform a first input on the range selection box to trigger the electronic device to adjust the display parameters of the range selection box. The electronic device can then determine the image area located within the adjusted range selection box as the first image area.

[0074] In some embodiments of this application, the display parameters of the range selection box may include at least one of the following: display position, display size, and aspect ratio.

[0075] In some embodiments of this application, the user moves the range selection box to adjust its display position.

[0076] In some embodiments of this application, the user can drag a border or a top corner of the range to adjust the display size and aspect ratio of the range selection box.

[0077] In some embodiments of this application, the electronic device displays a range selection box before, such as Figure 2A As shown, the user can first perform an input on the panoramic image 20, such as clicking "Input," and then... Figure 3A As shown, the electronic device can display a range selection box 21 on the panoramic image 20, allowing the user to drag the right border of the range selection box 21 to the right, i.e., the first input, as shown. Figure 3B As shown, the electronic device can increase the display size of the range selection box 21, thereby allowing the electronic device to... Figure 3B The image area within the range selection box 21 shown is defined as the first image area 22, and, as Figure 2B As shown, the electronic device can magnify and display the first image area 22.

[0078] It is understandable that since electronic devices can magnify the first image area, users can confirm whether the selected image area in the panoramic image meets the requirements based on the magnified first image area, thus improving human-computer interaction performance and user experience.

[0079] Thus, since users can trigger the electronic device to select and zoom in on a first image area by making a preliminary input to a selection box on the panoramic image, it facilitates visual selection of the image area, thereby improving the accuracy of image area selection. Furthermore, after selecting the first image area, the ability to zoom in allows users to confirm the selection, further enhancing the user experience and improving human-computer interaction performance. In this way, both the accuracy and the overall selection experience of the image area can be improved.

[0080] In some embodiments of this application, in (iii) above, the electronic device may, in response to a first input, determine the image area traversed by the sliding trajectory of the sliding input in the panoramic image as a first image area, and enlarge and display the first image area when the first input is a sliding input by a user on a panoramic image.

[0081] Step 103B: The electronic device plays a first video segment in the dynamic video that corresponds to at least one target image, based on the updated first image region.

[0082] In some embodiments of this application, the electronic device may play a first video segment corresponding to at least one target image in a dynamic video after receiving confirmation input from a user regarding the updated first image region.

[0083] In some embodiments of this application, the aforementioned confirmation input includes, but is not limited to: touch input by the user using a touch device such as a finger or stylus on the updated first image area, or voice commands input by the user, or specific gestures input by the user, or other feasible inputs. The specific input can be determined according to actual usage needs, and this application embodiment does not impose limitations.

[0084] In some embodiments of this application, the specific gesture mentioned above can be any one of a single-click gesture, a swipe gesture, a drag gesture, a pressure recognition gesture, a long-press gesture, an area change gesture, a double-press gesture, or a double-tap gesture.

[0085] In some embodiments of this application, the aforementioned touch input can be click input, long press input, hard press input, swipe input, two-finger pinch input, or any possible touch input, and this application embodiment does not limit it. The click input can be a single click, double click, or any number of clicks.

[0086] For example, such as Figure 2B As shown, the user can long-press on the magnified first image area 22 to confirm the input, thus... Figure 3C As shown, the electronic device can display a dynamic video playback interface 23, that is, play a first video segment in the dynamic video that corresponds to at least one target image.

[0087] In some embodiments of this application, the electronic device can automatically play a first video segment corresponding to at least one target image in a dynamic video after a preset time period following the update and display of the first image area.

[0088] For example, after enlarging and displaying the first image area, if no user input is received within a preset time period, the electronic device determines that the user is satisfied with the selected image area, and can then automatically play the first video segment in the dynamic video that corresponds to at least one target image.

[0089] In some embodiments of this application, if a user is not satisfied with the first image area currently displayed by the electronic device, the user can execute a cancel input to trigger the electronic device to re-display the entire image area of ​​the panoramic image. Then, the user can re-execute the first input to reselect the first image area until the user is satisfied with the selected first image area. Then, based on the updated first image area that satisfies the user, the first video segment corresponding to at least one target image in the dynamic video is played.

[0090] It is understandable that the purpose of updating the display of the first image area based on the input parameters of the first input is to highlight the first image area so that the user can confirm whether the selected first image area meets the user's needs.

[0091] Thus, when a user triggers the electronic device to select a first image region in the panoramic image through the first input, the electronic device can update the display of the first image region first. Based on the updated first image region, the user can determine whether the image region selected by the electronic device meets the user's expectations or requirements. This ensures that the video segment played by the electronic device meets the user's playback needs, thereby improving the accuracy of playing video segments in dynamic videos.

[0092] In some embodiments of this application, step 103 may include steps 103C to 103E.

[0093] Step 103C: In response to the first input, the electronic device determines at least one video frame in the dynamic video that is associated with at least one target image, based on the first association relationship.

[0094] The first association relationship can be the association relationship between an image in the above at least two images and a video frame in the above dynamic video, and the video segment associated with the image is determined by the video frame associated with the image.

[0095] In some embodiments of this application, at least one target image corresponds one-to-one with at least one video frame.

[0096] In some embodiments of this application, the first association relationship can be a list of relationships between the identifiers of at least two images and the frame numbers or cache identifiers of video frames in a dynamic video, such as cache timestamps.

[0097] In some embodiments of this application, determining the associated video segment from the associated video frame may include: assuming an image (hereinafter referred to as image 1) is associated with video frame a in a dynamic video, then:

[0098] 1) An electronic device can identify a video segment consisting of video frame a and at least one video frame in the dynamic video preceding video frame a as the video segment associated with the image;

[0099] 2) An electronic device can identify a video segment consisting of video frame a and at least one video frame in the dynamic video that is located after video frame a as the video segment associated with the image.

[0100] 3) An electronic device may identify a video segment consisting of video frame a, at least one video frame preceding video frame a, and at least one video frame following video frame a as the video segment associated with the image.

[0101] For example, an electronic device can identify a video segment in a dynamic video whose playback time falls within a first playback time range as an image-associated video segment. Specifically, the first playback time range includes at least one of the following: a playback time range extending forward for a third preset duration from the playback time of the image-associated video frame (e.g., video frame a); or a playback time range extending backward for a fourth preset duration from the playback time of video frame a.

[0102] The third preset duration and the fourth preset duration can be the same or different.

[0103] For example, the third preset duration can be 1.5s, and the fourth preset duration can be 1.2s.

[0104] For example, the third preset duration can be 1.5s, and the fourth preset duration can be 1.5s.

[0105] It should be noted that the above embodiment illustrates the first association relationship between an image and a video frame in a dynamic video, and plays at least one video frame in the dynamic video that is associated with at least one target image.

[0106] In practice, since a panoramic image is obtained by fusing at least two images, each image region in the panoramic image includes at least a portion of the image content of a video frame in the dynamic video. This allows for the direct establishment of a correlation between the image regions in the panoramic image and the video frames in the dynamic video, enabling the subsequent playback of the corresponding video segments in the dynamic video based on the user-selected image regions more directly and quickly.

[0107] For example, assuming the first image region includes N fusion regions, the first image region corresponds to N+1 video frames in the dynamic video. Here, the fusion region is the fused image part between two adjacent images, and one fusion region corresponds to two images. Thus, one fusion region is associated with two video frames in the dynamic video, and N can be a positive integer.

[0108] For example, suppose the panoramic image is formed by sequentially fusing image 1, image 2, image 3, and image 4, and image 1 corresponds to video frame a1, image 2 corresponds to video frame a2, image 3 corresponds to video frame a3, and image 4 corresponds to video frame a4, then:

[0109] If the first image region includes the fused image region between image 1 and image 2, the electronic device can determine that the first image region is associated with video frame a1 and video frame a2 in the dynamic video.

[0110] If the first image region includes the image region corresponding to image 1 in the panoramic image, but does not include the fused region, then the electronic device can determine that the first image region is associated with video frame a1.

[0111] Step 103D: The electronic device determines the playback time period based on the playback time of the first video frame and the playback time of the second video frame.

[0112] Step 103E: The first video segment whose playback time falls within the playback time period in the dynamic video played by the electronic device.

[0113] Among them, the first video frame and the second video frame are the video frames with the smallest and largest frame numbers, respectively, among the at least one video frame mentioned above.

[0114] In some embodiments of this application, when the number of video frames associated with the first image region is greater than 1, the electronic device can determine the start time of the playback period based on the video frame with the smallest frame number among these video frames, and determine the end time of the playback period based on the video frame with the largest frame number. Thus, the electronic device can play a continuous video segment from a dynamic video based on the first image region selected by the user.

[0115] In some embodiments of this application, the aforementioned playback time period is a relative time period. Specifically, the playback time period is a relative time period with reference to the total duration of the dynamic video.

[0116] For example, assuming the first video frame plays at the 2nd second and the second video frame plays at the 2.5th second, the electronic device can determine the playback time period as the 2nd to 2.5th seconds of the playback duration corresponding to the dynamic video.

[0117] It can be understood that when the number of video frames in at least one of the aforementioned video frames is 1, the first video frame and the second video frame are the same video frame. When the number of video frames in at least one of the aforementioned video frames is greater than 1, the first video frame and the second video frame are different.

[0118] In some embodiments of this application, the electronic device can determine the playback time period based on the playback time of the first video frame, the playback time of the second video frame, the third preset duration, and the fourth preset duration.

[0119] For example, the electronic device can determine the start time of the playback period as: the time corresponding to the first preset duration before the playback time of the first video frame, that is, the start time of the playback period = the playback time of the first video frame - the third preset duration; and the electronic device can determine the end time of the playback period as: the time corresponding to the first preset duration after the playback time of the second video frame, that is, the end time of the playback period = the playback time of the second video frame + the fourth preset duration.

[0120] In some embodiments of this application, the electronic device may also determine the playback time period based on any one of the playback time of the first video frame, the playback time of the second video frame, the third preset duration, and the fourth preset duration.

[0121] For example, the electronic device determines the start time of the playback period based on the playback time of the first video frame and the third preset duration, and determines the playback time of the second video frame as the end time of the playback period.

[0122] For example, the electronic device determines the start time of the playback period based on the playback time of the first video frame, and determines the end time of the playback period based on the playback time of the second video frame and the fourth preset duration.

[0123] Thus, since at least one video frame corresponding to at least one target image in the video dynamic can be determined based on the first correlation, and a playback time period can be determined based on the earliest and latest playback times corresponding to the at least one video frame, and video segments in the dynamic video whose playback time is within the playback time period can be played, the accuracy of playing video segments in the dynamic video can be improved.

[0124] Thus, since users can trigger electronic devices to select the image region in the panoramic image corresponding to the input by inputting the panoramic image, and play specific video segments in the dynamic video according to the correlation between the target image corresponding to the image region in the panoramic image and the video frames in the dynamic video, it is possible to achieve joint playback of dynamic video and panoramic image, thereby improving playback flexibility.

[0125] The dynamic video playback method provided in the embodiments of this application will be described below with reference to examples.

[0126] For example, such as Figure 2A As shown, the electronic device displays a panoramic image 20, which the user can click on, and then... Figure 3A As shown, the electronic device can display a range selection box 21 on the panoramic image 20, allowing the user to drag the right border of the range selection box 21 to the right, i.e., a second input, such as... Figure 3B As shown, the electronic device can increase the display size of the range selection box 21. Then, the electronic device can define the image area within the resized range selection box as the first image area, and as shown... Figure 2B As shown, the electronic device can magnify and display the first image area 22. Based on the magnified first image area and the first correlation relationship, the electronic device can determine video frames 1, 2, and 3 in the dynamic video, with playback times of T2, T3, and T4 respectively. Therefore, the electronic device can select the time period for playing the dynamic video as T2-1.5s to T4+1.5s.

[0127] Then, the electronic device can switch from the panoramic image display interface to... Figure 3C The dynamic video playback interface shown in Figure 23.

[0128] Then, in one approach, the electronic device can automatically start playing the dynamic video from time T2-1.5s and end the dynamic video playback at time T4+1.5s, thereby displaying the effect of dynamic panoramic image display and realizing panoramic dynamic image.

[0129] Or, in another way, such as Figure 3CAs shown, the electronic device can start playing dynamic video from time T2-1.5s after the user presses and holds the playback interface 23, and end the dynamic video playback from time T4+1.5s, thereby displaying the effect of dynamic panoramic image display and realizing panoramic dynamic image.

[0130] Furthermore, after the electronic device finishes playing, such as Figure 2A As shown, the electronic device can re-display the panoramic image 20.

[0131] Thus, when a user views a panoramic image, they can select a portion of the image area using a range adjustment box, triggering the electronic device to play a video segment corresponding to that portion of the image area in a dynamic video. This enhances the dynamic interaction between the video and the panoramic image, increasing the viewing experience.

[0132] For example, such as Figure 2A As shown, the electronic device displays a panoramic image 20. The user can double-click on the first image area of ​​the panoramic image 20 to reduce the display area of ​​the panoramic image, thus... Figure 2B As shown, the first image area 22 double-clicked by the user is magnified. Based on the magnified first image area and the first correlation, the electronic device can determine video frames 1, 2, and 3 in the dynamic video. The playback times of these three video frames are T2, T3, and T4, respectively. Therefore, the electronic device can select the time period for playing the dynamic video as T2-1.5s to T4+1.5s.

[0133] Then, the electronic device can switch from the panoramic image display interface to... Figure 3C The dynamic video playback interface shown in Figure 23.

[0134] Then, in one approach, the electronic device can automatically start playing the dynamic video from time T2-1.5s and end the dynamic video playback at time T4+1.5s, thereby displaying the effect of dynamic panoramic image display and realizing panoramic dynamic image.

[0135] Or, in another way, such as Figure 3C As shown, the electronic device can start playing dynamic video from time T2-1.5s after the user presses and holds the playback interface 23, and end the dynamic video playback from time T4+1.5s, thereby displaying the effect of dynamic panoramic image display and realizing panoramic dynamic image.

[0136] Furthermore, after the electronic device finishes playing, such as Figure 2A As shown, the electronic device can re-display the panoramic image 20.

[0137] Thus, when a user views a panoramic image, they can adjust the display range of the panoramic image or increase the display size of a certain image area to trigger the electronic device to select an image area within the adjusted display range of the panoramic image and play the video segment corresponding to that image area in the dynamic video. This can improve the dynamic linkage between the video and the panoramic image and enhance the viewing experience.

[0138] Furthermore, users can select the area from which to play dynamic videos by zooming in on the panoramic image, which is more in line with user habits and improves the ease of operation.

[0139] In the dynamic video playback method provided in this application embodiment, since the electronic device can display a panoramic image associated with a dynamic video, the panoramic image is obtained by fusing at least two images acquired during the shooting process of the dynamic video, and each of the at least two images is associated with at least one video segment in the dynamic video, when the user triggers the electronic device to select a first image area in the panoramic image (the first image area includes the image content of at least one target image) through a first input, the electronic device can play the first video segment associated with at least one target image in the dynamic video. That is, by selecting an image area in the panoramic image, the electronic device can be triggered to play at least a portion of the video segments in the dynamic video associated with the panoramic image, thus simplifying the operation process of playing dynamic videos.

[0140] In some embodiments of this application, the dynamic video playback method provided in this application may further include the following step 104.

[0141] Step 104: During the playback of the first video segment, the electronic device adjusts the display range of the panoramic image according to the playback progress of the first video segment.

[0142] In some embodiments of this application, the electronic device can adjust the display range of the panoramic image based on the first association relationship and the playback progress of the first video segment. For a description of the first association relationship, please refer to the relevant descriptions in the above embodiments; to avoid repetition, they will not be repeated here.

[0143] In some embodiments of this application, the electronic device can display the playback screen of the first video segment and a portion of the panoramic image area in different regions during the playback of the first video segment, and adjust the display range of the panoramic image according to the playback progress of the first video segment and the first correlation relationship, so that the display range of the panoramic image is consistent with the playback progress of the first video segment.

[0144] In some embodiments of this application, the electronic device can display the playback screen of the first video segment and the display range of the panoramic image with the same aspect ratio.

[0145] In some embodiments of this application, a user can trigger an electronic device to simultaneously display a portion of the panoramic image and a playback interface of a first video segment by inputting an identifier for the panoramic image; or, a user can trigger an electronic device to simultaneously display a portion of the panoramic image and a playback screen of a dynamic video by inputting an identifier for the dynamic video.

[0146] The dynamic video playback method provided in the embodiments of this application will be described below with reference to the accompanying drawings.

[0147] For example, during the playback of the first video segment, such as Figure 4A As shown, the electronic device can simultaneously display a first image region 40 in a panoramic image and a playback interface 41 for a first video segment. The playback interface 41 includes a progress indicator 42 indicating the playback progress of the dynamic video. Figure 4A As shown, the user can input the playback indicator 43 in the playback interface 41, such as... Figure 4B As shown, the electronic device can play a first video segment and, based on the playback progress of the first video segment, determine the frame number of at least one video frame corresponding to the first image region. Then, it determines whether these frame numbers include the frame number of the currently playing video frame of the first video segment. If not, and it is determined that the image region matching the currently playing video frame is located to the right of the current display range of the panoramic image, then... Figure 4B As shown, the electronic device automatically slides the panoramic image to the left until the displayed image area corresponds to the frame number of the currently playing video frame of the first video segment, thus achieving synchronized playback of the panoramic image and the dynamic video. This allows users to simultaneously watch video segments from the dynamic video and view the panoramic image, further enhancing the playback flexibility and effect of the electronic device. It should be noted that... Figure 4B The arrow in the image is only used to indicate the direction of updating the display range of the panoramic image. In actual implementation, the arrow may not be displayed or may not be visible to the user.

[0148] Thus, as electronic devices can adjust the display range of the panoramic image according to the playback progress of video clips in dynamic videos, users can view the corresponding image area in the panoramic image while watching the dynamic video, thereby improving playback efficiency, entertainment, and convenience.

[0149] In some embodiments of this application, before step 101 above, the dynamic video playback method provided in the embodiments of this application may further include steps 105 to 107 below.

[0150] Step 105: When the electronic device displays the panoramic preview interface, it caches the preview video frames and the corresponding audio data in the panoramic preview interface.

[0151] In some embodiments of this application, the panoramic preview interface can also be called the panoramic shooting preview interface; the two terms have the same meaning and can be used interchangeably. Both refer to the preview interface displayed when the electronic device is in panoramic shooting mode.

[0152] In some embodiments of this application, the electronic device can collect audio data through its microphone, and then the electronic device can buffer the audio data collected by the microphone.

[0153] For example, when an electronic device displays a panoramic preview interface, the user can narrate the shooting scene. The electronic device can capture the user's narration audio through a microphone and cache the narration audio and the preview video frames in the panoramic preview interface.

[0154] Step 106: During the process of caching preview video frames and corresponding audio data, the electronic device acquires at least two images.

[0155] In some embodiments of this application, the electronic device may begin acquiring at least two images after a fifth preset time period of displaying the panoramic preview interface; or, the electronic device may begin acquiring at least two images after receiving the user's shooting input, such as the second input described below.

[0156] It is understandable that the update frame rate of the preview video frames in the panoramic preview interface is greater than the acquisition frame rate of the electronic device when acquiring at least two images.

[0157] For example, the update frame rate of the preview video frames in the panoramic preview interface is 30 frames per second, and the electronic device can acquire images at a frequency of 2 images per second.

[0158] It should be noted that the panoramic preview interface and at least two images correspond to the same camera; that is, the preview video frames cached by the electronic device and the aforementioned at least two images were captured by the same camera. It can be understood that each of these at least two images corresponds to one preview video frame cached by the electronic device.

[0159] For example, taking the display of a panoramic preview interface corresponding to the front-facing camera on an electronic device as an example, the electronic device can display the preview video frame in the panoramic preview interface for each preview video frame captured by the front-facing camera. Suppose that when the front-facing camera captures the i-th preview video frame, the acquisition time of the j-th image in at least two images has been reached, then: on the one hand, the electronic device can cache the i-th preview video frame and display the i-th preview video frame in the panoramic preview interface; on the other hand, the electronic device can use the i-th preview video frame as the j-th image in at least two images, where j is less than i, and i and j are both positive integers.

[0160] In some embodiments of this application, the duration of the electronic device caching preview video frames and corresponding audio data is greater than or equal to the target duration, where the target duration is the time difference between the time when the electronic device acquires the first image and the time when it acquires the last image.

[0161] It is understandable that during the process of acquiring at least two images, the electronic device can continuously cache preview video frames and corresponding audio data to make a more comprehensive video recording of the panoramic image shooting scene.

[0162] Step 107: The electronic device generates a dynamic video based on the preview video frames and corresponding audio data cached within the first time period, and performs image fusion processing on at least two images to obtain a panoramic image.

[0163] The first time period can be determined by the acquisition time periods of at least two images.

[0164] In some embodiments of this application, the electronic device can encode the cached preview video frames and corresponding audio data in chronological order according to the cached preview video frames and corresponding audio data within a first time period to obtain dynamic video.

[0165] In some embodiments of this application, the electronic device generates dynamic video based on preview video frames cached within a first time period and corresponding audio data, which may include:

[0166] The electronic device encodes the preview video frames and corresponding audio data cached between the second and third time points to generate dynamic video.

[0167] The second time point is the time point corresponding to the first preset duration before the first time point, and the third time point is the time point corresponding to the second preset duration after the acquisition end time of the above at least two images.

[0168] In some embodiments of this application, the second preset duration can be any possible duration such as 1 second, 2 seconds, 3 seconds or 1.5 seconds.

[0169] In some embodiments of this application, the first preset duration and the second preset duration may be the same or different. For example, taking the first preset duration and the second preset duration as the same, the first preset duration and the second preset duration may be 1.5 seconds.

[0170] In some embodiments of this application, the electronic device may perform image fusion processing on at least two images in the order of their acquisition time to obtain a panoramic image.

[0171] It is understood that an electronic device can perform image fusion processing on at least two images based on the shooting time and image content of at least two images to obtain a panoramic image.

[0172] In some embodiments of this application, the electronic device may encode the cached audio data and preview video frames during the process of caching preview video frames and corresponding audio data; or, the cached audio data and preview video frames may be encoded after the caching of audio data and preview video frames has ended.

[0173] This is understandable; it can be viewed as a dynamic video of the panoramic image shooting scene.

[0174] In some embodiments of this application, after the electronic device obtains the first video and the panoramic image, it can establish a first association relationship between the image region in the panoramic image and the video frame in the first video based on the correspondence between the first image and the preview image.

[0175] For example, an electronic device can establish a list of correspondences between region identifiers in a panoramic image and frame numbers in a first video to obtain a first association.

[0176] In some embodiments of this application, the region identifier of the image region can be identified by the cache sequence number of the preview image corresponding to the image region, such as a cache timestamp.

[0177] Thus, when the electronic device displays a panoramic preview interface, it can cache preview video frames and corresponding audio data, and acquire at least two images during the caching process. It can also generate dynamic video based on the cached preview video frames and corresponding audio data within the first time period, and perform image fusion processing on the acquired at least two images to obtain a panoramic image. In other words, the electronic device can simultaneously capture dynamic video and panoramic images within the same time period, thereby improving shooting efficiency.

[0178] In some embodiments of this application, before step 106 above, the dynamic video playback method provided in the embodiments of this application may further include the following step 108, and the above step 106 may be implemented by the following step 106A.

[0179] Step 108: During the process of caching preview video frames and corresponding audio data, the electronic device receives a second input from the user to the shooting controls in the panoramic preview interface.

[0180] Step 106A: In response to the second input, the electronic device selects at least two images from the preview video frames cached after the first time point.

[0181] The first time point is the input time point of the second input.

[0182] In some embodiments of this application, after receiving the second input, the electronic device can use the most recently captured preview video frame as the first image and detect the shooting angle. After detecting a change in the shooting angle by a preset angle, the electronic device can select the second image, and so on, until the electronic device selects the last image out of at least two images. Of course, the electronic device can also periodically acquire images until it obtains at least two images.

[0183] It is understandable that after the user makes a second input to the shooting control, the user can continuously adjust the shooting angle of the electronic device so that the electronic device can select images from different shooting angles.

[0184] It is understandable that the number of images, at least two, can be preset or determined by user input.

[0185] For example, an electronic device can end the image selection operation after receiving a third input from the user to the shooting control.

[0186] In some embodiments of this application, the electronic device ends the buffering of preview video frames later than the time when the electronic device selects the last image, for example, later by a second preset duration. Thus, since the electronic device can end the buffering of audio data and preview video frames a second preset duration after completing image selection—meaning that the electronic device buffers preview video frames and audio data for a period of time after completing image selection—the video content covered in the dynamic video can be richer than the image content in the panoramic image, thereby enabling the dynamic video to more completely reproduce the shooting scene of the panoramic image.

[0187] In some embodiments of this application, the aforementioned second input includes, but is not limited to: touch input of the user to the shooting control via a touch device such as a finger or stylus, or voice commands input by the user, or specific gestures input by the user, or other feasible inputs. The specific input can be determined according to actual usage needs, and this application does not limit it.

[0188] In some embodiments of this application, the specific gesture mentioned above can be any one of a single-click gesture, a swipe gesture, a drag gesture, a pressure recognition gesture, a long-press gesture, an area change gesture, a double-press gesture, or a double-tap gesture.

[0189] In some embodiments of this application, the touch input can be a click input, a long press input, a hard press input, or any possible touch input, and this application does not limit the scope of the embodiments. Specifically, the click input can be a single click, a double click, or any number of clicks.

[0190] In some embodiments of this application, the electronic device may end the shooting after capturing a preset number of first images, or the electronic device may end the image selection operation after receiving user input, such as the user inputting again into the shooting control.

[0191] In some embodiments of this application, the electronic device stops previewing the video frame and the corresponding audio data after a second preset time period following the selection of at least two images.

[0192] Thus, since the electronic device can capture images after receiving the user's shooting input, i.e., the second input, to obtain at least two images, it can ensure that at least two images captured by the electronic device meet the user's shooting requirements, thereby improving shooting accuracy.

[0193] In some embodiments of this application, before step 107 above, the dynamic video playback method provided in the embodiments of this application may further include the following step 109.

[0194] Step 109: The electronic device determines the time period between the second time point and the third time point as the first time period.

[0195] The second time point is the time point corresponding to the first preset duration before the first time point, and the third time point is the time point corresponding to the second preset duration after the acquisition end time of the above at least two images.

[0196] For a description of the second time point, please refer to the relevant description of the second time point in the above embodiments. To avoid repetition, it will not be repeated here.

[0197] It is understandable that the second time point is earlier than the input time point of the first input by a preset duration.

[0198] For example, assuming the input time of the first input is t1 and the first preset duration is T0, then the second time point is: t1-T0.

[0199] In some embodiments of this application, the electronic device can begin encoding the preview video frames and audio data cached after the second time point after receiving the second input; that is, encoding is performed during the caching of the preview video frames. Thus, compared to a scheme where encoding is performed after caching ends, encoding during the caching of preview images and audio data can shorten the time required for the electronic device to capture dynamic video.

[0200] In some embodiments of this application, the first preset duration can be any possible duration such as 1s, 1.5s, 2s or 3s.

[0201] It is understandable that electronic devices can discard preview video frames and corresponding audio data cached outside the first time period.

[0202] In some embodiments of this application, after the electronic device enters the panoramic image mode, i.e., displays the panoramic preview interface, the electronic device can cache the preview image and audio data in the background for a first preset duration in real time. Specifically, when the caching duration exceeds the first preset duration, the electronic device can retain the audio data and preview image cached within the most recent first preset duration and discard the remaining audio data and preview image until the electronic device receives the second input.

[0203] The dynamic video playback method provided in the embodiments of this application will be described exemplarily below with reference to the accompanying drawings.

[0204] For example, after the electronic device displays the panoramic preview interface, it caches the preview video frames and audio data in real time. Then, at time t5, the user clicks the shooting control in the panoramic preview interface, allowing the electronic device to intermittently select images, record the selection time, and maintain the cached preview video frames and audio data. The cached preview video frames and audio data after t5-1.5s are then encoded. It can be understood that the cached preview video frames and audio data during image selection also need to be encoded in real time.

[0205] After shooting for a period of time, if the user clicks the shooting control again, the electronic device can end the image selection operation, indicating that at least two images have been selected.

[0206] Furthermore, after the user clicks the shooting control again, the electronic device continues to cache 1.5 seconds of preview video frames and corresponding audio data, and encodes them to ultimately generate a dynamic video containing a 1.5-second video segment before image selection begins, a video segment during image selection, and a 1.5-second video segment after selection ends. Thus, since panoramic images can be captured while the dynamic video is being shot, the shooting scene corresponding to the panoramic image can be reconstructed from the dynamic video, thereby improving shooting efficiency and adding fun to the shooting experience.

[0207] Thus, since electronic devices can determine the time period between the second and third time points as the encoding range corresponding to the dynamic video, it can ensure that the video content of the dynamic video includes, but is not limited to, the image content in the panoramic image. This allows the video content covered in the dynamic video to be richer than the image content in the panoramic image, thereby enabling the dynamic video to more completely record or restore the shooting scene of the panoramic image.

[0208] It should be noted that the above embodiments are illustrated using the example of an electronic device in panoramic shooting mode, i.e., displaying a panoramic preview image. In actual implementation, the electronic device can cache preview video frames and corresponding audio data during the process of shooting a large-view image with a certain shooting process. Thus, while completing the shooting of the large-view image, a dynamic video is generated to record the shooting scene corresponding to the large-view image, thereby improving shooting efficiency and shooting effect and increasing the fun.

[0209] The dynamic video playback method provided in this application can be executed by a dynamic video playback device. This application uses a dynamic video playback device executing the dynamic video playback method as an example to illustrate the dynamic video playback device provided in this application.

[0210] This application provides a dynamic video playback device. Figure 5 A schematic diagram of the structure of the dynamic video playback device is shown, as follows: Figure 5 As shown, the dynamic video playback device 500 may include a display module 501, a receiving module 502, and a processing module 503;

[0211] The display module 501 is used to display a panoramic image associated with a dynamic video. The panoramic image is obtained by fusing at least two images, which are images acquired during the shooting of the dynamic video. The dynamic video includes at least two video segments, and each image is associated with at least one video segment.

[0212] The receiving module 502 is used to receive a first input, which is used to select a first image region in the panoramic image. The first image region includes the image content of at least one target image, and the target image is an image among the at least two images.

[0213] The processing module 503 is configured to, in response to the first input received by the receiving module 502, play a first video segment in the dynamic video associated with the at least one target image based on the first image region.

[0214] In some embodiments of this application, the display module is further configured to update the display area of ​​the first image in response to the first input, based on the input parameters of the first input;

[0215] The processing module is specifically used to play a first video segment in the dynamic video associated with the at least one target image based on the updated first image region.

[0216] In some embodiments of this application, the receiving module is specifically used to receive the first input to the range selection box when a range selection box is displayed on the panoramic image;

[0217] The display module is specifically configured to respond to the first input and, based on the input parameters of the first input, magnify and display the first image area selected by the range selection box in the panoramic image.

[0218] In some embodiments of this application, the processing module is specifically used for:

[0219] Based on the first association relationship, at least one video frame in the dynamic video that is associated with the at least one target image is determined. The first association relationship is the association relationship between the image in the at least two images and the video frame in the dynamic video. The video segment associated with the image is determined by the video frame associated with the image.

[0220] The playback time period is determined based on the playback time of the first video frame and the playback time of the second video frame, wherein the first video frame and the second video frame are the video frames with the smallest and largest frame numbers, respectively, among the at least one video frame.

[0221] Play the first video segment in the dynamic video whose playback time falls within the playback time period.

[0222] In some embodiments of this application, the processing module is further configured to adjust the display range of the panoramic image according to the playback progress of the first video segment during playback.

[0223] In some embodiments of this application, the processing module is further configured to:

[0224] Before the display module displays the panoramic image associated with the dynamic video, when the display module displays the panoramic preview interface, the preview video frames and the corresponding audio data in the panoramic preview interface are cached.

[0225] During the process of caching preview video frames and corresponding audio data, the at least two images are acquired;

[0226] Based on the preview video frames and corresponding audio data cached within the first time period, the dynamic video is generated, and image fusion processing is performed on the at least two images to obtain the panoramic image;

[0227] The first time period is determined by the acquisition time period of the at least two images.

[0228] In some embodiments of this application, the receiving module is further configured to receive a second input to the shooting control in the panoramic preview interface before the processing module acquires the at least two images;

[0229] The processing module is specifically configured to, in response to the second input received by the receiving module, select at least two images from the preview video frames cached after the first time point, wherein the first time point is the input time point of the second input.

[0230] In some embodiments of this application, the processing module is further configured to determine the time period between the second time point and the third time point as the first time period before generating the dynamic video based on the preview video frames and corresponding audio data cached within the first time period.

[0231] Wherein, the second time point is the time point corresponding to the first preset duration before the first time point, and the third time point is the time point corresponding to the second preset duration after the acquisition end time of the at least two images.

[0232] In the dynamic video playback device provided in this application, since the electronic device can display a panoramic image associated with a dynamic video, the panoramic image is obtained by fusing at least two images acquired during the shooting of the dynamic video, and each of the at least two images is associated with at least one video segment in the dynamic video, when the user triggers the electronic device to select a first image area in the panoramic image (the first image area includes the image content of at least one target image) through a first input, the electronic device can play the first video segment associated with at least one target image in the dynamic video. That is, by selecting an image area in the panoramic image, the electronic device can be triggered to play at least a portion of the video segments in the dynamic video associated with the panoramic image, thus simplifying the operation process of playing dynamic videos.

[0233] The dynamic video playback device in this application embodiment can be an electronic device or a component within an electronic device, such as an integrated circuit or a chip. The electronic device can be a terminal or other devices besides a terminal. For example, the electronic device can be a mobile phone, tablet computer, laptop computer, PDA, in-vehicle electronic device, mobile internet device (MID), augmented reality (AR) / virtual reality (VR) device, robot, wearable device, ultra-mobile personal computer (UMPC), netbook, or personal digital assistant (PDA), etc. It can also be a server, network attached storage (NAS), personal computer (PC), television set (TV), ATM, or self-service machine, etc. This application embodiment does not specifically limit the specific device.

[0234] The dynamic video playback device in this application embodiment can be a device with an operating system. This operating system can be Android, iOS, or other possible operating systems; this application embodiment does not specifically limit the specific operating system used.

[0235] The dynamic video playback device provided in this application embodiment can achieve... Figures 1 to 4B The various processes implemented in the method embodiments achieve the same technical effect, and will not be described again here to avoid repetition.

[0236] Optionally, such as Figure 6 As shown, this application embodiment also provides an electronic device 600, including a processor 601 and a memory 602. The memory 602 stores a program or instructions that can run on the processor 601. When the program or instructions are executed by the processor 601, they implement the various steps of the above-described dynamic video playback method embodiment and can achieve the same technical effect. To avoid repetition, they will not be described again here.

[0237] It should be noted that the electronic devices in the embodiments of this application include mobile electronic devices and non-mobile electronic devices.

[0238] Figure 7 A schematic diagram of the hardware structure of an electronic device to implement an embodiment of this application.

[0239] The electronic device 1500 includes, but is not limited to, components such as: radio frequency unit 1501, network module 1502, audio output unit 1503, input unit 1504, sensor 1505, display unit 1506, user input unit 1507, interface unit 1508, memory 1509, and processor 1510.

[0240] Those skilled in the art will understand that the electronic device 1500 may also include a power supply (such as a battery) for supplying power to various components. The power supply may be logically connected to the processor 1510 through a power management system, thereby enabling functions such as managing charging, discharging, and power consumption through the power management system. Figure 7 The electronic device structure shown does not constitute a limitation on the electronic device. The electronic device may include more or fewer components than shown, or combine certain components, or have different component arrangements, which will not be elaborated here.

[0241] The display unit 1506 is used to display a panoramic image associated with a dynamic video. The panoramic image is obtained by fusing at least two images, which are images acquired during the shooting of the dynamic video. The dynamic video includes at least two video segments, and each image is associated with at least one video segment.

[0242] The user input unit 1507 is used to receive a first input, which is used to select a first image region in the panoramic image. The first image region includes the image content of at least one target image, and the target image is an image among the at least two images.

[0243] The processing unit 1510 is configured to, in response to the first input received by the user input unit 1507, play a first video segment in the dynamic video associated with the at least one target image based on the first image region.

[0244] In some embodiments of this application, the display unit 1506 is further configured to update the display area of ​​the first image in response to the first input, based on the input parameters of the first input.

[0245] The processing unit 1510 is specifically used to play the first video segment associated with the at least one target image in the dynamic video based on the updated first image region.

[0246] In some embodiments of this application, the user input unit 1507 is specifically used to receive the first input to the range selection box when a range selection box is displayed on the panoramic image;

[0247] The display unit 1506 is specifically used to respond to the first input and, according to the input parameters of the first input, to magnify and display the first image area selected by the range selection box in the panoramic image.

[0248] In some embodiments of this application, the processing unit 1510 is specifically used for:

[0249] Based on the first association relationship, at least one video frame in the dynamic video that is associated with the at least one target image is determined. The first association relationship is the association relationship between the image in the at least two images and the video frame in the dynamic video. The video segment associated with the image is determined by the video frame associated with the image.

[0250] The playback time period is determined based on the playback time of the first video frame and the playback time of the second video frame, wherein the first video frame and the second video frame are the video frames with the smallest and largest frame numbers, respectively, among the at least one video frame.

[0251] Play the first video segment in the dynamic video whose playback time falls within the playback time period.

[0252] In some embodiments of this application, the processing unit 1510 is further configured to adjust the display range of the panoramic image according to the playback progress of the first video segment during playback of the first video segment.

[0253] In some embodiments of this application, the processing unit 1510 is further configured to:

[0254] Before the display unit 1506 displays the panoramic image associated with the dynamic video, when the display unit 1506 displays the panoramic preview interface, the preview video frames and the audio data corresponding to the preview video frames in the panoramic preview interface are cached.

[0255] During the process of caching preview video frames and corresponding audio data, the at least two images are acquired;

[0256] Based on the preview video frames and corresponding audio data cached within the first time period, the dynamic video is generated, and image fusion processing is performed on the at least two images to obtain the panoramic image;

[0257] The first time period is determined by the acquisition time period of the at least two images.

[0258] In some embodiments of this application, the user input unit 1507 is further configured to receive a second input to the shooting control in the panoramic preview interface before the processing unit 1510 acquires the at least two images;

[0259] The processing unit 1510 is specifically configured to, in response to the second input received by the user input unit 1507, select the at least two images from the preview video frames cached after the first time point, wherein the first time point is the input time point of the second input.

[0260] In some embodiments of this application, the processing unit 1510 is further configured to determine the time period between the second time point and the third time point as the first time period before generating the dynamic video based on the preview video frames and corresponding audio data cached within the first time period.

[0261] Wherein, the second time point is the time point corresponding to the first preset duration before the first time point, and the third time point is the time point corresponding to the second preset duration after the acquisition end time of the at least two images.

[0262] In the electronic device provided in this application, since the electronic device can display a panoramic image associated with a dynamic video, the panoramic image is obtained by fusing at least two images acquired during the shooting of the dynamic video, and each of the at least two images is associated with at least one video segment in the dynamic video, when the user triggers the electronic device to select a first image area in the panoramic image (the first image area includes the image content of at least one target image) through a first input, the electronic device can play the first video segment associated with at least one target image in the dynamic video. That is, by selecting an image area in the panoramic image, the electronic device can be triggered to play at least a portion of the video segments in the dynamic video associated with the panoramic image, thus simplifying the operation process of playing dynamic videos.

[0263] It should be understood that, in this embodiment, the input unit 1504 may include a graphics processing unit (GPU) 15041 and a microphone 15042. The GPU 15041 processes image data of still images or videos obtained by an image capture device (such as a camera) in video capture mode or image capture mode. The display unit 1506 may include a display panel 15061, which may be configured in the form of a liquid crystal display, an organic light-emitting diode, or the like. The user input unit 1507 includes a touch panel 15071 and at least one of other input devices 15072. The touch panel 15071 is also called a touch screen. The touch panel 15071 may include a touch detection device and a touch controller. Other input devices 15072 may include, but are not limited to, physical keyboards, function keys (such as volume control buttons, power buttons, etc.), trackballs, mice, and joysticks, which will not be described in detail here.

[0264] The memory 1509 can be used to store software programs and various data. The memory 1509 may primarily include a first storage area for storing programs or instructions and a second storage area for storing data. The first storage area may store the operating system, application programs or instructions required for at least one function (such as sound playback, image playback, etc.). Furthermore, the memory 1509 may include volatile memory or non-volatile memory, or both. The non-volatile memory may be read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), or flash memory. Volatile memory can be random access memory (RAM), static random access memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random access memory (SDRAM), double data rate synchronous dynamic random access memory (DDRSDRAM), enhanced synchronous dynamic random access memory (ESDRAM), synchronous link dynamic random access memory (SLDRAM), and direct memory bus RAM (DRRAM). The memory 1509 in this embodiment includes, but is not limited to, these and any other suitable types of memory.

[0265] Processor 1510 may include one or more processing units; optionally, processor 1510 integrates an application processor and a modem processor, wherein the application processor mainly handles operations involving the operating system, user interface, and applications, and the modem processor mainly handles wireless communication signals, such as a baseband processor. It is understood that the aforementioned modem processor may also not be integrated into processor 1510.

[0266] This application also provides a readable storage medium storing a program or instructions. When the program or instructions are executed by a processor, they implement the various processes of the above-described dynamic video playback method embodiments and achieve the same technical effect. To avoid repetition, they will not be described again here.

[0267] The processor is the processor in the electronic device described in the above embodiments. The readable storage medium includes computer-readable storage media, such as computer read-only memory (ROM), random access memory (RAM), magnetic disk, or optical disk.

[0268] This application embodiment also provides a chip, which includes a processor and a communication interface. The communication interface is coupled to the processor. The processor is used to run programs or instructions to implement the various processes of the above-described dynamic video playback method embodiments and can achieve the same technical effect. To avoid repetition, it will not be described again here.

[0269] It should be understood that the chip mentioned in the embodiments of this application may also be referred to as a system-on-a-chip, system chip, chip system, or system-on-a-chip, etc.

[0270] This application provides a computer program product, which is stored in a storage medium and executed by at least one processor to implement the various processes of the above-described dynamic video playback method embodiments, and can achieve the same technical effect. To avoid repetition, it will not be described again here.

[0271] It should be noted that, in this document, the terms "comprising," "including," or any other variations thereof are intended to cover non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements includes not only those elements but also other elements not expressly listed, or elements inherent to such a process, method, article, or apparatus. Without further limitations, an element defined by the phrase "comprising one..." does not exclude the presence of other identical elements in the process, method, article, or apparatus that includes that element. Furthermore, it should be noted that the scope of the methods and apparatuses in the embodiments of this application is not limited to performing functions in the order shown or discussed, but may also include performing functions substantially simultaneously or in the reverse order, depending on the functions involved. For example, the described methods may be performed in a different order than described, and various steps may be added, omitted, or combined. Additionally, features described with reference to certain examples may be combined in other examples.

[0272] Through the above description of the embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus necessary general-purpose hardware platforms. Of course, they can also be implemented by hardware, but in many cases the former is a better implementation method. Based on this understanding, the technical solution of this application, in essence, or the part that contributes to the prior art, can be embodied in the form of a computer software product. This computer software product is stored in a storage medium (such as ROM / RAM, magnetic disk, optical disk) and includes several instructions to cause a terminal (which may be a mobile phone, computer, server, or network device, etc.) to execute the methods described in the various embodiments of this application.

[0273] The embodiments of this application have been described above with reference to the accompanying drawings. However, this application is not limited to the specific embodiments described above. The specific embodiments described above are merely illustrative and not restrictive. Those skilled in the art can make many other forms under the guidance of this application without departing from the spirit and scope of the claims, and all of these forms are within the protection scope of this application.

Claims

1. A method for playing dynamic video, characterized in that, The method includes: Display a panoramic image associated with a dynamic video, the panoramic image being obtained by fusing at least two images, the at least two images being images acquired during the shooting of the dynamic video, the dynamic video including at least two video segments, each image being associated with at least one video segment; wherein, each image being associated with at least one video segment includes any of the following: the image identifier of each image being associated with the segment identifier of at least one video segment, and the image identifier of each image being associated with the identifier of a key video frame in at least one video segment; Receive a first input, the first input being used to select a first image region in the panoramic image, the first image region including image content of at least one target image, the target image being an image among the at least two images; In response to the first input, a first video segment associated with the at least one target image in the dynamic video is played based on the first image region.

2. The method of claim 1, wherein, The step of responding to the first input and playing a first video segment associated with the at least one target image in the dynamic video based on the first image region includes: In response to the first input, the first image region is updated and displayed according to the input parameters of the first input; Based on the updated first image region, a first video segment associated with the at least one target image in the dynamic video is played.

3. The method of claim 2, wherein, The receiving of the first input includes: When a range selection box is displayed on the panoramic image, the first input to the range selection box is received; The step of updating the displayed first image region in response to the first input, based on the input parameters of the first input, includes: In response to the first input, the first image region selected by the range selection box in the panoramic image is magnified according to the input parameters of the first input.

4. The method according to any one of claims 1 to 3, characterized in that, Playing the first video segment associated with the at least one target image in the dynamic video includes: Based on the first association relationship, at least one video frame in the dynamic video that is associated with the at least one target image is determined. The first association relationship is the association relationship between the image in the at least two images and the video frame in the dynamic video. The video segment associated with the image is determined by the video frame associated with the image. The playback time period is determined based on the playback time of the first video frame and the playback time of the second video frame, wherein the first video frame and the second video frame are the video frames with the smallest and largest frame numbers, respectively, among the at least one video frame. Play the first video segment in the dynamic video whose playback time falls within the playback time period.

5. The method according to any one of claims 1 to 4, characterized in that, The method further includes: During the playback of the first video segment, the display range of the panoramic image is adjusted according to the playback progress of the first video segment.

6. The method according to any one of claims 1 to 4, characterized in that, Before displaying the panoramic image associated with the dynamic video, the method further includes: When displaying the panoramic preview interface, cache the preview video frames and the corresponding audio data in the panoramic preview interface. During the process of caching preview video frames and corresponding audio data, the at least two images are acquired; Based on the preview video frames and corresponding audio data cached within the first time period, the dynamic video is generated, and image fusion processing is performed on the at least two images to obtain the panoramic image; The first time period is determined by the acquisition time period of the at least two images.

7. The method of claim 6, wherein, Before acquiring the at least two images, the method further includes: Receive a second input to the shooting controls in the panoramic preview interface; The acquisition of at least two images includes: In response to the second input, at least two images are selected from preview video frames cached after the first time point, where the first time point is the input time point of the second input.

8. The method of claim 7, wherein, Before generating the dynamic video based on the cached preview video frames and corresponding audio data within the first time period, the method further includes: The time period between the second time point and the third time point is defined as the first time period; Wherein, the second time point is the time point corresponding to the first preset duration before the first time point, and the third time point is the time point corresponding to the second preset duration after the acquisition end time of the at least two images.

9. A dynamic video playback device, characterized in that, The device includes: a display module, a receiving module, and a processing module; The display module is used to display a panoramic image associated with a dynamic video. The panoramic image is obtained by fusing at least two images, which are acquired during the shooting of the dynamic video. The dynamic video includes at least two video segments, and each image is associated with at least one video segment. The association of each image with at least one video segment includes any of the following: the image identifier of each image is associated with the segment identifier of at least one video segment, and the image identifier of each image is associated with the identifier of a key video frame in at least one video segment. The receiving module is used to receive a first input, which is used to select a first image region in the panoramic image. The first image region includes the image content of at least one target image, and the target image is an image among the at least two images. The processing module is configured to, in response to the first input received by the receiving module, play a first video segment in the dynamic video associated with the at least one target image based on the first image region.

10. The apparatus of claim 9, wherein, The display module is further configured to, in response to the first input, update the display area of ​​the first image according to the input parameters of the first input; The processing module is specifically used to play the first video segment associated with the at least one target image in the dynamic video based on the updated first image region.

11. The apparatus of claim 10, wherein, The receiving module is specifically used to receive the first input to the range selection box when a range selection box is displayed on the panoramic image; The display module is specifically configured to respond to the first input and, based on the input parameters of the first input, magnify and display the first image area selected by the range selection box in the panoramic image.

12. The apparatus of any one of claims 9-11, wherein, The processing module is specifically used for: Based on the first association relationship, at least one video frame in the dynamic video that is associated with the at least one target image is determined. The first association relationship is the association relationship between the image in the at least two images and the video frame in the dynamic video. The video segment associated with the image is determined by the video frame associated with the image. The playback time period is determined based on the playback time of the first video frame and the playback time of the second video frame, wherein the first video frame and the second video frame are the video frames with the smallest and largest frame numbers, respectively, among the at least one video frame. Play the first video segment in the dynamic video whose playback time falls within the playback time period.

13. The apparatus of any one of claims 9-12, wherein, The processing module is further configured to adjust the display range of the panoramic image according to the playback progress of the first video segment during playback.

14. The apparatus of any one of claims 9-13, wherein, The processing module is further configured to: Before the display module displays the panoramic image associated with the dynamic video, when the display module displays the panoramic preview interface, the preview video frames and the corresponding audio data in the panoramic preview interface are cached. During the process of caching preview video frames and corresponding audio data, the at least two images are acquired; Based on the preview video frames and corresponding audio data cached within the first time period, the dynamic video is generated, and image fusion processing is performed on the at least two images to obtain the panoramic image; The first time period is determined by the acquisition time period of the at least two images.

15. The apparatus of claim 14, wherein, The receiving module is further configured to receive a second input to the shooting control in the panoramic preview interface before the processing module acquires the at least two images; The processing module is specifically configured to, in response to the second input received by the receiving module, select at least two images from the preview video frames cached after the first time point, wherein the first time point is the input time point of the second input.

16. The apparatus of claim 15, wherein, The processing module is further configured to determine the time period between the second time point and the third time point as the first time period before generating the dynamic video based on the preview video frames and corresponding audio data cached within the first time period; Wherein, the second time point is the time point corresponding to the first preset duration before the first time point, and the third time point is the time point corresponding to the second preset duration after the acquisition end time of the at least two images.

17. An electronic device, comprising: It includes a processor and a memory, the memory storing a program or instructions that can run on the processor, the program or instructions being executed by the processor to implement the steps of the dynamic video playback method as described in any one of claims 1 to 8.

18. A readable storage medium, characterized by, The readable storage medium stores a program or instructions that, when executed by a processor, implement the steps of the dynamic video playback method as described in any one of claims 1 to 8.

Citation Information

Patent Citations

  • Panoramic video live broadcast method and system and computer readable storage medium

    CN113099245A