A video processing method, apparatus, device, and storage medium
By displaying a mask at the bottom of the video editing window and showing an editing reference track on the mask, the problem of inconvenient operation caused by a complicated video processing interface is solved, thus improving the user experience.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- BEIJING ZITIAO NETWORK TECH CO LTD
- Filing Date
- 2020-11-24
- Publication Date
- 2026-06-19
AI Technical Summary
With the diversification of video processing functions, the interactive functions on video processing interfaces are becoming increasingly complex, leading to inconvenience for users during the video processing process and affecting the user experience.
Display a mask at the bottom of the video editing window and show editing reference tracks, including video and audio tracks, on the mask, using the timeline in the video editing window to assist the user in editing.
By displaying the editing reference track through a mask, the impact on user operations is reduced, thus improving the user experience of video processing.
Smart Images

Figure CN117750112B_ABST
Abstract
Description
Technical Field
[0001] This disclosure relates to the field of data processing, and more particularly to a video processing method, apparatus, device, and storage medium. Background Technology
[0002] With the continuous development of video processing technology, video processing functions have become more diversified. For example, video editing not only provides users with the ability to add music, special effects, and other sound effects, but also allows users to add stickers, text, and other elements to the video, enriching the user's video processing experience.
[0003] However, with the diversification of video processing functions, the interactive functions on the video processing interface have become increasingly complex, leading to inconvenience for users during the video processing process and thus affecting the user's video processing experience. Summary of the Invention
[0004] To address, or at least partially address, the aforementioned technical problems, this disclosure provides a video processing method, apparatus, device, and storage medium that redisplays the editing reference track as a mask when the editing reference track is moved out of the video editing window, thereby reducing the impact on the user's video processing and improving the user experience.
[0005] In a first aspect, this disclosure provides a video processing method, the method comprising:
[0006] When the editing reference track of the video to be processed is detected to have moved out of the video editing window, a mask is displayed in the bottom area of the video editing window; the editing reference track includes a video track and / or an audio track;
[0007] Based on the timeline in the video editing window, an editing reference track for the video to be processed is displayed on the overlay; wherein, the editing reference track displayed on the overlay is used to assist the user in editing the video to be processed in the video editing window.
[0008] In one optional implementation, the method further includes:
[0009] When the editing reference track that has been moved out of the video editing window is detected to move back into the video editing window, the overlay and the editing reference track displayed on the overlay are hidden.
[0010] In one optional implementation, displaying an editing reference track of the video to be processed on the overlay based on the timeline on the video editing window includes:
[0011] Based on the timeline on the video editing window, the audio track of the video to be processed is displayed on the overlay in a preset format.
[0012] In one optional implementation, before displaying the audio track of the video to be processed on the overlay in a preset format based on the timeline on the video editing window, the method further includes:
[0013] Determine the time point at which the target audio data exists on the audio track of the video to be processed; the target audio data includes music-type audio data or sound effect-type audio data;
[0014] Based on the time point where the target audio data exists, a target audio visual line is generated for the audio track of the video to be processed; wherein, the target audio visual line is used to visualize the playback position of the target audio data in the video to be processed;
[0015] Correspondingly, the step of displaying the audio track of the video to be processed in a preset format on the overlay based on the timeline on the video editing window includes:
[0016] Based on the timeline in the video editing window, the target audio is displayed on the overlay as a visible visual element.
[0017] In one optional implementation, the audio track of the video to be processed includes a first audio track and a second audio track, and the step of determining the time point at which the target audio data exists in the audio track of the video to be processed includes:
[0018] Determine the time points when the target audio data exists on the first audio track and the second audio track, respectively;
[0019] The time points where the target audio data exists on the first audio track and the second audio track are merged to obtain the merged time points;
[0020] Accordingly, generating a target audio visible line of sight for the audio track of the video to be processed based on the time point where the target audio data exists includes:
[0021] Based on the merged time point, a target audio visual line is generated for the video to be processed.
[0022] In one optional implementation, the target audio data includes audio data of a music genre; the step of displaying the visible target audio on the overlay based on the timeline on the video editing window further includes:
[0023] Music beats are marked on the target audio visual line of sight.
[0024] In one alternative implementation, the target audio data includes audio data of the sound effect type.
[0025] Secondly, this disclosure provides a video processing apparatus, the apparatus comprising:
[0026] The first display module is used to display a mask in the bottom area of the video editing window when the video track of the video to be processed is detected to have moved out of the video editing window;
[0027] The second display module is used to display an editing reference track of the video to be processed on the overlay based on the timeline on the video editing window; wherein the editing reference track includes a video track and / or an audio track, and the editing reference track is used to assist the user in editing the video to be processed in the video editing window.
[0028] Thirdly, this disclosure provides a computer-readable storage medium storing instructions that, when executed on a terminal device, cause the terminal device to implement the above-described method.
[0029] Fourthly, this disclosure provides an apparatus comprising: a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor, when executing the computer program, implements the method described above.
[0030] The technical solution provided in this disclosure has the following advantages compared with the prior art:
[0031] This disclosure provides a video processing method in which, when the editing reference track of the video to be processed is detected to have moved out of the video editing window, a mask is displayed in the bottom area of the video editing window; the editing reference track includes a video track and / or an audio track. Then, based on the timeline on the video editing window, the editing reference track of the video to be processed is displayed on the mask; wherein, the editing reference track displayed on the mask is used to assist the user in editing the video to be processed in the video editing window. Therefore, the video processing method provided by this disclosure can redisplay the editing reference track in the form of a mask when it moves out of the video editing window, reducing the impact on the user's video processing and improving the user experience. Attached Figure Description
[0032] The accompanying drawings, which are incorporated in and form a part of this specification, illustrate embodiments consistent with this disclosure and, together with the description, serve to explain the principles of this disclosure.
[0033] To more clearly illustrate the technical solutions in the embodiments of this disclosure or the prior art, the accompanying drawings used in the description of the embodiments or the prior art will be briefly introduced below. Obviously, those skilled in the art can obtain other drawings based on these drawings without creative effort.
[0034] Figure 1 A flowchart of a video processing method provided in this embodiment of the disclosure;
[0035] Figure 2 This is a schematic diagram illustrating the effect of a video editing window provided in an embodiment of the present disclosure;
[0036] Figure 3 A schematic diagram illustrating the overlay of target audio data on multiple audio tracks provided in this embodiment of the disclosure;
[0037] Figure 4 This is a schematic diagram illustrating the effect of another video editing window provided in an embodiment of this disclosure;
[0038] Figure 5 This is a schematic diagram of the structure of a video processing apparatus provided in an embodiment of the present disclosure;
[0039] Figure 6 This is a schematic diagram of the structure of a video processing device provided in an embodiment of the present disclosure. Detailed Implementation
[0040] To better understand the above-mentioned objectives, features, and advantages of this disclosure, the solutions disclosed herein will be further described below. It should be noted that, unless otherwise specified, the embodiments and features described herein can be combined with each other.
[0041] Numerous specific details are set forth in the following description in order to provide a full understanding of this disclosure, but this disclosure may also be implemented in other ways different from those described herein; obviously, the embodiments in the specification are only some, and not all, of the embodiments of this disclosure.
[0042] With the diversification of video processing functions, the editing functions on the video editing window are becoming increasingly complex, which may cause inconvenience for users during the video processing process and affect the user experience.
[0043] In real-world applications, video editing windows can display more and more video editing functions via scrollbars. However, scrolling based on scrollbars may cause tracks such as video tracks or audio tracks, which are used to assist users in video editing, to no longer be displayed in the video editing window, thus affecting the user's video editing experience.
[0044] To this end, this disclosure provides a video processing method. Specifically, when the editing reference track of the video to be processed is detected to have moved out of the video editing window, a mask is displayed in the bottom area of the video editing window; the editing reference track includes a video track and / or an audio track. Then, based on the timeline on the video editing window, the editing reference track of the video to be processed is displayed on the mask; wherein, the editing reference track displayed on the mask is used to assist the user in editing the video to be processed in the video editing window.
[0045] As can be seen, the video processing method provided in this embodiment can redisplay the editing reference track in the form of a mask when the editing reference track moves out of the video editing window, thereby reducing the impact on the user's video processing and improving the user experience.
[0046] Based on this, the present disclosure provides a video processing method, referring to... Figure 1 The flowchart below illustrates a video processing method provided in this embodiment of the present disclosure. The method includes:
[0047] S101: When the editing reference track of the video to be processed is detected to have moved out of the video editing window, a mask is displayed in the bottom area of the video editing window; the editing reference track includes a video track and / or an audio track.
[0048] The video processing method provided in this disclosure can be applied to functions such as video editing.
[0049] In practical applications, video and audio tracks play a certain auxiliary role in the user's video processing. Therefore, when the user is processing the video to be processed, the video and audio tracks of the video to be processed should be displayed in the video editing window as much as possible to improve the user's video processing experience.
[0050] Therefore, in this embodiment of the present disclosure, when the editing reference track of the video to be processed is detected to have moved out of the video editing window, the editing reference track that has moved out of the video editing window is displayed in the form of a mask to reduce the impact on the user's video processing.
[0051] In one optional implementation, when processing a video, the user may need to scroll up and down the video editing window. During this scrolling, the editing reference track may move out of the video editing window and no longer be displayed. To reduce the impact on the user's video processing, this embodiment displays a mask at the bottom area of the video editing window and redisplays the editing reference track on the mask.
[0052] The bottom area of the video editing window can be a display area defined by the bottom boundary of the video editing window. The overlay displayed in the bottom area has a certain degree of transparency, showing the editing reference track on the overlay without affecting the content display and operation control of the underlying area covered by the overlay. For example, the content displayed in the underlying area covered by the overlay will update as the video editing window scrolls up and down.
[0053] In one optional implementation, when the video track of the video to be processed is detected to have moved out of the video editing window, a mask is displayed in the bottom area of the video editing window. Alternatively, when the audio track of the video to be processed is detected to have moved out of the video editing window, a mask is displayed in the bottom area of the video editing window. Alternatively, when both the video track and the audio track of the video to be processed are detected to have moved out of the video editing window, a mask is displayed in the bottom area of the video editing window. The video track may include the main video track, or the video track may include the main video track and a picture-in-picture track.
[0054] S102: Based on the timeline on the video editing window, display the editing reference track of the video to be processed on the overlay; wherein, the editing reference track displayed on the overlay is used to assist the user in editing the video to be processed in the video editing window.
[0055] In this embodiment of the disclosure, a timeline is displayed on the video editing window, and an editing reference track of the video to be processed is displayed on the overlay based on the timeline, so that the editing reference track displayed on the overlay is consistent with the timeline displayed in the video editing window, which makes it convenient for the user to edit the video to be processed.
[0056] In one optional implementation, after the editing reference track moves out of the video editing window, it remains displayed on the overlay at the bottom of the window to facilitate user editing. When the editing reference track is detected to have moved back into the video editing window, the overlay and the edit reference track displayed on it can be hidden to avoid redundant display. At this point, the user can edit the video based on the re-entered editing reference track.
[0057] In the video processing method provided in this disclosure, when the editing reference track of the video to be processed is detected to have moved out of the video editing window, a mask is displayed in the bottom area of the video editing window; the editing reference track includes a video track and / or an audio track. Then, based on the timeline on the video editing window, the editing reference track of the video to be processed is displayed on the mask; wherein, the editing reference track displayed on the mask is used to assist the user in editing the video to be processed in the video editing window. It is evident that the video processing method provided in this disclosure can redisplay the editing reference track in the form of a mask when it moves out of the video editing window, reducing the impact on the user's video processing and improving the user experience.
[0058] In practical applications, when editing reference tracks that include audio tracks, the audio track of the video to be processed can be displayed on the mask in a preset format.
[0059] In one optional implementation, the audio track of the video to be processed can be displayed as a visual representation. Specifically, firstly, the time points in the audio track of the video to be processed where the target audio data exists are determined. Then, based on the time points where the target audio data exists, a visual representation of the target audio is generated for the audio track of the video to be processed. This visual representation of the target audio is used to visualize the playback position of the target audio data in the video to be processed. In other words, the visual representation of the target audio can determine which positions in the video to be processed contain the target audio data, for example, determining the playback position of background music in the video to be processed.
[0060] like Figure 2 The diagram illustrates the effect of a video editing window according to an embodiment of this disclosure. A mask is displayed at the bottom of the video editing window, and a video track and a target audio line of sight are simultaneously displayed on the mask. The target audio line of sight may include at least one line segment, and the video track may include a main video track and a picture-in-picture track.
[0061] Of course, in some implementations, only the video track or only the target audio line of sight may be displayed on the overlay at the bottom area of the video editing window, and this disclosure does not limit this.
[0062] In addition, the target audio data in this embodiment may include music-type audio data and / or sound effect-type audio data.
[0063] Specifically, the music-type audio data is typically electronic music, such as songs or light music. In this embodiment, determining the time point at which the target audio data exists in the audio track of the video to be processed can include determining the time point at which music-type audio data exists in the audio track of the video to be processed. The audio track can include at least one music-type audio track.
[0064] Audio data of the sound effect type typically includes special effects audio, dubbing, text reading, etc., such as a cat-like meowing sound effect "meow~". The time point at which the target audio data exists in the audio track of the video to be processed in this embodiment can include the time point at which the sound effect type audio data exists in the audio track of the video to be processed. The audio track can include at least one sound effect type audio track.
[0065] In one optional implementation, if there is only one audio track containing the target audio data in the video to be processed, then the time point in which the target audio data exists on that audio track can be determined as the time point in which the target audio data exists in the audio track of the video to be processed.
[0066] In another optional implementation, if the audio track containing the target audio data in the video to be processed includes at least two audio tracks (taking the first audio track and the second audio track as examples), the time point at which the target audio data exists in the audio track of the video to be processed is determined specifically includes:
[0067] First, the time points in the first audio track and the second audio track where the target audio data exists are determined respectively; then, the time points in the first audio track and the second audio track where the target audio data exists are merged to obtain merged time points; finally, based on the merged time points, a target audio visual line is generated for the video to be processed.
[0068] like Figure 3 The diagram illustrates the merging of time points containing target audio data across multiple audio tracks according to an embodiment of this disclosure. The first audio track includes audio data A and audio data B, and the second audio track includes audio data C. The time points corresponding to audio data A, audio data B, and audio data C are merged based on a time axis to obtain merged time points, i.e., time points where audio data A, audio data B, or audio data C exists. Specifically, merging the time points corresponding to audio data A, audio data B, and audio data C involves retaining at least one of the time points in the video to be processed that contain audio data A, audio data B, and audio data C, thus forming the visible line of sight for the target audio.
[0069] In one optional implementation, the target audio data may include music-type audio data and sound effect-type audio data. Therefore, embodiments of this disclosure can generate corresponding visual lines for music-type audio data and sound effect-type audio data, respectively. Specifically, a music audio visual line is generated based on the time points in the audio track of the video to be processed where music-type audio data exists, and simultaneously, a sound effect audio visual line is generated based on the time points in the audio track of the video to be processed where sound effect-type audio data exists.
[0070] like Figure 4 The diagram shown illustrates the effect of another video editing window provided in this embodiment. In this window, the main video track, picture-in-picture track, audio view L1 (for music), and audio view L2 (for sound effects) are displayed on the same timeline overlay.
[0071] The video processing method provided in this disclosure can display the time points in the audio track where the target audio data exists in a visual form to the user. Without affecting video processing, it optimizes the display effect of the video editing window, facilitates user operation, and improves the user experience.
[0072] In practical applications, music timing is an important reference factor in video processing. Therefore, in this embodiment of the present disclosure, before displaying the target audio visual line on the overlay based on the timeline on the video editing window, it is also necessary to determine the music timing of the video to be processed and then display the music timing to facilitate video processing and improve user experience.
[0073] Specifically, when the target audio data is music-type audio data, music timing is determined from the time points when the target audio data exists on the audio track of the video to be processed, and then the music timing is marked on the visible line of sight of the music-type audio.
[0074] Music timing is used to identify rhythm, melody, and other elements in music. Music timing can be pre-generated automatically or manually determined by the user; no specific limitation is made here.
[0075] In one optional implementation, if there is only one audio track containing music-type audio data in the video to be processed, then the music beats on that audio track are determined as the music beats of the video to be processed.
[0076] In another optional implementation, if the video to be processed contains audio data of music type and includes at least two audio tracks (taking the third and fourth audio tracks as an example), the music beats on the third and fourth audio tracks can be superimposed based on the timeline to determine the music beats in the video to be processed. Figure 4As shown, the music beats have been marked on the audio track for the music genre.
[0077] In the video processing method provided in this embodiment, not only is the target audio visible to the user on the overlay, but the music beats are also displayed to the user. This optimizes the display effect of the video editing window without affecting video processing, making it easier for users to operate and improving the user experience.
[0078] Based on the same inventive concept as the above-described method embodiments, this disclosure also provides a video processing apparatus, with reference to... Figure 5 This is a schematic diagram of the structure of a video processing apparatus provided in an embodiment of the present disclosure. The apparatus includes:
[0079] The first display module 501 is used to display a mask in the bottom area of the video editing window when the video track of the video to be processed is detected to have moved out of the video editing window;
[0080] The second display module 502 is used to display an editing reference track of the video to be processed on the overlay based on the timeline on the video editing window; wherein the editing reference track includes a video track and / or an audio track, and the editing reference track is used to assist the user in editing the video to be processed in the video editing window.
[0081] In one optional embodiment, the apparatus further includes:
[0082] A hiding module is used to hide the overlay and the editing reference track displayed on the overlay when the editing reference track that has been moved out of the video editing window is detected to have moved back into the video editing window.
[0083] In one optional implementation, the second display module is specifically used for:
[0084] Based on the timeline on the video editing window, the audio track of the video to be processed is displayed on the overlay in a preset format.
[0085] In one optional embodiment, the apparatus further includes:
[0086] The first determining module is used to determine the time point at which target audio data exists on the audio track of the video to be processed; the target audio data includes music-type audio data or sound effect-type audio data;
[0087] The generation module is used to generate a target audio visual line for the audio track of the video to be processed based on the time point where the target audio data exists; wherein, the target audio visual line is used to visualize the playback position of the target audio data in the video to be processed;
[0088] Accordingly, the second display module is specifically used for:
[0089] Based on the timeline in the video editing window, the target audio is displayed on the overlay as a visible visual element.
[0090] In one optional implementation, the audio track of the video to be processed includes a first audio track and a second audio track, and the first determining module includes:
[0091] The first determining submodule is used to determine the time points when target audio data exists on the first audio track and the second audio track, respectively.
[0092] The merging submodule is used to merge the time points where the target audio data exists on the first audio track and the second audio track to obtain the merged time points;
[0093] Accordingly, the generation module is specifically used for:
[0094] Based on the merged time point, a target audio visual line is generated for the video to be processed.
[0095] In one optional implementation, the target audio data includes audio data of a music genre; the apparatus further includes:
[0096] A marking module is used to mark music beats on the visible surface of the target audio.
[0097] In one alternative implementation, the target audio data includes audio data of the sound effect type.
[0098] In the video processing apparatus provided in this embodiment, when the editing reference track of the video to be processed is detected to have moved out of the video editing window, a mask is displayed in the bottom area of the video editing window; the editing reference track includes a video track and / or an audio track. Then, based on the timeline on the video editing window, the editing reference track of the video to be processed is displayed on the mask; wherein, the editing reference track displayed on the mask is used to assist the user in editing the video to be processed in the video editing window. Therefore, the video processing apparatus provided in this embodiment can redisplay the editing reference track in the form of a mask when the editing reference track moves out of the video editing window, reducing the impact on the user's video processing and improving the user experience.
[0099] In addition to the methods and apparatus described above, this disclosure also provides a computer-readable storage medium storing instructions that, when executed on a terminal device, cause the terminal device to implement the video processing method described in this disclosure.
[0100] In addition, this disclosure also provides a video processing device, see [link to relevant documentation]. Figure 6 As shown, it may include:
[0101] The video processing device includes a processor 601, a memory 602, an input device 603, and an output device 604. The number of processors 601 in the video processing device can be one or more. Figure 6 Taking a processor as an example. In some embodiments of this disclosure, the processor 601, memory 602, input device 603, and output device 604 can be connected via a bus or other means, wherein, Figure 6 Taking the example of a connection between China and Israel via a bus.
[0102] The memory 602 can be used to store software programs and modules. The processor 601 executes various functional applications and data processing of the video processing device by running the software programs and modules stored in the memory 602. The memory 602 may mainly include a program storage area and a data storage area. The program storage area may store the operating system, at least one application program required for a function, etc. In addition, the memory 602 may include high-speed random access memory, and may also include non-volatile memory, such as at least one disk storage device, flash memory device, or other volatile solid-state storage device. The input device 603 can be used to receive input digital or character information, and to generate signal inputs related to user settings and function control of the video processing device.
[0103] Specifically in this embodiment, the processor 601 loads the executable files corresponding to the processes of one or more applications into the memory 602 according to the following instructions, and the processor 601 runs the applications stored in the memory 602, thereby realizing the various functions of the video processing device described above.
[0104] It should be noted that, in this document, relational terms such as "first" and "second" are used merely to distinguish one entity or operation from another, and do not necessarily require or imply any such actual relationship or order between these entities or operations. Furthermore, the terms "comprising," "including," or any other variations thereof are intended to cover non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements includes not only those elements but also other elements not expressly listed, or elements inherent to such a process, method, article, or apparatus. Without further limitations, an element defined by the phrase "comprising one..." does not exclude the presence of other identical elements in the process, method, article, or apparatus that includes said element.
[0105] The above description is merely a specific embodiment of this disclosure, enabling those skilled in the art to understand or implement it. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be implemented in other embodiments without departing from the spirit or scope of this disclosure. Therefore, this disclosure is not to be limited to the embodiments described herein, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
Claims
1. A method of video processing, the method comprising: The method includes: The video editing window displays the editing reference track of the video to be processed; the editing reference track includes a video track and an audio track; When the editing reference track is detected to have moved out of the video editing window, the time point at which the target audio data exists on the audio track of the video to be processed is determined; the target audio data includes music-type audio data or sound effect-type audio data; Based on the time point where the target audio data exists, a target audio visual line is generated for the audio track of the video to be processed; wherein, the target audio visual line is used to visualize the playback position of the target audio data in the video to be processed; the target audio visual line includes a music audio visual line or a sound effect audio visual line, the music audio visual line is generated based on the time point of the audio data of the music type, and the sound effect audio visual line is generated based on the time point of the audio data of the sound effect type; A display overlay is added to the bottom area of the video editing window, and the video track and the target audio line of the video to be processed are displayed on the overlay; wherein, the video track and the target audio line of the overlay are the timeline on the video editing window, and the video track and the target audio line of the overlay are used to assist the user in editing the video to be processed in the video editing window.
2. The method of claim 1, wherein, The method further includes: In response to a scrolling operation on the video editing window, the video editing window is controlled to scroll so that the editing track displayed in the video editing window moves following the scrolling operation.
3. The method of claim 1, wherein, The method further includes: When the editing reference track that has been moved out of the video editing window is detected to move back into the video editing window, the overlay and the editing reference track displayed on the overlay are hidden.
4. The method of claim 1, wherein, The audio track of the video to be processed includes a first audio track and a second audio track. The step of determining the time point at which the target audio data exists in the audio track of the video to be processed includes: Determine the time points when the target audio data exists on the first audio track and the second audio track, respectively; The time points where the target audio data exists on the first audio track and the second audio track are merged to obtain the merged time points; Accordingly, generating a target audio visible line of sight for the audio track of the video to be processed based on the time point where the target audio data exists includes: Based on the merged time point, a target audio visual line is generated for the video to be processed.
5. The method according to claim 1 or 4, characterized in that, The target audio data includes audio data of a music genre; the step of displaying the target audio visual line on the overlay based on the timeline on the video editing window further includes: Music beats are marked on the target audio visual line of sight.
6. The method according to claim 1 or 4, characterized in that, The target audio data includes audio data of the sound effect type.
7. A video processing apparatus, characterized in that, The device includes: The track display module is used to display the editing reference track of the video to be processed in the video editing window; the editing reference track includes a video track and an audio track; The first determining module is used to determine the time point at which target audio data exists on the audio track of the video to be processed when the editing reference track is detected to have moved out of the video editing window; the target audio data includes music-type audio data or sound effect-type audio data; A generation module is used to generate a target audio visual line for the audio track of the video to be processed based on the time point where the target audio data exists; wherein, the target audio visual line is used to visualize the playback position of the target audio data in the video to be processed; the target audio visual line includes a music audio visual line or a sound effect audio visual line, wherein the music audio visual line is generated based on the time point of the audio data of the music type, and the sound effect audio visual line is generated based on the time point of the audio data of the sound effect type; The first display module is used to add a display mask to the bottom area of the video editing window and display the video track and the target audio line of sight on the mask; wherein the video track and the target audio line of sight on the mask adopt the timeline on the video editing window, and the video track and the target audio line of sight are used to assist the user in editing the video to be processed in the video editing window.
8. A computer-readable storage medium, characterized in that, The computer-readable storage medium stores instructions that, when executed on a terminal device, cause the terminal device to perform the method as described in any one of claims 1-6.
9. A video processing device, characterized in that, include: A memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor, when executing the computer program, implements the method as claimed in any one of claims 1-6.