Video editing methods, equipment, electronic devices, media, and products

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
The method automates language conversion in video editing templates, addressing the inefficiencies of separate template creation for different languages, enhancing efficiency and accessibility of multilingual templates.

JP2026521917APending Publication Date: 2026-07-02BEIJING ZITIAO NETWORK TECH CO LTD

View PDF 0 Cites 0 Cited by

Patent Information

Authority / Receiving Office: JP · JP
Patent Type: Applications
Current Assignee / Owner: BEIJING ZITIAO NETWORK TECH CO LTD
Filing Date: 2024-09-13
Publication Date: 2026-07-02

Application Information

Patent Timeline

13 Sep 2024

Application

02 Jul 2026

Publication

JP2026521917A

IPC: G06F40/58; H04N21/81; H04N21/854; H04N21/472; H04N21/44

AI Tagging

Technology Topics

Software engineering MediaFLO

Explore More Agents

Novelty Search
Search existing technologies and assess novelty
↗
FTO
Analyze whether a product may infringe others' patents
↗
Design FTO
Check prior-design risk for exterior design
↗
Drafting
Draft patent application text based on a technical solution
↗
Find Solutions with TRIZ
Generate feasible solution to solve your technical challenge
↗

Similar Technology Patents

Get free access to AI patent search and analysis

Check patentability, review prior art and ask IP Agent with full patent context.

Smart Images

Figure 2026521917000001_ABST

Patent Text Reader

Abstract

Embodiments of this disclosure disclose video editing methods, apparatus, electronic devices, storage media, and products, the method comprising: obtaining a first video editing draft; determining replaceable material within first video material; generating second text material by converting text content in a first language to text content in a second language based on first text material; marking replaceable material within the first video editing draft; replacing the first text material with the second text material; and generating a first video editing template, the first video editing template being used to introduce second video material for replacing replaceable material and to generate a second video editing draft, the second video editing draft being used to composite a target video. According to the technical embodiments of the embodiments of this disclosure, before publishing the video editing template, the text language conversion of the template can be performed automatically, multiple language versions of a single video editing template can be efficiently generated, and the acceptance range of the video editing template can be expanded.

Need to check novelty before this filing date? Find Prior Art

Description

Technical Field

[0001] This application claims the priority of a Chinese patent application filed on September 15, 2023, with an application number of 202311199524.3 and an invention title of "Video Editing Method, Apparatus, Electronic Device, Medium and Product", and all the contents of the application are incorporated herein by reference.

[0002] Embodiments of the present disclosure relate to the field of computer technology, and in particular, to video editing methods, apparatuses, electronic devices, media and products.

Background Art

[0003] In video editing applications, users can create and publish video editing templates. The published video editing templates can be used by more users. Usually, the languages of the operating environments in which different users apply video editing applications and video editing templates are different, and a single-language video editing template cannot meet the video editing needs of more users.

[0004] However, currently, when the publisher of a video editing template publishes video editing templates in multiple language versions, it is necessary to edit and create the video editing templates separately for each language version. For the publisher of the video editing template, the editing process of duplicate templates is also cumbersome, and there are also language ability barriers. In addition, there are certain limitations to the popularization and application of video editing templates.

Summary of the Invention

[0005] The present disclosure provides a video editing method, apparatus, electronic device, medium and product, which automatically performs language conversion of text materials based on one video editing draft, efficiently generates video editing templates in languages different from the video editing draft, is used in the video editing process, improves the creation and writing efficiency of video editing templates in multiple language versions, and can also expand the acceptance range of video editing templates.

[0006] In a first embodiment, an embodiment of the present disclosure provides a video editing method, the method comprising: obtaining a first video editing draft, the first video editing draft comprising multimedia material and editing information, the editing information being used to indicate editing operations on the multimedia material, the multimedia material comprising first video material and first text material, the first text material being used to indicate text content in a first language; determining replaceable material within the first video material; generating second text material by converting the text content in a first language to text content in a second language based on the first text material; marking the replaceable material within the first video editing draft, replacing the first text material with the second text material, and generating a first video editing template, the first video editing template being used to introduce second video material to replace the replaceable material, generate a second video editing draft, and the second video editing draft being used to composite a target video.

[0007] In a second embodiment, an embodiment of the present disclosure further provides a video editing apparatus, the apparatus comprising: an original video editing draft acquisition module for acquiring a first video editing draft, wherein the first video editing draft includes multimedia material and editing information, the editing information is used to indicate editing operations on the multimedia material, the multimedia material includes first video material and first text material, and the first text material is used to display text content in a first language; a replaceable material determination module used to determine replaceable material within the first video material; and the The system includes a text material conversion module used to generate a second text material by converting the text content of a first language to text content of a second language based on the first text material, and a video editing template generation module that marks the replaceable material in the first video editing draft, replaces the first text material with the second text material, and generates a first video editing template, wherein the first video editing template is used to introduce the second video material to replace the replaceable material and generate a second video editing draft, and the second video editing draft is used to composite a target video.

[0008] In a fourth embodiment, an embodiment of the present disclosure provides a storage medium containing computer executable instructions, which, when executed by a computer processor, is used to perform, for example, a video editing method described in any of the embodiments of the present disclosure.

[0009] In a fifth embodiment, the embodiments of the present disclosure further provide a computer program product including a computer program, which, when executed by a processor, performs, for example, a video editing method described in any of the embodiments of the present invention. [Brief explanation of the drawing]

[0010] The above and other features, advantages, and aspects of each embodiment of this disclosure will become more apparent by referring to the following specific embodiments in conjunction with the accompanying drawings. Throughout the accompanying drawings, the same or similar reference numerals indicate the same or similar elements. It should be understood that the accompanying drawings are schematic diagrams and that parts and elements are not necessarily drawn to scale. [Figure 1] This is a flowchart of the video editing method provided in the embodiments of this disclosure. [Figure 2] This is a schematic diagram of the template post settings page provided in the embodiments of this disclosure. [Figure 3] This is a flowchart of the video editing method provided in the embodiments of this disclosure. [Figure 4] This is a flowchart of the video editing method provided in the embodiments of this disclosure. [Figure 5] This is a schematic diagram of the template detail page provided in the embodiments of this disclosure. [Figure 6] This is a schematic diagram of the configuration of a video editing device provided in the embodiments of this disclosure. [Figure 7] This is a schematic diagram of the configuration of an electronic device provided by an embodiment of the present disclosure. [Modes for carrying out the invention]

[0011] The embodiments of this disclosure will be described in more detail below with reference to the attached drawings. While the drawings show several embodiments of this disclosure, it should be understood that this disclosure is achievable in various forms and should not be construed as being limited to the embodiments described herein. Rather, these embodiments are provided to provide a more complete and thorough understanding of this disclosure. The drawings and embodiments of this disclosure are for illustrative purposes only and should not be used to limit the scope of protection of this disclosure.

[0012] It should be understood that each step described in the embodiments of the method of this disclosure may be performed in a different order and / or in parallel. Furthermore, embodiments of the method may include additional steps and / or omit the performance of the indicated steps. The scope of this disclosure is not limited in this respect.

[0013] As used herein, the expression “including” and its variations thereof are open inclusions, meaning “including but not limited to XXX.” The expression “based on XXX” means “at least partially based on XXX.” The expression “one embodiment” means “at least one embodiment.” The expression “another embodiment” means “at least one other embodiment.” The expression “several embodiments” means “at least several embodiments.” Related definitions of other terms are given below.

[0014] It should be noted that the concepts such as “first,” “second,” etc., as used in this disclosure are used solely to distinguish different devices, modules, or units, and do not limit the order or interdependence of the functions performed by these devices, modules, or units.

[0015] It should be noted that the modifiers “one” and “multiple” as used in this disclosure are illustrative and non-exclusive, and should be understood by those skilled in the art as “one or more” unless otherwise specified in the context.

[0016] It should be understood that before using any of the technical forms disclosed in each embodiment of this disclosure, the user should be informed of the type, scope, and context of use of the personal information related to this disclosure, and the user's consent should be obtained through appropriate means in accordance with applicable laws and regulations.

[0017] For example, by responding to a user's voluntary request and sending the user information, the system clearly indicates to the user that the operation being performed requires the acquisition and use of the user's personal information. This allows the user to autonomously choose, based on the information provided, whether or not to provide personal information to software or hardware such as electronic devices, applications, servers, or storage media that perform the operation in the technical form of this disclosure.

[0018] As a selective and non-restrictive implementation method, a method for responding to a user's voluntary request and sending information to the user is, for example, a pop-up window method, in which the information can be displayed as text in the pop-up window. Furthermore, the pop-up window may provide electronic devices with control over the selection of personal information by displaying "Agree" or "Disagree" for the user to choose.

[0019] As described above, the process for notifying and obtaining user consent is merely illustrative and does not limit the embodiments of this disclosure. It should be understood that other methods that comply with applicable laws and regulations are also applicable to the embodiments of this disclosure.

[0020] It should be understood that data relating to this technological form (including, but not limited to, the data itself, its acquisition, and its use) should be handled in accordance with applicable laws, regulations, and related provisions. Furthermore, during data collection and use, users' personal information will not be collected or processed in a manner that does not reveal the user's identity.

[0021] FIG. 1 is a flowchart of a video editing method provided by an embodiment of the present disclosure. The embodiments of the present disclosure are applicable to video editing scenarios, particularly video editing using a multi-language version video editing template. The method is executable by a video editing apparatus, which can be implemented in the form of software and / or hardware. Optionally, the video editing apparatus can be implemented by an electronic device, which may be a mobile terminal, a PC terminal, a server, etc.

[0022] As shown in FIG. 1, the video editing method includes steps S110, S120, S130, and S140.

[0023] In S110, a first video editing draft is obtained. The multimedia material includes a first video material and a first text material. The first text material is used to display text content in a first language.

[0024] A video editing draft is a draft of a project in which a user edits video clips and includes multimedia material and editing information. The editing information is used to indicate an editing operation on the multimedia material, and the multimedia material includes video material and text material.

[0025] In some video and audiovisual processing applications, video editing templates can be published to allow users to retain their creative ideas while performing video editing and creation, and these templates can also be shared with other users and utilized in their video editing creations. To save users time in editing and creating multilingual video editing templates, embodiments of this disclosure provide a function to automatically generate cross-language video editing templates usable for subsequent video editing from a video editing draft, eliminating the need for video editing template creators to duplicately compile video editing drafts in different language versions. In other words, after a user decides to compile a video editing draft in a single source language version, they can obtain video editing templates in at least one other language version besides the source language version. In video editing templates of different language versions, the video editing processing process and rendering display effects are all identical, with only the language corresponding to the text material content of each template differing. In some embodiments, the source language version may be understood as the template version corresponding to the language used in the editing process of the single video editing draft in question.

[0026] Specifically, in this embodiment, the first video editing draft is a video editing draft that requires cross-language processing to generate a target video editing target. The first video editing draft includes multimedia material and editing information. The editing information is used to indicate editing operations on the multimedia material, and the multimedia material includes first video material and first text material. In some embodiments, the first text material is used to display text content in a first language.

[0027] In S120, the replaceable material within the first video material is determined.

[0028] The determination of replaceable material is based on user interaction in the post-configuration interface of the video editing template. Replaceable material is material that can be substituted to support the template while other users are using the video editing template generated based on the first video editing draft.

[0029] The first text material is typically text content added by the creator of the video editing draft, and it should be understood that this text content is usually editable and can be partially or completely replaced.

[0030] In S130, a second text material is generated by converting the text content of the first language to text content of the second language based on the first text material.

[0031] In this embodiment, accessing the post-configuration stage of the video editing template determines whether the template language conversion function is enabled. That is, it determines whether the text material in the current video editing draft needs to be converted to a different language. The template language conversion function may be enabled or disabled.

[0032] The enabled state of the video editing function may be the default enabled state in the video and audiovisual processing application after the function is provided, or it may be a function enabled state set by the user. In possible embodiments, the default enabled state of the template language conversion function in the video and audiovisual processing application may be directly set to an on or off state, or it may be a default state that is automatically set by the video and audiovisual processing application based on the device system's functional support capabilities.

[0033] When the template language conversion function is enabled, the first text material is translated and the text content in the first language is converted to the second language based on the function placement attribute information of the template language conversion function. In some embodiments, the second language includes at least one language different from the first language.

[0034] In some embodiments, the functional placement attribute information includes placement attribute information such as the target text conversion language type and text conversion rules. The target text conversion language type may be one or more, for example, converting a first text material whose source language version (first language) is French into a template for at least one language version corresponding to English, German, Spanish, and / or other language types.

[0035] In the text material conversion process, first, the target text language type is determined. Then, the first text material in the source language version is input into the corresponding language conversion neural network model or other text translation tool to obtain at least one type of second text material.

[0036] In S140, the replaceable material in the first video editing draft is marked, the first text material is replaced with the second text material, and the first video editing template is generated.

[0037] In some embodiments, the first video editing template is used to introduce a second video material to replace the replaceable material, to introduce a second video editing draft, and the second video editing draft is used to composite the target video. The second video material is material that generates a new video editing result based on the first video editing template.

[0038] In the process of generating the first video editing template, marking replaceable material may involve retaining the replaceable material in the template and marking it as replaceable, or it may involve deleting the replaceable material and then marking its position in the template, allowing subsequent new video material to be placed at the correct position on the material's time track.

[0039] Following the steps described above, if text conversion is required for the Chinese version of the video editing draft, the text material within the video editing draft can be converted into single-language or multi-language text material, generating single-language or multi-language video editing templates. This eliminates the need to repeatedly organize video editing drafts for different language versions, reduces the time required for draft editing, lowers the difficulty of creating multi-language target video editing templates, and allows template users to easily obtain video editing templates corresponding to their language, thus facilitating video creation.

[0040] In the technical embodiments of the embodiments of the present disclosure, a video editing method includes obtaining a first video editing draft, wherein the first video editing draft includes multimedia material and editing information, the editing information is used to indicate editing operations on the multimedia material, the multimedia material includes first video material and first text material, the first text material is used to indicate text content in a first language, determining replaceable material within the first video material, generating second text material by converting the text content in a first language to text content in a second language based on the first text material, marking the replaceable material within the first video editing draft, replacing the first text material with the second text material, and generating a first video editing template, wherein the first video editing template is used to introduce second video material to replace the replaceable material and generate a second video editing draft, and the second video editing draft is used to composite a target video. This method solves the problem of current video processing applications not being able to automatically generate cross-language video editing templates in languages different from the language of the video editing draft. It provides an automatic translation function for video editing drafts and can automatically perform language conversion of text material in the video editing draft before the source language video editing template is published. This allows for the efficient generation of cross-language versions of video editing drafts, improves the efficiency of creating and exporting multilingual video editing templates, and expands the acceptance range of video editing templates.

[0041] Furthermore, regarding the new template language conversion function in video and audiovisual processing applications, as shown in Figure 2, a template language conversion function control can be set in the video editing template post-configuration interface. This provides the user with a function entry point and a switch for the function's enabled state in the template language conversion function interaction settings. In Figure 2, a video editing draft with five video clips (clip 1 to clip 5) is shown, and the "Language" control is the template language conversion function control. Based on the user's interaction with the template language conversion function control, the enabled state and / or placement attribute information of the template language conversion function is set.

[0042] For example, when a user clicks the "Language" control, further attribute placement controls for function switches and language version switching can be displayed. If the video processing application temporarily cannot support video editing functions, the user can click the "Language" control to gray out the next level of attribute placement controls and display a message such as "The current template does not support template language conversion."

[0043] After a user launches a video processing application, the "Language" control that appears for the first time can display information announcing the release of a new feature, such as "The template language conversion function has been activated, and you can now configure and experience the function." Of course, the application can also present the user with an introduction to the template language conversion function and instructions on how to use it through language text. If the user closes the template language conversion function, a pop-up window may appear with a message such as "Do you want to close? Activating the template language conversion function will..."

[0044] Furthermore, after a target video editing template is published, a corresponding "Language" control may be set on the template details page, allowing users to further switch between different language versions using the control on the details page. Guidance text such as "Switch Successful," "Switch Loading," and "Switch Failed" can also be displayed. It should be understood that if the template language conversion function is turned off, the language version switching control will be grayed out and unresponsive.

[0045] Figure 3 is a flowchart of another video editing method provided by an embodiment of the present disclosure. The process of generating a video editing template in the process of implementing the video editing method procedure described above will be further interpreted and explained. The method can be implemented by a video editing device, which can be implemented in software and / or hardware form. Optionally, it can be implemented by an electronic device, which may be a mobile terminal, PC terminal, or server.

[0046] As shown in Figure 3, the video editing method includes steps S210, S220, S230, S240, S250, and S260.

[0047] In S210, accessing the post settings page for video editing templates determines whether the template language conversion function is enabled.

[0048] When accessing the post settings page of a video editing template, a first video editing draft can be obtained, which includes multimedia material and editing information. The editing information is used to indicate editing operations on the multimedia material, which includes first video material and first text material, the first text material being used to display text content in a first language.

[0049] When accessing the post-configuration stage of the source language version of a video editing template, users can set template material attributes such as replaceable materials in the first video editing draft and determine the enabled state of the template language conversion function.

[0050] After the user confirms the completion of editing the video editing draft on the editing and creation page, they access the post-setting stage of the video editing template, that is, the final video editing draft setting stage before the template is published.

[0051] In video processing applications that provide a template language conversion function, it is possible to determine whether or not it is necessary to perform language conversion of draft text content for text material within a video editing draft.

[0052] In S220, if the enabled state is the function-on state, the second language is determined based on the function placement attribute information of the template language conversion function.

[0053] In some embodiments, the second language includes at least one language different from the first language.

[0054] If the template language conversion function is enabled, at least one second language is determined based on the translation placement attribute information of the template language conversion function.

[0055] The function placement attribute information includes pre-configured text conversion language information. This information may be set by the user, or it may be one or more target conversion languages that are set by default in the video processing application system.

[0056] In S230, the text category of each text content contained in the first text material within the first video editing draft is identified, and the text content to be converted within the first text material is determined based on the pre-set text conversion attributes corresponding to the text category.

[0057] In determining the text content to be converted within the first text material, first, the text category of each text content included in the first text material can be identified, and then the text content to be converted within the first text material can be determined based on the pre-set text conversion attributes corresponding to each text category.

[0058] The pre-configured text conversion attribute may be information indicating whether the text needs to be converted from the source language version (first language) of the first text material to the second language. In the first text material, the pre-configured translation attributes corresponding to different text categories will differ. For example, draft text belonging to categories such as digital, coded, or special effects text does not require text conversion and only needs to be returned as the second text material by combining the material text of the corresponding source language version and the subsequent text conversion result.

[0059] Specifically, the first text material in the source language version within the first video editing draft is divided into external language text and internal language text. In some embodiments, the external language text includes subtext categories such as cover, theme, and descriptive information, while the internal language refers to the draft text added to the video clip. The draft text within the video clip should exclude text content that does not require text conversion, such as subtitles, lyrics, and text attached to the video material.

[0060] The text content to be converted is obtained by removing text content that does not require line-by-line text conversion from the first text material. The text conversion principle followed by the text material conversion process is not to change the subject matter, template material, or screen effects. If the source language version of the text material contains corresponding characters, digital or special symbols, the corresponding source language version of the text content is returned directly without modification. If the source language version of the text material contains garbled or other unconvertible text, the source language version of the text content is returned directly.

[0061] In S240, the text content to be converted is input into a pre-configured text conversion model corresponding to the second language, and the converted text in the second language is obtained.

[0062] In a selective embodiment, the text conversion may employ artificial intelligence, inputting the text to be converted into a text conversion model that supports at least one second language, and obtaining the corresponding language-converted text. The text conversion model that supports at least one second language may be a composite model with multilingual conversion capabilities, or it may be different text conversion models that support multiple different languages. Based on the setting of the target conversion language, the corresponding text conversion model can be selected.

[0063] Before performing text conversion, the text to be converted can be further divided into paragraphs, emojis and symbols can be removed, and content that does not need to be converted, such as automatically recognized lyrics, subtitles, TTS (text-to-speech) text, and rich text, can be removed.

[0064] In S250, a second text material is generated based on the non-convertible text content in the first text material and the second language conversion text.

[0065] By directly integrating text material that does not require text conversion with a second language conversion text corresponding to the text to be converted, a second text material can be obtained.

[0066] In S260, the replaceable material of the first video editing draft is marked as described above, the first text material is replaced with the second text material, the content display effect is applied to the first video material and the second text material of the first video editing draft, and the first video editing template is generated.

[0067] The content display effect application process performed on the second text material and the first video material includes various application methods, such as screen application, character limit application, and rich text effect application.

[0068] In some implementations, screen adaptation refers to the situation where, when expressing the same meaning in different languages, the number of characters does not match, resulting in the converted text not fitting into the corresponding source language version's text box, and thus the text and screen becoming inconsistent. Screen adaptation can be achieved by automatically wrapping the text based on the length of the string. In adaptation processes with character limits, the conversion results are applied to themes and explanatory texts with character limits, and if the character limit is exceeded, only the draft text content of the corresponding source language version is retained. For rich text effects, a method that does not convert rich text is employed.

[0069] After text conversion and adaptation processing, the multilingual version of the video editing draft may be exported asynchronously in the background without increasing front-end latency. In some cases, language conversion of the text material in the video editing draft is not required, and it is sufficient to simply export the video within the video editing template corresponding to the original video editing draft. If the video editing draft does not contain draft text or only contains symbols, it is not necessary to export a separate multilingual version of the video editing template, as only the theme and description content needs to be converted text.

[0070] In some embodiments, the generated first video editing template is used to introduce second video material to replace the replaceable material and generate a second video editing draft. The second video editing draft is used to composite the target video. Users can create videos based on different language versions of the video editing template and generate new video creation results.

[0071] According to the technical embodiment of the present disclosure, when accessing the post settings page of a video editing template, the enabled state of the template language conversion function is determined, and if the enabled state is the function ON state, the second language is determined based on the function placement attribute information of the template language conversion function, the text category of each text content contained in the first text material of the first video editing draft is identified, the text content to be converted in the first text material is determined based on the pre-configured text conversion attribute corresponding to the text category, the text content to be converted is input into a pre-configured text conversion model corresponding to the second language, and the second language converted text is obtained. Based on the non-convertible text content in the first text material and the second language-converted text, a second text material is generated; the replaceable material in the first video editing draft is marked as described above; the first text material is replaced with the second text material; content display effect application processing is performed on the first video material and the second text material in the first video editing draft; a first video editing template is generated; the generated first video editing template is used to introduce the second video material for replacing the replaceable material and to generate a second video editing draft; and the second video editing draft is used to composite the target video. This method solves the problem of current video processing applications not being able to automatically generate cross-language video editing templates in languages different from the language of the video editing draft. It provides an automatic translation function for video editing drafts and can automatically perform language conversion of text material in the video editing draft before the source language video editing template is published. This allows for the efficient generation of cross-language versions of video editing drafts, improves the efficiency of creating and exporting multilingual video editing templates, and expands the acceptance range of video editing templates.

[0072] Figure 4 is a flowchart of a video editing method provided by an embodiment of the present disclosure. Based on the above embodiment, the setting of a template switching function, which is performed at the consumption stage for multilingual versions of video editing templates, will be further explained. This method can be implemented by a video editing device, which can be implemented by software and / or hardware. Optionally, it can be implemented by an electronic device, which may be a mobile terminal, PC terminal, or server.

[0073] As shown in Figure 4, the video editing method includes steps S310, S320, S330, S340, S350, S360, S370 and S380.

[0074] In S310, the enabled state of the template language conversion function is determined when accessing the post settings page for a video editing template.

[0075] In S320, if the enabled state is the function-on state, the second language is determined based on the function placement attribute information of the template language conversion function.

[0076] In S330, the text category of the text content included in the first text material of the first video editing draft is identified, and the text content to be converted within the first text material is determined based on the pre-configured text conversion attributes corresponding to the text category.

[0077] In S340, the text content to be converted is input into a pre-configured text conversion model corresponding to the second language, and the converted text in the second language is obtained. In S350, a second text material is determined based on the second language conversion text.

[0078] Specifically, the content in the first text material that does not require text conversion is combined with the second language conversion text to determine the second text material.

[0079] In S360, the replaceable material in the first video editing draft is marked, content adaptation processing is performed on the second text material and the first video material in the first video editing draft, and the first video editing template is generated.

[0080] S370 generates a second video editing template based on a first video editing draft in which replaceable material is marked, and displays the first or second video editing template in a language version that is compatible with the language of the application environment in which the video editing template is displayed or used.

[0081] In some embodiments, the second video editing template is a video editing template that has not yet undergone text conversion, corresponding to the first video editing draft. When creating a template, multiple video editing templates can be generated based on the first video editing draft. For example, multiple corresponding video editing templates can be generated by converting the first text material into multiple versions, such as a second language text material and a third language text material.

[0082] Based on the language of the application environment in which the video editing template is displayed or applied, the system can display the language version of the video editing template that corresponds to the language of the application environment. For example, if the terminal system used by the target video editing template creator switches from a first language to a second language, the system will respond by prioritizing the display of the second language version of the video editing template when the creator views the template.

[0083] In S380, in response to a template version switching operation, the first video editing template or the second video editing template is switched and displayed.

[0084] In this embodiment, the video processing application system automatically and preferentially displays a video editing template in a specific language version to the user, and also provides the user with a function to switch between different versions of the template.

[0085] For example, the template details page of the video editing template shown in Figure 5 has multiple template interaction options, and specific interaction items can be configured according to specific requirements. The schematic diagram of this details page applies when a general user, both the creator and non-creator of the video editing template, previews the template. The "Language" control corresponds to the template language conversion function and allows access to the next level of option menu items. In the next level of "Language" menu items, it is possible to set whether or not to display multilingual versions. When this item is turned on, any language version of the target video editing template for a specific display target can be provided to the user, and the user can select the target video editing version themselves. When this item is turned off, only the source language version of the target video editing template is displayed.

[0086] Furthermore, in this embodiment, consumption information for the first video editing template is determined based on the usage status of the first video editing template, consumption information for the second video editing template is determined based on the usage status of the second video editing template, and target consumption information for displaying the usage status of the target video editing template is determined based on the consumption information for the first video editing template and the consumption information for the second video editing template, and the target video editing template displays any video editing template generated based on the first video editing draft. When a text material conversion process is performed based on the user's video editing draft and multiple different language versions of the video editing template are generated, consumption statistics information for the different language versions of the video editing template are added together to obtain total consumption statistics information for the template corresponding to the video editing draft, and this information is displayed to the user to promote the user's video creation.

[0087] According to the technical embodiment of the present disclosure, when accessing the post settings page of a video editing template, the enabled state of the template language conversion function is checked, and if the enabled state is the function ON state, the second language is determined based on the function placement attribute information of the template language conversion function, the text category of each text content contained in the first text material of the first video editing draft is identified, the text content to be converted in the first text material is determined based on the pre-configured text conversion attribute corresponding to the text category, the text content to be converted is input into a pre-configured text conversion model corresponding to the second language, and the second language converted text is obtained. The system marks the replaceable material in the first video editing draft, performs content adaptation processing on the second text material and the first video material in the first video editing draft to generate a first video editing template, generates a second video editing template based on the first video editing draft with the replaceable material marked, displays the first or second video editing template in a language compatible with the language of the application environment based on the language of the application environment to which the video editing template is displayed or applied, and switches and displays the first or second video editing template in response to a template version switching operation. This method solves the problem of current video processing applications not being able to automatically generate cross-language video editing templates in languages different from the language of the video editing draft. It provides an automatic translation function for video editing drafts and can automatically perform language conversion of text material in the video editing draft before the source language video editing template is published. This allows for the efficient generation of cross-language versions of video editing drafts, improves the efficiency of creating and exporting multilingual video editing templates, and expands the acceptance range of video editing templates.

[0088] Figure 6 shows a video editing device provided by an embodiment of the present disclosure. The device is applicable to video editing scenarios, particularly video editing based on multilingual video editing templates. The video editing device can be implemented by software and / or hardware and can be deployed on an electronic device, which may be a mobile terminal, PC terminal or server, etc.

[0089] As shown in Figure 6, the video editing device includes an original video editing draft acquisition module 410, a replaceable material determination module 420, a text material conversion module 430, and a video editing template generation module 440.

[0090] In some embodiments, the original video editing draft acquisition module 410 is used to acquire a first video editing draft, the first video editing draft includes multimedia material and editing information, the editing information is used to indicate editing operations on the multimedia material, the multimedia material includes first video material and first text material, and the first text material is used to display text content in a first language. The replaceable material determination module 420 is used to determine replaceable material within the first video material. The text material conversion module 430 is used to generate second text material by converting the text content in a first language to text content in a second language based on the first text material. The video editing template generation module 440 is used to mark the replaceable material within the first video editing draft, replace the first text material with the second text material, and generate a first video editing template. The first video editing template is used to introduce a second video material to replace the replaceable material and generate a second video editing draft, which is then used to composite the target video.

[0091] In the technical embodiments of the embodiments of the present disclosure, a first video editing draft is obtained, the first video editing draft includes multimedia material and editing information, the editing information is used to indicate editing operations on the multimedia material, the multimedia material includes first video material and first text material, the first text material is used to display text content in a first language, replaceable material is determined within the first video material, second text material is generated by converting the text content in a first language to text content in a second language based on the first text material, the replaceable material is marked within the first video editing draft, the first text material is replaced with the second text material, a first video editing template is generated, the first video editing template is used to introduce second video material to replace the replaceable material, a second video editing draft is generated, and the second video editing draft is used to composite a target video. This method solves the problem of current video processing applications not being able to automatically generate cross-language video editing templates in languages different from the language of the video editing draft. It provides an automatic translation function for video editing drafts and can automatically perform language conversion of text material in the video editing draft before the source language video editing template is published. This allows for the efficient generation of cross-language versions of video editing drafts, improves the efficiency of creating and exporting multilingual video editing templates, and expands the acceptance range of video editing templates.

[0092] In an optional embodiment, the video editing device further includes a text material conversion function determination module. The text material conversion function determination module is used to check the enabled state of the template language conversion function when accessing the post settings page of the video editing template before generating the second text material, and if the enabled state is the function on state, it determines the second language based on the function placement attribute information of the template language conversion function, wherein the second language includes at least one language different from the first language.

[0093] In a selective embodiment, the text material conversion module 430 is used to specifically identify the text category of each text content included in the first text material, determine the text content to be converted within the first text material based on pre-configured text conversion attributes corresponding to the text category, input the text content to be converted into a pre-configured text conversion model corresponding to the second language, obtain the second language converted text, and generate the second text material based on the non-converted text content within the first text material and the second language converted text.

[0094] In an optional embodiment, the video editing template generation module 440 may be used to perform content display effect application processing on the first video material and the second text material in the first video editing draft.

[0095] In a selective embodiment, the video editing template device further includes a template language conversion function setting module. The template language conversion function setting module is used to set template language conversion function controls on the post-setting page and the template detail page of the first video editing template, respectively, and to set the enabled state and / or placement attribute information of the template language conversion function in response to interaction operations acquired by the template language conversion function controls.

[0096] In an optional embodiment, the video editing template device further includes a template display module. The template display module is used to generate a second video editing template based on a first video editing draft on which replaceable material is marked, and to display a language version of the first or second video editing template that is compatible with the language of the application environment on which the video editing template is displayed or applied.

[0097] In a selective embodiment, the template display module may be used to switch and display the first video editing template or the second video editing template in response to a template version switching operation.

[0098] In a selective embodiment, the video editing template device further includes a template consumption data statistics module. The template consumption data statistics module determines consumption information for the first video editing template based on the usage status of the first video editing template, determines consumption information for the second video editing template based on the usage status of the second video editing template, and determines target consumption information indicating the usage status of a target video editing template based on the consumption information for the first video editing template and the consumption information for the second video editing template. The target video editing template represents any video editing template generated based on the first video editing draft.

[0099] The video editing apparatus provided in the embodiments of this disclosure is capable of performing the video editing method provided in any embodiment of this disclosure and comprises a functional module and beneficial effects corresponding to the performance of the method.

[0100] It should be noted that the units and modules included in the above-described apparatus are merely categorized based on functional logic and are not limited to such classifications, as long as they can realize the corresponding functions. Furthermore, the specific names of each functional unit are merely for distinguishing purposes and do not limit the scope of protection of the embodiments of this disclosure.

[0101] Figure 7 is a schematic diagram of the configuration of an electronic device provided by an embodiment of the present disclosure. Below, Figure 7 shows a schematic diagram of a configuration suitable for realizing the electronic device (e.g., terminal device or server in Figure 7) 500 of an embodiment of the present disclosure. The terminal devices in the embodiment of the present disclosure include, but are not limited to, mobile terminals such as mobile phones, laptop computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablet computers), PMPs (portable multimedia players), and in-vehicle terminals (e.g., in-vehicle navigation terminals), and fixed terminals such as digital TVs and desktop computers. The electronic device shown in Figure 16 is merely an example and should not impose any limitations on the functions and scope of use of the embodiment of the present disclosure.

[0102] As shown in Figure 7, the electronic device 500 includes a processing unit (e.g., a central processing unit, a graphics processor, etc.) 501, which can perform various appropriate operations and processes based on programs stored in read-only memory (ROM) 502 or programs loaded from storage device 508 into random access memory (RAM) 503. RAM 503 further stores various programs and data necessary for the operation of the electronic device 500. The processing unit 501, ROM 502, and RAM 503 are connected to each other via a bus 504. An edit / output (I / O) interface 505 is also connected to the bus 504.

[0103] Typically, the I / O interface 505 can be connected to input devices 506 such as touchscreens, touchpads, keyboards, mice, cameras, microphones, accelerometers, and gyroscopes; output devices 507 such as liquid crystal displays (LCDs), speakers, and vibrators; storage devices 508 such as magnetic tape and hard disks; and communication devices 509. The communication device 509 enables the electronic device 500 to communicate with other equipment wirelessly or via wired connections to exchange data. Figure 7 shows an electronic device 500 with various devices, but it should be understood that it is not necessary to implement or include all of the devices shown. More or fewer devices may be implemented or included as alternatives.

[0104] In particular, according to embodiments of the present disclosure, the processes described with reference to the flowchart described above can be implemented as a computer software program. For example, embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-temporary computer-readable medium, the computer program comprising program code for performing the method shown in the flowchart. In such embodiments, the computer program can be downloaded and installed from a network via a communication device 509, or installed from a storage device 508, or installed from a ROM 502. When executed by a processing unit 501, the computer program performs the functions limited to the methods of embodiments of the present disclosure.

[0105] The names of messages or information exchanged between multiple devices in embodiments of this disclosure are for illustrative purposes only and do not limit the scope of such messages or information.

[0106] The electronic devices provided in the embodiments of this disclosure belong to the same inventive concept as the video editing method provided in the above-described embodiments. Technical features not described in detail in these embodiments can be referred to in the above-described embodiments, and these embodiments have the same beneficial effects as the above-described embodiments.

[0107] The embodiments of this disclosure provide a computer storage medium that stores a program, when executed by a processor, that implements the video editing method provided by the embodiments.

[0108] The computer-readable medium described herein may be a computer-readable signal medium, a computer-readable storage medium, or any combination thereof. A computer-readable storage medium is, for example, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples of computer-readable storage media include, but are not limited to, electrical connections having one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fibers, portable compact disk read-only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination thereof. In this disclosure, a computer-readable storage medium is any physical medium that contains or stores a program used by or in combination with an instruction execution system, apparatus, or device. On the other hand, in this disclosure, a computer-readable signal medium includes a data signal transmitted in baseband or as part of a carrier wave, wherein a computer-readable program code is carried on the data signal. The data signals transmitted in this manner can take various forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination thereof. The computer-readable signal medium may be any computer-readable medium other than a computer-readable storage medium, and is used to transmit, transmit or communicate programs used by or in combination with instruction execution systems, apparatus, or devices. The program code contained in the computer-readable medium is transmittable by any suitable medium, including but not limited to cables, optical cables, RF (radio frequency), etc., or any suitable combination thereof.

[0109] In some embodiments, clients and servers may communicate using any currently known or future-developed network protocol, such as HTTP (Hypertext Transfer Protocol), and may connect to digital data communications (e.g., communication networks) in any form or medium. Examples of communication networks include local area networks ("LANs"), wide area networks ("WANs"), internetworks (e.g., the Internet), and peer-to-peer networks (e.g., ad-hoc peer-to-peer networks), as well as any currently known or future-developed networks.

[0110] The computer-readable media described above may be included in the electronic device described above, or it may exist independently and not be incorporated into the electronic device.

[0111] The computer-readable medium described above carries one or more programs, and when the one or more programs described above are executed by the electronic device, the electronic device is caused to perform the following: obtain a first video editing draft, wherein the first video editing draft includes multimedia material and editing information, the editing information is used to indicate editing operations on the multimedia material, the multimedia material includes first video material and first text material, the first text material is used to indicate text content in a first language; determine replaceable material within the first video material; generate second text material by converting the text content in a first language to text content in a second language based on the first text material; mark the replaceable material within the first video editing draft, replace the first text material with the second text material, and generate a first video editing template, the first video editing template is used to introduce second video material to replace the replaceable material, generate a second video editing draft, and the second video editing draft is used to composite a target video.

[0112] Computer program code for performing the operations described herein can be organized using one or more programming languages or a combination thereof, and the programming languages described herein include, but are not limited to, object-oriented programming languages such as Java®, Smalltalk, and C++, as well as conventional procedural programming languages such as the "C" language, or similar programming languages. The program code may run entirely on the user's computer, partially on the user's computer, run as a standalone software package, run partially on the user's computer and partially on a remote computer, or run entirely on a remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer or an external computer (for example, via the Internet using an Internet service provider) via any type of network, including a local area network (LAN) or a wide area network (WAN).

[0113] The flowcharts and block diagrams in the drawings illustrate the architecture, functions, and operations that can be realized by the systems, methods, and computer program products in each embodiment of the present disclosure. In this regard, each block in the flowchart or block diagram represents a module, program segment, or part of code containing one or more executable instructions for realizing a given logical function. It should also be noted that in some alternative embodiments, the functions described in the blocks may occur in a different order than that shown. For example, two consecutively shown blocks may actually be executed nearly parallel or in reverse order, depending on the functions involved. It should also be noted that each block in the block diagram and / or flowchart, and any combination of blocks in the block diagram and / or flowchart, can be realized by a dedicated system based on hardware that performs a given function or operation, or by a combination of dedicated hardware and computer instructions.

[0114] The units described in the embodiments of this disclosure can be implemented in software or in hardware. The names of the units are not limiting to the units themselves. For example, the first acquisition unit may be described as "a unit for acquiring at least two Internet Protocol addresses."

[0115] The functions described above may be performed, at least in part, by one or more hardware logic components. For example, typical types of hardware logic components that can be used include, but are not limited to, field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), application-specific standard products (ASSPs), systems on chips (SOCs), and complex programmable logic devices (CPLDs).

[0116] In this disclosure, a device-readable medium is a physical medium containing or storing a program used in a command execution system, apparatus or equipment, or a combination of command execution systems, apparatus or equipment. A device-readable medium is a device-readable signal medium or a device-readable storage medium. A device-readable medium is an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or any combination thereof, but is not limited to these. More specific examples of device-readable storage media include, but are not limited to, electrical connections having one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fibers, portable compact disk read-only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination thereof.

[0117] Embodiments of the present disclosure further provide a computer program product which includes a computer program, when the computer program is executed by a processor, performs, for example, a video editing method provided by any embodiment of the present disclosure.

[0118] In the process of realizing a computer program product, computer program code for performing the operations described herein can be organized using one or more programming languages or a combination thereof. These programming languages include, but are not limited to, object-oriented programming languages such as Java®, Smalltalk, and C++, and may also include conventional procedural programming languages such as "C" or similar programming languages. The program code may run entirely on the user's computer, partially on the user's computer, as a standalone software package, partially on the user's computer, partially on a remote computer, or entirely on a remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer or an external computer (for example, via the Internet using an Internet service provider) through any type of network, including a local area network (LAN) or wide area network (WAN).

[0119] According to one or more embodiments of the present disclosure, [Example 1] provides a video editing method, the method comprising: obtaining a first video editing draft, the first video editing draft comprising multimedia material and editing information, the editing information being used to indicate editing operations on the multimedia material, the multimedia material comprising first video material and first text material, the first text material being used to indicate text content in a first language; determining replaceable material within the first video material; generating second text material by converting the text content in a first language to text content in a second language based on the first text material; marking the replaceable material within the first video editing draft, replacing the first text material with the second text material, and generating a first video editing template, the first video editing template being used to introduce second video material for replacing the replaceable material and generate a second video editing draft, the second video editing draft being used to composite a target video.

[0120] According to one or more embodiments of the present disclosure, [Example 2] provides a video editing method, in some selective implementations, before generating the second text material, the method further includes determining the enabled state of a template language conversion function when entering the post-settings page of a video editing template, and, if the enabled state is the function-on state, determining the second language based on the function placement attribute information of the template language conversion function, wherein the second language includes at least one language of a different kind from the first language.

[0121] According to one or more embodiments of the present disclosure, [Example III] provides a video editing method, in several selective implementations, for generating a second text material by converting the text content of the first language to text content of the second language as described above, the method includes: identifying the text category of each text content contained in the first text material; determining the text content to be converted in the first text material based on pre-configured text conversion attributes corresponding to the text category; inputting the text content to be converted into a pre-configured text conversion model corresponding to the second language to obtain second language converted text; and generating the second text material based on the non-converted text content in the first text material and the second language converted text.

[0122] According to one or more embodiments of the present disclosure, [Example IV] provides a video editing method, in some optional implementations, which further includes marking the replaceable material in the first video editing draft as described above, replacing the first text material with the second text material, and generating the first video editing template, by performing a content display effect adaptation process on the first video material and the second text material in the first video editing draft.

[0123] According to one or more embodiments of the present disclosure, [Example V] provides a video editing method, in some optional implementations, the method further includes setting a template language conversion function control on the post-settings page and the template details page of the first video editing template, respectively, and setting the enabled state and / or placement attribute information of the template language conversion function by interaction operations obtained in response to the template language conversion function control.

[0124] According to one or more embodiments of the present disclosure, [Example VI] provides a video editing method, in some optional implementations, the method further includes generating a second video editing template from a first video editing draft marked with replaceable material, and displaying a language version of the first or second video editing template that matches the language of the application environment, depending on the language of the application environment in which the video editing template is displayed or applied.

[0125] According to one or more embodiments of the present disclosure, [Example VII] provides a video editing method, in some selective implementations, the method further includes switching and displaying the first video editing template or the second video editing template in response to a template version switching operation.

[0126] According to one or more embodiments of the present disclosure, [Example VIII] provides a video editing method, in some optional implementations, the method further includes determining consumption information for the first video editing template based on the usage status of the first video editing template, determining consumption information for the second video editing template based on the usage status of the second video editing template, and determining target consumption information for displaying the usage status of a target video editing template based on the consumption information for the first video editing template and the consumption information for the second video editing template, wherein the target video editing template represents any of the video editing templates generated based on the first video editing draft.

[0127] According to one or more embodiments of the present disclosure, [Example 9] is an original video editing draft acquisition module for providing a video editing apparatus and acquiring a first video editing draft, wherein the first video editing draft includes multimedia material and editing information, the editing information is used to indicate editing operations on the multimedia material, the multimedia material includes first video material and first text material, and the first text material is used to display text content in a first language; and a replaceable material determination module is used to determine replaceable material within the first video material. The system includes a text material conversion module used to generate a second text material by converting the text content of a first language to text content of a second language based on the first text material, and a video editing template generation module that marks the replaceable material in the first video editing draft, replaces the first text material with the second text material, and generates a first video editing template, wherein the first video editing template is used to introduce the second video material to replace the replaceable material and generate a second video editing draft, and the second video editing draft is used to composite a target video.

[0128] According to one or more embodiments of the present disclosure, [Example 10] provides a video editing device, in an optional embodiment, the video editing device further includes a text material conversion function determination module, which is used to determine the enabled state of a template language conversion function when accessing the post-settings page of a video editing template before generating the second text material, and if the enabled state is a function-on state, it determines the second language based on the function placement attribute information of the template language conversion function, wherein the second language includes at least one language different from the first language.

[0129] According to one or more embodiments of the present disclosure, [Example 11] provides a video editing device, in an optional embodiment, the text material conversion module is used to specifically identify text categories of text content contained in the first text material, determine the text content to be converted in the first text material based on pre-configured text conversion attributes corresponding to the text categories, input the text content to be converted into a pre-configured text conversion model corresponding to the second language, obtain second language converted text, and generate a second text material based on the non-converted text content in the first text material and the second language converted text.

[0130] According to one or more embodiments of the present disclosure, [Example 12] provides a video editing apparatus, in an optional embodiment, the video editing template generation module is used to perform content display effect adaptation processing on the first video material and the second text material in the first video editing draft.

[0131] According to one or more embodiments of the present disclosure, [Example 13] provides a video editing device, in an optional embodiment, the video editing template device further includes a template language conversion function setting module, the template language conversion function setting module is used to set template language conversion function controls on the post setting page and the template detail page of the first video editing template, respectively, and to set the enabled state and / or placement attribute information of the template language conversion function by interaction operations obtained in response to the template language conversion function controls.

[0132] According to one or more embodiments of the present disclosure, [Example 14] provides a video editing device, in an optional embodiment, the video editing template device further includes a template display module, the template display module generating a second video editing template based on a first video editing draft on which replaceable material is marked, and displaying the first or second video editing template in a language version that is compatible with the language of the application environment, based on the language of the application environment on which the video editing template is displayed or applied.

[0133] According to one or more embodiments of the present disclosure, [Example 15] provides a video editing device, in which, in an optional embodiment, the template display module is also used to switch and display the first video editing template or the second video editing template in response to a template version switching operation.

[0134] According to one or more embodiments of the present disclosure, [Example 16] provides a video editing device, in an optional embodiment, the video editing template device further includes a template consumption data statistics module, which is used to determine consumption information for the first video editing template based on the usage of the first video editing template, to determine consumption information for the second video editing template based on the usage of the second video editing template, and to determine target consumption information for displaying the usage of a target video editing template based on the consumption information for the first video editing template and the consumption information for the second video editing template, wherein the target video editing template represents one of the video editing templates generated based on the first video editing draft.

[0135] The above description is merely a description of preferred embodiments and applicable technical principles of the present disclosure. Those skilled in the art will understand that the scope of the disclosure is not limited to technical modes comprising specific combinations of the above-described technical features, but should also encompass other technical modes comprising arbitrary combinations of the above-described technical features or equivalent features, without departing from the spirit of the disclosure. For example, it should be understood that it should encompass technical modes comprising the mutual substitution of the above-described features with similar functional technical features disclosed herein (but not limited to these).

[0136] Furthermore, although each operation is described in a specific order, it should not be understood that these operations must necessarily be performed in the specific order or sequence shown. In certain environments, multitasking and parallel processing may be advantageous. Similarly, the foregoing statements should not be construed as limitations on the scope of this disclosure, including some specific implementation details. Certain features described in the context of individual embodiments may also be realized by combining them in a single embodiment. Conversely, various features described in the context of a single embodiment may also be realized individually in multiple embodiments or by any appropriate sub-combination.

[0137] Although the present invention has been described in a language specific to structural features and / or methodological logic, it should be understood that the subject matter limited by the appended claims is not necessarily limited to the specific features or operations described above. Conversely, the specific features and operations described above are merely exemplary forms of realizing the claims.

Claims

1. This is a video editing method, Obtaining a first video editing draft, wherein the first video editing draft includes multimedia material and editing information, the editing information is used to indicate editing operations on the multimedia material, the multimedia material includes first video material and first text material, and the first text material is used to indicate text content in a first language. To determine the replaceable material within the first video material, Based on the first text material, a second text material is generated by converting the text content in the first language to text content in the second language. This includes marking the replaceable material in the first video editing draft, replacing the first text material with the second text material, and generating the first video editing template. A video editing method wherein the first video editing template is used to introduce a second video material for replacing the replaceable material and to generate a second video editing draft, and the second video editing draft is used to composite a target video.

2. Before generating the second text material, the method, When accessing the post settings page for a video editing template, the enabled state of the template language conversion function is determined, The method according to claim 1, further comprising determining the second language based on the function placement attribute information of the template language conversion function when the enabled state is a function-on state, wherein the second language includes at least one language of a different type from the first language.

3. The process of generating a second text material by converting the text content of the first language to text content of the second language is as follows: Identifying the text category of each text content included in the first text material described above, Based on the pre-configured text conversion attributes corresponding to the aforementioned text category, the text content to be converted within the first text material is determined, The process involves inputting the text content to be converted into a pre-configured text conversion model corresponding to the second language, and obtaining the converted text in the second language. The method according to claim 1 or 2, comprising generating a second text material based on the non-convertible text content in the first text material and the second language-converted text.

4. Marking the replaceable material in the first video editing draft, replacing the first text material with the second text material, and generating the first video editing template are: The method according to claim 1, further comprising applying a content display effect to the first video material and the second text material in the first video editing draft.

5. The template language conversion function control is set on the aforementioned post settings page and the template details page of the first video editing template, respectively. The method according to claim 2, further comprising setting the enabled state and / or placement attribute information of the template language conversion function by interaction operations obtained in response to the template language conversion function control.

6. The process involves generating a second video editing template from a first video editing draft with replaceable elements marked, The method according to claim 1, further comprising displaying a language version of the first or second video editing template that matches the language of the application environment, depending on the language of the application environment to which the video editing template is displayed or applied.

7. The method according to claim 6, further comprising switching and displaying the first video editing template or the second video editing template in response to a template version switching operation.

8. Based on the usage status of the first video editing template, the consumption information for the first video editing template is determined, and based on the usage status of the second video editing template, the consumption information for the second video editing template is determined. The method further includes determining target consumption information for displaying the usage status of a target video editing template based on the consumption information of the first video editing template and the consumption information of the second video editing template, The method according to claim 6, wherein the target video editing template indicates one of the video editing templates generated based on the first video editing draft.

9. It is a video editing device, An original video editing draft acquisition module for obtaining a first video editing draft, wherein the first video editing draft includes multimedia material and editing information, the editing information is used to indicate editing operations on the multimedia material, the multimedia material includes first video material and first text material, and the first text material is used to display text content in a first language; A replaceable material determination module used to determine replaceable material within the first video material, A text material conversion module used to generate a second text material by converting the text content of the first language to text content of the second language based on the first text material, The system includes a video editing template generation module that marks the replaceable material in the first video editing draft, replaces the first text material with the second text material, and generates a first video editing template, A video editing apparatus in which the first video editing template is used to introduce a second video material for replacing the replaceable material and to generate a second video editing draft, and the second video editing draft is used to composite a target video.

10. It is an electronic device, One or more processors, A storage device for storing one or more programs, An electronic device that, when the one or more programs described above are executed by the one or more processors, causes the one or more processors to implement, for example, the video editing method described in any one of claims 1 to 8.

11. A computer-readable storage medium storing a computer program that, when executed by a processor, realizes, for example, the video editing method described in any one of claims 1 to 8.

12. A computer program product including a computer program that, when executed by a processor, realizes, for example, the video editing method described in any one of claims 1 to 8.