Interaction method and apparatus, electronic device, and storage medium
By configuring keyword selection and display on the page, users can quickly generate media files that meet their expectations, solving creative obstacles caused by a lack of experience and inspiration, and making media file generation convenient.
Patent Information
- Authority / Receiving Office
- WO · WO
- Patent Type
- Applications
- Current Assignee / Owner
- BEIJING ZITIAO NETWORK TECH CO LTD
- Filing Date
- 2025-12-11
- Publication Date
- 2026-07-02
AI Technical Summary
Non-professional users often lack experience and inspiration when generating media files, making it difficult to accurately describe the desired style, which leads to creative obstacles.
The configuration page is displayed, including a first area and a second area. The first area contains multiple keywords to characterize the style of the media file. After the user selects the target keyword, the media file with the corresponding style is displayed and generated in the second area.
It reduces the difficulty of generating media files, helping users, even those with little experience, to determine the desired style and quickly generate media files that meet expectations.
Smart Images

Figure CN2025141829_02072026_PF_FP_ABST
Abstract
Description
Interaction methods, devices, electronic devices, and storage media
[0001] Cross-references to related applications
[0002] This application claims priority to Chinese Patent Application No. 202411919065.6, filed on December 24, 2024, entitled "Interactive Method, Apparatus, Electronic Device and Storage Medium", the entire contents of which are incorporated herein by reference. Technical Field
[0003] This disclosure relates to the field of artificial intelligence technology, and in particular to an interaction method, apparatus, electronic device, and storage medium. Background Technology
[0004] With the rapid development of technology, media file generation technology has become a significant achievement in the digital field. Utilizing advanced algorithmic models, it is possible to generate corresponding media files based on specific input instructions or parameters. This technology has brought innovative opportunities to numerous fields. Summary of the Invention
[0005] This disclosure provides an interaction method, apparatus, electronic device, and storage medium.
[0006] Firstly, this disclosure provides an interaction method, including:
[0007] A configuration page is displayed, which includes a first area and a second area; the first area includes multiple keywords; the keywords are used to characterize the style of the media file.
[0008] In response to the selection of a target keyword in the first area, the target keyword is displayed in the second area of the configuration page;
[0009] In response to the confirmation operation command, the first media file is displayed in conjunction with the style of the media file represented by the target keywords shown in the second area.
[0010] Secondly, this disclosure also provides an interactive device, including:
[0011] The display module is used to display the configuration page, which includes a first area and a second area; the first area includes multiple keywords; the keywords are used to characterize the style of the media file.
[0012] The selection module is used to display the target keyword in the second area of the configuration page in response to the selection operation of the target keyword in the first area;
[0013] The generation module is used to display the first media file in response to the confirmation operation command, in conjunction with the style of the media file represented by the target keywords displayed in the second area.
[0014] Thirdly, this disclosure also provides an electronic device, the electronic device comprising:
[0015] One or more processors;
[0016] Storage device for storing one or more programs;
[0017] When the one or more programs are executed by the one or more processors, the one or more processors implement the interaction method as described above.
[0018] Fourthly, this disclosure also provides a computer-readable storage medium having a computer program stored thereon that, when executed by a processor, implements the interaction method described above. Attached Figure Description
[0019] The accompanying drawings, which are incorporated in and form a part of this specification, illustrate embodiments consistent with this disclosure and, together with the description, serve to explain the principles of this disclosure.
[0020] To more clearly illustrate the technical solutions in the embodiments of this disclosure or the prior art, the accompanying drawings used in the description of the embodiments or the prior art will be briefly introduced below. Obviously, those skilled in the art can obtain other drawings based on these drawings without creative effort.
[0021] Figure 1 is a flowchart of an interaction method provided in an embodiment of this disclosure;
[0022] Figures 2-9 are schematic diagrams of a terminal display interface provided in an embodiment of this disclosure;
[0023] Figure 10 is a schematic diagram of the structure of an interactive device according to an embodiment of this disclosure;
[0024] Figure 11 is a schematic diagram of the structure of an electronic device according to an embodiment of this disclosure. Detailed Implementation
[0025] To better understand the above-mentioned objectives, features, and advantages of this disclosure, the solutions disclosed herein will be further described below. It should be noted that, unless otherwise specified, the embodiments and features described herein can be combined with each other.
[0026] Numerous specific details are set forth in the following description in order to provide a full understanding of this disclosure, but this disclosure may also be implemented in other ways different from those described herein; obviously, the embodiments in the specification are only some, and not all, of the embodiments of this disclosure.
[0027] Currently, many non-professional users still face the challenge of accurately describing scenes and constructing creative ideas. Especially for those lacking creative experience or inspiration, facing a blank page often leaves them feeling lost and even discouraged. Therefore, how to help users overcome the barriers created by a lack of experience and inspiration to generate and create media files is a pressing issue that needs to be addressed.
[0028] The technical solution provided in this disclosure has the following advantages compared with the prior art: The technical solution provided in this disclosure displays a configuration page, which includes a first area and a second area. The first area includes multiple keywords; the keywords are used to characterize the style of the media file. In response to the selection of a target keyword in the first area, the target keyword is displayed in the second area of the configuration page. In response to a confirmation command, the first media file is displayed, combining the media file style characterized by the target keywords displayed in the second area. Essentially, it uses options to display the available media file styles for the user to choose from. Thus, even if the user lacks experience and inspiration, making it difficult to describe the desired style of the generated media file, or lacks understanding of media file styles, they can still determine the required style by selecting keywords, thereby reducing the difficulty of using media file generation technology.
[0029] Figure 1 is a flowchart of an interaction method provided in an embodiment of this disclosure. This embodiment can be applied to the situation where media files are generated in the client. The method can be executed by an interaction device, which can be implemented in software and / or hardware. The device can be configured in an electronic device, such as a terminal, including but not limited to smartphones, PDAs, tablets, wearable devices with displays, desktops, laptops, all-in-ones, smart home devices, etc.
[0030] As shown in Figure 1, the method may specifically include:
[0031] S110. Display the configuration page, which includes a first area and a second area; the first area includes multiple keywords; the keywords are used to characterize the style of the media file.
[0032] A configuration page can be, for example, a page that helps users experience the artistic expression of a media file's style. In some scenarios, the configuration page can also help users configure information used to guide the generation of the media file. Media files can include at least one of the following: images, videos, and audio. Audio can be, for example, music.
[0033] The first and second areas are two distinct sections on the configuration page. The first area displays pre-defined keywords for the user to choose from. The second area displays keywords selected by the user from the first area. Subsequently, keywords displayed in the second area can be input into the media file generation model to guide it in generating media files.
[0034] In some scenarios, keywords can be terms describing the artistic effects of a media file. The keywords displayed in the first area serve to provide one or more options for the user to choose from. If a user selects a keyword, it means that the user wants to generate a media file with the artistic effects referred to by that keyword. For example, in a scenario where the media file is an image, keywords can be terms describing artistic effects such as image style, image color, image lighting, image portrait representation, or image framing. Further, keywords used to describe image style can be, for example, realistic, illustrative, or 3D. Keywords used to describe image color can be, for example, vibrant, summery, or warm. Keywords used to describe image lighting can be, for example, metallic, contour, or ambient lighting. Keywords used to describe image portrait representation can be, for example, street photography, retro, or dreamlike. Keywords used to describe image framing can be, for example, close-up, wide-angle, or telephoto. In a scenario where the media file is a video, keywords can be terms describing artistic effects such as video style, video camera movement, or video effects. Further, keywords used to describe video style can be, for example, illustrative, pop, or 3D. Keywords used to describe video camera movements could include zooming in, zooming out, or slow evolution. Keywords used to describe visual effects could include glitches, backlogs, or explosions. In scenarios where the media file is music, keywords can describe artistic effects such as singing style, sound packs, genre, and rhythm. Further, keywords describing singing style could include traditional, pop, or classical. Keywords describing sound packs could include male, female, or electronic music. Keywords describing genres could include hip-hop, R&B, or country. Keywords describing rhythm could include 1 / 16, 1 / 8, or 1 / 4 time.
[0035] In other scenarios, keywords can also be pre-given text-based prompts or reference images used as examples or templates.
[0036] S120. In response to the selection of target keywords in the first area, display the target keywords in the second area of the configuration page.
[0037] The selection of target keywords in the first area can be an action where the user chooses one or more keywords from the keywords displayed in the first area. The selected keyword is the target keyword. Specific actions for selecting target keywords in the first area can include clicking or dragging the target keyword.
[0038] S130. In response to the confirmation operation command, the first media file is displayed in conjunction with the style of the media file represented by the target keywords displayed in the second area.
[0039] In this application, the first media file can be generated or pre-defined. In some embodiments, the first media file is displayed by combining the style of the media file represented by the target keywords displayed in the second area. For example, the target keywords displayed in the second area can be input into a media file generation model to generate and display the first media file. In other embodiments, multiple sets of keyword-media file correspondences are pre-constructed. Based on the target keywords displayed in the second area, the first media file is determined. For example, the correspondences including the target keywords displayed in the second area can be searched in the pre-constructed keyword-media file correspondences; the media files including the correspondences of the target keywords displayed in the second area are used as the first media file. This method can quickly output the first media file, allowing users to quickly experience the combination of their chosen style (achieved by selecting target keywords) and the resulting artistic effect, providing a reference for users to create subsequent media files.
[0040] The confirmation operation instruction could be, for example, an instruction indicating that the user wants to know what media file to obtain based on the keywords displayed in the current second area. Optionally, a confirmation operation instruction is generated when a third preset trigger operation on the configuration page is detected. This third preset trigger operation could be, for example, a swipe operation (such as a horizontal or vertical swipe), a drag operation, a click operation, or a click operation on a specific area or option of the configuration page. For example, the configuration page includes a generate option. If a user click operation on this generate option is detected, a confirmation operation instruction is generated. The generate option could be, for example, an option that triggers the media file generation model to run.
[0041] Furthermore, the display state of the configuration page can be set to include a first state and a second state; in the first state, the number of keywords included in the first area is less than the number of keywords included in the first area in the second state; the method can also include: when the display state of the configuration page is the first state, in response to a first preset trigger operation on the configuration page, switching the display state of the configuration page from the first state to the second state.
[0042] The first preset trigger operation on the configuration page can indicate an operation that allows switching between a first state and a second state of the configuration page. For example, the first preset trigger operation can be a swipe operation (such as a horizontal swipe or a vertical swipe), a drag operation, a click operation, or a click operation on a specific area or a specific option of the configuration page in the first state.
[0043] For example, consider image generation. Both Figure 2 and Figure 3 show configuration pages. Each configuration page includes a first area and a second area. The first area is located below the second area. The area of the first area in Figure 2 is smaller than that in Figure 3. The first area in Figure 2 includes five keywords: "Childhood Dream Core," "Pop Kitten," "Realistic," "Illustration," and "Vivid." The first area in Figure 3 includes ten keywords: "Childhood Dream Core," "Pop Kitten," "Realistic," "Illustration," "Vivid," "Summer," "Close-up," "Wide Angle," "Outline," and "Street Photography." The first area in Figure 2 includes fewer keywords than the first area in Figure 3. The configuration page in Figure 2 is in its first state. The configuration page in Figure 3 is in its second state. When the electronic device displays the configuration page in Figure 2, the user clicks the first area, or clicks the "More" option in the first area, or slides within the first area, and the electronic device displays the configuration page in Figure 3, thus switching the configuration page from its first state to its second state.
[0044] If a user clicks on the keywords "wide-angle" and "street photography" based on Figure 3, these two keywords will be used as target keywords. The two keywords will then be displayed in the second area of the configuration page. If the user clicks the "Generate" option in Figure 3, the system will subsequently determine and display the first media file based on the style of the media file represented by the keywords "wide-angle" and "street photography."
[0045] The above technical solution displays a configuration page, which includes a first area and a second area. The first area includes multiple keywords, which are used to characterize the style of the media file. In response to the selection of a target keyword in the first area, the target keyword is displayed in the second area of the configuration page. In response to a confirmation command, the first media file is displayed, combining the media file style characterized by the target keywords displayed in the second area. Essentially, it uses options to showcase the available media file styles for the user to choose from. This way, even if the user lacks experience and inspiration, struggles to describe the desired style of the generated media file, or lacks understanding of media file styles, they can still determine the required style by selecting keywords, thereby reducing the difficulty of using media file generation technology.
[0046] Based on the above technical solution, optionally, the configuration page also includes a third area; the method further includes: collecting generation prompt information through the third area; displaying the generation prompt information in the second area of the configuration page; S130 may include: confirming the operation instruction, combining the style of the media file represented by the target keywords displayed in the second area and the generation prompt information, generating and displaying the first media file.
[0047] The third area, distinct from the first and second areas on the configuration page, assists users in configuring generated prompt messages. These prompt messages can be text or image information.
[0048] For example, let's continue with image generation. Referring to Figure 4, the configuration page includes a first area, a second area, and a third area. The first area is located below the second area, and the third area is located below the first area. The first area includes multiple keywords. If the user clicks on the keywords "wide-angle" and "street photography," these two keywords will be used as target keywords. The keywords "wide-angle" and "street photography" will be displayed in the second area of the configuration page. The user enters the text "balloons and astronauts in childhood" in the text input box in the third area. This text will be displayed as a generation prompt in the second area. Then, if the user clicks the "Generate" option in Figure 4, a first media file will be generated based on "wide-angle," "street photography," and the text "balloons and astronauts in childhood."
[0049] The system collects generation prompts in the third area and displays them in the second area of the configuration page. Upon confirmation, it combines the target keywords representing the media file's style displayed in the second area with the generation prompts to generate and display the first media file. This process essentially allows users to edit text and / or upload images according to their needs. The edited text and / or uploaded images serve as generation prompts, participating in the media file generation process along with the target keywords. This combines style experience with image generation functionality, ensuring the generated media file better matches user expectations.
[0050] Based on the above technical solution, optionally, the keywords displayed in the first area of the configuration page include first-category keywords and second-category keywords; the display position of the first-category keywords is located before the display position of the second-category keywords; the first-category keywords include multiple sub-keywords, and different sub-keywords correspond to different categories; S120 may include: in response to the selection operation of the target keyword in the first area, if the target keyword is a first-category keyword, displaying the sub-keywords included in the target keyword in the second area of the configuration page; if the target keyword is a second-category keyword, displaying the target keyword in the second area of the configuration page.
[0051] The sub-keywords in the first category and the second category of keywords are words of the same type. For example, taking media files as images, both the sub-keywords in the first category and the second category of keywords can be descriptive words about image style, image color, image lighting, image portrait representation, or image framing. The main difference between the first and second categories of keywords is that the first category of keywords is an aggregation of multiple sub-keywords, while the second category of keywords is not an aggregation.
[0052] Those skilled in the art will understand that the artistic effect of a high-quality media file can be interpreted from multiple dimensions. Taking images as an example, when analyzing images with superior visual effects, one can analyze them from multiple dimensions such as color, lighting, and shot composition. In other words, the superior artistic effect of a media file is formed by the interweaving, combination, and interaction of features (i.e., keywords) from multiple analytical dimensions. It is precisely the combination of these features (i.e., keywords) from different analytical dimensions that achieves the superior artistic effect. Multiple sub-keywords within the first category of keywords have a collocation relationship. When determining which keywords to use as sub-keywords within the first category of keywords, one should consider which keyword combinations result in the best artistic effect.
[0053] For example, referring to Figure 5, among the 10 keywords displayed on this configuration page, the first two keywords (i.e., "Childhood Dream Core" and "Pop Kitten") are Category 1 keywords, and the remaining keywords are Category 2 keywords. Category 1 keywords are located in the first two display positions in the first area, and Category 2 keywords (i.e., all keywords except "Childhood Dream Core" and "Pop Kitten") are located after Category 1 keywords. Assuming that the Category 1 keyword "Childhood Dream Core" includes two sub-keywords, "Dreamy" and "Retro," if a user clicks on "Childhood Dream Core" in the first area, "Dreamy" and "Retro" will be displayed in the second area.
[0054] Furthermore, the first type of keyword can also include preset generation prompts. These preset prompts can include one or more of preset text information, preset image information, preset video information, and preset audio information. For example, suppose the first type of keyword "Childhood Dream Core" includes two sub-keywords: "Dreamy" and "Retro," as well as preset text information and a preset reference image. The preset text information is "XXXXX," and the preset reference image is image A. Referring to Figure 4, if the user clicks on "Childhood Dream Core" in the first area, "Dream Core," "Retro," "XXXXX," and image A will be displayed in the second area.
[0055] By setting the first category of keywords to include multiple sub-keywords, users can quickly generate media files with better artistic effects when they select the first category, saving them time from trying different keyword combinations. Positioning the first category of keywords before the second category of keywords makes it easier for users to find them quickly.
[0056] Furthermore, it is also possible to set up a function to delete the keywords displayed in the second area and / or the generated prompts in response to a deletion command for the keywords and / or generated prompts displayed in the second area.
[0057] Based on the above technical solution, optionally, displaying the first media file may include: displaying the first media file and obtaining the target keywords used by the first media file on the results display page.
[0058] The results display page could be, for example, a page displaying the first media file. Furthermore, the results display page could be configured to show instances where the same account executed the technical solution provided in this application at different times to obtain the first media file.
[0059] Furthermore, the method may also include: in response to the selection operation of the target keyword on the results display page, displaying candidate keywords, which belong to the same style category as the target keyword; not using the candidate keywords during the process of obtaining the first media file; in response to the selection operation of the candidate keywords, using the selected candidate keywords as new target keywords; and displaying the second media file and the target keywords used to generate the second media file on the results display page in combination with the style of the media file represented by the new target keywords.
[0060] In practice, each style category can be assigned a corresponding keyword set, which includes multiple keywords. If a keyword in the keyword set corresponding to a particular style category is used as the target keyword, all other keywords in that set become candidate keywords. The target keyword influences the determination of the first media file. The selected candidate keywords influence the determination of the second media file.
[0061] The method of "determining the second media file by combining the style of the media file represented by the new target keywords" is similar to the method of "determining the second media file by combining the style of the media file represented by the target keywords" mentioned above, and will not be repeated here.
[0062] Furthermore, the results display page includes multiple display positions; the first media file is displayed in the Nth display position on the results display page; displaying the second media file on the results display page includes: displaying the second media file in the Nth display position on the results display page; in response to a media file switching instruction for the Nth display position, switching the second media file to the first media file; or, switching the first media file to the second media file. Essentially, this setup means that the second media file and the first media file correspond to the same display position and can be switched between display positions within the same location.
[0063] A media file switching command for the Nth display position could be, for example, an instruction indicating that the user wants to switch between multiple media files displayed at the Nth display position. In practice, this command could be a swipe or click operation at the Nth display position.
[0064] For example, suppose a user clicks the "Generate" option based on Figure 4. Based on "wide-angle," "street photography," and the text information "balloons and astronauts in childhood," a first media file is generated, which is image B. See Figure 6. The results page displays image B along with the target keywords "wide-angle" and "street photography" used to generate image B. Since the style category of "wide-angle" is "shot type," the keyword set corresponding to this style category also includes "close-up" and "telephoto." "Close-up" and "telephoto" are used as candidate keywords corresponding to "wide-angle." If the user clicks "wide-angle," see Figure 6. The drop-down list for "wide-angle" displays the two candidate keywords "close-up" and "telephoto." If the user selects "close-up," a second media file is generated based on "close-up," "street photography," and the text information "balloons and astronauts in childhood." Assume the second media file is image D. See Figure 7. The results page displays image D along with the target keywords "close-up" and "street photography" used to generate image D. Comparing Figures 6 and 7, in this results display page, image D is now displayed in the position where image B was originally displayed. Furthermore, in Figure 6, the target keywords used to generate image B are displayed above image B. In Figure 7, the target keywords used to generate image D are displayed above image D. Further, based on Figure 7, if the user swipes horizontally (e.g., to the left) in the area occupied by image D, a media file switching instruction is generated. Based on this media file switching instruction, image B is replaced with image D. If the user swipes horizontally again (e.g., to the right) in the area occupied by image B, a media file switching instruction is generated. Based on this media file switching instruction, image D is replaced with image B.
[0065] By setting a response to the selection of target keywords on the results display page, candidate keywords are displayed; in response to the selection of candidate keywords, the selected candidate keyword is used as the new target keyword; combined with the style of the media file represented by the new target keyword, a second media file is displayed on the results display page, along with the target keywords used in the second media file. Essentially, this allows users to quickly experience the impact of different keywords on the generated media file on the results display page, thereby helping users generate media files that meet their expectations. To a certain extent, the second media file is a modified version of some generated elements of the first media file. By setting the display position of the second media file on the results display page; and by switching the second media file to the first media file in response to the media file switching command for the Nth display position, users can understand and compare the impact of keyword changes on the artistic effect of the media file presentation, thereby helping users find a keyword configuration method that can generate media files that meet their expectations.
[0066] Based on the above technical solutions, in response to the display instruction for the information flow page, the information flow page is displayed. The information flow page includes multiple media file display positions; the media file display positions are used to display media files; multiple media files produced by the same generation iteration sequence occupy the same media file display position in the information flow page.
[0067] A news feed page can be, for example, a page or area used to display information in a sequence of information. A news feed is a data format that continuously provides content to users. A news feed consists of multiple pieces of information arranged in a specific order. Information in a news feed can include, for example, media files. In a news feed, different media files can be created by different users. All users can view media files created by themselves or other users through the news feed page.
[0068] A unified generation iteration sequence is a media file creation process based on an initial idea, evolving gradually through multiple rounds. In this sequence, each round modifies and optimizes the generation prompts and / or keywords required for media file generation, and generates the corresponding media file based on the modified prompts and / or keywords. This cycle repeats, allowing the media file to iteratively develop from an initial concept towards a more refined and expressive direction, forming a collection of media works that are logically connected and progressively evolve. The multiple media files produced by the unified generation iteration sequence constitute the media files in this collection of media works.
[0069] In a news feed page, multiple media files generated by the same generation iteration sequence occupying the same display space means that these media files are grouped together and displayed in a single space. For example, a cover image can be assigned to each of the multiple media files generated by the same generation iteration sequence. This cover image is then displayed on the news feed page. If the user selects that cover image, all media files generated by that same generation iteration sequence are displayed. This helps users understand how the same generation iteration sequence is progressively improved.
[0070] For example, referring to Figure 8, on this information flow page, media file 1 is labeled "43 ideas," meaning that this display position is occupied by multiple media files generated by the same generation iteration sequence. This same generation iteration sequence performed 43 rounds of media file generation.
[0071] Based on the above technical solution, optionally, the configuration page includes an image configuration page, a video configuration page, and a music configuration page; the image configuration page, video configuration page, and music configuration page are arranged in a preset order to form a configuration page sequence; the method may further include: in response to a second trigger operation on the currently displayed configuration page, switching the currently displayed configuration page to the next configuration page in the configuration page sequence.
[0072] The currently displayed configuration page is one of the image configuration page, video configuration page, or music configuration page. A second trigger action for the currently displayed configuration page could be an action indicating that the user wishes to switch between different configuration interfaces. Specifically, this could be a swipe (horizontal or vertical swipe), drag, or click action on the currently displayed configuration page.
[0073] For example, suppose the configuration pages in the configuration page sequence are arranged in the order of image configuration page, video configuration page, and music configuration page. If, at a certain moment, referring to Figure 2, the currently displayed configuration page is the image configuration page, and the user performs a second trigger operation (such as a left swipe operation) on this configuration page (i.e., the configuration page in Figure 2), the image configuration page will be switched to the video configuration page. The page displayed on the electronic device at this time is shown in Figure 10.
[0074] This setting allows for quick switching between different types of configuration pages, thus helping users quickly find the configuration page they need.
[0075] It is understood that before using the technical solutions disclosed in the various embodiments of this disclosure, users should be informed of the types, scope of use, and usage scenarios of the personal information involved in this disclosure in an appropriate manner in accordance with relevant laws and regulations, and user authorization should be obtained.
[0076] For example, upon receiving a user's active request, a prompt message is sent to the user to explicitly inform them that the requested operation will require the acquisition and use of the user's personal information. This allows the user to independently choose whether to provide personal information to the software or hardware, such as the electronic device, application, server, or storage medium performing the operations of this disclosed technical solution, based on the prompt message.
[0077] As an optional but non-limiting implementation, in response to a user's active request, sending a prompt message to the user can be done via a pop-up window, where the prompt message can be presented in text format. Furthermore, the pop-up window can also include a selection control allowing the user to choose "agree" or "disagree" to provide personal information to the electronic device.
[0078] It is understood that the above notification and user authorization process are merely illustrative and do not constitute a limitation on the implementation of this disclosure. Other methods that comply with relevant laws and regulations may also be applied to the implementation of this disclosure.
[0079] It should be noted that, for the sake of simplicity, the foregoing method embodiments are all described as a series of actions. However, those skilled in the art should understand that the present invention is not limited to the described order of actions, because according to the present invention, some steps can be performed in other orders or simultaneously. Furthermore, those skilled in the art should also understand that the embodiments described in the specification are preferred embodiments, and the actions and modules involved are not necessarily essential to the present invention.
[0080] Figure 10 is a schematic diagram of an interactive device according to an embodiment of this disclosure. The interactive device provided in this embodiment can be configured in a client or in a server. Referring to Figure 10, the interactive device specifically includes:
[0081] The first display module 310 is used to display a configuration page, which includes a first area and a second area; the first area includes multiple keywords; the keywords are used to characterize the style of the media file.
[0082] The second display module 320 is used to display the target keyword in the second area of the configuration page in response to the selection operation of the target keyword in the first area;
[0083] The third display module 330 is used to display the first media file in response to a confirmation operation command, in conjunction with the style of the media file represented by the target keywords displayed in the second area.
[0084] Furthermore, the configuration page also includes a third area; the device also includes a collection module for: collecting and generating prompt information through the third area;
[0085] The second display module 320 is used to display the generation prompt information in the second area of the configuration page;
[0086] The third display module 330 is used to generate and display the first media file in response to the confirmation operation command, combining the style of the media file represented by the target keyword displayed in the second area and the generation prompt information.
[0087] Furthermore, the keywords displayed in the first area of the configuration page include first-category keywords and second-category keywords; the first-category keywords are displayed before the second-category keywords; the first-category keywords include multiple sub-keywords, and different sub-keywords correspond to different categories;
[0088] The second display module 320 is used for:
[0089] In response to the selection operation of the target keyword in the first area, if the target keyword is a first type of keyword, the sub-keywords included in the target keyword are displayed in the second area of the configuration page; if the target keyword is a second type of keyword, the target keyword is displayed in the second area of the configuration page.
[0090] Furthermore, the third display module 330 is used to display the first media file and the target keywords used in the first media file on the results display page.
[0091] Furthermore, the third display module 330 is also used for:
[0092] In response to the selection of the target keyword on the results display page, candidate keywords are displayed, which belong to the same style category as the target keyword; the candidate keywords were not used in the process of obtaining the first media file.
[0093] In response to the selection operation of the candidate keywords, the selected candidate keywords are used as new target keywords;
[0094] Based on the style of the media file characterized by the new target keywords, the second media file and the target keywords used by the second media file are displayed on the results display page.
[0095] Furthermore, the result display page includes multiple display positions; the first media file is displayed at the Nth display position on the result display page; the third display module is also used for:
[0096] The second media file is displayed in the Nth display position on the results display page;
[0097] In response to the media file switching instruction for the Nth display position, the second media file is switched to the first media file.
[0098] Furthermore, the display state of the configuration page includes a first state and a second state; in the first state, the number of keywords included in the first area is less than the number of keywords included in the first area in the second state; the device also includes a first switching module;
[0099] The first switching module is used to switch the display state of the configuration page from the first state to the second state in response to a first preset trigger operation on the configuration page when the display state of the configuration page is the first state.
[0100] Furthermore, the device also includes a third display module for:
[0101] In response to a display instruction for an information feed page, the information feed page is displayed, the information feed page including multiple media file display positions; the media file display positions are used to display media files; multiple media files generated by the same generation iteration sequence occupy the same media file display position on the information feed page.
[0102] Furthermore, the configuration page includes an image configuration page, a video configuration page, and a music configuration page; the image configuration page, the video configuration page, and the music configuration page are arranged sequentially in a preset order to form a configuration page sequence; the device also includes a second switching module, used for:
[0103] The second switching module is used to switch the currently displayed configuration page to the next configuration page in the configuration page sequence in response to a second trigger operation on the currently displayed configuration page.
[0104] The interactive device provided in this disclosure can execute the steps performed by the client or server in the interactive method provided in this disclosure, and has the functions of execution steps and beneficial effects, which will not be described in detail here.
[0105] Figure 11 is a schematic diagram of the structure of an electronic device according to an embodiment of this disclosure. Referring specifically to Figure 11 below, it shows a schematic diagram of the structure suitable for implementing the electronic device 1000 in the embodiments of this disclosure. The electronic device 1000 in the embodiments of this disclosure may include, but is not limited to, mobile terminals such as mobile phones, laptops, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablet computers), PMPs (portable multimedia players), in-vehicle terminals (e.g., in-vehicle navigation terminals), wearable electronic devices, etc., as well as fixed terminals such as digital TVs, desktop computers, smart home devices, etc. The electronic device shown in Figure 11 is merely an example and should not impose any limitations on the functionality and scope of use of the embodiments of this disclosure.
[0106] As shown in FIG11, the electronic device 1000 may include a processing device (e.g., a central processing unit, a graphics processing unit, etc.) 1001, which can perform various appropriate actions and processes according to a program stored in a read-only memory (ROM) 1002 or a program loaded from a storage device 1008 into a random access memory (RAM) 1003 to implement the interaction method as described in the embodiments of the present disclosure. The RAM 1003 also stores various programs and information required for the operation of the electronic device 1000. The processing device 1001, ROM 1002, and RAM 1003 are interconnected via a bus 1004. An input / output (I / O) interface 1005 is also connected to the bus 1004.
[0107] Typically, the following devices can be connected to the I / O interface 1005: input devices 1006 including, for example, a touchscreen, touchpad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; output devices 1007 including, for example, a liquid crystal display (LCD), speaker, vibrator, etc.; storage devices 1008 including, for example, magnetic tape, hard disk, etc.; and communication devices 1009. Communication device 1009 allows electronic device 1000 to communicate wirelessly or wiredly with other devices to exchange information. Although FIG11 shows an electronic device 1000 with various devices, it should be understood that it is not required to implement or possess all the devices shown. More or fewer devices may be implemented or possessed alternatively.
[0108] In particular, according to embodiments of this disclosure, the processes described above with reference to the flowcharts can be implemented as computer software programs. For example, embodiments of this disclosure include a computer program product comprising a computer program carried on a non-transitory computer-readable medium, the computer program containing program code for performing the methods shown in the flowcharts, thereby implementing the interactive methods as described above. In such embodiments, the computer program can be downloaded and installed from a network via communication device 1009, or installed from storage device 1008, or installed from ROM 1002. When the computer program is executed by processing device 1001, it performs the functions defined in the methods of embodiments of this disclosure.
[0109] It should be noted that the computer-readable medium described in this disclosure can be a computer-readable signal medium or a computer-readable storage medium, or any combination thereof. A computer-readable storage medium can be, for example,—but not limited to—an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples of a computer-readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard disk, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination thereof. In this disclosure, a computer-readable storage medium can be any tangible medium containing or storing a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. In this disclosure, a computer-readable signal medium can include information signals propagated in baseband or as part of a carrier wave, carrying computer-readable program code. Such propagated information signals can take various forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination thereof. A computer-readable signal medium can be any computer-readable medium other than a computer-readable storage medium, which can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device. The program code contained on the computer-readable medium can be transmitted using any suitable medium, including but not limited to: wires, optical fibers, RF (radio frequency), etc., or any suitable combination thereof.
[0110] In some implementations, clients and servers may communicate using any known or future network protocol such as HTTP (Hypertext Transfer Protocol) and may interconnect with digital information communication (e.g., communication networks) of any form or medium. Examples of communication networks include local area networks (“LANs”), wide area networks (“WANs”), the Internet (e.g., the Internet of Things), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks), as well as any known or future networks.
[0111] The aforementioned computer-readable medium may be included in the aforementioned electronic device; or it may exist independently and not assembled into the electronic device.
[0112] The aforementioned computer-readable medium carries one or more programs that, when executed by the electronic device, cause the electronic device to:
[0113] A configuration page is displayed, which includes a first area and a second area; the first area includes multiple keywords; the keywords are used to characterize the style of the media file.
[0114] In response to the selection of a target keyword in the first area, the target keyword is displayed in the second area of the configuration page;
[0115] In response to the confirmation operation command, the first media file is displayed in conjunction with the style of the media file represented by the target keywords shown in the second area.
[0116] Optionally, when one or more of the above-described procedures are executed by the electronic device, the electronic device may also perform other steps described in the above embodiments.
[0117] Computer program code for performing the operations of this disclosure can be written in one or more programming languages or a combination thereof, including but not limited to object-oriented programming languages such as Java, Smalltalk, and C++, as well as conventional procedural programming languages such as the "C" language or similar programming languages. The program code can be executed entirely on the user's computer, partially on the user's computer, as a standalone software package, partially on the user's computer and partially on a remote computer, or entirely on a remote computer or server. In cases involving remote computers, the remote computer can be connected to the user's computer via any type of network—including a local area network (LAN) or a wide area network (WAN)—or can be connected to an external computer (e.g., via the Internet using an Internet service provider).
[0118] The flowcharts and block diagrams in the accompanying drawings illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of this disclosure. In this regard, each block in a flowchart or block diagram may represent a module, segment, or portion of code containing one or more executable instructions for implementing a specified logical function. It should also be noted that in some alternative implementations, the functions indicated in the blocks may occur in a different order than those indicated in the drawings. For example, two consecutively indicated blocks may actually be executed substantially in parallel, and they may sometimes be executed in reverse order, depending on the functions involved. It should also be noted that each block in the block diagrams and / or flowcharts, and combinations of blocks in the block diagrams and / or flowcharts, can be implemented using a dedicated hardware-based system that performs the specified function or operation, or using a combination of dedicated hardware and computer instructions.
[0119] The units described in the embodiments of this disclosure can be implemented in software or hardware. The names of the units are not, in some cases, intended to limit the specific unit.
[0120] The functions described above in this document can be performed, at least in part, by one or more hardware logic components. For example, exemplary types of hardware logic components that can be used, without limitation, include: Field Programmable Gate Arrays (FPGAs), Application-Specific Integrated Circuits (ASICs), Application Standard Products (ASSPs), System-on-Chip (SoCs), Complex Programmable Logic Devices (CPLDs), and so on.
[0121] In the context of this disclosure, a machine-readable medium can be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, apparatus, or device. A machine-readable medium can be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium can be, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media include electrical connections based on one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing.
[0122] According to one or more embodiments of this disclosure, this disclosure provides an electronic device, including:
[0123] One or more processors;
[0124] Memory, used to store one or more programs;
[0125] When the one or more programs are executed by the one or more processors, the one or more processors implement any of the interaction methods provided in this disclosure.
[0126] According to one or more embodiments of the present disclosure, the present disclosure provides a computer-readable storage medium having a computer program stored thereon that, when executed by a processor, implements an interaction method as described in any of the present disclosure.
[0127] This disclosure also provides a computer program product, which includes a computer program or instructions that, when executed by a processor, implement the interaction method described above.
[0128] It should be noted that, in this document, relational terms such as "first" and "second" are used merely to distinguish one entity or operation from another, and do not necessarily require or imply any such actual relationship or order between these entities or operations. Furthermore, the terms "comprising," "including," or any other variations thereof are intended to cover non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements includes not only those elements but also other elements not expressly listed, or elements inherent to such a process, method, article, or apparatus. Without further limitations, an element defined by the phrase "comprising one..." does not exclude the presence of other identical elements in the process, method, article, or apparatus that includes said element.
[0129] The above description is merely a specific embodiment of this disclosure, enabling those skilled in the art to understand or implement it. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be implemented in other embodiments without departing from the spirit or scope of this disclosure. Therefore, this disclosure is not to be limited to the embodiments described herein, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
Claims
1. An interaction method, comprising: A configuration page is displayed, which includes a first area and a second area; the first area includes multiple keywords; the keywords are used to characterize the style of the media file. In response to the selection of a target keyword in the first area, the target keyword is displayed in the second area of the configuration page; In response to the confirmation operation command, the first media file is displayed in conjunction with the style of the media file represented by the target keywords shown in the second area.
2. The method of claim 1, wherein, The configuration page also includes a third area; the method further includes: The prompt information is collected and generated through the third area; The generation prompt information is displayed in the second area of the configuration page; In response to the confirmation operation command, and in conjunction with the style of the media file represented by the target keywords displayed in the second area, the first media file is displayed, including: In response to the confirmation operation command, and in conjunction with the style of the media file represented by the target keywords displayed in the second area and the generation prompt information, a first media file is generated and displayed.
3. The method of claim 1, wherein, The keywords displayed in the first area of the configuration page include first-category keywords and second-category keywords; the first-category keywords are displayed before the second-category keywords; the first-category keywords include multiple sub-keywords, and different sub-keywords correspond to different categories. The step of displaying the target keyword in the second area of the configuration page in response to the selection operation of the target keyword in the first area includes: In response to the selection operation of the target keyword in the first area, if the target keyword is a first type of keyword, the sub-keywords included in the target keyword are displayed in the second area of the configuration page; if the target keyword is a second type of keyword, the target keyword is displayed in the second area of the configuration page.
4. The method of claim 1, wherein, The first media file to be displayed includes: The results page displays the first media file and the target keywords used in the first media file.
5. The method of claim 4, wherein, Also includes: In response to the selection of the target keyword on the results display page, candidate keywords are displayed, which belong to the same style category as the target keyword; The candidate keywords were not used during the process of obtaining the first media file; In response to the selection operation of the candidate keywords, the selected candidate keywords are used as new target keywords; Based on the style of the media file characterized by the new target keywords, the second media file and the target keywords used by the second media file are displayed on the results display page.
6. The method of claim 5, wherein, The results display page includes multiple display locations; The first media file is displayed in the Nth position on the results display page; The step of displaying the second media file on the results display page includes: The second media file is displayed in the Nth display position on the results display page; In response to the media file switching instruction for the Nth display position, the second media file is switched to the first media file.
7. The method of claim 1, wherein, The configuration page can be displayed in two states: a first state and a second state. In the first state, the number of keywords in the first area is less than the number of keywords in the first area in the second state. The method may further include: When the configuration page is displayed in the first state, in response to a first preset trigger operation on the configuration page, the display state of the configuration page is switched from the first state to the second state.
8. The method of claim 1, wherein, Also includes: In response to a display instruction for a feed page, the feed page is displayed, the feed page including multiple media file display locations; The media file display location is used to display media files; In the information flow page, multiple media files generated by the same generation iteration sequence occupy the same media file display position.
9. The method of claim 1, wherein, The configuration page includes an image configuration page, a video configuration page, and a music configuration page; the image configuration page, the video configuration page, and the music configuration page are arranged sequentially according to a preset order to form a configuration page sequence; the method further includes: In response to a second trigger operation on the currently displayed configuration page, the currently displayed configuration page is switched to the next configuration page in the configuration page sequence.
10. An interactive device, comprising: The first display module is used to display the configuration page, which includes a first area and a second area; the first area includes multiple keywords. The keywords are used to characterize the style of media files; The second display module is used to display the target keyword in the second area of the configuration page in response to the selection operation of the target keyword in the first area; The generation module is used to display the first media file in response to the confirmation operation command, in conjunction with the style of the media file represented by the target keywords displayed in the second area.
11. An electronic device, comprising: One or more processors; Storage device for storing one or more programs; When the one or more programs are executed by the one or more processors, the one or more processors implement the method as described in any one of claims 1-9.
12. A computer readable storage medium having stored thereon a computer program, wherein, When the program is executed by the processor, it implements the method as described in any one of claims 1-9.