Page interaction method and device, electronic equipment, storage medium and program product
By displaying the video playback mode corresponding to the target text in the text reader, and combining it with listening and control modes, the problem of insufficient information dissemination in text recommendation is solved, achieving more efficient information dissemination and user attraction.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
- Filing Date
- 2026-02-28
- Publication Date
- 2026-06-30
AI Technical Summary
In existing technologies, text-based recommendation methods suffer from insufficient expressiveness of text itself, which affects the effectiveness and efficiency of information dissemination.
By displaying the video corresponding to the target text in the text reader, a video playback mode is achieved, and the text content can be presented in a combination of visual and auditory means when necessary, switching to a listening mode or other control modes.
It has improved the reach and efficiency of information dissemination, enhanced users' interest in and understanding of the text, and extended the time users spend on the page.
Smart Images

Figure CN122309806A_ABST
Abstract
Description
Technical Field
[0001] This disclosure relates to the field of computer technology, and in particular to a page interaction method, apparatus, electronic device, storage medium, and program product. Background Technology
[0002] Applications in related technologies can recommend text to users, for example, by showing videos related to the plot of a novel. While text recommendations allow users to read text, in many cases, directly reading text may hinder the effective dissemination of information due to the limited expressiveness of the text itself, thus affecting the reach and efficiency of the text's communication. Summary of the Invention
[0003] This disclosure provides a page interaction method, apparatus, electronic device, storage medium, and program product to at least solve one of the aforementioned technical problems. The technical solution of this disclosure is as follows: According to a first aspect of the present disclosure, a page interaction method is provided, comprising: Display the media content corresponding to the target text; When the media content is triggered, the target page corresponding to the target text is displayed. The target page is used to display the content corresponding to the target text, and the target page is in video playback mode. On the target page in the video playback mode, the target video corresponding to the target text is played.
[0004] In one exemplary embodiment, the method further includes: On the target page in the video playback mode, a first prompt message is displayed, which is used to instruct the reader to read the target text; When the first prompt message is triggered, the playback of the target video is stopped and the target page is switched to listening mode; On the target page in the listening mode, the text content in the target text that corresponds to the playback progress of the target video is displayed; Play the audio corresponding to the text content.
[0005] In one exemplary implementation, when the first prompt message is triggered, the method further includes: On the target page in the listening mode, a second prompt message is displayed. The second prompt message includes an identifier corresponding to the video playback mode and a status control corresponding to the video playback mode. The status control is used to indicate or control the status of the video playback mode. When the identifier or the status control is triggered, the target page is switched to the video playback mode, and playback of the target video is resumed, with the playback progress of the target video corresponding to the playback progress of the audio.
[0006] In one exemplary embodiment, the second prompt message further includes a close control, and the method further includes: When the close control is triggered, the second prompt message is canceled from being displayed; A third prompt message is displayed, which is used to prompt the user to use the video playback mode. The page area occupied by the third prompt message is smaller than that occupied by the second prompt message.
[0007] In one exemplary implementation, playing the audio corresponding to the text content includes: Upon receiving a preset control operation, the preset position in the currently displayed text content will be used as the starting point for audio playback.
[0008] In one exemplary embodiment, the method further includes: On the target page in the video playback mode, display settings controls; When the settings control is triggered, the target page is switched from the video playback mode to the video control mode; In the target page of the video control mode, a tone control control is displayed; When the timbre control is triggered, a timbre selection panel is displayed, the timbre selection panel including at least one timbre; When the timbre selection panel is triggered, the target video is played using the selected timbre.
[0009] In one exemplary embodiment, the method further includes: On the target page in the video control mode, display the subtitle control; When the subtitle control is enabled, the corresponding subtitles are displayed on the playback screen of the target video. When the subtitle control is set to off, the corresponding subtitles will not be displayed on the playback screen of the target video.
[0010] In one exemplary embodiment, the method further includes: On the target page in the video control mode, the first control is displayed; When the first control is triggered, the target page of the video control mode is switched to audio playback mode; On the target page in the audio playback mode, audio corresponding to the playback progress of the target video and determined based on the target text is played.
[0011] In one exemplary embodiment, the method further includes: On the target page in the audio playback mode, a first prompt message is displayed, which instructs the reader to read the target text. When the first prompt message is triggered, the target page will be switched to audio-visual mode; On the target page in the listening mode, the text content in the target text that corresponds to the current audio playback progress is displayed; Continue playing the audio.
[0012] In one exemplary embodiment, the method further includes: On the target page in the video control mode, a second control is displayed; When the second control is triggered, the target page of the video control mode is switched to video playback mode.
[0013] In one exemplary embodiment, the method further includes: When the content corresponding to the target text has finished playing, or when an operation to exit the target page is received, a fourth prompt message is displayed, which is used to display at least one recommended text. When any of the recommended texts is triggered, the associated target page corresponding to the triggered recommended text is displayed. The associated target page is used to display the content corresponding to the triggered recommended text, and the associated target page is in video playback mode. In the associated target page that is in the video playback mode, the associated target video corresponding to the triggered recommendation text is played.
[0014] According to a second aspect of the present disclosure, a page interaction device is provided, comprising: The display module is configured to display the media content corresponding to the target text; The interaction module is configured to, when the media content is triggered, display a target page corresponding to the target text, the target page being used to display the content corresponding to the target text, and the target page being in video playback mode; and play the target video corresponding to the target text on the target page in video playback mode.
[0015] In one exemplary implementation, the interaction module is configured to execute: On the target page in the video playback mode, a first prompt message is displayed, which is used to instruct the reader to read the target text; When the first prompt message is triggered, the playback of the target video is stopped and the target page is switched to listening mode; On the target page in the listening mode, the text content in the target text that corresponds to the playback progress of the target video is displayed; Play the audio corresponding to the text content.
[0016] In one exemplary implementation, the interaction module is configured to execute: On the target page in the listening mode, a second prompt message is displayed. The second prompt message includes an identifier corresponding to the video playback mode and a status control corresponding to the video playback mode. The status control is used to indicate or control the status of the video playback mode. When the identifier or the status control is triggered, the target page is switched to the video playback mode, and playback of the target video is resumed, with the playback progress of the target video corresponding to the playback progress of the audio.
[0017] In one exemplary embodiment, the second prompt message further includes a close control, and the interaction module is configured to execute: When the close control is triggered, the second prompt message is canceled from being displayed; A third prompt message is displayed, which is used to prompt the user to use the video playback mode. The page area occupied by the third prompt message is smaller than that occupied by the second prompt message.
[0018] In one exemplary implementation, the interaction module is configured to execute: Upon receiving a preset control operation, the preset position in the currently displayed text content will be used as the starting point for audio playback.
[0019] In one exemplary implementation, the interaction module is configured to execute: On the target page in the video playback mode, display settings controls; When the settings control is triggered, the target page is switched from the video playback mode to the video control mode; In the target page of the video control mode, a tone control control is displayed; When the timbre control is triggered, a timbre selection panel is displayed, the timbre selection panel including at least one timbre; When the timbre selection panel is triggered, the target video is played using the selected timbre.
[0020] In one exemplary implementation, the interaction module is configured to execute: On the target page in the video control mode, display the subtitle control; When the subtitle control is enabled, the corresponding subtitles are displayed on the playback screen of the target video. When the subtitle control is set to off, the corresponding subtitles will not be displayed on the playback screen of the target video.
[0021] In one exemplary implementation, the interaction module is configured to execute: On the target page in the video control mode, the first control is displayed; When the first control is triggered, the target page of the video control mode is switched to audio playback mode; On the target page in the audio playback mode, audio corresponding to the playback progress of the target video and determined based on the target text is played.
[0022] In one exemplary implementation, the interaction module is configured to execute: On the target page in the audio playback mode, a first prompt message is displayed, which instructs the reader to read the target text. When the first prompt message is triggered, the target page will be switched to audio-visual mode; On the target page in the listening mode, the text content in the target text that corresponds to the current audio playback progress is displayed; Continue playing the audio.
[0023] In one exemplary implementation, the interaction module is configured to execute: On the target page in the video control mode, a second control is displayed; When the second control is triggered, the target page of the video control mode is switched to video playback mode.
[0024] In one exemplary implementation, the interaction module is configured to execute: When the content corresponding to the target text has finished playing, or when an operation to exit the target page is received, a fourth prompt message is displayed, which is used to display at least one recommended text. When any of the recommended texts is triggered, the associated target page corresponding to the triggered recommended text is displayed. The associated target page is used to display the content corresponding to the triggered recommended text, and the associated target page is in video playback mode. In the associated target page that is in the video playback mode, the associated target video corresponding to the triggered recommendation text is played.
[0025] According to a third aspect of the present disclosure, an electronic device is provided, comprising: a processor; and a memory for storing processor-executable instructions; wherein the processor is configured to execute the instructions to implement the page interaction method described above.
[0026] According to a fourth aspect of the present disclosure, a computer-readable storage medium is provided that, when instructions in the computer-readable storage medium are executed by a processor of an electronic device, enables the electronic device to perform the page interaction method as described above.
[0027] According to a fifth aspect of the present disclosure, a computer program product is provided, the computer program product including a computer program stored in a readable storage medium, wherein at least one processor of a computer device reads from the readable storage medium and executes the computer program, causing the device to perform the page interaction method described above.
[0028] The technical solutions provided by the embodiments of this disclosure have at least the following beneficial effects: The page interaction method, apparatus, electronic device, storage medium, and program product disclosed herein display media content corresponding to target text; when the media content is triggered, a target page corresponding to the target text is displayed, the target page being used to display the content corresponding to the target text, and the target page being in video playback mode; and in the target page being in video playback mode, a target video corresponding to the target text is played.
[0029] This method achieves a smooth transition and deep association between the media content related to the target text and the video playback after the target text is triggered. It displays the target page in video playback mode and plays the target video, breaking away from the traditional model of directly presenting text to users in text promotion scenarios. Instead, it directly displays text-based videos, enhancing the expressiveness of information through video, conveying more information, and increasing user appeal. This makes it easier for users to understand the substantive information conveyed by the text and makes them more willing to stay on the target page to receive text-related information, thus improving the reach and efficiency of the dissemination.
[0030] It should be understood that the above general description and the following detailed description are exemplary and explanatory only, and are not intended to limit this disclosure. Attached Figure Description
[0031] The accompanying drawings, which are incorporated in and form part of this specification, illustrate embodiments consistent with this disclosure and, together with the description, serve to explain the principles of this disclosure, and are not intended to unduly limit this disclosure.
[0032] Figure 1 This is a schematic diagram of an implementation environment according to an exemplary embodiment.
[0033] Figure 2 This is a flowchart illustrating a page interaction method according to an exemplary embodiment.
[0034] Figure 3 This is a schematic diagram of a page interaction method according to an exemplary embodiment. Figure 1 .
[0035] Figure 4 This is a schematic diagram of page interaction according to an exemplary embodiment. Figure 2 .
[0036] Figure 5 This is a mode switching diagram of a target page according to an exemplary embodiment.
[0037] Figure 6 This is a schematic diagram of page interaction based on an exemplary embodiment. Figure 3 .
[0038] Figure 7 This is a block diagram of a page interaction device according to an exemplary embodiment.
[0039] Figure 8 This is a block diagram illustrating an electronic device for page interaction according to an exemplary embodiment.
[0040] Figure 9 This is another block diagram illustrating an electronic device for page interaction according to an exemplary embodiment. Detailed Implementation
[0041] To enable those skilled in the art to better understand the technical solutions of this disclosure, the technical solutions in the embodiments of this disclosure will be clearly and completely described below with reference to the accompanying drawings.
[0042] It should be noted that the terms "first," "second," etc., used in the specification, claims, and accompanying drawings of this disclosure are used to distinguish similar objects and are not necessarily used to describe a specific order or sequence. It should be understood that such data can be interchanged where appropriate so that the embodiments of this disclosure described herein can be implemented in orders other than those illustrated or described herein. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with this disclosure. Rather, they are merely examples of apparatuses and methods consistent with some aspects of this disclosure as detailed in the appended claims.
[0043] It should be noted that the user information (including but not limited to user device information, user personal information, etc.) and data (including but not limited to data used for display, data used for analysis, etc.) involved in this disclosure are all information and data authorized by the user or fully authorized by all parties.
[0044] Applications in related technologies can make text-based recommendations to users, such as recommending novels by showing videos related to the novel's plot. For example, short video applications can display short videos related to the novel's plot within the video stream. When a user becomes interested in the short video and clicks on it, the application will redirect to the corresponding novel's reader, where they can read the novel. However, due to the nature of novels as text-based mediums, it is difficult for them to capture users' interest in a short period of time. This may cause users to quickly exit the reader and stop reading the novel, thus affecting the promotional effect of the novel.
[0045] In view of this, this disclosure proposes a page interaction method that breaks the inherent pattern of text promotion. This method can directly play text-generated videos for users within the text reader, allowing users to watch dynamic videos that are highly matched with the current reading content. This approach leverages the characteristics of the video medium, fully mobilizing users' visual senses and enhancing their immersion and understanding of the text's plot. Users no longer rely solely on textual imagination but can experience the world constructed by the text more intuitively and vividly. This increases the attractiveness to users' interests in a short period of time, effectively prolongs the time users spend in the reader, enhances users' interest and stickiness to the text, and ultimately improves the promotional effect of the text.
[0046] The technical solution provided in this disclosure will be described in detail below: Please see Figure 1 The illustration shows an implementation environment provided by an embodiment of the present disclosure. The implementation environment may include at least one page interaction terminal 110 and an information acquisition server 120, wherein the page interaction terminal 110 and the information acquisition server 120 can communicate with each other via a network.
[0047] Specifically, the page interaction terminal 110 interacts with the user through interaction with the information acquisition server 120. Specifically, the page interaction terminal 110 can display media content corresponding to the target text; when the media content is triggered, it displays a target page corresponding to the target text, the target page being used to display the content corresponding to the target text, and the target page being in video playback mode; within the target page in video playback mode, the target video corresponding to the target text is played.
[0048] The page interaction terminal 110 can communicate with the information acquisition server 120 based on a browser / server (B / S) mode or a client / server (C / S) mode. The page interaction terminal 110 may include physical devices such as smartphones, tablets, laptops, digital assistants, smart wearable devices, in-vehicle terminals, and servers, and may also include software running on the physical device, such as applications. The operating system running on the page interaction terminal 110 in this embodiment may include, but is not limited to, Android, iOS, Linux, and Windows.
[0049] The information acquisition server 120 and the page interaction terminal 110 can establish and display a communication connection through wired or wireless means. The information acquisition server 120 may include a stand-alone server, a distributed server, or a server cluster composed of multiple servers, wherein the server may be a cloud server.
[0050] Please refer to Figure 2 The diagram illustrates a page interaction method flowchart in an exemplary embodiment of this disclosure. The execution subject of this method can be the aforementioned page interaction terminal. Please refer to [link / reference] for details. Figure 2 The method may include: S210. Display the media content corresponding to the target text.
[0051] This disclosure does not limit the page displaying media content corresponding to the target text. The page can be any page of an application with text promotion functionality. This page promotes the target text by playing media content corresponding to the target text; that is, the target text itself is the object of promotion. In other words, the page in the application aims to promote information (target text) by playing media content. This disclosure does not limit the form of the media content; for example, it can be in the form of video, short video, animation, audio, or a combination of text and images.
[0052] For example, the target text can be any type of information in text form, such as novels, academic papers, poems, essays, or news reports. For instance, when the target text is an online novel, the page can play media content adapted from the novel, such as audio dramas, animated shorts, or dynamic graphics, vividly presenting the novel's plot and characters to users, attracting their interest in reading the novel, and thus promoting the text. If the target text is a review paper about the latest technological products, the page can display video media content including the paper's core viewpoints, visual charts of experimental data, and physical demonstrations of the product, allowing users to more intuitively and quickly understand the professional information conveyed in the paper, thereby achieving the goal of promoting the paper.
[0053] S220. When the media content is triggered, display the target page corresponding to the target text, the target page being used to display the content corresponding to the target text, and the target page being in video playback mode.
[0054] In related technologies, after media content triggers promotional text, a corresponding reader is displayed, showing the text in its original text form for user reading, thus achieving the promotional purpose. For example, if the promoted text is a novel, a novel reader is displayed. Unlike related technologies, this disclosure breaks with the conventional approach of displaying text in its original text form in a reader after triggering promotional text. Instead, it displays the corresponding video in a video playback mode within the reader.
[0055] In this step, the target page is a reader corresponding to the target text. However, unlike readers in related technologies that directly display the text, this reader prioritizes displaying the target video generated based on the target text. This reader can have various modes, with the video playback mode being the priority. By using the video playback mode, it enhances the reader's appeal and better disseminates the substantive content of the target text.
[0056] Taking a novel as an example, when a user triggers media content promoting the novel, what is displayed is not the static text layout of a traditional novel reader, but a novel reader with a video playback mode. This novel reader features a video playback interface. In this video playback mode, the novel's plot is presented through animation, live-action footage, and character performances on the video playback interface. For example, dialogues are played with voice-over narration, and scene descriptions are shown visually. Users can enjoy the novel like watching a movie or TV series. They can also pause and resume playback using the playback control buttons on the target page, or drag the progress bar to jump to chapters of interest. This changes the traditional method of reading novels solely through text, presenting the novel's content in a more vivid and intuitive video format.
[0057] S230. On the target page in the video playback mode, play the target video corresponding to the target text.
[0058] This disclosure does not limit the method of obtaining the target video. For example, the server can automatically generate the target video based on the target text using text-based video generation technology, and send it to the client for storage or real-time loading and playback; alternatively, content creators can pre-produce videos based on the storyline of the target text and upload them to the server for the client to invoke when video playback is triggered; or user-uploaded self-made videos can be matched and associated with the target text content to form a playable target video resource. Regardless of the acquisition method used, as long as it ensures that the target video corresponds to the target text content and accurately conveys the plot, characters, and scene information in the text, it is acceptable.
[0059] In one exemplary implementation, the server can generate text-based videos using a text-based video big data model, thereby directly generating target videos from target text. The process is described in detail below: 1. Text Input Processing: Text input processing is the first step in the operation of the Wensheng video large-scale model. The model segments the target text into words or phrases. This step typically uses natural language processing techniques. Then, the model performs word embedding on the segmented results, converting each word or phrase into a vector representation for subsequent text encoding. Word embedding techniques are typically implemented using bag-of-words models or word embedding models.
[0060] 2. Text Encoding: The aforementioned vector representations are then processed by a text encoder to convert the text into high-dimensional vector representations. These high-dimensional vectors are mathematical representations in a high-dimensional space, capturing the semantic and syntactic information of the text. This step typically uses deep learning techniques such as Recurrent Neural Networks (RNNs) or Transformers. During text encoding, the model learns the text's syntax and semantics and converts it into vector representations based on contextual relationships.
[0061] 3. Video Generation Network: After the text encoder obtains a high-dimensional vector representation, it is input into the video generation network. The video generation network typically consists of a generator and a discriminator. The generator is responsible for generating video content, while the discriminator is responsible for evaluating the realism of the generated video. The video generation network is the core component of the text-to-video model, responsible for converting the high-dimensional vector representation obtained from text encoding into video content. This step typically uses deep learning techniques such as Generative Adversarial Networks (GANs) or Variational Autoencoders (VAEs).
[0062] 4. Video Generation by Generator: The generator receives a high-dimensional vector representation from the text encoder and generates the corresponding video content based on this representation. The generator is typically a deep neural network that can learn the relationships between different elements. The generator is a crucial part of the video generation network, responsible for generating video content from the high-dimensional vector representation obtained from the text encoding.
[0063] 5. Training Process: During video generation, the generator continuously optimizes its parameters to make the generated video content more realistic. This optimization process is achieved through a game between the generator and the discriminator. The discriminator evaluates whether the video generated by the generator is realistic, and the generator adjusts its generated video content based on the feedback.
[0064] 6. Generate video output: After the generator produces video content, it can output it as a video file of the target video.
[0065] Please refer to Figure 3 It illustrates the page interaction method in this disclosure. Figure 1 For example, if the target text is a novel, please refer to... Figure 3 (a) On page 310 of the application used to promote the novel, media content 320 related to the novel's plot and a reading entry point 330 can be displayed. If a user clicks on the reading entry point 330, the novel text can be displayed in the novel reader (target page), and the user can browse the novel in its original text form.
[0066] However, if the user clicks on media content 320, the novel reader (target page) will still be displayed, but it will be presented in video playback mode. Please refer to [link / reference]. Figure 3 (b) shows the target page 340 in video playback mode, which can directly play the video corresponding to the novel.
[0067] This design demonstrates the compatibility of related technologies with the solutions disclosed herein. Specifically, when users select the traditional reading entry point, the original text reading logic can be followed, ensuring a stable experience for users in their familiar reading habits. When users interact with media content, the system automatically switches to video playback mode, transforming the plot that users originally needed to construct through textual imagination into intuitive dynamic images. This compatibility is not a simple addition of functions, but rather, while preserving the core reading experience, it expands the dimensions of content presentation through precise recognition and response to user interaction behaviors. This allows the same target page to provide differentiated services based on the user's immediate choices, satisfying the preference of some users for text reading while also providing a more vivid way for users seeking an immersive experience. Thus, without increasing the complexity of user operations, it enhances the flexibility of page interaction and the attractiveness of the content.
[0068] As can be seen, the method disclosed herein achieves a smooth transition and deep association between the media content related to the text and the playback of the corresponding video after the media content is displayed and triggered. This is achieved by displaying the target page in video playback mode and playing the target video after the media content is shown. This method breaks away from the inherent pattern of directly displaying text to users in text promotion scenarios in related technologies. Instead, it directly displays text-based videos, enhancing the expressiveness of information through video, conveying more information, and increasing user appeal. This makes it easier for users to understand the substantive information conveyed by the text and makes them more willing to stay on the target page to receive text-related information, thereby improving the effectiveness, reach, and efficiency of text information dissemination.
[0069] In one exemplary embodiment, the method further includes: displaying a first prompt message on the target page in the video playback mode, the first prompt message being used to instruct the reading of the target text; stopping the playback of the target video and switching the target page to a listening mode when the first prompt message is triggered; displaying text content in the target text corresponding to the playback progress of the target video on the target page in the listening mode; and playing audio corresponding to the text content.
[0070] Taking a novel as the target text as an example, the novel reader (target page) supports multiple modes, one of which is "video playback mode". This mode is the first-use mode for users who enter the novel reading page through promotion from a media content promotion page. In this mode, the novel reader plays a video corresponding to the novel.
[0071] During video playback, a "first prompt message" is also displayed in this video playback mode. Its purpose is to provide an entry point for users to access other modes of the novel reader, allowing them to switch from the current video playback mode to another mode offered by the novel reader—the audio-visual mode. For example, the text content of this first prompt message could be: "Read the original text" or "Listen to the original text."
[0072] "Listen / Read Mode" is another state of the target page, as opposed to video playback mode. In this mode, the target page is mainly used to display text content and play the corresponding audio. The text content corresponding to the playback progress of the target video refers to the part of the novel text that matches the content at a certain point in the video playback. For example, if the video plays to the 5-minute mark and narrates the content of the third section of the first chapter, then the text displayed at that time is the text of the third section of the first chapter.
[0073] Playing audio corresponding to text content means playing the audio corresponding to that part of the text. For example, the audible audio can be generated by the server through technologies such as speech synthesis and is pre-stored in the cloud or locally. When the user switches to the listening mode, the corresponding text paragraph is located according to the current video playback progress, and then the audio file of that paragraph is retrieved from the storage location for playback.
[0074] In audio-visual mode, the target page displays text content in a clear layout, using appropriately readable font size, line spacing, and background color. The text corresponding to the playing audio is highlighted or dynamically marked, allowing users to visually see the position of the currently playing content within the text while listening to the audio. This synchronization of auditory and visual elements enhances the user's reading experience. When the audio finishes playing or the user manually switches text paragraphs, the displayed text content is automatically updated, and the corresponding audio is played, ensuring a smooth and continuous audio-visual experience.
[0075] In this implementation, after watching the video, users can naturally transition to the audio-visual mode through the initial prompt, catering to the diverse content consumption habits of different users. Some users may prefer the dynamic presentation of videos, while others may prefer the immersive experience of listening to and reading text simultaneously. This mode switching allows users to choose according to their needs, improving user satisfaction with the product. In audio-visual mode, the text content corresponds to the video playback progress, and audio is played synchronously. This is equivalent to providing users with a multi-sensory learning method that combines "audiovisual" elements. Users can cross-reference text reading, audio listening, and their memory of previous video footage, thereby gaining a deeper understanding of the information conveyed by the text. This method effectively lowers the comprehension threshold, especially for texts with complex plots or abstract content.
[0076] Furthermore, this implementation method increases the time users spend on the target page and the depth of their interaction. The smooth switching from video playback mode to audio-visual mode, along with the rich content presentation formats offered by the audio-visual mode itself, can continuously attract users' attention, reduce the likelihood of users leaving the page due to boredom from a single mode, and ultimately improve the dissemination effect of text information.
[0077] In one exemplary embodiment, when the first prompt message is triggered, the method further includes: On the target page in the listening mode, a second prompt message is displayed. The second prompt message includes an identifier corresponding to the video playback mode and a status control corresponding to the video playback mode. The status control is used to indicate or control the status of the video playback mode. When the identifier or the status control is triggered, the target page is switched to the video playback mode, and playback of the target video is resumed, with the playback progress of the target video corresponding to the playback progress of the audio.
[0078] The second prompt refers to the set of interactive elements displayed on the target page in audio-visual mode to guide users back to video playback mode. This includes a "video playback mode identifier" and "status controls." The video playback mode identifier can be a visual symbol (such as a "video" icon, text label, etc.) to clearly indicate to the user that the element is associated with video playback mode, helping them quickly identify the function entry point. Status controls are used to indicate or control the video playback mode status, commonly in the form of play / pause buttons, which intuitively reflect whether the video is currently playing and support direct user operation.
[0079] Taking the novel "Star Trek" as an example, when a user triggers the "listen-reading mode", the page switches to a state where text and audio play simultaneously. When the user sees "the spaceship engine suddenly overloads and the instrument panel flashes red" in the listen-reading mode, if they want to watch the corresponding animated scene, they can quickly locate the switching entry through the second prompt information. When the user clicks the "video playback mode corresponding icon" or the status control, the page immediately switches from the listen-reading mode back to the video playback mode. The video continues to play from 15 minutes and 3 seconds, and at this time the video screen corresponds completely to the current progress, that is, the scene of the spaceship engine overload appears in the video.
[0080] In this implementation, users do not need to exit the listening / reading mode to find the video entry point again. They can switch back to video playback mode with one click through the second prompt, and the progress is automatically aligned, avoiding gaps in content comprehension caused by interruptions in operation. For example, if a user makes a visual association with a certain scene while listening / reading, they can quickly resume video playback through the status control, achieving a seamless connection between "audio-visual" and meeting personalized needs such as "listening while watching" and "switching as needed." For example, users can first quickly browse the text content in listening / reading mode, and then switch to video to watch the details when encountering key scenes. This design makes the three media of text, audio, and video complementary, retaining the "immersive information reception" advantage of listening / reading mode while enhancing the visual impact through video mode, ultimately improving the user's overall understanding and memory of the content.
[0081] Please refer to Figure 3 (b) shows the target page 340 in video playback mode, which displays the first prompt message 350. The text corresponding to the first prompt message 350 is "Read the original text". When the first prompt message 350 is triggered, please refer to... Figure 3 (c) The target page 340 enters the listening / reading mode. In listening / reading mode, the target page displays a second prompt message 360, which includes a video playback mode corresponding identifier 361 and a status control 362 corresponding to the video playback mode. When any one of the second prompt message 360, the video playback mode corresponding identifier 361, or the status control 362 corresponding to the video playback mode is triggered, the page can switch back to video playback mode.
[0082] In one exemplary embodiment, the second prompt message further includes a close control, and the method further includes: when the close control is triggered, canceling the display of the second prompt message; displaying a third prompt message, the third prompt message being used to prompt the use of the video playback mode, the page area occupied by the third prompt message being smaller than the page area occupied by the second prompt message.
[0083] When the page is in audio-visual mode, the second prompt message, in addition to the corresponding identifier and status controls for switching back to video playback mode, also includes a "close control." When the user triggers this close control (e.g., by clicking the button), the previously displayed second prompt message disappears from the page and is no longer displayed; then, a new third prompt message is displayed on the page. The purpose of this third prompt message is to continue to remind the user that video playback mode is available, but the space it occupies on the page (e.g., height, width, or overall area) is smaller than that of the previous second prompt message.
[0084] This design enhances user flexibility and autonomy. If users find the second prompt in audio-visual mode too bulky and disruptive to their reading experience, they can disable it by triggering a close control, resulting in a cleaner interface and improved immersion. Secondly, even after disabling the second prompt, a smaller third prompt still guides the user. This means users won't completely forget about the video playback mode; when they need to switch back, they can easily find the entry point through the third prompt, preventing functional disconnection. The smaller size of the third prompt helps maintain interface cleanliness and information accessibility without interfering with the main audio content, achieving a balance between functional prompts and user experience. It satisfies some users' need for a simple interface while ensuring the discoverability of core functions.
[0085] Please refer to Figure 3 (b) The second notification message 360 also includes a close control 363. When the close control 363 is triggered, the display of the second notification message 360 can be cancelled, and instead... Figure 3 In (d), the third prompt message 370 is displayed. In addition, the page also includes a control with the text label "listen". When the control is triggered, it enters the audio playback mode. This mode will be described in detail below.
[0086] In one exemplary embodiment, playing the audio corresponding to the text content includes: upon receiving a preset control operation, using a preset position in the currently displayed text content as the starting point for audio playback and then playing the audio.
[0087] This disclosure does not limit the preset control operations. These operations can be triggered by specific gestures (such as double-clicking text) or keyboard shortcuts, or by setting corresponding controls on the target page and triggering these controls to execute the preset control operations. The preset position is a default position, such as the beginning of the current page. Once the preset control operation is received, the audio automatically plays from the beginning. The advantage of this design is that it allows users to quickly calibrate the audio playback progress through preset control operations, enabling users to quickly adjust the audio playback progress even when the audio is out of sync.
[0088] In one exemplary embodiment, the method further includes: displaying settings controls on a target page in the video playback mode; switching the target page from the video playback mode to a video control mode when the settings controls are triggered; displaying tone control controls on the target page in the video control mode; displaying a tone selection panel, which includes at least one tone, when the tone selection panel is triggered; and playing the target video using the selected tone when the tone selection panel is triggered.
[0089] The target page in video playback mode also displays settings controls that can switch the target page to another mode not mentioned earlier—video control mode. In video control mode, the target page displays more video-related control options (such as tone adjustment, picture quality settings, etc.), allowing users to fine-tune playback parameters. The tone control controls are interactive elements used to adjust audio tone in video control mode; triggering them opens the tone selection panel. The tone selection panel includes a selection interface for preset tones (such as "clear vocals," "deep narration," "cartoon sound effects," etc.), which users can select by clicking.
[0090] Taking a science fiction novel as an example, a dynamic comic video corresponding to the novel is playing for the user. In the video, the protagonist's mechanical prosthetic limbs emit metallic scraping sounds, and the narration is a cold, electronic voice. If the user feels the electronic voice lacks emotion and wants to change it to a more human-like tone, they can use the settings controls to switch to video control mode. In this mode, clicking the tone control will bring up a tone selection panel listing four options: "Warm Human Voice," "Mechanical Electronic Voice," "Youthful Voice," and "Calm Narration." Selecting "Warm Human Voice" closes the panel, and the video automatically replays with the new tone. The originally cold narration is replaced by a soft human voice, which better matches the protagonist's delicate psychological state.
[0091] In this implementation, the timbre selection function allows users to adjust the audio style in the video according to the type of content or personal preference, avoiding the monotony of a single timbre and enhancing immersion. The hierarchical design of "Settings Controls → Video Control Mode → Timbre Selection Panel" hides advanced functions in secondary interfaces, avoiding clutter in playback mode while ensuring users can quickly access adjustment functions, thus balancing simplicity and functionality.
[0092] In one exemplary embodiment, the method further includes: displaying a subtitle control on a target page in the video control mode; displaying corresponding subtitles on the playback screen of the target video when the subtitle control is set to enabled; and de-displaying the corresponding subtitles on the playback screen of the target video when the subtitle control is set to disabled.
[0093] When the user switches the page to video control mode, a subtitle control is displayed. The user can click this control to toggle the subtitle display status. Specifically, when the user sets the subtitle control to "on," subtitles corresponding to the video content will be automatically overlaid and displayed synchronously on the screen of the currently playing target video, using a preset style (such as a specific font, size, color, and background transparency). Conversely, when the user sets the subtitle control to "off," the subtitles originally displayed on the video screen will be immediately hidden and no longer displayed.
[0094] Subtitles aid in better understanding video content, and this design enhances the user's personalized viewing experience. Different users have different needs for subtitles; some users prefer subtitles to aid understanding or note-taking, while others prefer a clean, unedited view for a more immersive visual experience. The subtitle control gives users the right to choose, satisfying diverse viewing habits. Similar to the aforementioned audio selection function, this design also reflects a balance between hierarchical and simple interface interaction. The subtitle control is only displayed in video control mode, avoiding an overly cluttered main interface in playback mode, maintaining a clean look, while allowing users who need it to easily find and use the subtitle function, further optimizing the overall user experience.
[0095] In conjunction with the preceding text Figure 3 To see, when the settings control 380 is triggered on the target page in video playback mode, please refer to... Figure 4 It illustrates the page interaction diagram of this disclosure. Figure 2 When the settings control 380 on the target page in video playback mode is triggered, the following will be displayed: Figure 4 (a), Figure 4 (a) is the target page in video control mode. This page displays the timbre control 410 and the subtitle control 420, allowing users to adjust the timbre and subtitles according to their needs. The timbre control 410 provides multiple preset timbre options; clicking this control will... Figure 4 (b) A pop-up sound selection list appears, containing different sound styles. After the user selects the corresponding option, the background music or narration sound of the video will switch in real time to match the user's auditory preferences. The subtitle control 420 adopts a switch design. When the control is on, subtitles will be displayed on the video screen according to the preset style. Users can quickly switch between on and off states by clicking the control, making the operation convenient and intuitive.
[0096] In one exemplary embodiment, the method further includes: displaying a first control on a target page in the video control mode; switching the target page in the video control mode to an audio playback mode when the first control is triggered; and playing audio on the target page in the audio playback mode that corresponds to the playback progress of the target video and is determined based on the target text.
[0097] In video control mode, a primary control is displayed on the target page. When the user triggers this primary control, the current video control mode switches to audio playback mode, a new mode not mentioned earlier. In audio playback mode, the target page focuses on playing audio and performing related controls. For example, if the user triggers the primary control to switch to audio playback mode at the 10-minute mark of the video playback, the audio will resume playback from the 10-minute mark, achieving a seamless transition in playback progress.
[0098] This implementation satisfies users' need to listen to audio content only in specific scenarios. For example, when users are commuting, engaged in physical activity, or in environments where watching video is inconvenient, they can switch to audio mode by triggering the first control without interrupting their current viewing progress, thus continuing to acquire information and improving ease of use and scenario adaptability. Users can more flexibly choose the content presentation method according to their own state and environment. By ensuring that audio playback starts from the current video progress, seamless switching between video and audio modes is achieved, avoiding the need for users to reposition the playback position and ensuring the continuity and smoothness of the experience. Adding this interaction logic makes the page interaction method more complete, providing users with richer functional options, reflecting the design consideration for diverse user needs.
[0099] Please refer to Figure 4 ,exist Figure 4 (a) In the target page of the video control mode, a first control 430 is displayed. When the user triggers the first control 430, it displays... Figure 4 (c) shows the target page 440 in audio playback mode.
[0100] In one exemplary embodiment, the method further includes: displaying a first prompt message on the target page in the audio playback mode, the first prompt message being used to instruct the reading of the target text; switching the target page to a listening / reading mode when the first prompt message is triggered; displaying text content in the target text corresponding to the current audio playback progress on the target page in the listening / reading mode; and continuing to play the audio.
[0101] When a user enters audio playback mode by triggering the first control, a first prompt message will be displayed on the target page. This prompt message, as mentioned earlier, guides the user into the previously mentioned listening / reading mode. When the user triggers (e.g., clicks) this prompt message, the target page switches from simple audio playback mode to listening / reading mode. In listening / reading mode, in addition to continuing to play the audio, the page will simultaneously display the relevant content of the target text, clearly identifying the specific text segment corresponding to the current audio playback progress through methods such as highlighting, underlining, font color changes, or scrolling positioning.
[0102] The listening-to-read mode combines auditory and visual information, allowing users to deepen their understanding of the content by reading synchronized text while listening to audio. From video playback mode to video control mode, from video control mode to audio playback mode, and from audio playback mode back to listening-to-read mode, each mode switch is based on the user's active trigger and explicit needs. The audio playback progress remains consistent throughout the switching process, preventing audio content repetition or interruption due to mode switching, ensuring a smooth and seamless listening-to-read experience. This design fully considers user habits in different scenarios, significantly improving interaction flexibility and meeting various user needs through diverse mode switching, thus enhancing the scenario inclusivity of this disclosure.
[0103] Please refer to Figure 4 (c) The target page 440 in the audio playback mode shown includes a first prompt message 450, which reads "Read the original text," similar to the previous text. Figure 3 The first prompt message is the same as the previous one. When this first prompt message 450 is triggered, the target page in the listening / reading mode is displayed. This target page in the listening / reading mode is the same as the one mentioned earlier. Figure 3 (c) Maintain consistency.
[0104] In one exemplary embodiment, the method further includes: displaying a second control on a target page in the video control mode; and switching the target page in the video control mode to a video playback mode when the second control is triggered.
[0105] The second control displayed on the target page in video control mode serves as a core function, providing users with an intuitive and convenient entry point for quickly switching between video control and video playback modes. This second control further enhances the closed-loop interaction between modes. As mentioned earlier, mode switching is based on user-initiated triggers and explicit needs. The existence of the second control creates a bidirectional and convenient switching channel between video control and playback modes, avoiding any operational lag that might occur when switching between different modes and improving the smoothness and intuitiveness of the operation. Secondly, it strengthens the user's sense of control over the video playback process. Users can flexibly switch between control and playback modes according to their needs, precisely adjusting the video while readily returning to an immersive viewing experience. This flexible switching further solidifies the foundation for a seamless and smooth user experience.
[0106] Please refer to Figure 4 (c) The target page 440 in the audio playback mode shown includes a second control 460. When the second control 460 is triggered, the target page in the video playback mode is displayed. The target page in the video playback mode is the same as described above. Figure 3 (a) Maintain consistency.
[0107] Please refer to Figure 5 The diagram illustrates the mode switching of the target page in this disclosure. The target page offers four modes: video playback mode, video control mode, audio playback mode, and audio reading mode.
[0108] like Figure 5 As shown, when the target page is in video playback mode, the user can switch to video control mode by triggering the aforementioned settings control. At this time, the interface will display detailed playback control options, such as progress bar, volume adjustment, playback speed selection, etc., to facilitate precise operation by the user. In video control mode, after the user completes the operation, they can return to video playback mode by triggering the aforementioned second control.
[0109] When a user triggers the aforementioned first prompt in video playback mode, the target page switches from video playback mode to audio-visual mode. In audio-visual mode, the user can switch back to video playback mode by triggering the second or third prompt.
[0110] In video control mode, triggering the aforementioned first control switches to audio playback mode. In this mode, the target text information is not displayed; only core audio control elements such as play / pause, previous / next track, and volume adjustment are shown. In audio playback mode, triggering the aforementioned first prompt message switches to listening / reading mode.
[0111] The seamless switching between these four modes enhances user convenience, allowing users to quickly adjust the page state according to their immediate needs and avoiding the problems of functional redundancy or inadequacy in a single mode. Flexible mode switching effectively reduces the number of steps and cognitive burden for users in different usage scenarios. For example, switching from video playback to audio-only playback can be completed directly within the target page without exiting the current page and opening other applications, significantly improving the continuity and smoothness of the user experience. At the same time, this multi-mode design fully covers users' needs in video viewing, content control, and audio listening, enhancing the target page's scenario adaptability and functional completeness.
[0112] In one exemplary embodiment, the method further includes: displaying a fourth prompt message when the content corresponding to the target text has finished playing or when an operation to exit the target page is received, the fourth prompt message being used to display at least one recommended text; and continuing to display the content corresponding to the triggered recommended text on the target page when any of the recommended texts is triggered.
[0113] When the content corresponding to the current target text finishes playing, or when the user actively exits the target page (e.g., clicks the back button or closes the window), a "fourth prompt message" containing at least one recommended text can be displayed to the user. These recommended texts can have a certain degree of relevance to the target content that the user has just viewed or read, such as being thematically related, having a similar style, or being text related to potential points of interest mined from the user's historical behavior data.
[0114] When a user selects and triggers one of these recommended texts, the content corresponding to that triggered recommended text can be immediately displayed on the user's current target page, without needing to jump to a new page or window. This achieves a seamless content transition and continuous browsing experience. This design effectively reduces the user's operation path, improves the convenience of content acquisition, and maintains the continuity of user attention by switching recommended content within the same page, enhancing the smoothness of page interaction and user engagement.
[0115] This design provides users with seamless content transitions, preventing them from feeling lost about "what to watch next" after finishing one piece of content. It reduces the operational and time costs of searching for new content, improves content discovery efficiency and overall user experience, and extends the time users spend on target pages. This design subtly guides users to explore content, forming a virtuous cycle of "watching - exiting / ending - receiving recommendations - triggering new content - continuing to watch," enhancing the efficiency and depth of content interaction.
[0116] Please refer to Figure 6It illustrates the page interaction diagram in this disclosure. Figure 3 Please refer to this. Figure 6 (a) The user is watching the video on the target page in video playback mode. If the user wants to exit video playback mode and leave the target page, they can... Figure 6 (b) Display a fourth prompt message to the user, which includes multiple recommended texts, if the user clicks... Figure 6 (b) The close control allows the user to exit the target page. If the user clicks on a recommendation text, the video corresponding to that recommendation text can continue to be displayed on the target page.
[0117] like Figure 6 As shown in (c), if the user scrolls to the last episode, continuing to scroll up will display a prompt informing the user that they have reached the last episode. Continuing to scroll up at this point will then display the aforementioned fourth prompt message, such as... Figure 6 (b) The interaction logic with the fourth prompt has been described above and will not be repeated here.
[0118] Figure 7 This is a block diagram illustrating a page interaction device according to an exemplary embodiment. (Refer to...) Figure 7 The device includes: The display module 710 is configured to display the media content corresponding to the target text. The interaction module 720 is configured to, when the media content is triggered, display a target page corresponding to the target text, the target page being used to display the content corresponding to the target text, and the target page being in video playback mode; and to play the target video corresponding to the target text on the target page being in video playback mode.
[0119] In one exemplary implementation, the interaction module 720 is configured to perform: On the target page in the video playback mode, a first prompt message is displayed, which is used to instruct the reader to read the target text; When the first prompt message is triggered, the playback of the target video is stopped and the target page is switched to listening mode; On the target page in the listening mode, the text content in the target text that corresponds to the playback progress of the target video is displayed; Play the audio corresponding to the text content.
[0120] In one exemplary implementation, the interaction module 720 is configured to perform: On the target page in the listening mode, a second prompt message is displayed. The second prompt message includes an identifier corresponding to the video playback mode and a status control corresponding to the video playback mode. The status control is used to indicate or control the status of the video playback mode. When the identifier or the status control is triggered, the target page is switched to the video playback mode, and playback of the target video is resumed, with the playback progress of the target video corresponding to the playback progress of the audio.
[0121] In one exemplary embodiment, the second prompt message further includes a close control, and the interaction module 720 is configured to execute: When the close control is triggered, the second prompt message is canceled from being displayed; A third prompt message is displayed, which is used to prompt the user to use the video playback mode. The page area occupied by the third prompt message is smaller than that occupied by the second prompt message.
[0122] In one exemplary implementation, the interaction module 720 is configured to perform: Upon receiving a preset control operation, the preset position in the currently displayed text content will be used as the starting point for audio playback.
[0123] In one exemplary implementation, the interaction module 720 is configured to perform: On the target page in the video playback mode, display settings controls; When the settings control is triggered, the target page is switched from the video playback mode to the video control mode; In the target page of the video control mode, a tone control control is displayed; When the timbre control is triggered, a timbre selection panel is displayed, the timbre selection panel including at least one timbre; When the timbre selection panel is triggered, the target video is played using the selected timbre.
[0124] In one exemplary implementation, the interaction module 720 is configured to perform: On the target page in the video control mode, display the subtitle control; When the subtitle control is enabled, the corresponding subtitles are displayed on the playback screen of the target video. When the subtitle control is set to off, the corresponding subtitles will not be displayed on the playback screen of the target video.
[0125] In one exemplary implementation, the interaction module 720 is configured to perform: On the target page in the video control mode, the first control is displayed; When the first control is triggered, the target page of the video control mode is switched to audio playback mode; On the target page in the audio playback mode, audio corresponding to the playback progress of the target video and determined based on the target text is played.
[0126] In one exemplary implementation, the interaction module 720 is configured to perform: On the target page in the audio playback mode, a first prompt message is displayed, which instructs the reader to read the target text. When the first prompt message is triggered, the target page will be switched to audio-visual mode; On the target page in the listening mode, the text content in the target text that corresponds to the current audio playback progress is displayed; Continue playing the audio.
[0127] In one exemplary implementation, the interaction module 720 is configured to perform: On the target page in the video control mode, a second control is displayed; When the second control is triggered, the target page of the video control mode is switched to video playback mode.
[0128] In one exemplary implementation, the interaction module 720 is configured to perform: When the content corresponding to the target text has finished playing, or when an operation to exit the target page is received, a fourth prompt message is displayed, which is used to display at least one recommended text. When any of the recommended texts is triggered, the associated target page corresponding to the triggered recommended text is displayed. The associated target page is used to display the content corresponding to the triggered recommended text, and the associated target page is in video playback mode. In the associated target page that is in the video playback mode, the associated target video corresponding to the triggered recommendation text is played.
[0129] Regarding the pilot device in the above embodiments, the specific manner in which each module performs its operation has been described in detail in the embodiments related to the method, and will not be elaborated upon here.
[0130] In an exemplary embodiment, a computer-readable storage medium including instructions is also provided. Optionally, the computer-readable storage medium may be a ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, and optical data storage device, etc. When the instructions in the computer-readable storage medium are executed by a processor of an electronic device, the electronic device is able to perform any of the methods described above.
[0131] In an exemplary embodiment, a computer program product is also provided, the computer program product including a computer program stored in a readable storage medium, wherein at least one processor of a computer device reads from the readable storage medium and executes the computer program, causing the device to perform any of the methods described above.
[0132] Figure 8 This is a block diagram illustrating an electronic device for page interaction according to an exemplary embodiment. The electronic device may be a terminal, and its internal structure diagram may be as follows: Figure 8 As shown, the device may include an RF (Radio Frequency) circuit 810, a memory 820 including one or more computer-readable storage media, an input unit 830, a display unit 840, a sensor 850, an audio circuit 860, a WiFi (Wireless Fidelity) module 870, a processor 880 including one or more processing cores, and a power supply 890, among other components. Those skilled in the art will understand that... Figure 8 The terminal structure shown does not constitute a limitation on the terminal and may include more or fewer components than shown, or combine certain components, or have different component arrangements. Wherein: The RF circuit 810 can be used for receiving and transmitting signals during information transmission or calls. Specifically, it receives downlink information from the base station and hands it over to one or more processors 880 for processing; additionally, it transmits uplink data to the base station. Typically, the RF circuit 810 includes, but is not limited to, an antenna, at least one amplifier, a tuner, one or more oscillators, a Subscriber Identity Module (SIM) card, a transceiver, a coupler, an LNA (Low Noise Amplifier), a duplexer, etc. Furthermore, the RF circuit 810 can also communicate wirelessly with networks and other terminals. Wireless communication can use any communication standard or protocol, including but not limited to GSM (Global System for Mobile communication), GPRS (General Packet Radio Service), CDMA (Code Division Multiple Access), WCDMA (Wideband Code Division Multiple Access), LTE (Long Term Evolution), email, SMS (Short Messaging Service), etc.
[0133] The memory 820 can be used to store software programs and modules. The processor 880 executes various functional applications and data processing by running the software programs and modules stored in the memory 820. The memory 820 may mainly include a program storage area and a data storage area. The program storage area may store the operating system, application programs required for the functions, etc.; the data storage area may store data created according to the use of the terminal, etc. In addition, the memory 820 may include high-speed random access memory, and may also include non-volatile memory, such as at least one disk storage device, flash memory device, or other volatile solid-state storage device. Accordingly, the memory 820 may also include a memory controller to provide access to the memory 820 for the processor 880 and the input unit 830.
[0134] The input unit 830 can be used to receive input digital or character information, and to generate keyboard, mouse, joystick, optical, or trackball signal inputs related to user settings and function control. Specifically, the input unit 830 may include a touch-sensitive surface 831 and other input devices 832. The touch-sensitive surface 831, also known as a touch display screen or touchpad, can collect touch operations performed by the user on or near it (such as operations performed by the user using a finger, stylus, or any suitable object or accessory on or near the touch-sensitive surface 831), and drive the corresponding connected devices according to a pre-set program. Optionally, the touch-sensitive surface 831 may include two parts: a touch detection device and a touch controller. The touch detection device detects the user's touch position and the signal generated by the touch operation, and transmits the signal to the touch controller; the touch controller receives touch information from the touch detection device, converts it into touch point coordinates, and sends it to the processor 880, and can also receive and execute commands sent by the processor 880. In addition, the touch-sensitive surface 831 can be implemented using various types such as resistive, capacitive, infrared, and surface acoustic wave. In addition to the touch-sensitive surface 831, the input unit 830 may also include other input devices 832. Specifically, other input devices 832 may include, but are not limited to, one or more of the following: physical keyboard, function keys (such as volume control buttons, power buttons, etc.), trackball, mouse, joystick, etc. The display unit 840 can be used to display information input by the user or information provided to the user, as well as various graphical user interfaces of the terminal. These graphical user interfaces can be composed of graphics, text, icons, video, and any combination thereof. The display unit 840 may include a display panel 841, which may optionally be configured as an LCD (Liquid Crystal Display), OLED (Organic Light-Emitting Diode), or similar display panel 841. Further, a touch-sensitive surface 831 may cover the display panel 841. When the touch-sensitive surface 831 detects a touch operation on or near it, it transmits the information to the processor 880 to determine the type of touch event. Subsequently, the processor 880 provides corresponding visual output on the display panel 841 according to the type of touch event. The touch-sensitive surface 831 and the display panel 841 can be two independent components to implement input and output functions. However, in some embodiments, the touch-sensitive surface 831 and the display panel 841 can be integrated to achieve input and output functions.
[0135] The terminal may also include at least one sensor 850, such as a light sensor, a motion sensor, and other sensors. Specifically, the light sensor may include an ambient light sensor and a proximity sensor. The ambient light sensor can adjust the brightness of the display panel 841 according to the ambient light level, and the proximity sensor can turn off the display panel 841 and / or the backlight when the terminal is moved to the ear. As a type of motion sensor, a gravity acceleration sensor can detect the magnitude of acceleration in various directions (generally three axes). When stationary, it can detect the magnitude and direction of gravity and can be used for applications that identify the terminal's posture (such as landscape / portrait switching, related games, magnetometer posture calibration), vibration recognition-related functions (such as pedometer, tapping), etc. Other sensors that may be configured on the terminal, such as gyroscopes, barometers, hygrometers, thermometers, and infrared sensors, will not be described in detail here.
[0136] Audio circuitry 860, speaker 861, and microphone 862 provide an audio interface between the user and the terminal. Audio circuitry 860 converts received audio data into electrical signals, which are then transmitted to speaker 861, where they are converted into sound signals for output. Conversely, microphone 862 collects sound signals, converts them into electrical signals, which are then received by audio circuitry 860, converted back into audio data, processed by processor 880, and transmitted via RF circuitry 810 to, for example, another terminal, or output to memory 820 for further processing. Audio circuitry 860 may also include an earphone jack to facilitate communication between a peripheral headset and the terminal.
[0137] WiFi is a short-range wireless transmission technology. This terminal, through the WiFi module 870, can help users send and receive emails, browse web pages, and access streaming media, providing users with wireless broadband internet access. Although Figure 8 WiFi module 870 is shown, but it is understood that it is not a necessary component of the terminal and can be omitted as needed without changing the nature of the invention.
[0138] The processor 880 is the control center of the terminal, connecting various parts of the terminal through various interfaces and lines. It executes software programs and / or modules stored in the memory 820, and calls data stored in the memory 820 to perform various functions and process data, thereby enabling overall monitoring of the terminal. Optionally, the processor 880 may include one or more processing cores; preferably, the processor 880 may integrate an application processor and a modem processor, wherein the application processor mainly handles the operating system, user interaction area, and applications, while the modem processor mainly handles wireless communication. It is understood that the modem processor may not be integrated into the processor 880.
[0139] The terminal also includes a power supply 890 (such as a battery) to power various components. Preferably, the power supply can be logically connected to the processor 880 through a power management system, thereby enabling functions such as charging, discharging, and power consumption management through the power management system. The power supply 890 may also include one or more DC or AC power supplies, a recharging system, a power fault detection circuit, a power converter or inverter, a power status indicator, and other arbitrary components.
[0140] Although not shown, the terminal may also include a camera, Bluetooth module, etc., which will not be described in detail here. Specifically, in this embodiment, the display unit of the terminal is a touch screen display, and the terminal also includes a memory and one or more programs, wherein one or more programs are stored in the memory and configured to be executed by one or more processors of the instructions in the method embodiment of the present invention.
[0141] Please refer to Figure 9 This illustration shows another block diagram of an electronic device for page interaction provided in another exemplary embodiment of this disclosure. The computer device may be a server for performing the page interaction method described above. Specifically: Computer device 900 includes a Central Processing Unit (CPU) 901, a system memory 904 including Random Access Memory (RAM) 902 and Read Only Memory (ROM) 903, and a system bus 905 connecting the system memory 904 and the CPU 901. Computer device 900 also includes a basic input / output system (I / O system) 906 that facilitates information transfer between various devices within the computer, and a mass storage device 907 for storing the operating system 913, application programs 914, and other program modules 911.
[0142] The basic input / output system 906 includes a display 908 for displaying information and an input device 909 for user input, such as a mouse or keyboard. Both the display 908 and the input device 909 are connected to the central processing unit 901 via an input / output controller 190 connected to the system bus 905. The basic input / output system 906 may also include the input / output controller 190 for receiving and processing input from multiple other devices such as a keyboard, mouse, or electronic stylus. Similarly, the input / output controller 190 also provides output to a display screen, printer, or other types of output devices.
[0143] Mass storage device 907 is connected to central processing unit 901 via a mass storage controller (not shown) connected to system bus 905. Mass storage device 907 and its associated computer-readable media provide non-volatile storage for computer device 900. That is, mass storage device 907 may include computer-readable media (not shown) such as hard disk or CD-ROM (CompactDisc Read-Only Memory) drive.
[0144] Without loss of generality, computer-readable media can include computer storage media and communication media. Computer storage media includes volatile and non-volatile, removable and non-removable media implemented using any method or technology for storing information such as computer-readable instructions, data structures, program modules, or other data. Computer storage media includes RAM, ROM, EPROM (Erasable Programmable Read Only Memory), EEPROM (Electrically Erasable Programmable Read Only Memory), flash memory or other solid-state storage technologies, CD-ROM, DVD (Digital Video Disc) or other optical storage, magnetic tape cassettes, magnetic tape, disk storage, or other magnetic storage devices. Of course, those skilled in the art will recognize that computer storage media are not limited to the above-mentioned types. The system memory 904 and mass storage device 907 described above can be collectively referred to as memory.
[0145] According to various embodiments of this disclosure, the computer device 900 can also be connected to a remote computer on a network, such as the Internet. That is, the computer device 900 can be connected to a network 912 via a network interface unit 911 connected to a system bus 905, or the network interface unit 911 can be used to connect to other types of networks or remote computer systems (not shown).
[0146] The aforementioned memory also includes a computer program stored in the memory and configured to be executed by one or more processors to implement the aforementioned page interaction method.
[0147] In an exemplary embodiment, a computer-readable storage medium is also provided, which stores at least one instruction, at least one program, code set, or instruction set, wherein the at least one instruction, at least one program, code set, or instruction set is executed by a processor to implement the page interaction method.
[0148] Optionally, the computer-readable storage medium may include: ROM (Read Only Memory), RAM (Random Access Memory), SSD (Solid State Drives), or optical disc, etc. The random access memory may include ReRAM (Resistance Random Access Memory) and DRAM (Dynamic Random Access Memory).
[0149] In an exemplary embodiment, a computer-readable storage medium including program code is also provided, such as a memory including program code, which can be executed by a processor to complete the page interaction method described above. Optionally, the computer-readable storage medium may be read-only memory (ROM), random access memory (RAM), compact-disc read-only memory (CD-ROM), magnetic tape, floppy disk, and optical data storage device, etc.
[0150] In an exemplary embodiment, a computer program product is also provided, including a computer program that, when executed by a processor, implements the page interaction method described above.
[0151] Other embodiments of this disclosure will readily occur to those skilled in the art upon consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of this disclosure that follow the general principles of this disclosure and include common knowledge or customary techniques in the art not disclosed herein. The specification and examples are to be considered exemplary only, and the true scope and spirit of this disclosure are indicated by the following claims.
[0152] It should be understood that this disclosure is not limited to the precise structures described above and shown in the accompanying drawings, and various modifications and changes can be made without departing from its scope. The scope of this disclosure is limited only by the appended claims.
Claims
1. A page interaction method, characterized in that, The method includes: Display the media content corresponding to the target text; When the media content is triggered, the target page corresponding to the target text is displayed. The target page is used to display the content corresponding to the target text, and the target page is in video playback mode. On the target page in the video playback mode, the target video corresponding to the target text is played.
2. The page interaction method according to claim 1, characterized in that, The method further includes: On the target page in the video playback mode, a first prompt message is displayed, which is used to instruct the reader to read the target text; When the first prompt message is triggered, the playback of the target video is stopped and the target page is switched to listening mode; On the target page in the listening mode, the text content in the target text that corresponds to the playback progress of the target video is displayed; Play the audio corresponding to the text content.
3. The page interaction method according to claim 2, characterized in that, When the first prompt message is triggered, the method further includes: On the target page in the listening mode, a second prompt message is displayed. The second prompt message includes an identifier corresponding to the video playback mode and a status control corresponding to the video playback mode. The status control is used to indicate or control the status of the video playback mode. When the identifier or the status control is triggered, the target page is switched to the video playback mode, and playback of the target video is resumed, with the playback progress of the target video corresponding to the playback progress of the audio.
4. The page interaction method according to claim 3, characterized in that, The second prompt message also includes a close control, and the method further includes: When the close control is triggered, the second prompt message is canceled from being displayed; A third prompt message is displayed, which is used to prompt the user to use the video playback mode. The page area occupied by the third prompt message is smaller than that occupied by the second prompt message.
5. A page interaction method according to claim 2, characterized in that, Playing the audio corresponding to the text content includes: Upon receiving a preset control operation, the preset position in the currently displayed text content will be used as the starting point for audio playback.
6. The page interaction method according to claim 1, characterized in that, The method further includes: On the target page in the video playback mode, display settings controls; When the settings control is triggered, the target page is switched from the video playback mode to the video control mode; In the target page of the video control mode, a tone control control is displayed; When the timbre control is triggered, a timbre selection panel is displayed, the timbre selection panel including at least one timbre; When the timbre selection panel is triggered, the target video is played using the selected timbre.
7. A page interaction method according to claim 6, characterized in that, The method further includes: On the target page in the video control mode, display the subtitle control; When the subtitle control is enabled, the corresponding subtitles are displayed on the playback screen of the target video. When the subtitle control is set to off, the corresponding subtitles will not be displayed on the playback screen of the target video.
8. A page interaction method according to claim 6, characterized in that, The method further includes: On the target page in the video control mode, the first control is displayed; When the first control is triggered, the target page of the video control mode is switched to audio playback mode; On the target page in the audio playback mode, audio corresponding to the playback progress of the target video and determined based on the target text is played.
9. A page interaction method according to claim 8, characterized in that, The method further includes: On the target page in the audio playback mode, a first prompt message is displayed, which instructs the reader to read the target text. When the first prompt message is triggered, the target page will be switched to audio-visual mode; On the target page in the listening mode, the text content in the target text that corresponds to the current audio playback progress is displayed; Continue playing the audio.
10. A page interaction method according to claim 8, characterized in that, The method further includes: On the target page in the video control mode, a second control is displayed; When the second control is triggered, the target page of the video control mode is switched to video playback mode.
11. A page interaction method according to claim 1, characterized in that, The method further includes: When the content corresponding to the target text has finished playing, or when an operation to exit the target page is received, a fourth prompt message is displayed, which is used to display at least one recommended text. When any of the recommended texts is triggered, the associated target page corresponding to the triggered recommended text is displayed. The associated target page is used to display the content corresponding to the triggered recommended text, and the associated target page is in video playback mode. In the associated target page that is in the video playback mode, the associated target video corresponding to the triggered recommendation text is played.
12. A page interaction device, characterized in that, The device includes: The display module is configured to display the media content corresponding to the target text; The interaction module is configured to, when the media content is triggered, display a target page corresponding to the target text, the target page being used to display the content corresponding to the target text, and the target page being in video playback mode; and play the target video corresponding to the target text on the target page in video playback mode.
13. An electronic device, characterized in that, include: processor; Memory used to store the processor's executable instructions; The processor is configured to execute the instructions to implement the page interaction method as described in any one of claims 1 to 11.
14. A computer-readable storage medium, characterized in that, When the instructions in the computer-readable storage medium are executed by the processor of the electronic device, the electronic device is enabled to perform the page interaction method as described in any one of claims 1 to 11.
15. A computer program product, characterized in that, The computer program product includes a computer program stored in a readable storage medium, wherein at least one processor of a computer device reads from and executes the computer program, causing the device to perform the page interaction method as described in any one of claims 1 to 11.