Video transmission method and device
By sending a vertical blanking message and video data to the receiving device, which supports fast video transmission capability, along with video data, the video data transmission latency problem is solved, achieving fast transmission and low power consumption.
Patent Information
- Authority / Receiving Office
- WO · WO
- Patent Type
- Applications
- Current Assignee / Owner
- HUAWEI TECH CO LTD
- Filing Date
- 2025-12-05
- Publication Date
- 2026-06-18
AI Technical Summary
Existing technologies suffer from latency issues in video data transmission, especially when high-speed transmission is required, making it difficult to meet the demand for low latency.
By sending vertical blanking messages and video data to the destination device that supports fast video transmission capabilities, carrying fields indicating the activation of fast video transmission mode, pixel clock, and blanking area information, the transmission mode is dynamically controlled to reduce latency.
It enables fast transmission of video data, reduces transmission latency, and improves transmission efficiency and system power consumption management without requiring manual activation of the fast video transmission mode.
Smart Images

Figure CN2025140541_18062026_PF_FP_ABST
Abstract
Description
Video transmission methods and equipment
[0001] This application claims priority to Chinese patent application filed on December 12, 2024, with application number 202411834977.3, entitled "Video Transmission Method and Apparatus", the entire contents of which are incorporated herein by reference. Technical Field
[0002] This application relates to the field of communications, and more particularly to a video transmission method and device. Background Technology
[0003] With the development of big data, artificial intelligence (AI), and cloud computing technologies, various types of multimedia data are constantly emerging, placing higher demands on the transmission performance of multimedia data between different devices. Taking video data as an example, some scenarios require fast transmission of video data between the sending and display ends; therefore, achieving fast transmission faces numerous challenges. Summary of the Invention
[0004] This application provides a video transmission method and device, thereby dynamically enabling a fast video transmission mode for video data and reducing the transmission latency of video data.
[0005] Firstly, a video transmission method is provided, applied to a source device or a chip within a source device. The method includes: obtaining the fast video transmission capability of the destination device through its Device Comprehensive Capability Description (DCCD); and, if the destination device supports fast video transmission capability, transmitting a first vertical blanking packet (VBP) and first video data to the destination device. The first VBP includes a fast video transmission field, a pixel clock in fast video transmission mode, the number of rows in the front shoulder of the vertical blanking area in fast video transmission mode, and the number of rows in the vertical blanking area in fast video transmission mode. The fast video transmission field takes a first value, which indicates that fast video transmission mode is enabled for the first video data.
[0006] The video transmission method provided in this application, when the source device obtains that the destination device supports fast video transmission capability and the main link bandwidth between the source and destination devices is sufficient, sends a vertical blanking packet (VBP) and video data to the destination device in fast video transmission mode. This reduces the transmission time of effective video data and lowers the transmission latency of video data. The source device includes a field in the VBP packet indicating that the destination device should enable fast video transmission mode, the pixel clock in fast video transmission mode, the number of lines before the vertical blanking area in fast video transmission mode, and the number of lines in the vertical blanking area in fast video transmission mode. This allows the video data to be transmitted to the destination device quickly, enabling the destination device to obtain video parameter information and receive the content of each video frame more quickly. The destination device can then immediately start and complete subsequent processing in advance, thereby reducing display latency.
[0007] The source device can shut down video processing modules (such as video compression modules) for a longer period of time to reduce system power consumption.
[0008] When the destination device supports fast video transmission capability and needs to enable fast video transmission mode, this method enables the destination device to enable fast video transmission mode by carrying a field in the VBP that indicates the need to enable fast video transmission mode. This eliminates the need for manual activation, allowing for more convenient and dynamic control of enabling fast video transmission mode and reducing video data transmission latency.
[0009] In one possible implementation, DCCD includes an advanced video feature capability field, which contains flags indicating whether fast video transmission is supported.
[0010] In another possible implementation, the method further includes determining the pixel clock in fast video transmission mode based on the total number of vertical lines in fast video transmission mode, the horizontal parameters of the original format, and the refresh rate of the original format.
[0011] In another possible implementation, the method further includes: determining the number of lines in the vertical blanking area front shoulder under the fast video transmission mode based on the total number of vertical lines in the fast video transmission mode, the total number of vertical lines in the original format, and the number of lines in the vertical blanking area front shoulder of the original format.
[0012] In another possible implementation, the method further includes: determining the number of vertical blanking lines in fast video transmission mode based on the total number of vertical lines and the number of valid lines in a frame.
[0013] In another possible implementation, the method further includes: determining the maximum total number of vertical lines supported under the original standard based on the maximum pixel clock available in the fast video transmission mode, the horizontal parameters of the original standard, and the refresh rate of the original standard; and selecting the total number of vertical lines in the fast video transmission mode from the total number of vertical lines in the original standard to the maximum total number of vertical lines supported under the original standard.
[0014] In another possible implementation, the method further includes: determining the minimum of the maximum video bandwidth supported by the destination device and the maximum effective bandwidth supported by the path as the available maximum video bandwidth, wherein the maximum effective bandwidth supported by the path is the product of the maximum available bandwidth supported by the path and the link bandwidth utilization; and determining the minimum of the pixel clock corresponding to the available maximum video bandwidth and the maximum pixel clock supported by the destination device as the available maximum pixel clock in fast video transmission mode.
[0015] In another possible implementation, the maximum video bandwidth and the maximum pixel clock supported by the host device are obtained through DCCD.
[0016] In another possible implementation, the elongated portion of the blanking region is determined by the number of vertical blanking region lines in the fast video transmission mode and the number of vertical blanking region lines in the original format.
[0017] In another possible implementation, when the fast video transmission mode is enabled, the pixel clock in the fast video transmission mode is N times the pixel clock of the original standard, where N represents the multiplier of the fast video transmission mode, and N>1.
[0018] The pixel clock refers to the frequency of the clock signal used to control pixel data transmission, and it determines the transmission speed of video data. Increasing the pixel clock speed can improve the transmission speed of video data.
[0019] Compared to the frame rate multiplier of High-Definition Multimedia Interface (HDMI), which can only be an integer multiple, the fast video transmission multiplier provided in this application is a non-integer multiple. In calculating the pixel clock and fast video transmission multiplier in fast video transmission mode, the granularity of QVT is at the row level. Fast video transmission is applied to each row of video data; that is, fast video transmission is applied for non-integer multiples of video frames. Therefore, frame-level control of video data can be achieved to enable fast video transmission mode for a specific video frame, offering high flexibility.
[0020] In another possible implementation, transmitting the first VBP and the first video data to the destination device includes: transmitting a first video frame to the destination device, the first video frame including at least one first VBP and a plurality of first valid video packets (AVPs), the plurality of first AVPs being used to carry the first video data.
[0021] In this application, a VBP is sent in the video frame. The VBP includes fields for fast video transmission mode, pixel clock in fast video transmission mode, number of lines in the front shoulder of the vertical blanking area in fast video transmission mode, and number of lines in the vertical blanking area in fast video transmission mode. In this way, frame-level control of video data can be achieved to enable fast video transmission mode for a certain video frame, reduce the transmission latency of video data, and provide high flexibility.
[0022] In another possible implementation, when the fast video transmission mode is enabled, the bandwidth occupied by the first video frame is N times that of the bandwidth in the normal video transmission mode.
[0023] Increasing video bandwidth while keeping the frame rate constant allows for faster transmission of effective video data and reduces transmission latency.
[0024] In another possible implementation, when the fast video transmission mode is enabled, the transmission time of the effective video data of the first video frame is reduced.
[0025] In another possible implementation, when the fast video transmission mode is enabled, the effective video time of one line in the fast video transmission mode is 1 / N times that of the effective video time of one line in the normal video transmission mode.
[0026] In another possible implementation, when the fast video transmission mode is enabled, the horizontal blanking time in the fast video transmission mode is 1 / N times the horizontal blanking time in the normal video transmission mode.
[0027] In another possible implementation, the method further includes transmitting a second VBP and second video data to the destination device. The second VBP includes a fast video transmission field, which takes the value of a second value to indicate that the fast video transmission mode is turned off for the second video data.
[0028] In another possible implementation, transmitting the second VBP and the second video data to the destination device includes: transmitting a second video frame to the destination device, the second video frame including at least one second VBP and a plurality of second valid video packets (AVPs), the plurality of second AVPs being used to carry the second video data.
[0029] Secondly, a video transmission method is provided, applied to a destination device or a chip in the destination device, the method comprising: receiving a first VBP and first video data sent by a source device; and, when the fast video transmission field included in the first VBP is a first value, enabling a fast video transmission mode for the first video data, wherein the first value is used to indicate enabling the fast video transmission mode for the first video data.
[0030] In this application, after the receiving device receives the first VBP and the first video data sent by the source device, it starts the fast video transmission mode according to the fast video transmission instruction in the first VBP sent by the source device. In the fast video transmission mode, the first video data is received quickly, which can reduce the transmission latency of the video data.
[0031] In one possible implementation, when the fast video transmission mode is enabled, the pixel clock in the fast video transmission mode is N times the pixel clock of the original standard, where N represents the multiplier of the fast video transmission mode, and N>1.
[0032] In another possible implementation, receiving a first vertical blanking message (VBP) and first video data sent by a source device includes: receiving a first video frame sent by the source device, the first video frame including at least one first VBP and multiple first valid video messages (AVPs), the multiple first AVPs being used to carry the first video data.
[0033] In another possible implementation, when the fast video transmission mode is enabled, the transmission bandwidth occupied by the first video frame is N times that of the normal video transmission mode.
[0034] In another possible implementation, when the fast video transmission mode is enabled, the transmission time of the effective video data of the first video frame is reduced.
[0035] In another possible implementation, the method further includes: receiving a second vertical blanking message (VBP) and second video data sent by a source device; and disabling the fast video transmission mode for the second video data if the fast video transmission field included in the second VBP is a second value, wherein the second value is used to indicate that the fast video transmission mode is disabled for the second video data.
[0036] In another possible implementation, receiving a second vertical blanking message (VBP) and second video data sent by a source device includes: receiving a second video frame sent by the source device, the second video frame including at least one second VBP and multiple second valid video messages (AVPs), the multiple second AVPs being used to carry the second video data.
[0037] Thirdly, a video transmission apparatus is provided, comprising modules for performing operational steps of the method in the first aspect or any possible implementation thereof. For example, the video transmission apparatus includes a receiving module and a transmitting module. The receiving module is configured to acquire the fast video transmission capability of the destination device via the DCCD of the destination device. The transmitting module is configured to transmit a first VBP and first video data to the destination device, provided that the destination device supports fast video transmission capability. The first VBP includes a fast video transmission field, a pixel clock in fast video transmission mode, the number of lines in the front shoulder of the vertical blanking area in fast video transmission mode, and the number of lines in the vertical blanking area in fast video transmission mode. The fast video transmission field takes the value of a first value, which indicates that fast video transmission mode is enabled for the first video data.
[0038] Fourthly, a video transmission apparatus is provided, comprising modules for performing operational steps of the method in the second aspect or any possible implementation of the second aspect. For example, the video transmission apparatus includes a receiving module and a processing module. The receiving module is configured to receive a first VBP and first video data transmitted by a source device. The processing module is configured to enable a fast video transmission mode for the first video data if a fast video transmission field included in the first VBP is a first value, the first value indicating that the fast video transmission mode is enabled for the first video data.
[0039] Fifthly, a source device is provided, comprising: a memory, a transceiver, and a processor; the memory, transceiver, and processor are configured to cooperatively perform the method described in the first aspect and any of its possible implementations.
[0040] A sixth aspect provides a receiving device comprising: a transceiver and a processor; the transceiver and the processor being configured to cooperatively perform the method described in the second aspect and any of its possible implementations.
[0041] In a seventh aspect, a video transmission system is provided. This video transmission system includes a source device provided in the fifth aspect and a destination device provided in the sixth aspect. The source device can be used to implement the functions of the source device in the first or second aspect, and the destination device can be used to implement the functions of the destination device in the first or second aspect. Therefore, this video transmission system can also achieve the beneficial effects of the methods in the aforementioned first and second aspects.
[0042] Eighthly, a computer-readable storage medium is provided, storing computer instructions that, when executed on a computing device, perform the method of the first aspect and its possible implementations, or the method of the second aspect and its possible implementations. This includes situations where the computing device is a source device or a destination device as described above.
[0043] Ninthly, a computer program product is provided, comprising computer instructions that, when executed on a computing device, perform any one of the methods of the first aspect and its possible implementations. Such computing device may be the aforementioned source device or destination device, etc.
[0044] A tenth aspect provides a chip system comprising: a processor for retrieving and running a computer program from a memory, causing a computing device equipped with the chip system to perform the method described in the first aspect and its possible implementations, or in any one of the second aspect and its possible implementations. The computing device may be a source device or a destination device as described above.
[0045] The beneficial effects achieved by the technical solutions of the second to tenth aspects of this application and their corresponding possible implementations can be found in the above description of the technical effects of the first aspect and its corresponding possible implementations, and will not be repeated here.
[0046] Based on the implementation methods provided in the above aspects, this application can be further combined to provide more implementation methods. Attached Figure Description
[0047] Figure 1 is a schematic diagram of an audio-visual transmission system provided in this application;
[0048] Figure 2 is a schematic diagram of an audio-visual encoding and decoding system provided in this application;
[0049] Figure 3 is a logical diagram of an audio / video transmission adapter provided in this application;
[0050] Figure 4 is a logic diagram of an audio / video receiving adapter provided in this application;
[0051] Figure 5 is a schematic diagram of an advanced video feature capability negotiation process provided in this application;
[0052] Figure 6 is a flowchart illustrating a video transmission method provided in this application;
[0053] Figure 7 is a schematic diagram of the structure of a Vertical Blanking Message (VBP) provided in this application;
[0054] Figure 8 is a schematic diagram of the structure of a video frame provided in this application;
[0055] Figure 9 is a schematic diagram of the structure of an effective video packet AVP provided in this application;
[0056] Figure 10 is a schematic diagram of an elongated portion of a vertical blanking region provided in this application;
[0057] Figure 11 is a timing diagram of a video frame provided in this application;
[0058] Figure 12 is a flowchart illustrating another video transmission method provided in this application;
[0059] Figure 13 is a structural schematic diagram of a video transmission device provided in this application;
[0060] Figure 14 is a structural schematic diagram of a video transmission device provided in this application;
[0061] Figure 15 is a schematic diagram of the structure of a display device provided in this application. Detailed Implementation
[0062] To facilitate understanding, the main terms used in this application will be explained first.
[0063] Quick video transport (QVT) refers to the rapid transmission of video data from the source device to the destination device. Essentially, QVT reduces the transmission time of the effective video data within a video frame, allowing the destination device to receive the video data more quickly. Effective video data refers to the valid pixel data within a video frame.
[0064] Different scenarios have varying requirements for video data transmission latency. For example, in gaming scenarios, the transmission latency of video data should be as low as possible to provide a better user experience. Other scenarios, however, do not have strict requirements for video data transmission latency. To meet the needs of different scenarios, this application provides a video transmission method for transmitting video between a source device and a destination device. When the source device detects that the destination device has fast video transmission capabilities, the source device controls the destination device to enable or disable the fast video transmission mode. When the main link bandwidth between the source and destination devices is sufficient, the source device includes a field in its vertical blanking packet (VBP) indicating that the destination device should enable the fast video transmission mode, the pixel clock in fast video transmission mode, the number of rows before the vertical blanking area in fast video transmission mode, and the number of rows in the vertical blanking area in fast video transmission mode. The source device then sends the vertical blanking packet and video data to the destination device, enabling the video data to be transmitted to the destination device quickly, thereby reducing the transmission latency of the video data.
[0065] The technical solutions involved in the embodiments of this application are not only applicable to current audio-visual transmission technologies or standards, but may also be applied to future audio-visual transmission technologies or standards. The terminology used in the implementation section of this application is only for explaining specific embodiments of this application and is not intended to limit this application.
[0066] In this embodiment, "video" is a general term, referring to a sequence of multiple consecutive frames, with each frame corresponding to one image. "Audiovisual" is an information application technology term referring to video, audio, or multimedia content that includes both video and audio.
[0067] Video streaming refers to the transmission of video data. For example, a video stream can be processed over a network as a stable and continuous stream. The video (or video data) captured by a video capture device consists of multiple frames of images. After the video data is encapsulated and packaged, a video stream is obtained. A video stream consists of multiple video frames, each corresponding to one image frame.
[0068] The Device Comprehensive Capability Description (DCCD) primarily defines a system framework and data structure for declaring the capabilities supported by the device, such as device parameters, performance attributes, and audio / video capabilities. The DCCD uses a variable-length structure; different capabilities supported by the device are encapsulated in different data blocks within the DCCD. The declaration of supported capabilities can be expanded by adding extension data blocks. The DCCD describes the data content structure and is independent of the communication protocols between devices.
[0069] The video transmission method provided in this application supports video formats including but not limited to RGB, YCbCr4:4:4, YCbCr4:2:2, YCbCr4:2:0, ARGB, Y-only, and RAW.
[0070] The video transmission method provided in this application supports video data component bit widths (or color depths) including but not limited to one or more of the following: 8-bit, 10-bit, 12-bit, or 16-bit. For example, for RGB format video data, if the component bit width is 8-bit, then the R component of a pixel occupies 8 bits, the G component occupies 8 bits, and the B component occupies 8 bits. It can also be described as: the video transmission method supports 8-bit per component (bpc), 10-bpc, 12-bpc, and 16-bpc video data.
[0071] The video transmission method provided in this application will be described in detail below with reference to the accompanying drawings.
[0072] Figure 1 is a schematic diagram of an audio-visual transmission system provided in this application. The video processing includes, but is not limited to: video acquisition, video encoding, video transmission, video decoding, and playback.
[0073] The audio-visual transmission system in Figure 1 includes a set-top box 110, a smart TV 120, multiple audio-visual playback devices, and a server 130. The set-top box 110 connects to the network via a network cable and can receive audio-visual streams from the server 130. The network implements the audio-visual transmission function and includes one or more network devices, such as a router or switch (network device 131). In some optional implementations, the set-top box 110 and the server 130 can also communicate wirelessly; this embodiment is not limited to this approach.
[0074] Set-top box 110 is an audio-visual processing device used to receive, process, and push video or audio streams. In some possible cases, set-top box 110 may also be called an internet TV set-top box, network HD media player, or something else. For example, set-top box 110 refers to a TV box provided by a network operator, or a TV box purchased by the user. The hardware implementation of set-top box 110 is described in Figure 14 below.
[0075] The smart TV 120 is a display device with audio-visual processing capabilities, enabling functions such as receiving, processing, pushing, and playing video or audio-visual streams. In some possible cases, the smart TV 120 refers to audio-visual devices such as conference tablets, smart TVs, or projectors; this application embodiment does not limit this. The hardware implementation of the smart TV 120 is described in Figure 14 or Figure 15 below.
[0076] Multiple audio-visual playback devices include audio-visual playback devices 121 to 124. These audio-visual playback devices include, but are not limited to, multimedia control platforms or other devices supporting audio-visual playback functions, such as virtual reality (VR) terminal devices or augmented reality (AR) terminal devices, etc. The hardware of the audio-visual playback devices is described in Figure 15 below.
[0077] In this embodiment, the set-top box 110 and the smart TV 120 are connected via a network, for example, through an audio / video interface network. The set-top box 110 and various audio / video playback devices can also be connected via an audio / video interface network, as can the smart TV 120 and various audio / video playback devices. Exemplarily, this audio / video interface network is a wired network that supports video and audio / video transmission. Optionally, this audio / video interface network is called a Unified Multimedia Interconnect Network. In some implementations, Unified Multimedia Interconnect Network may also have other names. For example, Unified Multimedia Interconnect Network may also be a General Purpose Multimedia Interface (GPMI) network.
[0078] The Unified Multimedia Interconnection Network supports both uncompressed and compressed video transmission, as well as advanced features such as Quick Video Transport (QVT), Auto Low Latency Mode (ALLM), and Dynamic Frame Rate Refresh (DFR). It also supports LPCM format audio and video as defined by IEC 60958, and various HDR protocols, such as those specified in T / UWA005.1-2022, including HDR Vivid. Furthermore, the Unified Multimedia Interconnection Network supports encryption control and protection for audio and video data transmission.
[0079] Server 130 is an application server or authentication / authorization server. Server 130 provides video services, game services, messaging services, music services, authentication / authorization services, etc. In one example, the functions of multiple services are integrated on server 130; for example, game services and music services are deployed on server 130. In another example, the functions of some services are integrated on server 130; for example, parts of the game service and parts of the video service are deployed on server 130. Server 130 also utilizes virtualization technology to provide multiple virtual machines, which provide various services. This application embodiment does not limit the deployment form of the server. Network device 131 is connected to server 130 wirelessly or via a wired connection. The schematic diagram in Figure 1 is only an example; other devices are also included in this network, but are not shown in Figure 1.
[0080] Figure 1 is only a schematic diagram. The audio-visual transmission system also includes other devices, which are not shown in Figure 1. The embodiments of this application do not limit the number and type of the various devices included in the system.
[0081] Based on the audio-visual transmission system shown in Figure 1, Figure 2 is a schematic diagram of an audio-visual encoding and decoding system provided in this application. The audio-visual encoding and decoding system includes a source device 210 and a destination device 220. The source device 210 establishes a communication connection with the destination device 220 through a unified multimedia interconnection network.
[0082] The aforementioned source device 210 performs audio and video encoding functions, as shown in Figure 1. The source device 210 is the set-top box 110 or smart TV 120 shown in Figure 1. The source device 210 is also an audio and video control center with audio and video encoding capabilities. For example, the audio and video control center includes one or more servers.
[0083] The source device 210 includes a data source 211, a preprocessing module 212, an audio / video transmission adapter 213, and a communication interface 214.
[0084] Data source 211 includes or may be any type of electronic device for acquiring audio and video, and / or any type of source audio and video generation device, such as a computer graphics processor for generating computer animation scenes or any type of device for acquiring and / or providing source audio and video, or computer-generated source audio and video. Data source 211 may be any type of memory or storage device for storing the aforementioned source audio and video. The aforementioned source audio and video includes multiple audio and video streams or images acquired by multiple audio and video acquisition devices (such as cameras), such as Ultra High Definition (UHD) video, High Definition (HD) video, 4K video, 8K video, etc.
[0085] The preprocessing module 212 is used to receive source audio and video and preprocess the source audio and video to obtain audio and video or multi-frame images. For example, the preprocessing performed by the preprocessing module 212 may include color format conversion (e.g., from RGB to YCbCr), octree structuring, audio and video splicing, audio track merging and deletion, or channel number adjustment, etc.
[0086] The audio / video transmitter adapter 213 is used to receive audio / video or images and encode them to obtain encoded data. In some optional cases, the encoded bitstream (encoded data) may also be referred to as a bitstream. If the encoded data is obtained by encoding audio / video data, then the bitstream refers to the audio / video stream; if the encoded data is obtained by encoding video data, then the bitstream refers to the video stream.
[0087] The communication interface 214 in the source device 210 can be used to: receive encoded data (such as video streams or audio / video streams) and send encoded data (or a version of the encoded data after any other processing) to another device such as the destination device 220 or any other device through a unified multimedia interconnection network for storage, display, playback or image reconstruction, etc.
[0088] Optionally, the source device 210 includes a bitstream buffer for storing bitstreams corresponding to one or more coding units.
[0089] The aforementioned destination device 220 performs audio and video decoding functions. As shown in Figure 1, when the source device 210 is a set-top box 110, the destination device 220 can be either the smart TV 120 or an audio and video playback device shown in Figure 1. When the source device 210 is a smart TV 120, the destination device 220 can be either an audio and video playback device.
[0090] The receiving device 220 includes an audio / video playback unit 221, a post-processing module 222, an audio / video receiver adapter 223, and a communication interface 224.
[0091] The communication interface 224 in the receiving device 220 is used to receive encoded data (or a version of the encoded data after any other processing) from the source device 210 or from any other source device such as a storage device.
[0092] Communication interfaces 214 and 224 can be used for direct communication links between source device 210 and destination device 220, such as direct wired connections, as shown in the unified multimedia interconnection network in Figure 2. The details of the unified multimedia interconnection network can be found in the description in Figure 1, and will not be repeated here.
[0093] Communication interface 224 corresponds to communication interface 214. For example, it can be used to transmit data and process the data using any type of corresponding transmission decoding or processing and / or decapsulation to obtain audio and video data.
[0094] Both communication interface 224 and communication interface 214 can be configured as a one-way communication interface or a two-way communication interface as indicated by the arrow pointing from source device 210 to destination device 220 in Figure 2. They can be used to send and receive messages to establish connections and transmit other information, such as information related to the communication link, or information related to audio and video data (e.g., descriptive information DIP for audio and video).
[0095] The audio / video receiver adapter 223 is used to receive encoded data and decode the encoded data to obtain decoded data (video or audio / video, etc.).
[0096] The post-processing module 222 is used to post-process the decoded data to obtain post-processed data (such as images to be displayed or audio / video to be played). The post-processing performed by the post-processing module 222 includes, for example, color format conversion (e.g., from YCbCr to RGB), octree reconstruction, audio / video splitting and merging, or any other processing to generate data for output by the audio / video playback unit 221.
[0097] The audio-visual playback unit 221 is used to receive post-processed data for display or playback to users or viewers. The audio-visual playback unit 221 is or includes any type of display for representing the reconstructed image, such as an integrated or external display screen or monitor. For example, the display screen may include a liquid crystal display (LCD), an organic light emitting diode (OLED) display, a plasma display, a projector, a micro-LED display, a liquid crystal on silicon (LCoS), a digital light processor (DLP), or any other type of display screen. The audio-visual playback unit 221 also includes one or more audio-visual playback modules, each of which refers to a speaker, smart speaker, or amplifier, etc.
[0098] As an optional implementation, source device 210 and destination device 220 transmit encoded data through a data forwarding device. For example, the data forwarding device could be a router or switch. It is worth noting that this data forwarding device needs to support a unified multimedia interconnection network.
[0099] Figure 3 is a logical schematic diagram of an audio / video transmission adapter provided in this application. The audio / video transmission adapter is responsible for receiving video data and audio data, as well as descriptive information, such as vertical synchronization (Vsync), horizontal synchronization (Hsync), pixel clock, pixel data, and display enable (DE), and encapsulating them into corresponding packets, such as vertical blanking packet (VBP), horizontal blanking packet (HBP), active video packet (AVP), audio sample packet (ASP), and descriptive information packet (DIP). When content protection is enabled, AVP and ASP need to be sent to the content protection encryption module for encryption, and an encryption description packet (EDP) and a key distribution packet (KDP) are generated. The encrypted AVP is called AVP' and the encrypted ASP is called ASP'. These packets are combined into one audio / video stream by the audio / video stream multiplexer, and then sent to the audio / video receiver adapter of the destination device through the transport layer, logic layer and electrical layer of the unified multimedia interconnection network.
[0100] Figure 4 is a logical diagram of an audio / video receiver adapter provided in this application. The audio / video receiver adapter is responsible for receiving audio / video streams, parsing various messages (VBP, HBP, DIP, AVP (AVP' in encrypted cases), ASP (ASP' in encrypted cases)) from the audio / video streams, and restoring vertical synchronization (Vsync), horizontal synchronization (Hsync), pixel clock, pixel data, display enable (DE), audio clock, and audio data according to the messages.
[0101] In the embodiments of this application, the source device may also be referred to as an audio / video transmitting device, audio / video transmitting end, etc., and the destination device may also be referred to as an audio / video receiving device, audio / video playback device, etc. In this embodiment, the source device and the destination device are connected through a unified multimedia interconnection network.
[0102] In the first possible application scenario, the source device is the set-top box 110 in Figure 1, and the destination device is the smart TV 120 in Figure 1. The set-top box and the smart TV negotiate, meaning the set-top box obtains the smart TV's DCCD, and the set-top box pushes video to the smart TV in a fast video transmission mode.
[0103] In the second possible application scenario, the source device is the set-top box 110 in Figure 1, and the destination device is any of the audio / video playback devices in Figure 1, such as any one of audio / video playback devices 121 to 124. For example, the set-top box pushes audio / video data to the audio / video playback device in a fast video transmission mode.
[0104] In the third possible application scenario, the source device is the smart TV 120 in Figure 1, and the destination device is any of the audio / video playback devices in Figure 1, such as any one of audio / video playback devices 121 to 124. For example, the smart TV pushes audio / video data to the audio / video playback device in a fast video transmission mode.
[0105] The three possible application scenarios described above are merely examples provided in this embodiment and should not be construed as limiting this application. In some other possible examples, the source device is any one of the audio / video playback devices in FIG1 (such as audio / video playback device 121), and the destination device is another audio / video playback device different from the aforementioned audio / video playback devices (such as audio / video playback device 122).
[0106] The process of negotiating DCCD between the source and destination devices is explained below.
[0107] Figure 5 is a schematic flowchart of an advanced video feature capability negotiation method provided in this application. Here, the video transmission method of this application is illustrated using the source device 210 or a chip in the source device 210, and the destination device 220 or a chip in the destination device 220, as shown in Figure 2. Referring to Figure 5, the video transmission method provided in this embodiment includes step 510.
[0108] Step 510: The source device obtains the fast video transmission capability of the destination device through DCCD.
[0109] The destination device provides information indicating its capabilities. The source device's query for the destination device's capabilities includes the source device obtaining the destination device's capability description information. The process of the source device obtaining the destination device's capability information includes: the source device sending a query message to the destination device to query the destination device's capabilities; then, the destination device sending (feedback) its capability description information to the source device; the source device receiving the capability description information sent by the destination device; and the capability description information indicating the destination device's capabilities.
[0110] The destination device's capability description information includes information indicating the destination device's fast video transmission capabilities. For example, fast video transmission capability information includes whether the destination device supports fast video transmission capabilities. In some possible cases, the capability description information is also called a capability descriptor, which includes one or more bits of information.
[0111] In some cases, the capability description information of the destination device is also called the device comprehensive capability description. The area in the destination device that stores the DCCD is called the DCCD area or simply DCCD, which is not limited in this application.
[0112] In this embodiment, the DCCD declares whether it supports fast video transmission capability. For example, the destination device may or may not support fast video transmission capability.
[0113] After the source device queries the destination device's fast video transmission capability through DCCD, the destination device reports its own fast video transmission capability back to the source device.
[0114] In some embodiments, the DCCD includes an advanced video feature capability field, which primarily describes the video-related capabilities supported by the display. This advanced video feature capability field includes flags indicating whether fast video transmission is supported.
[0115] For example, the data structure of the advanced video feature capability field is shown in Table 1.
[0116] Table 1
[0117] As shown in Table 1, the Advanced Video Features Capability field includes flags indicating whether fast video transmission is supported. In some embodiments, different bit values indicate whether fast video transmission is supported.
[0118] For example, a bit value of 1 indicates support for fast video transmission, while a bit value of 0 indicates that fast video transmission is not supported. Similarly, a bit value of 0 indicates support for fast video transmission, while a bit value of 1 indicates that fast video transmission is not supported.
[0119] Optionally, the DCCD also stores content describing the audio and video processing capabilities of the receiving device, such as audio processing capabilities (whether the receiving device supports processing pure audio data packets), HDR display capabilities (whether the receiving device supports displaying specific types of HDR video, such as HDR), or others.
[0120] In one alternative example, the receiving device includes a memory storing the aforementioned capability description information. This memory includes, but is not limited to: random access memory (RAM), flash memory, read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), registers, hard disks, portable hard disks, CD-ROMs, or any other form of storage medium known in the art.
[0121] In another alternative example, the destination device includes a dedicated register whose state indicates the capability description information described above. If the register indicates a value of 1, the capability description information indicates that the destination device has fast video transmission capability; if the register indicates a value of 0, the capability description information indicates that the destination device does not have fast video transmission capability.
[0122] The process of transmitting video between the source and destination devices is explained below.
[0123] The source device receives the DCCD from the destination device, parses the DCCD, and obtains the flag information indicating whether fast video transmission is supported. The source device then transmits video based on this flag information. If the fast video transmission support flag indicates that the destination device supports fast video transmission, the source device transmits vertical blanking messages and video data to the destination device in fast video transmission mode.
[0124] Figure 6 is a schematic flowchart of a video transmission method provided in this application. Here, the video transmission method of this application is illustrated using an example where the source device 210 or a chip in the source device 210, and the destination device 220 or a chip in the destination device 220 are executed. Referring to Figure 6, the video transmission method provided in this embodiment includes steps 610 to 620.
[0125] Step 610: If the destination device supports high-speed video transmission, the source device transmits the first VBP and the first video data to the destination device. Correspondingly, the destination device receives the first VBP and the first video data from the source device.
[0126] This application does not limit the source from which the source device acquires the video. The source device acquires the video from a storage device, or receives video sent by other devices.
[0127] The video data sent from the source device to the destination device is, for example, a frame of video data to be sent by the source device. The first video data corresponds to the valid data (or, valid pixels, or, valid pixel data) of a frame of image in the video.
[0128] In some embodiments, the source device transmits VBP and video data based on a flag indicating whether the destination device supports fast video transmission. If the flag indicates that the destination device has fast video transmission capability, the source device selects whether to instruct the destination device to enable fast video transmission mode based on actual needs. For example, it selects whether to enable fast video transmission mode based on the type of video content. Different types of video content include, but are not limited to, graphics, images, videos, or games. Alternatively, the source device may also select whether to enable fast video transmission mode for all types of video content.
[0129] When the destination device supports fast video transmission capability, and the source device transmits VBP and video data to the destination device in fast video transmission mode, the VBP includes a fast video transmission field, a pixel clock in fast video transmission mode, the number of lines in the front shoulder of the vertical blanking area in fast video transmission mode, and the number of lines in the vertical blanking area in fast video transmission mode. The fast video transmission field takes the first value, which indicates that fast video transmission mode is enabled for the first video data.
[0130] The following describes how the source device transmits VBP and video data to the destination device in the fast video transmission mode.
[0131] The source device receives the DCCD feedback from the destination device, obtaining the maximum video bandwidth and maximum pixel clock supported by the destination device. In other words, the DCCD also includes the maximum video bandwidth and maximum pixel clock supported by the destination device. The source device then uses a bandwidth lookup to obtain the maximum available bandwidth supported by the path between the source and destination devices.
[0132] The source device calculates the pixel clock, the number of lines in the front shoulder of the vertical blanking zone in the fast video transmission mode, and the number of lines in the vertical blanking zone in the fast video transmission mode based on the maximum video bandwidth supported by the destination device, the maximum pixel clock supported by the destination device, and the maximum available bandwidth supported by the path.
[0133] Assuming the destination device supports high-speed video transmission, the maximum available video bandwidth is calculated based on the maximum video bandwidth supported by the destination device, the maximum available bandwidth supported by the path, and the link bandwidth utilization. The maximum effective bandwidth supported by the path is obtained by multiplying the maximum available bandwidth supported by the path by the link bandwidth utilization. The minimum value between the maximum video bandwidth supported by the destination device and the maximum effective bandwidth supported by the path is determined as the maximum available video bandwidth.
[0134] The maximum available video bandwidth satisfies the following formula (1). BMAX video =min(BMAX) sink_video BMAX channel_avail ×α) Formula (1)
[0135] Among them, BMAX video BMAX is the maximum available video bandwidth. sink_video BMAX is the maximum video bandwidth supported by the host device. channel_avail BMAX represents the maximum available bandwidth supported by the path, where α is the link bandwidth utilization. channel_avail ×α represents the maximum effective bandwidth supported by the path. min represents the minimum value.
[0136] Formula (1) represents taking BMAX sink_video and BMAX channel_avail The minimum value of ×α. If BMAX sink_video Less than BMAX channel_avail ×α, then the maximum available video bandwidth BMAX video For BMAX sink_video If BMAX channel_avail ×α is less than BMAX sink_video Then the maximum available video bandwidth BMAX video For BMAX channel_avail ×α.
[0137] Based on the maximum available video bandwidth BMAX video The pixel clock corresponding to the maximum available video bandwidth is calculated.
[0138] The pixel clock corresponding to the maximum available video bandwidth satisfies the following formula (2).
[0139] Where fchannel_pixel_max is the pixel clock corresponding to the maximum available video bandwidth, and BMAX video bpp is the equivalent number of bits per pixel for the maximum available video bandwidth. For example, YUV444, 10-bit bpp = 30, YUV422, 10-bit bpp = 20, YUV420, 10-bit bpp = 15.
[0140] The minimum value between the pixel clock corresponding to the maximum available video bandwidth and the maximum pixel clock supported by the destination device is determined as the maximum available pixel clock in fast video transmission mode. The maximum available pixel clock in fast video transmission mode is calculated based on the pixel clock corresponding to the maximum available video bandwidth and the maximum pixel clock supported by the destination device.
[0141] The maximum available pixel clock in fast video transmission mode satisfies the following formula (3). pixel_max =min(fchannel_pixel_max,fsink_pixel_max) Formula (3)
[0142] Among them, f pixel_max fchannel_pixel_max is the maximum available pixel clock in fast video transmission mode, fsink_pixel_max is the pixel clock corresponding to the maximum available video bandwidth, and fsink_pixel_max is the maximum pixel clock supported by the sink device. min is the minimum value.
[0143] Formula (3) represents taking the minimum value of fchannel_pixel_max and fsink_pixel_max. If fchannel_pixel_max is less than fsink_pixel_max, then the maximum available pixel clock f in fast video transmission mode is... pixel_max For fchannel_pixel_max. If fsink_pixel_max is less than fchannel_pixel_max, then the maximum available pixel clock f in fast video transmission mode is f. pixel_max It is fsink_pixel_max.
[0144] The maximum total number of vertical lines supported in the original format is determined based on the maximum available pixel clock in the fast video transmission mode, the horizontal parameters of the original format, and the refresh rate of the original format.
[0145] The maximum total number of vertical rows that can be supported under the original format satisfies the following formula (4).
[0146] Among them, Vtotal_max is the maximum total number of vertical lines supported in the original format, Floor is the floor function, Htotal is the horizontal parameter of the original format, and Refresh_Rate is the refresh rate of the original format.
[0147] Select the total number of vertical lines in the fast video transmission mode between the total number of vertical lines in the original format and the maximum total number of vertical lines supported in the original format.
[0148] The total number of vertical lines in the fast video transmission mode satisfies the following formula (5). Vtotal < Vtotal_qvt ≤ Vtotal_max Formula (5)
[0149] Among them, Vtotal_qvt is the total number of vertical lines in the fast video transmission mode, Vtotal is the total number of vertical lines in the original format, and Vtotal_max is the maximum total number of vertical lines supported in the original format.
[0150] Formula (5) means selecting the total number of vertical lines Vtotal_qvt in the fast video transmission mode from the total number of vertical lines Vtotal in the original format and the maximum total number of vertical lines Vtotal_max supported in the original format. The total number of vertical lines in the fast video transmission mode is greater than the total number of vertical lines in the original format, and the total number of vertical lines in the fast video transmission mode is less than or equal to the maximum total number of vertical lines supported in the original format.
[0151] Determine the pixel clock in the fast video transmission mode according to the total number of vertical lines in the fast video transmission mode, the horizontal parameter of the original format, and the refresh rate of the original format.
[0152] The pixel clock in the fast video transmission mode satisfies the following formula (6). f pixel_qvt = Htotal × Vtotal_qvt × Refresh_Rate Formula (6)
[0153] Among them, f pixel_qvt is the pixel clock in the fast video transmission mode, Htotal is the horizontal parameter of the original format, Vtotal_qvt is the total number of vertical lines in the fast video transmission mode, and Refresh_Rate is the refresh rate of the original format.
[0154] Determine the number of leading lines in the vertical blanking interval in the fast video transmission mode according to the total number of vertical lines in the fast video transmission mode, the total number of vertical lines in the original format, and the number of leading lines in the vertical blanking interval in the original format.
[0155] In fast video transmission mode, the number of lines in the front shoulder of the vertical blanking area satisfies the following formula (7): Vfront_qvt=Vtotal_qvt-Vtotal+Vfront Formula (7)
[0156] Where Vfront_qvt is the number of lines in the front shoulder of the vertical blanking area in fast video transmission mode, Vtotal is the total number of lines in the vertical format in the original format, Vtotal_qvt is the total number of lines in the vertical format in fast video transmission mode, and Vfront is the number of lines in the front shoulder of the vertical blanking area in the original format.
[0157] The number of vertical blanking lines in fast video transmission mode is determined based on the total number of vertical lines and the number of valid lines in a frame.
[0158] In fast video transmission mode, the number of vertical blanking lines satisfies the following formula (8): Vblank_qvt=Vtotal_qvt-Vactive Formula (8)
[0159] Where Vblank_qvt is the number of vertical blanking lines in fast video transmission mode, Vtotal_qvt is the total number of vertical lines in fast video transmission mode, and Vactive is the number of valid lines in a frame.
[0160] Vertical blanking messages are used to transmit the VBS (Vblank Start) signal, which is the vertical blanking start signal. A vertical blanking message contains a 4-byte header (or tunnel header) and a 28-byte payload (or payload), totaling 32 bytes.
[0161] For example, referring to the VBP structure diagram shown in Figure 7, the tunnel header occupies 4 bytes, the message payload occupies 28 bytes, and a single VBP occupies a total of 32 bytes. The tunnel header and message payload of the VBP are illustrated below with reference to Tables 2 and 3.
[0162] Table 2 VBP Tunnel Headers
[0163] The VBP's message payload includes video frame control (VFC) information, which describes video-related information for the current frame.
[0164] Table 3 VBP message load
[0165] Referring to Figure 7 and Table 2, the message payload of the VBP includes the Fast Video Transmission field, the Pixel Clock field, the Vertical Blanking Front Shoulder Line Digital Segment, and the Vertical Blanking Line Digital Segment.
[0166] In Table 2, QVT stands for Fast Video Transmission. The QVT field occupies 1 bit. A value of 1 indicates that Fast Video Transmission mode is enabled, while a value of 0 indicates that it is not enabled. For example, if the source device determines that Fast Video Transmission mode needs to be enabled, it sets 1 bit of the QVT field in VBP to 1 to instruct the destination device to enable QVT mode. Conversely, if the source device determines that Fast Video Transmission mode does not need to be enabled, it sets 1 bit of the QVT field in VBP to 0 to instruct the destination device not to enable QVT mode.
[0167] In fast video transmission mode, 1 bit of the QVT field is set to 1, the pixel clock field is set to the pixel clock value in fast video transmission mode, the vertical blanking area front shoulder row number field is set to the number of vertical blanking area front shoulder rows in fast video transmission mode, and the vertical blanking area row number field is set to the number of vertical blanking area rows in fast video transmission mode.
[0168] It should be noted that after enabling the fast video transmission mode, although the duration of the blanking zone and the effective video zone of the video frame changes, the timing parameters in the frame-level control information (i.e., the VBP message payload) of the vertical blanking message VBP should remain unchanged except for the fast video transmission field, the pixel clock field, the vertical blanking zone front shoulder line digital segment, and the vertical blanking zone line digital segment.
[0169] The format of video frames in a video stream will be described exemplarily below with reference to Figure 8, which is a schematic diagram of the structure of a video frame provided in this application. In Figure 8, a video frame includes multimedia messages and timing signals.
[0170] Multimedia messages include one or both of the following: Valid Video Messages (AVP) and Audio Sample Messages (ASP). AVP messages are used to transmit video data. ASP messages are used to transmit audio data. As shown in Figure 8, multiple AVP messages are arranged in rows within a video frame; multiple AVP messages belonging to the same row constitute a video line.
[0171] It is worth noting that the structure of the video frame in Figure 8 is only an example provided by this application and should not be construed as a limitation of this application. In some possible implementations, a single video frame may also include only one line of AVP message, which is not limited by this application.
[0172] Optionally, when the link bandwidth is greater than the video bandwidth, an ASP is inserted between the two AVPs, but the ASP has a lower priority than the AVP.
[0173] Timing signals include: horizontal synchronization signal (Hsync), vertical synchronization signal (Vsync), and display enable (DE) signal. These timing signals describe the video timing, that is, when which message or signal should be transmitted. Hsync indicates the start of scanning a line of pixels (e.g., the signal received by the line register in the receiving device is 1), Vsync indicates the start of scanning a video frame, and display enable (DE) indicates whether valid data can be received (e.g., DE=1 indicates that valid data can be received, DE=0 indicates that valid data cannot be received).
[0174] Referring to Figure 8, a simplified explanation of the video timing of a video frame is provided: A video frame includes a vertical blanking region, a horizontal blanking region, and a valid video region. In the blanking region, the video frame cannot transmit AVP packets, but can transmit other packets, such as ASP and DIP packets. In the valid video region, the video frame can transmit AVP packets carrying audio and video data. The start of Hactive is identified by a horizontal blanking packet (HBP), which indicates the end of the line blanking region. For example, the audio / video transmission adapter in the source device sends an HBP for each video line, immediately upon the end of the Hblank.
[0175] In the video frame shown in Figure 8, the line blanking region of the video frame includes a VBP, which indicates the video line where the rising edge (in terms of positive polarity) of a video frame's Vsync occurs. For example, the audio / video transmission adapter in the source device transmits one or more VBPs per frame (three consecutive VBPs in Figure 8), and ASPs and DIPs cannot be inserted between VBPs. For instance, if Vsync is positive polarity, the audio / video transmission adapter replaces the HBP with a VBP on the video line where the rising edge of Vsync occurs, and then transmits two more VBPs immediately afterward; if Vsync is negative polarity, the audio / video transmission adapter replaces the HBP with a VBP on the video line where the falling edge of Vsync occurs, and then transmits two more VBPs immediately afterward.
[0176] Referring to the structural diagram of the valid video packet AVP shown in Figure 9, the AVP consists of two parts: a header and a payload (totaling 512 bytes). The header is 32 bits long, and the payload is used to transmit valid video pixel data (or video data, or valid video data, or valid pixels, or valid pixel data, or valid video pixel data), and its length does not exceed 508 bytes.
[0177] The structure of the header of a valid video packet (AVP) is shown in Table 4 below.
[0178] Table 4 AVP message headers
[0179] As shown in Figure 9, each AVP is encapsulated in 32-bit rows, meaning that each row in the payload of an AVP message contains 32 bits of data. If 32 bits (4 bytes) are defined as one symbol, then the payload of an AVP includes multiple symbols.
[0180] In one possible implementation, the process of the source device transmitting the first VBP and the first video data to the destination device includes: the first video frame transmitted by the source device to the destination device includes at least one first VBP and multiple first valid video packets (AVPs), the multiple first AVPs being used to carry the first video data.
[0181] In this embodiment, after the source device (the audio / video transmission adapter in the source device) obtains the video data, it encapsulates the video data into AVP packets. It can be understood that for one frame of video data, the video data is encapsulated into multiple AVP packets. Optionally, the source device sends three VBPs to the destination device to ensure the success rate of VBP transmission.
[0182] As can be understood, referring to the block diagram of the audio / video transmission adapter shown in Figure 3 above, after the audio / video transmission adapter in the source device encapsulates the video data into AVP packets, the AVP packets and other packets in the source device are combined into a single video stream by the audio / video stream multiplexer. Other packets include vertical blanking packets (VBP), horizontal blanking packets (HBP), valid video packets (AVP), audio sample packets (ASP), and descriptive information packets (DIP). The video stream consists of multiple video frames, each corresponding to one image in the video.
[0183] Based on the above, in this embodiment, a video frame includes an AVP carrying video data and a VBP carrying frame-level control information. The VBP includes fields for the fast video transmission mode, the pixel clock in the fast video transmission mode, the number of lines in the front shoulder of the vertical blanking zone in the fast video transmission mode, and the number of lines in the vertical blanking zone in the fast video transmission mode. Therefore, frame-level control of video data can be achieved to enable the fast video transmission mode for a specific video frame, offering high flexibility.
[0184] Understandably, as can be seen from the above formula, the total number of vertical lines in the fast video transmission mode is greater than the total number of vertical lines in the original format. Since the number of effective lines in a frame remains unchanged, the number of vertical blanking lines is increased.
[0185] The elongation of the vertical blanking region is determined by the number of vertical blanking lines in the fast video transmission mode and the original standard. For example, as shown in Figure 10, the shaded area represents the elongated portion of the vertical blanking region. Vblank is the number of vertical blanking lines in the original standard. After QVT, the number of vertical blanking lines becomes Vblank', where Vblank' is the number of vertical blanking lines in the fast video transmission mode. The difference between Vblank' and Vblank is the elongated portion of the blanking region.
[0186] In high-speed video transmission mode, it is recommended that no packets be transmitted in the extended portion of the vertical blanking region so that the main link can enter a low-power state. Optionally, when audio data needs to be transmitted, the extended portion of the blanking region transmits audio sampling packets (ASPs). When audio switching occurs, audio control information (such as DIPs) is sent. If there is no audio data transmission, the extended portion of the blanking region does not transmit any packets.
[0187] In fast video transmission mode, the pixel clock is N times that of the original standard pixel clock, where N represents the fast video transmission multiplier, and N>1.
[0188] The scaling factor for fast video transmission satisfies the following formula (9).
[0189] Among them, f pixel For the original pixel clock format, f pixel_qvt For pixel clock in fast video transmission mode.
[0190] Since the pixel clock is proportional to the video bandwidth, in fast video transmission mode, the pixel clock is N times that of the original format. Therefore, the bandwidth occupied by each video frame is N times that of the normal video transmission mode.
[0191] Understandably, a video frame consists of a blanking region (including horizontal and vertical blanking regions) and an effective video region. When fast video transmission mode is enabled, the video bandwidth increases, reducing the duration of the effective video region in a video frame. Since the frame rate of the video data remains constant, the transmission time for each video frame remains unchanged. Therefore, the duration of the blanking region in a video frame will increase.
[0192] For example, as shown in Figure 11, the duration of the effective video area and the blanking area in the video frame changes in two modes: normal video transmission (i.e., QVT mode not enabled) and fast video transmission (i.e., QVT mode enabled).
[0193] In normal video transmission mode, the duration of the effective video area is Tactive, and the duration of the blanking area is Tblank. In fast video transmission mode, the duration of the effective video area Tactive in normal video transmission mode is shortened to T′active; the duration of the blanking area Tblank in normal video transmission mode is lengthened to T'blank.
[0194] The effective video time of a line satisfies the following formula (10).
[0195] Where THactive′ represents the effective video time per line in fast video transmission mode, and THactive represents the effective video time per line in normal video transmission mode. N is the multiplier for fast video transmission.
[0196] The horizontal hidden region time satisfies the following formula (11).
[0197] Where THblank′ is the horizontal blanking time in fast video transmission mode, and THblank is the horizontal blanking time in normal video transmission mode.
[0198] Based on the above, it can be seen that when the main link bandwidth between the source device and the destination device is sufficient, and the fast video transmission mode is enabled, for the source device, on the one hand, the source device quickly transmits video data to the destination device, reducing the transmission latency of video data between the source device and the destination device; on the other hand, after the source device quickly completes the transmission of video data, the source device can choose to shut down certain modules in the source device (such as the compression module), reducing the power consumption of the source device.
[0199] The destination device's audio / video receiver adapter receives the video stream. The destination device parses the first video frame (including the first VBP and the first video data) in the video stream to obtain the value of the QVT field in the VBP, the pixel clock in fast video transmission mode, the number of lines in the front shoulder of the vertical blanking zone in fast video transmission mode, and the number of lines in the vertical blanking zone in fast video transmission mode. If the fast video transmission field included in the first VBP is a first value (e.g., the value of the QVT field is 1), the destination device executes step 620.
[0200] Step 620: The receiving device enables fast video transmission mode for the first video data.
[0201] In some embodiments, the video transmitting adapter in the source device sends a VBP. After receiving the VBP, the video receiving adapter in the destination device first performs a CRC check. If the check fails, it receives the next VBP until a correct VBP is received. That is, the receiving end takes the first accurate VBP received as the standard.
[0202] In this embodiment of the application, when the fast video transmission mode is enabled, the destination device obtains video parameter information based on VBP, and the transmission time of the effective video data of each video frame is reduced. Therefore, the destination device receives the content of each video frame (e.g., the first video data) faster.
[0203] In one implementation, after receiving a video frame, the receiving device can immediately start and complete subsequent processing in advance, thereby reducing display latency. When the receiving device receives valid video data from a video frame, its internal control devices (such as processing modules) need to be quickly woken up to process the video data in a timely manner.
[0204] In another implementation, after the receiving device receives a video frame, if the video enhancement processing module is in a turned-off state, the receiving device will activate one or more video enhancement processing modules to improve the video quality.
[0205] When the Quick Video Transmission (QVT) mode is enabled, the source device increases the transmission bandwidth (i.e., the bandwidth of the virtual path used for video transmission) per frame. After the bandwidth adjustment, the transmission bandwidth of the first video data is the second bandwidth, which is the bandwidth adjusted by the source device from the first bandwidth of the currently transmitted video data, and the second bandwidth is greater than the first bandwidth. With the increased video transmission bandwidth, the transmission time of the effective video data for each video frame (such as the first video data mentioned above) is reduced. In other words, the duration of the effective video area of the first video frame is reduced, thereby quickly sending the first video frame (containing the first VBP and the first video data mentioned above) to the destination device. The destination device will then receive the content of each video frame more quickly.
[0206] In summary, in the video transmission method provided in this application embodiment, when the source device finds that the destination device has fast video transmission capability, the source device carries a field in the vertical blanking message that instructs the destination device to enable fast video transmission mode, and sends the vertical blanking message and video data to the destination device. Thus, the destination device enables fast video transmission mode according to the instruction of the fast video transmission field. This method can dynamically enable fast video transmission mode for video data and reduce the transmission latency of video data.
[0207] Understandably, in fast video transmission mode, the timing information of the video format only increases the number of lines in the front shoulder of the vertical blanking zone, while the rest remain unchanged. From vertical synchronization (vsync) to the last valid pixel, the time is shorter at the same magnification.
[0208] Optionally, compared to HDMI, the multiplier provided in this application for fast video transmission is a non-integer multiple. In calculating the pixel clock and the multiplier for fast video transmission in fast video transmission mode, QVT granularity is at the row level, using fast video transmission for each line of video data. That is, fast video transmission is used for non-integer multiples of video frames, while HDMI can only be at the frame level, and the multiplier can only be an integer multiple.
[0209] Compared to HDMI, the video transmission method provided in this application does not require the receiving device to recover the pixel clock according to the multiplier and original format of the fast video transmission. The VBP in the video transmission method provided in this application carries the pixel clock in the fast video transmission mode.
[0210] When the host device supports fast video transmission capability and needs to enable fast video transmission mode, by including a field in the VBP indicating whether to enable fast video transmission mode, it is possible to control the enabling of fast video transmission mode more conveniently and dynamically without manual activation.
[0211] In one possible implementation, the video transmission method provided in this application embodiment disables the fast video transmission mode during video data transmission. Referring to Figure 6, as shown in Figure 12, after step 620, the video transmission method provided in this application embodiment further includes steps 630 and 640.
[0212] Step 630: The source device transmits a second VBP and second video data to the destination device. The second VBP includes a second value for the Fast Video Transmission field, which indicates that Fast Video Transmission mode is disabled for the second video data. Correspondingly, the destination device receives the second VBP and second video data sent by the source device.
[0213] In one possible implementation, the process of the source device transmitting the second VBP and the second video data to the destination device includes the source device transmitting a second video frame to the destination device. The second video frame includes at least one second VBP and multiple second valid video packets (AVPs), with the multiple second AVPs used to carry the second video data.
[0214] Optionally, when the source device determines that the fast video transmission mode needs to be turned off, the source device adjusts the current video bandwidth (i.e., the increased bandwidth when the fast video transmission mode is turned on), that is, reduces the video bandwidth.
[0215] In some examples, the video bandwidth is reduced to the bandwidth before enabling the fast video transmission mode, or the video bandwidth can be reduced to other values, depending on the actual needs. This application does not limit the specific values.
[0216] After the audio / video receiving adapter of the destination device receives the video stream, for the second video frame in the video stream (including the second VBP and the second video data), if the Fast Video Transmission field included in the second VBP is the second value (i.e., the value of the QVT field is 0), the destination device executes step 640.
[0217] Step 640: The destination device disables the fast video transmission mode for the second video data.
[0218] Based on the description of the process of enabling fast video transmission mode on the receiving device in the above embodiments, and referring to the setting of the video processing mode of the receiving device in step 620 above, the receiving device will perform the opposite processing. For example, in one possible implementation, when the receiving device disables fast video transmission mode for the second video data, the receiving device reduces the bandwidth between the receiving device's interface and the receiving device's processing module. In another possible implementation, if the receiving device enables one or more video quality enhancement processing modules when it enables fast video transmission mode, then when the receiving device disables fast video transmission mode for the second video data, the receiving device will set the one or more video quality enhancement processing modules to a disabled state.
[0219] Based on the above, it can be seen that the source device sends a VBP in each video frame. The VBP includes a field for the fast video transmission mode. In this way, frame-level control of video data can be achieved to dynamically enable or disable the fast video transmission mode for each video frame.
[0220] With sufficient main link bandwidth, in fast video transmission mode, increasing video bandwidth allows the source device to quickly transmit video frames. The transmission time of effective video data for each frame is reduced, allowing the source device to shut down video processing modules (such as video compression modules) for a longer period, thus reducing system power consumption. The destination device receives each video frame faster, enabling it to start and complete subsequent processing immediately, reducing display latency. Since the video frame rate remains constant, the destination device has more time to process each video frame.
[0221] It is understood that, in order to achieve the functions in the above embodiments, the source device and the destination device include hardware structures and / or software modules corresponding to perform each function. Those skilled in the art should readily recognize that, based on the units and method steps of the various examples described in conjunction with the embodiments disclosed in this application, this application can be implemented in hardware or a combination of hardware and computer software. Whether a function is executed in hardware or by computer software driving hardware depends on the specific application scenario and design constraints of the technical solution.
[0222] The video transmission method provided according to this embodiment has been described in detail above with reference to Figures 1 to 12. The video transmission device provided according to this embodiment will be described below with reference to Figure 13.
[0223] Figure 13 is a schematic diagram of a video transmission device provided in an embodiment of this application. This video transmission device can be used to implement the function of any one of the devices in the above method embodiments, and therefore can also achieve the beneficial effects of the above method embodiments. In this embodiment, the video transmission device can be a set-top box 110, a smart TV 120, or any display device as shown in Figure 1; it can also be a source device 210 or a destination device 220 as shown in Figure 2; or it can be a source device or a destination device provided in subsequent embodiments. It should be understood that the video transmission device can also be a module (such as a chip) applied to any of the aforementioned devices.
[0224] As shown in Figure 13, the video transmission device includes a transceiver module 1310 and a processing module 1320. The transceiver module 1310 and the processing module 1320 can work together to implement the various steps in the above method embodiments. A more detailed description of the transceiver module 1310 and the processing module 1320 can be obtained directly from the relevant description of the device in the method embodiments shown in the foregoing figures, and will not be repeated here.
[0225] For example, transceiver module 1310 is used to execute steps 510, 610, and 630. Transceiver module 1310 is used to acquire the DCCD of the destination device, transmit VBP, and video data. Processing module 1320 is used to parse the DCCD and obtain the fast video transmission capability of the destination device through the DCCD. For example, the DCCD contains an advanced video feature capability field, which contains flag information indicating whether fast video transmission is supported.
[0226] Optionally, the video transmission device may also include a storage module 1330 for storing DCCD and video, etc.
[0227] When a video transmission device implements any of the video transmission methods shown in the foregoing figures through software, the video transmission device and its various units can also be software modules. The video transmission method described above is implemented by a processor calling this software module. This processor can be a central processing unit (CPU), an application-specific integrated circuit (ASIC), or a programmable logic device (PLD). The PLD can be a complex programmable logical device (CPLD), a field-programmable gate array (FPGA), a generic array logic (GAL), or any combination thereof.
[0228] It is understood that the video transmission device shown in Figure 13 is only an example provided in this embodiment. Depending on the different video transmission processes, the video transmission device may include more or fewer units, and this application does not limit it in this regard.
[0229] When the video transmission device is implemented in hardware, the hardware can be implemented using a processor or a chip system. The chip system includes one or more chips, each chip including interface circuitry and control circuitry. The interface circuitry is used to receive data from other devices outside the chip and transmit it to the control circuitry, or to send data from the control circuitry to other devices outside the chip. The control circuitry and interface circuitry are used through logic circuitry or executable code instructions to implement the method of any of the possible implementations in the above embodiments. The beneficial effects can be found in the description of any aspect of the above embodiments, and will not be repeated here.
[0230] It is understood that the processor in the embodiments of this application can be a CPU, or other general-purpose processors, digital signal processors (DSPs), ASICs, FPGAs, or other programmable logic devices, transistor logic devices, hardware components, or any combination thereof. A general-purpose processor can be a microprocessor or any conventional processor.
[0231] The video transmission device shown in Figure 13 can also be implemented by a video transmission equipment. Figure 14 is a structural schematic diagram of a video transmission equipment provided in this application. The video transmission equipment includes: a memory 1401 and at least one processor 1402. The processor 1402 can implement the video transmission method provided in the above embodiments. The memory 1401 is used to store the software instructions corresponding to the above video transmission method.
[0232] As an optional implementation, in hardware implementation, the video transmission device can refer to a chip or chip system that encapsulates one or more processors 1402. For example, when the video transmission device is used to implement the method steps in the above embodiments, the processor 1402 included in the video transmission device executes the steps of the source device and its possible sub-steps in the above method. In an optional case, the video transmission device may also include a communication interface 1403, which can be used to send and receive data. For example, the communication interface 1403 is used to receive DCCD, audio / video data, or send audio / video streams, etc.; the communication interface 1403 can be implemented through interface circuitry included in the video transmission device. Therefore, in some examples, the communication interface 1403 can also be referred to as the transceiver of the video transmission device. In this embodiment, the communication interface 1403 supports the use of a unified multimedia interconnection network.
[0233] In the embodiments of this application, the communication interface 1403, the processor 1402, and the memory 1401 can be connected via a bus 1404. The bus 1404 can be divided into an address bus, a data bus, a control bus, etc. The bus 1404 can be a Peripheral Component Interconnect Express (PCIe) bus, or an extended industry standard architecture (EISA) bus, a unified bus (Ubus or UB), a compute express link (CXL), a cache coherent interconnect for accelerators (CCIX), or other types of buses, etc.
[0234] It is worth noting that the video transmission device can also perform the functions of the video transmission device shown in Figure 13, which will not be elaborated here.
[0235] The video transmission device provided in this embodiment can be the set-top box 110, smart TV 120, source device 210, etc., or other devices with video processing functions. This application does not limit this. For example, when the aforementioned display device also has video processing functions, the video transmission device can refer to any of the aforementioned display devices.
[0236] Alternatively, the video transmission device shown in Figure 13 can also be implemented through a display device. When the video transmission device is implemented through a display device, this embodiment provides a possible example, as shown in Figure 15. Figure 15 is a schematic diagram of the structure of a display device provided in this application. The display device includes: a processor 1510, an external memory interface 1520, an internal memory 1521, a universal serial bus (USB) interface 1530, a unified multimedia interconnect interface 1531, antenna 1, antenna 2, a mobile communication module 1550, a wireless communication module 1560, an audio module 1570, a speaker 1570A, a receiver 1570B, a microphone 1570C, a sensor module 1580, buttons 1590, an indicator 1592, a camera 1593, a display screen 1594, and a subscriber identification module (SIM) card interface 1-N 1595, etc.
[0237] The aforementioned sensor module 1580 may include sensors such as pressure sensors, gyroscope sensors, barometric pressure sensors, magnetic sensors, accelerometers, distance sensors, proximity sensors, fingerprint sensors, temperature sensors, touch sensors, ambient light sensors, and bone conduction sensors.
[0238] It is understood that the structure illustrated in this embodiment does not constitute a specific limitation on the display device. In other embodiments, the display device may include more or fewer components than illustrated, or combine some components, or split some components, or have different component arrangements. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.
[0239] Processor 1510 may include one or more processing units, such as: application processor (AP), modem processor, graphics processing unit (GPU), image signal processor (ISP), controller, memory, video codec, digital signal processor (DSP), baseband processor, and / or neural network processing unit (NPU), etc. The different processing units may be independent devices or integrated into one or more processors.
[0240] The controller can serve as the nerve center and command center of the display device. Based on the instruction opcode and timing signals, the controller generates operation control signals to control the fetching and execution of instructions.
[0241] The processor 1510 may also include a memory for storing instructions and data. In some embodiments, the memory in the processor 1510 is a cache memory. This memory can store instructions or data that the processor 1510 has just used or that are used repeatedly. If the processor 1510 needs to use the instruction or data again, it can retrieve it directly from the memory. This avoids repeated accesses, reduces the waiting time of the processor 1510, and thus improves the efficiency of the system.
[0242] In some embodiments, the processor 1510 may include one or more interfaces. Interfaces may include an inter-integrated circuit (I2C) interface, an inter-integrated circuit sound (I2S) interface, a pulse code modulation (PCM) interface, a universal asynchronous receiver / transmitter (UART) interface, a mobile industry processor interface (MIPI), a general-purpose input / output (GPIO) interface, a subscriber identity module (SIM) interface, and / or a USB interface, a unified multimedia interconnect interface, etc.
[0243] It is understood that the interface connection relationships between the modules illustrated in this embodiment are merely illustrative and do not constitute a structural limitation on the display device. In other embodiments, the display device may also employ different interface connection methods or a combination of multiple interface connection methods as described in the above embodiments.
[0244] The wireless communication function of the display device can be implemented through antenna 1, antenna 2, mobile communication module 1550, wireless communication module 1560, modem processor, and baseband processor. In some embodiments, antenna 1 and mobile communication module 1550 of the display device are coupled, and antenna 2 and wireless communication module 1560 are coupled, enabling the display device to communicate with networks and other devices through wireless communication technology.
[0245] Wired communication functionality of the display device can be achieved through the USB interface 1530 or the Unified Multimedia Interconnect Interface 1531. For example, the display device can receive or send DCCDs, video streams, etc., through a bus connected via the Unified Multimedia Interconnect Interface 1531.
[0246] The display device implements display functions through a GPU, a display screen 1594, and an application processor. The GPU is a microprocessor for image processing, connecting the display screen 1594 and the application processor. The GPU performs mathematical and geometric calculations for graphics rendering. The processor 1510 may include one or more GPUs, which execute program instructions to generate or modify display information.
[0247] The display screen 1594 is used to display images, videos, etc. The display screen 1594 includes a display panel.
[0248] The display device can implement shooting functions through an ISP, camera 1593, video codec, GPU, display screen 1594, and application processor. The ISP is used to process the data fed back by the camera 1593. The camera 1593 is used to capture still images or videos. In some embodiments, the display device may include one or N cameras 1593, where N is a positive integer greater than 1.
[0249] In this embodiment, the above-mentioned display screen 1594, video codec, GPU, display screen 1594 and application processor can also be collectively referred to as the display unit of the receiving device, which is used to process and display the received video stream.
[0250] The external storage interface 1520 can be used to connect an external storage card, such as a Micro SD card, to expand the storage capacity of the display device. The external storage card communicates with the processor 1510 through the external storage interface 1520 to perform data storage functions. For example, music, video, and other files can be saved on the external storage card.
[0251] Internal memory 1521 can be used to store computer executable program code, which includes instructions. Processor 1510 executes various functional applications and data processing of the display device by running the instructions stored in internal memory 1521. For example, in this embodiment, processor 1510 can execute instructions stored in internal memory 1521, which may include a program storage area and a data storage area.
[0252] The program storage area can store the operating system, at least one application program required for a function (such as sound playback, image playback, etc.). The data storage area can store data created during the use of the display device (such as audio and video data, phonebook, etc.). Furthermore, the internal memory 1521 can include high-speed random access memory, and may also include non-volatile memory, such as at least one disk storage device, flash memory device, universal flash storage (UFS), etc.
[0253] The display device can implement audio functions such as music playback and recording through an audio module 1570, a speaker 1570A, a receiver 1570B, a microphone 1570C, and an application processor.
[0254] Buttons 1590 include a power button, volume buttons, etc. Buttons 1590 can be mechanical buttons or touch-sensitive buttons. Indicator 1592 can be an indicator light, used to indicate charging status, battery level changes, or to indicate messages, missed calls, notifications, etc.
[0255] In the above embodiments, implementation can be achieved, in whole or in part, through software, hardware, firmware, or any combination thereof. When implemented using software programs, implementation can be, in whole or in part, in the form of a computer program product. This computer program product includes one or more computer instructions. When these computer instructions are loaded and executed on a computer, all or part of the processes or functions described in the embodiments of this application are generated. The computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device. The computer instructions can be stored in a computer-readable storage medium or transmitted from one computer-readable storage medium to another. For example, the computer instructions can be transmitted from one website, computer, server, or data center to another via wired (e.g., coaxial cable, fiber optic, digital subscriber line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.) means. The computer-readable storage medium can be any available medium accessible to a computer or a data storage device such as a server or data center integrating one or more available media. The available media can be magnetic media (e.g., floppy disks, hard disks, magnetic tapes), optical media (e.g., digital video discs (DVDs)), or semiconductor media (e.g., solid-state drives (SSDs)).
[0256] Through the above description of the embodiments, those skilled in the art will clearly understand that, for the sake of convenience and brevity, only the division of the above functional modules is used as an example. In practical applications, the above functions can be assigned to different functional modules as needed, that is, the internal structure of the device can be divided into different functional modules to complete all or part of the functions described above. The specific working process of the system, device, and unit described above can be referred to the corresponding process in the foregoing method embodiments, and will not be repeated here.
[0257] In the several embodiments provided in this application, it should be understood that the disclosed systems, apparatuses, and methods can be implemented in other ways. For example, the apparatus embodiments described above are merely illustrative; for instance, the division of modules or units is only a logical functional division, and in actual implementation, there may be other division methods. For example, multiple units or components may be combined or integrated into another system, or some features may be ignored or not executed. Furthermore, the coupling or direct coupling or communication connection shown or discussed may be through some interfaces, or indirect coupling or communication connection between apparatuses or units, and may be electrical, mechanical, or other forms.
[0258] The units described as separate components may or may not be physically separate. The components shown as units may or may not be physical units; that is, they may be located in one place or distributed across multiple network units. Some or all of the units can be selected to achieve the purpose of this embodiment according to actual needs.
[0259] Furthermore, the functional units in the various embodiments of this application can be integrated into one processing unit, or each unit can exist physically separately, or two or more units can be integrated into one unit. The integrated unit can be implemented in hardware or as a software functional unit.
[0260] If the integrated unit is implemented as a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of this application, in essence, or the part that contributes to the prior art, or all or part of the technical solution, can be embodied in the form of a software product. This computer software product is stored in a storage medium and includes several instructions to cause a computer device (which may be a personal computer, server, or network device, etc.) or processor to execute all or part of the steps of the methods described in the various embodiments of this application. The aforementioned storage medium includes various media capable of storing program code, such as flash memory, portable hard disk, read-only memory, random access memory, magnetic disk, or optical disk.
[0261] The above description is merely a specific embodiment of this application, but the scope of protection of this application is not limited thereto. Any changes or substitutions within the technical scope disclosed in this application should be included within the scope of protection of this application. Therefore, the scope of protection of this application should be determined by the scope of the claims.
Claims
A video transmission method, characterized in that, The method, applied to a source device or a chip in the source device, includes: The DCCD acquires the fast video transmission capability of the host device by describing the overall capabilities of the equipment. If the destination device supports high-speed video transmission capability, a first vertical blanking message (VBP) and first video data are transmitted to the destination device. The first VBP includes a fast video transmission field, a pixel clock in fast video transmission mode, the number of rows in the front shoulder of the vertical blanking area in fast video transmission mode, and the number of rows in the vertical blanking area in fast video transmission mode. The value of the fast video transmission field is a first value, which is used to indicate that fast video transmission mode is enabled for the first video data. The method according to claim 1, characterized in that, The method further includes: The pixel clock for the fast video transmission mode is determined based on the total number of vertical lines in the fast video transmission mode, the horizontal parameters of the original format, and the refresh rate of the original format. The method according to claim 1 or 2, characterized in that, The method further includes: The number of lines in the front shoulder of the vertical blanking area in the fast video transmission mode is determined based on the total number of vertical lines in the fast video transmission mode, the total number of vertical lines in the original format, and the number of lines in the front shoulder of the vertical blanking area in the original format. The method according to any one of claims 1-3 is characterized in that, The method further includes: The number of vertical blanking lines in the fast video transmission mode is determined based on the total number of vertical lines and the number of valid lines in a frame. The method according to any one of claims 2-4, characterized in that, The method further includes: The maximum total number of vertical lines supported in the original format is determined based on the maximum pixel clock available in the fast video transmission mode, the horizontal parameters of the original format, and the refresh rate of the original format. The total number of vertical lines in the fast video transmission mode is selected from the original format's total number of vertical lines to the maximum total number of vertical lines supported under the original format. The method according to claim 5, characterized in that, The method further includes: The minimum value between the maximum video bandwidth supported by the destination device and the maximum effective bandwidth supported by the path is determined as the maximum available video bandwidth. The maximum effective bandwidth supported by the path is the product of the maximum available bandwidth supported by the path and the link bandwidth utilization. The minimum value between the pixel clock corresponding to the maximum available video bandwidth and the maximum pixel clock supported by the destination device is determined as the maximum available pixel clock in the fast video transmission mode. The method according to claim 6, characterized in that, The maximum video bandwidth and the maximum pixel clock supported by the host device are obtained through the DCCD. The method according to any one of claims 1-7 is characterized in that, The elongated portion of the blanking region is determined by the number of vertical blanking region rows in the fast video transmission mode and the number of vertical blanking region rows in the original format. The method according to any one of claims 1-8, characterized in that, When the fast video transmission mode is enabled, the pixel clock in the fast video transmission mode is N times the pixel clock of the original standard, where N represents the multiplier of the fast video transmission mode, and N>1. The method according to any one of claims 1-9 is characterized in that, Transmitting the first VBP and first video data to the destination device includes: A first video frame is transmitted to the destination device. The first video frame includes at least one first VBP and a plurality of first valid video packets (AVPs), the plurality of first AVPs being used to carry the first video data. The method according to claim 10, characterized in that, When the fast video transmission mode is enabled, the bandwidth occupied by the first video frame is N times that of the bandwidth in the normal video transmission mode. The method according to claim 10 or 11 is characterized in that, When the fast video transmission mode is enabled, the transmission time of the effective video data of the first video frame is reduced. The method according to claim 12, characterized in that, When the fast video transmission mode is enabled, the effective video time for one line in the fast video transmission mode is 1 / N times that of the effective video time for one line in the normal video transmission mode. The method according to any one of claims 9-13 is characterized in that, When the fast video transmission mode is enabled, the horizontal blanking time in the fast video transmission mode is 1 / N times that in the normal video transmission mode. The method according to any one of claims 1-14 is characterized in that, The method further includes: The second VBP and second video data are transmitted to the destination device. The second VBP includes a fast video transmission field, and the value of the fast video transmission field is a second value, which is used to indicate that the fast video transmission mode is turned off for the second video data. The method according to claim 15, characterized in that, Transmitting the second VBP and second video data to the destination device includes: A second video frame is transmitted to the destination device. The second video frame includes at least one second VBP and a plurality of second valid video packets (AVPs), the plurality of second AVPs being used to carry the second video data. A video transmission method, characterized in that, The method, which applies to a receiver device or a chip within the receiver device, includes: Receive the first vertical blanking message (VBP) and the first video data sent by the source device; When the fast video transmission field included in the first VBP is a first value, the fast video transmission mode is enabled for the first video data, and the first value is used to indicate that the fast video transmission mode is enabled for the first video data. The method according to claim 17, characterized in that, When the fast video transmission mode is enabled, the pixel clock in the fast video transmission mode is N times the pixel clock of the original standard, where N represents the multiplier of the fast video transmission mode, and N>1. The method according to claim 17 or 18 is characterized in that, Receive the first vertical blanking message (VBP) and first video data sent by the source device, including: The device receives a first video frame sent by the source device. The first video frame includes at least one first VBP and a plurality of first valid video packets (AVPs), the plurality of first AVPs being used to carry the first video data. The method according to claim 19, characterized in that, When the fast video transmission mode is enabled, the transmission bandwidth occupied by the first video frame is N times that of the normal video transmission mode. The method according to claim 19 or 20 is characterized in that, When the fast video transmission mode is enabled, the transmission time of the effective video data of the first video frame is reduced. The method according to any one of claims 17-21 is characterized in that, The method further includes: Receive the second VBP and second video data sent by the source device; When the Fast Video Transfer field included in the second VBP is a second value, the Fast Video Transfer mode is turned off for the second video data, and the second value is used to indicate that the Fast Video Transfer mode is turned off for the second video data. The method according to claim 22, characterized in that, Receive the second vertical blanking message (VBP) and second video data sent by the source device, including: The device receives a second video frame sent by the source device. The second video frame includes at least one second VBP and a plurality of second valid video packets (AVPs), the plurality of second AVPs being used to carry the second video data. A video transmission device, characterized in that, include: A memory, a transceiver, and a processor; the memory, the transceiver, and the processor are configured to cooperate in performing the method of any one of claims 1-16 or the method of any one of claims 17-23. A chip characterized in that, The device includes one or more interface circuits and one or more processors; the interface circuits are configured to receive signals from the memory of an electronic device and send the signals to the processors, the signals including computer instructions stored in the memory; when the processor executes the computer instructions, the processor performs the operational steps of the method according to any one of claims 1-16 or the method according to any one of claims 17-23. A computer-readable storage medium, characterized in that, The device stores computer instructions that, when executed on a computing device, perform the method as claimed in any one of claims 1-16 or any one of claims 17-23.