Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

31results about How to "Reduce the number of encoded bits" patented technology

Band-width spreading method and system for voice or audio signal

The invention discloses a method and system for speech or audio signal bandwidth expansion, which comprises: A. to simulate spectral envelope of the high-frequency signal components in the speech or audio signal. B. to make a synthesis of the said spectrum envelope and the low-frequency signal components corresponding to the high-frequency signal components in the frequency and spatial domain to obtain the reset high-frequency signal components. The invention also discloses the method and system to realize the said bandwidth expansion, the technical scheme offered by which has the advantage of less bit number of coding that can be adaptively adjusted based on the type features of the signals. Besides, by extracting spectrum envelope of the high-frequency signal components, the invention makes the fine structure acted on the low-frequency signal components corresponding frequency and spatial domain to guarantee the correlation between the reset high-frequency signal spectrum and the harmonization of the high-frequency signal spectrum lopped during coding.
Owner:HUAWEI TECH CO LTD +1

Derivable motion compensation prediction method of decoding terminal

The invention discloses a derivable motion compensation prediction method of a decoding terminal. The method comprises the steps of firstly, obtaining motion vectors and coordinates of encoding blocks on the left side, the upper side and the upper right side of an adjacent encoding block, and working out model parameters; secondly, determining the initial value of a candidate motion vector of the current encoding block according to the model parameters obtained by computing; thirdly, modifying the initial value of the candidate motion vector to obtain candidate motion vector set, and selecting an optimum candidate motion vector from the candidate motion vector muster; fourthly, performing compensation prediction according to the optimum candidate motion vector, and computing a rate-distortion cost when the optimum candidate motion vector is adopted to perform the compensation prediction; and fifthly, performing prediction on the current encoding block by adopting a motion compensation prediction method in H.264 / AVC, and selecting a motion compensation prediction method with the lowest rate-distortion cost to perform prediction. The method can save motion vector information, and improves the compression efficiency of a zoom video; and simultaneously, when an encoding terminal is not required to encode the model parameters, the decoding terminal can directly determine the model parameters.
Owner:SHANDONG UNIV

MMVD mode-based inter-frame prediction method, video coding method, related device and equipment

The invention discloses an MMVD mode-based inter-frame prediction method, a video coding method and related devices and equipment. The MMVD mode-based inter-frame prediction method comprises the following steps: constructing a first candidate list of a current coding block in an MMVD mode, the first candidate list comprising a first preset number of first candidate motion vectors; determining a second preset number of motion search amplitudes and a third preset number of motion search directions; for each first candidate motion vector, performing searching by taking the first candidate motionvector as a starting point and respectively utilizing different motion combinations of the motion search amplitude and the motion search direction to obtain motion vector offset of the first candidatemotion vector under different motion combinations; and determining the optimal motion vector of the current coding block based on the motion vector offset of each first candidate motion vector underdifferent motion combinations. According to the method, the accuracy of inter-frame prediction can be improved or the coding amount of the inter-frame prediction result can be reduced with a large probability, and the coding compression ratio is improved.
Owner:ZHEJIANG DAHUA TECH CO LTD

Method for realizing AMR-WB (adaptive multi-rate wideband) voice coding or decoding by adopting information hiding technology

The invention discloses a method for realizing AMR-WB (adaptive multi-rate wideband) voice coding or decoding by adopting an information hiding technology, capable of reducing coding parameter length and saving voice transmission bandwidth. The AMR-WB voice coder comprises a linear prediction (LPC) analysis module, a fundamental tone search module and a fixed codebook search module (comprising an information hiding sub module). The method comprises the following steps of: (1) extracting ISF (information system finishing) parameter coding by virtue of an LPC analysis module according to a coding rate mode, and dividing the parameter coding into hidden information ST to be embedded in a step (2); and (2) searching a second best codebook according with constraint conditions (I) in a local codebook space by adopting a first depth tree search algorithm by virtue of the fixed codebook search module, and embedding the ST into a fixed codebook index to be transmitted, wherein T is a number of a codebook track and is selected from a codebook track 0, a codebook track 1, a codebook track 2 and a codebook track 3; ST is ISF index information to be embedded in a codebook track T; NT is the number of nonzero pulses contained in the codebook track T; PTk is the kth pulse in the codebook track T; and M is an embedded bit control bit, and log2M is maximum bit number of embeddable information in each track. By applying the method, the coding speed is reduced, the resource utilization factor is improved, the method is simple and feasible, and no additional computation complexity is increased.
Owner:SUZHOU INST FOR ADVANCED STUDY USTC

Video coding method and device, electronic equipment and storage medium

The invention relates to a video coding method and device, electronic equipment and a storage medium, and belongs to the technical field of video processing. The method comprises the steps of obtaining a to-be-coded video, wherein the video comprises a plurality of first video frames; carrying out smoothing processing on the regions except the region of interest in the plurality of first video frames to obtain a plurality of second video frames, the smoothing processing being used for reducing a pixel value difference between each pixel point and a neighborhood pixel point in the region; and encoding the plurality of second video frames to obtain an encoded video. According to the invention, the number of coding bits can be saved, and the network bandwidth consumption during transmission is reduced.
Owner:BEIJING DAJIA INTERNET INFORMATION TECH CO LTD

Method and apparatus for intra-frame prediction in video encoding/decoding

This invention discloses an infrafyne prediction method and a device in video code / decode predicting the pel block value according to the values or the estimated values around the pel block to be processed. The said method can get the estimation values of other surrounding pel values by interpolation of their values the pel block to be processed partly or totally then to be combined with the usable values of surrounding pels to get the prediction value of the pel block to be processed so as to reduce its code bit number and increase video code compression efficiency.
Owner:XFUSION DIGITAL TECH CO LTD

Voice communication method and equipment

The invention discloses a voice communication method and equipment. The method comprises the following steps that a voice signal sending end determines a voice frequency band range; the voice signal sending end determinates frequency band limitation data in the audio frequency band range; the voice signal sending end uses the frequency band limitation data for carrying out coding processing on the voice signals, and sends the voice signals subjected to coding processing to the voice signal receiving end. According to the embodiment of the invention, on the basis of the characteristic of high-degree concentration of the user voice frequency range, the voice frequency band range is determined, in addition, the frequency band limitation data in the voice frequency band range is determined, and in addition, the frequency band limitation data is used for carrying out coding processing on the voice signals, so that the coding bits are put onto the most common frequency band of the user, the coding bit number is saved, the coding efficiency is improved, the redundancy in the coding range is avoided, and the network flow rate of the voice communication is reduced.
Owner:CHINA MOBILE COMM GRP CO LTD

Band-width spreading method and system for voice or audio signal

The invention discloses a method and system for speech or audio signal bandwidth expansion, which comprises: A. to simulate spectral envelope of the high-frequency signal components in the speech or audio signal. B. to make a synthesis of the said spectrum envelope and the low-frequency signal components corresponding to the high-frequency signal components in the frequency and spatial domain to obtain the reset high-frequency signal components. The invention also discloses the method and system to realize the said bandwidth expansion, the technical scheme offered by which has the advantage ofless bit number of coding that can be adaptively adjusted based on the type features of the signals. Besides, by extracting spectrum envelope of the high-frequency signal components, the invention makes the fine structure acted on the low-frequency signal components corresponding frequency and spatial domain to guarantee the correlation between the reset high-frequency signal spectrum and the harmonization of the high-frequency signal spectrum lopped during coding.
Owner:HUAWEI TECH CO LTD +1

Image prediction method, coding tree node division method and devices thereof

The invention provides an image prediction method, a coding tree node division method and devices thereof, and the image prediction method comprises the steps: determining the type of a current codingunit which is a brightness and chrominance coding unit, a brightness coding unit or a chrominance coding unit; determining a prediction mode of the current coding unit according to the type of the current coding unit and / or the prediction mode of an adjacent image block, wherein the image block in the current coding unit and the adjacent image block are spatially adjacent image blocks, and the adjacent image block comprises an adjacent luminance block and / or an adjacent chrominance block; and predicting the image block in the current coding unit according to the prediction mode of the currentcoding unit. According to the method in the embodiment of the invention, the video encoding and decoding efficiency can be improved.
Owner:HUAWEI TECH CO LTD

Encoding and decoding method, device and system for intra-frame prediction mode of chrominance component

An encoding method for an intra-frame prediction mode of a chrominance component provided by the invention comprises the following steps: determining chrominance component intra-frame prediction attribute information of a current image block; for any sub-block in the current image block, selecting a target intra-frame prediction mode for the chrominance component of the sub-block from intra-frameprediction modes matched with the chrominance component intra-frame prediction attribute information, and encoding the target intra-frame prediction mode to obtain encoding information of each sub-block; and according to the encoding information of each sub-block in the current image block, sending an encoding bit stream carrying indication information to a decoding end, the indication informationbeing used for indicating the chrominance component intra-frame prediction attribute information of the current image block. According to the encoding and decoding method, device and system for the intra-frame prediction mode of the chrominance component, the encoding bit number can be reduced, and the decoding throughput can be improved.
Owner:HANGZHOU HIKVISION DIGITAL TECH

Audio arithmetic coding method based on time-frequency plane context

The invention relates to an audio arithmetic coding method based on time frequency plane context and belongs to the field of coding, wherein probability of an audio coding parameter can be self adapted. The audio arithmetic coding method based on the time frequency plane context is characterized in that firstly modified discrete cosine transformation (MDCT) and vector quantization are carried out on sound signals of each frame, and then context distribution and probability modeling are carried out on MDCT factors which are quantified. During the process, frequency domain contexts are firstly used for quantifying the MDCT factors, namely, statistics of first order condition frequency of the MDCT factors to be coded and quantified is carried out, after statistics of the first order condition frequency of each MDCT factor is carried out by the utilization of each frequency domain context, and final first order condition frequency is sent to an arithmetic encoder to be coded. The audio arithmetic coding method based on the time frequency plane context can keep good audio coding quality, meanwhile, reduce average coding rate of audio signals, and then improve coding efficiency of the audio encoder.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Beam measurement reporting method and system in 5g communication

This application discloses a beam measurement reporting method and system in 5G communication, which relates to the field of mobile communication technology. When the corresponding indication bit in the UE-level high-level signaling = 0, the control terminal uses the single-branch differential mode for beam reporting; when When the corresponding indication bit in UE-level high-level signaling = 1, the control terminal uses multi-branch sorting differential mode for beam reporting; when single-branch differential mode is used for beam reporting, it is dynamically configured or semi-statically configured through UE-level high-level signaling Configure at least 1-bit signaling to indicate the first quantization step Δ1 in the single-branch differential mode, where Δ1≥1; when the multi-branch sorting differential mode is used for beam reporting, it is dynamically configured through UE-level high-level signaling or The semi-statically configured signaling of at least 1 bit is used to indicate the number of branches M and the second quantization step size Δ2 in the multi-branch sorting differential mode, where M≥2. Error propagation is reduced while reducing beam reporting overhead.
Owner:BEIJING YUNZHIRUANTONG INFO TECH CO LTD

Crowd image coding method based on quadratic fitting photometric transformation

The present invention proposes a group image coding method based on quadratic fitting photometric transformation, aiming at reducing the number of bits of group image coding, the implementation steps are: obtaining SIFT matching points and SIFT matching distances between two images in the image set to be coded ; Obtain the minimum spanning tree of the image set to be encoded; calculate the perspective transformation matrix H between each pair of reference images and the target encoding image in the minimum spanning tree; perform geometric deformation on the reference image in the minimum spanning tree of the image set to be encoded; After a geometric deformation, the image is subjected to quadratic fitting photometric transformation; the YUV code stream corresponding to the reference image is obtained; and the target coded image is subjected to block-based motion compensation coding. The invention makes full use of redundant information between images, effectively reduces the number of coding bits of group images, improves the coding efficiency of group images, and can be used for personal album compression, cloud image storage, and the like.
Owner:XIDIAN UNIV

Encoding and decoding method, device and system for intra prediction mode of chroma component

The coding method of the intra prediction mode of the chroma component provided by the present application includes: determining the intra prediction attribute information of the chroma component of the current image block; for any sub-block in the current image block, from the Select a target intra prediction mode for the chrominance component of the sub-block in the intra prediction mode matched with the intra prediction attribute information of the degree component, and encode the target intra prediction mode to obtain the encoding information of each sub block ; According to the encoding information of each sub-block in the current image block, send an encoded bit stream carrying indication information to the decoding end, the indication information is used to indicate that the chrominance component in the current image block Predict attribute information. The coding and decoding method, device and system of the intra-frame prediction mode of the chrominance component provided by the present application can reduce the number of coding bits and improve the decoding throughput.
Owner:HANGZHOU HIKVISION DIGITAL TECH

Image filling method, device and equipment

The embodiment of the invention provides an image filling method, a device and equipment. The method includes the steps of: selecting a prediction model for an image block comprising a filling region pixel point; according to the prediction model, determining the prediction value of the filling region pixel point in the image block; and using a decoding restore value of the predication value as afilling value of the filling region pixel point in a corresponding image block of a subsequent image frame. The embodiment uses the decoding restore value as the filling value of the filling region pixel point in the corresponding image block of the subsequent image frame, thus leading the prediction differences of the pixel points in the filling region to be zero and consequently decreasing the coding bit numbers occupied by the filling region. In addition, the embodiment also applies the prediction model for the filling region and fully utilizes the interframe or intra-frame correlation of the filling region, thus improving the interframe or intra-frame predication efficiency.
Owner:MIGU CO LTD

Method and apparatus for intra-frame prediction in video encoding/decoding

This invention discloses an infrafyne prediction method and a device in video code / decode predicting the pel block value according to the values or the estimated values around the pel block to be processed. The said method can get the estimation values of other surrounding pel values by interpolation of their values the pel block to be processed partly or totally then to be combined with the usable values of surrounding pels to get the prediction value of the pel block to be processed so as to reduce its code bit number and increase video code compression efficiency.
Owner:XFUSION DIGITAL TECH CO LTD

Audio arithmetic coding method based on time-frequency plane context

The invention relates to an audio arithmetic coding method based on time frequency plane context and belongs to the field of coding, wherein probability of an audio coding parameter can be self adapted. The audio arithmetic coding method based on the time frequency plane context is characterized in that firstly modified discrete cosine transformation (MDCT) and vector quantization are carried out on sound signals of each frame, and then context distribution and probability modeling are carried out on MDCT factors which are quantified. During the process, frequency domain contexts are firstly used for quantifying the MDCT factors, namely, statistics of first order condition frequency of the MDCT factors to be coded and quantified is carried out, after statistics of the first order condition frequency of each MDCT factor is carried out by the utilization of each frequency domain context, and final first order condition frequency is sent to an arithmetic encoder to be coded. The audio arithmetic coding method based on the time frequency plane context can keep good audio coding quality, meanwhile, reduce average coding rate of audio signals, and then improve coding efficiency of the audio encoder.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Construction method of cloud object library based on deep learning

The invention provides a cloud object library construction method based on deep learning, and aims at solving the technical problem that the constructed cloud object library is large in storage spacein the prior art. The realization step is as follows: selecting multiple images from an ImageNet database, and acquiring a basic cloud object library; initializing the object detection frame; performing object detection on the image with size of X input into the object detection frame by using the object detection frame; performing quality assessment on the n object images P1...Pi...Pn so as to obtain m object images P1...Pj...Pm; extracting an image feature of the object image Pj and screening the object image Pj, and taking the basic cloud object library containing the object image Pj as thecloud object library. The method disclosed by the invention can be applied to similar scenes for the internet cloud-side image compression and the personal photo album image compression.
Owner:XIDIAN UNIV

Group image coding method based on quadratic fitting luminosity transform

The invention provides a group image coding method based on quadratic fitting luminosity transform, aiming at reducing the bit number of group image coding. The group image coding method based on quadratic fitting luminosity transform includes the steps: acquiring SIFT (Scale-invariant feature transform) matching points and SIFT matching distance between every two images in a image-to-be-coded set; acquiring a minimal span tree of the image-to-be-coded set; calculating a perspective transform matrix H between each pair of reference images and a target image to be coded in the minimal span tree; performing geometry deformation on the reference images on the minimal span tree of the image-to-be-coded set; performing quadratic fitting luminosity transform on a plurality of images after geometry deformation; acquiring YUV code streams corresponding to the reference images; and performing block-based motion compensation coding on the target image to be coded. The group image coding method based on quadratic fitting luminosity transform fully utilizes the redundancy information among images, can effectively reduce the group image coding bit number, can improve the group image coding efficiency, and can be used for personal photo album compression, cloud image storage and the like.
Owner:XIDIAN UNIV

Beam measurement reporting method and system in 5G communication

The invention discloses a beam measurement reporting method and system in 5G communication, and relates to the technical field of mobile communication, and the method comprises the steps: controllinga terminal to carry out the beam reporting in a single-branch differential mode when a corresponding indication bit in a UE-level high-level signaling is equal to 0; when the corresponding indicationbit in the UE-level high-level signaling is equal to 1, controlling the terminal to perform beam reporting by adopting a multi-branch sorting differential mode; when a single-branch differential modeis adopted for beam reporting, dynamically configuring at least one bit of signaling or semi-statically configuring at least one bit of signaling through UE-level high-level signaling to indicate a first quantization step length delta 1 in the single-branch differential mode, wherein delta 1 is larger than or equal to 1; when a multi-branch sorting differential mode is adopted for beam reporting,dynamically configuring or semi-statically configuring at least one bit of signaling through UE-level high-level signaling to indicate the branch number M and the second quantization step length delta2 in the multi-branch sorting differential mode, wherein M is greater than or equal to 2. Error propagation is reduced while beam reporting overhead is reduced.
Owner:北京云智软通信息技术有限公司

Cloud object library construction method based on deep learning

The invention provides a cloud object library construction method based on deep learning, and aims at solving the technical problem that the constructed cloud object library is large in storage spacein the prior art. The realization step is as follows: selecting multiple images from an ImageNet database, and acquiring a basic cloud object library; initializing the object detection frame; performing object detection on the image with size of X input into the object detection frame by using the object detection frame; performing quality assessment on the n object images P1...Pi...Pn so as to obtain m object images P1...Pj...Pm; extracting an image feature of the object image Pj and screening the object image Pj, and taking the basic cloud object library containing the object image Pj as thecloud object library. The method disclosed by the invention can be applied to similar scenes for the internet cloud-side image compression and the personal photo album image compression.
Owner:XIDIAN UNIV

Image processing method, method and apparatus for encoding and decoding

An image processing method, encoding / decoding method and device, mainly comprising: obtaining image, camera parameter information and object depth information of at least two views; and carry out updating process on image of at least one video according to the image, camera parameter information and object depth information of at least two videos. Embodiment of this invention can carry out updating process on image of video to be updated according to obtained camera parameter of image and object depth information, in order to obtain expected image during encoding or decoding process, reduce encoding bits of image to be updated, reduce calculation load for encoding and decoding.
Owner:HUAWEI TECH CO LTD +1

An audio code rate control method and system

The embodiment of the present invention discloses an audio code rate control method and system. The method includes: determining a subband for initial coding according to the number of available bits; calculating an initial value of an effective scale factor of the subband according to a masking threshold of the subband; Adjust the effective scale factor so that the quantization noise of the subband frequency domain coefficients is smaller than the masking threshold, quantize the subband according to the effective scale factor, encode the quantized frequency domain coefficients, and determine the encoding Number of bits required; when the number of bits required for encoding is greater than the number of available bits, adjust the number of bits required for encoding by adjusting the effective scale factor of the subband in steps, so that the number of bits required for encoding is less than or equal to the number of available bits number. It can take into account the sound quality on the basis of reducing the complexity of the algorithm.
Owner:GLOBAL INNOVATION AGGREGATORS LLC

A bit rate control method integrating visual perception characteristics

The invention discloses a code rate control method fused with a visual perception characteristic. Code rate distribution of an LCU layer is guided according to influence of brightness and exercise intensity to human visual perception, and furthermore, in order to improve rate-distortion performance, a Lagrange multiplier lambda and a quantization parameter QP are corrected on the basis of the visual perception characteristic. According to the technical scheme of the invention, compared with an international coding standard HEVC (HM15.0), the code rate control method fused with the visual perception characteristic has the advantages that code rate control accuracy is not changed basically, subjective quality of a video is improved, rate-distortion performance can be better, bit resources are saved, contradiction between the quality of a video coding reconstructed image and coding resources is relieved, and coding performance of HEVC is improved.
Owner:BEIJING UNIV OF TECH

Cloud Image Coding Method Based on Object Library

The invention provides a cloud image coding method based on an object library and aims to solve the technical problem that the number of coded bits of a cloud image in the prior art is large. The method comprises the following steps: carrying out object detection on a to-be-coded cloud image; acquiring an object database; searching and matching the cloud object image obtained by carrying out object detection on the to-be-coded cloud image and the object image in the object database to obtain a candidate coded reference object image set (formula); screening the candidate coded reference objectimage set (formula) to obtain a coded reference object image set; finally, using the object image in the coded reference object image set as a coded reference image to carry out HEVC coding on the to-be-coded cloud image. The method can be applied to the fields of internet cloud image compression, personal photo album compression and the like, and compression coding is carried out on the to-be-coded cloud image.
Owner:XIDIAN UNIV

A voice communication method and device

The present invention discloses a voice communication method and equipment. The method comprises: a voice signal sending end determines a voice frequency band range; the voice signal sending end determines frequency band limit data within the voice frequency band range; The frequency band limitation data encodes the voice signal, and sends the encoded voice signal to the voice signal receiving end. In the embodiment of the present invention, based on the feature that the frequency range of the user's voice is highly concentrated, by determining the range of the voice frequency band, and determining the frequency band limit data within the range of the voice frequency band, and using the frequency band limit data to encode the voice signal, thereby effectively encoding The bits are placed in the frequency band most commonly used by users, thereby saving the number of coding bits, improving the coding efficiency, avoiding redundancy in the coding range, and reducing the network traffic of voice communication.
Owner:CHINA MOBILE COMM GRP CO LTD

Derivable motion compensation prediction method of decoding terminal

The invention discloses a derivable motion compensation prediction method of a decoding terminal. The method comprises the steps of firstly, obtaining motion vectors and coordinates of encoding blocks on the left side, the upper side and the upper right side of an adjacent encoding block, and working out model parameters; secondly, determining the initial value of a candidate motion vector of thecurrent encoding block according to the model parameters obtained by computing; thirdly, modifying the initial value of the candidate motion vector to obtain candidate motion vector set, and selecting an optimum candidate motion vector from the candidate motion vector muster; fourthly, performing compensation prediction according to the optimum candidate motion vector, and computing a rate-distortion cost when the optimum candidate motion vector is adopted to perform the compensation prediction; and fifthly, performing prediction on the current encoding block by adopting a motion compensationprediction method in H.264 / AVC, and selecting a motion compensation prediction method with the lowest rate-distortion cost to perform prediction. The method can save motion vector information, and improves the compression efficiency of a zoom video; and simultaneously, when an encoding terminal is not required to encode the model parameters, the decoding terminal can directly determine the model parameters.
Owner:SHANDONG UNIV

Inter prediction method based on mmvd mode, video coding method and related devices and equipment

The present application discloses an MMVD mode-based inter-frame prediction method, a video coding method, and related devices and equipment. Wherein, the inter-frame prediction method based on MMVD mode includes: constructing a first candidate list of the current coding block in MMVD mode, wherein the first candidate list contains a first preset number of first candidate motion vectors; determining a second preset A number of motion search amplitudes and a third preset number of motion search directions; for each first candidate motion vector: starting from the first candidate motion vector, using different motion combinations of motion search amplitudes and motion search directions respectively Searching to obtain motion vector offsets of the first candidate motion vectors under different motion combinations; based on the motion vector offsets of each first candidate motion vector under different motion combinations, determine and obtain the best motion vector of the current coding block. The foregoing method can improve the accuracy of inter-frame prediction or reduce the coding amount of the inter-frame prediction result with a high probability, and improve the coding compression rate.
Owner:ZHEJIANG DAHUA TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products