Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

474 results about "Vector quantization" patented technology

Vector quantization (VQ) is a classical quantization technique from signal processing that allows the modeling of probability density functions by the distribution of prototype vectors. It was originally used for data compression. It works by dividing a large set of points (vectors) into groups having approximately the same number of points closest to them. Each group is represented by its centroid point, as in k-means and some other clustering algorithms.

Object recognizer and detector for two-dimensional images using bayesian network based classifier

A system and method for determining a classifier to discriminate between two classes—object or non-object. The classifier may be used by an object detection program to detect presence of a 3D object in a 2D image (e.g., a photograph or an X-ray image). The overall classifier is constructed of a sequence of classifiers (or “sub-classifiers”), where each such classifier is based on a ratio of two graphical probability models (e.g., Bayesian networks). A discrete-valued variable representation at each node in a Bayesian network by a two-stage process of tree-structured vector quantization is discussed. The overall classifier may be part of an object detector program that is trained to automatically detect many different types of 3D objects (e.g., human faces, airplanes, cars, etc.). Computationally efficient statistical methods to evaluate overall classifiers are disclosed. The Bayesian network-based classifier may also be used to determine if two observations (e.g., two images) belong to the same category. For example, in case of face recognition, the classifier may determine whether two photographs are of the same person. A method to provide lighting correction or adjustment to compensate for differences in various lighting conditions of input images is disclosed as well. As per the rules governing abstracts, the content of this abstract should not be used to construe the claims in this application.
Owner:CARNEGIE MELLON UNIV

Prototype waveform phase modeling for a frequency domain interpolative speech codec system

A system and method is provided that employs a frequency domain interpolative CODEC system for low bit rate coding of speech which comprises a linear prediction (LP) front end adapted to process an input signal that provides LP parameters which are quantized and encoded over predetermined intervals and used to compute a LP residual signal. An open loop pitch estimator adapted to process the LP residual signal, a pitch quantizer, and a pitch interpolator and provide a pitch contour within the predetermined intervals is also provided. Also provided is a signal processor responsive to the LP residual signal and the pitch contour and adapted to perform the following: provide a voicing measure, where the voicing measure characterizes a degree of voicing of the input speech signal and is derived from several input parameters that are correlated to degrees of periodicity of the signal over the predetermined intervals; extract a prototype waveform (PW) from the LP residual and the open loop pitch contour for a number of equal sub-intervals within the predetermined intervals; normalize the PW by a gain value of the PW; encode a magnitude of the PW; and separate stationary and nonstationary components of the PW using a low complexity alignment process and a filtering process that introduce no delay. The ratio of the energy of the nonstationary component of the PW to that of the stationary component of the PW is averaged across 5 subbands to compute the nonstationarity measure as a frequency dependent vector entity. A measure of the degree of voicing of the residual is also computed using openloop pitchgain, pitch variance, relative signal power, PW correlation and PW nonstationarity in low frequency subbands. The nonstationarity measure and voicing measure are encoded using a 6-bit spectrally weighted vector quantization scheme using a codebook partitioned based on a voiced/unvoiced decision. At the decoder, a stationary component of PW is reconstructed as a weighted combination of the previous PW phase vector, a random phase perturbation and a fixed phase vector obtained from a voiced pitch pulse.
Owner:HUGHES NETWORK SYST

Generation method of vector quantization code book

The invention provides a generation method of a vector quantization code book. In the method, a global optimization method based on a random relaxation technology is introduced; and while iteratively updating the code book every time, random disturbance is generated and added to a corresponding code word, thus local convergence is effectively avoided during the process of updating the code book. The method can further rationally optimize a code book structure and bit positions of the code word according to the inherent characteristic of channel statistical distribution of a wireless communication system and the requirement on orthogonality of user scheduling in a base station in an MIMO system on the selected users. In addition, a method for expanding the code book is also introduced in order to improve the robustness of the code book in a multi-channel statistical distribution environment, and the size of the code book can be flexibly adjusted according to specific conditions. As the antenna number of the base station and downlink user equipment is greatly increased based on the demand of the development of the future communication system, the method can generate reserved interfaces for the future code book, and can achieve higher quantization and system performance with lower complexity even though the method is used in high-dimensional vector quantization.
Owner:SHANGHAI JIAO TONG UNIV +1

Object Recognizer and Detector for Two-Dimensional Images Using Bayesian Network Based Classifier

A system and method for determining a classifier to discriminate between two classes—object or non-object. The classifier may be used by an object detection program to detect presence of a 3D object in a 2D image (e.g., a photograph or an X-ray image). The overall classifier is constructed of a sequence of classifiers (or “sub-classifiers”), where each such classifier is based on a ratio of two graphical probability models (e.g., Bayesian networks). A discrete-valued variable representation at each node in a Bayesian network by a two-stage process of tree-structured vector quantization is discussed. The overall classifier may be part of an object detector program that is trained to automatically detect many different types of 3D objects (e.g., human faces, airplanes, ears, etc.). Computationally efficient statistical methods to evaluate overall classifiers are disclosed. The Bayesian network-based classifier may also be used to determine if two observations (e.g., two images) belong to the same category. For example, in case of face recognition, the classifier may determine whether two photographs are of the same person. A method to provide lighting correction or adjustment to compensate for differences in various lighting conditions of input images is disclosed as well. As per the rules governing abstracts, the content of this abstract should not be used to construe the claims in this application.
Owner:GOOGLE LLC

Video key frame extraction method based on color quantization and clusters

InactiveCN103065153ALow type dependencyAvoid redundant selectionCharacter and pattern recognitionTelevision systemsCanonical quantizationFrame difference
The invention discloses a video key frame extraction method based on color quantization and clusters. The method comprises the steps of loading video data flow; conducting single frame scanning on video flow; conducting the color quantization on obtained frame images, and extracting main color features of the frame images going through quantization; calculating similarity of adjacent frames so as to obtain adjacent frame difference; conducting shot boundary detection according to the adjacent frame difference; conducting shot classification on intersected shots and extracting a representative frame of each shot; and conducting compression clustering on the sequence of the representative frames so as to obtain a key frame sequence. According to the method, the color quantization is conducted on the single frame images so that main color of the images is extracted, frame difference calculation is conducted through the cluster feature similarity calculation method based on color features of the clusters so that the shot boundary detection is realized, and finally clustering according to the compression ratio is conducted on the extracted representative frames. Due to the fact that he whole process is low in dependency on video formats and types, the method has good universality and adaptability, is simple in calculation and low in space consumption, and can effectively avoid the phenomenon of key frame selection redundancy, control the number and quality of the key frames, and realize control of the video compression ratio.
Owner:SOUTHWEAT UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products