home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.compression
- Path: sparky!uunet!gatech!destroyer!caen!nic.umass.edu!noc.near.net!gateway!pictel!garys
- From: garys@pictel.com (Gary Sullivan)
- Subject: Re: Comparison: BMA vs. PRA
- Message-ID: <1992Aug14.153217.4146@pictel.com>
- Organization: PictureTel Corporation
- References: <1992Aug12.052132.9017@kum.kaist.ac.kr>
- Date: Fri, 14 Aug 1992 15:32:17 GMT
- Lines: 161
-
- In article <1992Aug12.052132.9017@kum.kaist.ac.kr> hckim@camars.kaist.ac.kr (Hyung Chul Kim) writes:
- >Hi folks,
- >
- >BMA(Block Matching Algorithm) and PRA(Pel Recursive Algorithm) are
- >motion estimation algorithms for motion video compression.
- >Anyone knows whether there is any comparison study about them from the point of
- >view of speed and compression rate?
- >If you have ever studied or have any information about it,
- >please let me know.
- >
-
- Here is an excerpt from the LaTeX source of my motion compensation article in '91 ICASSP.
-
- Reference: @inproceedings{sullivanCGI91conf,
- author = "G. J. Sullivan and R. L. Baker",
- title = "Motion Compensation for Video Compression using Control Grid Interpolation",
- booktitle = icassp,
- pages = "2713--2716",
- month = may,
- year = 1991}
-
- I have included some lines of the original source that were commented out
- (using "%") in the actual paper due to constraints on the length of the
- paper, as I think these lines may be helpful in some cases. However, the
- text is not guaranteed to make sense with these lines included. It is presumed
- to make sense without them.
-
- The summary of what is below is this:
- (1) PRAs are fairly useless for low rate video compression, since they require
- sequential encoding, not block-oriented methods such as DCT or large-block VQ.
- (2) PRAs are more normally more computationally complex than BMAs but can
- produce a better motion vector field estimation under some circumstances.
-
- ----------------- Included description below --------------------------
- The dominant methods of motion compensation are {\em block matching
- algorithms} (BMAs)~[1--3]\nocite{musmann85,rao85,MPEGSM3}.
- In a BMA,
- the input image is decomposed into a regular
- grid of rectangular blocks. One MV is found for each block, and all pels
- in the block are displaced equally. The chosen MVs
- are entropy coded and sent to the receiver.
- %The focus of much of the research to date on BMAs consists of
- %various methods for efficiently searching for the best MV for each block.
- A straightforward but computationally intense way of choosing these MVs is
- called {\em full-search}, which consists of measuring the distortion for every
- allowable MV value and choosing the best directly.
- %Many methods for reducing the
- %number of tested vectors have been proposed, each of which by construction
- %has performance somewhat below that of full-search.
-
- There are several well-known problems with BMAs.
- %The borders of moving objects
- %do not normally coincide with the borders of the blocks used for the BMA.
- When objects undergoing different types of motion are found in the same block,
- the use of the same MV for every pel in the block becomes
- inappropriate. Also, when adjacent blocks use different MVs, they
- form a straight-line discontinuity in the prediction image.
- Since the human visual system is very sensitive to straight lines, these
- errors appear as {\em blocking artifacts}. Since the motion vectors are
- determined independently for each block, the motion vector field
- is discontinuous, difficult to compress, and may have little relation to
- true object motion.
- Further, since each block can
- undergo only translational motion, BMAs have difficulty with other types of
- motion such as rotation and depth changes. These problems
- demonstrate the need for methods which operate on individual pels, rather
- than large blocks.
-
- The next most common class of methods for motion compensation is
- {\em pel-recursive algorithms}
- (PRAs)~\cite{musmann85}.
- These operate on a pel basis and overcome
- many of the problems associated with BMAs.
- %They produce
- %less distortion and have no blocking artifacts.
- %In a PRA each individual pel or small block of pels
- %has its own motion vector, which is
- %calculated by an iterative method. In their preferred implementation,
- %PRAs do not send these motion vectors to the receiver,
- %because to do so would require large
- %amounts of side information. Instead, they use
- %previously decoded data from the current frame
- %to find a MV that is appropriate for the area surrounding
- %the next pel to be decoded.
- %Since this requires that each pel be decoded individually, it
- However, in their preferred implementation, PRAs require that each
- pel be decoded sequentially, thus making them
- incompatible with block coding methods, such as DCT or VQ.
- %PRAs also have high complexity, since they involve an iterative
- %optimization procedure for every pel.
- %
- Several other methods for motion compensation have been proposed.
- Predictive segmentation uses a BMA and data
- from prior frames to divide a block into regions having differing
- MVs~\cite{orchard90}. This method shows promise, but requires
- extra rate to identify the segmentation.
- %Also, since segmentation is
- %a hard (rather than soft) decision, errors can produce large residuals.
- Other methods based on object modeling may be useful in the future, but
- have yet to be developed for real-time application to general, complex
- scenes typical of videoconferencing or broadcast video.
-
- In this paper, we describe a new class of motion compensation methods based on
- control grid interpolation (CGI)~\cite{castle79}.
- CGI is a spatial transformation method which can be used to create a smooth
- interpolated motion field.
- The authors are
- unaware of any prior study of CGI in motion compensation.
- We show that BMA is a special case of CGI, and
- describe a new CGI algorithm that is compatible with nearly any BMA coding
- system, yet
- produces no blocking artifacts. It uses the same or less rate
- than the corresponding BMA, and has about the same mean-square distortion.
- Nearly all of the theory developed for BMA searching and for encoding BMA
- motion vector data is easily applied to the CGI system.
-
- Citations:
-
- @article{musmann85,
- author = "H. G. Musmann and P. Pirsch and H-J. Grallert",
- title = "Advances in Picture Coding",
- journal = procieee,
- volume = "73",
- number = 4,
- pages = "523--548",
- month = apr,
- year = 1985}
-
- @article{rao85,
- author = "R. Srinivasan and K. R. Rao",
- title = "Predictive Coding Based on Efficient Motion Estimation",
- journal = ieeetransCOM,
- volume = "COM-33",
- pages = "888-896",
- month = aug,
- year = 1985}
-
- @unpublished{MPEGSM3,
- author = "{MPEG Video Editorial Group}",
- title = "{MPEG} Video Simulation Model Three",
- month = aug,
- note = "Int. Standards Org.",
- year = 1990}
-
- @inproceedings{orchard90,
- author = "M. T. Orchard",
- title = "Predictive Motion Field Segmentation for Image Sequence Coding",
- booktitle = icassp,
- month = apr,
- volume = "4",
- pages = "1977--1980",
- year = 1990}
-
- @book{castle79,
- author = "K. R. Castleman",
- title = "Digital Image Processing",
- address = "Englewood Cliffs, NJ",
- publisher = "Prentice-Hall",
- year = 1979}
- ---------------------------------
- Gary Sullivan (garys@pictel.com)
-