|
JPEG2000是基於小波變換的圖像壓縮標準,由Joint Photographic Experts Group組織創建和維護。JPEG2000通常被認為是未來取代JPEG(基於離散餘弦變換)的下一代圖像壓縮標準。JPEG2000文件的擴展名通常為.jp2,MIME類型是image/jp2。
- y% z: d) A. d. {4 r5 ^' I `: K* f! Z& {& M
JPEG2000的壓縮比更高,而且不會產生原先的基於離散餘弦變換的JPEG標準產生的'blocky and blurry'artifacts。JPEG2000同時支持破壞性資料壓縮和非破壞性資料壓縮。另外,JPEG2000也支持更複雜的漸進式顯示和下載。; j' J$ W2 @0 i( u9 y
# V% p* D- A/ m0 T5 H* D
JPEG2000是國際標準化組織(ISO)發佈的標準,文檔代碼為ISO/IEC 15444-1:2000。雖然JPEG2000在技術上有一定的優勢,但是到目前為止(2006年),網際網路上採用JPEG2000技術製作的圖像文件數量仍然很少,並且大多數的瀏覽器仍然沒有預設支持JPEG2000圖像文件的顯示。但是,由於JPEG2000在非破壞性壓縮下仍然能有比較好的壓縮率,所以JPEG2000在圖像質量要求比較高的醫學圖像的分析和處理中已經有了一定程度的廣泛應用。$ ^1 y. [2 O* H
- J0 n3 C) N# E; ]- X2 ~
; c- X: Z2 Z1 m相關標準1 i& Y( x6 Q2 Q! F; u
已經發佈的JPEG2000標準包括ISO/IEC 15444-1:2000。另外,和JPEG2000相關的一些額外標準也正在制定和討論中,比如JPEG2000安全圖像傳輸(JPSEC)以及基於連接的JPEG2000圖像瀏覽(JPIP)等。# h, h* R$ ?0 B" o! `
( t( U5 R. V3 J7 H9 V& [6 JSeveral additional parts of the JPEG 2000 standard exist, some of them are not yet officially released. Amongst them are ISO/IEC 15444-2:2000, JPEG 2000 extensions, featuring for example trellis quantization, an extended file format and additional color transformations, ISO/IEC 15444-4:2000, the reference testing and ISO/IEC 15444-6:2000, the compound image file format, allowing compression of compound text/image graphics. Extensions for secure image transfer, JPSEC, and connection-based image browsing, called JPIP are currently under discussion by the ISO.
, x: z0 x6 o X
3 r% R2 A; C! w9 F/ R) T技術探討
. k, Q: K+ [! w2 W# Y/ N. |7 wImage:Jpeg 2000.jpg 0 w& D3 D% t/ e" C; w0 R a
Comparison of JPEG 2000 with the original JPEG format.JPEG 2000的目標不僅僅是性能要超越JPEG,而且增加和增強瞭如可縮放性和可編輯性這樣的特性。. m6 W7 E# h: ]& P, f; O4 c% |: b
5 k, @2 Z: I: R7 q0 W) u
In fact, JPEG 2000's improvement in compression performance relative to the original JPEG standard is actually rather modest and should not ordinarily be the primary consideration for evaluating the design. Moreover, very low and very high compression rates (including lossless compression) are also supported in JPEG 2000. In fact, the graceful ability of the design to handle a very large range of effective bit rates is one of the strengths of JPEG 2000. (For example, to reduce the number of bits for a picture below a certain amount, the advisable thing to do with the first JPEG standard is to reduce the resolution of the input image before encoding it — something that is ordinarily not necessary for that purpose when using JPEG 2000 because of its wavelet scalability properties.)
+ V/ q2 i2 E2 D: Z% P
* o! V: l( [2 \: k1 P2 y# EJPEG 2000, as did the original JPEG 1992 standard, applies a form of transform coding to compress images. However, JPEG 2000 uses a wavelet transform, in contrast to JPEG 1992 which uses an 8x8 block-size discrete cosine transform.
8 A+ p' g0 l, a* z7 |& D/ k+ @- V9 f3 O
Initially, images have to be transformed (from the RGB color space) to the well known YCbCr color space or to the RCT space (reversible component transform) leading to three components. In lossless mode a very rough, but reversible, approximation of this color space conversion is used. The chrominance components can be, but not necessarily have to be, down-scaled in resolution; in fact, since the wavelet transformation already separates images into scales, downsampling is more effectively handled by dropping the finest wavelet scale. This step is called multiple component transformation in the JPEG 2000 language since its usage is not restricted to the RGB color model.3 I3 t4 t+ L& i# Y; |0 n( W7 L6 V
$ d2 N) N$ ?! Y% \After color transformation, the image is split into so-called tiles, rectangular regions of the image that are transformed and encoded separately. The purpose of tiles is to cope with memory limitations more easily. These tiles are then wavelet transformed to an arbitrary depth.; O9 ~7 [5 _0 g. K
) t" s- V3 S9 ^: n' ? [The result is a collection of sub-bands which represent several approximation scales. A sub-band is a set of coefficients — real numbers which represent aspects of the image associated with a certain frequency range as well as a spatial area of the image. These coefficients are scalar-quantized, giving a set of integer numbers which have to be encoded bit-by-bit.
3 z6 B6 I! ?3 A. x! A/ a
- r" F! y4 \( B5 o1 W; X% IThe quantized sub-bands are split further into precincts, rectangular regions in the wavelet domain. They are typically selected in a way that the coefficients within them across the sub-bands form approximately spatial blocks in the (reconstructed) image domain, though this is not a requirement./ Z& P; c2 X6 f1 _9 x; o& ^
4 \# ?9 W1 [ [5 a' G9 C
Precincts are split further into code-blocks. Code-blocks are located in a single sub-band and have equal sizes — except those located at the edges of the image. The encoder has to encode the bits of all quantized coefficients of a code-block, starting with the most significant bits and progressing to less significant bits by a process called the EBCOT scheme. EBCOT here stands for Embedded Block Coding with Optimal Truncation. In this encoding process, each bit-plane of the codeblock gets encoded in three so called coding passes, first encoding bits (and signs) of insignificant coefficients with significant neighbors (i.e. with 1-bits in higher bit-planes), then refinement bits of significant coefficients and finally coefficients without significant neighbours. The three passes are called Significance Propagation, Magnitude Refinement and Cleanup Pass, respectively.
8 Q9 v1 M: R! a4 W3 j t, Z+ W, |+ T% D* B8 ]& S# v( s" u
The bits selected by these coding passes then get encoded by a context-driven binary arithmetic coder, namely the binary MQ-coder. The context of a coefficient is formed by the state of its nine neighbours in the codeblock./ d! j" i. P, D5 x+ _7 L9 i
2 _( O {0 c, f8 V3 Z
The result is a bit-stream that is split into packets where a packet groups selected passes of all codeblocks from a precinct into one indivisible unit. Packets are the key to quality scalability (i.e. packets containing less significant bits can be discarded to achieve lower bit-rates and higher distortion).
6 M; ]' q1 [3 W' l/ j. E. W. R% G3 ?9 D/ ?% l+ T5 h8 D; E
Packets from all sub-bands are then collected in so-called layers. The way the packets are built up from the code-block coding passes, and thus which packets a layer shall contain is not defined by the JPEG 2000 standard, but in general a codec will try to build layers in such a way that the image quality will increase monotonically with each layer, and the image distortion will shrink from layer to layer. Thus, layers define the progression by image quality within the codestream.1 M$ s9 |, v0 Y7 i
+ j0 w: Z$ G9 l2 z8 ?8 `The problem is now to find the optimal packet length for all code-blocks which minimizes the overall distortion in a way that the generated target bitrate equals the demanded bitrate. While the standard does not define a procedure as to how to perform this so-called rate-distortion optimization, the general outline is given in one of its many appendices: For each bit encoded by the EBCOT coder, the improvement in image quality, defined as mean square error, gets measured; this can be implemented by an easy table-lookup algorithm. Furthermore, the length of the resulting codestream gets measured. This forms for each codeblock a graph in the rate-distortion plane, giving image quality over bitstream length. The optimal selection for the truncation points, thus for the packet-build-up points is then given by defining critical slopes of these curves, and picking all those coding passes whose curve in the rate-distortion graph is steeper than the given critical slope. This method can be seen as a special application of the method of Lagrange multiplier which is used for optimization problems under constraints. The Lagrange multiplier, typically denoted by λ, turns out to be the critical slope, the constraint is the demanded target bitrate, and the value to optimize is the overall distortion.
/ [0 y* Y3 d7 H% L. f8 s6 Z- d; I t1 N0 t4 g
Packets can be reordered almost arbitrarily in the JPEG 2000 bit-stream; this gives the encoder as well as image servers a high degree of freedom., @% r- `+ ?+ r9 I" Q; V
! Z' G Z9 S8 e4 a/ X% UAlready encoded images can be sent over networks with arbitrary bit-rates by using a layer-progressive encoding order. On the other hand, color components can be moved back in the bit-stream; lower resolutions (corresponding to low-frequency sub-bands) could be sent first for image previewing. Finally, spatial browsing of large images is possible through appropriate tile- and/or partition selection. All these operations do not require any re-encoding but only byte-wise copy operations.
6 B, \* I9 E) q0 K* }; X
2 `' j4 I6 B& j4 w1 q* l. pJPEG 2000 gains up to about 20% compression performance for medium compression rates in comparison to the first JPEG standard. For lower or higher compression rates, the improvement can be somewhat greater (especially if altering the input resolution to the codec is not considered as a technique for effective use of the older JPEG standard). Good applications for JPEG 2000 are large images, images with low-contrast edges — e.g. medical images.0 ^7 X2 C( Z; y- r
8 N \" e6 ]: x9 ` W
It has, however, notably higher computational and memory demands.
+ O4 Y6 x; @) y3 v; i7 u) l3 }3 I+ O1 H, x6 K8 w2 v9 J" _
Lossless compression is achieved through the use of a special integer wavelet filter (biorthogonal 3/5 instead of Daubechies biorthogonal 7/9) and a quantization step size of 1. Clearly, in lossless mode all bitplanes have to be encoded by the EBCOT, and no bitplanes can be dropped.: O8 F/ Y S5 w9 h2 t( h: V
0 m3 {. {9 T( z6 Y* b
[ 本帖最後由 masonchung 於 2007-1-20 02:42 PM 編輯 ] |
|