- 著者
- Edward R. Fiala, Daniel H. Greene
- タイトル
- Data Compression with Finite Windows
- ページ
- 490-505
- 日時
- April 1989
- コメント
- LZFGとして知られる手法である。データ構造として
PATRICIA Trie\cite{Morrison:PATRICIA}を使っている。
- 感想
- The research of Ziv and Lempel led to a family of
compression methods based on the idea of replacing a
group of characters by a reference to an earlier
occurrence of that same group. The authors explore one
particular form of this compression method which
restricts the references so that they can only point
into a finite sliding window of text. Their aim is
clearly to develop a practical compression algorithm.
In particular, the authors attempt to achieve good
compression performance, but at the same time they
want fast compression and decompression algorithms
with reasonable storage requirements. As far as
compression performance is concerned, the algorithms
are definitely successful; the figures reported in the
paper indicate that they achieve compression similar
to or better than competing methods. The compression
performance of these algorithms is also good on
relatively small files, whereas many other methods
achieve their best performance only on very large
files. Unfortunately, the authors do not provide all
the data needed to compare execution time
requirements. I was particularly disappointed to find
that they make no speed comparison with the ZLW
(Ziv-Lempel-Welch) method, which is provided on most
UNIX systems as the compress command and is in
widespread use.
\\
The quality of the paper is uneven. The first half
gives a readable introduction to the use of suffix
trees in implementing the window-based compression
algorithm and provides theorems to show that they have
linear-time update costs. But pragmatic considerations
then take over, and all the details of the implemented
algorithms will likely overwhelm the reader. Although
such details are important when extracting the last
drop of compression performance, they definitely
detract from the paper's readability.
\\
Finally, I cannot conclude this review without
complaining about the three graphs reproduced in the
paper. Each graph includes a curve labeled H1 that
purports to show the entropy of the file as determined
by first-order Markov statistics. The curves are not
corrected to compensate for small sample sizes,
however, and are therefore meaningless. It would have
made more sense for the authors to use the
Cleary-Witten method to generate the data for a
first-order Markov model. Another problem with these
graphs is that the numbers marked on the vertical axes
do not agree at all well with the numbers printed
alongside the curves.
\\
-R. N. Horspool, Victoria, B.C., Canada
- カテゴリ
- Compress
Category: Compress
Journal: cacm
Comment: LZFGとして知られる手法である。データ構造として
PATRICIA Trie\cite{Morrison:PATRICIA}を使っている。
Number: 4
Bibtype: Article
Author: Edward R. Fiala
Daniel H. Greene
Pages: 490-505
Month: apr
Review: The research of Ziv and Lempel led to a family of
compression methods based on the idea of replacing a
group of characters by a reference to an earlier
occurrence of that same group. The authors explore one
particular form of this compression method which
restricts the references so that they can only point
into a finite sliding window of text. Their aim is
clearly to develop a practical compression algorithm.
In particular, the authors attempt to achieve good
compression performance, but at the same time they
want fast compression and decompression algorithms
with reasonable storage requirements. As far as
compression performance is concerned, the algorithms
are definitely successful; the figures reported in the
paper indicate that they achieve compression similar
to or better than competing methods. The compression
performance of these algorithms is also good on
relatively small files, whereas many other methods
achieve their best performance only on very large
files. Unfortunately, the authors do not provide all
the data needed to compare execution time
requirements. I was particularly disappointed to find
that they make no speed comparison with the ZLW
(Ziv-Lempel-Welch) method, which is provided on most
UNIX systems as the compress command and is in
widespread use.
\\
The quality of the paper is uneven. The first half
gives a readable introduction to the use of suffix
trees in implementing the window-based compression
algorithm and provides theorems to show that they have
linear-time update costs. But pragmatic considerations
then take over, and all the details of the implemented
algorithms will likely overwhelm the reader. Although
such details are important when extracting the last
drop of compression performance, they definitely
detract from the paper's readability.
\\
Finally, I cannot conclude this review without
complaining about the three graphs reproduced in the
paper. Each graph includes a curve labeled H1 that
purports to show the entropy of the file as determined
by first-order Markov statistics. The curves are not
corrected to compensate for small sample sizes,
however, and are therefore meaningless. It would have
made more sense for the authors to use the
Cleary-Witten method to generate the data for a
first-order Markov model. Another problem with these
graphs is that the numbers marked on the vertical axes
do not agree at all well with the numbers printed
alongside the curves.
\\
-R. N. Horspool, Victoria, B.C., Canada
Title: Data Compression with Finite Windows
Year: 1989
Volume: 32