About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
DCC 2004
Conference paper
An approximation to the greedy algorithm for differential compression of very large files
Abstract
A new diffrential compression algorithm that combines the hash value and suffix array techniques of Manber and Myers, was discussed. Differential compression algorithms did not find the longest matches for every offset of the version file. The performance of Greedy algorithm was based on the block hash table, the quick index array and the pointer array. Computing the suffix arrray of block hash reduced the storage and computing requirements by a factor equal to the block size used. It was found that the differential compression method was used in compression applications, web crawling as well as computational biology.