Understanding dedupe ratios

This Content Component encountered an error
This article can also be found in the Premium Editorial Download: Storage magazine: New rules change data retention game:

Data deduplication ratios are related to the number of changes occurring to the data. Each percentage increase in data change drops the ratio; the commonly cited 20:1 ratio is based on average data change rates of approximately 5%.

Vendors assume that compression will reduce deduplicated data by a factor of 2:1. If the deduplication ratio were 15:1, for example, compression could increase that ratio to 30:1. But users with large amounts of data stored in compressed formats, such as jpeg, mpeg or zip, aren't likely to realize the extra bump compression provides.

The length of time data is retained will affect reduction rates. To achieve a ratio of 10:1 or 30:1, you may need to retain and deduplicate a single data set over a 20-week period. If you don't have the capacity to store the data for that long, the data-reduction rate will be lower.

Lastly, full backups give deduplication software a more granular view into the backup, so more frequent full backups will achieve higher ratios.

--Jerome M. Wendt


For the full feature go to http://searchstorage.techtarget.com/tips

This was first published in September 2007

Dig deeper on Storage Resources

Pro+

Features

Enjoy the benefits of Pro+ membership, learn more and join.

0 comments

Oldest 

Forgot Password?

No problem! Submit your e-mail address below. We'll send you an email containing your password.

Your password has been sent to:

-ADS BY GOOGLE

SearchSolidStateStorage

SearchVirtualStorage

SearchCloudStorage

SearchDisasterRecovery

SearchDataBackup

Close