This article can also be found in the Premium Editorial Download "Storage magazine: Storage Products of the Year 2010."
Download it now to read this article plus other related content.
The buzz around data deduplication continues unabated. Based on our experience, many IT users have already deployed data deduplication to some extent in their data centers. And many of those who haven't taken the dedupe plunge are planning to in 2011. Still, we estimate that only approximately 20% of worldwide enterprise data is protected on disk, with another 50% still to be added to that number. The rest will stay on tape or remain unprotected.
But even as shops enjoy the benefits of dedupe, some are beginning to worry about the proliferation of secondary disks. And many of those are wondering if global deduplication might actually deliver more efficiency across large amounts of data. But before we get into global data deduplication, let me define data deduplication itself, especially in contrast to primary storage optimization (PSO). The term data deduplication has become so overused that vendors often include PSO technologies within its meaning.
At Taneja Group, PSO is defined as the technologies that reduce capacity requirements for primary storage, while secondary capacity optimization (SCO) refers to solutions that reduce required disk capacity for protecting data on secondary storage (including long-term archiving). We differentiate between the two because the technologies behind each are drastically different.
This was first published in February 2011