Home > Tools and techniques for reducing your enterprise data storage footprint
FAQ:
EMAIL THIS

Tools and techniques for reducing your enterprise data storage footprint

13 Nov 2009 | SearchStorage.com

Storage technology learning materials
Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google

Even though many enterprise data storage environments are coping with tightened budgets and reduced spending, overall net storage capacity is increasing. In this interview, Greg Shulz, founder and senior analyst at Stillwater, Minn.-based StorageIO Group, discusses how storage managers can reduce their data footprint. Schulz touches on the importance of managing your data footprint on both online and offline storage, as well as the various tools for doing so, including data archiving, thin provisioning and data deduplication.

His answers can be read below or downloaded as an MP3.

Table of contents:

>>Why should you be aware of your data footprint?
>>How can you determine your data footprint?
>>How can you reduce your data footprint?
>>How can you address the power consumption of your storage equipment?
>>How can you evaluate a data footprint reduction solution?
>>How do reduction technologies differ for online and offline storage?
>>How can information lifecycle management reduce your data footprint?

What is your data footprint and why should you be aware of it?

Your data footprint is the result of all the information that you have both online and offline. Not only is it your file servers, email systems and databases, but also the backup copies of that data that you have on tape, removable disk or in the cloud.

So your footprint is the copies and the result of all the data you have, not just what's online. What's referred to as the data footprint impact is the result of keeping this information for longer periods of time, and the impact of having to manage, secure and protect it.

What are some techniques for determining your data footprint?

Some of the techniques include tools for storage resource management (SRM), data classification and search. In other words, tools that give you insight into what information and resources you have, where that data is situated and located, and basically how much of your data is being used and where.

Then you need to get insight into how much of that information is changing: how much of it is static, where your backup copies are located, how effective your data protection tools are, what copies of data you have and what your retention policies are. Determining this really comes back to looking at what you currently have, how it's being used and its impact.

What are some of the technologies that can reduce your data footprint?

There's one simple technique, which is deleting all of your data. It's a basic premise, but it's important to have data management processes in place. In addition to simply managing your data, things like archiving are important. There's this notion that archiving is only important for regulatory compliance, but for decades it's been used in highly optimized environments for reducing costs and complexity.

So archiving, compression, data deduplication, thin provisioning, space-saving snapshots and different RAID levels are all techniques you can use to minimize the impact of an expanding data footprint.

How can you address the power consumed by the storage equipment you're housing?

It certainly comes back to archiving. By archiving, you reduce the amount of data that you have in your systems by moving that static data that you've discovered and classified off your primary expensive storage and onto technology that consumes less power. Dedupe also gets a lot of attention for its ability to focus on backup and achieve higher densities of the amount of data that's put on storage when you start to measure on a capacity per-watt basis.

There are also some other things that come into play. Certainly using intelligent power management techniques, such as second-generation massive array of idle disks (MAID), which power devices down when they're not in use. But you need to do so in a way that doesn't compromise the quality of service.

What should you be looking for when selecting a data footprint reduction solution?

Take a step back and determine if you're looking for an online, offline or near-line solution. Is this for your database, email system or file system, or is this for online active archives or for backups?

Having that in mind determines what types of tools you'll be looking for. If you're looking to do this for backup, then you'll be looking at tools for deduplication, whether it's immediate or post-processing, source or target-based. If you're looking to reduce the impact on your online running databases, then you'll be looking at things like database archiving or compression, where within the database software, you can turn compression on.

Is there a difference in the data footprint reduction technologies used for online storage and offline storage?

Generally speaking there is, but where it's becoming blurred is where there are some tools that can work online, offline or in a near-line mode. There are some tools that will work online for files, but can't work for block data. The big differentiator is that with a lot of tools for reducing data footprints, you're trading time for space.

In other words, if you can take time to reduce the data, you can achieve space savings. When you're looking at online data, the primary premise for performance is time. So it's a balancing act -- you trade time for space. If you need time, that's performance. If you need space, that's capacity reduction. That's where the significant differences lie in the tools for working with primary databases vs. a file system.

How can information lifecycle management (ILM) help you reduce your data footprint?

It depends on how you view information lifecycle management. If you view ILM as a product -- as in archiving, shelving or hierarchal storage management (HSM) -- than that's a data footprint technique. But if you think of it as more of a technique for managing your information and aligning the right tiers instead of just simply deleting everything, then ILM is part of the overall umbrella of data management. If you think of it in terms of a specific product or technique, than it's just one of the tools in the tool box.

BROWSE BY TAG
Data Backup,   Data reduction and deduplication,   Data Storage Management,   Data management tools,   Primary storage capacity optimization,   VIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google



RELATED CONTENT
Data reduction and deduplication
Hot storage technologies for 2010
Is data deduplication right for your primary storage infrastructure?
Backup in a snap: A guide to snapshot technologies
Tips for an effective data deduplication implementation
EMC completes acquisition of Data Domain; fate of data deduplication partner Quantum unknown
EMC wraps up data deduplication vendor Data Domain; what's next for EMC, NetApp?
EMC acquires Data Domain for $2.1B after NetApp drops bid
EMC raises its acquisition offer for data deduplication vendor Data Domain to $2.1B
Choosing a storage system for data archiving
Storage Decisions Chicago 2009 Session Downloads

Data management tools
Green storage essentials: Addressing power, cooling and space issues
Performance metrics: Evaluating your data storage efficiency
School district maintains uptime with Xiotech, DataCore
Tools for using your enterprise data storage resources more efficiently
Enterprise data storage technologies rise from the dead
SAN sales boosted by need for storage efficiency
Thin provisioning brings utilization and capacity benefits to data storage, but with a caveat
Improving storage utilization with thin provisioning
Managing capacity planning with thin provisioning
Improved enterprise storage management tools are needed, especially for data backup
Data management tools Research

Primary storage capacity optimization
Green storage essentials: Addressing power, cooling and space issues
Performance metrics: Evaluating your data storage efficiency
Tools for using your enterprise data storage resources more efficiently
Optimizing enterprise data storage capacity and performance to reduce your data footprint
Thin provisioning brings utilization and capacity benefits to data storage, but with a caveat
Improving storage utilization with thin provisioning
Managing capacity planning with thin provisioning
Is data deduplication right for your primary storage infrastructure?
3PAR fattens its thin provisioning arsenal
Storage virtualization essentials: Increasing flexibility and utilization with virtualization

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
data deduplication  (SearchStorage.com)
delta differencing  (SearchStorage.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary




Find Data Reduction
TechTarget Storage Media
Storage Magazine View this month\\'s issue and subscribe today.
Storage Decisions Apply online for free conference admission.
SearchStorage.com
HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2000 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts