Home > Storage Technology Tips > Data storage management > Data classification best practices for tiered storage
Storage Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

Data classification best practices for tiered storage


Rick Cook
Rating: -3.50- (out of 5)

The concept behind tiered storage is simple: Divide data according to its value, access needs and retention requirements, and then handle each class of data accordingly. While this may sound simple, it may not be so easy to accomplish.

According to the Storage Networking Industry Association (SNIA), the hardest part of implementing a tiered storage system is classifying the data.

"The biggest challenge we hear from customers as they begin implementing ILM-based tiered storage is in getting agreement on the information and data classification requirements," reads a quote on the SNIA home page. "This is the crux of establishing successful ILM practices."

Information lifecycle management or ILM is basical...


BROWSE BY TAG
Data storage management,   Data Storage Management,   Tiered storage,   VIEW ALL TAGS

RELATED CONTENT
Data storage management
Solid-state drives vs. hard disk drives: How to justify the cost of an SSD
Tiered storage: A look at internal and external tiered storage models
How to use flash-based SSDs in your environment
Use MAID, intelligent power management as green storage options to control energy consumption
Cloud storage pricing: The cost of a hypothetical month of cloud data storage
Cloud storage pricing revealed: Hidden costs include data migration and access fees
Creating a data center migration plan
Top 10 enterprise data storage tips of 2009
Building a private storage cloud: Essential components
How to add solid-state storage to your enterprise data storage systems

Tiered storage
3PAR adds SSDs, sub-volume automated tiered storage to InServ arrays
No more wasted tiers
Tiered storage: A look at internal and external tiered storage models
Automated tiered storage startup Avere adds support for flash SSDs
EMC releases first version of FAST for automated tiered storage
Storspeed comes out of stealth with SP5000 NAS caching and monitoring appliance
Avere looks to optimize performance of tiered storage with FXT Series
Podcast: How file virtualization can benefit your storage infrastructure
The expanding tiered storage model
Maximizing your enterprise data storage capacity: Improve efficiency and utilization

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
information life cycle management  (SearchStorage.com)
tiered storage  (SearchStorage.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary


ly the goal-directed adjunct to tiered storage. While tiered storage plans deal with how the data is to be divided and stored, ILM deals with the same issue from the user's perspective. In other words, why the data needs to be divided and stored in that manner.

A successful tiered storage implementation has to balance the cost of storage, the life expectancy of data, how often data will be retrieved, and how fast it will be needed against cost and available technology.

There are only three basic tiers of storage -- archival, backup and immediate use -- but a typical business is likely to end up with more than a dozen categories in their tiered storage system. Because the categories have to be composed for specific businesses, there are literally hundreds of rules, often differing in only tiny details.

One variable, for example, is the amount of time a record is retained in each tier and if the record should be moved to the next tier or destroyed.

However, establishing categories for a tiered storage system isn't primarily a storage management problem. It's also a user problem, and each group of users in the business needs to help in deciding the classifications. Storage managers typically get the job of refining the categories and picking the technologies to support them.

User input on data is necessary

You'll need input from each group of users regarding the data they generate or handle. This will include how long to keep the data, how fast the data needs to be available when needed, how likely it is to be needed, when it can be moved between tiers, and when and if it can be destroyed.

Setting up a data classification committee will likely be necessary because data is typically used by more than one department. For example, data that's useless after three months to one department may be useful to another department for a few years.

Whenever possible, the classification itself should be done automatically. That is, the system should be able to determine where to pigeonhole each document without asking the user to classify it. This is usually done based on the type of document (spreadsheet or word processing), time of creation, who created it and what folder it's stored in. This means the classifications need to be easy enough for the system to handle.

The next step is to rationalize these categories and combine them when possible. This involves questioning users about their classification characteristics. For example, a user may only need a document for three months, but if retaining it for six months eliminates a category and doesn't cause problems, it might be worth doing.

Now it's time to start thinking about technology. How much of the hardware and software that you have now can you use in your tiered storage system? What new technology will you need? Can you slipstream it in or will you have to install it in large chunks?

Capacity planning becomes more complicated with tiering

One result of tiering is that capacity planning becomes more complicated. Instead of just needing more hard drives, you need to decide which kinds of hard drives you need (i.e. fast SCSI, medium speed SCSI, SATA, RAID 10, RAID 5, etc.). Don't assume that the storage device categories will grow in lockstep. Tiered storage typically has some categories that grow rapidly, some that hardly grow at all and others that will actually shrink.

For example, archival needs often shrink as data that was once stored permanently is reclassified into categories that are kept for only a limited time.

Once the data is classified, the business rules are set and the technology is in place, dividing the data can be a straightforward process with storage management software and, if needed, data archiving programs.

Some types of data in tiered storage are best handled by specialized data archiving programs, if the amount of data is large enough. This is particularly true with email because of the nature of the backups created by email programs (archival rather than easily searchable) and the large number of small data files.

Once a tiered storage system is established, its classifications should be rigidly adhered to. Let's say records are subpoenaed -- it's crucial that you can deliver all of the records asked for and that all the records you believe to be destroyed have actually been destroyed.

The user who's kept a forgotten copy of a supposedly destroyed email on his/her personal hard drive can be a serious impediment in a court case. You'll need a company policy establishing what kinds of information can be stored where, and you'll need to provide employee education.

Rate this Tip
To rate tips, you must be a member of SearchStorage.com.
Register now to start rating these tips. Log in if you are already a member.




DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



Find Data Backup Analysis
TechTarget Storage Media
Storage Magazine View this month\\'s issue and subscribe today.
Storage Decisions Apply online for free conference admission.
SearchStorage.com
HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2000 - 2010, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts