Home > Storage Technology Tips > Data storage management > Why you should perform data classification
Storage Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

DATA STORAGE MANAGEMENT

Why you should perform data classification


Pierre Dorion
07.13.2007
Rating: -3.50- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


What you will learn: Regardless of how daunting a task it may be, understanding the data we store is the first step in addressing data management issues. This tip outlines some of the main reasons why data classification is essential.

Data classification is an exercise every organization will have to consider at some point. The economics of data growth go far beyond simply adding storage capacity. Getting a good understanding of the type of data being stored is at the root of many IT projects that are causing most companies growing pains and headaches. Data classification can be based on type, application, owner, age, legal aspect, criticality, value, etc., and a growing number of software tools are available to assist with this task. But essentially, data must be siloed so it can be better managed. Data classification must be a collaborative effort between IT and business operations since the business units rarely know what data they need, and IT typically only has limited knowledge of who uses the data and why.

Consider the following:

How do you decide what data to retain?

We have all heard of information lifecycle management (ILM), and opinions differ as to what it is, what it is not, and whose responsibility it is. Regardless of opinions and what it is called, at some point or another data might be disposed of, moved offline (i.e., archived) or migrated to lower cost storage (tiered storage). But, before a decision is made regarding where data will reside next, there must be an understanding of what the data is.

What data needs to be backed up?

You are likely backing up data that has not been modified or even accessed in months -- if not years. This practice unnecessarily uses up time in a shrinking backup window, network bandwidth and capacity on your backup storage infrastructure. How long does it take to complete a full backup of your file servers? How much of that data is actually "production" data?



>What data should be archived?

Too often, backup products are used as archival tools simply because a decision cannot be made as to when data should be taken out of production and out of the backup loop. It is not rare to see a financial database backup retained for seven years, only to find out that the same database tables are also part of last night's backup.

What data should be restored first for disaster recovery?

When planning for disaster recovery, it is seldom clear which data must be restored first. Recovery time objectives (RTO) are typically driven by the criticality of a business process or application, but the planning often falls short of clearly identifying the associated data.

How is data migrated in tiered storage?

Regardless of whether storage tiers are implemented based on performance requirements, criticality or functionality, data migration across tiers can only take place once data is classified.

What data is subject to regulatory compliance laws?

Regulatory compliance targeting availability of records has sent many companies running in all directions, because they had never taken the time to examine what data is stored. The answer has unfortunately been to increase capacity until there is a better understanding of what is subject to the rules. This understanding is unlikely until the data is inventoried and classified.

How do you develop a chargeback model for storage?

When the time comes to obtain funding for IT, knowing which departments or functional areas are the biggest storage consumers can help build a business case for IT. Data classification can assist with developing a "chargeback" model for storage.

In the heydays of paper records, there were records managers. These people knew what the records were, where they were stored and when they should be archived and disposed of. Nowadays, records managers have mostly been replaced by data storage administrators -- who cannot be as close to the data as their predecessors could.

For many large organizations, classifying existing data may never be fully addressed due to the massive amounts of records accumulated over the years. For some, the only answer might be to draw a line in the sand and develop data management policies going forward that include categorization as soon as data is generated.

About the author: Pierre Dorion is a certified business continuity professional for Mainland Information Systems Inc.


Rate this Tip
To rate tips, you must be a member of SearchStorage.com.
Register now to start rating these tips. Log in if you are already a member.




BROWSE BY TAG
Data storage management,   Data Storage Management,   Tiered storage,   Data storage compliance and archiving,   E-discovery,   Additional e-discovery information,   Related information,   Tiered Storage,   Tiered storage data classification tools,   VIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


RELATED CONTENT
Data storage management
Choosing a storage system for data archiving
Green storage best practices control costs, increase energy efficiency
Best practices for using server virtualization in your storage environment
Best practices for effective thin provisioning
Three tips for ensuring a user-friendly email archiving system
Top tips when evaluating a storage automation product
Guidelines for implementing virtualization in your storage infrastructure
The value of easy-to-use SAN storage
Pros and cons of storage capacity management tools
What are the differences between SATA II vs. SATA I?

Tiered storage
Green storage best practices control costs, increase energy efficiency
Storage Decisions Chicago 2009 Session Downloads
Storage Decisions Session Downloads: Storage Systems & Storage Management Track (Chicago 2009)
Majority of users opt for tiered storage
A dynamic four-tier storage design
MAID technology remains underutilized
MAID product roundup
The state of MAID in data centers
Data classification fueled by e-discovery, storage tiering
Permabit makes secondary storage its primary goal

Data storage compliance and archiving
Choosing a storage system for data archiving
Mimosa Systems adds case management tool to NearPoint 4.0 data archiving software
Mimosa NearPoint, LiveOffice Mail Archive offer hybrid SaaS email archiving approach
HP resizes its ExDS9100 scale-out NAS system; finds market broader than original Web 2.0 target
New data archiving products focus on software-only delivery, cloud integration
Email archiving strategies: Five best practices
Email archiving needs soar as e-discovery requests rise
Storage Decisions Chicago 2009 Session Downloads
Storage Decisions Session Downloads: Data Retention & Retrieval Track (Chicago 2009)
Storage Decisions Session Downloads: Storage Systems & Storage Management Track (Chicago 2009)
Data storage compliance and archiving Research

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
information life cycle management  (SearchStorage.com)
tiered storage  (SearchStorage.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary

DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



Find Data Backup Analysis and Server Storage Channel Solutions

The Data Domain Data DeDuplication Center - Data Retention, Replication and Recovery

TechTarget Storage Media
Storage Magazine View this month\\'s issue and subscribe today.
Storage Decisions Apply online for free conference admission.
SearchStorage.com
HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2000 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts