Home > Storage Technology Tips > Data storage management > Checklist: Fight these storage fires -- automatically
Storage Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

DATA STORAGE MANAGEMENT

Checklist: Fight these storage fires -- automatically


Brett P. Cooper
03.02.2005
Rating: --- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


What you will learn from this tip: How to identify storage events and resolve them programmatically. Need a downloadable copy? Go here.


Storage managers who are looking to automate problem determination and resolution have two choices: increase the budget and purchase a suite of software and services to deliver these capabilities, or take advantage of the methodology behind the tools to deliver services that can automatically address the issues before they become problematic. Here are eight steps to get you there:

 Fight these storage fires -- automatically
Step 1: Identify the problem -- if there is one.
You may choose to undertake a comprehensive evaluation of your data center, or you may start on one site with one database, or anywhere in-between. Once you've determined the scope of your audit, you can begin. You need to know what hardware is out there, including hardware across the SAN and hardware that's directly connected. You will likely be pleasantly surprised at how much storage you actually have.
Step 2: Research the event.
When the switch reports a high port utilization event, a help desk ticket is created. It's usually 2:00 a.m., and there's no one at the data center to help you diagnose the issue. So, you hop into your car, or dial into your VPN, and research the issue. Then you realize that the time of the event seems familiar. It seems that the backup job kicks off at this time each night. When you look at the backup reporting tool, the tapes are spinning and everything looks OK.

If the backup reporting tool shows a failed backup job at this time, then you may have taken the traces from the event and made an alert to call for action to resolve this in the future.

Step 3: Define a corrective action.
The next morning, after a few hours of sleep and a well-deserved cup of coffee, you are back in the office explaining the situation to your team. If there was an actual alarm, this is where the team would get together and determine a corrective action. This could take the form of adding more throughput capability on the switches, breaking up the backup job, changing the schedule, or simply raising the threshold for throughput on the switch port. Both time and events can be part of an automated response to an issue.
Step 4: Document the solution and the traces of the problem into a knowledgebase.
The best way to define a storage management policy is to look at the events that occur in the day-to-day operations of a storage environment from the disk, tape, switch, host and application perspectives. Then you can then look at how the events and alarms are created and how the team processes each event -- from problem determination to resolution. I look at these events and alarms as the traces of a problem, and the actions that are taken to resolve them as a policy in its infancy. I have a saying that has served me well: "From practice, comes process. From process comes policy."/td>
Step 5: The problem repeats. Now what?
Look at the knowledgebase where you documented the solution the first time and follow the steps to resolve the issue. If the traces from the event are different, then research the issue and provide a resolution.
Step 6: Go back into problem resolution mode.
The next morning, you should check the problem traces (events) to make sure that it is the same situation. This may be a recurring event, which calls for filtering technologies to remove the spurious event from the notification system, or changing the thresholds on the switch or infrastructure that generates the event(s).
Step 7: If the problem occurs repeatedly, automate a response with a simple script.
Now that you are a master of this issue, you can script a response or filter out for this type of event in the monitoring and alerting solution that you use.
Step 8: Continual monitoring and training.
Train all of your staff, including new hires, on this process.

In the end, if this allows you to sleep an additional few hours and focus your precious time on what is truly important, than you are doing the right thing and can move on to the next big thing.

The goal: To keep your team from working on problems that aren't real problems so they can focus on the truly large issues at hand.

For more information:

Checklist: Ten steps to troubleshooting SAN/NAS performance problems

Tip: Know your port errors when troubleshooting SANs

Tip: Troubleshoot SANs from the center out

About the author: Brett P. Cooper is a frequent speaker at storage industry events. During his recent tenure at Veritas Software, he was responsible for developing and delivering the first release of Veritas SANPoint Control, one of the industry's first storage management solutions. Brett was also one of the founders of the Veritas Press, where he acted as technical advisor for the well-known storage reference book, "Storage area network essentials: A complete guide to understanding & implementing SANs," by Paul Massiglia and Richard Barker. In Brett's current role for Network Appliance, he is responsible for delivering Fibre Channel Protocol (FCP) and Internet SCSI (iSCSI) solutions to the market.

Rate this Tip
To rate tips, you must be a member of SearchStorage.com.
Register now to start rating these tips. Log in if you are already a member.




BROWSE BY TAG
SAN management,   SAN (storage area network),   Data storage management,   VIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   



RELATED CONTENT
SAN management
Unilever maintains 5 PB Fibre Channel SAN storage performance with Virtual Instruments' NetWisdom
Storage Decisions Chicago 2009 Session Downloads
Storage Decisions Session Downloads: Managing Storage Networks Track (Chicago 2009)
Storage-area networks to become increasingly object based
Data storage management in virtual server environments
10 Gb Ethernet bodes well for iSCSI
Mellanox builds bridge to consolidation
Best storage Products of the Year 2008
Wide stripe before you dive into SSD
How your SAN will evolve
SAN management Research

Data storage management
How to add solid-state storage to your enterprise data storage systems
Is cloud data storage right for your IT infrastructure?
Optimizing enterprise data storage capacity and performance to reduce your data footprint
Is data deduplication right for your primary storage infrastructure?
Fail-in-place systems: Avoiding hard disk drive failures
Data storage resources needed to implement a virtual desktop infrastructure
Storage encryption essentials
Addressing storage performance bottlenecks in enterprise data storage
Data archiving: Three key elements
Archiving data to cloud storage: How to choose the right cloud storage provider

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
Fast Guide to Storage Technologies  (WhatIs.com)
fat provisioning  (SearchStorage.com)
oversubscription  (SearchStorage.com)
RAID  (SearchStorage.com)
storage area management  (SearchStorage.com)
storage area network  (SearchStorage.com)
thin provisioning  (SearchStorage.com)
unified storage  (SearchStorage.com)
virtual provisioning  (SearchStorage.com)
zoned-bit recording  (SearchStorage.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary

DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



Find Data Backup Analysis
TechTarget Storage Media
Storage Magazine View this month\\'s issue and subscribe today.
Storage Decisions Apply online for free conference admission.
SearchStorage.com
HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2000 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts