Home > Storage Technology Tips > SAN/NAS Update > Troubleshooting your SAN (more or less) painlessly
Storage Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

SAN/NAS UPDATE

Troubleshooting your SAN (more or less) painlessly


Rick Cook
02.09.2004
Rating: -2.75- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


SAN problems are generally divided into two areas: connectivity and performance. Connectivity problems happen when something in the SAN, from a port to the whole SAN, simply doesn't work. With a performance problem, everything still works, it just doesn't work very well. In the IBM SAN Survival Guide Redbook (available from the IBM storage site at http://www.storage.ibm.com), IBM offers a number of suggestions for solving both kinds of problems.

Unfortunately connectivity problems are the most common. That's unfortunate because they are the hardest to troubleshoot. When an entire storage array suddenly drops off the SAN you have a pretty clear and fairly shallow troubleshooting tree. When things just aren't working very well it can be a lot more complex to identify what's happening.

This is exacerbated by the information problem. Usually the problem with troubleshooting a SAN isn't that you don't have enough information, it's that you've got too much.

From HBAs to switches to storage arrays, most devices on a SAN keep event and alarm logs, often copious logs. When a SAN starts to go flaky all these logs tend to fill up rapidly as the problem causes errors in other components. One of the first problems is to sort out the information you need from what you don't.

The best strategy is to concentrate and correlate. That is concentrate on the recent errors that seem related to the problem and try to correlate the information in the error and performance logs to get an overall picture of what's happening. In doing so it's important to get data from all these sources at the same instant in time so you can meaningfully compare what's happening. Ideally you can trace each device's log from when things were performing normally to when they went wrong. Since the information tends to drop off the logs in the storm of alarms, it's important to get those logs quickly.

While you're checking the status of the SAN components, make sure you've got current information on the operating system version, maintain level, driver version and versions of the firmware. This information is invaluable when you're dealing with other support personnel. (On a related issue, you should take regular snapshots of SAN performance at the same time of day and under the same load conditions. Not only will this give you a baseline to work from, trends in this data can also help you detect problems before they become serious.)

Usually the best place to start looking is at the switch. Often you can divide the problem tree in half and quickly determine if the problem is on the server side or the storage side of the switch. In the case of multiple switches or director and switch topology, start with the central point and work outwards.

Check the switch's event and alarm logs and monitor the performance on the ports. You can compare the values with the previously stored snapshot to see how performance has changed. If you haven't been taking regular snapshots, you should take two snapshots a few minutes apart and look for changes in the parameters. Check the status of the switch and the integrity of the fabric and check the integrity of the zoning as well.

On the server side, check the event and alarm log and, if multi-path software is installed, check the virtual paths and adapters.

On the storage side check the event and alarm log and the status of the Fibre Channel ports.

Given this information, and some thought, you should get some idea if the problem is in the components (both hardware and software) or in the design of the SAN. Remember it is possible to run up against a design limitation quite suddenly if, for example, an application needs more SAN bandwidth than was allowed for.

For more information:

Tip: Troubleshoot SANs from the center out

Tip: Troubleshooting Fibre Channel HBAs in PCs

Tip: Fast guide to SAN management



About the author: Rick Cook has been writing about mass storage since the days when the term meant an 80K floppy disk. The computers he learned on used ferrite cores and magnetic drums. For the last twenty years he has been a freelance writer specializing in storage and other computer issues.


Rate this Tip
To rate tips, you must be a member of SearchStorage.com.
Register now to start rating these tips. Log in if you are already a member.




BROWSE BY TAG
SAN/NAS Technical Tips,   EMC,   Tips,   General,   Storage Area Network (SAN),   SAN management,   SAN (storage area network),   SAN/NAS Update,   SAN Management,   VIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   



RELATED CONTENT
SAN/NAS Technical Tips
Troubleshooting your NAS system
Storage 101: Using a director switch
Better performance -- RAID striping or concatenation?
Crash Course: Snapshots
Storage Smarts Quiz: RAID
Zoning 101: Why zone?
Tech Roundup: Intelligent switches
Tech Roundup: Disaster recovery tools
SATA II arrives -- sort of
Learning Guide: NAS

General
SAN School: Final quiz:
Configuring your first SAN: Usable versus raw storage
Configuring your first SAN: Creating your LUNs
Preparing for SNIA's certification exam
How to assign LUNs in a SAN
Servers and SANs, how they work
How WWNs are associated with LUNs
How do I know if I'm ready for a SAN?
Does each server require a separate OS?
Is SAN implementation more expensive than NAS?

SAN management
Unilever maintains 5 PB Fibre Channel SAN storage performance with Virtual Instruments' NetWisdom
Storage Decisions Chicago 2009 Session Downloads
Storage Decisions Session Downloads: Managing Storage Networks Track (Chicago 2009)
Storage-area networks to become increasingly object based
Data storage management in virtual server environments
10 Gb Ethernet bodes well for iSCSI
Mellanox builds bridge to consolidation
Best storage Products of the Year 2008
Wide stripe before you dive into SSD
How your SAN will evolve
SAN management Research

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
Fast Guide to Storage Technologies  (WhatIs.com)
fat provisioning  (SearchStorage.com)
oversubscription  (SearchStorage.com)
RAID  (SearchStorage.com)
storage area management  (SearchStorage.com)
storage area network  (SearchStorage.com)
thin provisioning  (SearchStorage.com)
unified storage  (SearchStorage.com)
virtual provisioning  (SearchStorage.com)
zoned-bit recording  (SearchStorage.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary

DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



Find Data Backup Analysis
TechTarget Storage Media
Storage Magazine View this month\\'s issue and subscribe today.
Storage Decisions Apply online for free conference admission.
SearchStorage.com
HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2000 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts