Home > Storage Technology Tips > Data storage management > Checklist: Ten steps to troubleshooting SAN/NAS performance problems
Storage Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

DATA STORAGE MANAGEMENT

Checklist: Ten steps to troubleshooting SAN/NAS performance problems


Greg Schulz
12.21.2004
Rating: --- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


What you will learn from this tip: How to isolate the cause of performance problems in your storage system, fix what's broken and learn from your mistakes. Need a downloadable copy? Go here.


Both SAN and NAS architectures experience performance problems for a variety of reasons, including increased workloads or the addition of new applications or tools. Some issues are specific to a given storage environment, but many can be isolated using this step-by-step approach:
 Ten steps to troubleshooting SAN/NAS performance problems
1. Do you actually have a SAN/NAS related performance problem?
In order to understand whether or not you really have a performance issue, you have to identify the precise nature of the problem. Are you able to access data and applications -- just very slowly? Or are you unable to access any data or applications, and receive an error message instead?
2. What is the normal expected behavior of the SAN/NAS environment?
While not ideal, it may be normal for performance to slow down at certain times of the day, similar to how performance on your home cable or DSL modem slows down in the late afternoon shortly after school lets out for the day. While known slow-downs may be accepted, ultimately you will want to know where and why these occur and have a plan to address them. Having a baseline performance summary helps to know what is normal and what is not.
3. Can the performance problem be reproduced?
Is it a transitory performance issue, or is it consistent and capable of being reproduced? Can you access data and performance work, albeit at a slower pace, or has everything come to a screeching halt? Is this a first-time occurrence, or have the symptoms been seen before? Is this a seasonal performance problem, for example, handling more transactions during the holiday season, that can be addressed by spending some money for more equipment, or is something that can be dealt with as an occasional nuisance?
4. Is everything functioning as it should, or has something failed?
Has any hardware failed or exhibited signs that it might be about to fail? What type of error log and event log activity has taken place? Is the performance problem isolated to specific users, applications, servers, files and data, or storage resources? Have any disk drives failed, triggering automatic hot spare disk rebuilds, or has a controller or adapter failed over? Some tools for monitoring and collecting performance data include: iostat, NetStat, nfstat, PerfMan, NTSMF from Demand Technologies, ITR client, and Intel Iometer, among other standard and vendor provided products.
5. What has changed in the SAN/NAS environment since the problem started?
Have any changes been made to storage subsystems (expansion, reconfiguration, and other changes), NAS appliances or gateways, network and storage interfaces, servers, volume managers, applications or databases? Do you have a change control process to help determine what will be changed? Do you have fall-back procedures in case something does not work correctly? Have any new security polices or access controls been applied? Have file system eminence or virus detection scans been initiated prior to the performance problem being reported? Is any maintenance on data or hardware/software components being performed?
6. What other applications and workload are running?
Have any new applications been added or changed? Has new workload been added? Have any applications changed that subsequently require more storage and I/O resources? Are any applications misbehaving, for example, a database query taking out and excessively holding locks on resources? Are any virus, spyware, security auditing, disk defragmentation, backup, data classification tools or performance monitoring tools running and performing I/O to storage devices where performance is being impacted?
7. What does a quick scan of your SAN/NAS environment show?
Do you have health statuses monitors that you can look at to determine the general health and well-being of your environment? What is the status of memory and CPU resources on servers? What are the busiest processes and what resources are they consuming? What are the busiest storage volumes and which adapter and I/O paths do they use? What is the status and performance of interfaces, including Ethernet for IP and Fibre Channel for open systems and FICON mainframe attachment? What is the performance of the storage subsystem including cache hits, cache utilization, cache effectiveness, and device activity?
8. Is it a local or remote performance problem?
Can you determine that there are problems with your local LAN or SAN segments, by using a ping to check network connectivity, or by performing an I/O command to a storage device? Determining if the performance problem is local or remote can be done by verifying performance to local storage and then comparing that to remote. Things to look at for remote performance would be the network interface using ping, NetStat or nfstat to look at link errors, response time, timeouts, re-transmits and packet loss. What is the status of inter-switch links (ISLs), routers, bridges and gateways? Are they functioning normally?
9. Do you need outside help to determine and correct the problem?
Do you need to enlist the support of your vendors (hardware, software, networks) to provide diagnostic and test tools or hands-on assistance? Your vendors may have knowledge bases with information on troubleshooting performance and other problems that you can use as a source of information and education.
10. Have you learned from the incident?
Have you documented the findings, resolution and symptoms to help others troubleshoot the same problem in the future?

For more information:

Tip: How to isolate and solve performance bottlenecks

Tip: How to troubleshoot SANs using WWN zoning

Tip: Top 10 tips on storage management

About the author: Greg Schulz is a senior analyst with the independent storage analysis firm, The Evaluator Group Inc. Greg has 25 years of IT experience as a consultant, end user, storage and storage networking vendor, and industry analyst. Greg has worked with Unix, Windows, IBM Mainframe, OpenVMS and other hardware/software environments. In addition to being an analyst, Greg is also the author and illustrator of Resilient Storage Networks", Greg has contributed material to Storage Magazine. Greg holds both a computer science and software engineering degree from the University of St. Thomas.

Rate this Tip
To rate tips, you must be a member of SearchStorage.com.
Register now to start rating these tips. Log in if you are already a member.




Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


RELATED CONTENT
SAN management
Does SAP eat more than its share of storage?
Solid-state drives quiz
Windows Server 2008 steps on some storage toes
Storage Decisions Chicago 2008 Session Downloads
Storage Decisions Session Downloads: Storage Systems & Networking Track (Chicago 2008)
Determining the number of drives in a RAID group
Learn all about SANs
How to determine the protocol for your SAN
Automate storage management
HBAs explained
SAN management Research

Data storage management
Server virtualization may have big disaster recovery payoff
SAN vs. NAS: What's the difference?
How to make your storage greener
RAID 6 vs. RAID 10
Top 5 storage management tips of 2007
How to mitigate the performance penalties of data encryption software
Tutorial: Creating a tiered SAN architecture
Avoiding storage-related bottlenecks in virtualized environments
Ten reasons storage security is critical
How to reduce risk with storage security policies

NAS management
Johns Hopkins selects Caringo CAS software for data archiving
NetApp fleshes out plan for OnTap GX clustering convergence
Storage Decisions Chicago 2008 Session Downloads
What you need to know before implementing NAS clusters
Learn all about NAS
How to create an RFP for NAS
Planning for NAS
Understanding NAS interconnects: Fast Ethernet and Gigabit Ethernet
Why NAS protocols are the language of NAS
Dispelling myths about clustering NAS and file servers
NAS management Research

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
Fast Guide to Storage Technologies  (WhatIs.com)
fat provisioning  (SearchStorage.com)
oversubscription  (SearchStorage.com)
SAN file system  (SearchStorage.com)
storage area management  (SearchStorage.com)
storage area network  (SearchStorage.com)
storage consolidation  (SearchStorage.com)
thin provisioning  (SearchStorage.com)
unified storage  (SearchStorage.com)
zoned-bit recording  (SearchStorage.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary

DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.

TechTarget Storage Media
Storage Magazine View this month\\'s issue and subscribe today.
Storage Decisions Apply online for free conference admission.
SearchStorage.com
HomeNewsMagazineTopicsLearningWebcastsWhite PapersBlogsEventsAbout Us

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




All Rights Reserved, Copyright 2000 - 2008, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts