Home > Storage Technology Tips > Backup and disaster recovery > Dealing with backup server instability
Storage Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

BACKUP AND DISASTER RECOVERY

Dealing with backup server instability


Rick Cook
10.10.2006
Rating: --- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


What you will learn from this tip: Rick Cook outlines common causes of backup failure and suggests tips for avoiding backup server instability.

"Instability" is a polite term for the stuff that drives storage managers nuts. When a backup server becomes erratic or undependable, it doesn't just fail… it remains failed. That makes it hard to track down the cause. The problem is particularly acute with a backup server, because problems with the server can be caused by anything the server touches -- from network to the file systems being backed up. That considerably complicates diagnosis.

Backup information
Users cautious in deploying CDP

Five musts for NetBackup restores

Backup your LAN without the performance impact
While the details of backup server instability are usually very (some would say 'maddeningly') configuration-specific, there are some general principles you can apply to help.

Probably the most common cause of backup server instability is changes to the system parameters to increase performance or for other reasons. There are a lot of things you can do to a server to enhance performance. However, at some point those tweaks and modifications are likely to make the system unstable in some way. Since the instabilities don't always show up in the application or server that is being tuned, it's important to keep a careful record of changes. That record should be easily accessible to all the administrators.

Another common cause of server instability is changes made in the system, such as upgrades or additions. Microsoft's Service Pack 2 (SP2) for Windows XP caused a lot of remote backup servers to fail because it turned on the XP firewall by default and the backup software couldn't get through the firewall. These are usually easy to identify and can generally be alleviated by rolling the system back to an earlier, stable state, while a more permanent fix is found. In the case of upgrades or new hardware or software, your primary resource is the vendor or manufacturer.

The third main cause for backup server instability is that something changed by itself. This can be the result of a natural process, accumulating hardware failure or changes in the use pattern of the system. For example, in Microsoft Small Business Server, two common causes of backup failure are:

  • An SBS log file that has grown to more than 64 MB
  • Timed-out backups because of excessive read/write activity to the device
  • In dealing with any backup server failure, your log files are your friends. You should make sure you're logging the relevant backup events and give those logs at least a cursory once-over every day. When your backup server starts having problems, you need to go over them with a fine-tooth comb, and possibly consider logging additional events to help pin down what's happening.

    Pay particular attention to error messages generated by the backup, even if the backup was completed successfully. Modern backup systems are amazingly resilient and often will continue to work -- for a while -- even if something in the backup sequence is reporting an error.

    In checking your logs, pay special attention to unexplained timeouts. An unexplained timeout almost always means something is running inefficiently -- at the very least -- and can mean you've got much bigger problems elsewhere in the system. This is especially true in architectures where backup processes run in the background with lower priorities than regular jobs. Since the backup gets a smaller slice of system resources, it is likely to be the first to be starved into a timeout when something starts to go wrong.

    (Of course, just to complicate matters, there are some errors that truly don't matter. Check with your vendor to see what they can tell you about these error messages.)

    A fruitful question to ask yourself when you have a flaky backup server: What else is using those same resources when the server acts up? Obviously this is a pretty wide-ranging question because a backup server touches so much of the rest of the IT infrastructure, interacting with hardware, software, storage devices and just about everything else.

    Often the first priority is to get the backup system stable again as quickly as possible while the real problem is tracked down. Frequently, the server can be stabilized by reducing the load on the backup system by throttling back I/O or devoting more resources to the backup.

    Do you know…

    How to optimize your backups?

    About the author: Rick Cook has been writing about mass storage since the days when the term meant an 80 K floppy disk. The computers he learned on used ferrite cores and magnetic drums. For the last 20 years, he has been a freelance writer specializing in storage and other computer issues.


    Rate this Tip
    To rate tips, you must be a member of SearchStorage.com.
    Register now to start rating these tips. Log in if you are already a member.




    Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


    RELATED CONTENT
    Backup and disaster recovery
    Disaster recovery planning in a virtualized environment
    Leveraging storage replication for VM disaster recovery
    Four disaster recovery strategies to consider when using data deduplication
    Comparing different backup strategies
    Troubleshooting automated tape libraries
    How to choose a Web-based email archiving vendor
    How to choose an e-discovery tool
    How to conduct a disaster recovery test
    Outsourcing backup: Get the right service level agreement
    Using WAN clustering for disaster recovery

    RELATED RESOURCES
    2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
    Search Bitpipe.com for the latest white papers and business webcasts
    Whatis.com, the online computer dictionary

    DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



    Find Data Backup Analysis and Server Storage Channel Solutions
    TechTarget Storage Media
    Storage Magazine View this month\\'s issue and subscribe today.
    Storage Decisions Apply online for free conference admission.
    SearchStorage.com
    HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

    About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
    TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

    TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




    All Rights Reserved, Copyright 2000 - 2008, TechTarget | Read our Privacy Policy
      TechTarget - The IT Media ROI Experts