Home
Remote Data Replication Explained:
EMAIL THIS

Three approaches to remote replication

01 Sep 2006 | Stephen J. Bigelow, Features Writer

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   

It's possible to recover from hardware failures, data corruption and even malicious activity by copying data to another disk. But what about natural disasters or even terrorist attacks?

In 2005, Hurricane Katrina underscored the vulnerability of local data and the importance of sending data offsite. Remote data replication duplicates data between remote servers or storage platforms across a wide area network (WAN).

Remote replication has been around for years, but advances in technology have made it mainstream. "Today I can buy replication for just about any application that I may have in my environment, and I don't have to rob a bank to do that," says Arun Taneja, consulting analyst and founder of the Taneja Group in Hopkinton, Mass. This article will take a look at the elements of remote replication, its pitfalls and its impact on the storage organization.

Remote Replication Information
Tech Report: Protect remote-office data

Take full advantage of the remote replication process

Stars align for remote replication

Narrow down your replication or CDP options

Making remote replication work

There are three approaches to remote replication: host-based, array-based and fabric-based. A host-based architecture uses software running on a server or dedicated appliance to pass data across a WAN to a target system. One example is the KBX5000 Data Protection Platform from Kashya Inc. (now part of EMC Corp.), which connects directly into the storage area network (SAN). Host-based replication is usually the least expensive of the three approaches, but doesn't always offer the same performance.

Remote replication can also be accomplished between compatible storage arrays using application software accompanying the arrays themselves. This array-based approach used to be inflexible because the replication depended on identical arrays (e.g., Symmetrix to Symmetrix), but this is changing as replication software supports greater heterogeneity. For example, EMC's Clariion AX150 array ships with EMC SAN Copy software that can create remote point-in-time data copies between Clariion, Symmetrix, IBM, Sun and Hitachi Data Systems storage arrays.

Recently, remote data replication has started appearing in the network fabric, usually as software running on switches installed in the SAN. Topio Inc's Data Protection Suite is an example of fabric-based remote replication. Kashya and FalconStor Software Inc. also offer similar products. Replication in the fabric is particularly appealing because switches support a broad range of devices and there is no significant performance impact. "Obviously, intelligent switches can be brought to bear," Taneja says. "But many of these examples didn't even use intelligent switches yet."

The growth of remote replication is attributed to the convergence of lower disk costs, lower bandwidth costs and the emergence of bandwidth optimization techniques. Data deduplication is one element of bandwidth optimization, along with compression, delta differential and improved data flow methods. "It's also about optimizing the data flow -- creating larger packets so that when data does get transmitted, it's transmitted more efficiently, says Greg Schulz, founder and senior analyst at the StorageIO Group in Stillwater, Minn.

Synchronous and asynchronous remote data replication

Remote data replication can be accomplished synchronously or asynchronously. Synchronous data replication occurs in realtime where data is sent from the source disk all the way to a destination disk before the data transfer is acknowledged, e.g., the remote disk must "catch up" to the local disk. This ensures synchronization but latency in the acknowledgement limits synchronous distances to a nearby floor or building on campus. WAN interruptions can also wreak havoc with synchronous replication schemes.

With asynchronous data replication, data from one disk is passed only to a local server before an acknowledgement is received. The local disk will then pass data across the WAN to the remote disk as time and bandwidth allow. In many cases, the replicated disk content will lag behind the local data, by as much as several hours. However, asynchronous behavior works well over long distances (because latency isn't a factor) and inexpensive low-bandwidth WAN links. Asynchronous replication is also tolerant of WAN disruptions, maintaining a local copy of data until WAN service is restored.

Synchronous and asynchronous techniques can be employed together. For example, synchronous replication may serve a role in local backups, while asynchronous replication may duplicate the data to a distant disaster site. Still, remote replication should never be the sole means of data protection. "Both technologies can replicate viruses or other forms of corruption," says Heidi Biggar, analyst at the Enterprise Strategy Group in Milford, Mass. "Because of this, end users may want to couple replication with some type of snapshot -- continuous data protection (CDP) or near-CDP -- technology."

Replication management and tools

Tools for remote data replication are gaining features to work with other network management tools. The trend is clear: Simplify tools and automate management processes by improving integration with other backup and recovery products. Look for a gradual convergence of storage, network and application tasks. Analysts expect that users will eventually manage remote replication products through a single management interface that also embraces disk-to-disk (D2D), CDP and other technologies. "CommVault [Systems Inc.], Symantec [Corp.] and EMC have also been very vocal lately about their plans to provide integrated recovery focused platforms," Biggar says.

Roadblocks to remote replication

Although remote replication is more affordable and robust than ever before, serious deployment issues exist. Cost is the first concern. While the hardware and bandwidth are considerably less expensive than years past, organizations must still carry the cost of a second site along with the recurring costs of bandwidth. But with data volumes constantly spiraling upward, even with the benefit of bandwidth optimization there is always more data to move in less time. "Any gains that are made are zapped right away by consumption," Schulz says.

Many enterprise applications, such as databases and customer relationship management (CRM) software, frequently operate across multiple volumes. As a consequence, administrators must replicate each related volume to ensure a complete environment for the application. Not only does this demand more storage, taking more time and bandwidth for a complete replication, it also requires administrators to track the replication consistency across all of those volumes. Replication tools like Kashya's KBX5000 are designed to help ensure application data consistency.

Since so many products include remote replication software today, administrators must often juggle multiple replication platforms as more storage platforms are brought online -- a management nightmare for even the most skilled IT staff. Analysts note that fabric-based replication products can eliminate the growing glut of replication applets.

Multiple management applets can also have an adverse impact on remote replication performance, and simultaneous replication can choke the available WAN bandwidth. This invariably causes all of the replication tasks to fall behind and can be a serious problem for mission-critical replication tasks. Select remote replication products that can throttle their bandwidth utilization based on task priority. For example, the main corporate database may take priority and use most of the available bandwidth, while other data replication may only use a small part of the bandwidth, until the high-priority task is finished.

Finally, analysts highlight the importance of bandwidth optimization and encourage the selection of remote replication products that include strong features, like delta differential or data deduplication. For example, it is far more efficient to transfer only the changed bytes of a block, or the entire block, rather than to retransmit the entire file. "Not every replication product in the industry has that level of sophistication right now and the IT users need to be looking for that massive efficiency difference," Taneja says.

Go to the next page of this article for the impact and future direction of remote replication



BROWSE BY TAG
Data Backup,   Data storage backup tools,   Backup Products,   Related information,   WAN optimization and WAFS platforms,   Resource Library,   Storage explained,   Data Protection,   Disaster recovery and planning,   VIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   



RELATED CONTENT
Data storage backup tools
Hot storage technologies for 2010
Improved enterprise storage management tools are needed, especially for data backup
Enterprise data storage capacity management tools still found lacking
Storage configuration needs go beyond SRM tools
Remote site monitoring often missing from disaster recovery plan
Enterprise data storage performance monitoring goes beyond storage arrays
Backup in a snap: A guide to snapshot technologies
Growing need for email archiving
Addressing storage performance bottlenecks in enterprise data storage
Software vendors expand data deduplication offerings
Data storage backup tools Research

Storage explained
Data storage management in virtual server environments
Data storage and wide-area networks in 2009
Tiered storage tutorial
Buying storage capacity in 2009
The evolution of RAID data protection
Applying ITIL best practices to storage explained
Content-addressed storage (CAS) explained
Self-healing storage explained
NAS virtualization explained
How to create Tier 0 storage by leveraging solid-state drive technology

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
backup  (SearchStorage.com)
Backup and recovery: Do you speak geek?  (WhatIs.com)
backup robot  (SearchStorage.com)
backup storage  (SearchStorage.com)
cold backup  (SearchStorage.com)
continuous data protection  (SearchStorage.com)
replication  (SearchSQLServer.com)
storage snapshot  (SearchStorage.com)
virtual tape  (SearchStorage.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary




Find Data Reduction
TechTarget Storage Media
Storage Magazine View this month\\'s issue and subscribe today.
Storage Decisions Apply online for free conference admission.
SearchStorage.com
HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2000 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts