How should I factor data deduplication technology into my virtual machines?
By submitting your personal information, you agree that TechTarget and its partners may contact you regarding relevant content, products and special offers.
The interesting thing about dedupe technology for virtual machines (VMs) is that it can play a different role in VMs than it does in other places, such as backup, for example.
Deduplication technology is commonly thought of as a method for improving overall capacity. In the case of backups, deduplication can be used to reduce backup media consumption. Similarly, performing deduplication prior to writing data to the cloud can reduce the total amount of data sent across the wire, thereby improving the overall capacity of your WAN link.
When it comes to VMs, however, deduplication can be just as much about performance as it is about capacity. Dedupe technology is important for VMs because of the commonalities that exist between virtual machines.
Let’s suppose that a particular organization had a host server running 10 VMs, and all of those were running Windows Server 2012. Because each VM is running the same operating system, there's obviously redundancy across the VMs, and deduplication can reduce their storage footprint.
At the same time, deduplication can also improve performance for those VMs. If the volume containing the VM files has been duplicated, storage blocks will be shared among most -- if not all -- of the virtual machines. If the storage blocks are cached to memory, Windows can access them much more quickly than if the storage blocks all resided solely on disk.
Caching storage blocks to memory can happen even in non-deduplicated systems. In the case of a deduplicated host server, block caching can result in a much greater overall performance gain than might be possible if the OS had to cache storage blocks individually for each VM because each cached storage block can conceivably benefit multiple virtual machines.
Dig Deeper on Enterprise storage, planning and management
Related Q&A from Brien Posey
Setting up Office 365 generally involves multiple devices. With nonpersistent VDI, the rules of the game change for IT admins.continue reading
Much has been said about the inability to scale storage separately from other resources in a hyper-converged system, but are there any advantages to ...continue reading
The definition of hyper-converged infrastructure has evolved as the technology has grown. But the phrase still means different things depending on ...continue reading
Have a question for an expert?
Please add a title for your question
Get answers from a TechTarget expert on whatever's puzzling you.