Home > Ask the Storage Technology Experts > Storage Management Questions & Answers > Why does my EMC 45 TB drive yield only 41 TB of usable storage?
Ask The Storage Expert: Questions & Answers
EMAIL THIS

Why does my EMC 45 TB drive yield only 41 TB of usable storage?

>
QUESTION:
When is a 300 GB drive not a 300 GB drive? I had known about the importance of dealing with "usable" vs. "raw," but what I did not know was that every vendor uses base-10 to generate their quotes while the field engineers use base-2. I was particularly surprised to find that the Hitachi GUI tool "Resource Manager" also works on base-2, thus misleading the storage guys to think they have more gigabytes than they do! A recent 45 TB purchase from EMC actually yielded 41.3 TB of usable storage. Comments?


BROWSE BY TAG
Storage Management,   Primary Storage or Storage Hardware,   Disk drives,   Data Storage Management,   Data management tools,   VIEW ALL TAGS

RELATED CONTENT
Storage Management
Why RAID rebuilds obstruct data migration
Creating half a mirror out of 3 drives totaling 300 GB
Growing a RAID group by adding old drives
How to calculate available disk space on a RAID 5
RAID 4 vs. RAID 5
External SATA RAID array vs. system replacement
RAID-5 configuration
Separate or combine tiers in SAN?
Stephen Foskett: Blogs and more
How much hard disk drive space should I actually use?

Disk drives
Use MAID, intelligent power management as green storage options to control energy consumption
SAS drives showing up more and more
Disks and disk subsystems finalists: 2009 Products of the Year
SAS challenges Fibre Channel drives
Primary storage data reduction advancing via data deduplication, compression
NetApp: Post-process deduplication limits performance hit in primary storage data deduplication
EMC Celerra: Primary storage data reduction through deduplication, compression
Storwize claims good data compression rates, no performance degradation on STN-6000 appliance
Primary storage data reduction: Data deduplication and compression tools
Gartner analyst on data deduplication for primary storage
Disk drives Research

Data management tools
Use MAID, intelligent power management as green storage options to control energy consumption
SolarWinds aims to integrate Tek-Tools storage resource management with network, server management
Improve storage utilization rates with storage optimization, capacity reduction techniques
Storage management tools finalists: 2009 Products of the Year
Vendors take different approaches to automated tiered storage software for solid-state drives
Data migration projects: Avoid data migration errors with automation, orphaned storage remediation
EMC upgrades Symmetrix V-Max arrays, thin provisioning
EMC adds replication support to Data Protection Advisor
Leverage existing network-attached storage and block storage for better data storage management
Avoid data migration project failure: Five best practices
Data management tools Research

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
backup robot  (SearchStorage.com)
DASD  (SearchStorage.com)
disk-to-disk-to-tape  (SearchStorage.com)
Fibre Channel  (SearchStorage.com)
hard-drive encryption  (.com)
holographic disk drive  (SearchStorage.com)
hybrid hard drive  (SearchStorage.com)
Robson  (SearchStorage.com)
Serial ATA  (SearchStorage.com)
solid-state drive  (SearchStorage.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary


Ashley D'Costa EXPERT RESPONSE FROM: Ashley D'Costa

Pose a Question
Other Storage Categories
Meet all Storage Experts
Become an Expert for this site
ANSWERED September 2007:
Yeah, I can just imagine the frustration when you're trying to cram those last few downloaded bytes from Nickelback's latest, only to discover that you've been short-changed those last few vestiges of your post-grunge melody. And not by a few bits of rounding error, but by an order of magnitude!

To be a bit more precise, it would be about 93% less than what you were expecting if we're talking gigabytes. Why 93%? That's the ratio between a gigabyte expressed in base-10 and a gigabyte expressed in Base-2. But then that's your basic question: Which is it, base-10 or base-2?

What it boils down to is how a gigabyte (or megabyte or yottabyte or whatever-byte) is defined, not just by storage manufacturers and industry experts (ahem), but also by computer scientists (ahem, again), who ultimately started all of this. It's a complex and sordid story. Now that you're riveted, here goes.

Although the definition of how many bits comprises a byte has changed through the early history of computer science, today it is generally accepted that a byte is expressible as 8 bits (which began with the influence of IBM's System/360 architecture back in the day when everyone else was distracted by Flower power).

However, as computing evolved and memory capacities grew to thousands and millions of bytes, the need arose to express these larger byte capacities more conveniently. Like all other fields of science, the system chosen was the metric (or SI) prefix (kilo, mega, giga, etc). This logical and seemingly harmless decision was the start of all the problems, for memory capacity was (and is still to this day) built in multiples of two, and the metric system represented numbers in multiples of ten.

How could my fellow computer science predecessors not know this? In truth, they did. However, the difference between a kilobyte base-2 and kilobyte base-10 is only 24 bytes. No one at the time actually believed that in a few short years scientists would be using the word gigabyte. But Moore's Law had its way, and here we are with the discrepancy between base-2 and base-10 getting bigger and bigger.

So, history lesson aside, what is the definition of a gigabyte? As defined by the IEEE, it is 1 billion bytes (1,000,000,000) as per the correct usage for the metric prefix (SI) notation. But in practice, it depends. If you're talking computer memory or, more crucially, file space, for reasons that will soon become apparent it's considered to be 1 billion base-2 bytes (1,073,741,824 bytes).

If you're talking storage (hard disks, flash drives, etc), it's considered to be 1 billion base-10 bytes (1,000,000,000). So technically, storage manufacturers are using the term "gigabyte" in the correct way as defined by standards bodies, and a 300 GB hard drive is, in fact, 300 billion bytes exactly.

So why does it not appear like you have 300 billion bytes available to you when you try to save your latest tunes or grow that SAP database? The reason is that your computer's operating system is actually calculating the file consumption and available space in terms of base-2 binary, not base-10 decimal. So a kilobyte, megabyte or gigabyte as reported by most, if not all, operating systems is as a power of 1024 not a power of 1000. Operating systems do this because computer CPU and memory architectures are constructed according to base-2 math. Thus computers store information in base-2 sized segments, not base-10 sized segments.

In other words, computers save files onto their storage in 1024-byte chunks because it processes the information in 1024-byte chunks. Thus operating systems report information this way as well. If they reported kilobytes in 1000-byte chunks rather than the 1024-byte chunks that it saves, you'd end up with fractional answers that would potentially be subject to rounding errors and even more confusion as the operating system attempts to report back values that can be read back conveniently.

In the end, it looks like the storage manufacturer here is not at fault and you can evaporate any thoughts of financial compensation for being ripped off. Your 300 GB hard drive does, in fact, hold 300 billion bytes and when it gets full, you are truly consuming 300 billion bytes. It's just in bigger chunks than you originally thought. Sadly, it's your own computer that's taken that huge "byte" out of your 300 gigabytes. (Sorry, I couldn't resist).




Search and Browse the Expert Answer Center
Search and browse more than 25,000 question and answer pairs from more than 250 TechTarget industry experts.
Browse our Expert Advice



Search for Data Management Tools
TechTarget Storage Media
Storage Magazine View this month\\'s issue and subscribe today.
Storage Decisions Apply online for free conference admission.
SearchStorage.com
HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2000 - 2010, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts