Security flaw could put EMC Centera users at risk

Article

Security flaw could put EMC Centera users at risk

Jo Maitland, News Director

At the Crypto 2004 conference in Santa Barbara, Calif. last week, several papers were presented that demonstrated vulnerabilities in a popular algorithm used to create digital signatures.

The flaw in the MD5 algorithm could put EMC Corp.'s Centera customers who use its single-instance storage function at risk of data loss, experts say. Consequently, the product might not meet government regulations for compliance.

"The results are preliminary, but users subject to SEC 17 a-4 should check this out carefully," said Peter Gerr, analyst with the Milford, Mass.-based Enterprise Strategy Group. Rule 17 a-4 states that data must be stored in a non-rewritable, non-erasable form. Gerr advised users to wait until the National Institute of Standards and Technology (NIST) officially confirms that all use of MD5 for single-instance storage systems are non-compliant, before taking any action. "Until then it tends to remain FUD," he said.

Single-instance storage features, like the one used by the Centera device, purport to store only one copy of any file with a unique content address, regardless of how many times duplicate objects are submitted for storage. Until now this seemed like a great idea from the point of view of storage savings.

However, this assumption that if two files have the same content address then they must have identical contents is incorrect

    Requires Free Membership to View

    When you register for SearchStorage.com, you’ll also receive targeted emails from my team of award-winning editorial writers. Our goal is to keep you informed on the hottest topics, the latest news and the biggest challenges you face as a storage professional today.

    Rich Castagna, Editorial Director

    By submitting your registration information to SearchStorage.com you agree to receive email communications from TechTarget and TechTarget partners. We encourage you to read our Privacy Policy which contains important disclosures about how we collect and use your registration and other information. If you reside outside of the United States, by submitting this registration information you consent to having your personal data transferred to and processed in the United States. Your use of SearchStorage.com is governed by our Terms of Use. You may contact us at webmaster@TechTarget.com.

when the single-instancing storage feature is enabled and the content address is computed using the MD5 cryptographic algorithm.

EMC responded that Centera uses two different types of naming schemes -- one based on MD5 and another based on MD5 plus, an EMC-developed algorithm, which also incorporates time and date stamps to the content address. Regular background checks run the algorithm across the object to make sure that it is the same as the original. Centera stores a second copy with the same content address for these verfication purposes. In addition, MD5 plus gives users the option to up the encryption to 128-bit or 256-bit and turn off the single-instance storage feature.

Roy Sanford, a vice president in the Centera division, likened the probability of someone creating two files at the same exact time, on the same exact entry node, with exactly the same content, to someone running the 100 meter dash in two seconds. "At some point they could, but is it something that people should lose sleep over today? No," he said.

Sanford also pointed out that the Centera is responsible for the integrity of the data stored on the system, which he said is different to the security of the information, which falls to network security and application security.

Competitive spin

"Vendors of products that rely on MD5 will say that there are more stars in the universe, than there are chances of a unique address being created twice, but that's only true if there isn't a weakness in the algorithm," said Will McGovern, chief architect of Network Appliance Inc.'s SnapLock product, which does not use MD5. "Cracking algorithms is like an arms race," he said, "someone will always get around existing measures." He noted that one way to address this vulnerability is to turn off the single-instancing storage feature.

NetApp's file system works like a regular C drive in that users cannot use the same file name twice to store a file. The company claims its SnapLock software prevents users from erasing files that already exist.

Permabit Inc, another provider of single-instance storage for archival purposes uses the SHA-256 algorithm instead of MD5. "This is the only acceptable and recommended algorithm for use in federal information processing," said a spokesman for the company.

To see how the Centera system works, users can check out cascommunity.org.

An MD5 archive corruption scenario

Join the conversationComment

Share
Comments

    Results

    Contribute to the conversation

    All fields are required. Comments will appear at the bottom of the article.