What is unstructured data and how is it different from structured data in the enterprise?

What is unstructured data and how is it different from structured data in the enterprise?

What we're really doing is designating our data as structured or unstructured. Let's start with structured data, which is really data that is organized in a structure so that it is identifiable. The most universal form of structured data is a database like SQL or Access. For example, SQL (Structured Query Language) allows you to select specific pieces of information based on columns and rows in a field. You might look for all the rows containing a particular date or ZIP code or name -- this is structured data, and it is organized and searchable by data type within the actual content.

    Requires Free Membership to View

    When you register for SearchStorage.com, you’ll also receive targeted emails from my team of award-winning editorial writers. Our goal is to keep you informed on the hottest topics, the latest news and the biggest challenges you face as a storage professional today.

    Rich Castagna, Editorial Director

    By submitting your registration information to SearchStorage.com you agree to receive email communications from TechTarget and TechTarget partners. We encourage you to read our Privacy Policy which contains important disclosures about how we collect and use your registration and other information. If you reside outside of the United States, by submitting this registration information you consent to having your personal data transferred to and processed in the United States. Your use of SearchStorage.com is governed by our Terms of Use. You may contact us at webmaster@TechTarget.com.

By comparison, unstructured data has no identifiable structure. Unstructured data typically includes bitmap images/objects, text and other data types that are not part of a database. Most enterprise data today can actually be considered unstructured. An email is considered unstructured data. Even though the email messages themselves are organized in a database, such as Microsoft Exchange or Lotus Notes, the body of the message is really freeform text without any structure at all -- the data is considered raw. Documents are another example of unstructured data. Although a Word document has some formatting attached to it, the content of the document is completely free form.

The nature of some data types, such as spreadsheets, is still a matter of debate. The spreadsheet itself has some structure, but the data you put into each cell of a spreadsheet, like Excel, is not regulated by the application.

Listen to the Unstructured data FAQ audiocast.

Go to the beginning of the Unstructured Data FAQ Guide.

 


This was first published in March 2007