By comparison, indexing will help searches and legal discovery issues, but there are limitations to indexing. Indexing will rely on a text string of some sort; it could be within the file, or in the case of audio or images, rely upon the file name. This implies a naming convention that needs to be consistent with your data and the way that you work with it. This is something
Requires Free Membership to View
When you register for SearchStorage.com, you’ll also receive targeted emails from my team of award-winning editorial writers. Our goal is to keep you informed on the hottest topics, the latest news and the biggest challenges you face as a storage professional today.
Rich Castagna, Editorial DirectorEmail searches are also hindered by the lack of policies around the "Subject" line. People have a tendency to simply respond to a previous message as a means of starting a totally different message thread -- you can have 15 different conversations all with the same title because it was just convenient to reply. This can really muddy up subject-based searches. Again, policies and a little discipline in working with your data can have an important effect on data management and searchability.
Metadata can help to identify and categorize unstructured data, so you should absolutely pay attention to the file's metadata structure and content. This also relates to content-addressed storage (CAS), which relies on metadata to really organize and understand what files you have. Metadata can identify dates, size, permissions, title and other details. But again, searches rely on text strings, so metadata must be complete and consistent for it to be of any use.
Listen to the Unstructured data FAQ audiocast.
Go to the beginning of the Unstructured Data FAQ Guide.
This was first published in March 2007
Storage Management Strategies for the CIO