Gimmal Blog

Read the latest thought leadership and industry news from the experts at Gimmal!

All Posts

What is Dark Data and How Can It Be Cleaned Up?

I recently came across a great blog post on a term called Dark Data and the importance of cleaning it up. According to Gartner, Dark Data is the information assets organizations collect, process and store during regular business activities, but generally fail to use for other purposes. This very well written post by Rick Delgado (@ricknotdelgado) discusses several aspects about the importance of cleaning up this dark data. My post will just summarize the key points, but I highly recommend reading the post directly.

Dealing with Dark Data

To deal with dark data, Rick says you’ll first need to identify it.

Often, dark data will sit unused for years, taking up valuable space in a data center while your company continues to collect even more data. What can start off as a small problem can grow rapidly as unused information continues to pile up.

Why Dark Data stays around?

He also answers why companies tend to keep this data around.

The truth is many organizations prefer to store all the information they collect to ensure they are in compliance with all laws and regulations. At the same time, businesses are reluctant to just toss out unused data because they never know if they might need it at some time in the future. Big data analytics can yield some promising solutions to problems, and to come to those solutions, organizations need the relevant data. As the usual mindset goes, just because you don’t need it now doesn’t mean it won’t prove valuable in the future.

Cleaning it up

What about cleaning up this dark data?

It’s true that a thorough cleanup of dark data can be time-consuming, but the results are well worth the effort. The main challenge is to get rid of dark data while still holding onto any necessary data. There are several ways you can do this at your organization. One of the most effective methods is filtering your data. When gathering data generated by machines and the internet, you’ll find a lot of valuable information along with data that is largely useless. By identifying and isolating the data you need, you can keep it separated from all the other noise. This helps prevent unneeded data from piling up in the first place.

This is where Gimmal can help. Our technology cleans up this dark data for you by classifying unstructured (as well as structured) information and determining what should be kept and what should be properly destroyed.

By Chris Caplinger

Related Posts

Why Should Records Management be Important to You

Why should an organization care about records management? When users throughout all departments are creating new records without a thought to how they are cataloged or tagged, the sprawl of records can become a real threat. Unstructured data can lead to compliance issues for highly regulated industries. When proper records management isn’t a top priority, content that should have been disposed of for security purposes is left vulnerable for anyone to find and distribute.  

Creating Compliance in Chaos: A Consultant's Story

Records and Information Management (RIM) is constantly changing and evolving as record managers begin to realize the benefits of automation in their daily operations. In my 6 years of consulting, I have seen everything from heavily manual business processes to automated document management solutions.  Even as time goes by, information professionals continue to face the long-standing hardship of trying to get end users to comply with either internal or external regulations when it comes to records management.  Lately, there has been an apparent shift from rigid business centric solutions to end user centric solutions. 

Gimmal at ARMA International InfoCon 2019

Once a year, members in the records management community come together for ARMA’s annual conference to discuss the latest advancements and best practices for modern information managers.  ARMA, the global authority of information management and governance, hosted this year’s conference, ARMA InfoCon, in Nashville, TN. While attendees were not in the typical “record” industry that Nashville is known for, the location called for a great mix of music and information management knowledge.