In this day and age, every business generates and harvests data. Google, Amazon, Facebook, Netflix, Nvidia, and Tesla are the kind of companies that are leaders of the data economy. They are harvesting data to its full extent and continue to find ways of growing their business via data. Worldwide data expected to hit 175 zettabytes by 2025, representing a 61 percent compound annual growth rate according to IDC
People often forget that a lot of this data is simply made up of copies of existing data. Why do copies of data exist? For multiple reasons, the most common reasons include:
- Backup and Compliance
- Disaster Recovery
- Reporting
- Analytics
- Test & Development
- DevOps &
- Training
To put the scope of how much data is copy data in context, IDC has found that 60 percent of storage is dedicated to managing copies of data--at $55 billion per year collectively.
The most easily observable problem with copy data is there is too much of it. Data reduction capabilities, such as compression and deduplication, are helping reduce the burden of copies, copy data still causes many organizations unwanted stress and costs - the core reasons for these being:
- Storage growth
- Security vulnerabilities
- Compliance risks and
- Shadow IT
So, what can companies do to manage and harvest data better to reduce the stress and costs associated with Copy Data?