Data Analytics - Overview

Table of Contents

The Challenge of Managing "Big Data"

Companies today generate a lot of data files. These files are created by multiple sources and exist in various locations across a company's storage infrastructure. Most of these files contain unstructured data in the form of emails, documents, and digital media. Unlike structured data, which reside in organized databases, unstructured data files are notoriously difficult to manage. Often, the amount of unstructured data created within a company can put a strain on storage resources. In fact, this exponential growth of unstructured data common to many corporate environments has been given a name: "Big Data."

From a storage-resource perspective, managing Big Data means locating and removing files that are:

  • Duplicated
  • Obsolete
  • Nonessential

It takes an enormous amount of time and energy to search through each storage location for data that should be archived or deleted. But ignoring the growth of Big Data allows files to consume valuable and limited storage resources. And, from a compliance perspective, risk increases when confidential or other sensitive files are disseminated throughout an organization because a proper solution is not in place.

Data Analytics

Data Analytics allows you to view statistical information about the unstructured, Big Data in your environment, such as files and emails. With this information, you can quickly assess the current state of your Big Data, take actionable steps to retrieve valuable storage space, and mitigate the risk of compliance-related issues.

Some examples of how the information provided by Data Analytics can help you manage Big Data include:

  • Locating older files or emails that can be moved to less expensive archival storage
  • Identifying and deleting extra copies of large files and databases
  • Removing unauthorized file types (such as multimedia and personal photos)
  • Deleting outdated emails to reduce compliance risk

You must log in to the Web Console to access this report.

Analytics Job and Analytics Workflow

There are two options when running data analytics on the clients in your environment: running analytics as a job and running the Analytics Workflow.

Refer to the following chart to decide which option fits your needs:

If you need to analyze data... Then... Refer to...
In a single CommCell group on a regular basis
  • Run Analytics as a job.
  1. Configuring the Analytics Engine
  2. Running an Analytics Job
  • Across multiple version 10.0 or 9.0 CommCell groups
  • On computers without a Simpana Agent installed
  • To provide a one-time report of the data in your environment
  • Run the Analytics Workflow.
  1. Configuring the Analytics Engine
  2. Running the Analytics Workflow