Data Debt Calculator

Source Data (TB)

Enter the total amount of data you have in non-relational systems, including Hadoop, NoSQL, Amazon S3, and third party applications.

Source Systems

The data in the prior row is spread across multiple systems. Enter the number of systems rather than servers (eg, a 50 node Hadoop cluster would count as 1 system).

Data Analysts

How many data analysts use this data? One way to estimate is the number of users of Tableau, Power BI, Qlik, Cognos, BusinessObjects, and other BI or SQL tools.

Data Scientists

How many data scientists use this data? One way to estimate is the number of users of Python, R, SAS, SPSS, or SQL tools.

Other Factors


Liability costs. Big data involves tools and protocols that are less mature than traditional approaches. These systems pose a greater liability risk that must be considered in understanding your total debt. Liability costs include potential losses that can result from unsecured or ungoverned data moving through pipelines to make it compatible with the tools used by analysts and data scientists.

Opportunity costs. Moving application data into analytical environments can take significant time. Reducing this time can have very high costs. Opportunity costs include unrealized value as a result of prolonged time to insight as data moves through pipelines to reach the tools used by analysts and data scientists.

Learn about Dremio's beta program.