Cloudera DataFlow (CDF)

The answer to all your real-time streaming data problems.

Manage your data from edge to enterprise with a no-code approach to developing sophisticated streaming applications easily

The biggest challenge in getting streaming data insights is acquiring the data—quickly, securely, and prioritized for analysis with clear traceability.

Cloudera DataFlow (CDF), formerly Hortonworks DataFlow (HDF), is a scalable, real-time streaming analytics platform that ingests, curates, and analyzes data for key insights and immediate actionable intelligence.

DataFlow addresses the key challenges enterprises face with data-in-motion:

  • Processing real-time data streaming at high volume and high scale

  • Tracking data provenance and lineage of streaming data

  • Managing and monitoring edge applications and streaming sources

The Cloudera DataFlow Platform

Now available, as part of Cloudera's open-source data-in-motion platform for streaming analytics, are Cloudera Edge Management and Cloudera Flow Management.

Key benefits


Imagine a no-code approach to building complex data pipelines with minimal effort. CDF offers a simple visual UI for building sophisticated data flows to accomplish major data ingestions, transformations, and enrichment from a variety of streaming sources. Powered by Apache NiFi, CDF ingests data from devices, enterprise applications, partner systems, and edge applications generating real-time streaming data.


Real-time insights and actionable intelligence mean you can act sooner. Using the powerful streaming platform Apache Kafka, CDF can process several million transactions per second, identify key patterns, compare against machine learning models, and offer predictive or prescriptive analytics to help business leadership make key decisions and seize opportunities.


CDF enables high volume data collection at the edge, even from edge devices using Minifi. Now you can set up widely distributed IoT deployment models for regional data collection with ease using NiFi with Minifi to stream data from the edge. Tight integration with Apache Ranger gives CDF the unique advantage of seamless security across all your data-in-motion and data-at-rest.


CDF is the only product in the industry offering data provenance and edge-to-enterprise data governance out of the box. In the age of GDPR and other regulatory compliance, it’s important to track data lineage, even for streaming data. NiFi within CDF offers data provenance tracking without any extra configuration or setup. With tight integration of Apache Atlas, you have a complete governance of data from the edge to the enterprise.

Contact Us

Location :

L'AVENUE Office 20th floor, Unit 20A
Jl. Raya Pasar Minggu Kav 16
Jakarta Selatan 12780

(+6221) 8066 7065, (+6221) 8066 7064 (fax)

  • LinkedIn - Black Circle
  • Instagram
  • Facebook - Black Circle
  • Twitter - Black Circle
  • YouTube

© 2021 copyright by Duta Sarana Inovasi