Cloudera Data Science Workbench

Accelerate machine learning from research to production with the secure, self-service enterprise data science platform built for the enterprise.

data sience.PNG

A platform for collaborative data science at scale

For data scientists

  • Experiment faster. Use R, Python, or Scala with on-demand compute and secure access to Apache Spark™ and Apache Impala™

  • Work together. Share reproducible research with your whole team

  • Deploy with confidence. Get to production repeatably and without recoding

For IT professionals

  • Bring data science to your data. Give your team more freedom while reducing the risk and cost of silos

  • Secure by default. Leverage common security and governance across workloads

  • Flexible deployment. Run on-premises or in the cloud

Self-service data science

With Python, R, and Scala directly in the web browser, Cloudera Data Science Workbench (CDSW) delivers a self-service experience data scientists will love. Download and experiment with the latest libraries and frameworks in customizable project environments that work just like your laptop. Access any data, anywhere—from cloud object storage to data warehouses, Cloudera Data Science Workbench provides connectivity not only to CDH and HDP but also to the systems your data science teams rely on for analysis.

data science.jpg

Automated data and analytics pipelines

Cloudera Data Science Workbench lets data scientists manage their own analytics pipelines, including built-in scheduling, monitoring, and email alerting. Quickly develop and prototype new machine learning projects and easily deploy them to production.

automated data.jpg

Quickly deploy models, with confidence

A single, unified workflow lets you build, train, and deploy your own machine learning models. Experiments track each training run, for easy reproducibility. Share models as REST APIs with a few clicks, without expensive rewrites or complex DevOps knowledge.

quick deploy.jpg