Declarative Infrastructure with the Jsonnet Templating Language

06/26/17 0

How Databricks Provisions Infrastructure Services at Scale with Kubernetes

Parallelizing Large Simulations with Apache SparkR on Databricks

06/23/17 0

The details on one use case jointly done by Shell Oil Company and Databricks.

Managing and Securing Credentials in Databricks for Apache Spark Jobs

06/20/17 0

This blog post will describe how to leverage an IAM Role to map to any set of credentials

Managing and Securing Credentials in Databricks for Apache Spark Jobs

06/20/17 0

An easy recipe to secure your credentials

IBM Ends Hadoop Distribution, Hortonworks Expands Hybrid Open Source

06/21/17 0

IBM has followed Intel and EMC/Pivotal in abandoning efforts to make a business of Hadoop distributions.

Analysing Metro Operations Using Apache Spark on Databricks

06/14/17 0

The Metro transports more than 200.000 passengers on a daily basis and is a critical part of the city’s infrastructure.

Apache Arrow 0.4.1 Release

06/14/17 0

This is a bug fix release that addresses a regression with Decimal types in the Java implementation introduced in 0.4.0

Five Spark SQL Utility Functions to Extract and Explore Complex Data Types

06/13/17 0

This blog post is a preamble to the how as a notebook tutorial.

Five Spark SQL Utility Functions to Extract and Explore Complex Data Types

06/12/17 0

Tutorial on how to do ETL on data from Nest and IoT Devices.

10th Spark Summit Sets Another Record of Attendance

06/09/17 0

A selected collage of highlights from Databricks’ speakers at our 10th Spark Summit.

Databricks Serverless: Next Generation Resource Management for Apache Spark

06/07/17 0

IT teams constantly struggle to find a way to allocate big data infrastructure.

A Vision for Making Deep Learning Simple

06/06/17 0

From Machine Learning Practitioners to Business Analysts.

A Vision for Making Deep Learning Simple

06/06/17 0

Engineers at Silicon Valley tech companies could analyze the entire Internet.

Making Apache Spark the Fastest Open Source Streaming Engine

06/06/17 0

How to make it easy to build end-to-end streaming applications by exposing a single API to write streaming queries as you would write batch queries.

Sharing Knowledge with the Community in a Preview of Apache Spark: The Definitive Guide

06/05/17 0

Hundreds of contributors working collectively have made Spark an amazing piece of the technology.

Databricks Runtime 3.0 Beta Delivers Cloud Optimized Apache Spark

05/24/17 0

We are also making the beta of Databricks Runtime 3.0-that includes the latest release candidate build of Apache Spark 2.2

Working with Nested Data Using Higher Order Functions in SQL on Databricks

05/24/17 0

Nested data types offer Databricks customers and Apache Spark users powerful ways to manipulate structured data

On-Demand Webinar and FAQ: Deep Learning and Apache Spark: Workflows and Best Practices

05/23/17 0

Deep Learning and Apache Spark is focused on issues that are common to deep learning frameworks when running on an Apache Spark cluster

Kafka Streams – Part 2

05/07/17 0

Branch creates multiple branches from a single stream

Event-time Aggregation and Watermarking in Apache Spark’s Structured Streaming

05/08/17 0

This is the fourth post in a multi-part series about how you can perform complex streaming analytics using Apache Spark

Introducing Uno and Muchos

04/21/17 0

Uno and Muchos are tools that ease the burden on developers of installing Accumulo and its dependencies

Cloudera Enterprise 5.11 is Now Available

04/18/17 0

Cloudera Enterprise 5.11 is now generally available (GA)

Going to ApacheCon? Check out TomcatCon, a Mini-Conference Featuring Apache Tomcat

04/14/17 0

There is a series of mini-conferences running in and around ApacheCon that you will not want to miss.

Events

AWS re:Invent

Nov 28, 2022 Las Vegas, NV

Join us again this year in Las Vegas for our biggest, most comprehensive, and most vibrant event in cloud computing.

AI & Big Data Expo 2022

Oct 05, 2022 Santa Clara, CA

This technology event is for the ambitious enterprise technology professional, seeking to explore the latest innovations, implementations and strategies to drive businesses forward.

Current 2022: The Next Generation of Kafka Summit

Oct 04, 2022 Austin, TX

Join the first-ever data streaming industry event at Current 2022: The Next Generation of Kafka Summit. You’ll be able to immerse yourself in all things real-time data with peers.

O’Reilly Open Source Convention

May 16, 2016 Austin

OSCON covers FLOSS in its entirety. Not just one language, tool, or philosophy, but all the moving parts integrated and working together.

99U 2016

May 05, 2016 New York

The goal of the 99U Conference is to shift the focus from idea generation to idea execution. Providing road-tested insights

Consensus 2016: Making Blockchain Real

May 02, 2016 New York

Consensus 2016 will define what is “real” in blockchain technology and focus on how to mainstream real-world applications for consumers.