How Databricks Provisions Infrastructure Services at Scale with Kubernetes
The details on one use case jointly done by Shell Oil Company and Databricks.
This blog post will describe how to leverage an IAM Role to map to any set of credentials
An easy recipe to secure your credentials
IBM has followed Intel and EMC/Pivotal in abandoning efforts to make a business of Hadoop distributions.
The Metro transports more than 200.000 passengers on a daily basis and is a critical part of the city’s infrastructure.
This is a bug fix release that addresses a regression with Decimal types in the Java implementation introduced in 0.4.0
This blog post is a preamble to the how as a notebook tutorial.
Tutorial on how to do ETL on data from Nest and IoT Devices.
A selected collage of highlights from Databricks’ speakers at our 10th Spark Summit.
IT teams constantly struggle to find a way to allocate big data infrastructure.
From Machine Learning Practitioners to Business Analysts.
Engineers at Silicon Valley tech companies could analyze the entire Internet.
How to make it easy to build end-to-end streaming applications by exposing a single API to write streaming queries as you would write batch queries.
Hundreds of contributors working collectively have made Spark an amazing piece of the technology.
We are also making the beta of Databricks Runtime 3.0-that includes the latest release candidate build of Apache Spark 2.2
Nested data types offer Databricks customers and Apache Spark users powerful ways to manipulate structured data
Deep Learning and Apache Spark is focused on issues that are common to deep learning frameworks when running on an Apache Spark cluster
Branch creates multiple branches from a single stream
This is the fourth post in a multi-part series about how you can perform complex streaming analytics using Apache Spark
Uno and Muchos are tools that ease the burden on developers of installing Accumulo and its dependencies
Cloudera Enterprise 5.11 is now generally available (GA)
There is a series of mini-conferences running in and around ApacheCon that you will not want to miss.
Join us again this year in Las Vegas for our biggest, most comprehensive, and most vibrant event in cloud computing.
This technology event is for the ambitious enterprise technology professional, seeking to explore the latest innovations, implementations and strategies to drive businesses forward.
Join the first-ever data streaming industry event at Current 2022: The Next Generation of Kafka Summit. You’ll be able to immerse yourself in all things real-time data with peers.
OSCON covers FLOSS in its entirety. Not just one language, tool, or philosophy, but all the moving parts integrated and working together.
The goal of the 99U Conference is to shift the focus from idea generation to idea execution. Providing road-tested insights
Consensus 2016 will define what is “real” in blockchain technology and focus on how to mainstream real-world applications for consumers.