Top Stories

Understanding The Next Generation Of Analytics

Feb 22, 2017

Technological innovation compounds over time, with new advances building on the foundations laid by the inventions of the past. At the Spark East summit this past week, it was fascinating to see this dynamic play out in practice. Many of the higher level services in the spotlight at the conference are evolutionary next steps from lower level services created in...Go To Article

Processing trillion rows per second on a single machine: how can nested loop joins be this fast?

Feb 16, 2017

This blog post describes our experience debugging a failing test case caused by a cross join query running “too fast.” Because the root cause of fail test case spans across multiple layers—from Apache Spark to the JVM JIT compiler— we wanted to share our analysis in this post. Spark as a compiler The vast majority of big data SQL or...Go To Article

 

Working with Complex Data Formats with Structured Streaming in Apache Spark 2.1

Feb 23, 2017

How Apache Spark SQL’s built-in functions can be used to solve all data transformation challenges

Styles of Deep Learning: What You Need to Know — Upside

Feb 23, 2017

The market for deep learning solutions continues to expand.

Understanding The Next Generation Of Analytics

Feb 22, 2017

Combining lower level services into higher order solutions creates true innovative acceleration.

Google’s TensorFlow Reaches 1.0

Feb 22, 2017

TensorFlow - Google's machine learning software library - has reached 1.0 status

3D Data Visualisation using Datascape 2.0

Feb 18, 2017

How we use Datascape to get data into a 3D environment

28 Free Internet of Things Classes You Can Take Right Now

Feb 18, 2017

For engineers or for entrepreneurs. For software or for hardware. Any course you might need for IoT.

Splunking Kafka with Kafka Connect

Feb 16, 2017

How to use Kafka Connect along with a Splunk Heavy Forwarder to stream data

Processing a Trillion Rows Per Second on a Single Machine: How Can Nested Loop Joins be this Fast?

Feb 16, 2017

Debugging a failing test case caused by query running “too fast”

Apache Kafka: The Cornerstone of an Internet-of-Things Data Platform

Feb 15, 2017

If you are a developer considering IoT as a career option it is time for you to start investing in Apache Kafka

7 familiar myths regarding Big Data analytics

Feb 15, 2017

Let’s have a look on the common myths about Big Data

Crossing the Streams – Joins in Apache Kafka

Feb 15, 2017

Version 0.10.0 of the popular message broker Apache Kafka saw the introduction of Kafka Streams

Google releases TensorFlow 1.0 with new machine learning tools

Feb 15, 2017

Google announced the release of version 1.0 of its TensorFlow open source framework for deep learning

Why Open Data Science Matters

Feb 14, 2017

Open Data Science is eating the world. Why?

An HDFS Tutorial for Data Analysts Stuck With Relational Databases

Feb 13, 2017

What are the benefits that HDFS has over relational databases?

Getting the business of Big Data right

Feb 13, 2017

The pool of skilled people has grown and vendors have created ways for less skilled analysts to interrogate Big Data

When Businesses Go Around IT for Analytics — Upside

Feb 13, 2017

Going behind IT's back is one of the best-known tropes in IT and business management

Anonymizing Datasets at Scale Leveraging Databricks Interoperability

Feb 13, 2017

Data anonymization is often the first step performed when preparing data for analysis

Running Top-N Aggregation grouped by Dimension

Feb 12, 2017

How to implement a streaming analytics application using Kafka Streams

The Greatest Public Datasets for AI

Feb 11, 2017

Public data sets are ideal for testing AI

Building a streaming analytics Java application against a Kafka Topic

Feb 11, 2017

In this article I will show you my first steps with Kafka Streams

The AWS Deep Learning AMI, Now with Ubuntu

Feb 10, 2017

AWS Deep Learning AMI for Ubuntu is now available in the AWS Marketplace

Data Is the new currency

Feb 10, 2017

The focus on data requires a universe of services that can work together to solve critical problems of all types

Microsoft adds patent suit protections for cloud customers

Feb 09, 2017

The patent protection can provide an edge for Microsoft as the company competes with Amazon and Google in the cloud

Events

AWS re:Invent

Nov 28, 2022 Las Vegas, NV

Join us again this year in Las Vegas for our biggest, most comprehensive, and most vibrant event in cloud computing.

AI & Big Data Expo 2022

Oct 05, 2022 Santa Clara, CA

This technology event is for the ambitious enterprise technology professional, seeking to explore the latest innovations, implementations and strategies to drive businesses forward.

Current 2022: The Next Generation of Kafka Summit

Oct 04, 2022 Austin, TX

Join the first-ever data streaming industry event at Current 2022: The Next Generation of Kafka Summit. You’ll be able to immerse yourself in all things real-time data with peers.

O’Reilly Open Source Convention

May 16, 2016 Austin

OSCON covers FLOSS in its entirety. Not just one language, tool, or philosophy, but all the moving parts integrated and working together.

99U 2016

May 05, 2016 New York

The goal of the 99U Conference is to shift the focus from idea generation to idea execution. Providing road-tested insights

Consensus 2016: Making Blockchain Real

May 02, 2016 New York

Consensus 2016 will define what is “real” in blockchain technology and focus on how to mainstream real-world applications for consumers.