Top Stories

Intel’s BigDL on Databricks

Feb 09, 2017

Intel recently released its BigDL project for distributed deep learning on Apache Spark. BigDL has native Spark integration, allowing it to leverage Spark during model training, prediction, and tuning. This blog post gives highlights of BigDL and a tutorial showing how to get started with BigDL on Databricks. Intel’s BigDL project BigDL is an open source deep learning library from...Go To Article

Build a super fast deep learning machine for under $1,000

Feb 01, 2017

Brain. (source: Pixabay ). For more practical techniques for getting started with deep learning, check out the deep learning sessions at Strata + Hadoop World San Jose, March 13-16, 2017. Yes, you can run TensorFlow on a $39 Raspberry Pi, and yes, you can run TensorFlow on a GPU powered EC2 node for about $1 per hour. And yes, those...Go To Article

Big Results From Big Data Still Elude Most Organizations

Jan 23, 2017

Investments in big data are getting bigger, but big results are still eluding most organizations, according to a new survey of big data professionals. Conducted by business analytics firm SAS, the survey reports that 83 percent of respondents call their company’s big data investment as “moderate” or “significant.” However, many of these projects are still in the early stages: Just...Go To Article

The Amazing Ways Big Data and Predictive Analytics Can Reduce Traffic Casualties

Jan 23, 2017

Lots of people are scared of flying, but it’s rare that you find someone who is scared of getting into a car. This is strange because statistically it’s one of the most dangerous things you can do – in the US alone, 35,000 people were killed in traffic accidents last year. While this figure has been steadily declining from its...Go To Article

 

Intel’s BigDL on Databricks

Feb 09, 2017

BigDL is an open source deep learning library from Intel

Spark Summit East 2017: Another Record-Setting Spark Summit

Feb 09, 2017

We’ve put together a short recap of the keynotes and highlights from Databricks’ speakers for Apache Spark enthusiasts who could not attend the summit

Monitoring Kafka Streams Metrics via JMX

Feb 08, 2017

I'll present a way to access the metrics using the command-line application jmxterm

How Predictive Maintenance Saves Millions Of Dollars

Feb 07, 2017

When it comes to big data and Internet of Things (IoT) initiatives most companies are still in the design or early adoption phases which make it hard to get a solid return on investment (ROI) figures

Hadoop Fundamentals and Key Technologies in the Evolving Hadoop Ecosystem

Feb 03, 2017

Reliance on open standards such as NFS and POSIX is the best way to leverage data integration into a big data platform

Use a Hadoop-based Data Lake to Empower New Best Practices for Business Analytics

Feb 02, 2017

The end goal for savvy managers is to gain business value and drive organizational effectiveness from the data

Three Paths to Value in Data Science

Feb 02, 2017

The open source model is a collaborative development model where code is freely available

How to Solve IoT’s Big Data Challenge with Machine Learning

Feb 02, 2017

Machine learning may also help us with a challenge from one of last year’s most buzzed about technology developments: the Internet of Things.

Build a super fast deep learning machine for under $1,000

Feb 01, 2017

The age of super-machines has arrived!

Organizations counter rising malware variants with more vigilance

Feb 01, 2017

Hackers are constantly creating new forms of malware

Kafka – Rewind Consumer Offsets

Jan 31, 2017

One of the most important features from Apache Kafka is how it manages Multiple Consumers.

Announcing the Spark Live 2017 World Tour

Jan 31, 2017

we will be hitting the road again in 2017 to continue our mission of bringing Apache Spark and Databricks to the masses

Once More into the Data Lake

Jan 31, 2017

Detailed and usable architectural definitions of a data lake are somewhat rare on the Web.

Integrating Your Central Hive Metastore with Apache Spark on Databricks

Jan 30, 2017

The Databricks platform provides a fully managed Hive Metastore that allows users to share a data catalog across multiple Spark clusters

Keeping Data Scientists Happy: The Rise of the Cloud Data Lab

Jan 27, 2017

Within the datasets available to organizations lie answers to some of the most pertinent questions and ways to drive and validate important decisions.

How MTV And Nickelodeon Use Real-Time Big Data Analytics To Improve Customer Experience

Jan 26, 2017

Monitoring of the digital networks which are used to pump their content into millions of homes gives them access to a huge amount of data

Are Hadoop’s Best Days Behind It — Or Still Ahead?

Jan 26, 2017

Data Warehouse Modernization shows that 17 percent of data warehouse programs surveyed already have Hadoop in production in their data warehouse environment.

Delivering Exceptional Care Through Data-Driven Medicine

Jan 25, 2017

An emerging theme among providers that are reporting early wins is the central role of big data technologies

Big Results From Big Data Still Elude Most Organizations

Jan 23, 2017

respondents generally believe that big data will provide their company with a competitive advantage.

Big Results From Big Data Still Elude Most Organizations

Jan 23, 2017

Just building a cluster is never enough - you need a business case that is well defined

See this simple introduction to Natural Language Processing (NLP)

Jan 21, 2017

Organizations are turning to natural language processing (NLP) technology to derive understanding from the myriad of these unstructured data available online and in call-logs

ANALYTICS STRATEGIES FOR THE INTERNET OF THINGS – GETTING THE MOST OUT OF IOT DATA

Jan 20, 2017

The Internet of Things (IoT) enables us to measure processes and react more quickly to ever-evolving conditions

Events

AWS re:Invent

Nov 28, 2022 Las Vegas, NV

Join us again this year in Las Vegas for our biggest, most comprehensive, and most vibrant event in cloud computing.

AI & Big Data Expo 2022

Oct 05, 2022 Santa Clara, CA

This technology event is for the ambitious enterprise technology professional, seeking to explore the latest innovations, implementations and strategies to drive businesses forward.

Current 2022: The Next Generation of Kafka Summit

Oct 04, 2022 Austin, TX

Join the first-ever data streaming industry event at Current 2022: The Next Generation of Kafka Summit. You’ll be able to immerse yourself in all things real-time data with peers.

O’Reilly Open Source Convention

May 16, 2016 Austin

OSCON covers FLOSS in its entirety. Not just one language, tool, or philosophy, but all the moving parts integrated and working together.

99U 2016

May 05, 2016 New York

The goal of the 99U Conference is to shift the focus from idea generation to idea execution. Providing road-tested insights

Consensus 2016: Making Blockchain Real

May 02, 2016 New York

Consensus 2016 will define what is “real” in blockchain technology and focus on how to mainstream real-world applications for consumers.