Intel recently released its BigDL project for distributed deep learning on Apache Spark. BigDL has native Spark integration, allowing it to leverage Spark during model training, prediction, and tuning. This blog post gives highlights of BigDL and a tutorial showing how to get started with BigDL on Databricks. Intel’s BigDL project BigDL is an open source deep learning library from...Go To Article
Brain. (source: Pixabay ). For more practical techniques for getting started with deep learning, check out the deep learning sessions at Strata + Hadoop World San Jose, March 13-16, 2017. Yes, you can run TensorFlow on a $39 Raspberry Pi, and yes, you can run TensorFlow on a GPU powered EC2 node for about $1 per hour. And yes, those...Go To Article
Investments in big data are getting bigger, but big results are still eluding most organizations, according to a new survey of big data professionals. Conducted by business analytics firm SAS, the survey reports that 83 percent of respondents call their company’s big data investment as “moderate” or “significant.” However, many of these projects are still in the early stages: Just...Go To Article
Lots of people are scared of flying, but it’s rare that you find someone who is scared of getting into a car. This is strange because statistically it’s one of the most dangerous things you can do – in the US alone, 35,000 people were killed in traffic accidents last year. While this figure has been steadily declining from its...Go To Article
BigDL is an open source deep learning library from Intel
We’ve put together a short recap of the keynotes and highlights from Databricks’ speakers for Apache Spark enthusiasts who could not attend the summit
I'll present a way to access the metrics using the command-line application jmxterm
When it comes to big data and Internet of Things (IoT) initiatives most companies are still in the design or early adoption phases which make it hard to get a solid return on investment (ROI) figures
Reliance on open standards such as NFS and POSIX is the best way to leverage data integration into a big data platform
The end goal for savvy managers is to gain business value and drive organizational effectiveness from the data
The open source model is a collaborative development model where code is freely available
Machine learning may also help us with a challenge from one of last year’s most buzzed about technology developments: the Internet of Things.
The age of super-machines has arrived!
Hackers are constantly creating new forms of malware
One of the most important features from Apache Kafka is how it manages Multiple Consumers.
we will be hitting the road again in 2017 to continue our mission of bringing Apache Spark and Databricks to the masses
Detailed and usable architectural definitions of a data lake are somewhat rare on the Web.
The Databricks platform provides a fully managed Hive Metastore that allows users to share a data catalog across multiple Spark clusters
Within the datasets available to organizations lie answers to some of the most pertinent questions and ways to drive and validate important decisions.
Monitoring of the digital networks which are used to pump their content into millions of homes gives them access to a huge amount of data
Data Warehouse Modernization shows that 17 percent of data warehouse programs surveyed already have Hadoop in production in their data warehouse environment.
An emerging theme among providers that are reporting early wins is the central role of big data technologies
respondents generally believe that big data will provide their company with a competitive advantage.
Just building a cluster is never enough - you need a business case that is well defined
Eschewing ETL seems to be a real theme these days
Organizations are turning to natural language processing (NLP) technology to derive understanding from the myriad of these unstructured data available online and in call-logs
The Internet of Things (IoT) enables us to measure processes and react more quickly to ever-evolving conditions
Join us again this year in Las Vegas for our biggest, most comprehensive, and most vibrant event in cloud computing.
This technology event is for the ambitious enterprise technology professional, seeking to explore the latest innovations, implementations and strategies to drive businesses forward.
Join the first-ever data streaming industry event at Current 2022: The Next Generation of Kafka Summit. You’ll be able to immerse yourself in all things real-time data with peers.
OSCON covers FLOSS in its entirety. Not just one language, tool, or philosophy, but all the moving parts integrated and working together.
The goal of the 99U Conference is to shift the focus from idea generation to idea execution. Providing road-tested insights
Consensus 2016 will define what is “real” in blockchain technology and focus on how to mainstream real-world applications for consumers.