Guest blog post by Vincent Granville In this article, I summarize the components of any data science / machine learning / statistical project, as well as the cross-dependencies between these components. This will give you a general idea of what a data science or other analytic project is about. Components 1. Problem This is the top, fundamental component. I have...Go To Article
It's hard to believe, but it's true. The Apache Hadoop project, the open source implementation of Google's File System (GFS) and MapReduce execution engine, turned 10 this week. The technology, originally part of Apache Nutch, an even older open source project for Web crawling, was separated out into its own project in 2006, when a team at Yahoo was dispatched...Go To Article
2016 marks the 10th Anniversary of Hadoop. This birthday provides us an opportunity to celebrate, and also to reflect on how we got here and where we are going. Hadoop has come to symbolize big data, itself central to this century’s industrial revolution: the digital transformation of business. Ten years ago, digital business was limited to a few sectors, like...Go To Article
A few months ago I became a committer and a PMC Member at an exciting new project in the Apache Software Foundation: Apache Ignite. Ignite is a strong contender of Hazelcast and it exceeds its functionality in many regards, especially in terms of computing, transactionality, co-locality, and "queriability" via plain SQL. As defined by the Apache Ignite community: Apache Ignite...Go To Article
Great insight from a chief analyst at Creative Strategies Inc
A whole new range of information architectures is unfolding
Definitely no shortage of work for CDO
Find out whether it is time to renovate environment and assess technologies
Get a sneak peek of what is coming in the next Apache Spark releases
Good commentary from an industry visionary
Redressing the balance between theoretical and practical dimension of Data Science seems long overdue
Fascinating missive on the evolution of Hadoop and its impact on software development cultures!
An exciting new project in the Apache Software Foundation
another reminder about the importance of online security
Nice write-up about data management best practices
Sinequa makes a strong showing on the market
Smart move by Software AG to grab Terracotta
Congratulations to a cool company doing good stuff!
Curious development by Booz Allen - a little cash-flow generator?
Intel sure seems to be hedging their bets!
Is it buzzworthy? Go into a list of questions future Network Infrastructure Research aims to address
What does the near future of big data hold?
The comprehensive events calendar for data professionals!
Join us again this year in Las Vegas for our biggest, most comprehensive, and most vibrant event in cloud computing.
This technology event is for the ambitious enterprise technology professional, seeking to explore the latest innovations, implementations and strategies to drive businesses forward.
Join the first-ever data streaming industry event at Current 2022: The Next Generation of Kafka Summit. You’ll be able to immerse yourself in all things real-time data with peers.
OSCON covers FLOSS in its entirety. Not just one language, tool, or philosophy, but all the moving parts integrated and working together.
The goal of the 99U Conference is to shift the focus from idea generation to idea execution. Providing road-tested insights
Consensus 2016 will define what is “real” in blockchain technology and focus on how to mainstream real-world applications for consumers.