Disruptive Data Science – Transforming Your Company into a Data Science-Driven Enterprise
Big Data is the latest technology wave impacting C-Level executives across all areas of business, but amid the hype, there remains confusion about what it all means. The name emphasizes the exponential growth of data volumes worldwide (collectively, 2.5 Exabytes/ day in the latest estimate I saw from IDC), but more nuanced definitions of Big Data incorporate the following key tenets: diversification, low latency, and ubiquity.
Read more »
“More Hands Than Our Own”: Greenplum’s Logan Lee on Opening Chorus
Data science is a team sport that thrives upon collaboration, quick iteration, and a healthy amount of collegial competitiveness. These characteristics also drive development in the open source software community. So it’s fitting that Greenplum announced the release of Chorus, its social platform for collaboration on predictive analytics projects, as an open source project last week at the Strata Conference in New York City.
Read more »
OpenChorus and Greenplum’s Kaggle Partnership in the News
This week’s announcement that Greenplum is open-sourcing its collaborative data science platform Chorus and partnering with Kaggle to connect OpenChorus users with the data scientist elite has generated lots of press. Announced at this week’s O’Reilly Strata conference in New York City, OpenChorus and the Kaggle partnership will enable customers, partners, developers, and data scientists to collaboratively realize the predictive potential of Big Data. Here’s a roundup of some of the responses in the media:
The New York Times Bits blog: Creating Big Data’s Talent Mart
Scott Yara, a co-founder of Greenplum and now its senior vice president of products, said his company already has a dedicated staff of 25 data scientists but has more work than it can handle.
Read more »
Delve Into the Deep Blue Sea of Oceanic Data with Marinexplore
It’s widely known that most of the Earth is covered in water; the ocean alone covers 71% of the planet’s surface to be exact. The ocean contains fathoms of data, and with over 90% of it still to be explored, its processing and analysis is the very model of a Big Data problem. Marinexplore is a new open data collaboration platform and community containing 463,447,500 oceanographic measurements collected from 23,422 sensors.
Read more »