As Data Becomes a Service, Will Data Scientists Disappear?

Photo by Paul L Dineen via Flickr.

Even while CIOs tighten belts and demand clear-cut results from technology investments, the value of data-driven insight is evident. Gartner projects investment in Business Intelligence and Analytics software platforms will continue to increase, driven in part by the desire to perform predictive analytics in-house.

Read more »

10 Reasons Why Hadoop Will Dominate in 2013

Hadoop-elephant.png

The Apache Hadoop platform has been ascendant in recent years, thanks to its flexibility, rich developer ecosystem, and ability to suit the analytics needs of developers, web startups and enterprises. Hadoop is on track to become prevalent in 2013, with Gartner predicting it will be embedded in two thirds of Big Data analytics products by 2015.

Read more »

Demonstrating the Future of Data Science at the Strata Conference

the_new_normal

A wise man once said only a fool would attempt a live demonstration (anyone remember Bill Gates and Windows 98?). Apparently I am that fool. Last month at the Strata Conference, my cohort Matt Neglay and I presented a talk, “Demonstrating the Future of Data Science.” As we demonstrated to the standing room-only crowd, data science is changing.

Read more »

OpenChorus and Greenplum’s Kaggle Partnership in the News

thinkingbigtogether

This week’s announcement that Greenplum is open-sourcing its collaborative data science platform Chorus and partnering with Kaggle  to connect OpenChorus users with the data scientist elite has generated lots of press. Announced at this week’s O’Reilly Strata conference in New York City, OpenChorus and the Kaggle partnership will enable customers, partners, developers, and data scientists to collaboratively realize the predictive potential of Big Data. Here’s a roundup of some of the responses in the media:

The New York Times Bits blog: Creating Big Data’s Talent Mart

Scott Yara, a co-founder of Greenplum and now its senior vice president of products, said his company already has a dedicated staff of 25 data scientists but has more work than it can handle.

Read more »

Data Science: Neither Elementary Nor Magic

Photo by Ian-S on Flickr.

A recent GigaOm post, “Why becoming a data scientist might be easier than you think”, has fired up considerable debate in the community. Just from reading the headline, you can probably infer why. Derrick Harris points to the success of a handful of programming neophytes who won Kaggle competitions after taking a class on Coursera, a free online machine learning course co-founded by Stanford professor Andrew Ng.

Read more »

Hadoop MapReduce Can Transform How You Build Top-Ten Lists

Cat icon by Marie Coons via The Noun Project.

It seems like websites, magazines, and TV shows all over the place are building top ten lists (or top-k lists) these days. The top ten science fiction movies of all time, the best places to live, etc. Top-ten lists are not only a lot of fun because of our seemingly primal need to create categories and hierarchies — they can actually be a useful way to analyze your data.

Read more »