All Topics
News & Blogs
Hadoop
EMC Integrates Greenplum DB and Hadoop with Pivotal HD
Wikibon
February 25, 2013 –
Greenplum today announced a new Hadoop distribution called Pivotal HD. It is based on Hadoop 2.0 and integrates the Greenplum database with Apache Hadoop running directly against HDFS. The move addresses a common concern among Big Data practitioners. While Hadoop has proven itself a scalable, cost effective Big Data storage and processing platform, analyzing data loaded in Hadoop is a complex affair.
Hadoop
Live from Hadoop: The Foundation for Change in San Francisco
Greenplum Blog
February 25, 2013 –
Live from San Francisco at 10am, EMC Greenplum will unveil product and technology innovations that will make Hadoop bigger, better, more accessible and meaningful than ever before.
Hadoop
Greenplum Blog
February 25, 2013 –
This week, we’ll have a number of events to introduce Pivotal HD, our new Hadoop distribution. Pivotal HD isn’t just about making Hadoop better, it’s significantly expanding Hadoop capabilities as a data platform. In this first post, I’ll explore the core functionality enabled by the major new component of Pivotal HD: HAWQ , the first, fully functional, high-performance, all-encompassing relational database that runs in Hadoop.
Hadoop
EMC to Hadoop competition: “See ya, wouldn’t wanna be ya.”
GigaOm
February 25, 2013 –
If, like many industry watchers, you’ve been confused about EMC Greenplum’s Hadoop strategy over the past couple years, Scott Yara has a message for you: “We’re all in on Hadoop, period.” Yara, Greenplum’s co-founder and senior vice president of products, has a not-so-coded message for his big data market competitors, too. Put simply, he doesn’t think they stand a chance against his company, and he served notice on Monday morning.
Hadoop
Why Hadoop Is the Future of the Database
Wired
February 25, 2013 –
At the time, a long line of startups were offering a new breed of database designed to store and analyze much larger amounts of data. Greenplum. Vertica. Netezza. Hammerbacher and Facebook tested them all. But they weren’t suited to the task either.
Hadoop
SQL is what’s next for Hadoop: Here’s who’s doing it
GigaOm
February 24, 2013 –
More and more companies and open source projects are trying to let users run SQL queries from inside Hadoop itself. Here’s a list of what’s available and, on a high level, how they work. When we first began putting together the schedule for Structure: Data several months ago, we knew that running SQL queries on Hadoop would be a big deal — we just didn’t know how big a deal it would actually become.
Hadoop
Hadoop 2013 – Part Two: Projects
Gartner
February 21, 2013 –
In Part One of this series, I pointed out that how significant attention is being lavished on performance in 2013. In this installment, the topic is projects, which are proliferating precipitously. One of my most frequent client inquiries is “which of these pieces make Hadoop?” As recently as a year ago, the question was pretty simple for most people: MapReduce, HDFS, maybe Sqoop and even Flume, Hive, Pig, HBase, Lucene/Solr, Oozie, Zookeeper.
Data Science
New York University wants to train the next generation of data scientists
Venture Beat
February 20, 2013 –
Everybody and their mother is on the hunt for a data scientist, dubbed the “sexiest job of the 21st century,” by the Harvard Business Review. To train the next generation, New York University (NYU) has launched a data science and statistics initiative, which is intended to drive breakthrough research in other programs, including medicine and technology. The new website for the program cites research from McKinsey that projects the U.S. alone will need 140,000 to 190,000 people with deep analytic skills by 2018 as well as 1.5 million managers and analysts for “big data” jobs.
Data Science
As Data Becomes a Service, Will Data Scientists Disappear?
Greenplum Blog
February 20, 2013 –
Even while CIOs tighten belts and demand clear-cut results from technology investments, the value of data-driven insight is evident. Gartner projects investment in increasingly accessible Business Intelligence and Analytics software platforms will continue to grow in the years ahead. But where will that leave the data science specialists?
Data Science
The New York Times
February 19, 2013 –
Not long ago, I was at a dinner with the chief executive of a large bank. He had just had to decide whether to pull out of Italy, given the weak economy and the prospect of a future euro crisis. The C.E.O. had his economists project out a series of downside scenarios and calculate what they would mean for his company. But, in the end, he made his decision on the basis of values.