News & Blogs

Blog

Hadoop

The History of Hadoop: From Small Starts to Big Data

Greenplum Blog
March 15, 2013 – Named after a toy elephant belonging to developer Doug Cutting’s son, over the past decade Hadoop has proven to be the little platform that could. From its humble beginnings as an open source search engine project created by Cutting and Mike Cafarella, Hadoop has evolved into a robust platform for Big Data storage and analysis. How did an open source project started by a moonlighting developer and a University of Washington grad student become ubiquitous in so many data-driven settings? In its new four-part series, GigaOm documents Hadoop’s history, its growth, and the promising future of the platform.

Blog

Analytics

Ideas for Solving the 'Data' Problem First, the 'Big' Problem Second: The Pentaho Way

Forbes
March 14, 2013 – Imagine a word cloud to represent discussions about how to make data useful in business right now. Based on what I saw at Strata in late February in Santa Clara, you would see “big” in letters about 4 inches high and the word “data” in regular 12 point type. As my pals at Gawker say, “Thatz Not Okay.”

Industry News

Predictive

Prismatic Architecture - Using Machine Learning On Social Networks To Figure Out What You Should Read On The Web

High Scalability
March 13, 2013 – Jason Wolfe from Prismatic has generously agreed to describe their thoroughly modern solution for answering the “what to read question” using lots of sexy words like Machine Learning, Social Graphs, BigData, functional programming, and in-memory real-time feed processing. The result is possibly even more occult, but this or something very much like it will be how we meet the challenge of finding interesting topics and stories hidden inside infinitely deep pools of information.

Industry News

Analytics

Pinterest launches Web Analytics for brands, takes one giant leap towards monetization

Venture Beat
March 12, 2013 – Pinterest launched Pinterest Web Analytics this morning to help brands and marketers see what Pinterest users are doing with their content. It’s a simple and, yes, very Pinteresty take on what brands want to know, and it’s a big step towards Pinterest actually starting to monetize its massive traffic asset.

Industry News

Big Data

The big data world is operating at 1 percent

Gigaom
March 11, 2013 – We talk a lot about big data, but only analyze 1 percent of what’s available. In order to take advantage of the other 99 percent, we need to reconsider how we do big data. The other 99 percent of the 1 quintillion bytes of data that is collected every day (according to a recent study from IDC) remains untouched.


Blog

Agile

Open Source “Purity,” Hadoop, and Market Realities

Gartner Blog
March 11, 2013 – I don’t often do a pure opinion piece but I feel compelled to weigh in on a queston I’ve been asked several times since EMC released its Pivotal HD recently. The question is whether it is somehow inappropriate, even “evil,” for EMC to enter the market without having “enough” committers to open source Apache projects. More broadly, it’s about whether other people can use, incorporate, add to and profit from Apache Hadoop.

Blog

Agile

Cloud Foundry Open for Business and Commits

Greenplum Blog
March 8, 2013 – Cloud Foundry, the Platform as a Service (PaaS) solution offered by EMC’s newly-minted Pivotal Initiative, opened itself this week to clients and the open source community. The Pivotal Initative will be selling Cloud Foundry as a cloud software service, while opening the project’s code for collaboration and commits through Github.

Blog

Predictive

Learning more from your 23andMe results with Imputation

Genomes Unzipped
March 8, 2013 – Suppose that you’ve had your DNA genotyped by 23andMe or some other DTC genetic testing company. Then an article shows up in your morning newspaper or journal (like this one) and suddenly there’s an additional variant you want to know about. You check your raw genotypes file to see if the variant is present on the chip, but it isn’t! So what next?

Industry News

Predictive

Technology Turns to Tracking People Offline

The New York Times
March 7, 2013 – Following people online, with cookies, tagged pixels and even voluntarily given information, has been a big business. Now much of the same technology is moving into the physical world. A company called Euclid Analytics uses the Wi-Fi antennas inside stores to see how many people are coming into a store, how long they stay and even which aisles they walk. It does this by noting each smartphone that comes near the store, feeding on every signal ping the phone sends.

Industry News

Predictive

Search-Engine Data Gives Early Warnings of Drug Side Effects

Wired
March 7, 2013 – Analyzing queries made to Google, Bing, and other search engines can reveal the potentially dangerous consequences of mixing prescriptions before they are known to the Food and Drug Administration (FDA), according to a new study. Such data mining could even expose medical risks that slip through clinical trials undetected.


Pages