All Topics
News & Blogs
Hadoop
The History of Hadoop: From Small Starts to Big Data
Greenplum Blog
March 15, 2013 –
Named after a toy elephant belonging to developer Doug Cutting’s son, over the past decade Hadoop has proven to be the little platform that could. From its humble beginnings as an open source search engine project created by Cutting and Mike Cafarella, Hadoop has evolved into a robust platform for Big Data storage and analysis. How did an open source project started by a moonlighting developer and a University of Washington grad student become ubiquitous in so many data-driven settings? In its new four-part series, GigaOm documents Hadoop’s history, its growth, and the promising future of the platform.
Analytics
Ideas for Solving the 'Data' Problem First, the 'Big' Problem Second: The Pentaho Way
Forbes
March 14, 2013 –
Imagine a word cloud to represent discussions about how to make data useful in business right now. Based on what I saw at Strata in late February in Santa Clara, you would see “big” in letters about 4 inches high and the word “data” in regular 12 point type. As my pals at Gawker say, “Thatz Not Okay.”
Predictive
High Scalability
March 13, 2013 –
Jason Wolfe from Prismatic has generously agreed to describe their thoroughly modern solution for answering the “what to read question” using lots of sexy words like Machine Learning, Social Graphs, BigData, functional programming, and in-memory real-time feed processing. The result is possibly even more occult, but this or something very much like it will be how we meet the challenge of finding interesting topics and stories hidden inside infinitely deep pools of information.
Analytics
Pinterest launches Web Analytics for brands, takes one giant leap towards monetization
Venture Beat
March 12, 2013 –
Pinterest launched Pinterest Web Analytics this morning to help brands and marketers see what Pinterest users are doing with their content. It’s a simple and, yes, very Pinteresty take on what brands want to know, and it’s a big step towards Pinterest actually starting to monetize its massive traffic asset.
Big Data
The big data world is operating at 1 percent
Gigaom
March 11, 2013 –
We talk a lot about big data, but only analyze 1 percent of what’s available. In order to take advantage of the other 99 percent, we need to reconsider how we do big data. The other 99 percent of the 1 quintillion bytes of data that is collected every day (according to a recent study from IDC) remains untouched.
Agile
Open Source “Purity,” Hadoop, and Market Realities
Gartner Blog
March 11, 2013 –
I don’t often do a pure opinion piece but I feel compelled to weigh in on a queston I’ve been asked several times since EMC released its Pivotal HD recently. The question is whether it is somehow inappropriate, even “evil,” for EMC to enter the market without having “enough” committers to open source Apache projects. More broadly, it’s about whether other people can use, incorporate, add to and profit from Apache Hadoop.
Agile
Cloud Foundry Open for Business and Commits
Greenplum Blog
March 8, 2013 –
Cloud Foundry, the Platform as a Service (PaaS) solution offered by EMC’s newly-minted Pivotal Initiative, opened itself this week to clients and the open source community. The Pivotal Initative will be selling Cloud Foundry as a cloud software service, while opening the project’s code for collaboration and commits through Github.
Predictive
Learning more from your 23andMe results with Imputation
Genomes Unzipped
March 8, 2013 –
Suppose that you’ve had your DNA genotyped by 23andMe or some other DTC genetic testing company. Then an article shows up in your morning newspaper or journal (like this one) and suddenly there’s an additional variant you want to know about. You check your raw genotypes file to see if the variant is present on the chip, but it isn’t! So what next?
Predictive
Technology Turns to Tracking People Offline
The New York Times
March 7, 2013 –
Following people online, with cookies, tagged pixels and even voluntarily given information, has been a big business. Now much of the same technology is moving into the physical world. A company called Euclid Analytics uses the Wi-Fi antennas inside stores to see how many people are coming into a store, how long they stay and even which aisles they walk. It does this by noting each smartphone that comes near the store, feeding on every signal ping the phone sends.
Predictive
Search-Engine Data Gives Early Warnings of Drug Side Effects
Wired
March 7, 2013 –
Analyzing queries made to Google, Bing, and other search engines can reveal the potentially dangerous consequences of mixing prescriptions before they are known to the Food and Drug Administration (FDA), according to a new study. Such data mining could even expose medical risks that slip through clinical trials undetected.