Greenplum HD: Enterprise-Ready Apache Hadoop

Products Landing

Hadoop Support for the Enterprise

Rely on EMC to provide 24x7 worldwide support with the industry’s largest Hadoop support infrastructure

Proven at Scale

Certified by EMC to remove the guesswork associated with Hadoop deployments

Pluggable Storage Options

Leverage best of breed storage options with no changes to applications

Delivering Enterprise-Ready Apache Hadoop

Recent computing and business trends have triggered an explosion in the amount of unstructured data companies generate each day. By extracting the knowledge wrapped within unstructured and machine-generated data, your enterprise can make better decisions that drive revenue and reduce costs. Hadoop has rapidly emerged as the preferred solution for big data analytics across unstructured data. But the fast-changing Hadoop ecosystem can present challenges to any company that wants to standardize on core functionality and build repeatable processes.

Deploy Hadoop Faster and Easier

Greenplum HD enables you to take advantage of big data analytics without the overhead and complexity of a project built from scratch. Available as software or in a pre-configured Data Computing Appliance Module, Greenplum HD provides a complete platform, including installation, training, global support, and value-add beyond simple packaging of the Apache Hadoop distribution. The Greenplum HD Module combines Hadoop and the Greenplum Database in a single purpose-built Data Computing Appliance.

EMC Greenplum Named a Leader in Enterprise Hadoop Solutions

Download the Report

The Forrester Wave™: Enterprise Hadoop Solutions Q1 2012



Greenplum HD brings a pluggable storage layer to Hadoop that supports both HDFS and Isilon OneFS Storage

HDFS Storage Option

Greenplum HD is a 100 percent open-source certified and supported version of the Apache Hadoop stack that includes HDFS, MapReduce, Hive, Pig, Hbase and Zookeeper.

Isilon OneFS Storage Option

Greenplum HD supports Isilon’s OneFS Scale-Out NAS Storage for Hadoop. EMC Isilon scale-out NAS is the first and only Enterprise NAS solution that can natively integrate with the Hadoop Distributed File System (HDFS) layer.




Greenplum HD with pluggable storage is also supported in a pre-configured appliance option with the Greenplum HD DCA Module

Greenplum HD DCA Module

The Greenplum HD DCA Module is the world’s first high-performance data co-processing Hadoop appliance module. The DCA fuses Hadoop with the Greenplum Database, allowing the co-processing of both structured and unstructured data within a single, seamless solution.

Greenplum HD

Greenplum HD is a 100 percent open-source certified and supported version of the Apache Hadoop stack that includes HDFS, MapReduce, Hive, Pig, Hbase and Zookeeper. Backed by the world’s largest Hadoop support organization and tested at scale in Greenplum’s 1,000 node Analytics Workbench, Greenplum HD brings flexible storage options to an enterprise-ready Hadoop stack. Greenplum HD makes Hadoop faster, more dependable, and easier to use.

Isilon Scale Out NAS for Greenplum HD

Greenplum HD supports Isilon’s OneFS Scale-Out NAS Storage for Hadoop. EMC Isilon scale-out NAS is the first and only Enterprise NAS solution that can natively integrate with the Hadoop Distributed File System (HDFS) layer. By treating HDFS as an over the wire protocol, you can quickly deploy a comprehensive big data analytics solution that combines Greenplum HD with Isilon scale-out NAS storage systems to provide a powerful, highly efficient and flexible data storage and analytics ecosystem.

The combined Greenplum HD and Isilon solution enables your organization to avoid the resource-intensive complexity of traditional Hadoop deployments by providing a packaged, yet comprehensive Hadoop system. This approach enables you to focus more on analyzing your business rather than spending valuable resources struggling with the technical complexities of configuring and managing a Hadoop cluster.

Greenplum HD DCA Module

The Greenplum HD DCA Module seamlessly integrates the Greenplum HD software into an appliance, providing an optimized configuration built for performance and reliability. The Greenplum Data Computing Appliance marries the unstructured batch-processing power of Hadoop with the Greenplum Database and the breakthrough Massively Parallel Processing (MPP) architecture. This allows enterprises to extract value from both structured and unstructured data under a single, seamless platform.

Greenplum stands behind its products with rigid SLAs, and carefully tests each component for maturity prior to inclusion in a release. Greenplum’s support standards are enterprise-class and deliver end-to-end service from day one.

Greenplum HD Technologies

Data Sheet

Isilon Scale-Out NAS for Greenplum HD

EMC Isilon Big Data Storage and Analytics Solution that can natively integrate with the Hadoop Distributed File System (HDFS) layer.

Data Sheet

Greenplum HD

Delivering enterprise-ready Apache Hadoop.

Whitepaper

EMC's Enterprise Solution

An emerging MapReduce platform layered on a distributed file system—Hadoop and HDFS—is one of the solutions more recently being selected by companies to address their big data analytics needs.


Analyst Report

The Enterprise Use of Hadoop

Hadoop, an Apache Foundation Open Source project, represents a way for enterprise IT to take advantage of Cloud and Internet capabilities sooner.

Play Video

Hadoop – The Data Scientist's Dream

Everyone is talking about Hadoop. According to many data scientists, it doesn't get better than this. But how is Hadoop being used today?

Press Release

EMC Announces 1000 Node Analytic Platform To Accelerate Industry’s Hadoop Testing and Development

EMC and industry leading companies including Intel, VMware, Micron, Seagate, Supermicro, Switch, and Mellanox Technologies Partner To Deliver the Greenplum Analytics Workbench ™ analytic computing platform


Analyst Report

Hadoop: Revealing It's True Value for Business Intelligence

Despite all the hubbub and hype around Hadoop, few business intelligence (BI) and data warehousing (DW) professionals know much about what Hadoop is, how it does what it does, or in which situations they should deploy it.