Greenplum HD: Enterprise-Ready Apache Hadoop
Hadoop Support for the Enterprise
Rely on EMC to provide 24x7 worldwide support with the industry’s largest Hadoop support infrastructure
Proven at Scale
Certified by EMC to remove the guesswork associated with Hadoop deployments
Pluggable Storage Options
Leverage best of breed storage options with no changes to applications
Delivering Enterprise-Ready Apache Hadoop
Recent computing and business trends have triggered an explosion in the amount of unstructured data companies generate each day. By extracting the knowledge wrapped within unstructured and machine-generated data, your enterprise can make better decisions that drive revenue and reduce costs. Hadoop has rapidly emerged as the preferred solution for big data analytics across unstructured data. But the fast-changing Hadoop ecosystem can present challenges to any company that wants to standardize on core functionality and build repeatable processes.
Deploy Hadoop Faster and Easier
Greenplum HD enables you to take advantage of big data analytics without the overhead and complexity of a project built from scratch. Available as software or in a pre-configured Data Computing Appliance Module, Greenplum HD provides a complete platform, including installation, training, global support, and value-add beyond simple packaging of the Apache Hadoop distribution. The Greenplum HD Module combines Hadoop and the Greenplum Database in a single purpose-built Data Computing Appliance.
Greenplum HD brings a pluggable storage layer to Hadoop that supports both HDFS and Isilon OneFS Storage
HDFS Storage Option
Greenplum HD is a 100 percent open-source certified and supported version of the Apache Hadoop stack that includes HDFS, MapReduce, Hive, Pig, Hbase and Zookeeper.
Isilon OneFS Storage Option
Greenplum HD supports Isilon’s OneFS Scale-Out NAS Storage for Hadoop. EMC Isilon scale-out NAS is the first and only Enterprise NAS solution that can natively integrate with the Hadoop Distributed File System (HDFS) layer.
Greenplum HD with pluggable storage is also supported in a pre-configured appliance option with the Greenplum HD DCA Module
Greenplum HD DCA Module
The Greenplum HD DCA Module is the world’s first high-performance data co-processing Hadoop appliance module. The DCA fuses Hadoop with the Greenplum Database, allowing the co-processing of both structured and unstructured data within a single, seamless solution.
Greenplum HD
Greenplum HD is a 100 percent open-source certified and supported version of the Apache Hadoop stack that includes HDFS, MapReduce, Hive, Pig, Hbase and Zookeeper. Backed by the world’s largest Hadoop support organization and tested at scale in Greenplum’s 1,000 node Analytics Workbench, Greenplum HD brings flexible storage options to an enterprise-ready Hadoop stack. Greenplum HD makes Hadoop faster, more dependable, and easier to use.
Isilon Scale Out NAS for Greenplum HD
Greenplum HD supports Isilon’s OneFS Scale-Out NAS Storage for Hadoop. EMC Isilon scale-out NAS is the first and only Enterprise NAS solution that can natively integrate with the Hadoop Distributed File System (HDFS) layer. By treating HDFS as an over the wire protocol, you can quickly deploy a comprehensive big data analytics solution that combines Greenplum HD with Isilon scale-out NAS storage systems to provide a powerful, highly efficient and flexible data storage and analytics ecosystem.
The combined Greenplum HD and Isilon solution enables your organization to avoid the resource-intensive complexity of traditional Hadoop deployments by providing a packaged, yet comprehensive Hadoop system. This approach enables you to focus more on analyzing your business rather than spending valuable resources struggling with the technical complexities of configuring and managing a Hadoop cluster.
Greenplum HD DCA Module
The Greenplum HD DCA Module seamlessly integrates the Greenplum HD software into an appliance, providing an optimized configuration built for performance and reliability. The Greenplum Data Computing Appliance marries the unstructured batch-processing power of Hadoop with the Greenplum Database and the breakthrough Massively Parallel Processing (MPP) architecture. This allows enterprises to extract value from both structured and unstructured data under a single, seamless platform.
Greenplum stands behind its products with rigid SLAs, and carefully tests each component for maturity prior to inclusion in a release. Greenplum’s support standards are enterprise-class and deliver end-to-end service from day one.
Isilon Scale-Out NAS for Greenplum HD
EMC Isilon Big Data Storage and Analytics Solution that can natively integrate with the Hadoop Distributed File System (HDFS) layer.
An emerging MapReduce platform layered on a distributed file system—Hadoop and HDFS—is one of the solutions more recently being selected by companies to address their big data analytics needs.
Hadoop, an Apache Foundation Open Source project, represents a way for enterprise IT to take advantage of Cloud and Internet capabilities sooner.
Video
Hadoop – The Data Scientist's Dream
Everyone is talking about Hadoop. According to many data scientists, it doesn't get better than this. But how is Hadoop being used today?
Press Release
EMC Announces 1000 Node Analytic Platform To Accelerate Industry’s Hadoop Testing and Development
EMC and industry leading companies including Intel, VMware, Micron, Seagate, Supermicro, Switch, and Mellanox Technologies Partner To Deliver the Greenplum Analytics Workbench ™ analytic computing platform
Data Sheet
Technical Brief: Data Sharing between Database and the Hadoop Distributed File System
How Greenplum Database and Hadoop work together
Analyst Report
Hadoop: Revealing It's True Value for Business Intelligence
Despite all the hubbub and hype around Hadoop, few business intelligence (BI) and data warehousing (DW) professionals know much about what Hadoop is, how it does what it does, or in which situations they should deploy it.