Greenplum Database 3.3 - Features

Greenplum Database is a software solution built to support the next generation of data warehousing and large-scale analytics processing. Supporting SQL and MapReduce parallel processing, Greenplum Database offers industry-leading performance at a low cost for companies managing Terabytes to Petabytes of data.

Core MPP Architecture
Greenplum Database's architecture provides automatic parallelization of data and queries – all data is automatically partitioned across all nodes of the system, and queries are planned and executed using all nodes working together in a highly coordinated fashion. Read more >

Multi-level fault tolerance
Greenplum Database utilizes multiple levels of fault-tolerance and redundency that allow it to automatically continue operation in the face of hardware or software failures. Read more >

Online system expansion
Add servers to increase storage capacity, processing performance and loading performance. The database can remain online and fully available while the expansion process takes place in the background. Performance and capacity increases linearly as servers are added.

Workload management
Provides administrative control over system resources and their allocation to queries. Allows users to be assigned to resource queues that manages the inflow of work to the database. Also allows priority adjustment of running queries.

Petabyte-scale loading
High-performance loading utilizing MPP Scatter/Gather Streaming technology. Loading speeds scale with each additional node to greater than 4TB/hour.

Trickle Micro-Batching
When loading a continuous stream, trickle micro-batching allows data to be loaded at frequent intervals (e.g. every 5 minutes) while maintaining extremely high data ingest rates.

Anywhere data access
Allows queries to be executed from the database against external data sources, returning data in parallel, regardless of their location, format, or storage medium.

Hybrid Storage & Execution (Row- and Column-Oriented)
For each table (or partition of a table), the DBA can select the storage, execution and compression settings that suit the way that table will be accessed. This includes the choice of row- or column-oriented storage & processing for any table or partition. Leverages Greenplum's Polymorphic Data Storage™ technology. Read more >

In-database compression
Utilizes industry-leading compression technology to increase performance and dramatically reduce the space required to store data. Customers can expect to see a 3-10x disk space reduction with a corresponding increase in effective I/O performance.

Multi-level partitioning
Allows flexible partitioning of tables based on date, range or value. Partitioning is specified using DDL and allows an arbitrary number of levels. The query optimizer will automatically prune unneeded partitions from the query plan.

Indexes - Btree, Bitmap, etc.
Greenplum supports a range of index types including B-Tree and Bitmap.

Comprehensive SQL
Comprehensive SQL-92 and SQL-99 support with SQL 2003 OLAP extensions. All queries are parallelized and executed across the entire system.

Native MapReduce.
MapReduce has been proven as a technique for high-scale data analysis by Internet leaders such as Google and Yahoo. Greenplum natively runs MapReduce programs within its parallel engine. Read more >

SQL 2003 OLAP Extensions
Provides a fully-parallelized implementation of SQL recently added OLAP extensions. Full standard support, including window functions, rollup, cube and a wide range of other expressive functionality.

Programmable analytics
Offers a new level of parallel analysis capabilities for mathematicians and statisticians, with support for R, linear algebra and machine learning primitives.

Client Access & 3rd Party Tools
Offers a new level of parallel analysis capabilities for mathematicians and statisticians, with support for R, linear algebra and machine learning primitives.

Greenplum Performance Monitor
View the performance of your Greenplum Database system including system metrics and query details. The dashboard view allows you to monitor the system utilization during query runs. Drill down into a query’s detail and plan to understand its performance. Read more >

pgAdmin3 for GPDB
pgAdmin 3 is the most popular and feature rich Open Source administration and development platform for PostgreSQL. Greenplum Database 3.3 ships with an enhanced version of pgAdmin 3 that has been extended to work with Greenplum Database and provides full support for Greenplum-specific capabilities.

 


What's New:

Greenplum is now accepting appointments for the free ½ day Enterprise Data Cloud Workshop. More >

See the Enterprise Data Cloud Initiative Webcast. Watch it >

Do you have MAD Skills? Find out by giving the MAD Skills white paper a read! More >


Information For: