This document discusses mixing analytic workloads between Greenplum and Apache Spark. It recommends using Greenplum for large-scale data processing and queries over entire datasets, while using Spark for in-memory processing, data exploration, and ETL workloads involving streaming or micro-batches of data. It introduces the Greenplum-Spark connector for high-speed parallel data transfer between the two systems and optimizing performance through column projection and predicate pushdown. The key benefits are faster data transfer and leveraging the strengths of both platforms for mixed workloads.