What is Google Cloud Dataflow?
Cloud Dataflow is a fully-managed service that enables seamless transformation and enrichment of data in both real-time and historical modes. It offers a unified programming model and a managed service for executing a diverse range of data processing patterns, including ETL, batch computation, and continuous computation. Cloud Dataflow frees users from operational tasks such as resource management and performance optimization, providing a serverless approach to resource provisioning and management. This allows access to virtually limitless capacity to tackle even the most complex data processing challenges, with users only paying for the resources they consume
Highlights
- Unified programming model for developing and executing a wide range of data processing patterns
- Fully-managed service for transforming and enriching data in stream (real-time) and batch (historical) modes
- Serverless approach to resource provisioning and management, providing access to virtually limitless capacity
- Eliminates the need for complex workarounds or compromises in data processing
- Frees users from operational tasks like resource management and performance optimization
Features
Combines batch and streaming with a single API
High performance with automatic workload
Fully managed