Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
- Updated
Nov 26, 2025 - Java
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Maestro: Netflix’s Workflow Orchestrator
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Multi-hop declarative data pipelines
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
A Realtime Seismic Logging & Alerts Service with Live Monitoring & Email Alerts made using Kafka Data Pipelines, all Dockerized & Deployment Ready!
Samstraumr is a R&D framework in development to implement systems theory tube-based design concepts in software architecture for experimenting with adaptive systems.
Real-Time Data Pipeline: Postgres CDC → Kafka → Spark Streaming → MySQL → Live Dashboard
JSON Streaming With Mongo Streams
Add a description, image, and links to the data-pipelines topic page so that developers can more easily learn about it.
To associate your repository with the data-pipelines topic, visit your repo's landing page and select "manage topics."