- CAIS
- London, UK
- 13:22
(UTC) - https://deepumohan.com/
- @deepumohanp
- in/deepumohanp
⭐ Data
re_data - fix data issues before your users & CEO would discover them 😊
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
The Metadata Platform for your Data and AI Stack
Immutable database and Datalog query engine for Clojure, ClojureScript and JS
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
Data Lake as Code, featuring ChEMBL and OpenTargets
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
This repository has moved into https://github.com/dbt-labs/dbt-adapters
Data Contracts engine for the modern data stack. https://www.soda.io
🔥 🔥 🔥 A Free & Self-hostable Airtable Alternative
AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker
Apache Superset is a Data Visualization and Data Exploration Platform
A Clojure dataframe library that runs on Spark
The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL
An implementation of differential dataflow using timely dataflow on Rust.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data…
A modular implementation of timely dataflow in Rust
Self-serve BI to 10x your data team ⚡️
This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (MSK) and MSK Serverless into Apache Iceberg table in S3 with …
Event Driven Orchestration & Scheduling Platform for Mission Critical Applications
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.




