treeverse / lakeFS Star 5.2k Code Issues Pull requests Discussions lakeFS - Data version control for your data lake | Git for data go golang apache-spark aws-s3 google-cloud-storage data-engineering data-lake azure-storage data-version-control object-storage datalake hadoop-filesystem data-quality data-versioning azure-blob-storage apache-sparksql git-for-data lakefs datalakes Updated Mar 20, 2026 Go
dacruzmathis / Datalakes-and-Data-Integration Star 1 Code Issues Pull requests Master 2 labs and project made for the Datalakes and Data Integration cursus at Efrei Paris. python azure cloud-computing data-integration datalakes Updated Nov 26, 2023 Python
alehurtadoxo / anvilogic-pmm-assets Star 0 Code Issues Pull requests Discussions a repo of all the assets PMM maintains with messaging updates defined detection datalakes detectionengineering ai-soc Updated Jan 22, 2026
dell-datascience / Data_Engineering Star 0 Code Issues Pull requests This repository is dedicated to my participation in Datatalks Mlzoomcamp data streaming spark gcp spark-streaming dbt batch-processing datawarehouse prefect dataanalytics gbq datalakes Updated Sep 30, 2024 Jupyter Notebook