Skip to content
#

data-pipeline

Here are 13 public repositories matching this topic...

Production-ready data pipeline with Jupyter notebook, SQLite database modeling (star schema), and automated ETL workflows for customer churn analysis and segmentation.

  • Updated Aug 11, 2025
  • Jupyter Notebook

An extension enabling the monitoring of Apache Airflow DAGs directly from Jupyter notebooks. Tailored for developers and data scientists, it simplifies tracking specific DAGs, reduces unnecessary friction, and allows severity levels setup for failed DAGs.

  • Updated May 29, 2023
  • Python

📈 A modular, multi-notebook visualization suite built with Plotly and Streamlit, showcasing advanced charting techniques, subplot dashboards, z-score bands, regression overlays, and app-based exploration. Ideal for mastering interactive data storytelling and production-ready Python workflows.

  • Updated Jul 29, 2025
  • HTML

📊 A comprehensive pandas mastery project with 10 modular Jupyter notebooks covering data loading, cleaning, grouping, merging, time series, visualization, and performance profiling. Includes real-world workflows, Docker, Streamlit, and reusable utils. Ideal for data scientists and analysts to learn, practice, and refer. Practice-ready and modular.

  • Updated Jul 29, 2025
  • Jupyter Notebook

An ELT data pipeline 🚚 for analyzing Brazilian e-commerce data. Processes CSVs and public API data using Python and Pandas, loads it into an SQLite database, and analyzes revenue/delivery metrics with SQL. Visualizations and the final report are created in a Jupyter Notebook.

  • Updated Jun 14, 2023
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more