Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
- Updated
Nov 21, 2025 - Python
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digital/.
Business Intelligence (BI) in Python, OLAP
Supercharge BigQuery with BigFunctions
(Finished) Geek Time Data Analysis Practical 45 Lecture - Detailed notes containing markdown images mind map code data can be read directly code test
Read data from, write data to, and modify the formatting of Google Sheets
ISP Data Pollution to Protect Private Browsing History with Obfuscation
A data management platform for the web, developed by Kitware
A toolbox for processing and analysing air traffic data
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. How to get any Ethereum smart contract into BigQuery https://towardsdatascience.com/how-to-get-any-ethereum-smart-contract-into-bigquery-in-8-mins-bab5db1fdeee
A MCP (Model Context Protocol) server for interacting with dbt.
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Snowflake Snowpark Python API
Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"
Code and data for the Modern Polars book
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
Python program that rates stocks out of 100 based on valuation, profitability, growth, and price performance metrics, relative to the company's sector.
Add a description, image, and links to the data-analytics topic page so that developers can more easily learn about it.
To associate your repository with the data-analytics topic, visit your repo's landing page and select "manage topics."