You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Manage your topic's configuration (partitions, replication factor, parameters), ACLs, quotas, users and get stats, without any effort with this library. It does not use the Kafka scripts and does not require ssh connection to the remote broker.
A scalable architecture for real-time log processing and visualization. Built with a Kafka-Spark ETL pipeline, DynamoDB for storing aggregate real-time metrics, and Python Dash for interactive dashboards. Designed for high-throughput log ingestion, real-time monitoring, and long-term storage.
Real-time data streaming project using Python, Pandas, and Confluent Kafka to publish and consume retail data through Kafka topics, demonstrating scalability and dynamic consumer behavior.
RealTimeIncrementalSync is a real-time data streaming system that fetches product updates from a MySQL database, serializes the data in Avro format, and streams it to Kafka. Kafka consumers deserialize the data and append it to JSON files, enabling real-time monitoring and analysis of product information.
L'objectif c'est de mettre en place des pipelines batch, streaming, et de qualité de données afin d’assurer la fiabilité, la cohérence et la fraîcheur des données pour les analyses.
A complete DataOps + AI project for streaming text analytics using Kafka, Spark Streaming, Hugging Face Transformers, Elasticsearch, Kibana, and Airflow.
A data pipeline demo that streams structured data from Kafka to TimescaleDB using Kafka Connect, ideal for real-time analytics and time-series storage.