Stars
Udacity Data Engineering Nanodegree Program, Data Pipeline with Airflow project using MinIO and Postgresql.
This repo demonstrates how to integrate existing files in object storage into Iceberg files as metadata-only operations using the Iceberg Java API.
🔎 📈 🐍 💰 Backtest trading strategies in Python.
Nyc_Taxi_Data_Pipeline - DE Project
Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAM…
A template repository to create a data project with IAC, CI/CD, Data migrations, & testing
An example repository showing how to leverage Kafka to stream your data
Workshop on optimizing PySpark pipelines.
Answer key for my Kubernetes for Beginners Course on Udemy
📚Open Source Curriculum for CNCF Certification Courses
GenAI + Airflow. Fine-tuning + RAG pipeline for content generation.
This code is associated to the article "6 recommandations pour optimiser un job Spark"
Unlock the potential of Apache Spark, a robust distributed computing framework for large-scale data processing. Dive into the world of efficient data functions with Python decorators, tackling sche…
Bringing Data from MySQL to Kafka Using Debezium, Joining Kafka Topics with Flink, Upserting into a New Kafka Topic, and Ingesting into Hudi Real-Time
Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand
LLM Zoomcamp - a free online course about building a Q&A system
Terraform module for creating Athena views
Big Data Demystified meetup and blog examples
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
Source code accompanying O'Reilly book: Machine Learning Design Patterns
Repository for the dynamic tasks webinar on 2022-10-18.
GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers
Get the course here: https://deeplearningcourses.com/c/ai-finance