-
Databricks Inc.
- Belgrade, Serbia
- https://linkedin.com/in/maxgekk/
Stars
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
A tool to get better debug info on spark's memory usage
Tips for developing Apache Spark, especially in IntelliJ IDEA
All the things about TPC-DS in Apache Spark
Open source platform for the machine learning lifecycle
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Code that'll help you kickstart a personal website that showcases your work as a software developer.
Spark Structured Streaming State Tools
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
A lightweight library to inject LLVM bitcode into JVMs
Log analyser / visualiser for Java HotSpot JIT compiler. Inspect inlining decisions, hot methods, bytecode, and assembly. View results in the JavaFX user interface.
Run spark calculations from Ammonite
A scala library for interacting with the slack api and real time messaging interface
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Qubole Sparklens tool for performance tuning Apache Spark
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…
Schema Registry integration for Apache Spark
Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol
Example project showing how to use Hive UDFs in Apache Spark
Apache Spark - A unified analytics engine for large-scale data processing