Skip to content
View MaxGekk's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@apache @databricks

Block or report MaxGekk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,079 907 Updated Oct 2, 2024

A tool to get better debug info on spark's memory usage

Scala 42 15 Updated Aug 21, 2019

Tips for developing Apache Spark, especially in IntelliJ IDEA

3 1 Updated Jan 24, 2020

Command line history manager for bash

C++ 28 3 Updated Mar 11, 2023

All the things about TPC-DS in Apache Spark

Scala 104 39 Updated Jun 15, 2023
Jupyter Notebook 7 2 Updated Aug 23, 2021

Task Metrics Explorer

Scala 13 9 Updated Apr 2, 2019

Open source platform for the machine learning lifecycle

Python 18,437 4,172 Updated Oct 4, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,491 1,683 Updated Oct 2, 2024

Code that'll help you kickstart a personal website that showcases your work as a software developer.

HTML 7,427 6,702 Updated Dec 21, 2023

Spark Structured Streaming State Tools

Scala 34 9 Updated Jul 3, 2020

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

C++ 19,217 1,017 Updated Oct 5, 2024

Koalas: pandas API on Apache Spark

Python 3,330 356 Updated Mar 20, 2024

A lightweight library to inject LLVM bitcode into JVMs

C++ 81 7 Updated Dec 9, 2019

Log analyser / visualiser for Java HotSpot JIT compiler. Inspect inlining decisions, hot methods, bytecode, and assembly. View results in the JavaFX user interface.

Java 3,079 437 Updated Aug 16, 2024

Run spark calculations from Ammonite

Scala 118 18 Updated Aug 21, 2024

Spark SQL index for Parquet tables

Scala 132 35 Updated May 6, 2021

A scala library for interacting with the slack api and real time messaging interface

Scala 186 105 Updated Aug 28, 2024

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 183 34 Updated Feb 12, 2023

Qubole Sparklens tool for performance tuning Apache Spark

Scala 562 138 Updated Jun 26, 2024

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 699 144 Updated Aug 13, 2024

Schema Registry integration for Apache Spark

Scala 39 18 Updated Nov 16, 2022

Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol

Scala 34 19 Updated Sep 8, 2022

Example project showing how to use Hive UDFs in Apache Spark

Scala 55 25 Updated Apr 23, 2019

Scala library for .netrc files

Scala 2 1 Updated Feb 1, 2018

Simple jdbc client for Apache Spark

Scala 7 1 Updated Dec 16, 2017

Apache Spark - A unified analytics engine for large-scale data processing

Scala 39,415 28,223 Updated Oct 5, 2024

Mirror of Apache Kafka

Java 28,512 13,875 Updated Oct 4, 2024

Readings in Databases

7,647 896 Updated Sep 9, 2024
Next