Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Example code for running Spark and Hive jobs on EMR Serverless.
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Examples for running Debezium (Configuration, Docker Compose files etc.)
An adaptive radix tree for efficient indexing in main memory.
A schema-first tool for graphql-java inspired by graphql-tools for JS
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
Contains Company Wise Questions sorted based on Frequency and all time
Demo code showing how to use Java's StructuredTaskScope
Spring demo application to compare controllers using CompletableFuture vs. virtual threads.
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever…
A data generator source connector for Flink SQL based on data-faker.
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
A lightweight message queue. Like AWS SQS and RSMQ but on Postgres.
Snowflake Data Source for Apache Spark.
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
A library that provides an embeddable, persistent key-value store for fast storage.