🚄 FASTJSON2 is a Java JSON library with excellent performance.
-
Updated
Jul 7, 2024 - Java
🚄 FASTJSON2 is a Java JSON library with excellent performance.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
ClickHouse® is a real-time analytics DBMS
Leveraging PySpark to analyze the IMDB database, answer various queries, and develop machine learning models to predict a movie's popularity based on its cast
AI + Data, online. https://vespa.ai
Seamless multi-master syncing database with an intuitive HTTP/JSON API, designed for reliability
YTsaurus is a scalable and fault-tolerant open-source big data platform.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Read and write Neuroglancer datasets programmatically.
A collection of my data science journey - projects, code, and notes.
QuestDB is an open source time-series database for fast ingest and SQL queries
Apache DataFusion SQL Query Engine
SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.
High performance data store solution
汐洛彖夲肜矩阵(Sillot T☳Converbenk Matrix),致力于服务智慧新彖乄
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."