-
secor Public
Forked from pinterest/secorSecor is a service implementing Kafka log persistence
Java Apache License 2.0 UpdatedJan 26, 2023 -
-
mosaic Public
Forked from databrickslabs/mosaicAn extension to the Apache Spark framework that allows easy and fast processing of very large geospatial datasets.
Scala Other UpdatedAug 1, 2022 -
polars Public
Forked from pola-rs/polarsFast multi-threaded DataFrame library in Rust | Python | Node.js
Rust MIT License UpdatedJul 31, 2022 -
Apache-Kafka-Series---Learn-Apache-Kafka-for-Beginners Public
Forked from PacktPublishing/Apache-Kafka-Series---Learn-Apache-Kafka-for-Beginners-v3Code Repository for Apache Kafka Series - Learn Apache Kafka for Beginners, Published by Packt
MIT License UpdatedJun 13, 2022 -
-
SparkInternals Public
Forked from JerryLead/SparkInternalsNotes talking about the design and implementation of Apache Spark
UpdatedApr 5, 2022 -
awesome-spark Public
Forked from awesome-spark/awesome-sparkA curated list of awesome Apache Spark packages and resources.
Shell Creative Commons Zero v1.0 Universal UpdatedDec 30, 2021 -
trino-encrypt-udfs Public
Forked from victorcouste/trino-encrypt-udfsTrino UDFs Plugin to encrypt/decrypt values with a password
Java UpdatedDec 24, 2021 -
jmx_exporter Public
Forked from prometheus/jmx_exporterA process for exposing JMX Beans via HTTP for Prometheus consumption
Java Apache License 2.0 UpdatedDec 16, 2021 -
cp-helm-charts Public
Forked from confluentinc/cp-helm-chartsThe Confluent Platform Helm charts enable you to deploy Confluent Platform services on Kubernetes for development, test, and proof of concept environments.
Mustache Apache License 2.0 UpdatedDec 15, 2021 -
airbyte Public
Forked from airbytehq/airbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Java Other UpdatedDec 15, 2021 -
quenya-dsl Public
Forked from music-of-the-ainur/quenya-dslQuenya DSL(Domain Specific Language) is a language that simplifies the task to parser complex semi-structured data
Scala Apache License 2.0 UpdatedDec 13, 2021 -
gallia-core Public
Forked from galliaproject/gallia-coreA Scala library for data manipulation
Scala Other UpdatedDec 6, 2021 -
kubernetes-1 Public
Forked from justmeandopensource/kubernetesKubernetes playground
Shell UpdatedDec 1, 2021 -
great_expectations Public
Forked from great-expectations/great_expectationsAlways know what to expect from your data.
Python Apache License 2.0 UpdatedNov 29, 2021 -
marquez Public
Forked from MarquezProject/marquezCollect, aggregate, and visualize a data ecosystem's metadata
Java Apache License 2.0 UpdatedNov 28, 2021 -
spark-flowchart Public
Forked from holdenk/spark-flowchartFlowchart for debugging Spark aplications
Shell UpdatedNov 23, 2021 -
kcat Public
Forked from edenhill/kcatGeneric command line non-JVM Apache Kafka producer and consumer
C Other UpdatedNov 18, 2021 -
tech-interview-handbook Public
Forked from yangshun/tech-interview-handbook💯 Curated interview preparation materials for busy engineers
JavaScript MIT License UpdatedNov 17, 2021 -
dbt-tips Public
Forked from erika-e/dbt-tipsCollection of dbt Tips and Tricks
GNU General Public License v3.0 UpdatedNov 10, 2021 -
druid-operator Public
Forked from druid-io/druid-operatorDruid Kubernetes Operator
Go Other UpdatedNov 9, 2021 -
loki Public
Forked from grafana/lokiLike Prometheus, but for logs.
Go GNU Affero General Public License v3.0 UpdatedNov 8, 2021 -
turnilo Public
Forked from allegro/turniloBusiness intelligence, data exploration and visualization web application for Druid, formerly known as Swiv and Pivot
TypeScript Apache License 2.0 UpdatedNov 8, 2021 -
incubator-sedona Public
Forked from apache/sedonaA cluster computing framework for processing large-scale geospatial data
Java Apache License 2.0 UpdatedNov 3, 2021 -
metaflow Public
Forked from Netflix/metaflow🚀 Build and manage real-life data science projects with ease!
Python Apache License 2.0 UpdatedNov 2, 2021 -
lakeFS Public
Forked from treeverse/lakeFSGit-like capabilities for your object storage
Go Apache License 2.0 UpdatedOct 26, 2021 -
EVCache Public
Forked from Netflix/EVCacheA distributed in-memory data store for the cloud
Java Apache License 2.0 UpdatedOct 25, 2021 -
singer Public
Forked from pinterest/singerA high-performance, reliable and extensible logging agent for uploading data to Kafka, Pulsar, etc.
Java Apache License 2.0 UpdatedOct 18, 2021 -
Burrow Public
Forked from linkedin/BurrowKafka Consumer Lag Checking
Go Apache License 2.0 UpdatedOct 15, 2021