-
Databricks
- Beijing, China
-
spark Public
Forked from apache/sparkMirror of Apache Spark
-
polars Public
Forked from pola-rs/polarsDataframes powered by a multithreaded, vectorized query engine, written in Rust
Rust Other UpdatedSep 12, 2024 -
spark-website Public
Forked from apache/spark-websiteApache Spark Website
HTML Apache License 2.0 UpdatedAug 13, 2024 -
pandas Public
Forked from pandas-dev/pandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 13, 2024 -
scikit-learn Public
Forked from scikit-learn/scikit-learnscikit-learn: machine learning in Python
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 13, 2024 -
spark-connect-go Public
Forked from apache/spark-connect-goApache Spark Connect Client for Golang
Go Apache License 2.0 UpdatedAug 13, 2024 -
xgboost Public
Forked from dmlc/xgboostScalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
C++ Apache License 2.0 UpdatedAug 13, 2024 -
arrow-datafusion Public
Forked from apache/datafusionApache Arrow DataFusion and Ballista query engines
Rust Apache License 2.0 UpdatedJun 8, 2024 -
aexpy Public
Forked from StardustDL/aexpyAexPy /eɪkspaɪ/ is Api EXplorer in PYthon for detecting API breaking changes in Python packages. (ISSRE'22)
Python Mozilla Public License 2.0 UpdatedMar 28, 2024 -
numpy Public
Forked from numpy/numpyThe fundamental package for scientific computing with Python.
Python Other UpdatedMar 28, 2024 -
ray Public
Forked from ray-project/rayAn open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyp…
Python Apache License 2.0 UpdatedMar 28, 2024 -
arrow Public
Forked from apache/arrowApache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
C++ Apache License 2.0 UpdatedMar 28, 2024 -
scipy Public
Forked from scipy/scipySciPy library main repository
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 7, 2023 -
-
modin Public
Forked from modin-project/modinModin: Scale your Pandas workflows by changing a single line of code
Python Apache License 2.0 UpdatedDec 7, 2023 -
LightGBM Public
Forked from microsoft/LightGBMA fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks…
C++ MIT License UpdatedDec 7, 2023 -
-
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedMay 9, 2023 -
Py4J enables Python programs to dynamically access arbitrary Java objects
Java Other UpdatedMar 13, 2023 -
breeze Public
Forked from scalanlp/breezeBreeze is a numerical processing library for Scala.
Scala Apache License 2.0 UpdatedSep 28, 2022 -
dbt-databricks Public
Forked from databricks/dbt-databricksA dbt adapter for Databricks.
Python Apache License 2.0 UpdatedAug 23, 2022 -
-
spark-libFM Public
An implement of Factorization Machines (LibFM)