-
Microsoft
- United States
- https://alexkyllo.com
- @alexkyllo
Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Python packaging and dependency management made easy
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
The fundamental package for scientific computing with Python.
Data Apps & Dashboards for Python. No JavaScript Required.
Free and Open Source Enterprise Resource Planning (ERP)
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Open standard for machine learning interoperability
Zipline, a Pythonic Algorithmic Trading Library
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
🦉 ML Experiments and Data Management with Git
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
Perform data science on data that remains in someone else's server
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
A modern Python package and dependency manager supporting the latest PEP standards
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…
This is a repository for collecting global custom management extensions for the Django Framework.
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data…
Create beautiful, publication-quality books and documents from computational content.
Quickly and accurately render even the largest data.