Lists (2)
Sort Name ascending (A-Z)
Stars
TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.
TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstraction for processing tiles.
Source for Demystifying GPU Microarchitecture through Microbenchmarking
High Performance Inter-Thread Messaging Library
Repository of example Darshan log files of interest
Tutorial to help developers ramp up on UEFI environment and programming.
💎 Amber the programming language compiled to Bash
Temporary repository for Kind2's refactor based on HVM2
A massively parallel, high-level programming language
Gambit: The package for computation in game theory
Scalable Distributed System Model Checking with Specification-Level State Exploration
Generic model checker for concurrent C programs (mirror repository)
Next generation SPARSE implementation for ROCm platform
Simple OpenACC Fortran Examples
Contains sources related to the lectures and labs for the NVIDIA OpenACC course.
The world's first wait-free Software Transactional Memory
A massively parallel, optimal functional runtime in Rust
GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm
The curated list of awesome C++ Coroutine resources.
How to use node-local MPI rank IDs to manually map MPI ranks to GPUs
Implementation of a Tensor Processing Unit for embedded systems and the IoT.
This tutorial demonstrates how to use CUDA-Aware MPI
FlexGripPlus: an open-source GPU model for reliability evaluation and micro architectural simulation