Skip to content
View astroC86's full-sized avatar

Block or report astroC86

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.

C++ 18 2 Updated May 12, 2024

TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstraction for processing tiles.

C++ 122 9 Updated Sep 10, 2024
C++ 4 Updated Dec 14, 2023

Source for Demystifying GPU Microarchitecture through Microbenchmarking

Cuda 16 4 Updated May 29, 2023

High Performance Inter-Thread Messaging Library

Java 17,363 3,918 Updated Sep 13, 2024

The QuantLib C++ library

C++ 5,230 1,779 Updated Sep 20, 2024

LLM inference in Fortran

Fortran 54 1 Updated May 30, 2024

Repository of example Darshan log files of interest

Shell 1 6 Updated May 3, 2024

Tutorial to help developers ramp up on UEFI environment and programming.

Makefile 96 5 Updated Sep 19, 2017

💎 Amber the programming language compiled to Bash

Rust 3,822 82 Updated Sep 19, 2024

Temporary repository for Kind2's refactor based on HVM2

Rust 278 26 Updated Aug 26, 2024

A massively parallel, high-level programming language

Rust 17,225 424 Updated Sep 17, 2024

Gambit: The package for computation in game theory

C++ 396 149 Updated Sep 19, 2024

Coz: Causal Profiling

C 4,050 160 Updated Jul 16, 2024

Scalable Distributed System Model Checking with Specification-Level State Exploration

TLA 22 Updated Apr 24, 2024

Generic model checker for concurrent C programs (mirror repository)

C++ 104 17 Updated Sep 11, 2024

Next generation SPARSE implementation for ROCm platform

C++ 117 53 Updated Sep 19, 2024

Simple OpenACC Fortran Examples

Fortran 52 10 Updated Aug 1, 2021

Contains sources related to the lectures and labs for the NVIDIA OpenACC course.

C 52 35 Updated Oct 23, 2019

The world's first wait-free Software Transactional Memory

C++ 168 20 Updated Feb 21, 2020

A massively parallel, optimal functional runtime in Rust

Cuda 10,436 396 Updated Sep 4, 2024

BLISlab: A Sandbox for Optimizing GEMM

C 468 99 Updated Jun 17, 2021

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 7,991 291 Updated Aug 31, 2024

The curated list of awesome C++ Coroutine resources.

7 Updated Mar 8, 2024
Python 2 Updated Oct 29, 2020

How to use node-local MPI rank IDs to manually map MPI ranks to GPUs

Cuda 9 3 Updated Apr 22, 2020

Implementation of a Tensor Processing Unit for embedded systems and the IoT.

VHDL 383 62 Updated Jan 5, 2019

This tutorial demonstrates how to use CUDA-Aware MPI

Cuda 33 3 Updated May 16, 2023

FlexGripPlus: an open-source GPU model for reliability evaluation and micro architectural simulation

VHDL 80 18 Updated May 11, 2023
Next