Skip to content
View carey60354's full-sized avatar

Block or report carey60354

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

537 49 Updated Sep 21, 2024

The official Meta Llama 3 GitHub site

Python 26,466 2,993 Updated Aug 12, 2024

The Triton TensorRT-LLM Backend

Python 667 96 Updated Sep 30, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,334 936 Updated Oct 1, 2024

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Python 396 31 Updated Oct 21, 2023

Python bindings for llama.cpp

Python 7,840 938 Updated Oct 3, 2024

LLM inference in C/C++

C++ 65,833 9,452 Updated Oct 5, 2024

Inference code for Llama models

Python 55,848 9,511 Updated Aug 18, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,850 4,109 Updated Oct 5, 2024

Lightning fast C++/CUDA neural network framework

C++ 3,697 451 Updated Aug 26, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,983 4,062 Updated Oct 4, 2024

This is a Chinese translation of the CUDA programming guide

1,201 192 Updated Jan 5, 2023

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 10,614 2,110 Updated Oct 2, 2024

Common HPC interview questions and materials

4 Updated May 2, 2022

Transformer related optimization, including BERT, GPT

C++ 5,808 889 Updated Mar 27, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,455 924 Updated Sep 25, 2024

C++ 资源大全中文版,标准库、Web应用框架、人工智能、数据库、图片处理、机器学习、日志、代码分析等。由「开源前哨」和「CPP开发者」微信公号团队维护更新。

10,016 2,182 Updated Dec 28, 2023

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

59,107 7,783 Updated Oct 1, 2024

C++ Parallel Computing and Asynchronous Networking Framework

C++ 13,015 2,407 Updated Sep 30, 2024

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 6,198 1,782 Updated Jul 26, 2024

🔥 Linux下C++轻量级WebServer服务器

C++ 16,531 3,900 Updated Jul 5, 2024

Create a 3d terrain with WebGL.

JavaScript 57 8 Updated Sep 9, 2020

Lightweight thread library for C/C++ coroutine (similar to goroutine), for high performance network servers.

C++ 719 276 Updated Jul 8, 2024

RTSP/RTP/RTMP/FLV/HLS/MPEG-TS/MPEG-PS/MPEG-DASH/MP4/fMP4/MKV/WebM

C 3,061 1,074 Updated Sep 29, 2024

A QUIC implementation in pure Go

Go 10,012 1,305 Updated Oct 3, 2024

SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.

C++ 25,465 5,354 Updated Sep 28, 2024

Named Entity Recognition using multilayered bidirectional LSTM

Python 539 182 Updated Mar 10, 2019

Expert System with Fuzzy Control to Froth Flotation control

Jupyter Notebook 5 4 Updated Oct 1, 2018

基于机器视觉工况识别系统

Python 6 Updated Nov 16, 2017
Next