carey60354

Follow

carey60354

Follow

1 follower · 5 following

Stars

lafmdp / Awesome-Papers-Autonomous-Agent

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

537 49 Updated Sep 21, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 26,466 2,993 Updated Aug 12, 2024

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

Python 667 96 Updated Sep 30, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,334 936 Updated Oct 1, 2024

yangjianxin1 / Firefly-LLaMA2-Chinese

Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Python 396 31 Updated Oct 21, 2023

abetlen / llama-cpp-python

Python bindings for llama.cpp

Python 7,840 938 Updated Oct 3, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 65,833 9,452 Updated Oct 5, 2024

meta-llama / llama

Inference code for Llama models

Python 55,848 9,511 Updated Aug 18, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,850 4,109 Updated Oct 5, 2024

NVlabs / tiny-cuda-nn

Lightning fast C++/CUDA neural network framework

C++ 3,697 451 Updated Aug 26, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,983 4,062 Updated Oct 4, 2024

HeKun-NVIDIA / CUDA-Programming-Guide-in-Chinese

This is a Chinese translation of the CUDA programming guide

1,201 192 Updated Jan 5, 2023

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 10,614 2,110 Updated Oct 2, 2024

Yanxing-Shi / Awesome-HPC

Common HPC interview questions and materials

4 Updated May 2, 2022

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 5,808 889 Updated Mar 27, 2024

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 5,455 924 Updated Sep 25, 2024

jobbole / awesome-cpp-cn

C++ 资源大全中文版，标准库、Web应用框架、人工智能、数据库、图片处理、机器学习、日志、代码分析等。由「开源前哨」和「CPP开发者」微信公号团队维护更新。

10,016 2,182 Updated Dec 28, 2023

fffaraz / awesome-cpp

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

59,107 7,783 Updated Oct 1, 2024

0xGhost / GraphicsForGames

C 11 2 Updated Nov 25, 2019

sogou / workflow

C++ Parallel Computing and Asynchronous Networking Framework

C++ 13,015 2,407 Updated Sep 30, 2024

NVIDIA / cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 6,198 1,782 Updated Jul 26, 2024

qinguoyi / TinyWebServer

🔥 Linux下C++轻量级WebServer服务器

C++ 16,531 3,900 Updated Jul 5, 2024

wybiral / terrain

Create a 3d terrain with WebGL.

JavaScript 57 8 Updated Sep 9, 2020

ossrs / state-threads

Lightweight thread library for C/C++ coroutine (similar to goroutine), for high performance network servers.

C++ 719 276 Updated Jul 8, 2024

ireader / media-server

RTSP/RTP/RTMP/FLV/HLS/MPEG-TS/MPEG-PS/MPEG-DASH/MP4/fMP4/MKV/WebM

C 3,061 1,074 Updated Sep 29, 2024

quic-go / quic-go

A QUIC implementation in pure Go

Go 10,012 1,305 Updated Oct 3, 2024

ossrs / srs

SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.

C++ 25,465 5,354 Updated Sep 28, 2024

monikkinom / ner-lstm

Named Entity Recognition using multilayered bidirectional LSTM

Python 539 182 Updated Mar 10, 2019

ihmstefanini / Fuzzy-Expert-System

Expert System with Fuzzy Control to Froth Flotation control

Jupyter Notebook 5 4 Updated Oct 1, 2018

Kunkakola / Foam-Flotation

基于机器视觉工况识别系统

Python 6 Updated Nov 16, 2017