-
-
-
Hugging_Face Public
quantization ner finetuning huggingface bert-fine-tuning huggingface-transformers huggingface-datasetsJupyter Notebook UpdatedAug 27, 2024 -
Agent-Long-Term-Memory Public
Agent long term personalized memory
Python MIT License UpdatedAug 24, 2024 -
-
llama_cpp_quantization Public
Efficient Llama Model Quantization: A Python and C++ integrated approach to reduce model size and inference latency while maintaining accuracy, designed for scalable deployment in AI systems
Jupyter Notebook MIT License UpdatedAug 11, 2024 -
tree_attention Public
Forked from Zyphra/tree_attentionTree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Python UpdatedAug 9, 2024 -
-
-
llama.cpp Public
Forked from ggerganov/llama.cppLLM inference in C/C++
C++ MIT License UpdatedJul 23, 2024 -
acezero Public
Forked from nianticlabs/acezeroACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
Python Other UpdatedJul 23, 2024 -
neuralgcm Public
Forked from google-research/neuralgcmHybrid ML + physics model of the Earth's atmosphere
Python Apache License 2.0 UpdatedJul 20, 2024