-
Fudan University
- Shanghai
- https://github.com/victorShawFan
-
mem0 Public
Forked from mem0ai/mem0The Memory layer for your AI apps
Python Apache License 2.0 UpdatedSep 18, 2024 -
transformer-explainer Public
Forked from poloclub/transformer-explainerTransformer Explained: Learn How LLM Transformer Models Work with Interactive Visualization
JavaScript MIT License UpdatedAug 15, 2024 -
Qwen2 Public
Forked from QwenLM/Qwen2.5Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Shell UpdatedJul 16, 2024 -
Online-RLHF Public
Forked from RLHFlow/Online-RLHFA recipe for online RLHF.
Python UpdatedJun 20, 2024 -
OpenRLHF_add_simpo Public
添加了simpo方法的OpenRLHF,个人修改,原仓库链接:https://github.com/OpenLLMAI/OpenRLHF
-
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
Python Apache License 2.0 UpdatedJun 19, 2024 -
SimPO Public
Forked from princeton-nlp/SimPOSimPO: Simple Preference Optimization with a Reference-Free Reward
Python UpdatedMay 25, 2024 -
llama3-from-scratch Public
Forked from naklecha/llama3-from-scratchllama3 implementation one matrix multiplication at a time
Jupyter Notebook MIT License UpdatedMay 23, 2024 -
CLUE Public
Forked from CLUEbenchmark/CLUE中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Python UpdatedMay 23, 2024 -
-
llama3 Public
Forked from meta-llama/llama3The official Meta Llama 3 GitHub site
Python Other UpdatedMay 10, 2024 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnify Efficient Fine-Tuning of 100+ LLMs
Python Apache License 2.0 UpdatedMay 8, 2024 -
modpo Public
Forked from ZHZisZZ/modpo[ACL 2024] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization.
Python UpdatedApr 16, 2024 -
-
HALOs Public
Forked from ContextualAI/HALOsA library with extensible implementations of DPO, KTO, PPO, and other human-aware loss functions (HALOs).
Python Apache License 2.0 UpdatedMar 26, 2024 -
Chinese-Mixtral-8x7B Public
Forked from HIT-SCIR/Chinese-Mixtral-8x7B中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
Python Apache License 2.0 UpdatedJan 18, 2024 -
Baichuan2 Public
Forked from baichuan-inc/Baichuan2A series of large language models developed by Baichuan Intelligent Technology
Python Apache License 2.0 UpdatedSep 6, 2023 -
DeepSpeedExamples Public
Forked from microsoft/DeepSpeedExamplesExample models using DeepSpeed
Python Apache License 2.0 UpdatedAug 30, 2023 -
LLMs-cookbook Public
Forked from LearnPrompt/LLMs-cookbookExamples and guides for using the LLMs
Jupyter Notebook UpdatedAug 23, 2023 -
huanhuan-chat Public
Forked from KMnO4-zx/huanhuan-chatChat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句,基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。
Python UpdatedAug 15, 2023 -
More_Simple_Reinforcement_Learning Public
Forked from lansinuote/More_Simple_Reinforcement_LearningJupyter Notebook UpdatedAug 7, 2023 -
MOSS-RLHF Public
Forked from OpenLMLab/MOSS-RLHFMOSS-RLHF
Python Apache License 2.0 UpdatedJul 11, 2023 -
InternLM Public
Forked from InternLM/InternLMInternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.
Python Apache License 2.0 UpdatedJul 9, 2023 -
Baichuan-7B Public
Forked from baichuan-inc/Baichuan-7BA large-scale 7B pretraining language model developed by BaiChuan-Inc.
Python Apache License 2.0 UpdatedJul 8, 2023 -
LangChain-Chinese-Getting-Started-Guide Public
Forked from liaokongVFX/LangChain-Chinese-Getting-Started-GuideLangChain 的中文入门教程
UpdatedJul 7, 2023 -
ChatGLM2-6B Public
Forked from THUDM/ChatGLM2-6BChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Python Other UpdatedJun 25, 2023 -
course Public
Forked from huggingface/courseThe Hugging Face course on Transformers
Python Apache License 2.0 UpdatedJun 1, 2023 -
transformers-code Public
Forked from zyds/transformers-code手把手带你实战Transformers
Jupyter Notebook UpdatedMay 29, 2023 -
Linly Public
Forked from CVI-SZU/LinlyChinese-LLaMA基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
Python UpdatedMay 29, 2023 -