Skip to content
View oasis-0927's full-sized avatar

Block or report oasis-0927

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 18,386 2,307 Updated Sep 27, 2024

MyIPTV

713 152 Updated Sep 29, 2024

一个还算强大的Web思维导图。A relatively powerful web mind map.

JavaScript 6,105 860 Updated Sep 29, 2024

pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。

Python 183 29 Updated Mar 27, 2024

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

2,895 662 Updated Aug 5, 2024

Content Farm Terminator browser extension/「終結內容農場」瀏覽器套件

JavaScript 1,313 47 Updated Sep 22, 2024

An opinionated list of awesome Python frameworks, libraries, software and resources.

Python 220,043 24,821 Updated Aug 11, 2024

Summarize existing representative LLMs text datasets.

837 83 Updated Sep 4, 2024

欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。

Jupyter Notebook 253 32 Updated Jul 21, 2024
Python 254 30 Updated Jun 13, 2024

DataComp for Language Models

HTML 1,119 99 Updated Sep 5, 2024

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 4,756 470 Updated Sep 27, 2024
Python 148 13 Updated Nov 13, 2023

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,411 233 Updated Sep 14, 2024

中国大模型

5,329 437 Updated Jun 7, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 18,223 1,847 Updated Sep 29, 2024

Brand new TTS solution

Python 12,642 950 Updated Sep 20, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,462 143 Updated Sep 25, 2024

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 17,754 2,165 Updated Feb 4, 2024

System for AI Education Resource.

Python 3,452 430 Updated Jun 21, 2024

leaked prompts of GPTs

28,388 3,831 Updated Sep 27, 2024

最好用的北京联通、北京移动IPTV频道列表。https://bjiptv.gq/

HTML 1,637 277 Updated Sep 18, 2024

✯ 一个可直连访问的电视/广播图标库与相关工具项目 ✯ 🔕 永久免费 直连访问 完整开源 不断完善的台标 支持IPv4/IPv6双栈访问 🔕

JavaScript 21,743 3,252 Updated Sep 29, 2024

FongMi影视和tvbox配置文件,如果喜欢,请Fork自用。使用前请仔细阅读仓库说明,一旦使用将被视为你已了解。

JavaScript 5,492 2,237 Updated Sep 22, 2024

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 17,170 4,598 Updated Sep 29, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,187 3,811 Updated Sep 17, 2024

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder

Python 439 38 Updated Aug 4, 2024

李宏毅2021/2022/2023春季机器学习课程课件及作业

Jupyter Notebook 6,072 1,563 Updated Jun 3, 2023

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 9,361 916 Updated Sep 22, 2024

深度定制属于自己的EPG节目预告、高清台标

3,889 569 Updated Sep 13, 2024
Next