-
Wuhan University
- China
- https://www.cnblogs.com/xuhaoshuai/
- @HaoshuaiXu
Stars
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Zotero plugin to manage your attachments: automatically rename, move, and attach PDFs (or other files) to Zotero items, sync PDFs from your Zotero library to your (mobile) PDF reader (e.g. an iPad,…
Python code for "Probabilistic Machine learning" book by Kevin Murphy
An Obsidian plugin that formats and styles your notes with a focus on configurability and extensibility.
Medical NLP Competition, dataset, large models, paper
一本 GPT4 生成的单词书📚,超过 8000 个单词分析,涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
Switch my calibre library from ascii path to plain Unicode path. 将我的书库从拼音目录切换至非纯英文(中文)命名
Rule Snippet & Rule Set for Surge / Clash Premium / Clash Meta
Precision Medicine Knowledge Graph (PrimeKG)
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
ACL'2022: Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction
The source code of NeurIPS 2020 paper "CogLTX: Applying BERT to Long Texts"
Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'
📔 Day One diary entries to markdown files
Early Retirement Extreme 中文翻译 Renaissance & Liberty
Check酱:监测网页内容变化,并发送异动到微信。亦支持http status、json和rss监测。配合自架云端,关电脑后也能运行。
Python3重写了很多常用的开发工具和开发流程,欢迎Star和提新需求,不断完善和更新Alfred Workflow。包含不限于时间戳,编码转换,随机密码,快速打开终端,快速创建文件等
A comprehensive, unified and modular event extraction toolkit.
A PyTorch Library for Meta-learning Research
geekdada / surge-list
Forked from Blankwonder/surge-listRules for Surge. DOMAIN-SET update daily.
sentence embedding by Smooth Inverse Frequency weighting scheme
🦄 🎃 👻 V2Ray 路由规则文件加强版,可代替 V2Ray 官方 geoip.dat 和 geosite.dat,适用于 V2Ray、Xray-core、mihomo(Clash-Meta)、hysteria、Trojan-Go 和 leaf。Enhanced edition of V2Ray rules dat files, applicable to V2Ray, Xray-core…
ShadowsocksR update rss, SSR organization https://github.com/shadowsocksr
Community managed domain list. Generate geosite.dat for V2Ray.