Skip to content
View HaoshuaiXu's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report HaoshuaiXu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,394 232 Updated Sep 14, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

15,022 1,395 Updated Sep 19, 2024

surge module

JavaScript 811 103 Updated Aug 11, 2024

Zotero plugin to manage your attachments: automatically rename, move, and attach PDFs (or other files) to Zotero items, sync PDFs from your Zotero library to your (mobile) PDF reader (e.g. an iPad,…

Java 3,959 280 Updated Apr 16, 2024

收录了一些可以快速创建出精美readme.md的工具集合

289 3 Updated Jul 7, 2024

Python code for "Probabilistic Machine learning" book by Kevin Murphy

Jupyter Notebook 6,472 1,519 Updated Aug 5, 2024

An Obsidian plugin that formats and styles your notes with a focus on configurability and extensibility.

TypeScript 1,181 79 Updated Sep 16, 2024

Medical NLP Competition, dataset, large models, paper

2,091 400 Updated Jun 8, 2024

一本 GPT4 生成的单词书📚,超过 8000 个单词分析,涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事

HTML 3,188 214 Updated Aug 17, 2024

A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.

Go 84,615 13,170 Updated Sep 6, 2024

🧡 Next generation information browser.

TypeScript 9,423 395 Updated Sep 22, 2024

Switch my calibre library from ascii path to plain Unicode path. 将我的书库从拼音目录切换至非纯英文(中文)命名

Python 1,015 39 Updated Sep 13, 2024

Rule Snippet & Rule Set for Surge / Clash Premium / Clash Meta

TypeScript 1,746 134 Updated Sep 22, 2024

Precision Medicine Knowledge Graph (PrimeKG)

Jupyter Notebook 380 83 Updated May 27, 2024

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Python 2,634 292 Updated Dec 12, 2023

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,369 511 Updated Jul 2, 2024

ACL'2022: Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction

Python 125 15 Updated May 5, 2023

The source code of NeurIPS 2020 paper "CogLTX: Applying BERT to Long Texts"

Python 268 54 Updated May 17, 2022

Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'

HTML 115 29 Updated Mar 28, 2023

📔 Day One diary entries to markdown files

Python 37 Updated Jan 31, 2022

Early Retirement Extreme 中文翻译 Renaissance & Liberty

HTML 55 16 Updated Nov 5, 2022

Check酱:监测网页内容变化,并发送异动到微信。亦支持http status、json和rss监测。配合自架云端,关电脑后也能运行。

JavaScript 1,705 160 Updated Apr 12, 2023

Python3重写了很多常用的开发工具和开发流程,欢迎Star和提新需求,不断完善和更新Alfred Workflow。包含不限于时间戳,编码转换,随机密码,快速打开终端,快速创建文件等

326 11 Updated Oct 9, 2023

A comprehensive, unified and modular event extraction toolkit.

Python 341 33 Updated Oct 15, 2023

A PyTorch Library for Meta-learning Research

Python 2,619 351 Updated Jun 7, 2024

Rules for Surge. DOMAIN-SET update daily.

Smarty 224 26 Updated Sep 22, 2024

sentence embedding by Smooth Inverse Frequency weighting scheme

Python 1,084 306 Updated Jul 23, 2019

🦄 🎃 👻 V2Ray 路由规则文件加强版,可代替 V2Ray 官方 geoip.dat 和 geosite.dat,适用于 V2Ray、Xray-core、mihomo(Clash-Meta)、hysteria、Trojan-Go 和 leaf。Enhanced edition of V2Ray rules dat files, applicable to V2Ray, Xray-core…

14,671 1,703 Updated Sep 21, 2024

ShadowsocksR update rss, SSR organization https://github.com/shadowsocksr

4,224 1,068 Updated Sep 3, 2017

Community managed domain list. Generate geosite.dat for V2Ray.

Go 4,693 848 Updated Sep 20, 2024
Next