-
Institute of Computing Technology, Chinese Academy of Sciences
- Singapore
- https://waltbai.github.io
Lists (7)
Sort Name ascending (A-Z)
datasets
datasetsentertainment
entertainmentinformation extraction
entity / relation / event extractioninformation retrieval
information retrievalnlp-toolkits
common NLP toolkitsscript event prediction
experimental codes for script event predictionstructure learning
syntax / semantic parsing and their toolsStars
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
LlamaIndex is a data framework for your LLM applications
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Code and documentation to train Stanford's Alpaca models, and generate the data.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
Core Engine of Singing Voice Conversion & Singing Voice Clone
Demo for the "Talking Head Anime from a Single Image."
brat rapid annotation tool (brat) - for all your textual annotation needs
An Open-sourced Knowledgable Large Language Model Framework.
Library for Knowledge Intensive Language Tasks
Stanford Open Information Extraction made simple!
Dataset and codes for ACL 2019 DocRED: A Large-Scale Document-Level Relation Extraction Dataset.
Core Data of HowNet and OpenHowNet Python API
基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体识别、事件要素抽取和判决结果预测等内容
TuShare是实现对股票/期货等金融数据从数据采集、清洗加工 到 数据存储过程的工具,满足金融量化分析师和学习数据分析的人在数据获取方面的需求,它的特点是数据覆盖范围广,接口调用简单,响应快速。
Recurrent Event Network: Autoregressive Structure Inference over Temporal Knowledge Graphs (EMNLP 2020)
Neural Symbolic Machines is a framework to integrate neural networks and symbolic representations using reinforcement learning, with applications in program synthesis and semantic parsing.
Several data modalities for KBs (visual, numerical, temporal, etc.)
A Large Scale Text Summarization Dataset
[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.