Stars
TruthfulQA: Measuring How Models Imitate Human Falsehoods
A Tool for Navigating LLMs and Prompts for Computational Social Science and Digital Humanities Research
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
The 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.
A browser extension to collect social media data with.
List of coordinated networks and accounts that circulated on Facebook problematic news stories rated as problematic by third-party fact-checkers
Code and data repository for the "Detecting Coordinated Link Sharing Behaviour with CooRnet" tutorial of the Digital Media Initiative 2023 Winter School
Code and documentation to train Stanford's Alpaca models, and generate the data.
Data and code behind the articles and graphics at FiveThirtyEight
The BBC's Open Source Web Application. Contributions welcome! Used on some of our biggest websites, e.g.
Mecodify tool for twitter data analysis and visualisation
YTDT is a collection of simple tools for extracting data from the YouTube platform via the YouTube API v3.
Sample files to accompany the FT's Chart Doctor column
This repository has details regarding books read and to be read by the AICN Ethics Bookclub
Persine is an automated tool to study and reverse-engineer algorithmic recommendation systems.
Shiny app dashboard of HK district councillors' information include FB pages.
List of data journalism courses and programmes from universities and higher education institutions around the world
How Facebook and Google skew the distribution of advertisements, absent any targeting from the advertiser
Manuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
An overview and exploration of the concept of missing datasets.