Stars
An open-source CRF Reference String Parsing Package
Neuralized version of the Reference String Parser component of the ParsCit package.
[WSDM'2024 Oral] "LLMRec: Large Language Models with Graph Augmentation for Recommendation"
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)
Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.
OpenP5: An Open-Source Platform for Developing, Training, and Evaluating LLM-based Recommender Systems
A course on getting started with the Twitter API v2 for academic research
A library of sklearn compatible categorical variable encoders
Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
Using the Gmail API to topic model my recommended Medium reads
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
Python API for Science Parse
Automatically profile dataframes in the Jupyter sidebar
A machine learning software for extracting information from scholarly documents
π π€ Semantic search and workflows for medical/scientific papers
π βοΈ ETL processes for medical and scientific papers
Extract article or news by url or html, parse the title and content, output in markdown format.
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
Science Parse parses scientific papers (in PDF form) and returns them in structured form.
Python PDF parser for scientific publications: content and figures
Python client for GROBID Web services
Resoruce to help you to prepare for your comming data science interviews
πͺ Models' quality and performance metrics (R2, ICC, LOO, AIC, BF, ...)
Linguistic and stylistic complexity measures for (literary) texts
π python package to calculate readability statistics of a text object - paragraphs, sentences, articles.
A Python library to extract tabular data from PDFs
A collection of the most important Github repos for ML, AI & Data science practitioners