Skip to content
View erip's full-sized avatar
  • Fairfax, VA

Block or report erip

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
61 stars written in Python
Clear filter

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 132,487 26,387 Updated Sep 25, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 82,424 22,178 Updated Sep 25, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,213 6,376 Updated Sep 9, 2024

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 29,725 4,360 Updated Sep 14, 2024

🔍 AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your da…

Python 16,873 1,846 Updated Sep 24, 2024

State-of-the-Art Text Embeddings

Python 14,907 2,439 Updated Sep 19, 2024

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,389 2,066 Updated Jan 23, 2024

Hydra is a framework for elegantly configuring complex applications

Python 8,639 622 Updated Sep 18, 2024

A PyTorch-based Speech Toolkit

Python 8,605 1,367 Updated Sep 24, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,391 595 Updated Sep 20, 2024

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,232 887 Updated Sep 22, 2024

Python Stream Processing

Python 6,724 535 Updated Jul 27, 2024

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 6,430 657 Updated Aug 29, 2024

A system for quickly generating training data with weak supervision

Python 5,787 858 Updated May 2, 2024

A data augmentations library for audio, image, text, and video.

Python 4,946 299 Updated Sep 20, 2024

A Unified Toolkit for Deep Learning Based Document Image Analysis

Python 4,800 462 Updated Aug 15, 2024

A Python implementation of LightFM, a hybrid recommendation algorithm.

Python 4,728 688 Updated Jul 24, 2024

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,621 423 Updated Aug 29, 2024

Pampy: The Pattern Matching for Python you always dreamed of.

Python 3,514 125 Updated Mar 29, 2022

Models, data loaders and abstractions for language processing, powered by PyTorch

Python 3,498 814 Updated Sep 24, 2024
Python 3,237 145 Updated Apr 9, 2024

Foundation Architecture for (M)LLMs

Python 3,002 202 Updated Apr 11, 2024

Flexible Python configuration system. The last one you will ever need.

Python 1,945 106 Updated May 30, 2024

A fast, efficient universal vector embedding utility package.

Python 1,623 119 Updated Aug 3, 2023

This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"

Python 1,485 427 Updated Aug 27, 2021

Mozilla's Localization Platform

Python 1,455 526 Updated Sep 24, 2024

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

Python 1,340 164 Updated Jun 5, 2024

DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks

Python 1,263 133 Updated Mar 2, 2023

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Python 1,194 143 Updated Jan 16, 2024

A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

Python 1,118 149 Updated Sep 24, 2024
Next