Skip to content
View thebabellibrarybot's full-sized avatar

Block or report thebabellibrarybot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
thebabellibrarybot/README.md

Hi there πŸ‘‹

- πŸ”­ I’m currently working on ...

Putting ML models into production for page analysis, text-line-extraction, object detection, and HOCR of medival manuscripts.

Here you can find a variety of tools used to annotate data for ML, format data for ML, and running models in a UI. All projects are workspaces for The Babel Public Library.

You can also check out my basic project portfolio website MumbotPorts

Some of my favorite repos are pinned below, including a dataset I scrapped and formatted to mirror MINST but using a collection of 9 characters in latin textura from medieval text (provided by paleographers). An annotator aimed at leveraging a paleographers approach to transcribing, compiling, and carefully considering language data found in manuscripts. Exporters and API that convert the object structures I regularly use into standardized ML formats or standardized historic library formats such as PAGE XML, COCO, MARC, or Dublin Core. Last but not least API that may or may not be available to preform ML enabled alterations on datasets via lambda functions and sagemaker endpoints. (sagemakers endpoints are off more often than not cause thats a whole bill)

- 🌱 Stack ...

  • React React
  • Node.js Node.js
  • AWS AWS
  • Python Python
  • GitHub GitHub
  • Docker Docker

- πŸ‘― I’m looking to collaborate on ...

Historic HOCR ML Pipelines !

Game asset generation !

Making DevOps Cheaper !

- πŸ’¬ Ask me about ...

I'm really interested in natural language coding, few-shot-learning on depriciated data, unstructured language analysis, and just having fun with tech.

- πŸ“« How to reach me: ...

[email protected]

- πŸ˜„ Pronouns: ...

he/him

- ⚑ Fun fact: ...

I love to bike in NYC, 12mi a day baby!

Pinned Loading

  1. BabelAnno-Test BabelAnno-Test Public

    Webapp for annotating manuscripts with a paleographic approach to HOCR ground truth data-labeling

    JavaScript 1

  2. autoencoder-scribes autoencoder-scribes Public archive

    autoencoder and dataloaders for running anomaly detection in medieval scribes version of the MNIST dataset

    Python

  3. MorganAPi MorganAPi Public

    dockerized api and SQL database to look at records from the morgan libraries archives. data visualization views are linked in this repo.

    JavaScript

  4. gitbot-docker gitbot-docker Public

    dockerized project that scraps your github repos and helps you locate exact files and lines you may be confused about.

    Python 1