Skip to content
View clam004's full-sized avatar

Block or report clam004

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
clam004/README.md

Hello fellow human 👋

I am an AI/ML engineer and former practicing physician working in generative AI Ops. My scientific passion is in deep reinforcement learning of pre-trained neural networks and imbuing these models with long term episodic memory. I enjoy communicating these concepts in an empathetic, yet technically meaningful way.

Tools

Python Python Go HF

Pinned Loading

  1. minichatgpt minichatgpt Public

    annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation

    Jupyter Notebook 18 1

  2. adaptive-computation-time adaptive-computation-time Public

    The notebook connects the formulas used in the paper to the code that implements those formulas by implementing a training pipeline on a small but meaningful dataset

    HTML 2

  3. unsupervised-speech-representation-learning unsupervised-speech-representation-learning Public

    This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that uses CPC to learn representations of sound files for the purpose…

    Jupyter Notebook 10

  4. triton-ft-api triton-ft-api Public

    tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server

    Python 5