Skip to content
View bmax16634's full-sized avatar

Block or report bmax16634

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bmax16634/README.md

Ben Maxwell

Data scientist and ML engineer focused on practical AI systems.

I focus on small-scale LLM training, evaluation, and shipping the full app around the model.

Featured — SoloLLM

A decoder-only GPT-style transformer designed and pretrained from scratch in PyTorch on a single RTX 3090.

  • Final 151.9M v3 model trained on a self-curated 10B-token dataset
  • Beats GPT-2 small overall on a fixed eval suite: held-out perplexity, WikiText-2, LAMBADA, and multiple-choice continuation scoring
  • Open source code, published Hugging Face model cards, and live completion demo

Code · Model collection · Final v3 model · Live demo

What I'm interested in

  • Small / efficient models that punch above their weight
  • Training tricks that matter on real hardware
  • Honest model evaluation and benchmark comparisons
  • End-to-end AI products — model in the middle, real product around it
  • Local-first AI tooling on consumer GPUs

Stack

  • Primary — Python, PyTorch, FastAPI, SQL
  • ML / Data — Hugging Face (Transformers, Datasets), NumPy, pandas, Polars, scikit-learn
  • Also — Node.js, R, Java, JavaScript, Streamlit
  • Environments — Linux, Docker, Git

Background

  • UT Austin — Post Graduate Program in Generative AI for Business Applications (Jan 2026)
  • Arizona State University — B.S. Data Science, summa cum laude (May 2024)
  • SmartRent — Workforce Management Analyst (forecasting, scheduling optimization, KPI pipelines)

Links

Pinned Loading

  1. bmax16634.github.io bmax16634.github.io Public

    JavaScript

  2. soloLLM soloLLM Public

    GPT-2-class language models trained from scratch in PyTorch on one RTX 3090, with 10B-token data curation and full GPT-2 comparisons.

    Python