Ben Maxwell

I build machine learning systems and AI tools — from training language models from scratch to shipping full-stack apps.

Projects & Learning

All Artificial Intelligence Learning
soloLLM

soloLLM

A GPT-2-class language model built and trained from scratch on a single GPU using a self-curated 10B-token dataset. The best model beats GPT-2 small on held-out perplexity, WikiText-2, and LAMBADA.

  • Python
  • PyTorch
  • Hugging Face
Handcrafted Neural Networks — ARC-style task visualization

Handcrafted Neural Networks

Neural-network logic built by hand — manually choosing features, weights, and thresholds across notebooks to make the mechanics of a tiny network visible, from 3×3 line detection to an ARC-style task.

  • Python
  • PyTorch
  • From scratch