Marco Haruni
I build AI from the silicon up GPU-obsessed, hardware-aware, latency-conscious, and production-tested.
I work on large language models, mathematical research, world models, GPU and TPU kernels, quantization, and inference systems.
Books
Videos
Blog
Projects
GitHub • X • LinkedIn • Hugging Face • Email