Marco Haruni

I build AI from the silicon up GPU-obsessed, hardware-aware, latency-conscious, and production-tested.

I work on large language models, mathematical research, world models, GPU and TPU kernels, quantization, and inference systems.

Books
Videos
Blog
Projects

GitHubXLinkedInHugging FaceEmail