Shawn David Iuliucci | Air-Gapped AI Operator | 20+ Years Scaling Systems
Running 70B-parameter LLMs air-gapped on NVIDIA DGX Spark. No cloud. Just results. Prototypes include air-gapped 70B RAG on DGX Spark and edge RAG on Intel Lunar Lake Getac F120, focusing on distributed training and model parallelism.
GitHub Repo Need Help?
With over 20 years in software engineering, I've scaled large-scale systems to handle billions in value, from cloud to no-cloud environments. Now advancing large-scale AI frameworks, with a strong interest in contributing to NVIDIA’s Megatron Core and NeMo teams.
Proficient in Python, and actively mastering Rust and Kubernetes through hands-on projects. Early in specialized AI career but fully dedicated to scaling LLMs for real-world impact.
Building 4 open-source RAG tools across key domains, with 4 arXiv papers and 4 conference keynotes in the pipeline.
classified-rag: 70B on synthetic CUI, 0.9s latency
rail-rag: 100k containers in 0.2s
fin-rag: 1B trades in <5 min
health-rag: 50 hospitals, federated, no PHI
All domains unified: 4 keynotes, DGX leaderboard