Squint: A peephole optimizer for stack VM compilers
-
Updated
Oct 17, 2025 - C
Squint: A peephole optimizer for stack VM compilers
Production Android AI with ExecuTorch 1.0 - Deploy PyTorch models to mobile with NPU acceleration and 50KB footprint
Official codebase for the MLSys 2026 paper "IntAttention: A Fully Integer Attention Pipeline for Efficient Edge Inference". It enables high-fidelity and high-speed LLM/ViT deployment on ARM CPUs.
Real-time SAM2 segmentation on edge devices - 40x faster C++ inference with ONNX Runtime for iOS/Android deployment
Add a description, image, and links to the arm-optimization topic page so that developers can more easily learn about it.
To associate your repository with the arm-optimization topic, visit your repo's landing page and select "manage topics."