All NIST FIPS post-quantum algorithms run natively on GPU. The Number Theoretic Transform (NTT) is the shared primitive — 6 GPU NTT variants accelerate ML-DSA, ML-KEM, SLH-DSA, and Pulsar simultaneously.
6 NTT variants optimized for different use cases:
ntt.metal: General-purpose butterflyfour_step_ntt.metal: Large-degree polynomialsntt_unified_memory.metal: Apple Silicon zero-copyntt_kernels.metal: Fused NTT + pointwise multiplypoly_mul.metal: Polynomial multiplication via NTTtwiddle_cache.metal: Pre-computed twiddle factors