Adaptive Orchestration for Large-Scale Inference on Heterogeneous Accelerator Systems Balancing Cost, Performance, and Resilience Paper • 2503.20074 • Published Mar 25 • 6
Trained on AWS Trainium Collection Collection of models on Hugging Face that have been trained on AWS Trainium. Learn more here: https://huggingface.co/docs/optimum-neuron/index • 7 items • Updated May 7, 2024 • 10