feat: minimal FastAPI app for Llama via HF Inference Endpoint; Dockerfile + requirements 02a6500 harismlnaslm commited on 19 days ago