API usage limits on free plan

Serega61 · October 30, 2025, 9:44am

Hello, everyone!

I am interested in limits on one single request. I am trying actually simple project: sending request with prompt + text, where I asks to create tests based on the text. So how many symbols max can be in single request? sorry if it is dumb question…
Thanks in advance!

John6666 · October 30, 2025, 10:13am

The count is based on the number of API calls, not the number of tokens passed to the API, so there are almost no restrictions on the number of tokens. However, the Free Plan has very few API calls available…

Here’s the per-call limit on the Free plan:

No plan-specific character cap. One request is bounded by the model’s context window (tokens) and the server HTTP body size. The plan only changes credits and rate limits, not per-request size. (Hugging Face)
HTTP body cap: Hugging Face’s default payload limit is ~2,000,000 bytes on both TGI and TEI. Larger bodies return 413. (Hugging Face)
Token cap: Your input tokens + requested output tokens must fit the model’s context. TGI also exposes a --max-input-tokens guard. (Hugging Face)
Rough “symbols” math: 1 token ≈ 3–4 English characters.
- With a 128k-token model (e.g., Qwen2.5 or Llama-3.1), you can usually fit on the order of ~380k–500k characters of input if you reserve some tokens for output. Always check the model card. (Hugging Face)

Bottom line: On Free, send as much as fits within the model’s token window and under ~2 MB JSON body. If you hit 413, shrink the payload; if you hit context limits, shorten or chunk the text. (Hugging Face)

system · October 31, 2025, 5:03am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Unlimited API usage for models Beginners	4	5711	May 7, 2021
Default gpt-j output length Beginners	0	368	April 23, 2022
How does the GPT-J inference API work? Beginners	5	764	October 8, 2021
How to increase tokens text generation API Intermediate	1	759	August 28, 2022
How to set minimum length of generated text in hosted API Beginners	2	1603	March 10, 2021

API usage limits on free plan

Related topics