- Jan 2025 Rung 2 · Soft meter
- Jan 2025 Rung 3 · Hard allotment Inference Providers credits bolted onto seats
- Mar 2025 → now Rung 4 · Metered overage Pay-as-you-go enabled via Billing API above credits
Moved across the continuum this window — pattern: Incumbent bolt-on.
The short version A usage meter was bolted onto a seat you already pay for — the subscription is no longer the whole bill.
The verdict
In January 2025 Hugging Face's seat products bundled inference as a soft-metered benefit — PRO at $9/mo sold on 'higher rate limits for serverless inference,' gated by rate limits rather than a billing meter. On January 28, 2025 it launched Inference Providers and bolted an explicit credit meter onto those seats ($2/mo of credits for PRO), a hard-allotment framing. On March 11, 2025 it enabled pay-as-you-go through its Billing API, so usage above included credits is charged to the account — metered overage, with users reporting sharp effective-cost jumps once past the included credits. Today it is firmly at metered overage: small included credits with auto-recharging overage, plus usage-based Spaces and Endpoints compute that auto-bills above thresholds with no hard cap.
Current pricing snapshot
As published on Hugging Face Hub's own pages · captured Jun 12, 2026. Unit: inference credits (USD).
| Plan | Price / mo | Included | At zero | Top-up vs plan |
|---|---|---|---|---|
| PRO | $9 | 2 | Auto overage | Yes · price hidden |
| Team | $20 | 2 | Auto overage | Yes · price hidden |
| Enterprise | $50 | 2 | Auto overage | Yes · price hidden |
- Credit ↔ dollar
- Pegged — one credit is a fixed $ amount
- Credit burn rate
- Varies by model / action
- Model choice
- You pick the model
- Free tier
- 0.1 credits, per month
- Billing
- Monthly only
- Included credits
- Unstated
- Purchased credits
- Unstated
What changed
- Mar 11, 2025 · Policy change · HF enabled pay-as-you-go via its Billing API for providers fal, Novita and HF-Inference: usage beyond the included monthly credits is charged to your HF account, turning the credit allotment into auto-billed metered overage.
- Mar 1, 2025 · Credit burn-rate repricing · After PAYG went live, users reported sharp effective-cost jumps once over the included credits — e.g. ~10x per-request increase (690 requests = $6.38 vs prior 569 requests = $0.54) and image inference rising from ~$0.005 to ~$0.05 each starting March 24, 2025 — surfacing the metered-overage bite.
- Jan 28, 2025 · Meter adopted · Hugging Face launched Inference Providers (fal, Replicate, SambaNova, Together AI) on the Hub, introducing explicit monthly inference credits — PRO users get $2/month of credits — replacing the old rate-limited 'included serverless inference' with a credit-allotment model.
Where it's headed
Sourced from Hugging Face Hub's official pricing pages and archived snapshots. Primary: pricing page. Method & limits: Methodology & data.