def:xsr/infer≈ Bedrock / Together
Infer
Host any open model, or call hosted ones. Per-token billing, capability-gated.
Defaults
Model whitelist
Allowed model identifiers (open-weight catalog).
Max tokens
Per-request output ceiling.
Batching
Allow request batching for higher throughput.