def:xsr/infer≈ Bedrock / Together

Infer

Host any open model, or call hosted ones. Per-token billing, capability-gated.
About Infer
Defaults
Model whitelist

Allowed model identifiers (open-weight catalog).

all open
Max tokens

Per-request output ceiling.

4,096
Batching

Allow request batching for higher throughput.

on
Infer services · The Grid