Skip to content

RemoteGPU Docs

RemoteGPU docs are organized around three product paths: hosted applications, direct inference API calls, and Kubernetes resources.

Choose a starting point

Application

Use Application when you want to run a hosted application from the console and avoid managing Kubernetes resources or direct API calls.

Start with Application overview, or go directly to Comfy to run a hosted workspace.

Inference API

Use Inference API when you want to generate text or images from your application, backend, script, or automation with HTTP requests.

Start with Inference API overview, then use Text or Image for the first request flow.

Kubernetes

Use Kubernetes when your team wants namespace access and Kubernetes resources such as Deployment, StatefulSet, Job, Service, CronJob, and Ingress.

Start with Kubernetes overview. Use API keys when you need kubectl access.

Core documentation paths

Application:

  • Application overview: learn when to use hosted applications instead of API or Kubernetes workflows
  • Comfy: run and manage a hosted Comfy workspace

Inference API:

  • Inference API overview: learn when to use direct API access
  • Text: send OpenAI-compatible chat-completion requests
  • Image: send image-generation requests and track results
  • API keys: create keys for inference and Kubernetes exec access

Kubernetes:

Service endpoints

EndpointPurpose
console.remotegpu.aiCustomer console
api.remotegpu.aiRemoteGPU API host
inference.remotegpu.aiRemoteGPU Inference API host
docs.remotegpu.aiCustomer documentation

RemoteGPU customer documentation