Skip to content

Changelog

All notable changes to Spheron AI will be documented in this file.

Track new features, improvements, bug fixes, and breaking changes across all releases. This changelog is automatically updated with each deployment to ensure you always have the latest information about platform updates and enhancements.

v1.21.6 • 2026-06-04

Out-of-Stock Notify-Me for Spot GPUs & Cleaner GPU Cards

Out-of-stock notify-me now covers spot GPUs too, so a sold-out spot configuration sends you to the Notify CTA instead of a dead form. The same GPU no longer splits into duplicate cards, and the order summary now shows GPU memory.

Notify-Me for Spot
  • Spot Out-of-Stock Cards: When a spot GPU is fully out of stock, the deploy wizard now hides Region and OS and shows the same Notify CTA you already get for dedicated GPUs, instead of dropping you into a half-configured flow that reported "No OS options available for this region"
  • Surfaces Right Away: Spot configurations that had already sold out now show the Notify CTA immediately, with no waiting for the next stock refresh
Cleaner GPU Cards
  • No More Duplicate Cards: The same GPU no longer fragments into duplicate cards or shows a doubled form-factor suffix. Listings like "A100 80G SXM4 SXM4", or "A30" and "A30 PCIE" appearing as two separate cards, are now collapsed into one
  • Sold-Out Counts Stay Actionable: When a specific GPU count on Verda is out of stock, the wizard keeps the Notify-me CTA reachable and hides the storage selector, instead of dropping you into a dead configuration flow
  • GPU Memory on the Order Summary: The order summary now shows each GPU's VRAM next to its name
  • Accurate Availability Header: The available and guaranteed header copy now tracks live capacity, so it agrees with the GPU Count and Region sections

v1.21.5 • 2026-06-02

Accurate GPU Pricing & Broader SSH Key Support

GPU prices in the deploy wizard and the API now reflect the right offer, with no sold-out offer undercutting the best available price. Deploys with ECDSA and FIDO security keys now work on Massed Compute and Verda.

Accurate Pricing
  • Correct Spot and Dedicated Counts: Spot and dedicated offers are no longer double-counted against each other, so the lowest spot price, lowest dedicated price, and availability counts are accurate
  • Best Available Price Ignores Sold-Out Offers: The best-price figure on a GPU card now comes from in-stock offers only, so a stale out-of-stock price can no longer win and mislead you
  • Cleaner Public API: GET /api/gpu-offers no longer returns internal out-of-stock offers. Existing fields are unchanged, so existing integrations keep working
Broader SSH Key Support
  • ECDSA & FIDO Keys Now Work: Deploying to Massed Compute or Verda with an ECDSA (ecdsa-sha2-...) or FIDO security key (sk-ssh-ed25519@openssh.com, sk-ecdsa-...) now succeeds. Previously only RSA and Ed25519 keys were accepted and other types failed with "SSH Key does not exist". Spheron AI, Spheron ES, Sesterce, and Voltage Park already accepted every key type

v1.21.4 • 2026-06-02

Billing: Cost-Saving Intelligence & Spend Visibility

The billing page is now an actionable cost dashboard. See where you can save against other providers and the big hyperscalers, spot idle resources, and track spend trends and anomalies at a glance.

Cost-Saving Signals
  • Cross-Provider Price Comparison: Each deployment row shows whether you are on the cheapest provider for that GPU, or how much more you are paying per hour than the cheapest option (+$X/hr vs <provider>)
  • Hyperscaler Comparison: A line under each deployment compares your Spheron rate against AWS, GCP, and Azure list prices, with the percentage you are saving
  • Long-Running and Idle Flags: An amber chip marks deployments running 7+ days, a callout flags volumes left unattached for 30+ days, and volume rows now show the cost accrued so far
Spend Visibility
  • Daily Spend Trend: A sparkline of daily spend across the selected period, so trends are visible at a glance
  • Spend Breakdown: Mini-charts break your spend down by GPU model, by provider, and by spot versus dedicated
  • Blended Rate and Month Projection: New cards show your blended cost per GPU per hour and a projected total for the current month
  • Top Spenders and Reserved Hint: A top-three deployments-by-spend callout, plus a hint when a GPU type's spend suggests a reserved commitment could save you money
Discounts and Anomalies
  • Discount Savings: A running total of what your active discounts have saved you across the period
  • Spend Anomaly Alert: An amber banner when a single day's spend spikes far above your recent average

v1.21.3 • 2026-05-29

Out-of-Stock Notify-Me Cards, Interconnect Display & UX Polish

Sold-out GPUs now stay on screen as notify-me cards instead of disappearing, the GPU spec strip shows the interconnect bus, and a few rough edges around volume errors and confirmation dialogs are smoothed out.

Out-of-Stock Notify-Me Cards
  • Sold-Out GPUs Stay Visible: Across Verda, Massed Compute, Spheron ES, Sesterce, and Voltage Park, out-of-stock GPUs now appear as dimmed "Out of stock" cards with a Notify-me CTA, instead of vanishing from the wizard
  • Sold-Out Offers Carry Forward: Offers that briefly disappear upstream are remembered for up to 7 days, so a sold-out GPU still surfaces as a notify-me card rather than dropping off the marketplace entirely
Interconnect Display
  • Bus on the Spec Strip: GPU cards now show the interconnect bus (NVLink, SXM, or PCIe) on the spec strip, so you can pick the interconnect that matches your workload
  • Clearer NVLink Labels: NVLink-bridged GPUs are flagged consistently, and NVL models no longer carry a redundant "PCIE" suffix in their title
  • RTX PRO 6000 Corrected to Blackwell: The RTX PRO 6000 now reads as Blackwell architecture, so you install the right driver
UX Polish
  • Dialogs Behave as Expected: Confirmation dialogs now dismiss the way you expect, closing when you click outside them just like every other modal in the app, so they no longer trap you when you click away. They still stay open while an action is in progress, so you cannot dismiss one mid-operation

v1.21.2 • 2026-05-26

Billing & Volume Reliability

Auto top-up now gets a chance to recover your balance before any instance is terminated for low credit, and a terminated instance's volumes return to Available instead of staying stuck, so you can reattach them.

Billing
  • Auto Top-Up Recovers Before Termination: When your balance runs low, auto top-up is given time to charge your saved card and restore credit before instances are terminated. A momentary dip no longer costs you a running deployment
Volumes
  • Volumes No Longer Stick to Terminated Instances: When an instance is torn down, its volumes return to Available so you can reattach them right away, instead of staying marked Attached to an instance that no longer exists

v1.21.1 • 2026-05-24

Confidential Compute on Verda RTX PRO 6000

Verda's RTX PRO 6000 now offers a Confidential Compute variant that keeps your VM and GPU memory encrypted while in use.

New
  • Confidential Compute Variant: Verda's RTX PRO 6000 now lists a separate Confidential Compute (CC) option alongside the standard card, so you can choose it directly in the deploy wizard
  • Confidential Compute Notice: When you select the CC variant, the order summary explains that VM and GPU memory are encrypted in use and isolated from the hypervisor and the cloud provider itself

v1.21.0 • 2026-05-21

Mobile-Responsive Marketplace: Sidebar Drawer, Fluid Layouts, and Polished Wizards

The marketplace is now usable on a phone. The desktop sidebar collapses behind a hamburger drawer on small screens, the deploy wizard reorders so the order summary stays in view above the form, instance and volume detail panels open full-screen on mobile, and every page (billing, API docs, volumes, settings, top nav) has been retuned so cards, headers, and dialogs reflow without horizontal overflow.

Mobile Navigation
  • Hamburger Sidebar Drawer: On phones and small tablets the persistent sidebar is replaced by a hamburger button in the top nav. Tapping it slides a full nav drawer in from the left with an overlay dim. Tapping the overlay, hitting a nav item, or navigating away closes it. Background scrolling is locked while the drawer is open
  • One Source of Truth for Nav Links: Desktop sidebar and mobile drawer now share the same nav definition, so a route added in one shows up in both
  • Resize-Aware: Resizing the window from mobile to desktop closes the drawer, restores the desktop sidebar, and re-applies the layout without a reload
Deploy Wizard on Mobile
  • Order Summary at the Top: On mobile the order summary now stacks above the configuration form (instead of being pinned to the right rail) so the running price and selected GPU stay visible without scrolling back up
  • Full-Width Step Buttons: Next, Back, and Deploy buttons go full-width on mobile. The Back/Deploy pair stacks vertically with Deploy on top, so the primary action is always under your thumb
  • Truncating GPU Cards: GPU offer cards truncate long titles and the results counter clips its parenthetical detail on narrow screens, so cards no longer push the layout sideways
Instance and Volume Detail Drawers
  • Full-Screen on Mobile: Tapping an instance or volume card now opens its detail panel as a full-screen sheet (instead of a fixed-width side drawer that ran off the viewport). The panel reverts to the familiar right-side drawer on tablet and desktop
  • Safer Height on iOS: Detail drawer height now accounts for Safari's dynamic address bar, so the bottom of the content is no longer covered
Billing, API Docs, and Volumes
  • Billing Page Reflow: Balance cards, deposit packages, and the mini-metrics grid now stack cleanly on small screens, step up to a tighter grid on tablets, and keep the full layout on desktop. The tabs row scrolls horizontally on narrow screens instead of wrapping, and the saved-card section reflows for small viewports
  • API Docs Reflow: Endpoint accordions, curl commands, the base URL section, and the quick-test panel all reflow on mobile without horizontal scroll. Long URLs and code blocks now wrap or scroll inside their cards
  • Volumes and Instances Pages: Page headers, load-more buttons, attach/detach dialogs, and the create-volume wizard reflow for mobile widths. The load-more button is full-width on mobile and capped to a comfortable size on tablets and up
Miscellaneous Polish
  • Top Nav: Logo, balance display, team switcher, deposit dialog, and user menu all retuned for narrow viewports so the bar stays single-line without overflow
  • Feature Announcement Modal: Modal width adapts to the viewport on mobile instead of bleeding off the edge
  • Wizard Headers: Pipe separators between header chips are hidden on narrow screens so you no longer see orphaned characters
  • Reserved GPU Wizard, SSH Keys, Team Overview, GPU Alerts, Usage Analytics: All retuned to flow into a single column on mobile with appropriately stacked actions and headers
Accessibility
  • Drawer Trigger: The hamburger button now announces whether the drawer is open or closed for screen readers
  • Defensive Rendering: The credit-package grid now guards against an empty package list, so an empty config never throws while the page is mid-render

v1.20.0 • 2026-05-15

GPU Availability Alerts, Smarter Stock Signals & Faster Capacity Refresh

Never refresh the marketplace waiting for a GPU again. Subscribe to a sold-out GPU configuration directly from the deploy wizard and Spheron emails you the moment that exact GPU, count, and instance type is back in stock. The wizard now lets you navigate sold-out variants instead of hiding them, Spheron ES regions surface live stock signals before you commit, Spheron AI multi-GPU placements stay hidden while supply is too thin to deploy, and stock-out errors anywhere on the marketplace clear from the catalogue immediately instead of trailing the next user.

New Features
  • GPU Availability Alerts: Subscribe to a sold-out GPU configuration directly from the configure step in the deploy wizard. You receive an email the moment that exact configuration (GPU type, count, provider brand, spot or dedicated) is back in stock. An optional price ceiling keeps the email scoped to your budget so you only get pinged when the offer is actually affordable
  • Alerts Tab in Settings: A new Alerts tab under Settings lists every alert you have ever created. Unsubscribe a pending alert with one click, or Resubscribe a previously notified alert if you missed the window the first time
  • Brand-Wide Alert Coverage: An alert scoped to a brand fires when any region under that brand has matching capacity. You do not have to pick a specific region to subscribe; one alert covers the entire brand
  • Navigate Sold-Out Configurations in the Wizard: Sold-out GPU configurations now appear in the deploy wizard as dimmed cards instead of being suppressed entirely. Click through to inspect the count, networking, and provider options that are temporarily unavailable, then either subscribe to an alert or pick a live alternative without restarting the flow
UX Improvements
  • Per-Region Stock Signals on Spheron ES: The region picker in the deploy wizard now shows a tiered stock pill (high, healthy, low, critical) next to each Spheron ES region, so you can see how thin supply is before committing to a region
  • Spheron ES Spot Listings Hide Low-Stock Configurations: Spheron ES spot offers with too little stock to deploy reliably are no longer listed. This removes the failure path where a listing claimed availability but the deploy refused to allocate
  • Spheron AI Multi-GPU Stock Floor: Spheron AI 4x and 8x configurations are only listed when the region has enough stock to fulfil them. Below the floor the configurations are hidden, so you no longer pick a placement that fails at create time
  • Collapsible Sidebar with Persisted State: The navigation sidebar now collapses to an icon rail and remembers its state across reloads. Tooltips appear on hover when collapsed, so you can reclaim horizontal space for the dashboard, instance cards, and the deploy wizard without losing navigation
  • Reserved GPU Catalogue Updates: The reserved GPU catalogue now includes R100 and GH200, and the GPU memory field is optional on request submissions
Reliability
  • Faster Capacity Detection on Dedicated Offers: Dedicated GPU availability now refreshes every 5 minutes, so the deploy wizard and dashboard reflect real stock within minutes of an inventory change. When a pending alert exists, the marketplace forces an early refresh so the email lands while capacity is still live
  • Cache Refresh on Stock-Out Errors: Deploy failures that signal out-of-stock now invalidate the marketplace's offer cache immediately. The next user no longer sees a just-sold-out GPU listed as available while the cache walks down its TTL
  • Availability Alert Emails: Notification emails include a per-region capacity breakdown with brand labels, are tone-shaped for spot vs dedicated copy, and ship in both HTML and plain-text bodies so they render correctly in every client

v1.19.0 • 2026-05-11

Spheron ES (Spheron Extra Supply) Provider Launch, Spot Reliability & Faster Deployment Status

Meet Spheron ES (Spheron Extra Supply), a new GPU provider on the marketplace with B200 SXM6, H100 SXM5, H200 SXM5, L40S, and RTX Pro 6000 across Finland, France, Middle East, and the United States. Both on-demand and spot are supported, and persistent volumes ship on day one with multi-instance attach, in-place resize, and a 256 TiB ceiling per volume. Alongside the launch, spot deployments now stay correctly labelled across the dashboard, preempted instances are reaped in seconds instead of minutes, and freshly created deployments flip to "Running" the moment the VM is live.

New Provider
  • Spheron ES (Spheron Extra Supply): A new GPU provider is now live on the marketplace. Inventory currently spans B200 SXM6, H100 SXM5, H200 SXM5, L40S, and RTX Pro 6000
  • On-Demand and Spot: Both instance types are available. Spot offers the same GPU configurations at a lower price with no uptime guarantee
  • Multiple Regions: Spheron ES regions span Finland (EU North 1), France (EU West 1), Middle East (ME West 1), and the United States (US Central 1). They show up in the deployment and volume wizards, and in regions listings on the API
  • Country Filter Picks Up New Regions: The country filter in the deployment wizard correctly groups EU North under Finland, EU West under France, and ME West.
  • Tier 3 DC Compliance Badge: Spheron ES is flagged with the same Tier 3 DC compliance badge as the rest of the family
Persistent Volumes on Spheron ES
  • Volumes on Spheron ES: Spheron ES is a supported volume provider from day one. Create and manage persistent storage from the /volumes dashboard with the same UX as the existing volume providers
  • Multi-Instance Attach: A single Spheron ES volume can be attached to multiple instances at once, so a dataset can be fanned out across a cluster without duplication
  • Attach at Deploy Time Without Restart: Pick a Spheron ES volume in the deployment wizard and it is wired into the instance before the VM boots. No post-boot mount step, no restart
  • Hot-Attach and Hot-Detach Supported: Attach or detach a volume on a running instance from the dashboard or API. The instance briefly stops and restarts to apply the change; data is preserved
  • In-Place Resize: Grow a Spheron ES volume after creation without recreating it or migrating data (shrinking is not supported)
  • Up to 256 TiB per Volume: Spheron ES volumes can be created up to 256 TiB in a single volume (sizeInGb: 262144 on the API). The size slider in the wizard adapts to match
  • Volume Rules Dialog for Spheron ES: The "Volume Rules" dialog accessible from the create-volume and attach flows now covers Spheron ES, spelling out the at-deploy attach, restart-on-hot-attach behaviour, multi-attach semantics, resize policy, and the no-rename restriction
  • Mount Tag on the Volume Card: Each Spheron ES volume card now shows the mount tag for shared-filesystem volumes with a one-click copy, so you can grab it without opening the detail panel
Reliability & UX
  • Spot Type Preserved End-to-End: Spot deployments created via the API or dashboard are now consistently labelled "Spot" on the instances page and across deployment responses. Previously a spot VM could fall through to "Dedicated" in the dashboard when the client did not pass instanceType. Commission, billing, and the instance card badge all derive the type from the offer catalog now
  • Faster Status Flip on New Deployments: Newly created deployments now flip from "Deploying" to "Running" within seconds of the VM coming online, instead of the previous 30-90 second wait. The runtime counter on the dashboard tracks usage accurately from the first second
  • Faster Preemption Detection: Spot instances that get preempted are now detected within ~10 seconds (one monitor cycle), down from the previous ~30 seconds. This shrinks the window where you might still be charged for a preempted VM's allocated disk and IP
  • Orphan VM Cleanup for Failed Spot Starts: If a spot VM is preempted or fails to ever reach Running (for example when capacity is exhausted mid-create), the upstream VM is now reliably destroyed during cleanup. Previously the deployment was marked terminated on the marketplace side but the VM could linger and continue billing for disk and IP allocation
  • Safer Concurrent Volume Operations: Attaching and detaching volumes back-to-back on the same instance is now race-safe. The instance monitor no longer flips a healthy VM to "Terminated" if it observes a transient stopped state during a hot-attach or hot-detach cycle
API
  • New provider: "spheron-es" Value Accepted: POST /api/volumes and POST /api/deployments now accept spheron-es alongside the existing brand values. Use GET /api/gpu-offers?providers=spheron-es (note: providers is plural and comma-separated) to list GPU offers, and GET /api/volumes/regions?provider=spheron-es to discover regions at runtime
  • Storage Pricing on Spheron ES: GET /api/volumes/pricing?provider=spheron-es returns the current hourly rate per GB. The endpoint accepts an optional region parameter for forward compatibility, but Spheron ES rates are currently flat across all regions
  • Multi-Instance Attach on POST /api/volumes/{volumeId}/attach: Spheron ES volumes can be attached to multiple deployments concurrently. Each successful call adds the deployment to attachedToDeploymentIds; the same volume id can be reused on a different deployment without detaching first
  • volumeIds Wired in at Create: When volumeIds is passed to POST /api/deployments for a Spheron ES deployment, the volumes are mounted as part of the create call rather than after the VM reaches Running. The response returns with the volume already marked attached
  • Volume Size Ceiling for Spheron ES: POST /api/volumes accepts sizeInGb up to 262144 on Spheron ES (256 TiB). Read maxSizeGb from GET /api/volumes/regions?provider=spheron-es for the current cap
  • Correct instanceType on Deployment Responses: GET /api/deployments and GET /api/deployments/{deploymentId} now return the correct instanceType for every deployment even when the client did not pass the field on create. The value is derived from the offer catalog and resolves to one of SPOT, DEDICATED, BARE_METAL, or CLUSTER. Existing fields are unchanged so existing integrations continue to work

v1.18.0 • 2026-05-07

More Spheron AI GPUs, More Regions, Per-Region Storage Pricing

Spheron AI now ships with more GPU choices, more regions, and a larger storage ceiling. The volume wizard adapts size limits and price to the region you pick, region names read clearly across the dashboard, deployment failure messages are easier to act on, and Verda and Voltage Park now show the same post-create rules modal that Sesterce and Spheron AI already had.

Expansion
  • More Spheron AI GPUs: Spheron AI offers now span a wider GPU lineup, including new H100 (SXM5, NVL, PCIE), A100 80GB (NVL, PCIE), A6000, RTX 4090, RTX 5090, L40, and L40S options. They appear in the deployment wizard and on GET /api/gpu-offers automatically with no client changes
  • More Spheron AI Regions: Two new regions are available alongside Canada 1 and Norway 1: Montreal 1 and Oslo 1. They show up in the deployment and volume wizards, and in regions listings on the API
  • Larger Volume Ceiling: Spheron AI volumes can now be created up to 50 TB in the new regions (up from 40 TB). The wizard adjusts the size slider's maximum based on the region you pick
UX Improvements
  • Friendlier Region Names: Region picks across the deployment wizard, volume wizard, instance cards, and volume cards now display as "Canada 1", "Norway 1", "Montreal 1", "Oslo 1" instead of raw region codes
  • Country Filter Recognises New Regions: The country filter in the deployment wizard correctly groups the new Montreal region under Canada and Oslo under Norway
  • Per-Region Storage Pricing in the Wizard: When you change region in the create-volume wizard, the displayed hourly rate and the size slider's range update to match that region. No more clicking through and getting surprised at checkout
  • Volume Created Modal for Verda and Voltage Park: Creating a volume on Verda or Voltage Park now shows the same post-create confirmation modal that Sesterce and Spheron AI already had. Each provider's modal summarises that provider's specific attachment and lifecycle rules and offers a one-click "Deploy GPU" shortcut to jump straight into the deployment wizard
  • Cleaner Failure Messages: Deployment failure reasons and error messages across the dashboard, toast notifications, and deployment-failure emails now surface a single clean line instead of dense JSON payloads and status codes. The full original error is still retained internally for support requests
Bug Fixes & Reliability
  • Clearer Out-of-Stock Errors: When a GPU is sold out in the chosen region, the deployment failure message now says so directly instead of returning a generic provider error
  • Friendlier 409 / Conflict Errors: Volume delete and deployment errors that come back as a 409 (the volume is still attached, or it is being released after recent activity) now show an actionable next step instead of a generic conflict message
  • Region Switch Updates Size Cap: Switching to a region with a different storage ceiling now resets the size slider correctly. Previously a stale cap could carry over from the previously selected region
  • Correct Volume Size Cap Per Region: The create-volume slider now reads the maximum size of the region you actually picked. Previously the slider showed the brand's widest possible cap (50 TB) on every region, including regions that only support up to 40 TB
  • Consistent Provider Label in Errors: Volume and deployment error toasts always show the Spheron AI label, including in a handful of rarer error paths where the wording previously slipped
  • Custom Deployment Names No Longer Fail at Create Time: Deployments created via the API with a custom name (containing spaces, dots, emoji, or anything beyond letters and digits) no longer fail with a name-validation error at deploy time. Your name is preserved exactly as you sent it on GET /api/deployments and on the dashboard. See the API note below for details
API
  • Per-Region Size and Pricing on GET /api/volumes/regions: Each region entry now includes minSizeGb, maxSizeGb, and hourlyRatePerGb. If you are already listing regions, you no longer need a second call to GET /api/volumes/pricing to render a complete picker. Existing fields (id, name, country_code, types, hasGpuOffers) are unchanged
  • Region-Aware Pricing on GET /api/volumes/pricing: The pricing endpoint now accepts an optional region parameter. Spheron AI rates can vary by region, so pass the region you intend to use for an accurate rate. Calls without region keep returning the provider's default rate, so existing integrations continue to work
  • New Spheron AI Region IDs Accepted: POST /api/volumes and POST /api/deployments now accept MON1 (Montreal 1) and OSL1 (Oslo 1) as region values, alongside the existing CANADA-1 and NORWAY-1. Use GET /api/volumes/regions?provider=spheron-ai to discover the full list at runtime
  • Higher sizeInGb Ceiling: POST /api/volumes accepts sizeInGb up to 51200 on Spheron AI's Montreal and Oslo regions (40960 on Canada and Norway). Read maxSizeGb from GET /api/volumes/regions for the per-region cap
  • Cleaner error Bodies: Error responses across deployment, volume, SSH key, and Kubernetes endpoints now return a short, human-readable message instead of a dense JSON envelope. Existing fields are unchanged; only the message text is tidier
  • Cleaner failureDetails on Deployments: failureDetails on GET /api/deployments and GET /api/deployments/{deploymentId} is now a single readable line on failed deployments. The classification in failureReason (e.g. out-of-stock, 409 conflict) is also more specific so retries and backoffs can branch on it
  • Deployment name is Now Display-Only: The name you pass to POST /api/deployments is treated as a display label only. It is stored verbatim and returned unchanged on GET /api/deployments, GET /api/deployments/{deploymentId}, and the dashboard. There is no longer any character or length restriction enforced at create time, so names with spaces, dots, slashes, emoji, or longer than 63 characters are accepted. Your existing integrations keep working — there are no request changes required, and existing names continue to be returned exactly as stored

v1.17.2 • 2026-05-04

Deployment Response: First-Class gpuType and offerId Fields

Deployment responses now return the canonical GPU type and the source offer ID as their own fields, so you no longer need to parse the legacy gpuModelId composite to render GPU details or correlate a deployment back to a GPU offer.

API
  • New gpuType field on deployments: POST /api/deployments, GET /api/deployments, and GET /api/deployments/{deploymentId} responses now include a gpuType field with the canonical GPU type (for example rtx-4090, h100). Prefer this over splitting gpuModelId when rendering the GPU label. The existing gpuModelId field is unchanged so existing integrations continue to work.
  • New offerId field on deployments: The same three endpoints now also return the offerId used at create time, so you can correlate a deployment back to the entry from GET /api/gpu-offers directly. Existing fields are unchanged.
  • No request changes required. Both additions are response-only fields. Clients can opt in when ready and existing consumers keep working as-is.

v1.17.1 • 2026-04-28

API Reference & Referrals Page Refresh

The API reference and referrals pages have been rebuilt to match the v1.17.0 design refresh, the deployment, volume, and reserved GPU wizards have picked up another round of polish, and Spheron AI spot offers now stay fresher in the marketplace.

Design & UX Refresh
  • New API Reference Page: Endpoints are now grouped by category, with short descriptions, cleaner endpoint cards, refreshed request and response examples, and a tighter quick-test panel for easier scanning and testing
  • Refreshed Referrals Page: Stats, the share link, share buttons, and the "how it works" section have been rebuilt with clearer hierarchy, step cards, and reward callouts, so it is easier to see your rewards and pass on your link
  • More Wizard Polish: The deployment, volume, and reserved GPU wizards have picked up another round of spacing and typography tweaks across cards, selections, and order summaries
Reliability
  • Fresher Spheron AI Spot Pricing: Spheron AI spot offers now refresh on the same fast cadence as other spot providers, so spot availability and pricing stay current without a manual reload

v1.17.0 • 2026-04-27

Marketplace Design Refresh, Volume Discounts & NVLink GPU Flagging

A top-to-bottom visual refresh across billing, teams, volumes, API keys, SSH keys, deployment, and reserved GPU pages. Discounts now apply to volumes the same way they apply to deployments, the reserved GPU contact form captures more useful detail, and Spheron AI NVLink GPUs are clearly flagged in the marketplace.

Design & UX Refresh
  • Unified Visual Language: Billing, teams, volumes, API keys, SSH keys, deployment wizard, and reserved GPU wizard have all been redesigned around a shared header, card, and table style for a calmer, more consistent look
  • New Team Switcher: The team menu has been consolidated into a single switcher in the top navigation. Create, switch, and manage teams from one place instead of several disconnected dialogs
  • Refreshed Billing Page: Balance, deposit history, usage analytics, discount overview, and the add-credits flow (custom amount, saved card, billing details) have all been rebuilt with clearer hierarchy and tighter spacing
  • Smoother Wizards: Deployment, volume, and reserved GPU wizards share a common header treatment and improved step layouts. The order summary sidebar now stays put when dropdowns or dialogs open, so it no longer disappears mid-configuration
  • API Keys & SSH Keys: Empty states, generation dialogs, the whitelist-required state, the quick reference, and the documentation card have all been updated to match the new style
  • Volume Cards & Sidebar: Volume cards, the volume details sidebar, the resize dialog, and the create-volume order summary have been refreshed to read more clearly at a glance
New Features
  • Volume Discounts: Active team discounts now apply to volume hourly rates at creation and at resize time, the same way they already apply to deployments. The discounted rate is locked in when the volume is created. If a team's discount later expires or is revoked, the volume automatically reverts to its standard rate on the next billing cycle
  • Spheron AI NVLink GPUs Flagged: Spheron AI GPU offers that are NVLink-bridged variants (for example H100 NVL) are now clearly marked in the marketplace so you can pick the interconnect that matches your workload
  • Richer Reserved GPU Contact Form: The reserved GPU request form now captures phone country and country code separately (with a country picker), preferred GPU model, requested GPU quantity, and an optional marketing consent, so the team can route inquiries faster
Bug Fixes & Reliability
  • Discount Preview No Longer Fails Silently: The discount-code preview used in billing, the deposit flow, and the deployment and volume wizards now handles non-JSON error responses gracefully instead of failing in the background. Users see a clear error state instead of a stuck preview
  • Create Team Dialog Stays Open Until Done: The create team dialog now stays open and shows its loading state until the team is actually created, matching the behavior of the invite member and add SSH key dialogs
  • Data Crunch Cleanup Safety: The background cleanup that removes orphaned Data Crunch volumes now has additional safeguards so marketplace-tracked, idle, and shared user-owned volumes are never eligible. Only true orphans can be removed
  • Runtime Display: Reserved runtime hours now display as N/A instead of N.A when no expiry is set
API
  • Volume Hourly Rate (all clients): Volumes created or resized while a team has an active discount now bill at the discounted rate. The volume response from GET /api/volumes and GET /api/volumes/:volumeId reflects the locked-in discounted rate; the pre-discount rate is also returned for reference. If the team's discount later expires, the rate automatically reverts on the next billing cycle. No request changes are required for clients to benefit from this; pricing endpoints continue to return public list rates.

  • Spheron AI GPU Offers: GPU offers from Spheron AI now include an nvlink: true flag in their metadata when the GPU model is an NVLink-bridged variant (for example H100 NVL, A100 NVL). Non-NVLink offers do not include the flag. Existing fields are unchanged, so existing integrations continue to work and can opt in to the new flag when ready.


v1.16.0 • 2026-04-24

Spheron AI Persistent Volumes & Provider Rules Dialogs

Persistent storage, now on Spheron AI. Create a volume, attach it to a GPU instance at deploy time or while it's running, and move it between instances without losing data. Every volume provider now has a dedicated rules dialog so you know exactly how attachment and lifecycle work before you commit.

New Features
  • Spheron AI Volumes: Spheron AI is now a supported volume provider. Create and manage persistent storage from the /volumes dashboard, backed by Cloud-SSD storage
  • Attach at Deploy Time or Later: Pick a Spheron AI volume in the deployment wizard, or attach one to an already-running instance from the volume or instance sidebar. No need to tear down and redeploy to add storage
  • Hot-Detach Supported: Detach a volume from a running instance and reattach it to a different instance without terminating either one. Your data moves with it
  • Up to 10 Volumes per Instance: A single Spheron AI instance can have up to 10 volumes attached at once. The UI and API enforce the limit automatically so attach requests fail fast instead of in the middle of a deploy
  • Single-Attach Volumes: Each Spheron AI volume can only be attached to one instance at a time. The dashboard hides incompatible volumes in the deploy wizard and shows a clear error if you try to double-attach
  • Persistent After Termination: Volumes survive instance termination by default. Delete them explicitly from the dashboard when you're done
  • Large Storage Ceiling: Create volumes from 1 GB up to 40 TB in a single volume
  • Data Crunch Multi-Volume Support: Data Crunch instances can now attach up to 10 shared volumes at a time, up from a single volume in v1.14.1. Each Data Crunch volume can still be mounted on multiple instances concurrently, so you can fan out shared data and mix storage pools on the same instance
UX Improvements
  • Provider Rules Dialogs: Every volume provider (Spheron AI, Data Crunch, Sesterce, Voltage Park) now has its own "Volume Rules" dialog accessible from the create-volume and attach flows. Each dialog spells out per-provider limits: attach timing, concurrent-attach rules, resize policy, and lifecycle behavior, so you know what you're getting before you create a volume
  • Volume Created Success Modal: After a Spheron AI volume is created, a confirmation modal summarizes the rules and offers a one-click "Deploy GPU" shortcut to go straight into the deployment wizard with the new volume pre-selected
  • GPU Offer Availability Warning: The create-volume wizard now checks whether the selected provider and region actually have live GPU offers. If there's nothing to attach the volume to, a warning surfaces in the order summary before you confirm
  • Smarter Volume Matching in Deploy Wizard: The attach step in the deployment wizard now respects per-provider constraints. Incompatible volumes are filtered, limit warnings appear when you hit the cap, and single-attach volumes already bound elsewhere are shown as blocked with an explanation
  • Smarter Region Default on Create Volume: The create volume wizard now auto-selects the first region that actually has live GPU offers instead of blindly picking the alphabetical first. Regions without GPU offers are clearly labeled "No GPU offer present" in the Sesterce, Data Crunch, and Spheron AI dropdowns so you can see which ones to avoid before selecting. The Data Crunch dropdown is also cleaner, with decorative flag graphics removed
  • Consistent Region Name Formatting: Massed Compute regions (for example us-central-3) now display as "Us Central 3" in the deployment wizard, order summary, and instance cards, matching the style used for Sesterce and Spheron AI
Bug Fixes
  • No In-Place Resize (Enforced): Spheron AI volumes cannot be resized once created. The API now returns a clear error on resize attempts instead of silently succeeding. To grow storage, create a new larger volume and migrate your data
  • Accurate Region Codes in UI: The dashboard now shows clean region codes (for example CANADA-1) regardless of the provider's internal naming
  • Stale Deployments No Longer Lock Volumes: If a previous instance terminated without a clean detach, single-attach volumes are no longer permanently locked. Attach attempts skip terminated, failed, stopped, and deleted deployments when checking occupancy, so you can always reclaim the volume onto a new instance. Applies to both Spheron AI and Voltage Park volumes
  • Cleaner Single-Attach Error Messages: When a single attach volume is genuinely in use, the error message now lists only the live conflicting instances instead of every instance the volume has ever been attached to. Terminated and failed deployments no longer clutter the list
  • Voltage Park Single-Attach Enforced Server-Side: Voltage Park's one-volume-per-instance and one-instance-per-volume rules are now enforced by the API, not just the UI. Double-attach attempts are rejected even when the request bypasses the dashboard
API

Creating and using a Spheron AI volume involves three steps:

  1. Fetch available regions: GET /api/volumes/regions?provider=spheron-ai returns a regions array. Each entry has an id (the region code to pass at creation time) and a name. Each region now also includes a hasGpuOffers boolean so you can check availability before creating a volume in a dead region.

  2. Create the volume: POST /api/volumes with provider: "spheron-ai", region: "<region id from step 1>", name (max 50 characters), and sizeInGb (1–40960). The response includes the volume id you will use for all subsequent operations.

  3. Attach to an instance: two options depending on your workflow:

    • At deploy time: include volumeIds: ["<volumeId>"] in your POST /api/deployments request
    • After the instance is running: POST /api/volumes/{volumeId}/attach with { "deploymentId": "<id>" }

Detaching and re-attaching: POST /api/volumes/{volumeId}/detach with { "deploymentId": "<id>" } frees the volume to be attached elsewhere. Data is preserved. Detach and re-attach are supported without instance downtime.

Limits: Each Spheron AI instance supports up to 10 attached volumes. Each volume can be attached to only one instance at a time. Resize is not supported. Rename is not supported, so pick the name you want at creation.

Pricing: GET /api/volumes/pricing?provider=spheron-ai returns the current hourly rate per GB so you can estimate cost before creating a volume.

Data Crunch limits (updated): Each Data Crunch instance now supports up to 10 attached volumes (was 1 in v1.14.1). Multi-instance attach is unchanged. A single Data Crunch volume can still be mounted on many instances at the same time.

API Docs correction: The API reference now uses CLUSTER as the instanceType for multi-node deployments, matching the v1.10.0 rename. Any request examples that still showed BARE_METAL have been updated.


v1.15.0 • 2026-04-04

Sesterce Persistent Volume Support

Persistent storage is now available on Sesterce. Create volumes, pick your cloud provider and region, and attach them to instances at deploy time. Your data outlives any single session.

New Features
  • Sesterce Volumes: Sesterce is now a supported volume provider. Create and manage persistent storage volumes from the /volumes dashboard
  • Cloud Provider + Region Picker: When creating a Sesterce volume, choose your cloud provider and region from a dynamic dropdown so volumes are always deployed where your compute is
  • Deploy-time Attachment: Select a compatible Sesterce volume in the deployment wizard. It attaches automatically when the instance starts, no extra steps needed
  • Automatic Compatibility Filtering: The deployment wizard only shows volumes that match the region and cloud provider of your selected GPU offer. No risk of picking an incompatible volume
  • Volume Deletion: Delete a Sesterce volume from the dashboard once the instance it was attached to has been terminated
API

Creating and using a Sesterce volume involves three steps:

  1. Fetch available cloud providers and regions: GET /api/volumes/regions?provider=sesterce returns a regions array where each entry has an id (the region code), a name, a cloudId, and a cloudName. You need both id and cloudId when creating the volume. They must match the cloud provider and region of the instance you plan to deploy.

  2. Create the volume: POST /api/volumes with provider: "sesterce", region: "<id from step 1>", cloudId: "<cloudId from step 1>", name, and sizeInGb. The response includes the volume id you will use when deploying.

  3. Attach at deploy time: include volumeIds: ["<volumeId>"] in your POST /api/deployments request. The volume attaches when the instance is created. Post-creation attach and detach are not supported for Sesterce. The volume is bound at launch.

Compatibility: the cloudId and region of your volume must match the cloud provider and region of your GPU offer. Use GET /api/gpu-offers to find offers and cross-reference the region before creating a volume.

Pricing: GET /api/volumes/pricing?provider=sesterce with optional cloudId and region query params returns the current hourly rate per GB for that specific location.


v1.14.6 • 2026-03-17

Spheron AI: Open All Inbound Ports by Default

All ports are open out of the box. Spheron AI provider VMs now have full inbound access (TCP/UDP/ICMP, ports 1–65535, IPv4 and IPv6) applied automatically at creation time, with no manual firewall configuration required.

Improvements
  • Security rules at creation: Six inbound rules (TCP 1–65535, UDP 1–65535, and ICMP for both IPv4 and IPv6) are now passed directly in the VM creation payload via the security_rules field, replacing the previous post-creation polling approach
  • Removed background firewall task: Eliminated the enableSSHFirewallWhenActive polling loop (up to 10-minute wait) and the separate enableSSHFirewall method that made individual POST /sg-rules calls after the VM reached ACTIVE state
  • No more sshFirewallError tracking: Removed the sshFirewallErrorMap, the sshFirewallError field on VMInstance, and all related error-surface logic. Firewall setup is now atomic with VM creation

v1.14.5 • 2026-03-16

GPU Count Validation Hardening

Stricter GPU count validation. Deployment requests with a mismatched GPU count are now rejected reliably regardless of whether the value arrives as a string or a number, and validation errors are propagated correctly through the error-handling pipeline.

Bug Fixes
  • Type-safe GPU count comparison: The GPU count check now coerces both the requested value and the offer config value to Number before comparing, preventing false validation failures caused by string/number type mismatches (e.g., "2" vs 2)
  • GpuValidationError class: Introduced a dedicated GpuValidationError subclass so that GPU validation errors are identified with instanceof rather than fragile error.message.startsWith(...) string matching, eliminating the risk of silently swallowing errors or misclassifying HTTP status codes if the message text changes
  • Route-level fix: deployment.routes.ts now uses instanceof GpuValidationError to classify the error as an HTTP 400, replacing the previous error.message.includes('Invalid GPU count:') check

v1.14.4 • 2026-03-15

Feature Announcement Modal & Sidebar Changelog Link

Stay up to date without leaving the dashboard. A new in-app feature announcement modal greets users with what's new since their last visit, and the sidebar now links directly to the changelog.

New Features
  • Feature Announcement Modal: Automatically shows authenticated users any features released since their last visit. Slides through announcements newest-first with story-style progress bars, smooth slide transitions, and a 25-second auto-advance timer
  • Per-User Tracking: Last-seen feature is persisted. The modal only re-appears when genuinely new features have shipped
  • Learn More Links: Each announcement card links directly to the relevant docs or changelog section
  • Changelog Sidebar Link: A compact "Changelog" entry with a "New" badge now lives in the sidebar bottom dock alongside the API Docs link, opening docs.spheron.network/changelog in a new tab

v1.14.3 • 2026-03-14

Sesterce Deployment Reliability Fix

Sesterce deployments no longer fail on name collisions. A 409 Conflict error that occurred when instance names clashed is now handled automatically with a unique suffix and retry logic.

Bug Fixes
  • 409 Name Collision: Sesterce's /gpu-cloud/instances API rejects duplicate instance names with a 409. Deployment names are now suffixed with a random 6-character hex string (crypto.randomBytes(3).toString('hex')) to prevent collisions
  • Retry Logic: If a 409 still occurs (e.g. extremely unlikely suffix collision), the deployment automatically retries with a freshly generated name, up to 3 attempts before failing
  • Status Code on Errors: makeAuthenticatedRequest now attaches statusCode to thrown errors, enabling reliable detection of 409 vs other failure types
  • SSH Key Cleanup: If all retry attempts are exhausted, any provisioned temporary SSH key is deleted to prevent resource leaks

v1.14.2 • 2026-03-14

TensorDock Deployment Fix & Improved Error Reporting

TensorDock deployments now work reliably. A 400 Bad Request error that blocked all TensorDock deployments has been fixed, and deployment failures now show human-readable error messages in the dashboard.

Bug Fixes
  • TensorDock 400 Error: Removed an incorrect JSON
    wrapper (data.attributes) that was wrapping the request payload. TensorDock's API v2 expects a flat JSON body
  • Field Name Casing: Renamed useDedicatedIp (camelCase) → use_dedicated_ip (snake_case) to match TensorDock's API contract
  • Cloud-Init Format: Converted cloud_init from a JSON object to a YAML string, consistent with how other providers (e.g. Spheron AI provider) handle startup scripts
Error Visibility
  • Human-Readable Failure Reasons: TensorDock JSON error responses are now parsed and classified into user-friendly categories (e.g. "Insufficient resources", "Invalid configuration") stored as failureReason on the deployment
  • Failure Details in Sidebar: InstanceDetailsSidebar now surfaces failureReason and failureDetails so users can see exactly why a deployment failed without digging through logs

v1.14.1 • 2026-03-04

Data Crunch Shared Volume Support

Shared NFS volumes on Data Crunch. Create persistent NFS-backed volumes, attach them to multiple instances simultaneously, and keep your data safe after instance termination.

New Provider Support
  • Data Crunch Volumes: Data Crunch is now a supported volume provider. Create and manage shared NFS volumes from the dashboard or via the API
  • Multi-Instance Attachment: A single Data Crunch volume can be attached to multiple GPU instances at the same time, unlike Voltage Park which restricts each volume to one instance
  • Region Selection: Deploy volumes into FIN-01 (Finland 1), FIN-02 (Finland 2), or FIN-03 (Finland 3). Each volume is pinned to the region you pick at creation time
Volume Lifecycle & Safety
  • Persistent Volumes: Data Crunch volumes survive instance termination by default. Only deleted on explicit user action
  • One Volume Per Instance: Each Data Crunch instance supports at most one shared volume at a time, though that same volume can be attached to unlimited instances concurrently
API

Creating and using a Data Crunch volume involves three steps:

  1. Fetch available regions: GET /api/volumes/regions?provider=datacrunch returns a regions array where each entry has an id (the region code to pass at creation time), a human-readable name, and a country_code. Use the id value as the region field when creating the volume.

  2. Create the volume: POST /api/volumes with provider: "datacrunch", region: "<region id from step 1>", name, and sizeInGb. The response includes the volume id you will use for all subsequent operations.

  3. Attach to an instance: two options depending on your workflow:

    • At deploy time: include volumeIds: ["<volumeId>"] in your POST /api/deployments request
    • After the instance is running: POST /api/volumes/{volumeId}/attach with { "deploymentId": "<id>" }

After attachment, GET /api/volumes/{volumeId} returns the NFS mount command and mount path in the response. Copy these directly into your instance to mount the volume.

Detaching and re-attaching: POST /api/volumes/{volumeId}/detach with { "deploymentId": "<id>" } frees the volume to be attached elsewhere. Data is preserved.

Pricing: GET /api/volumes/pricing?provider=datacrunch returns the current hourly rate per GB so you can estimate cost before creating a volume.


v1.14.0 • 2026-03-03

New GPU Provider: Europe & North America Regions

More GPUs, more regions. A new provider is now available on the Spheron AI marketplace, adding H200, H100, A100, L40S, A6000, and A4000 inventory across Canada, Norway, and the US.

New GPU Inventory
  • GPU Models: H200 SXM, H100 SXM5 / NVL / PCIe, A100 SXM4 / NVL / PCIe, L40S, L40, A6000, RTX Pro 6000, A4000
  • Regions: Canada (Montreal), Norway (Bergen), United States
  • On-Demand & Spot: Both instance types are available. Spot instances offer the same GPU configurations at a lower price with no uptime guarantee
Deployment Features
  • Startup Scripts: Cloud-init startup scripts are supported. Packages, run commands, and file writes all work on this provider the same as others
  • OS Options: 11 OS images to choose from, including Ubuntu 22.04/24.04, AlmaLinux 9, Debian 12, and CUDA-enabled variants (R535, R550, R570). Defaults to Ubuntu Server 22.04 LTS with CUDA 12.8 and Docker
  • SSH Access: Port 22 is automatically opened on every new instance. No manual firewall configuration needed

v1.13.1 • 2026-02-27

Password Complexity Enforcement on Signup

Stronger account security. Signup now enforces password complexity rules before any Firebase call, giving users immediate, clear feedback.

Bug Fixes
  • Weak Password Allowed: The signup form previously accepted any non-empty string as a password, including the user's own email address. This is now blocked
  • No Complexity Checks: Passwords with no uppercase, no lowercase, or no digits were accepted without warning. All three are now required
Password Rules (enforced client-side before Firebase)
  • Minimum 8 characters
  • Cannot be identical to your full email address (case-insensitive)
  • Cannot be identical to the local part of your email (the portion before @)
  • Must contain at least one uppercase letter
  • Must contain at least one lowercase letter
  • Must contain at least one digit
UX Improvements
  • Inline Error Messages: Validation failures are shown immediately below the password field with a specific explanation. No generic Firebase errors
  • Live Clearing: The error clears on every keystroke in the password field, and also when the email field is changed (prevents a stuck form if a "password too similar to email" error was triggered and the user updates their email instead)
  • Gated Submit: The sign-up button stays disabled while a password validation error is active

v1.13.0 • 2026-02-27

Auto Top-Up Credits via Stripe

Never run out of credits. Configure a balance threshold and Spheron automatically charges your saved card to keep your deployments running.

Saved Card Management
  • Save a Card: Add a payment card via Stripe's hosted setup flow (no card numbers stored; only a Stripe PaymentMethod ID)
  • Card Details: Saved card brand, last 4 digits, and expiry are shown in the billing page at all times
  • Remove Card: Detach your saved card from Stripe entirely. This automatically disables auto top-up if it was enabled
  • Role-Gated Access: Only team owners and admins can save or remove cards; members see card info in read-only mode
Auto Top-Up Configuration
  • Enable / Disable Toggle: Turn auto top-up on or off from the billing page
  • Threshold Setting: Set the balance level (min $5, max $500) that triggers an automatic charge
  • Top-Up Amount: Choose how much to charge each time (min $15, max $5,000)
  • Immediate Check: Enabling auto top-up immediately checks your balance and triggers a charge if already below threshold
  • Failure Banner: A warning banner appears on the billing page if auto top-up was disabled due to a declined charge
Automatic Charging
  • Balance-Triggered Charge: After each credit deduction cycle, if your team balance falls below the configured threshold, a Stripe off-session PaymentIntent is created and charged automatically
  • 5-Minute Cooldown: A distributed lock prevents double-charging. Top-ups cannot fire more than once every 5 minutes per team
  • Race Condition Safe: Balance is re-fetched inside the lock before charging to prevent concurrent monitor cycles from triggering duplicate charges
  • Instant Credit: Team balance is credited immediately on successful charge without waiting for a webhook
Stripe Webhook Handlers
  • payment_intent.succeeded: Idempotent safety net. Credits the team if the service layer missed it (e.g. network failure mid-request)
  • payment_intent.payment_failed: Automatically disables auto top-up and records lastFailedAt when a card is declined
Transaction History
  • Auto top-up charges appear in Deposit History labeled "Auto Top-Up" with a distinct badge, clearly separated from manual credit purchases
Email Notifications
  • Success Email: Confirms the charge amount and shows the new account balance
  • Failure Email: Notifies the configuring user that their card was declined and auto top-up has been disabled, with a direct link to re-enable

v1.12.2 • 2026-02-27

Mandatory Phone Number on Reserved GPU Form

Required contact info. Phone number is now enforced end-to-end so reserved GPU requests always include reachable contact details.

Bug Fixes
  • Mandatory Phone Field: Phone number is now a required field on the Reserved GPU request form. The label, frontend validation, and server-side validation all enforce it
  • Format Validation: Phone input is validated against a format check (supports +, digits, spaces, dashes, parentheses, e.g. +1 555-123-4567 or (555) 123-4567) with a minimum of 7 digits, on both client and server
  • Review Step Gate: If contact info (including phone) is incomplete when the user reaches the review step, the wizard redirects back to the contact form with a clear "Contact Info Required" toast notification
  • Backwards Compatibility: Existing reservations without a phone number are unaffected. The schema change is non-breaking for legacy documents

v1.12.1 • 2026-02-23

Zero-Downtime Deployment & Faster Container Restarts

Instant restarts, no build delays. App builds are now baked into the Docker image so containers spin up in seconds instead of minutes.

Deployment Scripts
  • Zero-Downtime Deploy: New script/deploy.sh for all the apps. Builds the new image while the current container keeps serving traffic, then hot-swaps to the new container with no manual downtime
  • Health-Gated Rollout: The deploy script waits for Docker's health check to pass (up to 120s) before declaring success, ensuring the new container is fully ready before traffic is handed off
Docker Image Optimizations
  • Build at Image Time: npm run build now runs during docker build (image layer) instead of at container startup. Containers start in seconds rather than waiting for a full compile
  • Health Checks Added: All app's Dockerfiles now include HEALTHCHECK directives with 10s intervals and 30s start periods so Docker and the deploy script can reliably detect readiness

v1.12.0 • 2026-02-23

Volume Management UI for Voltage Park

Full persistent storage experience. Create, browse, attach, detach, resize, and monitor volumes directly from the dashboard.

New Pages
  • Volumes Page (/volumes): Dedicated page to view and manage all your persistent storage volumes in one place, with status badges, provider/region info, hourly rates, and quick actions
  • Create Volume Page (/create-volume): Step-by-step wizard to create a new volume. Select a provider, configure name, size, region, and lifecycle settings before reviewing the cost summary
Volume Management
  • Volume Cards: Each volume displays its name, size, provider, region, status (available, attached, creating, deleting, error), and current cost, with one-click copy for Volume ID
  • Volume Details Sidebar: Click any volume to open a detailed side panel showing overview, configuration, storage usage metrics, and attached instances
  • Attach to Deployment: During the deployment wizard, an "Attach Volumes" step surfaces compatible volumes (same provider and region) and lets you select one to attach before launch
    • Voltage Park instances support 1 volume per instance. The UI enforces this limit automatically
  • Detach Volume: Detach a volume from a running instance directly from the instance detail sidebar or the volumes page, without losing data
  • Resize Volume: Expand a volume's storage size on-demand from the volumes page (shrinking is not supported)
  • Delete Volume: Delete a volume (must be detached first) with a confirmation dialog
Instance Detail Sidebar
  • New Attached Volumes card visible on each running instance showing all volumes currently mounted, their size, hourly cost, and a detach action
Navigation
  • Volumes and Create Volume entries added to the sidebar for quick access
Smart Behaviors
  • Volume selection resets automatically when you switch providers or networking type in the deployment wizard
  • Volume size input clamps to the provider's supported min/max range and rounds to a whole number
  • Region matching is normalized so minor formatting differences (e.g. us-east-1 vs US East 1) don't prevent valid volumes from appearing
  • Provider pricing is validated before allowing volume creation. A clear error is shown if pricing is unavailable for the selected provider/region
Pricing & Cost Display
  • Hourly and monthly cost estimates shown in real-time while configuring volume size
  • Order summary in the create wizard shows total hourly cost and projected 30-day cost before confirming
API Endpoints (unchanged from v1.11.0, now fully surfaced in UI)
  • GET /api/volumes: list volumes (filterable by team, status, provider)
  • POST /api/volumes: create volume
  • GET /api/volumes/:volumeId: volume details with live usage data
  • PATCH /api/volumes/:volumeId: rename, resize, or update lifecycle settings
  • DELETE /api/volumes/:volumeId: delete volume
  • POST /api/volumes/:volumeId/attach: attach to a deployment
  • POST /api/volumes/:volumeId/detach: detach from a deployment
  • GET /api/volumes/pricing: storage pricing by provider

v1.11.1 • 2026-02-13

Kubernetes Add-on for Voltage Park

Managed Kubernetes clusters on bare-metal GPU instances with automatic provisioning, kubeconfig access, and health monitoring.

New Features
  • Kubernetes Cluster Deployment: Deploy fully managed Kubernetes clusters on Voltage Park bare-metal GPU instances
  • Version Selection: Choose from multiple Kubernetes versions (1.35, 1.34, 1.33, 1.32) with automatic validation
  • Automatic Provisioning: Kubernetes clusters are automatically provisioned and configured during instance deployment
  • Kubeconfig Access: Download kubeconfig files directly from deployments for immediate kubectl access
  • Cluster Health Monitoring: Real-time health tracking for control plane nodes, worker nodes, and network components
  • Grafana Integration: Built-in Grafana monitoring links for cluster observability and metrics
  • Custom Authentication: Optional support for custom Kubernetes AuthenticationConfiguration YAML
  • Persistent Cluster State: Cluster configurations persist independently from instance lifecycle
Technical Implementation
  • Provider Validation: Automatic validation ensures Kubernetes add-ons are only used with Voltage Park bare-metal instances
  • Version Validation: Server-side version checking against provider-supported Kubernetes releases
  • Base64 Encoding: Secure transmission of authentication configurations via base64 encoding
  • Monitoring Service: Background service updates cluster health and configuration data automatically
API Endpoints
  • GET /api/kubernetes/versions?provider=voltage-park: List available Kubernetes versions
  • GET /api/kubernetes/:clusterId/health: Get cluster health status with node and network component metrics
  • POST /api/deployments with kubernetesAddon object: Create deployment with Kubernetes cluster
Deployment Requirements
  • Provider: Voltage Park only (other providers not yet supported)
  • Instance Type: CLUSTER deployments only for H100 SXM5
  • Operating System: Compatible with TensorML and other Voltage Park OS images

v1.11.0 • 2026-02-07

Persistent Storage Volumes

Persistent storage for GPU instances with flexible volume management and deployment attachment.

New Features
  • Volume Management: Create, update, and delete persistent storage volumes independent of GPU instances
  • Flexible Attachment: Attach and detach volumes to/from running deployments without losing data
  • Multi-Provider Support: Volume storage available across supported providers (Voltage Park, etc.)
  • Size Flexibility: Configure volumes from 1GB to 10TB with dynamic expansion support
  • Automatic Billing: Hourly billing based on storage size with transparent pricing per GB
  • Usage Tracking: Real-time monitoring of used capacity and storage metrics
  • Smart Lifecycle Management: Choose whether volumes persist after instance termination or delete automatically
  • Credit Validation: Automatic validation ensures sufficient credits before volume creation (24-hour minimum)
Volume Features
  • Independent Lifecycle: Volumes persist independently from GPU instances by default
  • Cross-Instance Sharing: Attach a single volume to multiple instances (provider-dependent)
  • Capacity Expansion: Expand volume size on-demand (shrinking not supported for data safety)
  • Name Validation: Alphanumeric naming with hyphens/underscores (max 60 characters)
  • Status Tracking: Real-time status monitoring (available, attached, deleting, deleted)
  • Automatic Cleanup: Orphaned volume references automatically cleaned from deployments
API Endpoints
  • GET /api/volumes: List all volumes for a team with pagination and filtering
  • POST /api/volumes: Create new persistent storage volume
  • GET /api/volumes/:volumeId: Get volume details with fresh usage data
  • PATCH /api/volumes/:volumeId: Update volume name, size, or lifecycle settings
  • DELETE /api/volumes/:volumeId: Delete volume (must be detached first)
  • POST /api/volumes/:volumeId/attach: Attach volume to a deployment
  • POST /api/volumes/:volumeId/detach: Detach volume from deployment
  • GET /api/volumes/pricing: Get storage pricing by provider
Technical Improvements
  • Provider Consistency: Volumes and deployments must be from the same provider for attachment
  • Atomic Operations: Provider updates happen before database updates to maintain consistency
  • Graceful Cleanup: Automatic cleanup of volume references when deployments are deleted
  • Audit Logging: Complete audit trail for all volume lifecycle events

v1.10.1 • 2026-01-14

Reserved Instance Management

Dedicated GPU leasing for users who have contacted the team and secured reserved GPU nodes.

New Features
  • Reserved Instance Display: Users who have leased GPU nodes directly with Spheron AI can now view their reserved instances in the marketplace
  • Direct Leasing Support: Seamless integration for customers who have contacted the team for bulk GPU reservations
  • Dedicated Node Access: Reserved instances provide exclusive access to pre-allocated GPU nodes with guaranteed availability
  • Custom Configurations: Support for custom GPU configurations negotiated directly with the Spheron AI team
Improvements
  • Enhanced Visibility: Reserved instances are clearly labeled and distinguished from on-demand offerings
  • Better Management: Improved interface for tracking and managing leased GPU resources
  • Streamlined Experience: Integrated workflow for users with both reserved and on-demand instances

v1.10.0 • 2026-01-08

Cluster Deployments & Baremetal GPU Offers

Multi-node cluster support with enhanced baremetal GPU offerings and clearer deployment terminology.

New Features
  • Cluster Deployment Support: Multi-node GPU clusters are now clearly labeled and supported throughout the platform
  • Baremetal GPU Offers: New baremetal GPU configurations added to the marketplace for dedicated hardware performance
  • Improved Nomenclature: Voltage Park's H100 SXM5 offerings now accurately labeled as "H100 SXM5 Cluster" instead of "Baremetal" for better clarity
  • Better GPU Classification: Enhanced distinction between single-node baremetal instances and multi-node cluster deployments
Improvements
  • Clearer Labels: Deployment types are now more accurately labeled to help users understand exactly what they're deploying
  • Enhanced Discovery: Cluster and baremetal options are now easier to find and understand in the GPU marketplace
  • Consistent Terminology: Standardized naming conventions across all providers for better user experience

v1.9.7 • 2026-01-06

Intercom Live Chat Support

Get instant help with integrated Intercom messenger for real-time customer support and assistance.

New Features
  • Live Chat Widget: Intercom messenger widget integrated throughout the platform for instant support access
  • Identity Verification: Secure user authentication with HMAC-SHA256 hash-based identity verification to prevent user impersonation
  • User Context: Automatic user data synchronization (name, email, account creation date) for personalized support experience
  • Anonymous Visitor Support: Non-authenticated visitors can still access support through anonymous chat sessions
  • Smart Session Management: Automatic session handling when users log in, log out, or switch accounts
  • Seamless Integration: Support chat available on all pages without disrupting your workflow

v1.9.6 • 2025-12-30

Deployment Termination Controls

Critical Fix: Strict termination controls prevent premature instance termination and ensure proper billing accuracy.

Breaking Changes
  • Status-Based Termination: Only instances with running status can now be terminated manually
    • deploying instances cannot be terminated (must wait until running or automatic failure)
    • failed, terminated, and terminated-provider instances are already in final states
  • Minimum Runtime Enforcement: Running instances must meet the 20-minute minimum runtime requirement before termination
Bug Fixes
  • Fixed incorrect status semantics: Deployments that never started running are now marked as failed instead of terminated
  • Fixed API key users being able to bypass minimum runtime checks by terminating during deploying status
  • Fixed deployments showing "terminated" with N/A runtime and $0.00 cost (now correctly marked as failed)
  • Added proper error messages for termination attempts on non-running instances
New Features
  • Improved Error Messages: Status-specific error messages guide users on correct actions
    • Clear explanations for why termination is blocked
    • Helpful guidance on what to do next
  • Frontend Error Handling: Enhanced UI feedback for termination failures
    • Status-specific toast messages
    • Better user experience during edge cases
Technical Changes
  • Updated GET /api/deployments/:id/can-terminate endpoint with status validation
  • Updated DELETE /api/deployments/:id endpoint with strict pre-termination checks

v1.9.5 • 2025-12-30

Flexible Storage Options for Data Crunch

Choose your storage size when deploying Data Crunch dedicated instances with new 500GB and 1TB options.

New Features
  • Storage Size Selection: Data Crunch dedicated instances now offer two storage options:
    • 500GB Standard: Default option with $0.137/hr storage pricing
    • 1TB Extended: Expanded storage option with $0.274/hr storage pricing
  • Clear Pricing Display: Storage costs are shown separately in the deployment wizard for transparent pricing
  • Smart Defaults: 500GB storage is automatically selected when configuring Data Crunch dedicated instances
  • Flexible Configuration: Easily switch between storage sizes to match your workload requirements
Technical Notes
  • Storage property is now included in GPU offer configurations for Data Crunch dedicated instances
  • Pricing calculations properly account for storage costs in commission-adjusted totals

v1.9.4 • 2025-12-25

OTP Verification for Reserved GPU Requests

Enhanced security for bulk GPU reservations with email-based OTP verification.

New Features
  • OTP Verification Flow: Reserved GPU requests now require email verification via one-time password (OTP) for added security
  • Email OTP Delivery: Receive a verification code via email when submitting a reserved GPU request
  • OTP Resend Option: Didn't receive the code? Request a new OTP to be sent to your email
  • Session Management: OTP verification sessions are securely managed with automatic expiration
  • Rate Limiting: OTP requests are rate-limited to prevent abuse and ensure system stability

v1.9.3 • 2025-12-25

API Access Control & Runtime Policies

Controlled API access with whitelist-based key generation and enforced minimum runtime for deployments.

New Features
  • API Key Whitelisting: API key generation is now controlled through a whitelist system. Contact support to request API access
  • Whitelist Status Visibility: Users can now see their API access status clearly in the settings interface
  • Minimum Runtime Enforcement: Deployments now require a 20-minute minimum runtime before termination to prevent accidental early shutdowns
  • Runtime Protection: Destroy action is automatically disabled during the minimum runtime period with a helpful countdown timer
  • System Configuration API: New endpoint (GET /api/config/system) exposes system configuration including minimum runtime requirements
  • Termination Validation: New endpoint (GET /api/deployments/:id/can-terminate) checks if an instance can be safely terminated
Improvements
  • Enhanced Rate Limiting: Deployment creation now limited to 5 deployments per hour per user for better platform stability and abuse prevention
  • Better API Key Handling: Improved handling of empty or undefined API keys in the UI
  • Deployment Wizard Updates: SSH key step now displays minimum runtime requirements upfront
  • Clear User Feedback: Disabled actions now include tooltips explaining why they're unavailable and when they'll be enabled
API Changes
  • GET /api/config/system: Returns system configuration including minimum runtime settings
  • GET /api/deployments/:id/can-terminate: Validates if deployment can be terminated
  • DELETE /api/deployments/:id: Enforces minimum runtime checks before allowing termination
  • POST /api/deployments: Enhanced with rate limiting

v1.9.2 • 2025-12-22

Christmas Promo & Active Discount Display

Time-limited promotional discounts with automatic activation and clear discount visibility.

New Features
  • Christmas Promo: Special time-limited discount offers appear automatically to eligible users with easy one-click activation
  • Active Discount Display: Your current discount percentage and expiration date are now prominently shown on the billing page
  • Clear Savings: See exactly how much you're saving with active discounts applied to your usage

v1.9.1 • 2025-12-20

Reserved GPU Offers

Smarter GPU reservation options with detailed specifications.

Reserved GPU Offers
  • Two Display Modes: Reserved GPU offers now support two presentation styles:
    • Tiers Mode: Compare multiple pricing tiers (up to 3) with flexible commitment options, perfect for understanding pricing at different commitment levels
    • Specs Mode: View detailed technical specifications including vCPUs, RAM, storage, connectivity type, and deployment method (Bare Metal or VM), ideal for understanding exact hardware configurations
  • Regional Availability: GPU offers now display available regions, making it easier to choose deployments close to your users
  • Better Descriptions: Hover over offer descriptions to see full details without cluttering the interface
  • Improved Reservation Flow: When clicking "Reserve" on an offer, relevant details (specs, regions, pricing) are now pre-filled in the reservation form for faster booking

v1.9.0 • 2025-12-18

Major Frontend Refactor

Complete codebase modernization with modular components, centralized hooks, and improved UX.

Routing & Layout
  • Introduce new routing and layout components
  • Split PublicLayout and Sidebar into components/shared
Billing & Payments
  • Refactor AddCreditsTab into modular components
  • Powered by new usePayment hook and usePaymentCallback
  • Move deposit dialog to TopNavBar subcomponents
API Documentation
  • Break ApiDocumentation into composable sections and utils/types
Instances & Deployments
  • Modularize Card and Sidebar into subcomponents
  • Rebuild pages/instances.tsx using new grid/header/empty/error components with infinite loading
Shared Components & UI
  • Extract Sidebar and TopNavBar for reusability
  • Normalize toast hook path (useToast)
UX Improvements
  • All modals now wait for API completion before closing
  • Fixed React Hooks violations with proper hook ordering
  • Team switcher dropdown now works correctly
  • Better loading states and error handling throughout

v1.8.2 • 2025-12-17

Higher Deposit Limits & New Credit Packages

Deposit up to $5,000 with new bulk credit packages for larger workloads.

New Features
  • Increased Deposit Limits: Maximum deposit raised from $1,000 to $5,000 for teams with larger GPU requirements
  • New Credit Packages: Added $500, $1,000, and $5,000 quick purchase options for bulk credits
  • Clear Limit Display: Deposit limits are now prominently displayed in the UI for transparency
  • Package Validation: UI automatically prevents purchases outside allowed limits with helpful error messages
  • Better Formatting: Large numbers now display with proper comma formatting for easier reading
Improvements
  • Dynamic Configuration: Deposit limits and referral rewards are now centrally managed for consistency
  • Referral Page Updates: Referral reward amounts and minimum deposit requirements are now displayed accurately throughout
  • Enhanced UX: Improved handling of edge cases when selecting credit packages

v1.8.1 • 2025-12-17

Search Improvements

Enhanced GPU search with better filtering and improved reliability.

Improvements
  • Reserved GPU Search: Search now properly filters reserved GPU offers. When you search for "H100", only H100 models will appear in results
  • More Accurate Results: Improved search matching ensures you see only relevant GPU configurations
  • Better Performance: Enhanced search processing for faster and more reliable results
  • Consistent Experience: Search behavior is now unified across all GPU offer types

v1.8.0 • 2025-12-16

Referral Program Launch

Earn rewards by referring friends! Get $5 credit when your referral deposits $100.

New Features
  • Referral Dashboard: Access your unique referral link and track your referrals
  • Shareable Links: Easy sharing via Twitter, LinkedIn, and direct link copying
  • Automatic Tracking: Referrals are automatically tracked when someone signs up using your link
  • Instant Rewards: Credits are automatically added to your account when referrals make qualifying deposits
  • Discount Codes: New ability to validate and apply discount codes to your account
  • Enhanced Transaction History: Referral bonus deposits are now clearly labeled in your transaction history

v1.7.0 • 2025-11-20

Email Notifications & Credit Monitoring

Stay informed with professional email notifications and proactive balance alerts.

New Features
  • Enhanced Email Notifications: Improved email design for welcome messages, deployment updates, and account alerts
  • Smart Low Balance Alerts: Receive notifications when your balance reaches $50, $20, $10, or $5 to avoid service interruption
  • Balance Exhaustion Alerts: Get notified when your balance reaches $0 and deployments are automatically stopped
  • Rate-Limited Notifications: Smart notification system prevents email spam with 6-hour intervals between similar alerts
  • Priority Notification Delivery: Account owners and admins receive critical balance alerts first

v1.6.0 • 2025-10-29

Transaction Receipts & Improved Navigation

Download payment receipts and enjoy faster page loading with improved pagination.

New Features
  • Receipt Downloads: Download official receipts for all completed deposits (Stripe & CopperX payments)
  • Deposit History: Renamed transaction view to "Deposit History" for better clarity
  • Paginated Views: Faster loading with pagination across deployment history and usage analytics
  • Load More Button: Browse through your instances with easy "Load More" functionality
  • Documentation Access: Quick access to API documentation directly from the navigation menu
Improvements
  • Loading States: Better visual feedback with loading skeletons while data loads
  • Error Handling: Improved error messages and handling throughout billing and instances pages
  • Performance: Faster page loads with optimized data fetching

v1.5.0 • 2025-10-18

Complete UI Redesign

Modern dark theme with sleek blue gradients and enhanced user experience throughout the platform.

Visual Overhaul
  • Modern Dark Theme: New dark interface with blue gradient accents and improved contrast
  • Redesigned Cards: Sleek card designs with unique clipped borders for a futuristic look
  • Enhanced Buttons: Modern call-to-action buttons with hover effects and smooth transitions
  • Improved Typography: Updated fonts and text hierarchy for better readability
Page Improvements
  • Billing Dashboard: Completely redesigned with clear balance overview, streamlined deposit flow, and organized transaction history
  • Instance Management: Improved instance cards with better status indicators and enhanced SSH key management
  • Settings Interface: Modern tabbed navigation with refreshed profile, SSH key, and API key management sections
  • Deployment Wizard: Enhanced GPU browsing experience with improved provider, region, and configuration selection
  • Reserved GPU Portal: New design for bulk GPU reservation requests with better quote workflow
User Experience
  • Faster Loading: Optimized loading and redirect screens
  • Better Navigation: Improved top navigation and sidebar with clearer menu organization
  • Enhanced Dialogs: Modern confirmation dialogs with better visual feedback
  • Consistent Design: Unified design language across all pages and components

v1.4.0 • 2025-09-28

Reserved GPU Pricing & Compliance

New Features
  • Reserved GPU Pricing: Special pricing model for long-term GPU reservations
  • Provider Compliance Status: View compliance and certification status for each provider during deployment
  • Flexible SSH Key Options: Enhanced SSH key support for programmatic deployments
Improvements
  • Better Pricing Display: Improved GPU offer pricing calculations and display
  • Enhanced Security: Stronger SSH key validation and processing
  • System Stability: Improved reward processing reliability

v1.3.0 • 2025-09-02

Onboarding & Discounts

New Features
  • Welcome Onboarding: New user onboarding flow to help you get started quickly
  • Discount Codes: Support for promotional and discount codes
  • Reserved GPU Marketplace: New page for requesting bulk GPU reservations from our provider network
Improvements
  • Accurate Billing: Fixed credit deduction and monitoring for precise billing
  • Better Calculations: Improved accuracy in usage cost calculations

v1.2.0 • 2025-08-12

API Access & New Provider

New Features
  • API Key Management: Create and manage deployments programmatically using API keys
  • Massed Compute Provider: New provider option for more GPU choices and competitive pricing
  • Multi-Node Deployments: Deploy multiple GPU nodes simultaneously with Voltage Park for larger workloads

v1.1.0 • 2025-07-25

Startup Scripts & Discounts

New Features
  • Startup Scripts: Add custom initialization scripts that run automatically when your instance starts
  • Usage Discounts: Discount system for reduced pricing on GPU usage
Improvements
  • Real-Time Pricing: Fixed spot pricing updates for accurate availability and costs
  • Accurate GPU Selection: Improved pricing calculations during GPU selection

v1.0.0 • 2025-07-04

Initial Release

Welcome to Spheron AI GPU Marketplace!

The easiest way to rent high-performance GPUs on-demand from multiple providers.

Core Features
  • Multiple GPU Providers: Access GPUs from Voltage Park, Data Crunch, Sesterce, and TensorDock
  • Simple Authentication: Sign in with Google, GitHub, or email
  • Team Management: Create and manage teams for collaborative work
  • Flexible Billing: Pay with crypto or credit card
  • Usage Analytics: Track your GPU usage and spending in real-time
  • Transaction History: Complete history of all your deposits and usage charges