Skip to content

Reserved GPUs

Request bulk GPU allocations, specific locations, or preferential pricing for long-term commitments.

What are reserved GPUs

Reserved GPUs are for requests requiring:

  • Bulk quantities: Multiple GPUs (8, 16, 32, 64+)
  • Specific locations: Regional compliance or data proximity requirements
  • Long-term commitments: Multi-month reservations with preferential pricing
  • Custom configurations: Specialized hardware or network requirements

How it works

  1. Submit request: Fill out the reservation form with your requirements
  2. Team review: Spheron team reviews your request within 24 hours
  3. Receive quotes: Multiple providers compete to offer the best pricing
  4. Choose option: Select the quote that fits your needs
  5. Book meeting (optional): Schedule a consultation for complex requirements

Competitive bidding across providers ensures optimal pricing.

Benefits

Cost savings:
  • 30-50% lower than on-demand hourly rates
  • Bulk discounts for multiple GPUs
  • Long-term commitment pricing advantages
Guaranteed availability:
  • Reserved capacity ensures GPU access
  • No competition with the spot market
  • Predictable resource allocation
Provider competition:
  • Multiple quotes from different providers
  • Compare pricing and terms
  • Choose the best value for your requirements

Submitting a request

Visit app.spheron.ai > Reserved GPU to access the request form.

Reserved GPU Request Form

Fill out request form

GPU model

Select GPU type: H100, H200, A100, B200, RTX 4090, RTX 5090, L40S, A40, L4, V100

  • H100/H200: Highest performance, large-scale training
  • A100/B200: Production-grade training and inference
  • RTX 4090/5090: Development and medium workloads
  • L40S/A40: Balanced price-performance
  • L4/V100: Cost-effective for inference

Unsure? Book a consultation with the team.

Quantity

Enter the number of GPUs needed (8 to 64+)

Duration

Specify the reservation length:

  • Enter a value (e.g., 6)
  • Select a unit (Months or Years)
  • Choose a start time (ASAP or Within 12 Months)

Longer commitments typically receive better pricing.

Location

Select a region: North America, Europe, Asia Pacific, South America, Middle East, Africa, or Any Location

  • Any Location: Maximum provider competition, best pricing
  • Specific region: Required for data compliance or proximity

Start date

Calendar selection for specific deployment timing (optional)

Additional requirements

Specify custom needs (optional):

  • Network requirements (e.g., InfiniBand, NVLink)
  • Compliance needs (e.g., GDPR, HIPAA)
  • Storage requirements
  • Special configurations

Contact information

Provide your contact details so the team can deliver quotes:

  • Name: your full name
  • Email: where quotes and follow-ups are sent
  • Phone (required): include country code (e.g. +1 555-123-4567). Phone is validated and mandatory; the form will not advance to the review step without a valid number.

Review and submit

Review all details before submission. Edit any field if needed.

Click Submit Request to send to the team.

Receive quotes

Within 24 hours:

  • Multiple provider quotes via email
  • Pricing, hardware specs, and availability
  • Terms and conditions

No obligation to accept. Compare and choose the best option.

Support and consultation

For complex requirements, schedule a 30-minute consultation with the Spheron team to discuss GPU selection, quantity, and received quotes. For general questions, submit a request via the platform; responses are typically within 24 hours.

Common use cases

LLM training: Large language models requiring days or weeks of GPU time

Research projects: Academic and lab projects needing predictable long-term costs

Production inference: AI services requiring guaranteed GPU availability

Data processing: Video processing, simulations, large-scale data analysis

Multi-GPU workloads: Distributed training requiring 8+ GPUs with high-speed interconnects

Best practices

Optimize costs:
  • Request the exact quantity needed (submit additional requests later if needed)
  • Choose "Any Location" for competitive bidding unless a specific region is required
  • Longer commitments (6-12+ months) typically offer better per-month pricing
  • Select the appropriate GPU tier (L40S vs H100) based on actual workload needs
Improve quote quality:
  • Provide detailed requirements in additional notes
  • Specify network, storage, and compliance needs upfront
  • Include realistic timelines
  • Book a consultation for complex configurations

Frequently asked questions

Q: Can I modify my request after submission?

A: Contact the team with your updated requirements. The team sends a revised quote.

Q: What if I need additional GPUs later?

A: Submit a new request. Multiple concurrent reservations are supported.

Q: Am I obligated to accept a quote?

A: No. Quotes are non-binding offers. Choose only if the terms meet your needs.

Q: What are the cancellation policies?

A: Policies vary by provider. Review the specific terms included with each quote.

Q: What if my preferred GPU is unavailable?

A: Providers suggest equivalent alternatives. The large provider network ensures options are available.

Q: How much cheaper are reserved vs on-demand?

A: Typically 30-50% savings depending on duration, quantity, and GPU type.

Q: Can I get a quote without committing?

A: Yes. Request quotes with no commitment. Consultation is also free.

What's next