Reserved GPUs

Request bulk GPU allocations, specific locations, or preferential pricing for long-term commitments.

What are reserved GPUs

Reserved GPUs are for requests requiring:

Bulk quantities: Multiple GPUs (8, 16, 32, 64+)
Specific locations: Regional compliance or data proximity requirements
Long-term commitments: Multi-month reservations with preferential pricing
Custom configurations: Specialized hardware or network requirements

How it works

Submit request: Fill out the reservation form with your requirements
Team review: Spheron team reviews your request within 24 hours
Receive quotes: Multiple providers compete to offer the best pricing
Choose option: Select the quote that fits your needs
Book meeting (optional): Schedule a consultation for complex requirements

Competitive bidding across providers ensures optimal pricing.

Benefits

Cost savings:

30-50% lower than on-demand hourly rates
Bulk discounts for multiple GPUs
Long-term commitment pricing advantages

Guaranteed availability:

Reserved capacity ensures GPU access
No competition with the spot market
Predictable resource allocation

Provider competition:

Multiple quotes from different providers
Compare pricing and terms
Choose the best value for your requirements

Submitting a request

Visit app.spheron.ai > Reserved GPU to access the request form.

Reserved GPU Request Form

Fill out request form

GPU model

Select GPU type: H100, H200, A100, B200, RTX 4090, RTX 5090, L40S, A40, L4, V100

H100/H200: Highest performance, large-scale training
A100/B200: Production-grade training and inference
RTX 4090/5090: Development and medium workloads
L40S/A40: Balanced price-performance
L4/V100: Cost-effective for inference

Unsure? Book a consultation with the team.

Quantity

Enter the number of GPUs needed (8 to 64+)

Duration

Specify the reservation length:

Enter a value (e.g., 6)
Select a unit (Months or Years)
Choose a start time (ASAP or Within 12 Months)

Longer commitments typically receive better pricing.

Location

Select a region: North America, Europe, Asia Pacific, South America, Middle East, Africa, or Any Location

Any Location: Maximum provider competition, best pricing
Specific region: Required for data compliance or proximity

Start date

Calendar selection for specific deployment timing (optional)

Additional requirements

Specify custom needs (optional):

Network requirements (e.g., InfiniBand, NVLink)
Compliance needs (e.g., GDPR, HIPAA)
Storage requirements
Special configurations

Contact information

Provide your contact details so the team can deliver quotes:

Name: your full name
Email: where quotes and follow-ups are sent
Phone (required): include country code (e.g. +1 555-123-4567). Phone is validated and mandatory; the form will not advance to the review step without a valid number.

Review and submit

Review all details before submission. Edit any field if needed.

Click Submit Request to send to the team.

Receive quotes

Within 24 hours:

Multiple provider quotes via email
Pricing, hardware specs, and availability
Terms and conditions

No obligation to accept. Compare and choose the best option.

Support and consultation

For complex requirements, schedule a 30-minute consultation with the Spheron team to discuss GPU selection, quantity, and received quotes. For general questions, submit a request via the platform; responses are typically within 24 hours.

Common use cases

LLM training: Large language models requiring days or weeks of GPU time

Research projects: Academic and lab projects needing predictable long-term costs

Production inference: AI services requiring guaranteed GPU availability

Data processing: Video processing, simulations, large-scale data analysis

Multi-GPU workloads: Distributed training requiring 8+ GPUs with high-speed interconnects

Best practices

Optimize costs:

Request the exact quantity needed (submit additional requests later if needed)
Choose "Any Location" for competitive bidding unless a specific region is required
Longer commitments (6-12+ months) typically offer better per-month pricing
Select the appropriate GPU tier (L40S vs H100) based on actual workload needs

Improve quote quality:

Provide detailed requirements in additional notes
Specify network, storage, and compliance needs upfront
Include realistic timelines
Book a consultation for complex configurations

Frequently asked questions

Q: Can I modify my request after submission?

A: Contact the team with your updated requirements. The team sends a revised quote.

Q: What if I need additional GPUs later?

A: Submit a new request. Multiple concurrent reservations are supported.

Q: Am I obligated to accept a quote?

A: No. Quotes are non-binding offers. Choose only if the terms meet your needs.

Q: What are the cancellation policies?

A: Policies vary by provider. Review the specific terms included with each quote.

Q: What if my preferred GPU is unavailable?

A: Providers suggest equivalent alternatives. The large provider network ensures options are available.

Q: How much cheaper are reserved vs on-demand?

A: Typically 30-50% savings depending on duration, quantity, and GPU type.

Q: Can I get a quote without committing?

A: Yes. Request quotes with no commitment. Consultation is also free.

What's next

Getting Started: Deploy on-demand instances
Quick Start: Fast deployment guide
Cost Optimization: GPU tier selection and spend strategies
Billing: Credit management and pricing
General Info: Support and official channels