Reserved GPUs
Request bulk GPU allocations, specific locations, or preferential pricing for long-term commitments.
What are reserved GPUs
Reserved GPUs are for requests requiring:
- Bulk quantities: Multiple GPUs (8, 16, 32, 64+)
- Specific locations: Regional compliance or data proximity requirements
- Long-term commitments: Multi-month reservations with preferential pricing
- Custom configurations: Specialized hardware or network requirements
How it works
- Submit request: Fill out the reservation form with your requirements
- Team review: Spheron team reviews your request within 24 hours
- Receive quotes: Multiple providers compete to offer the best pricing
- Choose option: Select the quote that fits your needs
- Book meeting (optional): Schedule a consultation for complex requirements
Competitive bidding across providers ensures optimal pricing.
Benefits
Cost savings:- 30-50% lower than on-demand hourly rates
- Bulk discounts for multiple GPUs
- Long-term commitment pricing advantages
- Reserved capacity ensures GPU access
- No competition with the spot market
- Predictable resource allocation
- Multiple quotes from different providers
- Compare pricing and terms
- Choose the best value for your requirements
Submitting a request
Visit app.spheron.ai > Reserved GPU to access the request form.

Fill out request form
GPU model
Select GPU type: H100, H200, A100, B200, RTX 4090, RTX 5090, L40S, A40, L4, V100
- H100/H200: Highest performance, large-scale training
- A100/B200: Production-grade training and inference
- RTX 4090/5090: Development and medium workloads
- L40S/A40: Balanced price-performance
- L4/V100: Cost-effective for inference
Unsure? Book a consultation with the team.
Quantity
Enter the number of GPUs needed (8 to 64+)
Duration
Specify the reservation length:
- Enter a value (e.g., 6)
- Select a unit (Months or Years)
- Choose a start time (ASAP or Within 12 Months)
Longer commitments typically receive better pricing.
Location
Select a region: North America, Europe, Asia Pacific, South America, Middle East, Africa, or Any Location
- Any Location: Maximum provider competition, best pricing
- Specific region: Required for data compliance or proximity
Start date
Calendar selection for specific deployment timing (optional)
Additional requirements
Specify custom needs (optional):
- Network requirements (e.g., InfiniBand, NVLink)
- Compliance needs (e.g., GDPR, HIPAA)
- Storage requirements
- Special configurations
Contact information
Provide your contact details so the team can deliver quotes:
- Name: your full name
- Email: where quotes and follow-ups are sent
- Phone (required): include country code (e.g.
+1 555-123-4567). Phone is validated and mandatory; the form will not advance to the review step without a valid number.
Review and submit
Review all details before submission. Edit any field if needed.
Click Submit Request to send to the team.
Receive quotes
Within 24 hours:
- Multiple provider quotes via email
- Pricing, hardware specs, and availability
- Terms and conditions
No obligation to accept. Compare and choose the best option.
Support and consultation
For complex requirements, schedule a 30-minute consultation with the Spheron team to discuss GPU selection, quantity, and received quotes. For general questions, submit a request via the platform; responses are typically within 24 hours.
Common use cases
LLM training: Large language models requiring days or weeks of GPU time
Research projects: Academic and lab projects needing predictable long-term costs
Production inference: AI services requiring guaranteed GPU availability
Data processing: Video processing, simulations, large-scale data analysis
Multi-GPU workloads: Distributed training requiring 8+ GPUs with high-speed interconnects
Best practices
Optimize costs:- Request the exact quantity needed (submit additional requests later if needed)
- Choose "Any Location" for competitive bidding unless a specific region is required
- Longer commitments (6-12+ months) typically offer better per-month pricing
- Select the appropriate GPU tier (L40S vs H100) based on actual workload needs
- Provide detailed requirements in additional notes
- Specify network, storage, and compliance needs upfront
- Include realistic timelines
- Book a consultation for complex configurations
Frequently asked questions
Q: Can I modify my request after submission?
A: Contact the team with your updated requirements. The team sends a revised quote.
Q: What if I need additional GPUs later?
A: Submit a new request. Multiple concurrent reservations are supported.
Q: Am I obligated to accept a quote?
A: No. Quotes are non-binding offers. Choose only if the terms meet your needs.
Q: What are the cancellation policies?
A: Policies vary by provider. Review the specific terms included with each quote.
Q: What if my preferred GPU is unavailable?
A: Providers suggest equivalent alternatives. The large provider network ensures options are available.
Q: How much cheaper are reserved vs on-demand?
A: Typically 30-50% savings depending on duration, quantity, and GPU type.
Q: Can I get a quote without committing?
A: Yes. Request quotes with no commitment. Consultation is also free.
What's next
- Getting Started: Deploy on-demand instances
- Quick Start: Fast deployment guide
- Cost Optimization: GPU tier selection and spend strategies
- Billing: Credit management and pricing
- General Info: Support and official channels