Question 1

What GPU hardware does Qube Compute use?

Accepted Answer

We deploy NVIDIA Vera Rubin R100 NVL72 — the most powerful commercially available GPU system with 1,400+ ExaFLOPS FP4 per rack and NVLink 6.0 fabric. We also offer Groq LPX for sub-10ms real-time inference.

Question 2

How much does GPU cloud cost at Qube Compute?

Accepted Answer

Anchor contracts start at $14/GPU-package-hour (6-24 month terms). Cloud On-Demand is $19/hr and Spot/Night is $25/hr. Our energy cost of $0.048/kWh makes us 3x cheaper than AWS/Azure.

Question 3

Is Qube Compute Sharia-compliant?

Accepted Answer

Yes. We are the world's only AFSA-certified halal GPU cloud. Our Mudaraba profit-sharing structure has zero debt (riba) and no derivatives (gharar). All payments are held in Sharia-compliant escrow at Al Hilal Bank.

Question 4

Where is the data center located?

Accepted Answer

Our 8 MW Tier III TIA-942 facility is located in SEZ PIT Alatau, Almaty, Kazakhstan. The Special Economic Zone provides 0% corporate tax, VAT, and personal income tax until 2029.

Question 5

How are payments protected?

Accepted Answer

All prepayments are held in escrow at Al Hilal Bank under AIFC English Common Law. Funds are released only upon verified GPU access delivery. If we fail to deliver — automatic full refund.

Workload	Rubin R100 NVL72	H100 SXM (8x)	A100 SXM (8x)	Speedup
LLaMA 70B Training (1T tokens)	~3 days	~15 days	~38 days	5x faster
Inference throughput (LLaMA 70B)	800 tok/s	350 tok/s	120 tok/s	2.3x faster
Groq LPX Inference (70B)	3,000 tok/s	350 tok/s	120 tok/s	8.6x faster
Stable Diffusion XL (images/sec)	~180	~45	~15	4x faster
Memory per rack	6.5 TB HBM4	640 GB HBM3	640 GB HBM2e	10x faster

Feature	Qube Compute	AWS	Azure
GPU Orchestration	Kubernetes + Slurm	EKS only	AKS only
Networking	InfiniBand Quantum-X800	EFA (Elastic Fabric)	InfiniBand NDR
GPU Interconnect	NVLink 6.0 (full rack)	NVLink (per node)	NVLink (per node)
Energy Cost	$0.048/kWh	$0.12-0.18/kWh	$0.10-0.15/kWh
GPU Hardware	Rubin R100 NVL72	H100 / P5	H100 / ND
Real-Time Inference	Groq LPX (<10ms)	Inferentia2 (50ms+)	N/A (GPU only)
Monitoring	DCIM + MLflow + GPU metrics	CloudWatch	Monitor
Egress Fees	None	$0.09/GB	$0.087/GB
Sharia Compliance	AFSA Certified	No	No
Escrow Protection	Al Hilal Bank	None	None

Infrastructure for Every AI Workload

Training

Large Language Model Training

Fine-Tuning & RLHF

Computer Vision & Diffusion

Inference

Real-Time LLM Inference

Batch Inference

Embedding & RAG Pipelines

Performance by Workload

Qube Compute vs Hyperscalers

Built for Your Industry

Financial Services

Oil & Gas

Healthcare & Pharma

Government & Public Sector

How It Works

Choose Your Workload

Deploy in Minutes

Scale & Monitor

Ready to Deploy?