Other recommendations for your business

xFusion 1288H V5 AI Inference 1U Server - Dual Xeon Platinum 8462Y/1TB DDR5/8×3.2TB U.2 NVMe/4×A100 PCIe/200Gb InfiniBand/Inference Optimized Platform

US$1,900 - $2,600
Model:
Secure payments
Every payment you make on SysPartsDirect.com is secured with strict SSL encryption and PCI DSS data protection protocols
Standard refund policy
Claim a refund if your order doesn't ship, is missing, or arrives with product issues
SysPartsDirect.com protects all your orders placed and paid on the platform with Trade Assurance

Specifications

itemvalue
Optimization DirectionAI Model Inference Optimization
Model Number1288H V5-AI
Platform TypeAI Inference Dedicated Platform
Processor SeriesIntel Xeon Platinum Series
Processor ModelXeon Platinum 8462Y 2.8GHz
Core Count32 Cores 64 Threads
Base Frequency2.8GHz
Max Turbo Frequency4.1GHz
AI AccelerationAMX and DL Boost
Memory TechnologyDDR5-4800 ECC RDIMM
Memory Capacity1TB(16×64GB)
Memory Bandwidth307.2GB/s
GPU ModelNVIDIA A100 40GB PCIe
GPU Quantity4 GPUs
Total VRAM160GB GDDR6
Tensor Cores3rd Generation Tensor Cores
Storage TypeU.2 NVMe SSD
Storage Capacity8×3.2TB NVMe SSD
Read/Write PerformanceSequential Read 7000MB/s
IOPS PerformanceRandom Read 1.5M IOPS
Networking Technology200Gb InfiniBand HDR
Network ControllerMellanox ConnectX-6 DX
RDMA SupportRoCE and iWARP
Inference Performance10000fps@ResNet-50
Precision SupportFP16, INT8, FP8
Model DeploymentTriton Inference Server
AI FrameworksTensorRT, OpenVINO
Container SupportNGC Optimized Containers
Orchestration ToolsKubernetes AI Edition
Power Specification3200W Platinum Level
Power ManagementGPU-Aware Power Management
Cooling DesignForced Air Cooling Solution
Thermal DesignGPU-Directed Airflow Ducts
AI ManagementModel Performance Monitoring
Resource SchedulingIntelligent GPU Resource Allocation
ThroughputReal-time Inference Throughput
Latency PerformanceEnd-to-End Inference Latency <5ms
Computer VisionReal-time Image Recognition
NLPIntelligent Dialogue Systems
Recommendation SystemsReal-time Personalized Recommendations
Cloud IntegrationHybrid AI Inference Deployment
Edge ReadinessEdge Inference Optimized

Related Products