eBPF, shared SmartNICs, and smart scheduling have improved reliability and cut costs Chinese web giant Alibaba has reduced network outages by 92 percent, cut load balancing costs by 18.9 percent, and found ways to improve SmartNIC performance by offloading workloads to idle infrastructure....
Articles
Better scheduling and resource-sharing for inferencing workloads using multiple models, not a training breakthrough Chinese tech giant Alibaba has published a paper detailing scheduling tech it has used to achieve impressive utilization improvements across the GPU fleet it uses to power inferencing workloads - which is nice, but not a breakthrough that will worry AI investors....
Chinese giant adds to No AI bubble' babble by citing oversubscribed infrastructure and surging demand China's Alibaba Cloud can't deploy servers fast enough to keep up with demand for AI, so is rationing access to GPUs so that customers who use all of its services enjoy priority access....
1