TransferEngine enables GPU-to-GPU communication across AWS and Nvidia hardware, allowing trillion-parameter models to run on ...
Baseten launches a new AI training infrastructure platform that gives developers full control, slashes inference costs by up ...
Perplexity launches MoE kernels for trillion-parameter AI, lower latency and higher throughput on AWS EFA and ConnectX-7.
The Register on MSN
Perplexity shows how to run monster AI models more efficiently on aging GPUs, AWS networks
Some clever networking hacks open the door AI search provider Perplexity's research wing has developed a new set of software ...
Nebius introduced Token Factory, a platform for deploying open-source AI models efficiently. Supporting over 60 models, it ...
Advanced AI technology will transform handwritten notes and analog footage into a searchable database for scientists to gain ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results