TransferEngine enables GPU-to-GPU communication across AWS and Nvidia hardware, allowing trillion-parameter models to run on ...
Baseten launches a new AI training infrastructure platform that gives developers full control, slashes inference costs by up ...
Perplexity launches MoE kernels for trillion-parameter AI, lower latency and higher throughput on AWS EFA and ConnectX-7.
Some clever networking hacks open the door AI search provider Perplexity's research wing has developed a new set of software ...
Nebius introduced Token Factory, a platform for deploying open-source AI models efficiently. Supporting over 60 models, it ...
Advanced AI technology will transform handwritten notes and analog footage into a searchable database for scientists to gain ...