Weekly updates from our team on topics like large-scale deep learning training, cloud GPU infrastructure, hyperparameter tuning, and more.
FEB 12, 2024
Short summaries of self composing reasoning structures, LLMs for math and chess, plus other highlights from the week.
FEB 05, 2024
Short summaries of MESA and Co-Designing Model Architectures with Hardware, plus other highlights from the week.
JAN 31, 2024
We are excited to announce the 0.27.1 release of the Determined deep learning training platform!
JAN 31, 2024
How to Finetune a TinyLlama-1.1B Model on Text-to-SQL
JAN 29, 2024
Short summaries of MambaByte, Multimodal Pathway, and CrossMAE, plus other highlights from the week.
JAN 19, 2024
Visual Mamba for image processing, deceptive LLMs, and a geometry model from DeepMind caught our eye last week.
JAN 12, 2024
A new open source library for faster LLM finetuning, a multimodal guided visual search algorithm, and a new unlearning task for LLMs caught our eye last week.
JAN 08, 2024
Here’s what happened in AI the past few weeks.
DEC 19, 2023
What we took away from attending NeurIPS ‘23 last week.