Blogs

SEP 11, 2024

Finding the best LoRA parameters

By Sze Wai Yuen, Liam Li, Kevin Musgrave

How alpha, rank, and learning rate affect model accuracy, and whether rank-stabilized LoRA helps.

LEARN MORE

AUG 12, 2024

Summer '24 Conference Recap

By Isha Ghodgaonkar, Kevin Musgrave

Highlights from CVPR and ICML 2024.

LEARN MORE

JUL 17, 2024

How does Video Generation work?

By Isha Ghodgaonkar

Sora, Make-a-Video and Imagen Video explained.

LEARN MORE

JUL 10, 2024

Tensor Parallelism in Three Levels of Difficulty

By Garrett Goon, Kevin Musgrave

Tensor parallelism, from beginner to expert using PyTorch.

LEARN MORE

JUL 02, 2024

RAG for an LLM-powered hologram

By Kevin Musgrave

How we implemented Retrieval Augmented Generation for the Antonio Nearly hologram, including the data sources and ingestion, vector DB, embedding model, reranker, and query rephrasing.

LEARN MORE

JUN 17, 2024

GenAI studio is now available!

By Isha Ghodgaonkar

Announcing GenAI studio general availability for Determined Enterprise users.

LEARN MORE

JUN 12, 2024

Activation Memory: A Deep Dive using PyTorch

By Garrett Goon, Kevin Musgrave

How to measure activation memory in PyTorch, and why the choice of activation function matters.

LEARN MORE

MAY 22, 2024

Mistral 7B vs. Llama-2 7B: Lightning Round using GenAI studio

By Isha Ghodgaonkar

Let’s compare how Mistral-7b-instruct-v0.2 and Llama 2-7B-chat perform on some basic LLM prompts.

LEARN MORE

MAY 15, 2024

Activation Memory: What is it?

By Garrett Goon, Kevin Musgrave

An introduction to activation memory, and how it affects GPU memory consumption during model training.

LEARN MORE

MAY 01, 2024

3D Diffuse Glioma Segmentation for Early Cancer Detection: Spotlight Demo

By Isha Ghodgaonkar, Alejandro Morales Martinez

Using HPE’s AI platform to develop an early brain cancer detection machine learning model.

LEARN MORE