Posts in category tutorials

SEP 11, 2024

Finding the best LoRA parameters

By Sze Wai Yuen, Liam Li, Kevin Musgrave

How alpha, rank, and learning rate affect model accuracy, and whether rank-stabilized LoRA helps.

LEARN MORE

JUL 17, 2024

How does Video Generation work?

By Isha Ghodgaonkar

Sora, Make-a-Video and Imagen Video explained.

LEARN MORE

JUL 10, 2024

Tensor Parallelism in Three Levels of Difficulty

By Garrett Goon, Kevin Musgrave

Tensor parallelism, from beginner to expert using PyTorch.

LEARN MORE

JUN 12, 2024

Activation Memory: A Deep Dive using PyTorch

By Garrett Goon, Kevin Musgrave

How to measure activation memory in PyTorch, and why the choice of activation function matters.

LEARN MORE

MAY 22, 2024

Mistral 7B vs. Llama-2 7B: Lightning Round using GenAI studio

By Isha Ghodgaonkar

Let’s compare how Mistral-7b-instruct-v0.2 and Llama 2-7B-chat perform on some basic LLM prompts.

LEARN MORE

MAY 15, 2024

Activation Memory: What is it?

By Garrett Goon, Kevin Musgrave

An introduction to activation memory, and how it affects GPU memory consumption during model training.

LEARN MORE

MAY 01, 2024

3D Diffuse Glioma Segmentation for Early Cancer Detection: Spotlight Demo

By Isha Ghodgaonkar, Alejandro Morales Martinez

Using HPE’s AI platform to develop an early brain cancer detection machine learning model.

LEARN MORE

APR 24, 2024

From a pre-trained model to an AI assistant: Finetuning Gemma-2B using DPO

By Agnieszka Ciborowska, Kevin Musgrave

LLM Alignment using Direct Preference Optimization, an alternative to RLHF

LEARN MORE

FEB 28, 2024

Finetuning Mistral-7B with LoRA and DeepSpeed

By Kevin Musgrave, Agnieszka Ciborowska

How to Finetune Mistral-7B using HuggingFace + Determined

LEARN MORE

OCT 30, 2023

LLM Prompting: The Basic Techniques

By Kevin Musgrave

A gentle introduction to large language model prompting methods and terminology.

LEARN MORE

Posts in: Tutorials

Topics

Finding the best LoRA parameters

How does Video Generation work?

Tensor Parallelism in Three Levels of Difficulty

Activation Memory: A Deep Dive using PyTorch

Mistral 7B vs. Llama-2 7B: Lightning Round using GenAI studio

Activation Memory: What is it?

3D Diffuse Glioma Segmentation for Early Cancer Detection: Spotlight Demo

From a pre-trained model to an AI assistant: Finetuning Gemma-2B using DPO

Finetuning Mistral-7B with LoRA and DeepSpeed

LLM Prompting: The Basic Techniques