All Katas - ML Katas

Einops: ViT-Style Patch Embedding

yesterday easy (<10 mins) | transformer einops vision tensor

### Description Vision Transformers (ViT) process images by first breaking them down into a sequence of flattened patches. The `einops` library is perfectly suited for this task, offering a...

Tensor Manipulation: Implement Layer Normalization

yesterday medium (<10 mins) | pytorch normalization tensor

### Description Layer Normalization is a key component in many modern deep learning models, especially Transformers. It normalizes the inputs across the feature dimension. Your task is to...

Tensor Manipulation: Numerically Stable Softmax

yesterday medium (<10 mins) | pytorch deep learning tensor

### Description Implement the softmax function, which converts a vector of numbers into a probability distribution. A naive implementation can be numerically unstable if the input values are very...

Einops: Space-to-Depth Transformation

yesterday easy (<10 mins) | einops vision tensor

### Description Space-to-depth is an operation that rearranges blocks of spatial data into the channel dimension. For an input of shape `(B, C, H, W)` and a block size `S`, the output will be `(B,...

Tensor Manipulation: Causal Mask for Transformers

yesterday easy (<10 mins) | transformer attention tensor

### Description In decoder-style Transformers (like GPT), we need a "causal" or "look-ahead" mask to prevent positions from attending to subsequent positions. This is typically a lower-triangular...

Einops: Reversing a Sequence

yesterday easy (<10 mins) | pytorch einops tensor

### Description Reversing the order of elements in a sequence is a common operation. While it can be done with slicing (`torch.flip`), let's practice doing it with `einops` for a different...

Tensor Manipulation: Dropout Layer

yesterday medium (<10 mins) | pytorch deep learning tensor

### Description Implement the dropout layer from scratch. During training, dropout randomly zeroes some of the elements of the input tensor with probability `p`. The remaining elements are scaled...

Tensor Manipulation: One-Hot Encoding

yesterday easy (<10 mins) | pytorch data tensor

### Description Implement one-hot encoding for a batch of class indices. Given a 1D tensor of integer labels, create a 2D tensor where each row is a vector of zeros except for a `1` at the index...

Einops: Depth-to-Space Transformation

yesterday easy (<10 mins) | einops vision tensor

### Description Depth-to-space is the inverse of the space-to-depth operation. It rearranges features from the channel dimension into spatial blocks, increasing spatial resolution and decreasing...

Einops: Batched Matrix Multiplication

yesterday easy (<10 mins) | pytorch einops tensor

### Description Perform a batched matrix multiplication `(B, N, D) @ (B, D, M) -> (B, N, M)` using `einops` `einsum`. While `torch.bmm` is the standard, this is a good exercise to understand how...

Tensor Manipulation: Sinusoidal Positional Encoding

yesterday hard (<10 mins) | transformer attention nlp tensor

### Description Implement the sinusoidal positional encoding from the "Attention Is All You Need" paper. This technique adds information about the position of tokens in a sequence by injecting a...

Einops: Squeeze and Unsqueeze

yesterday easy (<10 mins) | einops tensor

### Description `torch.squeeze` and `torch.unsqueeze` are common for removing or adding dimensions of size one. `einops.rearrange` can do this as well, often with more clarity by explicitly naming...

Tensor Manipulation: Manual Convolutional Filter

yesterday medium (<10 mins) | convolution vision tensor

### Description Understand the mechanics of a 2D convolution by manually applying a 3x3 filter to a small, single-channel image. This exercise helps demystify what happens inside a...

Tensor Manipulation: Using `gather` for selection

yesterday medium (<10 mins) | pytorch tensor

### Description `torch.gather` is a powerful but sometimes confusing function for selecting elements from a tensor based on an index tensor. Your task is to use it to select specific elements...

Tensor Manipulation: Creating `unfold` with `as_strided`

yesterday hard (<10 mins) | pytorch tensor advanced

### Description **Warning: `as_strided` is an advanced and potentially unsafe operation that can crash your program if used incorrectly, as it creates a view on memory without checks.** With that...

Einops: Repeat for Tiling/Broadcasting

yesterday easy (<10 mins) | pytorch einops tensor

### Description The `einops.repeat` function is a powerful and readable alternative to `torch.expand` or `torch.tile` for broadcasting or repeating a tensor along new or existing dimensions. ###...

Tensor Manipulation: Using `scatter_add_`

yesterday medium (<10 mins) | pytorch tensor advanced

### Description `torch.scatter_add_` is used to add values into a tensor at specified indices. It's useful in cases like converting an edge list in a graph to an adjacency matrix or pooling...