PyTorch Tutorial: ImageFolder with Code Examples

July 24, 2023

In this tutorial, we'll explore how to use the ImageFolder dataset in PyTorch, a popular deep learning library, to load and preprocess image data for training a neural network. The ImageFolder dataset is useful when dealing with image data organized in a specific folder structure, where each class has its own folder containing images.

Dataset Structure

Before diving into code examples, let's understand the required folder structure for using ImageFolder:

data/
  ├── train/
  |    ├── class_1/
  |    |    ├── image_1.jpg
  |    |    └── image_2.jpg
  |    ├── class_2/
  |    |    ├── image_3.jpg
  |    |    └── image_4.jpg
  |    └── ...
  ├── val/
  |    ├── class_1/
  |    |    ├── image_5.jpg
  |    |    └── image_6.jpg
  |    ├── class_2/
  |    |    ├── image_7.jpg
  |    |    └── image_8.jpg
  |    └── ...

In this example, the images are categorized into classes, and the train and validation sets are organized in separate folders.

Code Examples

1. Importing Libraries

import torch
import torchvision.transforms as transforms
import torchvision.datasets as datasets
import matplotlib.pyplot as plt

2. Data Loading and Preprocessing

# Define the path to your data directory
data_dir = 'path/to/your/data'

# Data augmentation and normalization for training
# You can customize the transformations as needed
train_transform = transforms.Compose([
    transforms.RandomResizedCrop(224),
    transforms.RandomHorizontalFlip(),
    transforms.ToTensor(),
    transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
])

# Normalize the validation set without augmentation
val_transform = transforms.Compose([
    transforms.Resize(256),
    transforms.CenterCrop(224),
    transforms.ToTensor(),
    transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
])

# Load the data using ImageFolder
train_dataset = datasets.ImageFolder(root=data_dir + '/train', transform=train_transform)
val_dataset = datasets.ImageFolder(root=data_dir + '/val', transform=val_transform)

3. Data Visualization

# Helper function to show an image
def imshow(image):
    image = image.permute(1, 2, 0)  # Reorder dimensions for matplotlib
    plt.imshow(image)
    plt.axis('off')
    plt.show()

# Show a sample image from the training set
sample_image, sample_label = train_dataset[0]
imshow(sample_image)
print(f"Label: {sample_label}")

Conclusion

In this tutorial, we learned how to use the ImageFolder dataset in PyTorch to load and preprocess image data efficiently. By organizing the data into class-specific folders, we can easily create datasets for training deep learning models.

Feel free to experiment with different data augmentations and model architectures to improve the performance of your models. Happy coding!

Search This Blog

The Linux Helpline