Skip to content

Latest commit

 

History

History
22 lines (13 loc) · 2.36 KB

few_notes.md

File metadata and controls

22 lines (13 loc) · 2.36 KB

The DataLoader class from the torch.utils.data module is a utility class in PyTorch that combines a dataset and a sampler, providing an iterable over the given dataset. It is commonly used for loading and iterating over batches of data during the training or evaluation of machine learning models.

The DataLoader class supports both map-style and iterable-style datasets. Map-style datasets are classes that implement the getitem and len methods, allowing random access to individual samples. Iterable-style datasets are classes that implement the iter method, allowing sequential access to samples.

The DataLoader class provides various options and functionalities, including:

Specifying the batch size: You can specify how many samples per batch to load. Shuffling the data: You can choose to shuffle the data at every epoch to introduce randomness. Customizing the loading order: You can provide a sampler object to define the strategy for drawing samples from the dataset. Parallel data loading: You can specify the number of subprocesses to use for loading data in parallel, which can speed up the data loading process. Automatic batching: The DataLoader class can automatically collate a list of samples into a mini-batch of tensors using a collate function. Memory pinning: If enabled, the DataLoader will copy tensors into pinned memory (e.g., CUDA pinned memory) before returning them, which can improve performance when transferring data to devices like GPUs. Handling incomplete batches: You can choose to drop the last incomplete batch if the dataset size is not divisible by the batch size, or keep it as a smaller batch.

  • The DataLoader class is a fundamental component in PyTorch for efficiently loading and processing data in machine learning workflows. It provides a convenient and flexible interface for working with datasets and enables efficient training and evaluation of models.

  • where excatly are the feature maps being flattened in assignment 9?

  • Transformations are commonly used in computer vision tasks to preprocess or augment images. They can include operations such as resizing, cropping, normalization, or applying various image filters. By applying transformations, you can modify the images in a way that is beneficial for your specific task, such as improving the quality, adjusting the size, or enhancing certain features, and the transform class does all that.