Mixing Datasets
Last updated
Last updated
Rendered.ai offers a Dataset Mixing service that allows users to combine images from one or more datasets by sampling a specified number of images from each dataset. This can be used to simply create larger datasets, combine synthetic and real datasets for experimentation, or filter classes out of datasets. In this tutorial, we'll show you how to mix two datasets together.
Navigate to the Datasets Library tab of a Workspace. Select the Datasets you want to mix together by clicking the checkbox on the Dataset's row.
Next, click the Mix icon in the Action Bar on the left.
In the Mix Dataset modal, we have the ability to give our dataset a name and description, set tags for the dataset, and set a seed for the job. Each Dataset being mixed will have additional parameters that can be selected:
Maximum Sample Count - this sets the maximum number of samples that will be taken from the dataset based on the filters applied. By default it will sample from all images in the dataset.
Included Classes - this will only select images that have one or more of the classes selected when sampling from the dataset. By default this is left blank, meaning it will sample from all images in the dataset.
When the job parameters are configured, click the Mix button to start the mixing job.
Once the job is complete, we can see the new dataset that includes classes for submarine and the sailboat classes we specified from the Sailboats HDR dataset.
Notice that the total image count for the new dataset is 393 images. This is because we set a Maximum Sample Count of 10 images for the Submarine HDR dataset without any filter applied, giving us 10 images, and a Maximum Sample Count of 1000 images from the Sailboat HDR dataset with an included class filter applied, yielding us only 383 images.