•
In this project, I trained my own U-Net model to perform image segmentation on the Carvana Image Masking Challenge and the Dubai Aerial Imagery datasets.
What is image segmentation? It is a process of partitioning, or dividing, an image into multiple segments, thus called "image segmentation". The goal of image segmentation is to identify objects in an image.
Figure 1: Side by side visualization of the masked image and its true image
The true images come with their own masked images. These masked images contained the objects that have been manually segmented, or labelled, by humans. The goal of the model is to locate objects in an image given the image and its masked image.
U-Net is a CNN architecture that consists of multiple encoders and encoders that are connected by skip connections. As result, the entire architecture looks like a "U" shape.
U-Net Architecture from Wikipedia (Source: https://en.wikipedia.org/wiki/U-Net)
With 50 iterations, the model was only able to reduce its loss to . In other words, the model still needs more iterations and parameter tuning.
Figure 2: Satellite predicted image vs. true image
Figure 3: Satellite predicted image vs. true image
Unlike the result for Dubai Aerial Imagery, the model was able to reduce its loss to and DICE score to with only 50 iterations.
Figure 4: Predicted mask vs. true image
Figure 5: Predicted mask vs. true image