This Website contains a comprehensive dataset for underwater change detection with thousands of handsegmented groundtruth images. Many of the special difficulties of the underwater environment are depicted in the videos, for example: marine snow, caustics or color attenuation. The moving objects in the videos are always fishes, which swim in swarms or separately.
The evaluation of the set is done in 100 frames per video and each groundtruth images contains three categories.
background = white (value: 255)
foreground = black (value: 0)
unsure = grey (value: 120)


All 5 videos are available here.
Some first results for three different approaches can be seen in the table below. The Algorithms were tested on the Marine Snow video of the Dataset.
Algorithm | True Negatives | True Positives | False Negatives | False Positives | F1-Score | MCC |
GSM | 189,927,339 | 14,378,161 | 3,209,105 | 1,918,995 | 0.8361 | 0.8487 |
MOG | 189,109,247 | 14,516,541 | 3,070,725 | 2,737,087 | 0.8182 | 0.8333 |
KNN | 190,710,197 | 13,326,863 | 4,260,403 | 1,136,137 | 0.8224 | 0.8316 |
MoSeg | 194,489,198 | 9,462,542 | 4,571,069 | 910,791 | 0.7717 | 0.7754 |
Algorithm | True Negatives | True Positives | False Negatives | False Positives | F1-Score | MCC |
GSM |
190,377,856 | 14,980,737 | 2,606,529 | 1,468,478 | 0.8802 | 0.8702 |
MOG |
190,281,349 | 15,181,335 | 2,405,931 | 1,564,985 | 0.8843 | 0.8743 |
KNN |
191,511,427 | 14,006,979 | 3,580,287 | 334,907 | 0.8773 | 0.8726 |
MoSeg | 178,455,951 | 11,185,284 | 2,570,997 | 12,892,596 | 0.5795 | 0.5912 |
GSM:
M. Radolko, E. Gutzeit "Video Segmentation via a Gaussian Switch Background Model and Higher Order Markov Random Fields", VISAPP 2015 - Volume I : Proceedings of the International Conference on Computer Vision Theory and Applications. SciTePress, 2015, pp. 537-544
MOG:
Z. Zivkovic. “Improved adaptive Gausian mixture model for background subtraction”, International Conference Pattern Recognition, UK, August, 2004, http://www.zoranz.net/Publications/zivkovic2004ICPR.pdf. The code is very fast and performs also shadow detection. Number of Gausssian components is adapted per pixel.
KNN:
Z.Zivkovic, F. van der Heijden. “Efficient Adaptive Density Estimation per Image Pixel for the Task of Background Subtraction”, Pattern Recognition Letters, vol. 27, no. 7, pages 773-780, 2006.
MoSeg:
P. Ochs, J. Malik and T. Brox, "Segmentation of Moving Objects by Long Term Video Analysis," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 36, no. 6, pp. 1187-1200, June 2014.