I've noticed that Shotwell has imported many images twice (e.g. from my camera SD card). Apparently the duplicate detection is buggy once a photo is imported, tagged and then re-imported.
I have "write meta data tags" enabled in the settings. If I import a photo test-images.jpg
and add tags to it the photo will not be picked up by the duplicate detection upon another import of the same file.
The second time the file is imported it will be named test-images-1.jpg
and placed in the library folder as per the active rules (not necessarily into the same folder).
test-images.jpg
and test-images-1.jpg
will have the same image data but due to the added tag/metadata the files are not the same anymore and won't be picked up by searching for duplicates (e.g. md5 hash).
My usage scenario that caused multiple duplicate is as follows:
- I take pictures with my phone
- I import the photos from my phone, add tags but leave the images on the phone as I want to keep them for sharing etc.
- I add further tags to the imported photos
- After some weeks I repeat the import step from the phone and old photos that I have already imported will be imported again (with '-1.jpg' or '-2.jpg' added)
How to clean up the duplicates?
Using a file name based search would be possible but I can't exclude that I have not imported a file ending with -1
to which was not imported as a duplicate.
How can I clean up my photo library? I tried to use the search function in Shotwell but with more than 1000 photos there must be a better, more reliable, less error prone an simpler way.
I'm not to worried about tags getting lost, typically the second import (the duplicate) has no tags applied.