Photo De-Duplication
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Robert ace574eac9 Made bash array for invalid/size inner file issues. 8 months ago
.gitignore Check for tooSmall <1KB and tooLarge >10MB files. 8 months ago
0.jpg dups 8 months ago
1.jpg dups 8 months ago
2.jpg init 2 8 months ago
3.jpg init 2 8 months ago
4.jpg init 2 8 months ago
5.jpg init 2 8 months ago
BadImageFormat.jpg Check for tooSmall <1KB and tooLarge >10MB files. 8 months ago
LICENSE init 2 8 months ago
README.md Check for tooSmall <1KB and tooLarge >10MB files. 8 months ago
Screenshot2025-04-26.png Screen update 8 months ago
Screenshot2025-04-26Non-DupFound.png updated photos 8 months ago
TooSmall.jpg Check for tooSmall <1KB and tooLarge >10MB files. 8 months ago
a.jpg speed boost round 1 8 months ago
b.jpg speed boost round 1 8 months ago
coordinates.py Camera Make/Model Detection. 8 months ago
dedup.py . 8 months ago
delibs.py ANSI colors: green, yellow, and red added. 8 months ago
do_dups.inc Made bash array for invalid/size inner file issues. 8 months ago
get_dups.bat . 8 months ago
get_dups.sh . 8 months ago
japan-river.jpg geo-data 8 months ago
london-bridge.jpg geo-data 8 months ago

README.md

dedup

Photo De-Duplication

Install Windows

cd dedup
python -m venv C:\Users\$env:USERNAME\Common\env_dedup

Check to see if C:\Users$env:USERNAME\Common\env_dedup\Scripts OR bin folder exists in Windows...

& C:\Users\$env:USERNAME\Common\env_dedup\Scripts\Activate.ps1

If you get an error like: "running scripts is disabled on this system" It means your PowerShell execution policy is too restrictive. You can temporarily allow scripts by running:

Set-ExecutionPolicy -Scope Process -ExecutionPolicy Bypass

After that repeat the Activate Script command:

If that does not work, then TRY:
& C:\Users\$env:USERNAME\Common\env_dedup\bin\Activate.ps1

Here are the packages that need to be installed by pip:

pip install xxhash opencv-python pillow

Usage Windows:

Again, verify if the folder exists Scripts or bin:

& C:\Users\$env:USERNAME\Common\env_dedup\Scripts\Activate.ps1
python dedup.py 0.jpg 1.jpg

Windows Automated Directory use, not TESTED YET!:

get_dups.bat %USERPROFILE%\Pictures

Install Linux

cd dedup
python3 -m venv myenv
source myenv/bin/activate
pip install xxhash opencv-python pillow

Useage Linux:

cd dedup
source myenv/bin/activate
python dedup.py 0.jpg 1.jpg

Get more details on scores:

This command will give more details: Matrix deviation score, Decomposed similarity, Combined similarity, and general score...

python dedup.py 0.jpg 1.jpg scores

Linux Automated Directoy use:

./get_dups.sh .
  OR: 
./get_dups.sh $HOME/Pictures

Files made by get_dups Scripts:

error level: 0 = NOT a Dup, 1 = Duplicate, 2 = Close Match, 5 = Same GPS GEO-location, 8 = Invalid Image, 9 = File Too small/big. Possible files: dups.txt, alike.txt, sameGPS.txt, invalid.txt, size.txt.

[Image of ScreenShot]