this post was submitted on 10 Nov 2023
3 points (100.0% liked)
Data Hoarder
170 readers
1 users here now
We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time (tm) ). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
There seem to be a few of these in Github, found this one recently it seems to do a reasonable job. The real problem is its bash script you need WSL2 to run it on windows.
What I like is the output is CSV files that are spreadsheet friendly and can be used to analyze and remove files in bulk.
https://github.com/Jim-JMCD/Duplicate-File-Finder
Czkaaka (CLI version) output file can be used to delete stuff in bulk but doesn't list directories separately you have to through files individually.
Eh?
WSL2 is one way to run a Linux kernel (and thus native Linux executable binaries) in Windows.
And while bash is definitely very common on Linux, it has never by any means been a strictly Linux program.
It can be used on all kinds of operating systems -- mostly unix-like operating systems, but also including Windows using a POSIX compatibility shim like Cygwin.
People were using bash in 1989, years before Linux became the beginning of a thing. And folks have been using it on Windows since at least 1995, or maybe even earlier -- decades before WSL.
WSL2 is still the easiest way to get bash imo.