It's A Digital Disease!

20 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
1
 
 
The original post: /r/datahoarder by /u/Due_Replacement2659 on 2025-03-30 14:52:48.

I have no idea whether this makes sense to post here, so sorry if I'm wrong.

I have a huge library of existing Spectral Power Density Graphs (signal graphs), and I have to convert them into their raw data for storage and using with modern tools.

Is there anyway to automate this process? Does anyone know any tools or has done something similar before?

An example of the graph (This is not we're actually working with, this is way more complex but just to give people an idea).

https://preview.redd.it/yo47siwmbure1.png?width=554&format=png&auto=webp&s=1b70e08c514bd849eedd5ce46c1c5091f973940d

2
1
Cataloging data (zerobytes.monster)
submitted 15 hours ago by [email protected] to c/[email protected]
 
 
The original post: /r/datahoarder by /u/lawanda123 on 2025-03-30 14:30:14.

How do you folks catalog your data and make it searchable and explorable? Im a data engineer currently planning to hoard datasets, llm models and basically a huge variety of random data in different formats- wikipedia dumps, stackoverflow, YouTube videos.

Is there an equivalent to something like Apace Atlas for this?

3
 
 
The original post: /r/datahoarder by /u/bingobango2911 on 2025-03-30 14:19:23.

Hiya,

I've sorted through my photos using Duplicates.dupeguru.

I want to rename them (year / month / date based on the embedded information in the file), but I don't want to move them. I was going to use PhotoMove but it looks as though using that it would move them all into individual folders.

Does anyone know of any free software that will let me bulk rename the individual photo files?

Thanks!

4
 
 
The original post: /r/datahoarder by /u/areyoua1or0 on 2025-03-30 13:44:42.

If I download multiple large games over time—each around >120GB at a slow speed of 5 Mbps, will this cause more wear and tear on my NVMe SSD compared to copying the same amount of data quickly from another drive? Specifically, does the prolonged, small-scale writing from slow downloads impact SSD longevity more than faster, sequential writes?

5
 
 
The original post: /r/datahoarder by /u/spudd01 on 2025-03-30 12:56:22.

Slightly off topic post and apologies if this isn't the right place.

My late grand father was a hoarder in the days before computers (must be where I got it from) and has left a massive collection of cassette tapes with recorded radio shows on. I am yet to go through all of them, but they are a mix of recordings of radio shows like classic FM, Gardner's Question Time and other radio shows / podcasts from radio 4. From the labels of the ones i had a quick look at, some of these date back to the early 90's.

Is there somewhere that I could donate these too that would be interested in digitising them and preserving them? It feels like a massive shame to throw them away

6
 
 
The original post: /r/datahoarder by /u/SuperCiao on 2025-03-30 10:06:07.

Hi everyone,

I recently bought the Japanese Blu-ray box sets of Dragon Ball Kai, but they don’t include Italian subtitles—only the original Japanese audio.

I’m looking for a way to get Italian subtitles for each episode, ideally in .srt format, so I can sync them with the Blu-ray episodes.

Does anyone know of a reliable website or source where I can download Italian subs for the Japanese version of Dragon Ball Kai?

Also, how can I insert into video files without any loss quality?

Any help would be greatly appreciated!

7
 
 
The original post: /r/datahoarder by /u/untranslated_za on 2025-03-30 09:27:36.

Sorry if this has been asked 100 times, I have read over every post I could find and still not sure.

I live in Cape Town South Africa, so options are limited. I have a basic i5 12000 , 16gb setup.

C Drive (Games and New Downloads) - 500gb nvme

2TB Baracuda - Movies, Music, Personal Photos

1TB Baracuda - Game ISOs, Anime, Personal Photo Backup

4TB WD Green - Series

I have run out of space and looking to get a new drive. Options are :

https://preview.redd.it/k7e6qkhxosre1.png?width=1128&format=png&auto=webp&s=04d6783bdcd56fdb62789b441b6a2fcaf1fcb5de

My question is which to buy. The 6tb Ironwolf seems like the best option, but maybe 10 years from now ill regret not taking the extra 2TB from the 8TB drive (essentially for free) or will I regret not getting CMR since as far as I understand all my existing drives are CMR and the performance of SMR is much worse (seemingly only in write though).

It will only be used to store Movies/Series/Anime/(Personal Photo backup2) which I already have, and will add to over time. I dont download, watch, delete, redownload next year. Is SMR less reliable or is that only in RAID setups, same for performance? Will I notice it in a home setup?

8
 
 
The original post: /r/datahoarder by /u/mysticalbuttwizard on 2025-03-30 04:13:53.

I came across a number of 8mm films but have no means to digitise/project them myself. I'd just like to see them scanned and online somewhere for archival purposes, they have no personal meaning to me. This isn't something I can justify spending a whole bunch of money on digitising but I hate the thought of just dumping them and they potentially get ruined, trashed, etc. never to be seen.

Anyone know of who, if anyone, in Australia would take/borrow them to scan so they can be put on Internet Archive?

Thanks.

9
 
 
The original post: /r/datahoarder by /u/Abdel403 on 2025-03-30 03:57:53.

I found a "USED" "T7 Shield" external SSD drive on amazon and would like to know if anyone has bought one or have any experience? The seller is "Warehouse Deals".

New is $550 while the used one is 360, quite a difference.

I'm just a regular consumer using it for personal data.

Thanks!

10
 
 
The original post: /r/datahoarder by /u/DidThisSoICouldPost on 2025-03-30 02:27:07.

apologies if this isn't the right place to post this, if not then please direct me somewhere.

i am not concerned with disk performance, nor may i use other filesystems. i need to store as much data as possible (including many uncompressed small files) in a way that is compatible with MS-DOS, linux, and android.

how may i know what the minimum possible allocation unit size is for large-fat32 volumes of different sizes? is there some table of limits?

11
 
 
The original post: /r/datahoarder by /u/cleuseau on 2025-03-29 23:18:52.

I put my computer in the back room and it goes from -10c to about +5. Never had problems until I moved my unix server out back. I know for solid state it's probably better to be cold - but these SMR/CMR disks whatever they are - could it just be the cold killing the drives?

Long story: I had my computer in the house. moved about 4tb of data to the disks, Moved the computer to the back room for a long time and both drives had click of death after 4 month of no power. So I didn't let them idle with the click of death.

Flipped them over, a trick I learned as a kid in the 80s (long story) and copied my data off but now I wonder what the root cause is.

12
 
 
The original post: /r/datahoarder by /u/Wonton1111 on 2025-03-29 22:36:06.

Does anyone know how to download videos from Freeform?

For example, my daughter likes Switched at Birth:

https://www.freeform.com/episode/1af334f4-a1b8-4bfe-abe4-bd1aa5f03d99

I've tried quite a few downloaders, but none seem to work.

13
 
 
The original post: /r/datahoarder by /u/kaiser1025 on 2025-03-29 22:24:51.

I 100% know for a fact I uploaded / saved / backed them up. Infact, most things are uploaded twice. The cloud services I've used / still use, in order of most to least:

  1. Google Drive

  2. pCloud

  3. OneDrive

  4. Samsung Notes (I own a Samsung laptop and phone, but the PDFs I'm looking for would also show up in the above platforms)

*) I also have a total of 10TB of local storage, with a strong liklihood of also being on local storage. During the times when I've needed storage, PDFs are at the very bottom of the priority list of items to delete. Even duplicate PDFs don't get deleted. I've completed indexing of all 10TB inside of Windows 11, but there's far too many documents to search though. Adobe Reader freezes then crashes when attempting to search.

I've manually looked. I've searched "checking account statements from ". I have my paystubs from that time period and used them to determine the routing number(s) I had direct deposit. This was a period where I was churning for bank bonus signups, so there will be multiple banks.

I don't mind paying for whatever I need, whether it's software or an AI subscription. I already have Gemini Advanced and Copilot Pro. Perhaps there's a specific prompt that I could use to help achieve my goal? Time is limited; they're required in another week or so.

I've already contacted every financial institution from that time. The only financial institution that hadn't purged my records from 4 years ago (Is that even legal? I thought the retention period was at least 5 years?) was Wells Fargo.

Thank you for any help.

14
 
 
The original post: /r/datahoarder by /u/ExtendedPlay7 on 2025-03-29 22:19:19.

I'm very passionate about archiving and new to data hoarding, but this is something tailor made for how my brain works. With the concerning trends of data disappearing in the US I feel panicked like I need to start grabbing everything, but I don't know where to start. What is in danger? Where are people needed? Can I get hooked up with other people doing the same thing so that I can work efficiently and not just duplicate someone else's efforts?

I'd appreciate a little crash course on how to get started on this.

15
 
 
The original post: /r/datahoarder by /u/Master0fMuppets on 2025-03-29 21:44:18.
16
 
 
The original post: /r/datahoarder by /u/outlawaol on 2025-03-29 21:06:33.

Hello friends. Wondering about if I understand how an expander works in a jbod \ server setup and limitations. If I understand it correctly you can use an expander on any SAS (example I have a LSI 9207 8i) and could use an expander (let's say a backplane I've found that has 24 drive capacity LSI 2X36). From information I've gathered looks like if you use one port of the SAS it won't be as fast but if use 2 to the SAS card it'll be faster. I'm going to assume the speeds will be limited to the SAS capabilities?

On the same vein of connectivity. Can you take two separate expanders and run them to the same SAS? Or is it better practice to run separate SAS for each expander. Also I see some specifications for the cables being mini SFF and I guess regular SFF? Also seems the standard is SFF 8087? Is that the port on the cards or the cable standard?

Also it seems that expander cards only need power but are usually PCIE. So technically you can run an adapter from PSU to a PCIE power adapter and avoid the mobo altogether.

At current I'm looking to upgrade to a 4u 24 drive bay server rack and maybe at some point add either another 4u 24 bay or a smaller 12 bay jbod. But this would be way down the road as the 24 bay will keep my needs up plenty for awhile.

Thanks all for the clarity and information.

17
 
 
The original post: /r/datahoarder by /u/Playful-Bank8870 on 2025-03-29 20:10:53.

Hey all! I collect vintage magazines and want to digitize them before turning them into collage pieces. I'd love to upload them to Pinterest/Internet Archive so others can enjoy them too.

The catch is—I'm a teenager and don’t have the time to scan them myself. 😅

Would anyone be willing to help me scan them, or know of someone who offers affordable or even community-based scanning services? I’m totally open to mailing them (if you’re trusted or have a portfolio.)

Thanks so much in advance—I'd really love to preserve and share these before they become part of my art!

18
 
 
The original post: /r/datahoarder by /u/Imaginos9 on 2025-03-29 18:51:53.

Hi there. I'm trying to download all the replies in a single tweet on twitter/x but all gallery dl is doing is grabbing the main post's image and I want to grab all the images/videos in replies.

I don't have a config file and find those confusing. I'm just doing a command line in the command window.

gallery-dl -o "username=" -o "password=" "URL"

So what do I need to add to get all the replies to a single tweet? HELP!

Thank you

19
 
 
The original post: /r/datahoarder by /u/Stormy1956 on 2025-03-29 18:30:56.

I’m an organized digital hoarder and also have OCD. What has helped you overcome your digital hoarding?

20
 
 
The original post: /r/datahoarder by /u/chineke14 on 2025-03-29 18:14:32.

Hi I have a decent volume of media files and also a decent volume of files and other data. I do "software raid"/sync across a pair of 24 TB Hdds and a pair of 14 TB Hdds on my main desktop which also acts as my Plex server for the time being.

Backup wise, I am limited in means so I have 1 external 18TB Hdd which i want to act as the offline backup for the 24TB pair for the time being since I'm not close to 18TB data on the 24TB yet. And I do have a 14TB external drive to act as offline backup for the 14TB mirror.

QUESTION:

For this offline data, is it better to just use macrium to image the drives/folders and this way allows me to have multiple images of the same drive/folder as a sort of time machine, storing different instances of thse drives (I assume this is possible because macrium compresses) image files? If not is there an app that creates compressed backups of folder/drive images?

OR is it better to just have these offline drives be an exact mirror of the drives inside my desktop?

21
 
 
The original post: /r/datahoarder by /u/Blackwater_7 on 2025-03-29 18:10:08.

I have several 8TB external drives at home, was using Windows for years. Today I bought a MAC Mini and was trying to make the switch. Just for testing I connected all my drives onto MAC via powered USB Hub. Power should be enough bec this is how I was using it with Windows PC.

Anyway later on I had to connect external drives to PC again. Then I realised there is a huge "3TB free out of 8TB" label on the drive. The disk was almost full, I know it. In the root of the drive I see a folder called "Spotlight" , also some MAC related folders.

For the deleted files: Some are completely disappeared and some are showing as 0KB or 2MB, (normally they are much bigger)

I don't know what the hell happened but I can't see these files now, they are gone. I didn't even do anything. All I did was plugging it into mac and thats it. Now is there a way I can recover this data? Maybe the files are still there but its just my Windows showing the incorrect info (my windows also has issues)

should i just run recuva? or maybe i should check the files in mac now, maybe they will appear there.

22
 
 
The original post: /r/datahoarder by /u/Funnyman959 on 2025-03-29 16:19:27.

I have all the information of nearly hundreds of lost media YouTube videos with all the information archived but I wonder if there’s a chance if I can find them by using the description,like count, view count, name, thumbnail,date of creation, and links. It’s just that I don’t have the video I’m looking for itself. (I originally posted this on r/Archiveteam but they suggested me post it here for more answers.) and no they aren’t archived anywhere like on the web archive

23
 
 
The original post: /r/datahoarder by /u/ElectionOk60 on 2025-03-29 16:15:10.

So I've been banging my head with this for the last three days and I'm coming at a bit of an impasse. My goal is to start moving to linux, and have a data pool/raid with my personal/game files being able to be freely used between a Linux and Windows installation on a DualBoot system.

Things that I have ruled out for the following reasons/asumptions.

Motherboard RAID: RAID may not be able to be read by another motherboard if current board fails.

Snap RAID: This was the most promising, however, it all fell apart when i found there isn't a cross platform Merge/UnionFS solution to pool all the drives into one. You either have to use MergeFS/UnionFS on linux, or DrivePool on Windows.

ZFS: This also looked promising, However, it looks like the Windows version of Open ZFS is not considered stable.

BTRFS: Again, also looked promising. However, the Windows BTRFS driver is also not considered stable.

Nas: I tried this route with my NAS server that I use for backups. iscsi was promising, However, i only have Gigabit So not very performant. It would also mean that I need a backup for my backup server.

These are my current viable routes

Have all data handled by Linux, Then accessing that data via WSL. But It seems a little heavy and convoluted to constantly run a VM in the Background to act as a data handler.

It's also my understanding that Linux can read and wright to Windows Dynamic discs (Virtual volumes), Windows answer to LVM, formatted to NTFS. But my preferred solution would be RAID 10, Which I'm not sure if Linux would handle that sort of nested implementation.

A lot of data just sits, and is years old, So the ability to detect and correct latent corruption Is a must. All data is currently being held in a Windows Storage Spaces array, And backups of course.

If anyone can point me in the right direction, or let me know if any of my assumptions above are incorrect, It would be a massive help.

24
 
 
The original post: /r/datahoarder by /u/aimforsilence on 2025-03-29 15:52:26.

I'm in the market to buy a new NAS for mainly storage and PLEX use. I know I want a 6-bay model (using 6 x 10TB drives) but am not sure which brand/model to go with. I'm currently looking at the following;

QNAP TS-664-8G - I like that this model supports QuTS which allows me the ability to use the ZFS filesystem and have my drives in a Z2 array. I like the expansion options for memory, M.2 and PCI-E. I also like the inclusion of 2 10Gb/s USB ports and the 2.5G ethernet ports. I'm less a fan of the older Celeron chip powering this NAS

TerraMaster F6-424 Max - I like that this model has much more modern hardware including a 12th gen Intel Core i5 CPU. I like all the expansion options and also that it's future proof with having 10G ethernet. Honestly, this is the model I'd most likely buy for the hardware alone but I'm not familiar with TerraMaster's TOS software. I assume it's more or less similar to QNAP and Synology's OS's?

Synology DS1621+ - This NAS I tossed on here because I like Synology's DSM OS. Beyond that this model is very lacking in hardware compared to the other 2 options here.

Some things to note:

  • I live in Canada so I'm only able to get whatever I can find here for MSRP so my options aren't as wide as what someone living in the US would have

  • I don't use PLEX transcoding. I have DVD, Blu-ray, and 4K UHD Blu-ray's ripped in their native quality and play them direct from my NAS to my Apple TV 4K box. That's the PLEX setup. Very basic.

  • I have two 22TB external drives that I use to make backups of my data already, and critical data is also saved to cloud storage.

  • I'm currently using a Lenovo ThinkStation as my home NAS. It's running Windows 11 with the drives connected together using Windows Storage Spaces (yes, I know, not the greatest solution). I have tried Unraid and TrueNAS and honestly just got to frustrated with them both. I don't want to spend the amount of time needed to learn those OSs and instead just want something that essentially is easy and just works...thus why I'm looking at a new off the shelf NAS... that and also to save space and potentially be quieter too.

  • I'm not a pro when it comes to drive filesystems by any means, but I do understand how RAID and ZFS work as far as how the drives are split up depending on what type you use. I think for me I'd likely either use RAID 6 or ZFS Z2 (if I had a NAS that supported it)... not sure if for TerraMaster's TOS I'd want to use "TRAID" or "TRAID+" and the same can be said for Synology's DSM's "SHR" and "SHR2" as I'm not familiar with them so any info about them would be great.

If I missed anything please let me know, I tried to give as much detail as I could. Thanks folks

25
 
 
The original post: /r/datahoarder by /u/Separate-Lobster-806 on 2025-03-29 15:24:01.

Is there literally any way to possible to recover the media from old, deleted tumblrs? Are there any archives online I could search? Any info is helpful.

I’m not looking for the whole posts, simply any images or videos posted to any given deleted tumblr.

view more: next ›