Hey friends, looking for advice on how I can possibly get the best of both worlds out of this situation...
Some Background:
I've been running a Plex server for about 2 years now. 2018 Mac Mini, DAS: Pegasus R4 with 4x 12TB drives configured in RAID5. It's worked great, no complaints. I'm nearing the 36TB capacity of my array and have been researching the best course to upgrade for a couple months. Very interested in the idea of building a proper server and moving to Proxmox so I can start doing some other homelab stuff with the machine as well.
Yesterday, I got a new job which starts in February, so I have a little bit of time to figure things out.
For the new job, I will need to build a RAID with 70-100TB of usable space. I'll be offloading 1TB and 2TB NVMe's to 2 destinations. Destinations are the HDD RAID I'm building as well as to a 4TB NVMe external drive. Everything must be run through an xxhash64 checksum. One downside, is I'm required to use MacOs for the job. The offload has to be pretty fast, without breaking the bank.
The cheapest/easiest solution I've come up with is:
- Buy Mac Studio M1 Max ($1,579 refurbished from Apple)
- Buy OWC Thunderbay 8 enclosure ($899.99)
-Fill the Thunderbay with 20TB IronWolf Pro drives ($1800 for 6x 20TB)
Pros to this setup: Relatively cheap given the parameters. No building anything or relying on my (lack-of) skill in building a DAS/NAS. Good warranties on the computer and enclosure if something goes wrong and needs replaced fast.
Downsides to this setup: I really don't like the idea of using Softraid. I would much prefer a hardware raid. The Thunderbay is big and power hungry, and I'd prefer something rack mounted.
I considered the idea of building a NAS...But then to move files from the source drives to the NAS, would I just need a 10Gb switch to establish a local network on my work truck? (I've never done this, only used DAS for work) I also have no idea what kind of read/write speeds I could expect from a NAS spinning the IronWolf Pros.
What I'm looking for is advice on possibly building a DAS or NAS that I could use for the job and, when the job is over, reuse as much of it as possible in my new Plex server / Homelab build. Ideally, it would be compatible with both MacOS and Linux, have hardware RAID control, minimum 8 bays, be rack mounted and power efficient.
Based on the price of the above setup, I'll say my budget is $2,500 excluding the cost of HDDs.
Things I already own that may or may not be helpful: 2018 Mac Mini i7 w/ 16GB DDR4 RAM, Pegasus R4 with 4x 14TB IronWolf Pro Drives
Thanks for your time and I appreciate any feedback you can give!
It's a whole world, especially with fast transfer speeds. I recommend you seek paid advice
Need a lot more context such as types of files you are transferring. Is it video files, or word docs
checksums are typically calculated client side. The server will do its own thing re raid/parity.
Eg if you were doing video files I'd recommend Hedge for Mac. If you are windows I'd recommend Terracopy Pro but avoid on Mac
You'll want a separate network, 10G or more for fast transfer speeds, ofc speeds are limited by your raid. How many people need to be connected to it?
If just you can have a separate 10G network card on your server, and then an external Sonnet 10G Thunderbolt 3 adapter for your MacBook or however you are connected to the serve
If more then you need a network switch
I wouldn't use a Mac as a host with drivers I would build a separate Linux system for your server with the drives connected. You can then share the storage pools over the network via smb or nfs
Truenas is a good start if you have time to tinker
If you want something really easy I'm selling a Synology disk shelf and expansion, holds 12 disks each, super easy gui
Thanks for the response and all the information! Sorry for not being clear about several things. I'll try to clarify.
I will be offloading Arri Alexa 35 Capture Drives (1TB and/or 2TB) to a "Shuttle drive" as well as a "master backup". The capture drives (source) are NVMe. The "Shuttle drive" (destination 1) is NVMe, the "master backup" (destination 2) is the HDD RAID I would like to build.
It's all video files. Specifically, 4K ProRes 4444XQ, LogC4 files in .mxf's.
For checksums, I meant that I will be generating the xxhash64 checksums from the codex compact drives to the 4TB NVMe shuttle drives and my RAID.
For checksum and transfer software, I've used Silverstack and ShotPutPro in the past.
I've always avoided Hedge because of what an assistant editor told me years ago (and this part is WELL beyond my understanding, so please correct me if I'm way out of line) to paraphrase what I was told:
"Hedge does not create a legitimate checksum. Hedge begins the checksum process while files are still moving between the RAM and the destination drive. A legitimate checksum must be between the destination and the source. While unlikely, there is a possibility that information could be lost between the RAM and destination making the checksum Hedge performs illegitimate."
This is what I was told right when Hedge first came out and anytime I've seen Hedge asked about it since, they've ignored the question or skated around it. If this is incorrect, I would love to know. Hedge is incredibly affordable and faster than ShotPutPro. I just always assumed it was faster because they were cheating the checksum (based on the info above).
Back to your questions/variables.
I will be the only person that needs to access the "master backup"/RAID.
Based on what you said, I think my best bet is to build my Linux based server with a 10G network card and then network directly into my 2018 mac mini. At that point, the bottleneck is the 260-285MB/s speed of my 7200 RPM drives, right? Is there any reason why my 2018 i7 6-core with 16GB of DDR4 would slow the process down?
Originally, I thought I was going to be transcoding ArriRaw to HDE files, so I was going to get the Mac Studio for the 24-core GPU...but now that there's no transcode I'm hoping maybe my 2018 mac mini will be sufficient?
Those 6 7200RPM drives will be able to saturate your 1Gbps link and will even give your 10 gig link a pretty decent workout. Remember, when working with RAID, you’re dealing with multiple IO, so you can take throughput of one of those drives and multiply it by six, and that’s effectively what you’re going have available. I would personally recommend using a good set of SAS drives, over SATA, to maximize your bandwidth server side.
Thanks for commenting! I'm not sure if I can afford to go with 6x 20TB SAS drives over the 6x 20TB SATA drives, but will have to consider that...
Can you explain this for me or point me to a resource to learn what you mean? My knowledge of RAID is limited to how many drive failures it can handle and the boosts in read/write speeds. I will be using RAID 5, possibly RAID 6 for my situation, so no write speed gains either way.
Let me know if I'm missing something here. My single mac computer will be the only client able to access the server.
Where I need the most speed is writing from the NVMe source to the RAID. The only time I'd be reading the data on the RAID is when it's generating the checksum after the data transfer is complete, right?
I don't need to access any of the files kept on the RAID while I'm offloading the footage. So the write speed from the source to the RAID should be 260-285Mbps, I think?
He is wrong about Hedge, it's fantastic software. Copied petabytes through it without error
Hedge does source verification which is legit, it checksums in ram, but so does all xx64 that's why faster ram helps re your question about using a Mac mini
The chance of a non ecc ram error is incredibly low. And worthit for the speed advantage, you don't need to generate another destination checksum afterwards, it does it at the same time so won't reallllyyy be reading from the array.
If you want pure speed I would get a large nvme cache. 2x 4TB ssds in raid 0 means it can hold 2/3 large arri mags before it starts to transfer to the disks
sent from mobile