this post was submitted on 10 Nov 2023
1 points (100.0% liked)

Homelab

380 readers
9 users here now

Rules

founded 1 year ago
MODERATORS
 

Hello everyone .

Im up against a very strange situation on my setup :
(HP ML350 G6 , with dual Xeon X5670 @ 2.93GHz & 48GB of ram )

runing Proxmox on 2 Samsung Evo 870 512GB ssds in a zfs array for reduduncy .

One VM is runing TrueNas core with my HBA ( A LSI card) PCI-passed through that manages my files
and gives out SMB , FTP & NFS endpoints that i use to accses my files from other (physical or virtual) macines on my network &
one other VM that is runing debian , that has NextCloud on top , i connected the FTP from trueNas to the nextcloud as extrenal storage and its working like a treat for over 1 year . (Personal uptime record 5.5 months)
(plus some other vms , pbxs etc)

I have my music on the trunas share , and i listen to it via nextcloud , but during the last month i had 2 VERY weird crashes .

the first time , NC hanged , then my other VMs one by one started going offline , ending with my proxmox crashing with no error to either its VGA out or to the logs , in fact while this situation was unfolding i want able to use my VGA terminal , the system was just not listening to any of my inputs . A hard reboot later and a Through check of SMART & other reports on both truenas & proxmox i has again up for quite a while , doing heavy file transfers , listening to music watching movies out of the server (all IO heavy tasks)

Fast forward to tonight, i was about to finish my nightly music sesion ... and the whole thing happened again! NextCloud , gone , Freenas crashing , proxmox inaccessible... but , this time i had a clue . right before freenas "went" , it sent to my logger

Device /dev/da0p2 is causing slow I/O on pool boot-pool.

And the whole setup crashed again , a reboot later , im up and imidetly checked for Drive status .

On the proxmox side on both SSDs SMART was passed and Wearout was on 14% and 15% on the 2 ssds , on the freenas side , all my Spinning Rust had its SMART passing . and with

glabel status

i couldt find da0p2 , but da0 -propably- is the virtual drive that proxmox created for truenas to boot from ..

Please Help me out with this one , its almost cursed & i have no clues to run of from to troubleshot

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here