this post was submitted on 21 Dec 2023
55 points (91.0% liked)

Linux

46775 readers
1690 users here now

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

founded 5 years ago
MODERATORS
 

I'm not sure if this is the best community to post in, but I just bought a used computer and slotted in an RX480 as the GPU. I installed KDE Neon 5.27 on it, and it worked flawlessly for 2 days.

Then, even though it was working earlier today, it slept and then would not wake up. So I turned off the power and turned it back on again, and was greeted with this error screen:

The only prior error message I'd gotten from the system was when I tried to install wine for one application, it told me some packages weren't up to date, without a way to fix it. I can enter the BIOS just fine.

What is going on? How do I fix this?

all 38 comments
sorted by: hot top controversial new old
[–] [email protected] 19 points 8 months ago* (last edited 8 months ago) (2 children)

None of what's visible helps identifying the error. Try journalctl -xb as suggested it might show more relevant information

Edit: oops should've been joirnalctl instead of journal

[–] [email protected] 9 points 8 months ago (2 children)

I tried to do that, and it couldn't find the journal package. So I tried to install it, but apt, flatpak, nor snap could find the package to install.

[–] [email protected] 25 points 8 months ago (2 children)

This was probably supposed to say "journalctl -xb"

[–] [email protected] 5 points 8 months ago (1 children)

Okay, that command works for me. The last line says that /etc/hosts:7: hostname "SuperSpruce_Iron_3900X" is not valid, ignoring.

[–] [email protected] 13 points 8 months ago

Not sure if this is the root cause of your boot failure, but underscores in hostnames are not allowed. A- Z, 0-9 and - are the only allowed characters.

[–] [email protected] 1 points 8 months ago
[–] [email protected] 5 points 8 months ago

Read the error again. It's journalctl.

[–] [email protected] 12 points 8 months ago

Shit's broke yo.

Sleep/wake issues with AMD gpu and platform drivers are super, super, super common. Fish back through your kernel journal after a reboot (journalctl -kb -1 should do it) and look for the driver errors immediately after the wake event. If this has been fixed in a later kernel release then update your kernel, if not go report it to either the Ubuntu folks or on the amdgpu gitlab.

[–] [email protected] 5 points 8 months ago (1 children)

Can you remove the GPU and use onboard?

It is mentioning gpu in the errors, so it would be the first thing I would try, to see if the errors change, because I have no idea what's going on here

[–] [email protected] 7 points 8 months ago

The computer is running a Ryzen 9 3900X, which does not have onboard graphics unfortunately.

[–] NinePeedles 5 points 8 months ago

That monitor just keeps going

[–] [email protected] 3 points 8 months ago* (last edited 8 months ago)

Can you please post the output of journalctl -xb

[–] [email protected] 3 points 8 months ago

Before doing anything, if your screen allows it, swap DP to HDMI or HDMI to DP as output, that may fix this to the point of being able to actually boot and further fix the issue.

I've had this before with drivers where suddenly it would fail on either port but would still run on one of the others.

[–] [email protected] 2 points 8 months ago

Have you tried booting in with a live usb? You might be able to do some sort of recovery from there.

Having said, I’m still very much a Linux noob.

[–] [email protected] 2 points 8 months ago (1 children)

Try updating your DM / your entire system from the emergency mode.

[–] [email protected] 4 points 8 months ago (1 children)

How do I get into emergency mode?

[–] [email protected] 3 points 8 months ago (1 children)

The Shell where you typed "systemctl reboot" and "exit".

If you are running KDE neon, try "apt update" and "apt upgrade". If It doesn't work do "sudo apt update" and "sudo apt upgrade"

[–] [email protected] 4 points 8 months ago (2 children)

I ran apt update and some index files failed to download. It was just a warning though.

But systemctl reboot and exit still fail the same way.

[–] [email protected] 3 points 8 months ago (5 children)

Run "journalctl --lines 200" and send photos of output.

NOTE: This is all of the logging of the computer, and it's long (that command select the last 200 entries), so you might have to scroll down using the PageDown key (or arrow down) in order make the photos of everything

[–] [email protected] 3 points 8 months ago

Note: The computer has an SSD where the OS lives and two HDDs, sda and sdb, set up in RAID 1 because the computer is 3.5 years old.

[–] [email protected] 3 points 8 months ago (1 children)
[–] [email protected] 4 points 8 months ago* (last edited 8 months ago) (1 children)

The RAID1 seems to be failing according to that screenshot. That breaks the "Local File Systems" task and since quite a lot of things tend to depend on that, many things usually end up failing in an annoying cascade failure. It's also failing with a timeout instead of a strict error, which is odd.

Either way, I'd try commenting that line for /mnt/raid in /etc/fstab for now and seeing if that makes the system boot. It's possible that journalctl -u dev-md0.service or systemctl status dev-md0.service might tell you more, but it's 50/50 if it'll be anything useful.

[–] [email protected] 1 points 8 months ago (1 children)

How do I edit /etc/fstab if I'm not even able to boot the system? Or am I already booted in the system, just in a CLI environment?

[–] [email protected] 10 points 8 months ago (1 children)

You're most likely booted, otherwise you might need a live USB. Hopefully, the system isn't in read-only mode. What I'd recommend doing is:

cp /etc/fstab /etc/fstab.backup

To make a copy once. Then, nano /etc/fstab to run nano, a basic CLI editor. You can use the arrow keys to navigate and type freely in it. The hints like ^O shown on the bottom mean ctrl+o.

You'd use the arrow keys to go down to the line that probably says /dev/md0 /mnt/raid morecrap, put a # in front of it, press ctrl+w then enter to save. If that worked, ctrl+x to exit and try a reboot again.

Obviously can't promise this is "the" error preventing the system from booting, but it's generally a good idea to disable broken stuff like this to get the system working again, then fix it from there. Hopefully, this does the trick. Your RAID setup will not be activated on reboot after you do this but it's not going to permanently delete data or anything.

[–] [email protected] 2 points 8 months ago (1 children)

I used nano to edit /etc/fstab and commented out the last line and the system booted into GUI mode!

This leaves me with some questions:

  1. Why does fstab fail to mount the NTFS raid array?
  2. Why does the raid array failing to mount block the EDID signal? It's not like the OS lives on the raid array.
  3. How do I properly mount the raid array and how do I automate it every boot if I can't use fstab?
[–] [email protected] 1 points 8 months ago

Looks like you need to look for messages about /dev/md0 and why it may be timing out. Also maybe add nofail to the raid entry in fstab so you can still boot if the root fs is not on it and it fails ( is root on NTFS possible or good?)

I don't think the edid message is a problem, just an artifact of your monitor not talking to your video card?

Maybe NTFS is the problem, I think it needs special options to automatically remove the dirty bit and replay the journal

[–] [email protected] 3 points 8 months ago

Before the "systemctl" command: try removing the GPU and booting it up without the GPU If it works, you can skip the "systemctl" commands

[–] [email protected] 0 points 8 months ago

Read the messages on the screen. It's telling you how to check the logs for the error