I use KODI as a music video player.
I manually fill XML .nfo files with the information for each music video. Then I have KODI rescan the music video directory and the new ones are added to the library.
It might sound like a lot of manual work but it only takes a few seconds per song. Considering you say you listen to the same songs a lot, it might not be a problem for you.
You have no experience with Linux so you need to learn something. It might as well be Docker.
Have you looked at Linuxserver.io ? They've got Docker Compose configurations for a bunch of media applications, among others.
Linuxserver also have descriptions and Docker configs on Dockerhub.
So maybe that can help you start.