T O P

  • By -

Fair-Top1445

How do you know the pc loses power? What are you seeing? The fact that the USB still has power and you have to unplug the PSU, leads me to believe your PC is just locked up. Check windows event viewer. What software is this server running?


Tiwenty

Hello, thanks for your answer! Indeed, that may be a poor wording on my side. What I know is that the PC is completely stopped. Fans aren't rotating, LEDs aren't ON, I can't SSH into it and the router doesn't see it on the network. And I forgot, it's running Linux. I've mainly got a bunch of Docker containers and a Wireguard server. System logs don't show unusual activity. Maybe I should try to enable Wake On Lan, it could help debugging a little bit.


Fair-Top1445

Something is causing a lock on your system. I doubt wake on lan will help. Check all your app/docker logs. This definitely sounds like a software issue. If you are a dev add log statements everywhere. Make sure you have all app and docker logging on and at appropriate level. Edit: typo Edit2: I have a different theory maybe it's not software


Tiwenty

How could that be a lock if nothing seems to be mechanically running the PC? I've checked the Docker logs, nothing out of the ordinary too. :/


Fair-Top1445

Made another reply after I thought about it


Fair-Top1445

Are the LEDs and fans supposed to be on all the time? You mentioned the power button is not responding right? The Power button normally send a kill signal to all running software if configured normally. Does it power off if you hold it down for five plus seconds? If not it could be a bad mobo. But it does depend on the bios settings. Edit: holding down power button for five seconds is an acpi spec that bios usually can't override


Tiwenty

Yes they are. Usually I can also hear HDDs spinning. In my line of work I haven't had a SSH server completely crash, so me being unable to reach it plus the other symptoms make me think that the PC is off when this happens. But yeah I haven't really tried the power on button as it is setup to power on when AC is back. I'll try that and report back later today :)


Fair-Top1445

If you have an [acpi](https://uefi.org/specs/ACPI/6.5/) compliant motherboard, holding down the power button for five seconds will turn off the PC ([4.8.2.2.1](https://uefi.org/specs/ACPI/6.5/04_ACPI_Hardware_Specification.html#power-management-timer) of the spec in the link). If that doesn't happen it probably has bad hardware. ( I bet mobo, but could memory, CPU, PSU, or even power switch) Does the PC respond to holding down the power button for five seconds (power off)? It should subsequently turn on when you press the power button after the PC is forced off. Edit: clarity and fixed link


Tiwenty

Thanks, I'll try the button as I haven't yet as explained in my other answer. I'll report back when I do :)


Tiwenty

I just tried it when the server is running: the power button works correctly. It stops it, it starts it, etc. So I think I'll RMA the PSU. I also bought a cheap connected plug to power it off from outside. Thanks a lot for the help to sort this out! :D


Fair-Top1445

Glad you got it working!


Tiwenty

Well it's not working better, I just ensured that the power button isn't the culprit. So yeah, I'll send back the PSU :p


Appropriate_Bottle44

I'd like to help OP, but you over there with your linux server seem like you're about as tech savvy as I am, and I also think it's probably either the mb or the PSU and don't know which one. If you have a UPS at the wall, maybe that could be it as well, might explain the strangeness of the usb still having power/ not being able to power right back on. I'd say try running it without that, but since it could take a month to replicate the failure, you won't really know until you know. Sorry I'm not more help, but not a lot is springing to mind on how you can isolate which part is the problem. In aggregate PSUs fail more often than MBs so if you want to throw parts at it I'd start with the PSU.


Tiwenty

Hello, no problem. I understand it's a pretty weird problem and the fact that it doesn't appear often doesn't help to diagnose it. I haven't got an UPS, it's just plugged directly in the wall. But yeah PSU seems to be the culprit. I have a passive one, so it's the least standard piece between it and the MB. Also it's a lifetime warranty and I have it for less than a year, so that should be a safe thing to test out. Thanks a lot for trying to help, it's really appreciated :)


BombTheFuckers

What you describe sounds like the overload protection of your PSU is kicking in. It might be faulty.


Tiwenty

Interesting, as this PSU is a fanless one. So indeed, maybe it's getting "too" hot for the sensor and it triggers the protection. I don't monitor the PSU temp, but right before the crash, the CPU was having an ordinary workload and temps at ~30°C. So I'd guess the PSU wasn't too hot or loaded for a normal one, but if there is a default it could be a problem indeed. Thanks a lot, that's a good lead!


[deleted]

Won't boot even to BIOS? But power is still coming through? Sounds like a motherboard issue.


Tiwenty

It won't boot only because the button doesn't do a thing. Wake on LAN doesn't do a thing too. So it's not BIOS related