Everything works great, login (Tor, Firefox) with LAN and remote.
Running Bitcoin Node, NextCloud, Core Lightning
Every 1 to 2 days the server turns off or stays running but no one home (cannot connect with Kiosk or headless). If I do a hard shutdown and restart it is back and going like nothing happened.
Are you leaving a monitor connected full time? You may try disconnecting the monitor, and reboot the server with no monitor attached. Give that a try, and let us know if it makes a difference.
Other things to check here are: SSH into the server and capture sudo journalctl -xefa and see the latest thing that happens before it becomes unresponsive.
Is it actually completely shutting down?
Check that the output on your power supply matches the input on the plug of your machine.
To make sure it’s not overheating, is a good place to start. If it does it again, power-cycle the device, and pull that log George suggested. That will give you some clues as to what’s going on.
Well it did it again…its about 24 hrs and goes dead (server appears to stays on). No ability to log on. setting up kiosk does not work (no power to usb or hdmi).
I did a hard reboot with kiosk set up and during startup the graphics card stopped running and was unable to use kiosk. I tried this several times with same result. However I can log in remotely.
I obviously have a bad graphics card.
Any recommendations on a simple card.
Can you open a terminal to SSH into server via remote access? If so, how?
I really appreciate yalls help. Overall, this has been a lot of fun…especially since I knew nothing about servers or really operating systems before this.
First off, when this happens, check start.local or the server’s IP address in your browser to see if it’s in diagnostic mode. If you still get nothing, try SSH’ing in and run:
sudo journalctl -xefa
Make sure to do this before rebooting the server—you might get more info on what’s going on.
That sounds like a good possibility. Not sure about all the possibilities with the logs beyond just reading them. If you scroll back through your system logs, look to see if there is any evidence from the previous crashes
Update. Due the the instability in my system, I decided to take it apart, reformat all drives, cleaned everything, put it all back together and reload StartOS from scratch.
Everything seems to be good to go.
If I have a failure at this point, I am going to throw it all in a dumpster and buy a Start9 Server.
System still becomes unresponsive anywhere from 12 to 24 hours.
Since my previous statement of throwing it in the round file, I have become curious as to the cause. I will still most likely buy a Start9 system but in the mean time I am till troubleshooting which may help someone else in the future.
Pretty obvious to me that it is an issue with hardware or a Dell Inspirion BIOS problem (like timeout or sleep that does not like the StartOS processing)
cleaned and new thermal paste CPU temp stable at 40 deg C (still became unresponsive)
Replaced my Crucial 500GB M.2 NVMe with a WD 1TB NVMe for the StartOS and reflashed (still became unresponsive)…out $65
4 I completely disconnected everything but bare bones. (still became unresponsive)
a. led lights (yes it used to be gaming system)
b. wifi card
c. monitor, keyboard, mouse
c. all front usb, card readers, optical drives, audio inputs, etc.
Dell diagnostics telling me my Teams 4TB SDD (my data drive, not OS drive) is not completing self test. So something going on there. So I ran DiskGenius from USB boot and ran disk sector scan (4 hours) = all clear.
Finally, I did notice that my 32GB (16GB x 2) RAM is DDR4 @ 3400 MHZ but my motherboard can only handle DDR4 @ 2400 MHZ. I do remember that someone said even if the mhz is over the system specs on the RAM, the motherboard will only run it at its limit so it should not be an issue.
Bottom line is I guess I replace the RAM. Found 32GB that meet the UDIMM specs on Amazon for only $48.
After all that and if the RAM replacement does not work, I think the only things it could be are:
1. AMD CPU glitch
2. Dell BIOS glitch
3. Dell Power source unstable