WelcomeWelcome | FAQFAQ | DownloadsDownloads | WikiWiki

Author Topic: [Solved] anything in base that automatically reboots the system?  (Read 5348 times)

Offline curaga

  • Administrator
  • Hero Member
  • *****
  • Posts: 10957
Re: anything in base that automatically reboots the system?
« Reply #30 on: June 17, 2022, 10:55:33 PM »
HDD health can be checked with "smartctl -a /dev/sda" (from smartmontools.tcz), but that's not likely to cause a reboot.
« Last Edit: June 17, 2022, 10:57:16 PM by curaga »
The only barriers that can stop you are the ones you create yourself.

Offline CNK

  • Full Member
  • ***
  • Posts: 234
Re: anything in base that automatically reboots the system?
« Reply #31 on: June 17, 2022, 11:59:49 PM »
I have two wireless USB adapters connected to the laptop (wlan1 and wlan2).

If you used them with both laptops then it's also (slightly) possible that one is failing and that's exposing a bug in the wireless driver which causes a kernel crash.

Offline tacpilot

  • Newbie
  • *
  • Posts: 30
Re: anything in base that automatically reboots the system?
« Reply #32 on: June 18, 2022, 06:03:13 AM »
Quote
A kernel panic would reboot in one minute by default (configurable)
Since rebooting is not configured for any power related settings, and other hardware
have been removed from the equation, the only thing seems able to force a reboot
is the kernel panic setting.

Going on the assumption that a (failing drive / data corruption) is causing kernel panic.

Quote
If you used them with both laptops then it's also (slightly) possible that one is failing and that's exposing a bug in the wireless driver which causes a kernel crash.
Following previous suggestion to ensure are running solely from RAM, would
help expose such issues if reboots continue.

Could go as far as to set the reboot time for kernel panic to extreme high number
so as to visually verify that a panic is what's happening before it gets a chance to reboot.

« Last Edit: June 18, 2022, 06:16:15 AM by tacpilot »
Never limit your creativity by the imagination of others.

Offline GNUser

  • Hero Member
  • *****
  • Posts: 1343
Re: anything in base that automatically reboots the system?
« Reply #33 on: June 18, 2022, 07:01:10 AM »
smartmontools shows nothing concerning and total bytes written = 0.172373 TiB (well below the drive's expected useful life).

No reboots for over 12 hours, which is a record since this issue came to my attention :) The only difference between current setup and the previous iteration (which did have an unexpected reboot) is the TCL downgrade to version 12.

I'm chalking this up to a linux kernel regression between 5.10.3 and 5.15.10 that affects this specific use case (wireless router with multiple wireless interfaces). But with the other small changes between TCL versions it's hard to definitively exclude some other possibility.

Topic may be marked as solved. If I have bad news (another unexpected reboot) or good news (a week or more without unexpected reboots), I'll let you all know.

You are a sharp and generous bunch. Thank you very much for all your help.

P.S. CNK, in one of the several iterations of my setup I did change the wireless USB adapters to different ones with identical chipset (Ralink RT5370) and the issue recurred. So it was not a hardware problem with the adapters.
« Last Edit: June 18, 2022, 07:04:29 AM by GNUser »

Offline GNUser

  • Hero Member
  • *****
  • Posts: 1343
Re: anything in base that automatically reboots the system?
« Reply #34 on: June 20, 2022, 05:46:40 PM »
Downgrading to TCL12 x86_64 seems to have done the trick: No more unexpected reboots for 72 hours and counting! Maybe the new kernel version in TCL13 x86_64 had a regression that affected this particular use case.

Topic may be marked as solved :D

P.S. The laptop has been running without a battery at all. So my hypothesis that a missing or broken battery may have been responsible for the reboots seems to have been disproven.

Offline Rich

  • Administrator
  • Hero Member
  • *****
  • Posts: 11178
Re: [Solved] anything in base that automatically reboots the system?
« Reply #35 on: June 20, 2022, 06:18:42 PM »
Hi GNUser
... Maybe the new kernel version in TCL13 x86_64 had a regression that affected this particular use case. ...
Here are 2 other possible causes:
1. One or more extensions were updated.
2. One or more extensions were not updated but should have been.

Topic marked as solved.  :)

Offline GNUser

  • Hero Member
  • *****
  • Posts: 1343
Re: [Solved] anything in base that automatically reboots the system?
« Reply #36 on: June 21, 2022, 08:43:48 AM »
Hi, Rich. If I were to do any more testing now and cause additional router reboots at bad times, wife would kill me.

To exclude your hypothesis #2, when TCL14 comes out I'll submit new versions of the two workhorse extensions (hostapd.tcz and dnsmasq.tcz) and will test it out.

Offline Rich

  • Administrator
  • Hero Member
  • *****
  • Posts: 11178
Re: [Solved] anything in base that automatically reboots the system?
« Reply #37 on: June 21, 2022, 10:30:26 AM »
Hi GNUser
It was merely an observation, not a suggestion to do more testing.

Offline GNUser

  • Hero Member
  • *****
  • Posts: 1343
Re: [Solved] anything in base that automatically reboots the system?
« Reply #38 on: June 22, 2022, 10:28:54 AM »
Hi, Rich. I know--that was my bad conscience putting me on the defensive because I am lazy and didn't want to do what I knew needed to be done.

I went ahead and compiled latest hostapd and dnsmasq on TCL13 x86_64 and am testing everything on my router. Worst that can happen is that I have one more unexpected reboot, I have to downgrade to TCL12 on the router (again), and wife makes me sleep on the sofa for one night ;)

Juanito, I will submit refreshed hostapd and dnsmasq extensions for TCL13 x86_64 now.

Offline Rich

  • Administrator
  • Hero Member
  • *****
  • Posts: 11178
Re: [Solved] anything in base that automatically reboots the system?
« Reply #39 on: June 22, 2022, 12:43:23 PM »
Hi GNUser
... and didn't want to do what I knew needed to be done. ...
Once again, It didn't need to get done. You solved the issue in a way that met your needs, and
there's nothing wrong with that.

Allow me to play the role of PITA:
... Maybe the new kernel version in TCL13 x86_64 had a regression that affected this particular use case. ...
Here are 2 other possible causes:
1. One or more extensions were updated.
2. One or more extensions were not updated but should have been.
3. An existing bug was made more visible by the new kernel version. The issue will recur in the older
   kernel, but not for 53 days, You will not make the connection because you will have forgotten
   about this thread by then.  ;D

Offline GNUser

  • Hero Member
  • *****
  • Posts: 1343
Re: [Solved] anything in base that automatically reboots the system?
« Reply #40 on: June 24, 2022, 04:23:20 AM »
Good news: I've had no unexpected reboots while running the router with TCL13.1 Pure64 and the updated hostapd.tcz and dnsmasq.tcz extensions. It's been 48 hours and counting.

I'm glad I pressed on. Refreshing the two extensions seems like a more proper fix (and more likely to help the community) than just downgrading to TCL12.

Thank you all, especially Rich, for your help in coming up with the list of possible root causes. This was a tough one!

Offline Rich

  • Administrator
  • Hero Member
  • *****
  • Posts: 11178
Re: [Solved] anything in base that automatically reboots the system?
« Reply #41 on: June 24, 2022, 04:37:54 AM »
Hi GNUser
Congratulations on your success and avoiding nights on the couch. ::)