• Register
    • Login
    • Search
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Search

    Neuron unstable when accessing /sys

    UniPi Neuron Series
    2
    23
    3921
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • T
      TomasKnot @knebb last edited by TomasKnot

      @knebb

      I'm running a larger testing script on 2 devices with 4 threads in parallel, so this may take a little while. Unfortunately it's difficult to know what goes wrong without the device kernel log. I mentioned the electrical side as we did have customers who had issues wth it, but usually it was more on the order of factories and such.

      If you can wait a little while longer I'll see where my testing takes me, I'll post here again in a few hours.

      K 1 Reply Last reply Reply Quote 0
      • K
        knebb @TomasKnot last edited by knebb

        @TomasKnot
        I found some kernel related issues... see kernel.log I attached.
        Unfortunately it appears I have permission issues uploading. You can download the file here.

        K 1 Reply Last reply Reply Quote 0
        • K
          knebb @knebb last edited by

          Another test:

          When I add a "sleep 1s" after every iteration (in while) the system rebooted this time after run 200 (instead of 20100 or so). So it seems to be related to some timing and not the number of accesses.

          T 1 Reply Last reply Reply Quote 0
          • T
            TomasKnot @knebb last edited by TomasKnot

            @knebb

            I'll have a look at it. It does look like there might be a timing/resource starvation issue somewhere, based on the kernel log as well (not a kernel panic, but scheduled thread fails to run in allotted time).

            I have gone over all resource allocations again, so at the very least we can rule out a memory leak.

            1 Reply Last reply Reply Quote 0
            • T
              TomasKnot last edited by

              It looks like the issue is with the invalidation thread stalling out if consecutive reads are done before it can be performed. I've switched it to use mutexes instead of spinlocks, which seems to solve the issue.

              I seem to recall I have already sent you a modified binary - would you be willing to accept one again? I would send it via a private message as before.

              K 1 Reply Last reply Reply Quote 0
              • K
                knebb @TomasKnot last edited by

                @TomasKnot

                Yes, you already send one. It is fine.

                Looking forward to have a stable system soon. Luckily it is not a hardware fault.

                Thanks for great support!

                1 Reply Last reply Reply Quote 0
                • K
                  knebb last edited by

                  I ran the script and up to now it is at 43400- so far nearly 50% more than before. No crash or reboot up to now.

                  I will start my monitoring system and see if it will stay stable.

                  THANKS a lot!

                  1 Reply Last reply Reply Quote 0
                  • T
                    TomasKnot last edited by

                    Apologies for the trouble, we did not encounter this particular issue before.

                    I hope your project goes well!

                    K 1 Reply Last reply Reply Quote 0
                    • K
                      knebb @TomasKnot last edited by

                      @TomasKnot

                      Thanks again! currently set to minutely and uptime is at 15hrs.

                      Looks it is really stable now.

                      thanks again for the great support!

                      T 1 Reply Last reply Reply Quote 0
                      • T
                        TomasKnot @knebb last edited by TomasKnot

                        @knebb
                        If you need faster response times on the SYSFS I can make that change specifically for you, but the limiting factor will be snmp anyhow. Currently SYSFS is set to refresh at a rate of 50Hz. Rates up to 1000Hz are possible in theory, at a cost of higher CPU use.

                        1 Reply Last reply Reply Quote 0
                        • K
                          knebb last edited by knebb

                          Ah, well. No I am absolutely fine with this.

                          My Cacti monitors the system every 5 minutes. So no need for faster- I am fine with a minute.

                          Thanks again!

                          Oh, and it is working stable. Now running for nearly 2days without a reboot.

                          1 Reply Last reply Reply Quote 0
                          • First post
                            Last post