XCP-ng
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login
    1. Home
    2. aflons
    A Offline
    • Profile
    • Following 0
    • Followers 0
    • Topics 0
    • Posts 3
    • Groups 0

    aflons

    @aflons

    1
    Reputation
    3
    Profile views
    3
    Posts
    0
    Followers
    0
    Following
    Joined
    Last Online

    aflons Unfollow Follow

    Best posts made by aflons

    • RE: CPU pegged at 100% in several Rocky Linux 8 VMs without workload in guest

      @laszlobortel we've seen far less of this issue since my last message, not sure what made it better and when. But we're still making sure to reboot monthly (during patching, as we normally do anyways) + after live migration, and that helps. We don't use load balancing, so once a VM is staying put on one hypervisor, there is no issue. Live migration and time triggers the issue for us.

      What changed in our infra is upgrade to XCP-NG 8.3 and moving to XOSTOR as shared storage. We've seen no issue with AlmaLinux 9 and CloudLinux 9 at all. They also perform better I/O wise.

      posted in Compute
      A
      aflons

    Latest posts made by aflons

    • RE: CPU pegged at 100% in several Rocky Linux 8 VMs without workload in guest

      @laszlobortel yes I definately think load balancing is the issue for you. Since live migrations is the biggest trigger.

      posted in Compute
      A
      aflons
    • RE: CPU pegged at 100% in several Rocky Linux 8 VMs without workload in guest

      @laszlobortel we've seen far less of this issue since my last message, not sure what made it better and when. But we're still making sure to reboot monthly (during patching, as we normally do anyways) + after live migration, and that helps. We don't use load balancing, so once a VM is staying put on one hypervisor, there is no issue. Live migration and time triggers the issue for us.

      What changed in our infra is upgrade to XCP-NG 8.3 and moving to XOSTOR as shared storage. We've seen no issue with AlmaLinux 9 and CloudLinux 9 at all. They also perform better I/O wise.

      posted in Compute
      A
      aflons
    • RE: CPU pegged at 100% in several Rocky Linux 8 VMs without workload in guest

      We experience the exact same issue with CloudLinux OS 8, seemingly random after live migration. This has been ongoing for years. Seems to happen far less now with shared storage.

      My theory somehow the kernel and/or PVE module doesn't handle the freeze during live migration, longer freeze, more risk of this happening.

      VMs start to crash random amount of time after live migration, never immideate. Could be hours, or days even, making it hard to diagnose. No crash dump, nothing, just 100% CPU on all cores and frozen console.

      One consistent thing we see, that happens almost every time, is that top and other tools stop working, they are frozen in a state were no CPU load etc is reported, but there is load on the server.

      We've been going back and forth with CloudLinux support and they did some changed to tuned profile regarding disk buffers/cache that made things at bit more stable but not gone 100%.

      We don't see the same error in AlmaLinux 9 and CloudLinux OS 9.

      More busy VM = more chance of happening. Uptime may be a factor, too.

      posted in Compute
      A
      aflons