Share via

New node I create are missing AKSLinuxExtension

Greg Toews 0 Reputation points
2026-03-13T19:35:03.45+00:00

I can't scale up node pool. Scaling to 1 silently fails. VMSS is missing AKSLinuxExtension. All new nodes system created are missing this extension. VMSS never attaches to AKS

Azure Virtual Machine Scale Sets
Azure Virtual Machine Scale Sets

Azure compute resources that are used to create and manage groups of heterogeneous load-balanced virtual machines.


2 answers

Sort by: Most helpful
  1. SUNOJ KUMAR YELURU 18,166 Reputation points MVP Volunteer Moderator
    2026-03-15T15:05:16.0666667+00:00

    Hello @Greg Toews,

    One common issue is that the AKS cluster is using Virtual Machine Scale Sets (VMSS), which have specific requirements for scaling. In VMSS-based AKS clusters, you cannot resize the existing nodes directly; instead, you must create a new node pool with the desired configuration and then manage the existing nodes accordingly.

    Additionally, if the VMSS is missing the AKSLinuxExtension, it could indicate that the AKS cluster is not properly configured to manage the nodes. This extension is essential for the AKS infrastructure as it allows the AKS service to manage the lifecycle of the nodes effectively.

    In summary, to resolve these issues, you should check the configuration of your AKS cluster, ensure that the VMSS is set up correctly, and consider recreating the node pool with the correct specifications.


    If this answers your query, do click Accept Answer and Up-Vote for the same. And, if you have any further query do let us know.

    0 comments No comments

  2. Jilakara Hemalatha 11,520 Reputation points Microsoft External Staff Moderator
    2026-03-13T21:06:16.1433333+00:00

    Hello Greg,

    From what you’ve described, the behavior where new nodes are getting created but are missing the AKSLinuxExtension and not joining the cluster usually points to a failure during the node provisioning stage in AKS. This extension is a required part of the bootstrap process, and if it doesn’t install successfully, the node will not attach to the cluster, which is why scaling appears to fail.

    To understand what’s going wrong, I’d suggest starting with a quick check on the VMSS instances to see whether the extension is missing, failed, or stuck:

     az vmss list-instances --resource-group <rg> --vmss-name <vmss-name> --query "[].{id:instanceId,extState:resources[].provisioningState,extName:resources[].name}" -o table
    

    It would also help to validate connectivity from one of the new instances. If the node is unable to reach the required endpoints, the extension installation can fail due to network or DNS restrictions:

    az vmss run-command invoke -g <MC_resource-group> -n <vmss-name> --command-id RunShellScript --instance-id <instance-id> --scripts "nc -vz <cluster-fqdn> 443"
    

    In parallel, please check the Activity Logs for the VMSS resource around the time the scale operation was attempted. If there are any extension-related failures (for example, VMExtensionProvisioningError), they usually give a clear indication of the root cause.

    From similar cases, one common reason we see is a package manager lock (dpkg) during the extension installation. This can happen when background OS updates are running at the same time, which prevents the AKSLinuxExtension from completing within the expected time. This tends to occur more often on older node images. Upgrading the node image usually resolves this:

    az aks nodepool upgrade --resource-group <resource-group> --cluster-name <cluster-name> --name <nodepool-name> --node-image-only
    

    If your cluster version is also on the older side, it would be a good idea to upgrade that as well to a supported version.

    We’ve also recently seen a platform-side issue where the extension installation was unintentionally tied to the cluster’s maintenance window. In those cases, nodes created outside the maintenance window would come up without the extension.

    A fix has already been rolled out, but if your cluster was impacted earlier, it would be good to confirm whether the issue still persists.

    As a temporary workaround, If your cluster has a Maintenance Window configured and the above diagnostic steps do not reveal another cause, please try the following workaround to confirm whether this is the issue you are hitting:

    1. Temporarily remove the maintenance window
    2. Create a new node pool
    3. Verify whether the AKSLinuxExtension gets installed

    Additionally, could you please check the private message and provide necessary details

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.