If you have used the xCAT IBM HPC Integration support to initially install HPC software on your cluster nodes, you can apply product updates to your cluster in several different ways. The method you choose will depend upon many factors:
Some of the ways you can apply software updates to your cluster using xCAT are:
In all cases, you will need to start by downloading the product updates to your xCAT management node and putting them into the same directories you placed the original product software:
/install/post/otherpkgs/<osver>/<arch>/<product>
where <product> is: </product>
gpfs
loadl
pe
essl
compilers
rsct
Note: For GPFS on Linux, only the base GPFS rpms can be placed in the above directories. If you have GPFS update rpms, copy them to the following location:
/install/post/otherpkgs/gpfs_updates
And for AIX, you will also need to add the updates to your NIM lpp_source. For each <product> that you have updates for:
inutoc /install/post/otherpkgs/aix/ppc64/<product>
nim -o update -a packages=all -a source=/install/post/otherpkgs/aix/ppc64/<product> <lpp_source_name>
As with all product updates, you will need to be aware of any new operating system dependencies, any changes required to the product configuration, or any changes to operational procedures for administering or using the product. The default sample xCAT HPC Integration files may not support all the product updates as soon as they are available. Be sure to review all package lists, bundle files, scripts, and other files that you are using to ensure they will work correctly for the product updates you are installing.
Note: For Linux statelite clusters, a problem exists when updating the LoadLeveler rpms(e.g. PTF6 to PTF7). Currently, a new license rpm is shipped and must be installed and accepted before the other LL rpms will install correctly. This will be fixed in future LL PTFs, so that customers will only need to accept the license when installing the base LL rpms. For LL updates, no LL license rpm will be updated and customers will not need to accept the license a second time. Currently, the xCAT HPC integration does not support LoadLeveler upgrades until a fix becomes available.
Use this method for updating your HPC software if you are confident that you are applying stable, minor updates that will not impact an active cluster. You can use updatenode to apply software to Linux stateless nodes that have the operating system fully loaded into memory, and for all stateful nodes that have the operating system installed on a writeable disk. This method will not work if the products are currently installed in read-only directories in statelite or AIX diskless clusters.
updatenode <noderange>
This will run all three options for updating software, running postscripts, and synchronizing files (in that order). If you only wish to do one or two of those options, or run the operations in a different order, specify the correct flags to the updatenode command. See the [updatenode man page](http://xcat.sourceforge.net/man1/updatenode.1.html) for more details.
Use this method for updating your HPC software if you are confident that you are applying stable updates that you wish to commit to your existing compute node image and you have a scheduled maintenance window for your cluster. You can use this approach for all diskless nodes, either stateless or statelite, Linux or AIX.
Update the existing diskless image and reboot your nodes.
For Linux:
For AIX, follow the detailed documentation provided in [Updating_AIX_Software_on_xCAT_Nodes].
When the nodes are up, verify that all the HPC software updates are correctly installed.
Use this method for updating your HPC software when there are extensive changes, you wish to test the changes on a small number of nodes first, you have a limited maintenance window or can do a rolling upgrade, or if you need to keep a copy of your existing image to revert back to if the updates fail. You can use this approach for all diskless nodes, either stateless or statelite, Linux or AIX.
For Linux:
/install/custom/netboot/<ostype>/<profile>.pkglist
/install/custom/netboot/<ostype>/<profile>.otherpkgs.pkglist
/install/custom/netboot/<ostype>/<profile>.exlist
/install/custom/netboot/<ostype>/<profile>.postinstall
/install/custom/netboot/<ostype>/<profile>.synclist
Review and edit these files, making changes as required for your new image.
chdef <noderange> profile=<new-profile>
For AIX, follow the detailed documentation provided in: [Updating_AIX_software_in_xCAT_nodes]
When the nodes are up, verify that all the HPC software updates are correctly installed.
For Linux statelite or stateless clusters, a problem exists when updating the LoadLeveler rpms(e.g. PTF6 to PTF7). Currently, a new license rpm is shipped and must be installed and accepted before the other LL rpms will install correctly. This will be fixed in future LL PTFs, so that customers will only need to accept the license when installing the base LL rpms. Once fixed, when LL is updated, no LL license rpm will be updated and customers will not need to accept the license a second time.
Until a fix is available, please follow this procedure for updating LoadLeveler:
mv /install/post/otherpkgs/<OSVER>/<ARCH>/loadl /install/post/otherpkgs/<OSVER>/<ARCH>/loadl_base
/install/post/otherpkgs/<OSVER>/<ARCH>/loadl
rm /install/post/otherpkgs/<OSVER>/<ARCH>/loadl/LoadL-full-license-*
Wiki: IBM_HPC_Stack_in_an_xCAT_Cluster
Wiki: Setting_up_LoadLeveler_in_a_Statelite_or_Stateless_Cluster
Wiki: Updating_AIX_Software_on_xCAT_Nodes