Skip to content

Releases: NVIDIA/ais-k8s

v2.0.1

10 Feb 22:19
Compare
Choose a tag to compare
  • Fixed a bug where certain annotations could cause an infinite loop in reconciliation

See v2.0.0 Release Notes

v2.0.0

07 Feb 23:05
Compare
Choose a tag to compare

AIS Operator v2.0

THIS UPDATE WILL CAUSE A RESTART OF AIS CLUSTERS DEPLOYED WITH OPERATOR <v2.0

See Compatibility doc

Added/Changed

  • Environment variables and annotations provided in spec will now sync to AIS pods
  • Fixed a bug where AIS global rebalance would not properly disable before target upgrades
  • Changed the default discovery URL in config to use the proxy headless service instead of always using proxy-0
  • Added logSidecarImage spec option to provide control over included sidecar.
    • By default we suggest aistorage/ais-logs:v1.0. This reads INFO logs from the AIS daemon's file output and redirects to stdout for k8s to read. If left empty in spec, no sidecar will be included.

Removed

  • Removed default config generation based on image tag, now managed by init container only
  • Removed deprecated spec options
    • EnablePromExporter
    • DisablePodAntiAffinity
    • TargetSpec.AllowSharedOrNoDisks

Full Changelog: v1.7.0...v2.0.0

v1.7.0

14 Jan 05:57
Compare
Choose a tag to compare

AIS Operator v1.7.0

  • Fixed bug with shutdown that could cause a cluster to be stuck in "Shutting Down" state. Operator no longer makes a separate API call to specifically shut down AIS cluster before scaling down.
  • Optimize rebalance condition to patch only when changed
  • Removed several unused environment variables from the statefulset spec. Refactored construction of the set of ENV vars to use.
  • Minor updates to tests, linting, proxy statefulset update status
  • Updated all minor dependencies including AIS

Deprecated

  • EnablePromExporter option. On all recent AIS releases this is always enabled and the associated environment variable has been removed.

Helm

  • Updated CA duration and renewal option in TLS charts
  • Added cloud cert secrets generation chart
  • Added config for internal test cluster and internal deployment
  • Added pod resource values option

Full Changelog: v1.6.1...v1.7.0

v1.6.1

09 Dec 21:17
Compare
Choose a tag to compare

See https://github.com/NVIDIA/ais-k8s/releases/tag/v1.6.0

AIS Operator v1.6.1

  • Added reconciliation of target and proxy container resources spec

Full Changelog: v1.6.0...v1.6.1

v1.6.0

06 Dec 05:39
Compare
Choose a tag to compare

IMPORTANT Please see compatibility docs for information on deploying clusters with this new version. It requires a new aisinit container >= v3.25 to generate configs for AIS pods.

AIS Operator v1.6.0

  • Added support for init container managed configs. See compatibility docs. This will improve compatibility between versions and help with upgrade paths.
  • Operator will now reconcile the entire pod spec for aisnode when image changes
  • Operator will now reconcile the entire init pod spec when init image changes
  • Added resource management options to AIS spec
  • Added MY_NODE env var to aisnode container
  • Added support for deployments with distributed tracing

Full Changelog: v1.5.0...v1.6.0

v1.5.0

22 Oct 18:31
Compare
Choose a tag to compare

AIS Operator v1.5.0

  • Updated to go 1.23 and latest dependencies
  • Added support for custom annotations passed from spec to aisnode containers via Annotations spec option
  • Added support for custom environment variables passed from spec to aisnode containers via Env spec option
  • Fixed a bug where rebalance would not properly disable and re-enable for upgrades if it had been modified manually
  • Removed the option for the operator manager to run external to the k8s cluster
  • Internal logic refactoring of AIS API and AuthN clients
  • Added Sync option to version config
  • Changed net.http.UseHttps option to solely control whether aisnode expects to use HTTPS rather than relying on presence of TLS secrets or cert manager issuer
  • Improved logging and requeue logic to make it easier to follow deployment progress and debug issues

Helm

Full Changelog: v1.4.1...v1.5.0

v1.4.1

20 Sep 15:32
Compare
Choose a tag to compare

AIS Operator v1.4.1

  • Fixed an issue where the operator would modify the rebalance config in the provided spec and not restore previous config after upgrades
  • Cleaned up logging and handling of DNS resolution on proxy startup

Major release v1.4.0: https://github.com/NVIDIA/ais-k8s/releases/tag/v1.4.0
Full Changelog: v1.4.0...v1.4.1

v1.4.0

10 Sep 16:05
Compare
Choose a tag to compare

AIS Operator v1.4.0

  • Improved state management to reconcile based on state rather than using blocking waits
  • Disabled rebalance at the AIS level before cluster modifications -- scaling, rolling upgrades, cluster re-creation
  • Added a watch on AIS spec configToUpdate for changes and keep those in sync with the cluster
  • Added ability to reconcile statefulset status
  • Updated default AIS config generation and improved compatibility through version changes
  • Added new AIS states for the following:
    • Scaling
    • HostCleanup
    • Finalized
  • Bug fixes
    • Fixed deep equal comparison with spec
    • Fixed cleanup jobs with proper status and termination
    • Improved wait behavior when waiting for AIS cluster readiness or decommissioning
  • QOL improvements -- Cleaned up logging, Added unit testing

API Changes

  • New options
    • cleanupMetadata -- Allows for cluster decommission while preserving cluster metadata for future deployments
    • tlsCertManagerIssuerName -- Specifies a cert-manager CSI issuer

Full Changelog: v1.3.0...v1.4.0

v1.3.0

01 Aug 20:54
Compare
Choose a tag to compare

AIS Operator v1.3.0

  • Added sidecar container for accessing stdout logs via k8s
  • Test improvements including unit tests for controller
  • Improved state management including new states for in-progress shutdown, in-progress decommission, and cleanup. See ClusterCondition list in aistore_types.go
  • Improved state logging and event recording
  • Remove unused "env-mount" volume mount
  • Added AuthN support

API changes

  • New cleanupMetadata option. Previous behavior matches cleanupMetadata=true. This option can now be disabled to allow preservation of cluster metadata (such as buckets) when decommissioning and transitioning to an entirely new cluster (new AIS custom resource).
  • New authNSecretName option to add secret signing key for JWT tokens in AIStore.

Full Changelog: v1.2.0...v1.3.0

v1.2.0

11 Jul 17:43
Compare
Choose a tag to compare

AIS Operator v1.2.0

Operator:

  • Breaking Change

    • Deployments with Operator versions >= 1.2.0 must specify an ais-init image >= 1.2.0
  • Changes

    • Added stateStorageClass field to AIS spec for dynamic state storage
    • Handle destroying statefulsets in unready state
    • Wait for cleanup job success before continuing decommission
    • Added internal shutdown status
    • Fixed duration type in AIS config
    • Added ais-init docker build (moved from aistore repo)
    • Move bash script logic into the init image
    • Use proper HTTP probes for liveness/readiness
  • Deprecated

Full Changelog: v1.1.1...v1.2.0