Skip to content

Conversation

@mchmarny
Copy link
Member

@mchmarny mchmarny commented Nov 17, 2025

Summary

Automate UAT test on GCP with OIDC auth

Type of Change

  • πŸ› Bug fix
  • ✨ New feature
  • πŸ’₯ Breaking change
  • πŸ“š Documentation
  • πŸ”§ Refactoring
  • πŸ”¨ Build/CI

Component(s) Affected

  • Core Services
  • Documentation/CI
  • Fault Management
  • Health Monitors
  • Janitor
  • Other: ____________

Testing

  • Tests pass locally
  • Manual testing completed
  • No breaking changes (or documented)

Checklist

  • Self-review completed
  • Documentation updated (if needed)
  • Ready for review

Summary by CodeRabbit

  • New Features

    • GCP integration workflow to provision GKE GPU clusters and run end-to-end UATs.
  • Tests

    • New GCP UATs for GPU Operator and NVSentinel with updated GPU/SXID flows (one SXID path disabled); added GPU event verifications and reboot/quarantine handling.
    • Reduced cert-manager and Prometheus footprints to speed test startup; added cluster resource quota for GPU tests.
  • Documentation

    • Added GKE cluster deployment and usage guide for testing.
  • Chores

    • Terraform-based GCP project/cluster and GitHub federation setup; NVIDIA runtime enabled for GPU collectors; GCP-specific Helm value tweaks.

The || { ... } syntax was causing a syntax error with set -euo pipefail.
Changed to standard if-then-fi structure for better compatibility.

Signed-off-by: Mark Chmarny <[email protected]>
Signed-off-by: Mark Chmarny <[email protected]>
coderabbitai[bot]

This comment was marked as resolved.

coderabbitai[bot]

This comment was marked as resolved.

coderabbitai[bot]

This comment was marked as resolved.

coderabbitai[bot]

This comment was marked as resolved.

coderabbitai[bot]

This comment was marked as resolved.

coderabbitai[bot]

This comment was marked as resolved.

coderabbitai[bot]

This comment was marked as resolved.

coderabbitai[bot]

This comment was marked as resolved.

@mchmarny mchmarny enabled auto-merge (squash) November 17, 2025 18:59
coderabbitai[bot]

This comment was marked as resolved.

coderabbitai[bot]

This comment was marked as resolved.

@mchmarny mchmarny merged commit 1584015 into main Nov 17, 2025
34 checks passed
@mchmarny mchmarny deleted the feature/oidc-gcp branch November 17, 2025 21:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

chore Repo maintenance

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Feature]: end to end tests during UAT functional validation run on real GPU cluster in GCP

3 participants