Skip to content

Releases: red-hat-data-services/ods-ci

3.0

17 Nov 10:32
85c42da

Choose a tag to compare

What's Changed

Enhancements

  • fix(upgrade ide): clean eventual existing Jupyter workbench before test by @jstourac in #2631
  • [workbenches] fix name of a runtime image used in tests by @jstourac in #2710
  • Update the way we get RHOAI dashboard URL with 3.0 release by @jstourac in #2731
  • test(workbenches): fix jupyterhub/workbenches related tests for RHOAI3.0 by @jstourac in #2730

Other Changes

  • Added deprecatedTests Tag to Model Serving Runtimes Testcases by @Raghul-M in #2639
  • fix: add a rework in the dsci creation logic to wait for the operator to create it by @CFSNM in #2657
  • fix: minor typo in the auth detection call by @CFSNM in #2659
  • fix: add rolebinding setup/ teardown for distributed workloads tests by @pawelpaszki in #2664
  • Update codeflare sdk version by @kryanbeane in #2656
  • Bump workbenches versions and default Python version by @jstourac in #2667
  • Add ARM installation support for AWS OCP deployments. by @sefroberg in #2624
  • update kueue channel to 1.1, as this version has ARM support by @CFSNM in #2675
  • fix wait for pods condition to make it more robust by @CFSNM in #2677
  • Remove all the references to kserve Serverless deployment mode, the references to ServiceMesh and Serverless operators and CRs, and all the tests that were using them (all of them deprecated) by @CFSNM in #2643
  • fix: remove kueue managed mode automations for rhoai 3.0 by @CFSNM in #2652
  • [RHOAIENG-34042,RHOAIENG-34453] Remove Model Mesh and CodeFlare related tests and automations by @asmigala in #2644
  • revert dependent operators removal by @CFSNM in #2684
  • [RHOAIENG-34318] Adding Leader Worker Set Operator to E2E install by @robotmaxtron in #2683
  • fix some lws parameters by @CFSNM in #2686
  • fix some lws parameters by @CFSNM in #2687
  • fix installation method for lws operator by @CFSNM in #2689
  • fix (master): pre/post upgrade tests distributed workloads (ray) by @pawelpaszki in #2671
  • Updated typos, subcription_name vs subscription_name by @robotmaxtron in #2695
  • Remove AppWrapper and Codeflare Operator references by @kryanbeane in #2690
  • rename datasciencepipelines stuff to aipipelines by @CFSNM in #2698
  • [RHOAIENG-34318] removing leader worker set operator from kserve depdendencies by @robotmaxtron in #2704
  • Improving the stability of the 3.0 tests by @CFSNM in #2696
  • bump Python version from 3.11 to 3.12 for RHOAI 2.25 by @jstourac in #2699
  • Use openldap multi-arch image designed for ocp by @apodhrad in #2705
  • adding clause to wait 90s when checking the kueue pods by @CFSNM in #2706
  • Remove Serverless and ServiceMesh as dependency operator by @aloganat in #2692
  • RHOAIENG-33408: add rayjob tests and bump codeflare-sdk version by @pawelpaszki in #2693
  • fix typo for dsp by @CFSNM in #2709
  • fix typo for dsp by @CFSNM in #2712
  • add return output to have a better explanation of test failures by @CFSNM in #2713
  • Remove Kueue e2e tests by @sutaakar in #2722
  • [3.0] conf: change codeflare-sdk version for 3.0 RHOAI by @pawelpaszki in #2719
  • adding checks for all the resources managed by ODH/RHOAI 3.0 by @CFSNM in #2720
  • fix: handle empty saved_state in DSC component teardown by @asanzgom in #2723
  • add workaround to search by datasciencepipelines in the manifests by @CFSNM in #2724
  • fix: update auth crd test to check cluster-level resources for allowedGroups by @asanzgom in #2727
  • Update RHOAI dashboard URL to work both in 2.x and 3.x releases by @jstourac in #2728
  • add workaround to search by datasciencepipelines in the manifests by @CFSNM in #2726
  • Refactor Network Policies Test to Support Multi-Platform (RHOAI/ODH) with Programmatic Approach by @asanzgom in #2729
  • Changes for BYOIDC by @asmigala in #2725
  • Byoidc login revert by @asmigala in #2733
  • increase sleep to fix flaky test about controller patching by @CFSNM in #2735
  • BYOIDC clusters support by @asmigala in #2734
  • conf: change codeflare-sdk for rhoai 3.0 to include test fixes by @pawelpaszki in #2736
  • Fix cluster login when not using external OIDC by @bdattoma in #2737
  • remove all the stuff related to devflags since they are being deprecated by @CFSNM in #2740
  • [RHOAIENG-36045] Update the Must Gather image to support 3.0 and multi-arch execution. by @sefroberg in #2738
  • Update Feature store version and image details for rhoai 3.0 by @Srihari1192 in #2741
  • DW Components test: Kueue is now removed by default by @MarianMacik in #2742
  • conf: set kueue to Unmanaged for 3.0 by @pawelpaszki in #2746
  • Update DSPA to v1 by @mprahl in #2747
  • Remove dependency on kubelogin for BYOIDC by @asmigala in #2744
  • conf: bump codeflare-sdk version to latest for rhoai 3.0 by @pawelpaszki in #2750
  • Deploy SR-IOV operator and update ClusterPolicy for RDMA by @pierDipi in #2668

New Contributors

Full Changelog: 2.25...3.0

2.25

24 Oct 11:50
9974e93

Choose a tag to compare

What's Changed

Enhancements

  • [2.25] [workbenches] fix name of a runtime image used in tests by @jstourac in #2711

Other Changes

  • task(RHOAIENG-34228): Update ray cuda image sha for multi-arch by @chipspeak in #2635
  • fix: update new monitoring payload for the dsci by @CFSNM in #2623
  • fix servicemesh negative condition by @CFSNM in #2607
  • conf: deprecate e2e kuberay tests by @pawelpaszki in #2636
  • fix: Correct ServiceMesh operator missing message by @MarianMacik in #2637
  • Added deprecatedTests Tag to Model Serving Runtimes Testcases [Backport PR] by @Raghul-M in #2640
  • fix: add rolebinding setup/ teardown for distributed workloads tests by @pawelpaszki in #2663
  • Update codeflare sdk version by @kryanbeane in #2655
  • [2.25] fix(upgrade ide): clean eventual existing Jupyter workbench before test by @jstourac in #2680
  • [2.25] Bump workbenches versions and default Python version by @jstourac in #2681
  • install kueue 1.1 as a dependent operator [release-2.25] by @CFSNM in #2682
  • Update Feast Release version for RHOAI 2.25 verification by @Srihari1192 in #2685
  • fix (2.25): pre/post upgrade tests distributed workloads (ray) by @pawelpaszki in #2670
  • [RHOAIENG-36112]Backport the ARM provisioning to the release-2.25 branch. by @sefroberg in #2694
  • [release-2.25] Add DSCI rework logic by @CFSNM in #2697
  • [2.25] bump Python version from 3.11 to 3.12 for RHOAI 2.25 by @jstourac in #2700
  • [2.25] Add deprecatedTest tag to model server tests by @threcc in #2708
  • [2.25] conf: change codeflare-sdk version for 2.25 RHOAI by @pawelpaszki in #2718

Full Changelog: 2.24...2.25

2.24

24 Oct 11:47
a446c9a

Choose a tag to compare

What's Changed

Enhancements

  • [2.24] fix JupyterLab popup button XPath selector by @jiridanek in #2601
  • chore(guidedTour): disable guided tour via CLI if possible by @jstourac in #2604
  • fix(ide): small fixes for the tests stability by @jstourac in #2606
  • fix(platform): add a teardown in case test fails by @jstourac in #2608
  • fix(workbench start): waits until workbench app is truly available by @jstourac in #2611
  • chore(elyra): update pipeline name to match recent restrictions by @jstourac in #2591
  • fix(ide: culler): update of the xpaths by @jstourac in #2612

Other Changes

  • enhancement: add support for kueue and observability stuff in ods-ci by @CFSNM in #2559
  • Make controller resources configuration test more stable by @asmigala in #2571
  • enhancement: support deletion of cma dependent operator by @CFSNM in #2573
  • conf: update kuberay version to 1.4.0 by @pawelpaszki in #2576
  • Use full machineset CRD name when provisioning GPUs by @bdattoma in #2574
  • increase ROCm driver version + extend sleep for cs check by @bdattoma in #2575
  • add logic to enable disable the monitoring stuff in the dsci by @CFSNM in #2577
  • Omit image check for Unmanaged Kueue by @MarianMacik in #2584
  • add logic to enable disable the monitoring stuff in the dsci when the dsci is created by the operator by @CFSNM in #2583
  • Update Feast Release version tag to incorporate 2.23 Release changes by @Srihari1192 in #2585
  • RHOAIENG-9632 - Add validity check for the pipeline run link by @ada333 in #2587
  • enhancement: simplifying the logic behing the new observability stack by @CFSNM in #2588
  • Update Feast Notebook test image and version by @Srihari1192 in #2590
  • Remove model registry tests from robot framework by @dbasunag in #2593
  • Fix Component Versions test by @asmigala in #2595
  • fix: Pass ENABLE_NEW_OBSERVABILITY_STACK to tests by @MarianMacik in #2597
  • enhancement: include variable to support the installation of Authorino by @CFSNM in #2594
  • enhancement: add new observability parameters to the DSCI payload when the new stack is enabled by @CFSNM in #2598
  • enhance dsci deletion failure detection by @CFSNM in #2599
  • Add Llamastackoperator integration tests by @asanzgom in #2600
  • Update Feature Store Release version tag for 2.24 Release by @Srihari1192 in #2603
  • Add NFS provisioning task by @bdattoma in #2602
  • fix(ide): bump the expected versions of JupyterLab and git plugin by @jstourac in #2605
  • RHOAIENG-33248: Add GCP_DIS to ods-ci test environments by @apodhrad in #2609
  • fix uninstall in managed clusters by @CFSNM in #2610
  • fix: add needed keyword for uninstall variables by @CFSNM in #2613
  • fix: minor fix about unknown output variable by @CFSNM in #2615
  • add the upgrade option to our tests by @kobihk in #2518
  • test: ray python 3.12 by @pawelpaszki in #2614
  • adding minor timeout to avoid race condition in the unmanaged servicemesh mode by @CFSNM in #2616
  • conf: unify timeout with the e2e source code by @pawelpaszki in #2617
  • fix: add additional check for the install plan to be available in the subscription CR by @CFSNM in #2619
  • RHOAIENG-26686 - [QE] Enable the Monitoring tests to run in the conne… by @MarianMacik in #2547
  • fix: incorrect string in the dashboard by @pawelpaszki in #2625
  • fix ods-ci Upgrade stuff version comparison by @CFSNM in #2626
  • task(RHOAIENG-33492): Add CUDA 128 Ray image by @chipspeak in #2627
  • fix: add back csv name statement for odh and rhoai by @CFSNM in #2629
  • Automate ods-ci test for llamastack operator by @asanzgom in #2618
  • fix: add rolebinding setup/ teardown for distributed workloads tests by @pawelpaszki in #2660
  • conf: deprecate e2e kuberay tests for 2.24 (replaced by shift left) by @pawelpaszki in #2665
  • Add deprecatedTest tag for model server upgrade by @mwaykole in #2658
  • fix: pre/post upgrade tests distributed workloads (ray) by @pawelpaszki in #2669

New Contributors

Full Changelog: 2.23...2.24

2.23

25 Jul 14:33
fe6a1ff

Choose a tag to compare

What's Changed

Enhancements

  • rename notebook-stop-action to state-action-toggle by @jiridanek in #2545
  • RHOAIENG-26513: update test suite to use new rhoai-2.23 image naming convention with Python 3.11 by @jiridanek in #2557

Other Changes

  • update the instructions on how to create an ods-ci release by @kobihk in #2552
  • Kueue followup: including state transition tests by @CFSNM in #2540
  • Add 'deprecatedTest' tag to TrainingKubeflow smoke test by @ChughShilpa in #2555
  • Add deprecatedTest tag for Training operator upgrade tests by @sutaakar in #2558
  • Add NFD CR image for 4.19 by @bdattoma in #2544
  • Add Feast test for Feast Instance Image Reference is correct by @Srihari1192 in #2553
  • Update Ray image references by @kryanbeane in #2534
  • conf: bump codeflare-sdk version to match the release available for 2.23 RHOAI by @pawelpaszki in #2560
  • Move to OCM post + add support for team label and fips by @bdattoma in #2554
  • Add Image Digest check for Feast instance Image verification test by @Srihari1192 in #2561

Full Changelog: 2.22...2.23

2.22

16 Jul 12:40

Choose a tag to compare

What's Changed

Enhancements

  • Update xpath for RHOAI 2.22 regarding the workbench image version selection by @jstourac in #2511
  • Fix the user access test for jupyter hub by @jstourac in #2512
  • Fix the check of the workbench size GiB vs Gi by @jstourac in #2513
  • Update login page code for OpenShift 4.19+ by @jstourac in #2526
  • [ide] Fix the failing elyra tests for RHOAI 2.22 by @jstourac in #2531

Other Changes

  • Adapt the dsc components test suite to 3 versions of the product by @asanzgom in #2486
  • fix: change operator name to opendatahub-operator when ODH-nigthlies by @CFSNM in #2506
  • enhancement: revert auth hack as the bug is not present anymore in devel by @CFSNM in #2507
  • Fix Auth CR expected values to be in array by @MarianMacik in #2509
  • Specify correct saved state for Feast Operator by @MarianMacik in #2508
  • [RHOAI-13065 RHOAI-13070] Add OCM test for email and addon validation by @asmigala in #2488
  • Follow-up PR to improve the check image pull path keyword and robustness by @asanzgom in #2505
  • add support to install the Kueue operator if OCP >= 4.18 by @CFSNM in #2482
  • [RHOAIENG-12811] Verify allowlist for component customization by @asmigala in #2449
  • Add user defined tags or label for cloud resources when provisioning self-managed AWS, GCP and Azure clusters by @bdattoma in #2510
  • Added Tags for Runtime Testcases (Ksereve-caikit and Deprecated tests) by @Raghul-M in #2452
  • Update notebook image to 2.22 digest by @sutaakar in #2520
  • Add feast upgrade tests by @Srihari1192 in #2514
  • RHOAIENG-13268:Add verification of deadmansnitch configuration by @RomanFilip in #2519
  • NO-JIRA: fix obvious typos in comments, test names, and documentation by @jiridanek in #2522
  • Add OPENAI_KEY for running Feast tests with OpenAI dependencies by @Srihari1192 in #2525
  • Add Feast Notebook Milvus Test by @Srihari1192 in #2532
  • Enable tech preview features set for GCP in order to set userLabels on 4.15 and 4.16 by @bdattoma in #2533
  • Deprecate Dashboard API tests by @manosnoam in #2539
  • RHOAIENG-27699 - Add CMA operator dependency by @jstetina in #2524
  • enhancement: install only observability operators if the jenkins var is enbled by @CFSNM in #2542
  • remove kueue operator stuff from master to make not available for 2.22 by @CFSNM in #2546
  • Deprecate KFTO tests migrated to the shift left by @sutaakar in #2548
  • pre/post upgrade tests for MR set to deprecated by @lugi0 in #2550
  • Delete FMS tests in ODS CI by @sutaakar in #2549

New Contributors

Full Changelog: 2.21...2.22

2.21

17 Jun 11:17

Choose a tag to compare

What's Changed

Enhancements

  • Add RHOAI startingCSV after RHOAI SM installation by @bdattoma in #2439
  • Refactor TensorBoard frame selection in JupyterHub tests by @jiridanek in #2443
  • Update base DSP images to UBI9 for FIPS compatibility. Update KFP version to 2.13.0 by @jgarciao in #2464
  • Repace kw checking the status of the Ray DSP job by @jgarciao in #2469
  • Rename the Jupyter tile in Dashboard enabled applications by @jiridanek in #2472
  • Update runtime images used in DSP GPU testing for RHOAI 2.21 by @jgarciao in #2468
  • Rename the Workbench settings section in Dashboard by @jiridanek in #2473
  • Modify sample pipeline used for kfp test for compatibility with disconnected clusters by @jgarciao in #2499
  • Update the way we handle the Guided Tour modal window for OCP 4.19+ by @jstourac in #2504

Other Changes

  • Update Kuberay release assets by @kryanbeane in #2424
  • Disable Update notification email test by @asmigala in #2427
  • Fix DSC Component version verification by @asmigala in #2430
  • Adjust Training images to latest ones by @sutaakar in #2432
  • Workbenches fixes for RHOAI 2.21 by @jiridanek in #2428
  • post_install: Remove Tier1 and Monitoring tags from DeadManSnitch sce… by @MarianMacik in #2434
  • Adapt the Operator Subscription label to ODH by @asanzgom in #2422
  • fix: use opendatahub as default ns for workbenches in odh by @CFSNM in #2437
  • fix: add new test to check pagerduty secret by @CFSNM in #2438
  • Remove PyTorch 2.4.1 upgrade tests for Training component by @sutaakar in #2440
  • Changed admin and allowed groups to lists and check post upgrade matc… by @asanzgom in #2278
  • RHOSi.resource: Include rhaiseng domain as a selfmanaged cluster by @MarianMacik in #2444
  • Get original value of defaultDeploymentMode from status by @asmigala in #2445
  • [RHOAIENG-13262] Verify Data Science Pipelines Operator Alert by @asmigala in #2448
  • Added Deprecated tag For Triton UI Testcases by @Raghul-M in #2436
  • Rename CodeFlare env variables to reflect midstream changes by @sutaakar in #2453
  • Update odh-generic notebook image to latest one in distributed worklo… by @abhijeet-dhumal in #2454
  • rhods_operator_logs: Check return code for easier failure analysis by @MarianMacik in #2456
  • Update training ROCm images to latest one by @abhijeet-dhumal in #2459
  • Remove odhdashboardconfig checks from the Verify Auth CRD Test Case by @asanzgom in #2455
  • Update Codeflare SDK release tag for 2.21 release by @jiripetrlik in #2460
  • Update interop testing related files by @omrirh in #2462
  • Upgrade KFTO test names and env variables by @sutaakar in #2463
  • Upgrade Ray images to version 2.44.1 by @sutaakar in #2466
  • Add debugging kws to randomly failig trusted ca bundles unmanaged tc by @asanzgom in #2467
  • Add kueue VAP test for pytorchjob by @abhijeet-dhumal in #2458
  • Use latest Ray images in pipeline tests by @sutaakar in #2470
  • RHOAIENG-26136: Automatically clean dependent operators by @apodhrad in #2465
  • RHOAIENG-26222: ODS-CI installs other operators even if KSERVE_RAW_DEPLOYMENT = true by @apodhrad in #2474
  • add whole gateway name to differentiate between the other gateway CR by @CFSNM in #2475
  • Update Feast release tag and new tests for 2.21 release by @Srihari1192 in #2479
  • Remove nfd sub from nvidia namespace by @bdattoma in #2477
  • ODS.robot: Check Auth CR instead of OdhDashboardConfig for groups by @MarianMacik in #2480
  • create different upgrade configmap for user groups to avoid collisions by @asanzgom in #2483
  • RHOAIENG-13073 - ODS-558: Verify alerts have links to the triage guide by @MarianMacik in #2484
  • Exclude FeatureStore tests on ODH by @apodhrad in #2485
  • fix: Remove wait for NFD in nvidia ns by @bdattoma in #2487
  • Update notebook image for 2.21 by @ChughShilpa in #2489
  • RHOAIENG-23490 - Create test to verify images used by RHOAI are corre… by @MarianMacik in #2490
  • Adjust KFTO MNIST SDK test for env variable change by @sutaakar in #2491
  • Training operator upgrade - remove version limit by @sutaakar in #2493
  • Fix DistributedWorkloads UI tests in RHOAI 2.21 by @jiripetrlik in #2498
  • enhancement: add option to install new Observability dependent operators by @CFSNM in #2495
  • Deprecate MR tests by @dbasunag in #2497
  • fix: add hack to add the default groups in the auth cr by @CFSNM in #2501
  • conf: bump ray image sha to match latest version by @pawelpaszki in #2496
  • Deprecate 'Verify Dashboard Is Shipped And Enabled Within ODS' by @MarianMacik in #2478

New Contributors

Full Changelog: 2.20...2.21

2.20

06 May 08:57
3259a98

Choose a tag to compare

What's Changed

Enhancements

  • Update the connection creation based on the UI changes in RHOAI 2.19 by @jstourac in #2358
  • Fix pipeline ui test for changes introduced in RHOAI 2.19 by @jgarciao in #2377
  • fix the data id selector for the connection selection for RHOAI 2.20+ by @jstourac in #2392
  • RHOAIENG-17979: feat(tests/ide): assert that "authorize access" oauth page is not present after RHOAIENG-11155 got into the product by @jiridanek in #2190
  • Skip pipeline tests on ROSA HCP and deprecate pipeline UI test by @jgarciao in #2426

Other Changes

  • RHOAIENG-22958 - Remove DW downstream tests for Python 3.9 by @jiripetrlik in #2355
  • Install Authorino from stable channel for both RHOAI (>= 2.20) and ODH by @aloganat in #2352
  • Use latest CUDA Training image by @sutaakar in #2362
  • Add Auth CRD Test Case by @asanzgom in #2360
  • Add separate tags for Training Ray and Training kubeflow team by @ChughShilpa in #2361
  • fix: add some fixes to the kserve serving remove test to match new conditions by @CFSNM in #2356
  • RHOAIENG-13079 - ODS-514: Verify that no false alerts are fired while… by @MarianMacik in #2350
  • Add Kubeflow training image env variable in KFTO-SDK test by @abhijeet-dhumal in #2364
  • Add new images for ovms and caikit tgis runtime for 2.19 by @Raghul-M in #2368
  • Improve Component Should Be Enabled robustness by @asanzgom in #2367
  • Add debugging kws to randomly failing trusted ca bundles managed tc by @asanzgom in #2373
  • Added Vllm image for RHOAI 2.19 by @Raghul-M in #2375
  • fix: Verify Dashbord has no message with NO Component Found by @asmigala in #2384
  • enhancement: support custom namespace for workbenches by @CFSNM in #2321
  • [model server]Fix test for upgrade by @mwaykole in #2395
  • enhancement: add new feastoperator component as part of ODH and rhoai 2.20 by @CFSNM in #2274
  • RHOAIENG-23478:Fix 'Verify Auth Crd' test to pass on ODH by @RomanFilip in #2394
  • [RHOAIENG-12693] Validate DSC Component Names and Versions by @asmigala in #2386
  • Add Feature Store smoke tests by @Srihari1192 in #2388
  • fix MR UI tests, add RoleBinding due to RBAC bug fix by @lugi0 in #2405
  • Added tests for Feature Store component by @ntkathole in #2408
  • RHOAIENG-13263: add new monitoring test case to check DSP Operator rules by @CFSNM in #2409
  • Update Feast Smoke Test Image Validation reference condition check for ROSA HCP clusters by @Srihari1192 in #2410
  • Add AutomationBug to failing Momnitoring TCs by @asanzgom in #2411
  • fix: add tag to skip test if selfmanaged cluster by @CFSNM in #2412
  • Use v1.2.2-odh-2 for e2e kuberay tests by @jiripetrlik in #2415
  • Update training tests and images for different pytorch versions by @ChughShilpa in #2414
  • Use 4.16 catalog source for installing AMD Certified Operator on yet uncertified OCP versions by @bdattoma in #2401
  • Fix DistributedWorkloads UI failures in 2.20 by @jiripetrlik in #2417
  • fix: add automationbug tag to a monitoring case that takes 1h to finish by @CFSNM in #2420
  • Add missing Alerts Verifications by @asanzgom in #2421
  • Add Feature Store upgrade test by @Srihari1192 in #2416
  • RHOAIENG-12886 - ODS-2170: Verify Data Science Pipelines Application … by @MarianMacik in #2425

New Contributors

Full Changelog: 2.19...2.20

2.19

03 Apr 11:53

Choose a tag to compare

What's Changed

Enhancements

  • fix the jupyter workbench start for RHOAI 2.19+ by @jstourac in #2303
  • Modify DSP backend tests not to use the UI to assign user permissions to projects by @jgarciao in #2312
  • update the references with regards the new IDE image versions by @jstourac in #2324

Enhancements in scripts and CI/CD

  • bump poetry to 2.1.1 version including deprecation warn fix by @jstourac in #2302

Other Changes

  • RHOAIENG-15892 - Create automation to check if any RHOAI containers a… by @MarianMacik in #2286
  • Add missing PATH for poetry in Dockerfile - Interop testing by @omrirh in #2287
  • Update Alert names on Verify Alerts Severity Test Case by @asanzgom in #2294
  • Install Authorino Stable when using ODH by @lugi0 in #2295
  • fix: adapt disruptive tests to ODH/RHOAI 2.19 by @CFSNM in #2293
  • don't check for webhook pod when installing Authorino from stable channel by @lugi0 in #2300
  • Update DW UI tests to align with the 2.19 release Dashboard changes by @Srihari1192 in #2307
  • Adjust tests for Training operator GA by @sutaakar in #2306
  • Run KFTO upgrade tests for RHOAI 2.19 onwards by @sutaakar in #2309
  • Upgrade latest CUDA and ROCm Training and Ray images by @sutaakar in #2310
  • fix the 503 service unavailable by @kobihk in #2311
  • fix: delete auth when deleting dsci by @CFSNM in #2313
  • Create DSCI and DSC for Kserve raw deployment in ODS-CI by @apodhrad in #2280
  • Handle if KSERVE_RAW_DEPLOYMENT is unset by @apodhrad in #2315
  • Use latest fms-hf-tuning image - 2.7.1 by @sutaakar in #2314
  • Add deprecatedTest tag to model server tests by @rnetser in #2305
  • Use latest CUDA training image by @sutaakar in #2320
  • Add Monitoring Tag to Managed TCs by @asanzgom in #2325
  • Skip 2 monitoring-related tests on self-managed clusters by @MarianMacik in #2323
  • Update Codeflare SDK release tag for 2.19 release by @jiripetrlik in #2331
  • fix: add v3 sarif version because v2 is now deprecated by @CFSNM in #2339
  • fix: add minor fixes to make the scmp test work by @CFSNM in #2338
  • Add AutomationBug Tag to SMCP Already created TC by @asanzgom in #2343
  • model server: move to backend tests in pre/post upgrade by @rnetser in #2333
  • Skip Training operator smoke check for RHOAI < 2.19.0 by @sutaakar in #2351
  • Use latest Standard data science notebook for Distributed training by @sutaakar in #2353
  • Implement additional waiting for Kueue deployment by @sutaakar in #2354

Full Changelog: 2.18...2.19

2.18

11 Mar 17:11

Choose a tag to compare

What's Changed

Enhancements

  • Fix DSP dashboard test for xpath changes introduced in RHOAI 2.17 by @jgarciao in #2224
  • Fix Post Upgrade Test: Verify RHODS User Groups by @manosnoam in #2226
  • Add vllm image for 218 by @tarukumar in #2236
  • Fix DSP UI test in RHOAI 2.18 by @jgarciao in #2258
  • make login to OCP console and ODH/RHOAI dashboard slightly more stable by @jstourac in #2269
  • fix the xpath for the Application launcher button in OCP4.18+ by @jstourac in #2271
  • In DSP UI test, shorten project title if longer than 30 chars by @jgarciao in #2277

Other Changes

  • RHOAIENG-18519 - Add KFTO SDK test to downstream automation by @jiripetrlik in #2220
  • KubeRay: Sleep for 3 seconds after disabling Kueue by @sutaakar in #2227
  • Add new image for tgis and caikit tgis runtime for 2.18 by @tarukumar in #2229
  • Use OCP_API_URL parameter for Login To OCP function by @sutaakar in #2230
  • Add ProductBug Tag to Validate SMCP Already created and negative TC by @asanzgom in #2234
  • Update Ray and Training images with rocm 6.2.4 version by @ChughShilpa in #2237
  • Add new logic to support custom namespaces for both operator and applications by @CFSNM in #2216
  • RHOAIENG-14731 - Run ValidatingAdmissionPolicy test downstream by @jiripetrlik in #2192
  • Adjust DW_UI tests to accomodate UI changes in 2.18 by @ChughShilpa in #2240
  • Wait for Training operator Deployment to be available by @sutaakar in #2233
  • Update Ray cluster upgrade tests to align with the latest UI changes by @Srihari1192 in #2241
  • Update ods-ci branch name by @liswang89 in #2239
  • Update KFTO tags for manual execution by @ChughShilpa in #2243
  • Fix locators on Verify RHODS Accept Multiple Admin Groups And CRD Get… by @asanzgom in #2242
  • Add RHOAI Upgrade Testing on OpenShift CI by @liswang89 in #2213
  • Update Codeflare SDK release tag for 2.18 release by @jiripetrlik in #2248
  • Update notebook images for 2.18 release by @ChughShilpa in #2250
  • Increase KFTO SDK test timeout by @jiripetrlik in #2249
  • add teams tag for upgrade tests by @kobihk in #2252
  • Fix PreUpgrade testing return code to always return 0 by @liswang89 in #2251
  • RHOAIENG-20768: the Anaconda support was decided to be removed since RHOAI 2.18 by @jstourac in #2255
  • Upgrade fms-hf-tuning image to 2.6.0 by @ChughShilpa in #2259
  • Adjust fms-hf-tuning tests to use Granite model by @sutaakar in #2260
  • Adjust KFTO upgrade test to reflect upstream changes by @sutaakar in #2262
  • Remove fms tests from QG GPU run by @sutaakar in #2263
  • Update timeout for smoke cli TC, remove product bug tag for Model Registry by @lugi0 in #2265
  • Fix race condition in MR cli test case by @lugi0 in #2267
  • Remove Tier1 tag from multinode singleGPU tests by @sutaakar in #2268
  • Add Smoke tests to Upgrade by @liswang89 in #2261
  • enhancement: add new monitoring test to check RHOAI Metrics parameters by @CFSNM in #2266
  • Add NGC_API_KEY for executing NIM specific E2E Dashboard tests by @amalykhi in #2270
  • fix model registry teardown keywords failing if files/resources do not exist by @lugi0 in #2275
  • Fix Dockerfile_interop env var usage by @omrirh in #2257
  • Update Verify RHODS User Groups Test Case to new Auth CRD by @asanzgom in #2222
  • Fix RayCluster upgrade tests permission issue for OCP by @Srihari1192 in #2254
  • must-gather cleanup for manual tests by @govindjadhav3 in #2284
  • fix: remove the -k option in the cleanup command by @CFSNM in #2285
  • ModelRegistry: remove space in log level by @MarianMacik in #2281

New Contributors

Full Changelog: 2.17.0...2.18

2.17.0

10 Feb 14:35

Choose a tag to compare

What's Changed

New Tests

  • Added Python model REST protocol test on triton for Modelmesh by @Raghul-M in #2070
  • Added Python model gRPC protocol test on triton for Kserve by @Raghul-M in #2025
  • Added Python model REST protocol test on triton for Kserve by @Raghul-M in #2009
  • Added FIL model REST protocol test on triton for Kserve by @rpancham in #2018
  • Added FIL model GRPC protocol test on triton for Kserve by @rpancham in #2028
  • Added Tensorflow model REST protocol test on triton for Kserve by @Raghul-M in #1846

Enhancements

  • RHOAIENG-15919 Refactor to reflect 'create workbench' UI changes (#2059) by @FedeAlonso in #2077
  • [fix] This bumps the expected CUDA version for the workbench images by @jstourac in #2098
  • [fix] of the GPU tests for the IDE module by @jstourac in #2102
  • Fix Dashboard Smoke Tests for RHOAI 2.17 by @manosnoam in #2142
  • Fix Dashboard Suites 0408, 0409, 0410 Tests and Keywords by @manosnoam in #2152
  • Update AMD Operator and NFD install scripts by @bdattoma in #2139
  • [fix] the IDE Elyra tests based on the RHOAI 2.17 Dashboard changes by @jstourac in #2166
  • [fix] tests for the BYON feature by @jstourac in #2177
  • Default GPU node replicas to 1 avoiding 0 nodes in SNO clusters by @bdattoma in #2167
  • [fix] the Verify Notebook Has Not Restarted upgrade test by @jstourac in #2173
  • Fix dsp tests on interop 2.17 & fix variable conflict in Permissions provoking xpath issue by @jgarciao in #2219

Other Changes

  • Fix model mesh tests and update runtime images by @rnetser in #2060
  • Add changes to handle RHOAI deployment from stage for SelfManaged by @aloganat in #2080
  • Increase timeout for kuberay tests by @ChughShilpa in #2097
  • Update kfp dependency to 2.10.1. Rebuilds pipeline samples by @jgarciao in #2088
  • Increase codeflare-sdk tests timeout by @Srihari1192 in #2093
  • Remove Suite variables to avoid overriding values provided by tests in pre/post upgrade tests by @lugi0 in #2078
  • Moved fetch cluster type to RHOSi.resources and assign correct subscr… by @asanzgom in #2066
  • Add image references to the Distributed Workload image digests by @sutaakar in #2089
  • Add tag to disruptive tests so that they can be exclution from Operator suite runs by @mattmahoneyrh in #2079
  • update caikit runtime image and runtime validate param by @tarukumar in #2106
  • Smoke Test failure - Name fix for Runtime template by @Raghul-M in #2103
  • Update images used in nvidia and rocm pipeline testing for 2.16 (master) by @jgarciao in #2086
  • Chores on the upgrade test suites by @asanzgom in #2110
  • Fix Verify Model Can Be Deployed Via UI For Upgrade by @rnetser in #2112
  • Update chmod dir path in download model to pvc and increase timeout in non-admin use case by @rnetser in #2109
  • Added AutomationBug Tag to RHOAIENG-14306 by @asanzgom in #2116
  • add AutomationBug tag to RHOAIENG-14840 by @kobihk in #2121
  • DW: Add test coverage for custom ROCm Ray image by @ChughShilpa in #2124
  • Add AutomationBug Tag to Verify RHODS User Groups by @asanzgom in #2128
  • Remove ROCm tag from CPU tests by @ChughShilpa in #2127
  • Updated locators for Verify RHODS Accept Multiple Admin Groups And CR… by @asanzgom in #2130
  • [model server] Update post upgrade expected inference response by @rnetser in #2135
  • Replace AMD GPU community operator with RH certified one by @bdattoma in #2134
  • Add DW UI test to Verify CPU and Memory resource usage Exceeds warning threshold by @Srihari1192 in #2138
  • RHOAIENG-12192 - Extend DSP e2e tests by @jiripetrlik in #2129
  • Replace all Xpaths with pf-v5 to pf-v6 by @manosnoam in #2140
  • update the install type to Cli instead of CLi by @kobihk in #2144
  • Add keyword to find ROSA_HCP environment by @ChughShilpa in #2143
  • update path for KFTO and FMS tests by @ChughShilpa in #2156
  • Add KFTO_MNIST training operator tests by @ChughShilpa in #2159
  • Add keyword to find cluster type based on cluster infrastructure by @ChughShilpa in #2154
  • Skip FMS Training operator tests for version less than 2.12.0 by @sutaakar in #2158
  • Add Monitoring Tag to Test Cases by @asanzgom in #2163
  • enhancement: add operator integration tag to test that interacts with the component status by @CFSNM in #2162
  • Migrating Python model REST protocol test on triton for Kserve ( UI -> API ) by @Raghul-M in #2133
  • enhancement: add monitoring test to check targets are up and running by @CFSNM in #2165
  • Update DW UI tests to align with the latest Dashboard UI improvements by @Srihari1192 in #2160
  • Migration of Python model kserve grpc testcase UI -> API by @Raghul-M in #2155
  • Migration of Onnx model kserve Rest testcase UI -> API by @Raghul-M in #2176
  • Migration of Pytorch Rest Protocol test on triton for Kserve (UI -> API) by @rpancham in #2172
  • Migration of Onnx model kserve Grpc testcase UI -> API by @Raghul-M in #2178
  • enhancement: add monitoring test to check rhoai dashboard metrics by code by @CFSNM in #2175
  • Add new runtime images by @tarukumar in #2181
  • Migration of Keras Rest Protocol test on triton for Kserve (UI -> API) by @rpancham in #2182
  • Update Codeflare SDK release tag for 2.17 release by @jiripetrlik in #2184
  • Update KFTO multi-node test names according to recent updates in orig… by @abhijeet-dhumal in #2164
  • Migration of Dali model kserve Rest testcase UI -> API by @Raghul-M in #2193
  • Onboard ODS-CI testing on OpenShift CI by @liswang89 in #2189
  • Upgrade fms-hf-tuning image to 2.5.0 by @sutaakar in #2195
  • Add option in ocm.py script to just check if the cluster exists by @bdattoma in #2196
  • update KFTO tests to utilise storage bucket in case of disconnected e… by @abhijeet-dhumal in #2198
  • Migration of Tensort Rest Protocol test on triton for Kserve (UI -> API) by @rpancham in #2197
  • Update dry_run.yml upload artifact action from version v3 -> v4 by @Raghul-M in #2206
  • RHOAIENG-13260 - ODS-194: Verify RHOAI Dashboard metrics are defined by @MarianMacik in #2205
  • Add KFTO huggingface trainer tests by @abhijeet-dhumal in #2199
  • fix: solving disruptive tests issues due to operator reconciliation refactor changes by @CFSNM in ht...
Read more