Feature/ifbench #113

Jianshu1only · 2025-07-12T19:19:04Z

Checklist Before Starting

Search for similar PR(s).

What does this PR do?

Create IFBench dataset verification function and data preprocess code

High-Level Design

Add ifbench in reward_score

Specific Changes

Add ifbench verification function in reward_score, add ood_ifbench in util, add data preprocess function

API

NA

Usage Example

NA

Test

python verl/utils/reward_score/ifbench/test_ifbench.py
sbatch verl/scripts/train/ifbench_test.sh

Additional Info.

NA

Checklist Before Submitting

[✓] Read the Contribute Guide.
[✓] Apply pre-commit checks.
[✓] Add [BREAKING] to the PR title if it breaks any API.
[✓] Update the documentation about your changes in the docs.
[✓] New CI unit test(s) are added to cover the code path.
[✓] Rely on existing unit tests on CI that covers the code path.

BlankCheng · 2025-07-17T22:08:42Z

verl/utils/reward_score/ifbench/split_fixed_data.py

+    """Split the fixed IFBench data into 90% train and 10% test."""
+
+    # Input file (fixed data)
+    input_file = "/mnt/sharefs/users/jianshu.she/ood__ifbench_95.1k_fixed.parquet"


There are absolute paths here. Is this file necessary for production?

BlankCheng · 2025-07-17T22:09:05Z

verl/utils/reward_score/ifbench/split_fixed_data.py

+    """Split the fixed IFBench data into 90% train and 10% test."""
+
+    # Input file (fixed data)
+    input_file = "/mnt/sharefs/users/jianshu.she/ood__ifbench_95.1k_fixed.parquet"


BlankCheng · 2025-07-17T22:09:29Z

verl/utils/reward_score/ifbench/check_ifbench_data.py

+
+    # Fix the data
+    print("\n=== Fixing data ===")
+    fix_ifbench_data(original_file, "/mnt/sharefs/users/jianshu.she/ood__ifbench_95.1k_fixed.parquet")


BlankCheng · 2025-07-17T22:14:34Z

Hi @Jianshu1only , this looks awesome! One issue is some files are not directly used in RL training, which don't need to be included, e.g., check_ifbench_data.py. We can keep the minimal runnable files for RL here. Please correct me if I misunderstand. Also, let's remove the absolute paths involved.
Overall, the IFBench would be great to have. Thanks for the contribution!

Jianshu1only · 2025-07-22T20:12:01Z

Hi @BlankCheng , I’ve deleted all unnecessary files in the IFbench folder. Please review the PR again when you have a chance. Thanks!

BlankCheng · 2025-07-25T04:22:15Z

Thanks @Jianshu1only! I delete the bash files as we only keep the template there. Let's merge.

Jianshu She added 2 commits July 12, 2025 17:40

ifbench

1aa3bcd

ifbench training shell

f180dfc

BlankCheng reviewed Jul 17, 2025

View reviewed changes

Delete unused ifbench scripts

c87e990

Delete scripts/train/ifbench_test.sh

1105633

BlankCheng self-requested a review July 25, 2025 04:22

BlankCheng approved these changes Jul 25, 2025

View reviewed changes

BlankCheng merged commit bf5c3dc into main Jul 25, 2025

BlankCheng deleted the feature/ifbench branch July 25, 2025 04:28

BlankCheng pushed a commit that referenced this pull request Aug 12, 2025

[data] Add IFBench dataset (#113)

01c4492

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/ifbench #113

Feature/ifbench #113

Uh oh!

Jianshu1only commented Jul 12, 2025

Uh oh!

BlankCheng Jul 17, 2025

Uh oh!

BlankCheng Jul 17, 2025

Uh oh!

BlankCheng Jul 17, 2025

Uh oh!

BlankCheng commented Jul 17, 2025

Uh oh!

Jianshu1only commented Jul 22, 2025

Uh oh!

BlankCheng commented Jul 25, 2025

Uh oh!

Uh oh!

Feature/ifbench #113

Feature/ifbench #113

Uh oh!

Conversation

Jianshu1only commented Jul 12, 2025

Checklist Before Starting

What does this PR do?

High-Level Design

Specific Changes

API

Usage Example

Test

Additional Info.

Checklist Before Submitting

Uh oh!

BlankCheng Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

BlankCheng Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

BlankCheng Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

BlankCheng commented Jul 17, 2025

Uh oh!

Jianshu1only commented Jul 22, 2025

Uh oh!

BlankCheng commented Jul 25, 2025

Uh oh!

Uh oh!