-
Notifications
You must be signed in to change notification settings - Fork 9
Feature/ifbench #113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature/ifbench #113
Conversation
"""Split the fixed IFBench data into 90% train and 10% test.""" | ||
|
||
# Input file (fixed data) | ||
input_file = "/mnt/sharefs/users/jianshu.she/ood__ifbench_95.1k_fixed.parquet" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are absolute paths here. Is this file necessary for production?
"""Split the fixed IFBench data into 90% train and 10% test.""" | ||
|
||
# Input file (fixed data) | ||
input_file = "/mnt/sharefs/users/jianshu.she/ood__ifbench_95.1k_fixed.parquet" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The same
|
||
# Fix the data | ||
print("\n=== Fixing data ===") | ||
fix_ifbench_data(original_file, "/mnt/sharefs/users/jianshu.she/ood__ifbench_95.1k_fixed.parquet") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The same
Hi @Jianshu1only , this looks awesome! One issue is some files are not directly used in RL training, which don't need to be included, e.g., |
Hi @BlankCheng , I’ve deleted all unnecessary files in the IFbench folder. Please review the PR again when you have a chance. Thanks! |
Thanks @Jianshu1only! I delete the bash files as we only keep the template there. Let's merge. |
Checklist Before Starting
What does this PR do?
Create IFBench dataset verification function and data preprocess code
High-Level Design
Add ifbench in reward_score
Specific Changes
Add ifbench verification function in reward_score, add ood_ifbench in util, add data preprocess function
API
NA
Usage Example
NA
Test
python verl/utils/reward_score/ifbench/test_ifbench.py
sbatch verl/scripts/train/ifbench_test.sh
Additional Info.
NA
Checklist Before Submitting
[BREAKING]
to the PR title if it breaks any API.