-
Notifications
You must be signed in to change notification settings - Fork 13
Open
Description
Hello 👋
First of all thank you for the great work and evaluation results!
I have understood that in many cases you predicted outputs for each question based on the choice that minimizes the loss of the current evaluated model. I wanted to ask - if there was any difference between the evaluation for VLMs and LLMs? and if was, how did you put these results on the same scale?
many thanks
Idan
Metadata
Metadata
Assignees
Labels
No labels