Skip to content

Invalid Questions #5

@zmtomorrow

Description

@zmtomorrow

Hi, thanks for developing this benchmark.

I've noticed that some forecasting questions are not meaningful without the context of the current time.

For example:

  • 'Is growth in J&J's adjusted EPS expected to accelerate in FY2023?'
  • 'What production rate changes is Boeing forecasting for FY2023?'

A meaningful question should include the time when the query was raised, e.g.,

  • 'Is growth in J&J's adjusted EPS expected to accelerate in FY2023 from 2022?'

However, these types of questions are less frequent in practical scenarios.

In practice, people always want to predict the future based on their current time. For example, once the 10-K for FY2023 is released, the question 'Is growth in J&J's adjusted EPS expected to accelerate in FY2023?' becomes meaningless and can decrease the accuracy measurement of the QA system.

I am looking forward to your thoughts on this potential issue.

Thanks
Mingtian

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions