Now that multiple erroneous spans have been found, it remains currently unclear how the task organization will handle distribution of correct(ed) data.
- Are participants themselves responsible for cleaning dirty data?
- Wouldn't it be easier to use this Github repo to allow for regular correction updates of the data?
- Can we expect the final train data to be checked for invalid annotations?
I wouldn't mind some clarification on these topics.