Training on labeled only can achieve much better results than what is reported

Interesting idea of using an adversarial method for leveraging unlabeled data. I am trying to see how much unlabeled data can actually help.

In the plot below, I am comparing GanBert (Orange) that trains on both labeled and unlabeled data, and a basic model that uses Bert+Classifier (blue) that trains on the 109 labeled data only of Trec Data.

The paper reports that the basic model should achieve around 40%, but I am getting 60% which is very close to GanBert's. Are you sure that the baseline discussed in the paper is a reasonable one?
 
<img width="1244" alt="image" src="https://user-images.githubusercontent.com/3382128/154621685-334a28f4-d2f1-4106-8909-acbdcc10e0eb.png">

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Training on labeled only can achieve much better results than what is reported #21

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Training on labeled only can achieve much better results than what is reported #21

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions