Use 1/0 labels for binary classification instead of 1/-1

The loss function used in this library for binary classification is a hinge-loss function assuming labels +1 or -1:

```
case 1 =>
  1 - Math.signum(pred * label)
```

However, the predictions being made are in the range 0-1:

```
case 1 =>
  1.0 / (1.0 + Math.exp(-pred))
```

The 1 / 0 used in predictions should be preferred to the 1 / -1 expected in the loss function because [the negative label is represented by 0 in spark.mllib instead of −1, to be consistent with multiclass labeling](http://spark.apache.org/docs/latest/mllib-linear-methods.html#classification).

The loss function should be changed to be more like [the way Spark does it](https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/mllib/optimization/Gradient.scala#L312).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use 1/0 labels for binary classification instead of 1/-1 #9

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Use 1/0 labels for binary classification instead of 1/-1 #9

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions