Experimental: Allow shortlists in `marian-scorer` (browsermt) #3

graemenail · 2022-01-07T15:56:25Z

Description

NOTE!

Motivation

This is a port of PR #2 on to the browsermt fork of marian.
This is necessary to use models that require intgemm.

Caveats

--gemm-precision int8shiftAlphaAll currently doesn't work with marian-scorer; int8, int8shift do however. The problem is related to the loading of the precomputed alphas. Work is ongoing here.

This PR adds the possibility to use shortlist during (re)scoring in Marian Scorer. Its aim is to achieve word-scores from marian-scorer which are comparable to those obtained during decoding.

During decoding, tensor indices corresponding to non-shortlist tokens are discarded. This reduction in tensor size reduces the computational cost of later computations, and improves decoder performance. As such, the softmax+cross-entropy operation only ever sees shortlisted tokens. In order to imitate this in marian-rescorer, we perform a modified softmax which has a normalisation factor calculated from the sum of the subset defined by the shortlist. The sum of shortlist-only is correctly normalised to unity, while the sum over the full vocabulary is greater than (or equal to) unity. When we encounter tokens in scoring that are not in the shortlist, their value is not bounded above by 0, and therefore, may be positive.

You must maintain the same batching as used in decoding! The size of the generated shortlist depends on the contents of a particular batch, specifically the different tokens it contains.

For performance, the cross-entropy operation in Marian implements the softmax sum as part of it's node operation. This implementation is different, and uses several node operations to accomplish its result.

Finally, decoding and Scoring are two distinct modes of operation, utilising different code paths and therefore expression graphs, with decoder generating tokens sequentially, and scorer having them provided ahead of time. As such, floating point errors will propagate differently, and results may be numerically different.

Added dependencies: none

How to test

Using the same shortlist settings (e.g. --shortlist lex.s2t.gz 100 100), you should receive a roughly similar word-score when rescoring on decoder output.

Checklist

I have tested the code manually
I have run regression tests
I have read and followed CONTRIBUTING.md
I have updated CHANGELOG.md

The cross_entropy_shortlist operation implements cross-entropy with a modified softmax stage. This modified softmax uses the shortlist indices to define the subset over which the softmax is normalized. The motivation is to have entries inside the shortlist with the result they would have in the absence of non-shortlist entries. This should offer some comparison between results in scoring and decoding modes.

The default behaviour of shortlist performed an index select to retain entries corresponding to shortlist candidates. This is desired for decoding, where this reduction in the size of tensors offers an increase in performance. The new behaviour retains the full size of the tensors, and is designed to be used with marian scorer. The list of indices for shortlist candidates can be later used during loss computation.

graemenail added 12 commits January 7, 2022 15:26

Make marian_scorer task creation similar to marian_decoder

2638cb4

Add shortlist as a command line option for marian-scorer

d277312

Shortlist is scoring aware

d737558

Condense intgemm shortlist caching code paths

3ab52cb

Loss can compute cross entropy with shortlist indices, if provided

836412c

FastOpt specialisation for models::usage

0623a17

Operating mode is passed to output layer, and its shortlist

48d1bfd

Remove abort for shortlist in train-like modes (training, scoring)

8a5d0b6

Pass shortlist indicies from model to loss function

b35d6ea

Create shortlist generator for rescorer

e3f9f2c

graemenail mentioned this pull request Jan 14, 2022

Integrate shortlist to scorer for compatiblity with Quality Estimation browsermt/bergamot-translator#251

Closed

graemenail force-pushed the browsermt-master branch 2 times, most recently from df9cf75 to 7e67124 Compare March 10, 2022 13:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Experimental: Allow shortlists in `marian-scorer` (browsermt) #3

Experimental: Allow shortlists in `marian-scorer` (browsermt) #3

Uh oh!

graemenail commented Jan 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Experimental: Allow shortlists in marian-scorer (browsermt) #3

Are you sure you want to change the base?

Experimental: Allow shortlists in marian-scorer (browsermt) #3

Uh oh!

Conversation

graemenail commented Jan 7, 2022

Description

Motivation

Caveats

How to test

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Experimental: Allow shortlists in `marian-scorer` (browsermt) #3

Experimental: Allow shortlists in `marian-scorer` (browsermt) #3