oneDNN + MKL #967

graemenail · 2022-09-14T09:23:03Z

TODO

Manual check of training / scoring

Description

Adds oneDNN. This PR should enable a completely open-source compilations of Marian.

In contrast to #937, this PR also allows MKL-based builds. When compiled with -DUSE_DNNL=ON, oneDNN is used for sgemm, even if -DUSE_MKL=ON is requested.

This PR also includes a caching of boost, and a cleaning of the debug build directory. These can be broken off into a separate PR if necessary. During testing, windows builds would fail from running out of space building both debug and release. Since disabling oneDNN JIT profiling the build sizes are smaller.

Related: marian-nmt/marian-regression-tests#86
Closes: #706
Supersedes: #937

List of changes:

Adds oneDNN as a submodule
Use oneDNN for sgemm
MKL has been retained for backwards compatibility
LSH (with rotation) now explicitly requires MKL. This is to avoid silently falling back to less performant codepaths. To document, BLAS_FOUND is sufficient here and the MKL_FOUND condition could be omitted by a user expecting a performance degradation.

Added dependencies: Intel oneDNN

How to test

Ran the 1 million sentence testset of WNGT21 through the MKL and oneDNN versions.

The static binaries are now larger.

Regression test results

Skipped:
- tests/server/test_ende_cpu.sh
Failed:
- tests/decoder/intgemm/test_intgemm_16bit.sh
- tests/decoder/intgemm/test_intgemm_16bit_avx2.sh
- tests/decoder/intgemm/test_intgemm_16bit_sse2.sh
- tests/decoder/intgemm/test_intgemm_8bit.sh
- tests/decoder/intgemm/test_intgemm_8bit_avx2.sh
- tests/decoder/intgemm/test_intgemm_8bit_ssse3.sh
- tests/models/wngt19/test_model_base_fbgemm_packed8.sh
Logs:
- /home/gnail/projects/mkl-onednn/marian-dev/regression-tests/tests/decoder/intgemm/test_intgemm_16bit.sh.log
- /home/gnail/projects/mkl-onednn/marian-dev/regression-tests/tests/decoder/intgemm/test_intgemm_16bit_avx2.sh.log
- /home/gnail/projects/mkl-onednn/marian-dev/regression-tests/tests/decoder/intgemm/test_intgemm_16bit_sse2.sh.log
- /home/gnail/projects/mkl-onednn/marian-dev/regression-tests/tests/decoder/intgemm/test_intgemm_8bit.sh.log
- /home/gnail/projects/mkl-onednn/marian-dev/regression-tests/tests/decoder/intgemm/test_intgemm_8bit_avx2.sh.log
- /home/gnail/projects/mkl-onednn/marian-dev/regression-tests/tests/decoder/intgemm/test_intgemm_8bit_ssse3.sh.log
- /home/gnail/projects/mkl-onednn/marian-dev/regression-tests/tests/models/wngt19/test_model_base_fbgemm_packed8.sh.log
---------------------
Ran 19 tests in 00:00:0.000s, 11 passed, 1 skipped, 7 failed
FAILED

Checklist

I have tested the code manually
I have run regression tests
I have read and followed CONTRIBUTING.md
I have updated CHANGELOG.md

Compile time preference: 1. MKL 2. oneDNN 3. Other BLAS

The BLAS_FOUND ifdef guards are removed as sgemm already has an abort.

This reverts commit 735ef42.

This change was requested to avoid silently falling back to slower BLAS implementations.

graemenail mentioned this pull request Sep 14, 2022

Add oneDNN #937

Closed

4 tasks

graemenail added 28 commits November 2, 2022 13:19

Add oneDNN submodule to 3rd_party

36bf47e

Add oneDNN to CMake

ebd1559

Don't build DNNL examples

0b9cb99

Allow static builds of DNNL

e344dbc

Remove MKL include from config parser

b89f5d0

Add oneDNN sgemm

dc5c48f

Compile time preference: 1. MKL 2. oneDNN 3. Other BLAS

Improve oneDNN CMake

28b0eb3

Add oneDNN in prod

d454985

The BLAS_FOUND ifdef guards are removed as sgemm already has an abort.

Use int in loop for ProdBatched

01aba42

oneDNN only use OMP runtime when specified

9d6437f

Move MSVC unicode flags out of global flags

6675ac3

Disable DNNL JIT Profiling

cc7cb75

Cache Boost

35d302a

Clean up after debug build

4e69c5a

Update CHANGELOG

d44fa4d

Mention oneDNN in documentation

a975a1a

Fix comments mentioning MKL

215eec1

oneDNN GH actions

ed9fa14

Warn if no BLAS for FBGEMM

04053c0

Windows needs MKL for FBGEMM blas

79da945

Try Ubuntu with openblas for FBGEMM

a424614

Fix typo

714bdc8

Set BLAS_FOUND for Apple Accelerate

5f96baa

Revert "Try Ubuntu with openblas for FBGEMM"

fdf6c59

This reverts commit 735ef42.

Prefer DNNL codepaths at compile time when requested

4a29f6d

Update oneDNN compilation for clang CI

55478ce

Require MKL for LSH (with rotation)

7426ebb

This change was requested to avoid silently falling back to slower BLAS implementations.

Fix abort message to require MKL rather than generic BLAS

3e1a550

graemenail force-pushed the mkl-onednn branch from d691b98 to 3e1a550 Compare November 2, 2022 13:22

graemenail marked this pull request as ready for review November 14, 2022 17:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

oneDNN + MKL #967

oneDNN + MKL #967

Uh oh!

graemenail commented Sep 14, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

oneDNN + MKL #967

Are you sure you want to change the base?

oneDNN + MKL #967

Uh oh!

Conversation

graemenail commented Sep 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO

Description

How to test

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

graemenail commented Sep 14, 2022 •

edited

Loading