Releases · allozaur/llama.cpp

01 Sep 14:45

02c1813

b6344

Vulkan: Add Integer Dot Product mul_mat_vec shader for legacy quants …

Assets 15

01 Sep 07:30

github-actions

b6341

b66df9d

b6341

CUDA: fix build error from ambiguous __half conversions in conv2d (#1…

Assets 15

24 Aug 00:23

github-actions

b6259

710dfc4

b6259

CUDA: fix half2 -> half conversion for HIP (#15529)

Assets 15

19 Aug 23:22

github-actions

b6209

fb22dd0

b6209

opencl: mark `argsort` unsupported if cols exceed workgroup limit (#1…

Assets 15

14 Aug 13:03

github-actions

b6160

d32e03f

b6160

server : add SWA checkpoints (#15293)

* server : add SWA checkpoints

ggml-ci

* cont : server clean-up

* server : handle state restore fails

* llama : add extended llama_state_seq_ API

* server : do not make checkpoints if --swa-full

ggml-ci

* llama : remove flags value for NONE

* server : configure number of SWA checkpoints with CLI arg

ggml-ci

* args : fix scope of new argument

Assets 15

02 Aug 22:57

github-actions

b6075

5c0eb5e

b6075

opencl: fix adreno compiler detection logic (#15029)

Assets 15

01 Aug 19:17

github-actions

b6059

c76b420

b6059

vendor : update vendored copy of google/minja (#15011)

* vendor : update vendored copy of google/minja

Signed-off-by: Lennart Austenfeld <[email protected]>

* Re-remove trailing whitespace

Signed-off-by: Lennart Austenfeld <[email protected]>

* Remove another trailing whitespace

Signed-off-by: Lennart Austenfeld <[email protected]>

---------

Signed-off-by: Lennart Austenfeld <[email protected]>

Assets 15

29 Jul 19:24

github-actions

b6027

aa79524

b6027

HIP: remove the use of __HIP_PLATFORM_AMD__, explicitly support only …

Assets 15

29 Jul 18:02

github-actions

b6026

b77d111

b6026

HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. …

Assets 15

Releases: allozaur/llama.cpp

b6344

Uh oh!

b6341

Uh oh!

b6259

Uh oh!

b6209

Uh oh!

b6160

Uh oh!

b6075

Uh oh!

b6059

Uh oh!

b6027

Uh oh!

b6026

Uh oh!