Skip to content

Releases: leixy76/llama.cpp

b3454

24 Jul 16:19
f19bf99

Choose a tag to compare

Build Llama SYCL Intel with static libs (#8668)

Ensure SYCL CI builds both static & dynamic libs for testing purposes

Signed-off-by: Joe Todd <joe.todd@codeplay.com>

b3439

22 Jul 15:35
566daa5

Choose a tag to compare

*.py: Stylistic adjustments for python (#8233)

* Superflous parens in conditionals were removed.
* Unused args in function were removed.
* Replaced unused `idx` var with `_`
* Initializing file_format and format_version attributes
* Renaming constant to capitals
* Preventing redefinition of the `f` var

Signed-off-by: Jiri Podivin <jpodivin@redhat.com>

b3432

22 Jul 02:08
45f2c19

Choose a tag to compare

flake.lock: Update (#8610)

b3414

18 Jul 13:32
0d2c732

Choose a tag to compare

server: use relative routes for static files in new UI (#8552)

* server: public: fix api_url on non-index pages

* server: public: use relative routes for static files in new UI

b3409

17 Jul 16:08
30f80ca

Choose a tag to compare

CONTRIBUTING.md : remove mention of noci (#8541)

b3405

17 Jul 01:52
5e116e8

Choose a tag to compare

make/cmake: add missing force MMQ/cuBLAS for HIP (#8515)

b3389

14 Jul 13:42
73cf442

Choose a tag to compare

llama : fix Gemma-2 Query scaling factors (#8473)

* 9B - query_pre_attn_scalar = 256 not 224

See https://github.com/google/gemma_pytorch/commit/03e657582d17cb5a8617ebf333c1c16f3694670e

Gemma 9b should use 256 and not 224 (self.config.hidden_size // self.config.num_attention_heads)

* llama : fix Gemma-2 Query scaling factor

ggml-ci

---------

Co-authored-by: Daniel Han <danielhanchen@gmail.com>

b3384

12 Jul 15:03
4e24cff

Choose a tag to compare

server : handle content array in chat API (#8449)

* server : handle content array in chat API

* Update examples/server/utils.hpp

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

---------

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

b3372

11 Jul 14:36
a977c11

Choose a tag to compare

gitignore : deprecated binaries

b3369

11 Jul 01:48
278d0e1

Choose a tag to compare

Initialize default slot sampling parameters from the global context. …