Commit Graph

106 Commits (872c365a9176a011b13d31269bb3121fa89c37e1)

Author SHA1 Message Date
Georgi Gerganov 543c57e991
Ammend to previous commit - forgot to update non-QRDMX branch 2 years ago
Georgi Gerganov 113a9e83eb
10% performance boost on ARM 2 years ago
Sebastián A eb062bb012
Windows fixes (#31)
* Apply fixes suggested to build on windows

Issue: https://github.com/ggerganov/llama.cpp/issues/22

* Remove unsupported VLAs

* MSVC: Remove features that are only available on MSVC C++20.

* Fix zero initialization of the other fields.

* Change the use of vector for stack allocations.
2 years ago
Georgi Gerganov f1eaff4721 Add AVX2 support for x86 architectures thanks to @Const-me ! 2 years ago
Georgi Gerganov 007a8f6f45
Support all LLaMA models + change Q4_0 quantization storage 2 years ago
Georgi Gerganov 26c0846629
Initial release 2 years ago