You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
574406dc7e
* ggml : add Q5_0 quantization (cuBLAS only) * ggml : fix Q5_0 qh -> uint32_t * ggml : fix q5_0 histogram stats * ggml : q5_0 scalar dot product * ggml : q5_0 ARM NEON dot * ggml : q5_0 more efficient ARM NEON using uint64_t masks * ggml : rename Q5_0 -> Q5_1 * ggml : adding Q5_0 mode * quantize : add Q5_0 and Q5_1 to map * ggml : AVX2 optimizations for Q5_0, Q5_1 (#1195) --------- Co-authored-by: Stephan Walter <stephan@walter.name> |
1 year ago | |
---|---|---|
.. | ||
benchmark | 1 year ago | |
embedding | 1 year ago | |
main | 1 year ago | |
perplexity | 1 year ago | |
quantize | 1 year ago | |
quantize-stats | 1 year ago | |
save-load-state | 1 year ago | |
CMakeLists.txt | 1 year ago | |
Miku.sh | 1 year ago | |
alpaca.sh | 1 year ago | |
chat-13B.bat | 1 year ago | |
chat-13B.sh | 1 year ago | |
chat.sh | 1 year ago | |
common.cpp | 1 year ago | |
common.h | 1 year ago | |
gpt4all.sh | 1 year ago | |
reason-act.sh | 1 year ago |