Commit Graph

29 Commits (317fb12fbd7cef5d86476574bffe0e904af884ca)

Author SHA1 Message Date
unbounded 62cfc54f77
Add quantize-stats command for testing quantization (#728)
Command that calculates some statistics over the errors introduced by
quantization, like mean square error, max error and some percentile errors for layer
weights. Should be useful for testing quantization improvements.

Exposes some internal state from ggml and llama for testing
2 years ago
Sergey Alirzaev cc9cee8e9e
Do not crash when it has nothing to say. (#796)
Otherwise observing this in the interactive mode:
/usr/lib/gcc/x86_64-pc-linux-gnu/12/include/g++-v12/bits/stl_vector.h:1230: reference std::vector<int>::back() [_Tp = int, _Alloc = std::allocator<int>]: Assertion '!this->empty()' failed.
2 years ago
at8u ff05d05c96
miku.sh : add executable bit (#780) 2 years ago
at8u 88ed5761b8
examples : add Miku.sh (#724)
* Add Miku.sh to examples

* Add missing line to prompt in Miku.sh

* Add --keep param to Miku.sh

* Remove '[end_of_conversation]' line from Miku.sh

No longer is necessary.
2 years ago
mgroeber9110 53dbba7695
Windows: reactive sigint handler after each Ctrl-C (#736) 2 years ago
Leonardo Neumann 6e7801d08d
examples : add gpt4all script (#658) 2 years ago
Murilo Santana 5b70e7de4c
fix default params for examples/main (#697) 2 years ago
Slaren 0d054e292e Show error message when -f fails 2 years ago
Slaren 64bde3ffd4 Fix ggml_init_params in quantize 2 years ago
Thérence d9ad104440
Create chat-13B.bat (#592)
* Create chat-13B.bat

Same script than chat-13B.sh, but for windows users.
Tested and working on windows 10/11 v 22H2

* Apply suggestions from code review

---------

Co-authored-by: anzz1 <anzz1@live.com>
2 years ago
Tobias Lütke a6956b25a1
add example of re-act pattern (#583)
* add example of re-act pattern

* spelling...

* fixed whitespace in reverse prompt issue
2 years ago
anzz1 7f4c5c6651
llama : fix linkage with mingw (#551)
* Revert 7e53955 (#542)

Still needs to be fixed properly

* Fix linking on mingw32
2 years ago
Stephan Walter 436e561931
all : be more strict about converting float to double (#458)
* Be more strict about converting float to double

* Test equivalence of round, SILU implementations

Test module is commented out in CMakeLists.txt because the tests may
take a long time, depending on how much the compiler optimizes.

* Fix softmax in perplexity.cpp

* all : prefer float over double where appropriate

* perplexity : add <cmath>

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
Stephan Walter c1f885067c
ggml : introduce structs for the q4 data blocks (#356)
* Introduce structs for the q4 data blocks

* ggml : rename quant struct variables + fix ARM_NEON

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2 years ago
anzz1 7b8dbcb78b
main.cpp fixes, refactoring (#571)
- main: entering empty line passes back control without new input in interactive/instruct modes
- instruct mode: keep prompt fix
- instruct mode: duplicate instruct prompt fix
- refactor: move common console code from main->common
2 years ago
Marco Matthies 7e5395575a
Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542) 2 years ago
Stephan Walter b391579db9
Update README and comments for standalone perplexity tool (#525) 2 years ago
anzz1 7a87d31f4f
[main] fix infinite generation (-n == -1) (#523) 2 years ago
Harald Fernengel 33e35b8fe8
Exit from interactive mode if input stream is bad (#491)
Allow exiting the interactive prompt also with CTRL-D on Unix and CTRL-Z
on Windows.
2 years ago
anzz1 34ab526843
(Windows) Set console to UTF-8 on init (#420)
Sets console codepage to 65001 (CP_UTF8) on start for both input and output, should fix problems with UTF-8 characters.
2 years ago
Georgi Gerganov c2b25b6912
Fix colors enabling on WIN32 2 years ago
Georgi Gerganov 79b2b266db
If n_predict == -1, generate forever 2 years ago
Georgi Gerganov e2d490dafd
Inifinite generation via context swapping (#71) 2 years ago
Georgi Gerganov 03f7e33560
Cleanup STL headers + fix embedding examples + minor stuff 2 years ago
Georgi Gerganov 55ad42af84
Move chat scripts into "./examples" 2 years ago
Georgi Gerganov a316a425d0
Overhaul the examples structure
- main -> examples
- utils -> examples (renamed to "common")
- quantize -> examples
- separate tools for "perplexity" and "embedding"

Hope I didn't break something !
2 years ago
Georgi Gerganov 04c6f5ed6f
Immediately start processing the prompt before user input has been provided (#476) 2 years ago
Mathieu Nayrolles 3f9c6135e4
fix typo in chatLLaMa (#368)
The prompt contains a typo where 'alound' is used instead of 'aloud'.
2 years ago
Jean-Christophe Hoelt 3ab3e6582f
Add chatLLaMa script (#198)
* Add chatLLaMa script

* Fix shellcheck errors and do some cleanup

* Move chatLLaMa script to `examples` directory

* Reduce chatLLaMa context size to 2048

Ref d7def1a752

* Include n_predict to 2048 in examples/chatLLaMa
2 years ago