تاریخچه Commit ها

نویسنده SHA1 پیام تاریخ
  slaren b0bc9f4a9d llama-bench : use random tokens to improve accuracy with mixtral (#6069) 1 سال پیش
  Georgi Gerganov 4755afd1cb llama : fix integer overflow during quantization (#6063) 1 سال پیش
  Steve Grubb 6e0438da3c gguf : fix resource leaks (#6061) 1 سال پیش
  Ondřej Čertík 727107707a gguf-py : bump version to 0.8.0 (#6060) 1 سال پیش
  Michael Podvitskiy 69ff61397d llama : support models without vocabulary (#5798) 1 سال پیش
  Georgi Gerganov 044ec4b2a5 embedding : add EOS token if not present (#899) 1 سال پیش
  Georgi Gerganov 77178eedc8 gguf-py : fix dtype check (#6045) 1 سال پیش
  Jian Liao 15a333260a readme : improve readme for Llava-1.6 example (#6044) 1 سال پیش
  Pierrick Hymbert 43241adf22 server: disable debug release type sanitizer, simplify trigger (#6047) 1 سال پیش
  Georgi Gerganov a44bc969e4 llama : fix typo 1 سال پیش
  Michael Podvitskiy 2c4fb69246 llama : optimize defrag moves + fix fragmentation calculation (#6037) 1 سال پیش
  Ondřej Čertík 3ca23481dd gguf-py : add support for I8, I16 and I32 (#6045) 1 سال پیش
  Georgi Gerganov 3fe8d7a17f ggml : designate enum vals for integer types (#6050) 1 سال پیش
  Georgi Gerganov 68265ebfc6 embedding : print all resulting embeddings (#899) 1 سال پیش
  Georgi Gerganov 381da2d9f0 metal : build metallib + fix embed path (#6015) 1 سال پیش
  Georgi Gerganov 0fd6c1f015 embedding : print cosine similarity (#899) 1 سال پیش
  Linwei Wang 19885d205e readme : update details about running llama in Termux on Android (#6039) 1 سال پیش
  Georgi Gerganov 76a936c893 readme : update API changes and hot topics 1 سال پیش
  Clint Herron 463628372d grammar : handle missing "root" node (#6004) 1 سال پیش
  slaren f30ea47a87 llama : add pipeline parallelism support (#6017) 1 سال پیش
  slaren d8fd0ccf6a test-backend-ops : skip CPU backend by default (#6028) 1 سال پیش
  AidanBeltonS b3d978600f Update get version (#6025) 1 سال پیش
  Xuan Son Nguyen 99b71c068f Server: Use multi-task for embeddings endpoint (#6001) 1 سال پیش
  slaren 306d34be7a ci : remove tidy-review (#6021) 1 سال پیش
  Georgi Gerganov 8030da7afe ggml : reuse quantum structs across backends (#5943) 1 سال پیش
  Georgi Gerganov 184215e783 ggml : fix UB in IQ2_S and IQ3_S (#6012) 1 سال پیش
  Georgi Gerganov 48358b2e5b sycl : update IQ1_S kernels (WIP - not working!) (#5995) 1 سال پیش
  gliptic 5cdb371731 grammar : fix unnecessarily retained pointer to rules (#6003) 1 سال پیش
  Kawrakow 44ca159faf 1.5 bit: we can do even better (#5999) 1 سال پیش
  Georgi Gerganov 05b06210c9 llama : more consistent names of count variables (#5994) 1 سال پیش