Historique des commits

Auteur SHA1 Message Date
  Yann Follet 646ef4a9cf embedding : more cli arguments (#7458) il y a 1 an
  Douglas Hanley 80ea089d77 llama : allow pooled embeddings on any model (#7477) il y a 1 an
  Georgi Gerganov 1442677f92 common : refactor cli arg parsing (#7675) il y a 1 an
  Georgi Gerganov 6ff13987ad common : normalize naming style (#7462) il y a 1 an
  dm4 ea3b0590ee embedding : free the batch after execution (#7297) il y a 1 an
  Joan Fontanals b83cc3f5b3 llama : add Jina Embeddings architecture (#6826) il y a 1 an
  Jared Van Bortel 1b67731e18 BERT tokenizer fixes (#6498) il y a 1 an
  howlger 1e13987fba embedding : show full embedding for single prompt (#6342) il y a 1 an
  Minsoo Cheong deb7240100 embedding : adjust `n_ubatch` value (#6296) il y a 1 an
  Georgi Gerganov 044ec4b2a5 embedding : add EOS token if not present (#899) il y a 1 an
  Georgi Gerganov 68265ebfc6 embedding : print all resulting embeddings (#899) il y a 1 an
  Georgi Gerganov 0fd6c1f015 embedding : print cosine similarity (#899) il y a 1 an
  slaren f30ea47a87 llama : add pipeline parallelism support (#6017) il y a 1 an
  SeungWon Jeong fb215c3832 server : normalize embeddings (#5956) il y a 1 an
  Georgi Gerganov 29ae62d2ae llama : fix embeddings (#5796) il y a 1 an
  bmwl f486f6e1e5 ggml : add numa options (#5377) il y a 1 an
  Douglas Hanley 03bf161eb6 llama : support batched embeddings (#5466) il y a 1 an
  Douglas Hanley 2891c8aa9a Add support for BERT embedding models (#5423) il y a 1 an
  cebtenzzre b12fa0d1c1 build : link against build info instead of compiling against it (#3879) il y a 2 ans
  slaren 16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301) il y a 2 ans
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) il y a 2 ans
  Cebtenzzre 8781013ef6 make : restore build-info.h dependency for several targets (#3205) il y a 2 ans
  Cebtenzzre e6616cf0db examples : add compiler version and target to build info (#2998) il y a 2 ans
  Cebtenzzre e64f5b5578 examples : make n_ctx warning work again (#3066) il y a 2 ans
  Cebtenzzre 00d62adb79 fix some warnings from gcc and clang-tidy (#3038) il y a 2 ans
  Georgi Gerganov edd4c14817 llama : more tokenizer fixes (#2810) il y a 2 ans
  slaren 519c981f8b embedding : evaluate prompt in batches (#2713) il y a 2 ans
  Georgi Gerganov 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) il y a 2 ans
  Evan Miller 5656d10599 mpi : add support for distributed inference via MPI (#2099) il y a 2 ans
  Judd 36680f6e40 convert : update for baichuan (#2081) il y a 2 ans