Douglas Hanley
|
cdd1889de6
convert : add support for XLMRoberta embedding models (#8658)
|
1 год назад |
fairydreaming
|
d3f0c7166a
Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858)
|
1 год назад |
slaren
|
2b1f616b20
ggml : reduce hash table reset cost (#8698)
|
1 год назад |
Georgi Gerganov
|
938943cdbf
llama : move vocab, grammar and sampling into separate files (#8508)
|
1 год назад |