Sigbjørn Skjæret
|
55f6b9fa65
convert : fix duplicate key DeepSeek-R1 conversion error (#14103)
|
7 months ago |
Sigbjørn Skjæret
|
3678b838bb
llama : support GEGLU for jina-bert-v2 (#14090)
|
7 months ago |
Sigbjørn Skjæret
|
1caae7fc6c
gguf-py : add add_classifier_output_labels method to writer (#14031)
|
7 months ago |
Sigbjørn Skjæret
|
5e1c3aed40
convert : fix nomic-bert-moe mask token (#13757)
|
8 months ago |
Sigbjørn Skjæret
|
c496fe0b1d
convert : fix vocab padding code for bert models (#13954)
|
8 months ago |
Sigbjørn Skjæret
|
db38704f01
convert : fix rwkv bos/eos token (#13844)
|
8 months ago |
Xuan-Son Nguyen
|
07e4351ce6
convert : allow partial update to the chkhsh pre-tokenizer list (#13847)
|
8 months ago |
Đinh Trọng Huy
|
291f2b6913
llama : add support for DistilBert (#13907)
|
8 months ago |
Sigbjørn Skjæret
|
e83ba3e460
llama : add support for jina-reranker-v2 (#13900)
|
8 months ago |
Sigbjørn Skjæret
|
5ca82fc1d7
convert : workaround for AutoConfig dummy labels (#13881)
|
8 months ago |
Sigbjørn Skjæret
|
6385b843a8
llama : add RobertaForSequenceClassification reranker support (#13875)
|
8 months ago |
Đinh Trọng Huy
|
e0e3aa231d
llama : add support for BertForSequenceClassification reranker (#13858)
|
8 months ago |
Đinh Trọng Huy
|
aa6dff05be
convert: small addition to support LlamaModel (#13838)
|
8 months ago |
Xuan-Son Nguyen
|
a3938fb53d
convert : fix qwen omni conversion (#13859)
|
8 months ago |
Xuan-Son Nguyen
|
26b79b6cb3
convert : fix tensor naming conflict for llama 4 vision (#13836)
|
8 months ago |
Xuan-Son Nguyen
|
bc583e3c63
mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) (#13784)
|
8 months ago |
Xuan-Son Nguyen
|
40aaa8a403
mtmd : add support for Qwen2-Audio and SeaLLM-Audio (#13760)
|
8 months ago |
Xuan-Son Nguyen
|
797990c4bc
mtmd : add ultravox audio input (#13623)
|
8 months ago |
antichristHater
|
c76532e7ba
convert : add qwen2vl support for unsloth merges (#13686)
|
8 months ago |
Xuan-Son Nguyen
|
92ecdcc06a
mtmd : add vision support for llama 4 (#13282)
|
8 months ago |
Xuan-Son Nguyen
|
c531edfa34
convert : fix conversion for llama 4 (#13567)
|
8 months ago |
Gabe Goodhart
|
d590cd4c24
model : Granite MoE shared (#13269)
|
8 months ago |
Sigbjørn Skjæret
|
d2a4ef05c6
vocab : add ByteDance-Seed/Seed-Coder (#13423)
|
8 months ago |
Xuan-Son Nguyen
|
053367d149
mtmd : support InternVL 2.5 and 3 (#13422)
|
8 months ago |
Sigbjørn Skjæret
|
1a844be132
convert : support rope_scaling type and rope_type (#13349)
|
8 months ago |
Xuan-Son Nguyen
|
32916a4907
clip : refactor graph builder (#13321)
|
8 months ago |
Sigbjørn Skjæret
|
764b85627b
convert : qwen2/3moe : set yarn metadata if present (#13331)
|
9 months ago |
Xuan-Son Nguyen
|
5215b91e93
clip : fix confused naming ffn_up and ffn_down (#13290)
|
9 months ago |
Sigbjørn Skjæret
|
ae803bfc3d
convert : bailingmoe : set yarn metadata if present (#13312)
|
9 months ago |
ymcki
|
3bf785f3ef
llama : Llama-3_1-Nemotron-Ultra-253B-v1 support (#12843)
|
9 months ago |