Историја ревизија

Аутор SHA1 Порука Датум
  Piotr Wilkin 9014feadfa Change RoPE to NeoX пре 4 месеци
  Piotr Wilkin f020baa466 Normal attention: apply gate before output пре 4 месеци
  Piotr Wilkin 27fa5f335d Correct convolution state dimension calculations пре 4 месеци
  Piotr Wilkin e24c9dfa60 Remove OP_DELTA_NET, fix flake8 and editorchecker because why not пре 4 месеци
  Piotr Wilkin 6e3abeb6c0 Exclude MTP layers in conversion пре 4 месеци
  Piotr Wilkin 43eb7a7757 Now that eval's running move delta net stuff back to llama-model, add cbs пре 4 месеци
  Piotr Wilkin 890fa2c1e3 WE HAVE OUTPUT! пре 4 месеци
  Piotr Wilkin e590a75905 Cleanup complete, now for the recurrent memory management... пре 4 месеци
  Piotr Wilkin 2b0673c315 Cleanup ggml_delta_net пре 4 месеци
  Piotr Wilkin (ilintar) 72c98b0c7d Merge pull request #1 from ggml-org/xsn/qwen3next_experiment пре 4 месеци
  Xuan Son Nguyen e83ef74733 one less magic number пре 4 месеци
  Xuan Son Nguyen f643b957f4 refactor softplus fn пре 4 месеци
  Xuan Son Nguyen 46110e0630 split q_proj/gate пре 4 месеци
  Piotr Wilkin 9832f2934a Remove comments as half of them are wrong anyways пре 4 месеци
  Piotr Wilkin 8152df60f3 Getting closer (graph builds for bs=1 but tensor shaping is still wrong for bigger sizes) пре 4 месеци
  Piotr Wilkin e0c5dff2a7 Rewrite to tensor ops пре 4 месеци
  Piotr Wilkin 178230ee21 Getting to decode stage... пре 4 месеци
  Piotr Wilkin (ilintar) c78f9fce68 Merge branch 'ggml-org:master' into qwen3_next пре 4 месеци
  Radoslav Gerganov 2b6b55a59f server : include usage statistics only when user request them (#16052) пре 4 месеци
  Georgi Gerganov e58174cecb llama : bump max seq limit from 64 to 256 (#15916) пре 4 месеци
  Georgi Gerganov b213fce89b metal : improve F32, F16 and BF16 mat-vec multiplication (#16057) пре 4 месеци
  Jhen-Jie Hong e00f3fd8ff metal : avoid call free for non-owned buffer (#16067) пре 4 месеци
  Georgi Gerganov f2f28380ea metal : handle nil cv during pipeline creation (#16065) пре 4 месеци
  Chenguang Li 62c3b645c5 CANN: Remove print (#16044) пре 4 месеци
  Piotr Wilkin 344331c2b6 First draft пре 4 месеци
  Reese Levine d304f459d8 GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS operators (#16018) пре 4 месеци
  Georgi Gerganov 0320ac5264 metal : refactor + optimize v2 (#15995) пре 4 месеци
  Aleksander Grygier a7a98e0fff SvelteKit-based WebUI (#14839) пре 4 месеци
  Xuan-Son Nguyen 8f8f2274ee convert : add Llama4ForCausalLM (#16042) пре 4 месеци
  Johannes Gäßler c959b676be CUDA: fix FA occupancy, optimize tile kernel (#15982) пре 4 месеци