Skip to content

Pull requests: codewithdark-git/QuantLLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Refactor: Improve Quantization Suite & Benchmarking
#3 by codewithdark-git was merged May 21, 2025 Loading… updated May 21, 2025
Fix/awq quantized linear device issue
#6 by codewithdark-git was merged May 25, 2025 Loading… updated May 25, 2025
feat: Isolate and optimize GGUF quantization
#10 by codewithdark-git was merged May 26, 2025 Loading… updated May 27, 2025
Hi there! I've made some improvements to the AWQ quantization impleme…
#9 by codewithdark-git was merged May 26, 2025 Loading… updated May 27, 2025
Fix: Unify nn.Module device placement across all quantizers and base …
#7 by codewithdark-git was merged May 25, 2025 Loading… updated May 27, 2025
Feat: Introduce QuantizerFactory API and Refactor Quantization Workflow
#4 by codewithdark-git was merged May 21, 2025 Loading… updated May 27, 2025
Fix: Handle nn.Module in move_to_device
#8 by codewithdark-git was merged May 25, 2025 Loading… updated May 27, 2025
Add the GGUF for Quantization
#11 by codewithdark-git was merged May 27, 2025 Loading… updated May 27, 2025
Add the GGUF for Quantization
#12 by codewithdark-git was merged May 27, 2025 Loading… updated May 27, 2025
Add the GGUF for Quantization
#13 by codewithdark-git was merged May 27, 2025 Loading… updated May 27, 2025
Add the GGUF for Quantization
#14 by codewithdark-git was merged May 27, 2025 Loading… updated May 27, 2025
Feature/add gguf
#15 by codewithdark-git was merged May 28, 2025 Loading… updated Jul 16, 2025
ProTip! Add no:assignee to see everything that’s not assigned.