Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
GitHub Copilot
Write better code with AI
GitHub Spark
New
Build and deploy intelligent apps
GitHub Models
New
Manage and compare prompts
GitHub Advanced Security
Find and fix vulnerabilities
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
Discussions
Collaborate outside of code
Code Search
Find more, search less
Explore
Why GitHub
Documentation
GitHub Skills
Blog
Integrations
GitHub Marketplace
MCP Registry
View all features
Solutions
By company size
Enterprises
Small and medium teams
Startups
Nonprofits
By use case
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
By industry
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
Topics
AI
DevOps
Security
Software Development
View all
Explore
Learning Pathways
Events & Webinars
Ebooks & Whitepapers
Customer Stories
Partners
Executive Insights
Open Source
GitHub Sponsors
Fund open source developers
The ReadME Project
GitHub community articles
Repositories
Topics
Trending
Collections
Enterprise
Enterprise platform
AI-powered developer platform
Available add-ons
GitHub Advanced Security
Enterprise-grade security features
Copilot for business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
codewithdark-git
/
QuantLLM
Public
Uh oh!
There was an error while loading.
Please reload this page
.
Notifications
You must be signed in to change notification settings
Fork
0
Star
8
Code
Issues
0
Pull requests
0
Actions
Projects
0
Security
Uh oh!
There was an error while loading.
Please reload this page
.
Insights
Additional navigation options
Code
Issues
Pull requests
Actions
Projects
Security
Insights
Commits
Branch selector
main
User selector
google-labs-jules[bot]
Datepicker
All time
Commit History
Commits on May 26, 2025
feat: Isolate and optimize GGUF quantization
Show description for 1cdf6e9
google-labs-jules[bot]
committed
1cdf6e9
Copy full SHA for 1cdf6e9
Hi there! I've made some improvements to the AWQ quantization implementation.
Show description for 23f331e
google-labs-jules[bot]
committed
23f331e
Copy full SHA for 23f331e
Commits on May 25, 2025
Fix: Handle nn.Module in move_to_device
Show description for 07d1028
google-labs-jules[bot]
committed
07d1028
Copy full SHA for 07d1028
Fix: Unify nn.Module device placement across all quantizers and base class
Show description for 33e21ba
google-labs-jules[bot]
committed
33e21ba
Copy full SHA for 33e21ba
Fix: Correct device placement for QuantizedLinear across all quantizers
Show description for 5b434ed
google-labs-jules[bot]
committed
5b434ed
Copy full SHA for 5b434ed
Fix: Correct device placement for QuantizedLinear in AWQ
Show description for bfb5167
google-labs-jules[bot]
committed
bfb5167
Copy full SHA for bfb5167
Commits on May 21, 2025
Feat: Introduce QuantizerFactory API and Refactor Quantization Workflow
Show description for 082196c
google-labs-jules[bot]
committed
082196c
Copy full SHA for 082196c
Refactor: Improve Quantization Suite & Benchmarking
Show description for d2666f3
google-labs-jules[bot]
committed
d2666f3
Copy full SHA for d2666f3
You can’t perform that action at this time.