Why Hugging Face Shows the “Wrong” Parameter Count for AWQ Models
If you’ve loaded an AWQ-quantized model from Hugging Face (like Qwen2.5-VL-3B-Instruct-AWQ), you might have noticed something confusing: 👉 Hugging Face says the model has ~0.9B parameters, but the architecture is…
