Showing 120 of 120on this page. Filters & sort apply to loaded results; URL updates for sharing.120 of 120 on this page
Extremely Low Bit Transformer Quantization for On-Device NMT | PDF
VPTQ - Extreme Low Bit LLM Quantization - Compress 405B, 70B Models ...
Paper page - Mixed-Precision Graph Neural Quantization for Low Bit ...
[논문 리뷰] LUQ: Layerwise Ultra-Low Bit Quantization for Multimodal Large ...
Figure 1 from Residual Quantization for Low Bit-Width Neural Networks ...
(PDF) Pitch quantization in low bit-rate speech coding
Figure 5 from Residual Quantization for Low Bit-Width Neural Networks ...
A Hardware-Friendly Low-Bit Power-of-Two Quantization Method for CNNs ...
QuantSR: Accurate Low-bit Quantization for Efficient Image Super ...
This AI Research Introduces Atom: A Low-Bit Quantization Technique for ...
ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization – PyTorch
ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization | Zechun Liu
Advances to low-bit quantization enable LLMs on edge devices ...
Low-bit Model Quantization for Deep Neural Networks: A Survey | AI ...
Figure 1 from Distribution Matched Low-bit Post-Training Quantization ...
Honey, I shrunk the LLM! A beginner's guide to quantization • The Register
Figure 1 from A Simple Low-bit Quantization Framework for Video ...
Figure 1 from Low-bit Quantization for Deep Graph Neural Networks with ...
A Visual Guide to Quantization - by Maarten Grootendorst
SplitQuant: Layer Splitting for Low-bit Neural Network Quantization for ...
ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization | Hung-Yueh ...
Figure 2 from Improving Extreme Low-Bit Quantization With Soft ...
Figure 1 from Improving Extreme Low-Bit Quantization With Soft ...
Figure 4 from Low-bit Quantization for Deep Graph Neural Networks with ...
Figure 1 from Atom: Low-bit Quantization for Efficient and Accurate LLM ...
Figure 4 from Improving Extreme Low-Bit Quantization With Soft ...
Figure 13 from M-ANT: Efficient Low-bit Group Quantization for LLMs via ...
Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for ...
Figure 2 from Low-Bit Quantization and Quantization-Aware Training for ...
How to Achieve Extreme Low-bit Quantization for LLMs - YouTube
LittleBit: Ultra Low-Bit Quantization via Latent Factorization | AI ...
Quant-dLLM: Post-Training Extreme Low-Bit Quantization for Diffusion ...
(PDF) Low-Bit Quantization Methods for Modulated Wideband Converter ...
Navigating the Quantization Frontier: Achieving Ultra-Low-Bit Model ...
(PDF) A Novel Low-Bit Quantization Strategy for Compressing Deep Neural ...
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving | AI ...
(PDF) Low-bit Model Quantization for Deep Neural Networks: A Survey
Low-Bit Quantization Favors Undertrained LLMs
[PDF] DecDEC: A Systems Approach to Advancing Low-Bit LLM Quantization ...
Figure 1 from Outlier-Aware Training for Low-Bit Quantization of ...
Figure 1 from Low-bit Quantization Needs Good Distribution | Semantic ...
Figure 6 from Improving Extreme Low-Bit Quantization With Soft ...
Figure 3 from Improving Extreme Low-Bit Quantization With Soft ...
ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable ...
Figure 2 from Outlier-Aware Training for Low-Bit Quantization of ...
A Simple Low-bit Quantization Framework for Video Snapshot Compressive ...
LSQ+: Improving low-bit quantization through learnable offsets and ...
(PDF) Mixed Precision Low-bit Quantization of Neural Network Language ...
Table 1 from Low-bit Model Quantization for Deep Neural Networks: A ...
Low-bit Quantization of Neural Networks for Efficient Inference | DeepAI
(PDF) Direct Target Localization With Low-Bit Quantization in Wireless ...
2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution ...
Figure 1 from Distribution-aware Low-bit Quantization for 3D Point ...
A Visual Guide to Quantization - Maarten Grootendorst
(PDF) Low-bit Quantization of Recurrent Neural Network Language Models ...
Figure 1 from A Novel Low-Bit Quantization Strategy for Compressing ...
Guide to Audio Quality: Sample Rate, Bit Depth, and Bit Rate Explained
(PDF) A Hardware-Friendly Low-Bit Power-of-Two Quantization Method for ...
(PDF) Low-bit Quantization for Deep Graph Neural Networks with ...
Neuron-by-Neuron Quantization for Efficient Low-Bit QNN Training
Figure 1 from Towards Low-Bit Quantization of Deep Neural Networks with ...
Paper page - Low-Bit Quantization Favors Undertrained LLMs: Scaling ...
Figure 1 from Rethinking the Importance of Quantization Bias, Toward ...
A Novel Low‐Bit Quantization Strategy for Compressing Deep Neural ...
Transformer Quantization at Darlene Stinson blog
(PDF) LSQ+: Improving low-bit quantization through learnable offsets ...
Comparison with the state-of-the-art low-bit quantization methods on ...
Edge-ASR: Towards Low-Bit Quantization of Automatic Speech Recognition ...
Figure 1 from Unlocking Data-free Low-bit Quantization with Matrix ...
PB-LLM: a cutting-edge technique for extreme low-bit quantization in ...
MBCQ: Mixed-bias compensation quantization for extremely low-bit post ...
Figure 1 from Low-Bit Quantization and Quantization-Aware Training for ...
Figure 1 from LSQ+: Improving low-bit quantization through learnable ...
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving ...
A Survey of Quantization Methods for Efficient Neural Network Inference ...
Paper page - Atom: Low-bit Quantization for Efficient and Accurate LLM ...
Table 1 from Lowbit Neural Network Quantization for Speaker ...
Table 1 from Why Do Some Inputs Break Low-Bit LLM Quantization ...
Quantify and fine-tune shared weights 4.2 Low-bit representation A ...
Advances in the Neural Network Quantization: A Comprehensive Review
Figure 5 from Regularizing Activation Distribution for Ultra Low-bit ...
GitHub - pksvision/Low-Bit-Quantization-Pytorch · GitHub
How low-bit inference enables efficient AI - Dropbox
Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight ...
(PDF) INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit ...
[PDF] What Makes Low-Bit Quantization-Aware Training Work for Reasoning ...
(PDF) Metis: Training Large Language Models with Advanced Low-Bit ...
Metis: Training Large Language Models with Advanced Low-Bit ...
(PDF) QuantFace: Towards Lightweight Face Recognition by Synthetic Data ...
Bit-depth explained: Everything you need to know - SoundGuys
(PDF) Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Figure 1 from Regularizing Activation Distribution for Ultra Low-bit ...