Ngai Wong
11 papers · 7 citations
Most-cited papers
- Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies2024 · 112 citations
- Rethinking Kullback-leibler Divergence In Knowledge Distillation For Large Language Models2024 · 45 citations
- APTQ: Attention-aware Post-training Mixed-precision Quantization For Large Language Models2024 · 37 citations
- Loretta: Low-rank Economic Tensor-train Adaptation For Ultra-low-parameter Fine-tuning Of Large Language Models2024 · 19 citations
- Uncomp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design From An Uncertainty-aware Perspective2024 · 14 citations
- Enhancing Robustness Of Implicit Neural Representations Against Weight Perturbations2025 · 3 citations
- Binary Weight Multi-bit Activation Quantization For Compute-in-memory CNN Accelerators2025 · 1 citations
- Distribution-aware Hadamard Quantization For Hardware-efficient Implicit Neural Representations2025 · 1 citations
- MINR: Efficient Implicit Neural Representations For Multi-image Encoding2025 · 1 citations
- Quadinr: Hardware-efficient Implicit Neural Representations Through Quadratic Activation2025 · 1 citations
Topics