cs.LG
50 papers tagged cs.LG (ordered by heat_score)
Papers
- Rethinking Memory as Continuously Evolving Connectivity (2026)Jizhan Fang et al.13.31
- JLT: Clean-Latent Prediction in Latent Diffusion Transformers (2026)Funing Fu et al.11.74
- OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration (2026)Xinchen Zhang et al.11.20
- Recursive Flow Matching (2026)Jiahe Huang et al.11.02
- Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders (2026)Yi Jing et al.10.61
- Negligible in Size, Significant in Effect: On Scale Vectors in Large Language Models (2026)Mingze Wang et al.10.48
- MobileMoE: Scaling On-Device Mixture of Experts (2026)Yanbei Chen et al.9.24
- Less is More: Early Stopping Rollout for On-Policy Distillation (2026)Zhou Ziheng et al.8.54
- RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models (2026)Xing Cong et al.8.41
- PRISM: Position-encoded Regressive Inverse Spectral Model for Multilayer Thin-Film Design (2026)Runtian Wang et al.8.11
- Adversarial Speaker Distillation for Countermeasure Model on Automatic
Speaker Verification (2025)Yen-Lun Liao et al.7.81
- Squeezing Capacity from Multimodal Large Language Models for Subject-driven Generation (2026)Shuhong Zheng et al.7.39
- MEMS and ECM Sensor Technologies for Cardiorespiratory Sound Monitoring
- A Comprehensive Review (2025)Yasaman Torabi et al.7.16
- Acoustics-specific Piano Velocity Estimation (2026)Federico Simonetta et al.6.81
- On the Push-Based Asynchronous Federated Learning: A Bias-Correction Aggregation Approach (2026)Jiahui Bai et al.6.77
- Real-time Speech Summarization for Medical Conversations (2025)Khai Le-Duc et al.5.24
- Advancing Creative Physical Intelligence in Large Multimodal Models (2026)Cheng Qian et al.5.04
- Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases (2026)Dongyoon Hahm et al.5.04
- A Sharper Picture of Generalization in Transformers (2026)Paul Lintilhac et al.4.54
- Graph Navier Stokes Networks (2026)Zexing Zhao et al.4.54
- Variance Reduction for Expectations with Diffusion Teachers (2026)Jesse Bettencourt et al.4.54
- Probabilistic Data-Driven Modelling of Astrophysical Transients: The Neural Process Family for Ultrafast and Class-Agnostic Light Curve Reconstruction with NightLANP (2026)Siddharth Chaini et al.4.54
- Pruning and Distilling Mixture-of-Experts into Dense Language Models (2026)Junhyuck Kim et al.4.54
- Analyzing Quality-Latency-Resource Trade-offs in a Technical Documentation RAG Assistant Using LoRA Adaptation (2026)Evgenii Palnikov et al.4.54
- AdaDPO: Self-Adaptive Direct Preference Optimization with Balanced Gradient Updates (2026)Shaolong Chen et al.4.54
- Soft-SVeRL: Self-Verified Reinforcement Learning with Soft Rewards (2026)Saurabh Dash et al.4.54
- Dark Quest II: A Wide-Coverage Neural Network Emulator of the Nonlinear Matter Power Spectrum Across Extended Cosmologies (2026)Satoshi Tanaka et al.4.54
- Trading Devil: Robust backdoor attack via Stochastic investment models
and Bayesian approach (2025)Orson Mengara4.52
- AgentAtlas: Beyond Outcome Leaderboards for LLM Agents (2026)Parsa Mazaheri et al.3.91
- GenSBI: Generative Methods for Simulation-Based Inference in JAX (2026)Aurelio Amerio3.91
- FLUIDSPLAT: Reconstructing Physical Fields from Sparse Sensors via Gaussian Primitives (2026)Huaxi Huang et al.3.10
- Multi-Agent Reinforcement Learning for Safe Autonomous Driving Under Pedestrian Behavioral Uncertainty (2026)Prakash Aryan et al.3.10
- Matryoshka Concept Bottleneck Models (2026)Ziye Chen et al.3.10
- "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization (2026)Eldar Kurtic et al.2.94
- Reevaluating Policy Gradient Methods for Imperfect-Information Games (2026)Max Rudolph et al.1.96
- Reformulation of RBM to Unify Linear and Nonlinear Dimensionality Reduction (2026)Jiangsheng You et al.0.00
- Medical Spoken Named Entity Recognition (2025)Khai Le-Duc et al.0.00
- CktGen: Automated Analog Circuit Design with Generative Artificial Intelligence (2026)Yuxuan Hou et al.0.00
- Yes, Q-learning Helps Offline In-Context RL (2026)Denis Tarasov et al.0.00
- OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning (2026)Mingxin Huang et al.0.00
- Semantic-Aware Interpretable Multimodal Music Auto-Tagging (2025)Andreas Patakis et al.0.00
- Learnable Kernel Density Estimation for Graphs and Its Application to Graph-Level Anomaly Detection (2026)Xudong Wang et al.0.00
- Interpretability and Generalization Bounds for Learning Spatial Physics (2026)Alejandro Francisco Queiruga et al.0.00
- A Physics-Informed Hierarchical Neural Network for Microwave Scattering Analysis of 3D PEC Targets (2026)Rui Zhu et al.0.00
- Error Analysis of Discrete Flow with Generator Matching (2026)Zhengyan Wan et al.0.00
- HiSpec: Hierarchical Speculative Decoding for LLMs (2026)Avinash Kumar et al.0.00
- Cross-Receiver Generalization for RF Fingerprint Identification via Feature Disentanglement and Adversarial Training (2026)Yuhao Pan et al.0.00
- SWAP: Towards Copyright Auditing of Soft Prompts via Sequential Watermarking (2026)Wenyuan Yang et al.0.00
- Mechanistic Interpretability of Antibody Language Models Using SAEs (2026)Rebonto Haque et al.0.00
- TinyD\'ej\`aVu: Smaller RAM and Faster Inference with Neural Networks on MCUs for Sensor Data Streams (2026)Zhaolan Huang et al.0.00