cs.AR
30 papers tagged cs.AR (ordered by heat_score)
Papers
- GauS: Differentiable Scheduling Optimization via Gaussian Reparameterization (2026)Yaohui Cai et al.0.00
- GrateTile: Efficient Sparse Tensor Tiling for CNN Processing (2020)Yu-Sheng Lin et al.β
- Scaling up HBM Efficiency of Top-K SpMV for Approximate Embedding
Similarity on FPGAs (2021)Alberto Parravicini et al.β
- A Full-Stack Search Technique for Domain Optimized Deep Learning
Accelerators (2022)Dan Zhang et al.β
- G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and
Efficiency (2021)Yongan Zhang et al.β
- Accelerating Large-Scale Graph-based Nearest Neighbor Search on a
Computational Storage Platform (2022)Ji-Hoon Kim et al.β
- PQA: Exploring the Potential of Product Quantization in DNN Hardware
Acceleration (2024)Ahmed F. AbouElhamayed et al.β
- Chameleon: a Heterogeneous and Disaggregated Accelerator System for
Retrieval-Augmented Language Models (2025)Wenqi Jiang et al.β
- Efficient Data Access Paths for Mixed Vector-Relational Search (2024)Viktor Sanca and Anastasia Ailamakiβ
- HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator (2024)Zhewen Yu et al.β
- Efficient and Reliable Vector Similarity Search Using Asymmetric
Encoding with NAND-Flash for Many-Class Few-Shot Learning (2024)Hao-Wei Chiang et al.β
- Experimental comparison of graph-based approximate nearest neighbor
search algorithms on edge devices (2024)Ali Ganbarov et al.β
- Accelerating Retrieval-Augmented Generation (2024)Derrick Quinn et al.β
- MEMHD: Memory-Efficient Multi-Centroid Hyperdimensional Computing for
Fully-Utilized In-Memory Computing Architectures (2025)Do Yeong Kang et al.β
- Harmonia: Enhancing Data Placement and Migration in Hybrid Storage Systems via Multi-Agent Reinforcement Learning (2026)Rakesh Nadig et al.β
- CrossNAS: A Cross-Layer Neural Architecture Search Framework for PIM Systems (2025)Md Hasibul Amin et al.β
- REIS: A High-Performance and Energy-Efficient Retrieval System with In-Storage Processing (2025)Kangqi Chen et al.β
- Clo-HDnn: A 4.66 TFLOPS/W and 3.78 TOPS/W Continual On-Device Learning Accelerator with Energy-efficient Hyperdimensional Computing via Progressive Search (2025)Chang Eun Song et al.β
- JSPIM: A Skew-Aware PIM Accelerator for High-Performance Databases Join and Select Operations (2025)Sabiha Tajdari et al.β
- DCC: Data-Centric Compilation of Machine Learning Kernels for Processing-In-Memory Architectures (2026)Peiming Yang et al.β
- HDDB: Efficient In-Storage SQL Database Search Using Hyperdimensional Computing on Ferroelectric NAND Flash (2025)Quanling Zhao et al.β
- CAMformer: Associative Memory is All You Need (2025)Tergel Molom-Ochir et al.β
- SpANNS: Optimizing Approximate Nearest Neighbor Search for Sparse Vectors Using Near Memory Processing (2026)Tianqi Zhang et al.β
- FaTRQ: Tiered Residual Quantization for LLM Vector Search in Far-Memory-Aware ANNS Systems (2026)Tianqi Zhang et al.β
- Physical Analogue Kolmogorov-Arnold Networks based on Reconfigurable Nonlinear-Processing Units (2026)Manuel Escudero et al.β
- bsort: A theoretically efficient non-comparison-based sorting algorithm for integer and floating-point numbers (2026)Benjam\'in Guzm\'anβ
- An FPGA Implementation of Displacement Vector Search for Intra Pattern Copy in JPEG XS (2026)Qiyue Chen et al.β
- Hardware-Software Co-Design of Scalable, Energy-Efficient Analog Recurrent Computations (2026)Arthur Fyon et al.β
- EVA: Accelerating LLM Decoding via an Efficient Vector Quantization Architecture (2026)Bowen Duan et al.β
- RouteScan: A Non-Intrusive Approach to Auditing MoE LLMs Safety via Expert Routing Telemetry (2026)Bo Lv et al.β