ARC
Canonical16papers using it
2024first seen
Papers using ARC (16)
- Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMsData-Free Pruning of Self-Attention Layers in LLMsUncertainty-Aware Answer Selection for Improved Reasoning in Multi-LLM SystemsDr.LLM: Dynamic Layer Routing in LLMsTurning the Spell Around: Lightweight Alignment Amplification via Rank-One Safety InjectionUncovering Cross-Linguistic Disparities in LLMs using Sparse AutoencodersText-to-LoRA: Instant Transformer AdaptionFrom Threat to Tool: Leveraging Refusal-Aware Injection Attacks for Safety AlignmentTemporal Self-Rewarding Language Models: Decoupling Chosen-Rejected via
Past-FutureTurning the Spell Around: Lightweight Alignment Amplification via
Rank-One Safety InjectionLLMs Can Get "Brain Rot"!More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety AlignmentOrder Independence With FinetuningORI: O Routing IntelligenceTeuken-7B-Base & Teuken-7B-Instruct: Towards European LLMsLanguage Models are Hidden Reasoners: Unlocking Latent Reasoning
Capabilities via Self-Rewarding