Soujanya Poria
13 papers · 634 citations
Most-cited papers
- Red-teaming Large Language Models Using Chain Of Utterances For Safety-alignment2023 · 244 citations
- Language Models Are Homer Simpson! Safety Re-alignment Of Fine-tuned Language Models Through Task Arithmetic2024 · 97 citations
- Understanding The Capabilities And Limitations Of Large Language Models For Cultural Commonsense2024 · 71 citations
- Della-merging: Reducing Interference In Model Merging Through Magnitude-based Sampling2024 · 63 citations
- Consistency Guided Knowledge Retrieval And Denoising In Llms For Zero-shot Document-level Relation Triplet Extraction2024 · 40 citations
Topics