← authors · overview

Xiting Wang

13 papers · 371 citations

Most-cited papers

Uncovering Safety Risks Of Large Language Models Through Concept Activation Vector
2024 · 69 citations
From Instructions To Intrinsic Human Values -- A Survey Of Alignment Goals For Big Models
2023 · 64 citations
Value FULCRA: Mapping Large Language Models To The Multidimensional Spectrum Of Basic Human Values
2023 · 53 citations
RATT: A Thought Structure For Coherent And Correct LLM Reasoning
2024 · 48 citations

Topics

Safety & Alignment Survey Paper Prompting In-Context Learning RAG Model Architecture Evaluation