LLama-3-8B
Emerging8papers using it
2024first seen
Papers using LLama-3-8B (8)
- Theory-optimal Quantization Based on FlatnessRECAP: A Resource-Efficient Method for Adversarial Prompting in Large Language ModelsSparse Autoencoders Trained on the Same Data Learn Different FeaturesPrune&Comp: Free Lunch for Layer-Pruned LLMs via Iterative Pruning with Magnitude CompensationHeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingA Simple Linear Patch Revives Layer-Pruned Large Language ModelsRoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for
Rank AdaptationTODO: Enhancing LLM Alignment with Ternary Preferences