Llama-2
Emerging12papers using it
2024first seen
Papers using Llama-2 (12)
- Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter LevelsIntraSlice: Towards High-Performance Structural Pruning with Block-Intra PCA for LLMsCompressing LLMs with MoP: Mixture of PrunersLeveraging KV Similarity for Online Structured Pruning in LLMsTRIM: Achieving Extreme Sparsity with Targeted Row-wise Iterative Metric-driven PruningPrecision Where It Matters: A Novel Spike Aware Mixed-Precision
Quantization Strategy for LLaMA-based Language ModelsMaximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMsBeyond One-Size-Fits-All Pruning via Evolutionary Metric Search for Large Language ModelsModel-GLUE: Democratized LLM Scaling for A Large Model Zoo in the WildAttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention
ManipulationDAQ: Density-Aware Post-Training Weight-Only Quantization For LLMsLLM-NEO: Parameter Efficient Knowledge Distillation for Large Language
Models