Flashcheck: Exploration Of Efficient Evidence Retrieval For Fast Fact-checking
2025 Β· Kevin Nanekhan, Venktesh V, Erik Martin, et al.
Abstract
The advances in digital tools have led to the rampant spread of misinformation. While fact-checking aims to combat this, manual fact-checking is cumbersome and not scalable. It is essential for automated fact-checking to be efficient for aiding in combating misinformation in real-time and at the source. Fact-checking pipelines primarily comprise a knowledge retrieval component which extracts relevant knowledge to fact-check a claim from large knowledge sources like Wikipedia and a verification component. The existing works primarily focus on the fact-verification part rather than evidence retrieval from large data collections, which often face scalability issues for practical applications such as live fact-checking. In this study, we address this gap by exploring various methods for indexing a succinct set of factual statements from large collections like Wikipedia to enhance the retrieval phase of the fact-checking pipeline. We also explore the impact of vector quantization to further
Authors
(none)
Tags
Stats
Related papers
- That Is A Known Lie: Detecting Previously Fact-checked Claims (2020)14.02
- Airwaves At Checkthat! 2025: Retrieving Scientific Sources For Implicit Claims On Social Media With Dual Encoders And Neural Re-ranking (2025)0.00
- Fact Or Facsimile? Evaluating The Factual Robustness Of Modern Retrievers (2025)0.00
- Dimension Vs. Precision: A Comparative Analysis Of Autoencoders And Quantization For Efficient Vector Retrieval On BEIR Scifact (2025)0.00
- Webfaq: A Multilingual Collection Of Natural Q&A Datasets For Dense Retrieval (2025)0.00
- Investigating The Scalability Of Approximate Sparse Retrieval Algorithms To Massive Datasets (2025)5.84
- Vectorsearch: Enhancing Document Retrieval With Semantic Embeddings And Optimized Search (2024)0.00
- Corect: A Framework For Evaluating Embedding Compression Techniques At Scale (2025)0.00