SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval
Abstract
Retrieval-Augmented Generation (RAG) systems have become pivotal in leveraging vast corpora to generate informed and contextually relevant responses, notably reducing hallucinations in Large Language Models. Despite significant advancements, these systems struggle to efficiently process and retrieve information from large datasets while maintaining a comprehensive understanding of the context. This paper introduces SKETCH, a novel methodology that enhances the RAG retrieval process by integrating semantic text retrieval with knowledge graphs, thereby merging structured and unstructured data for a more holistic comprehension. SKETCH, demonstrates substantial improvements in retrieval performance and maintains superior context integrity compared to traditional methods. Evaluated across four diverse datasets: QuALITY, QASPER, NarrativeQA, and Italian Cuisine-SKETCH consistently outperforms baseline approaches on key RAGAS metrics such as answer_relevancy, faithfulness, context_precision and context_recall. Notably, on the Italian Cuisine dataset, SKETCH achieved an answer relevancy of 0.94 and a context precision of 0.99, representing the highest performance across all evaluated metrics. These results highlight SKETCH's capability in delivering more accurate and contextually relevant responses, setting new benchmarks for future retrieval systems.
Community
The paper introduces SKETCH, a novel methodology that combines semantic chunking and knowledge graphs to enhance Retrieval-Augmented Generation (RAG) systems, achieving superior performance in multi-context retrieval and context-rich responses.
- Methodology: SKETCH innovatively integrates semantic chunking and knowledge graphs, addressing the limitations of traditional RAG systems, including context loss and multi-hop reasoning inefficiencies.
- Performance: Evaluated across diverse datasets, SKETCH consistently outperforms baselines in answer relevancy, context precision, and retrieval accuracy, setting new benchmarks for RAG systems.
- Applications: By enabling holistic comprehension and accurate synthesis of information from both structured and unstructured data, SKETCH demonstrates robust applicability across domains like long-form narratives, scientific papers, and domain-specific texts.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Towards Knowledge Checking in Retrieval-augmented Generation: A Representation Perspective (2024)
- Towards Understanding Retrieval Accuracy and Prompt Quality in RAG Systems (2024)
- Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks (2024)
- Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models (2024)
- Context Awareness Gate For Retrieval Augmented Generation (2024)
- SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation (2024)
- Semantic Tokens in Retrieval Augmented Generation (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper