i'll try to update this as much as possible. for the most up to date content go here
Ilya 30u30
click here to view the full list
papers
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding |
- Case-Based Reasoning for Natural Language Queries over Knowledge Bases |
- KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning |
- NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models |
- Dense Passage Retrieval for Open-Domain Question Answering |
- Retrieval-Augmented Generation for AI-Generated Content: A Survey write notes Corrective Retrieval Augmented Generation |
- TIARA: Multi-grained Retrieval for Robust Question Answering over Large Knowledge Bases |
- Retrieval-Enhanced Generative Model for Large-Scale Knowledge Graph |
great resources
Essay from Dario - Founder of Antrhopic
Chunking Strategies for LLM Applications - this has been helpful for identifying a strategy to accurately chunk text for a retriever
What are embeddings? - really…what are they?
Deconstructing RAG - dumbed down