The ultimate open-source RAG framework
-
Updated
Jun 12, 2024 - HTML
The ultimate open-source RAG framework
MTEB: Massive Text Embedding Benchmark
Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Cottontail DB is a column store vector database aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.
MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models.
A realtime and indexing and structured extraction engine for Unstructured Data to build Generative AI Applications
Domain Adapted Language Modeling Toolkit - E2E RAG
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.
GePI (GEne - Protein Interactions) is a web portal for quick and convenient access to gene - protein interaction mentions automatically extracted from the biomedical literature, i.e. PubMed and PubMed Central (Open Access Subset).
Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets
All-in-One: Text Embedding, Retrieval, Rerank and RAG
An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers"
Generative Representational Instruction Tuning
Add a description, image, and links to the retrieval topic page so that developers can more easily learn about it.
To associate your repository with the retrieval topic, visit your repo's landing page and select "manage topics."