#

retrieval

Here are 304 public repositories matching this topic...

SciPhi-AI / R2R

The ultimate open-source RAG framework

search pdf machine-learning ocr deep-learning retrieval chatbot artificial-intelligence question-answering data-pipelines retrieval-systems large-language-models llm langchain llama-index retrieval-augmented-generation

Updated Jun 12, 2024
HTML

embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

benchmark information-retrieval retrieval text-classification clustering sts semantic-search reranking text-embedding sgpt neural-search sentence-transformers sbert multilingual-nlp bitext-mining

Updated Jun 12, 2024
Python

palladian / palladian

Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.

text-mining retrieval information-extraction classification

Updated Jun 12, 2024
Java

intel / intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

retrieval chatbot rag habana large-language-model chatpdf llm-inference 4-bits speculative-decoding llm-cpu streamingllm intel-optimized-llamacpp neural-chat neural-chat-7b autoround gaudi3

Updated Jun 12, 2024
Python

cottontaildb

vitrivr / cottontaildb

Cottontail DB is a column store vector database aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.

database multimedia retrieval similarity-search multimedia-retrieval vector-database vector-search-engine cottontail-db vector-space-retrieval knearest-neighbours-lookup cottontaildb embedding-similarity

Updated Jun 12, 2024
Kotlin

gentaiscool / miners

MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models.

multilingual nlp benchmark machine-learning deep-learning retrieval ml transformers efficient miner classification generation language-model miners deep-learning-models sentence-transformers semantic-retrieval large-language-models llm

Updated Jun 12, 2024
Python

tensorlakeai / indexify

A realtime and indexing and structured extraction engine for Unstructured Data to build Generative AI Applications

machine-learning retrieval llm

Updated Jun 12, 2024
Rust

apache / lucenenet

Apache Lucene.NET

search query analysis retrieval text information apache lucene index hacktoberfest lucenenet

Updated Jun 11, 2024
C#

arcee-ai / DALM

Domain Adapted Language Modeling Toolkit - E2E RAG

retrieval language-model llm retrieval-augmented-generation

Updated Jun 11, 2024
Python

qdrant / fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

retrieval embeddings openai rag vector-search retrieval-augmented-generation

Updated Jun 11, 2024
Python

wi2trier / cbrkit

Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.

nlp api cli library tool retrieval similarity cbr case-based-reasoning

Updated Jun 11, 2024
Python

JULIELab / gepi

GePI (GEne - Protein Interactions) is a web portal for quick and convenient access to gene - protein interaction mentions automatically extracted from the biomedical literature, i.e. PubMed and PubMed Central (Open Access Subset).

retrieval interactions molecular ppi bionlp webapplication

Updated Jun 11, 2024
JavaScript

ARM-DOE / ACT

Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets

visualization time-series retrieval meteorology atmospheric-science corrections meteorological-data

Updated Jun 10, 2024
Python

NewStrangeWorlds / BeAR

A Bayesian Nested-Sampling Retrieval Code

retrieval cuda exoplanets

Updated Jun 10, 2024
C

LongxingTan / open-retrievals

All-in-One: Text Embedding, Retrieval, Rerank and RAG

nlp information-retrieval retrieval semantic-search triplet-loss contrastive-loss rag text-embeddings dense-retrieval tranformers llm retrieval-augmented-generation rerank

Updated Jun 12, 2024
Python

Chirayu-Tripathi / MongoDB-Querifier

Improving LLMs MongoDB query generation capability with the help of advanced retrieval augmented generation.

database mongodb retrieval rag llms retrieval-augmented-generation

Updated Jun 9, 2024
Jupyter Notebook

louisbrulenaudet / tax-retrieval-benchmark

An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.

benchmark information-retrieval retrieval tax embeddings taxation semantic-search fiscal sentence-embeddings stp rag droit sentence-transformers sbert fiscalite retrieval-augmented-generation mteb

Updated Jun 7, 2024
Jupyter Notebook

TIGER-AI-Lab / UniIR

Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers"

retrieval language-model

Updated Jun 7, 2024
Python

ContextualAI / gritlm

Generative Representational Instruction Tuning

information-retrieval retrieval embeddings embedding-models embedding text-embedding sgpt grit sbert llm llms instruction-tuning mteb

Updated Jun 6, 2024
Jupyter Notebook

intel / document-automation

Document Automation Reference Kit

nlp ocr retrieval semantic-search dpr odqa

Updated Jun 5, 2024
Python

Improve this page

Add a description, image, and links to the retrieval topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the retrieval topic, visit your repo's landing page and select "manage topics."