Build software better, together

kyegomez / swarms

Orchestrate Swarms of Agents From Any Framework Like OpenAI, Langchain, and Etc for Business Operation Automation. Join our Community: https://discord.gg/DbjBMJTSWD

Updated Jun 13, 2024
Python

erfanzar / FJFormer

Star

Embark on a journey of paralleled/unparalleled computational prowess with FJFormer 🔥 - an arsenal of custom Jax Flax Functions and Utils that elevate your AI endeavors to new heights!

numpy attention llama flax attention-mechanism lax jax llm easydel

Updated Jun 12, 2024
Python

thebrownkidd / ISL-to-text

Star

An attention based approach to convert Indian Sign Language to Text using simulated hand gesture data

python neural-network transformer attention-mechanism hand-gesture-recognition sign-language-recognition indian-sign-language

Updated Jun 12, 2024
Jupyter Notebook

jordan7186 / GAtt

Star

Source code for the GAtt method in "Revisiting Attention Weights as Interpretations of Message-Passing Neural Networks".

python attention attention-mechanism explainable-ai xai graph-attention-networks graph-attention-model attribution-methods

Updated Jun 12, 2024
Jupyter Notebook

mo26-web / Bone-Fracture-Classification

Star

This project uses PyTorch to classify bone fractures. As well as fine-tuning some famous CNN architectures (like VGG 19, MobileNetV3, RegNet,...), we designed our own architecture. Additionally, we used Transformer architectures (such as Vision Transformer and Swin Transformer). This dataset is Bone Fracture Multi-Region X-ray, available on Kaggle.

deep-neural-networks pytorch attention-mechanism medical-images vgg19 medical-image-processing wideresnet cnn-architecture medical-application x-ray-images cnn-classification mobilenetv3 regnet vision-transformer swin-transformer bone-fracture-classification fracture-detection

Updated Jun 12, 2024
Jupyter Notebook

lucidrains / vit-pytorch

Star

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

computer-vision transformers artificial-intelligence image-classification attention-mechanism

Updated Jun 11, 2024
Python

IDT-ITI / 360-VSumm

Star

Dataset and scripts for paper "A Human-Annotated Video Dataset for Training and Evaluation of 360-Degree Video Summarization Methods", Proc. 1st Int. Workshop on Video for Immersive Experiences (Video4IMX-2024) at ACM IMX 2024, Stockholm, Sweden, June 2024.

deep-neural-networks video summarization attention-mechanism 360-video annotation-tool

Updated Jun 11, 2024
Python

rayabhisek123 / CFAT

Star

[CVPR 2024] "CFAT: Unleashing Triangular Windows for Image Super-resolution"

attention-mechanism super-resolution image-restoration restoration deep-learning-framework attention-model image-super-resolution low-level-vision transformer-models rectangular-window vision-transformer image-sr light-weight-image-super-resolution triangular-window

Updated Jun 11, 2024
Python

mverbytska / Custom-LSTM-with-Attention-for-FTS

Star

Experimental project on building custom LSTM and LSTM with Attention layer for comparison analysis on FTS forecasting (June 2024)

keras fintech lstm-model attention-mechanism timeseries-forecasting

Updated Jun 11, 2024
Jupyter Notebook

filipbasara0 / simple-diffusion

Star

A minimal implementation of a denoising diffusion model in PyTorch.

computer-vision deep-learning pytorch image-generation unet attention-mechanism diffusion stable-diffusion

Updated Jun 11, 2024
Python

BlinkDL / RWKV-LM

Star

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

deep-learning transformers pytorch transformer lstm rnn gpt language-model attention-mechanism gpt-2 gpt-3 linear-attention rwkv chatgpt

Updated Jun 11, 2024
Python

m4urin / temporal-causal-discovery

Star

Researching causal relationships in time series data using Temporal Convolutional Networks (TCNs) combined with attention mechanisms. This approach aims to identify complex temporal interactions. Additionally, we're incorporating uncertainty quantification to enhance the reliability of our causal predictions.

machine-learning causality attention-mechanism causality-analysis time-series-analysis causal-discovery deep-learrning temporal-causal-discovery

Updated Jun 10, 2024
Jupyter Notebook

yogev-namir / HumanChoicePrediction

Star

An extension for the code and data of the paper "Human Choice Prediction in Language-based Non-Cooperative Games: Simulation-based Off-Policy Evaluation" (Shapira et al. 2023). This project was conducted by Yogev Namir and Avishag Nevo.

nlp deep-learning persuasion lstm academic-project attention-mechanism persuasion-algorithm

Updated Jun 10, 2024
Python

kyegomez / MultiModalMamba

Sponsor

Star

A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.

machine-learning ai ml transformers torch pytorch artificial-intelligence zeta attention-mechanism ssm mamba transformer-architecture

Updated Jun 10, 2024
Python

logic-OT / Decoder-Only-LLM

Star

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

nlp computer-vision deep-learning inference transformer attention-mechanism decoder-model large-language-models llm small-models

Updated Jun 9, 2024
Jupyter Notebook

lucidrains / x-transformers

Star

A simple but complete full-attention transformer with a set of promising experimental features from various papers

deep-learning transformers artificial-intelligence attention-mechanism

Updated Jun 8, 2024
Python

Esmail-ibraheem / Xllama

Star

Xllama🦙 is an Extensible advanced language model framework, inspired by the original Llama model.

pytorch llama attention-mechanism paper-implementations llms llama2

Updated Jun 8, 2024
Python

kyegomez / Jamba

Sponsor

Star

PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"

ai ml transformers artificial-neural-networks gpt attention-mechanism ssm attention-is-all-you-need attention-mechanisms

Updated Jun 8, 2024
Python

NotShrirang / QuillGPT

Star

QuillGPT is an implementation of the GPT decoder block based on the architecture from Attention is All You Need paper by Vaswani et. al. in PyTorch. Additionally, this repository contains two pre-trained models — Shakespearean GPT and Harpoon GPT, a Streamlit Playground, Containerized FastAPI Microservice, training - inference scripts & notebooks.

nlp docker decoder pytorch transformer attention gpt attention-mechanism fastapi streamlit generative-ai

Updated Jun 7, 2024
Jupyter Notebook

awslabs / sockeye

Star

Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch

machine-learning deep-neural-networks translation deep-learning machine-translation pytorch transformer seq2seq neural-machine-translation sequence-to-sequence attention-mechanism encoder-decoder attention-model sequence-to-sequence-models attention-is-all-you-need sockeye transformer-architecture transformer-network

Updated Jun 7, 2024
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attention-mechanism

Here are 1,500 public repositories matching this topic...

kyegomez / swarms

erfanzar / FJFormer

thebrownkidd / ISL-to-text

jordan7186 / GAtt

mo26-web / Bone-Fracture-Classification

lucidrains / vit-pytorch

IDT-ITI / 360-VSumm

rayabhisek123 / CFAT

mverbytska / Custom-LSTM-with-Attention-for-FTS

filipbasara0 / simple-diffusion

BlinkDL / RWKV-LM

m4urin / temporal-causal-discovery

yogev-namir / HumanChoicePrediction

kyegomez / MultiModalMamba

logic-OT / Decoder-Only-LLM

lucidrains / x-transformers

Esmail-ibraheem / Xllama

kyegomez / Jamba

NotShrirang / QuillGPT

awslabs / sockeye

Improve this page

Add this topic to your repo