rocm

AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.

amd llvm openmp clang rocm

Updated Jun 12, 2024
Fortran

PygmalionAI / aphrodite-engine

Star

PygmalionAI's large-scale inference engine

machine-learning cuda api-rest avx512 rocm inference-engine inferentia

Updated Jun 12, 2024
Python

LLNL / hiop

Star

HPC solver for nonlinear optimization problems

Updated Jun 12, 2024
C++

ROCm / rocBLAS

Star

Next generation BLAS implementation for ROCm platform

blas hip rocm

Updated Jun 12, 2024
C++

ROCm / rocFFT

Star

Next generation FFT implementation for ROCm

fast amd gpu fourier transform fft hip rocm

Updated Jun 12, 2024
C++

PennyLaneAI / pennylane-lightning

Star

The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane

hpc gpu parallel openmp mpi distributed-computing cuda quantum-computing rocm quantum-machine-learning

Updated Jun 13, 2024
C++

ROCm / hipBLAS

Star

ROCm BLAS marshalling library

cuda blas hip rocm

Updated Jun 13, 2024
C++

alpaka-group / alpaka

Star

Abstraction Library for Parallel Kernel Acceleration 🦙

cpp hpc gpu openmp cuda header-only cpp17 hip heterogeneous-parallel-programming tbb openacc rocm

Updated Jun 12, 2024
C++

ROCm / rocRAND

Star

RAND library for HIP programming language

gpu random cuda rng hip rocm

Updated Jun 12, 2024
C++

patientx / ComfyUI-Zluda

Star

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. Now ZLUDA enhanced for better AMD GPU performance.

windows amd cuda rocm stable-diffusion comfyui zluda

Updated Jun 12, 2024
Python

ROCm / hipBLASLt

Star

hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

machine-learning amd assembly matrix-multiplication blas hip gpu-computing gemm rocm

Updated Jun 13, 2024
Assembly

pika-org / pika

Star

pika builds on C++ std::execution with fiber, CUDA, HIP, and MPI support.

cplusplus cpp gpu concurrency mpi cuda parallelism hip rocm stdexec p2300

Updated Jun 12, 2024
C++

cupy / cupy

Sponsor

Star

NumPy & SciPy for GPU

python gpu numpy cuda cublas scipy tensor cudnn rocm cupy cusolver nccl curand cusparse nvrtc cutensor nvtx cusparselt

Updated Jun 13, 2024
Python

DejvBayer / afft

Star

C++17 wrapper library for fft-related computations on CPUs and GPUs

cuda fft hip dct mkl dst cufft rocm dtt fftw3 pocketfft vkfft

Updated Jun 12, 2024
C++

ROCm / rpp

Star

AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.

cpu computer-vision hpc amd gpu opencl histogram contrast bitwise hip rocm openvx rpp mivisionx radeon-performance-primitives warp-affine channel-extract agumentation

Updated Jun 13, 2024
C++

eliranwong / MultiAMDGPU_AIDev_Ubuntu

Star

Multi AMD GPU Setup for AI Development on Ubuntu with ROCM

ai ubuntu amd gpu amdgpu rocm amd-gpu freegenius

Updated Jun 11, 2024

Improve this page

Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rocm

Here are 121 public repositories matching this topic...

vllm-project / vllm

apache / tvm

ROCm / rocSOLVER

EmbeddedLLM / vllm-rocm

ROCm / aomp

PygmalionAI / aphrodite-engine

LLNL / hiop

ROCm / rocBLAS

ROCm / rocFFT

PennyLaneAI / pennylane-lightning

ROCm / hipBLAS

alpaka-group / alpaka

ROCm / rocRAND

patientx / ComfyUI-Zluda

ROCm / hipBLASLt

pika-org / pika

cupy / cupy

DejvBayer / afft

ROCm / rpp

eliranwong / MultiAMDGPU_AIDev_Ubuntu

Improve this page

Add this topic to your repo