cublas
Here are 90 public repositories matching this topic...
SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.
-
Updated
Jan 8, 2025 - C++
Safe rust wrapper around CUDA toolkit
-
Updated
Jan 8, 2025 - Rust
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
-
Updated
Sep 8, 2024 - Cuda
🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR and High Performance Computing (HPC) projects.
-
Updated
Jan 7, 2025
Hooked CUDA-related dynamic libraries by using automated code generation tools.
-
Updated
Dec 12, 2023 - C
Deep Learning library using GPU(CUDA/cuBLAS)
-
Updated
Sep 18, 2021 - Elixir
A Deep Learning Framework Written in Rust
-
Updated
Jan 6, 2025 - Rust
Algorithms implemented in CUDA + resources about GPGPU
-
Updated
Jan 18, 2022 - Cuda
code for benchmarking GPU performance based on cublasSgemm and cublasHgemm
-
Updated
May 20, 2022 - Cuda
Bandicoot: C++ library for GPU linear algebra & scientific computing - https://coot.sourceforge.io
-
Updated
Jul 19, 2023
bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码
-
Updated
Aug 12, 2024 - Cuda
Harness the power of GPU acceleration for fusing visual odometry and IMU data with an advanced Unscented Kalman Filter (UKF) implementation. Developed in C++ and utilizing CUDA, cuBLAS, and cuSOLVER, this system offers unparalleled real-time performance in state and covariance estimation for robotics and autonomous system applications.
-
Updated
Mar 21, 2024 - Cuda
Improve this page
Add a description, image, and links to the cublas topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the cublas topic, visit your repo's landing page and select "manage topics."