- Sunnyvale, CA
-
13:18
(UTC -08:00) - in/pivovaal
-
raft Public
Forked from rapidsai/raftRAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …
Cuda Apache License 2.0 UpdatedSep 3, 2025 -
RTopK Public
Forked from xiexi51/RTopKOfficial Implementation of "RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs"
Cuda UpdatedJul 23, 2025 -
openxla-xla Public
Forked from openxla/xlaA machine learning compiler for GPUs, CPUs, and ML accelerators
C++ Apache License 2.0 UpdatedJun 9, 2025 -
axlearn Public
Forked from apple/axlearnAn Extensible Deep Learning Library
Python Apache License 2.0 UpdatedMay 22, 2025 -
shardy Public
Forked from openxla/shardyMLIR-based partitioning system
C++ Apache License 2.0 UpdatedFeb 26, 2025 -
-
-
stablehlo Public
Forked from openxla/stablehloBackward compatible ML compute opset inspired by HLO/MHLO
MLIR Apache License 2.0 UpdatedNov 28, 2024 -
jax Public
Forked from jax-ml/jaxComposable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Python Apache License 2.0 UpdatedNov 8, 2024 -
llvm-project Public
Forked from llvm/llvm-projectThe LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
-
ml_dtypes Public
Forked from jax-ml/ml_dtypesA stand-alone implementation of several NumPy dtype extensions used in machine learning.
-
uYouPlus Public
Forked from qnblackcat/uYouPlusuYou+ is a modified version of uYou (made by @MiRO92) with additional features and mainly made for non jailbroken users!
Logos UpdatedJun 1, 2024 -
-
xla Public
Forked from pytorch/xlaEnabling PyTorch on XLA Devices (e.g. Google TPU)
C++ Other UpdatedOct 21, 2023 -
tensorflow Public
Forked from tensorflow/tensorflowAn Open Source Machine Learning Framework for Everyone
C++ Apache License 2.0 UpdatedAug 30, 2023 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
C++ Other UpdatedAug 17, 2023 -
sagemaker-python-sdk Public
Forked from aws/sagemaker-python-sdkA library for training and deploying machine learning models on Amazon SageMaker
Python Apache License 2.0 UpdatedAug 16, 2023 -
AITemplate Public
Forked from facebookincubator/AITemplateAITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Python Apache License 2.0 UpdatedAug 1, 2023 -
hunter Public
Forked from cpp-pm/hunterCMake driven cross-platform package manager for C/C++.
CMake BSD 2-Clause "Simplified" License UpdatedJun 28, 2023 -
-
lldb-mi Public
Forked from lldb-tools/lldb-miLLDB's machine interface driver
C++ Other UpdatedJun 14, 2023 -
-
deploy-stable-diffusion-model-on-amazon-sagemaker-endpoint Public
Forked from aws-samples/deploy-stable-diffusion-model-on-amazon-sagemaker-endpointDeploy Stable Diffusion Model on Amazon SageMaker Endpont
-
-
-
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedMay 17, 2023 -
-
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Python Apache License 2.0 UpdatedMar 21, 2023 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedMar 10, 2023 -
addons Public
Forked from tensorflow/addonsUseful extra functionality for TensorFlow 2.x maintained by SIG-addons
Python Apache License 2.0 UpdatedJan 31, 2023



