-
optimum Public
Forked from huggingface/optimum🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Python Apache License 2.0 UpdatedNov 14, 2024 -
TensorRT-Model-Optimizer Public
Forked from NVIDIA/TensorRT-Model-OptimizerTensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…
Python Other UpdatedOct 23, 2024 -
libflash_attn Public
Forked from tlc-pack/libflash_attnStandalone Flash Attention v2 kernel without libtorch dependency
C++ BSD 3-Clause "New" or "Revised" License UpdatedMay 21, 2024 -
Stable-Diffusion-WebUI-OnnxRuntime Public
Forked from microsoft/Stable-Diffusion-WebUI-DirectMLExtension for Automatic1111's Stable Diffusion WebUI, using OnnxRuntime CUDA execution provider to deliver high performance result on Nvidia GPU.
-
-
unsloth Public
Forked from unslothai/unsloth2-5X faster 70% less memory QLoRA & LoRA finetuning
Python Apache License 2.0 UpdatedApr 2, 2024 -
ByteTransformer Public
Forked from bytedance/ByteTransformeroptimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
C++ Apache License 2.0 UpdatedMar 15, 2024 -
gdrivedl Public
Forked from matthuisman/gdrivedlGoogle Drive Download Python Script
Python GNU General Public License v3.0 UpdatedMar 2, 2024 -
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++ MIT License UpdatedFeb 14, 2024 -
Amuse Public
.NET application for stable diffusion, Leveraging OnnxStack, Amuse seamlessly integrates many StableDiffusion capabilities all within the .NET eco-system
-
onnx-modifier Public
Forked from ZhangGe6/onnx-modifierA tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
JavaScript MIT License UpdatedDec 17, 2023 -
Faster-Diffusion Public
Forked from hutaiHang/Faster-DiffusionPython Apache License 2.0 UpdatedDec 16, 2023 -
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: experiment of diffusion ONNX models
Python Apache License 2.0 UpdatedDec 15, 2023 -
DemoFusion Public
Forked from PRIS-CV/DemoFusionLet us democratise high-resolution generation! (arXiv 2023)
Jupyter Notebook UpdatedDec 15, 2023 -
segment-anything Public
Forked from OroChippw/segment-anythingONNX Runtime support for SAM
Jupyter Notebook Apache License 2.0 UpdatedJun 30, 2023 -
TensorRT Public
Forked from NVIDIA/TensorRTNVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…
C++ Apache License 2.0 UpdatedMay 3, 2023 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Python Apache License 2.0 UpdatedJul 11, 2022 -
Open Neural Network Exchange
PureBasic MIT License UpdatedMay 3, 2022 -
inference Public
Forked from mlcommons/inferenceReference implementations of inference benchmarks
Python Apache License 2.0 UpdatedSep 24, 2020 -
tutorials Public
Forked from onnx/tutorialsTutorials for creating and using ONNX models
Jupyter Notebook MIT License UpdatedMay 15, 2020 -
bert Public
Forked from google-research/bertTensorFlow code and pre-trained models for BERT
Python Apache License 2.0 UpdatedJun 11, 2019 -
CNTK Public
Forked from microsoft/CNTKMicrosoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
C++ Other UpdatedFeb 15, 2018