Skip to content
View tianleiwu's full-sized avatar

Block or report tianleiwu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • optimum Public

    Forked from huggingface/optimum

    🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

    Python Apache License 2.0 Updated Nov 14, 2024
  • TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…

    Python Other Updated Oct 23, 2024
  • Standalone Flash Attention v2 kernel without libtorch dependency

    C++ BSD 3-Clause "New" or "Revised" License Updated May 21, 2024
  • Extension for Automatic1111's Stable Diffusion WebUI, using OnnxRuntime CUDA execution provider to deliver high performance result on Nvidia GPU.

    Python 7 MIT License Updated May 9, 2024
  • Test ORT with multiple threading

    C# MIT License Updated Apr 3, 2024
  • unsloth Public

    Forked from unslothai/unsloth

    2-5X faster 70% less memory QLoRA & LoRA finetuning

    Python Apache License 2.0 Updated Apr 2, 2024
  • optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

    C++ Apache License 2.0 Updated Mar 15, 2024
  • gdrivedl Public

    Forked from matthuisman/gdrivedl

    Google Drive Download Python Script

    Python GNU General Public License v3.0 Updated Mar 2, 2024
  • ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

    C++ MIT License Updated Feb 14, 2024
  • Amuse Public

    .NET application for stable diffusion, Leveraging OnnxStack, Amuse seamlessly integrates many StableDiffusion capabilities all within the .NET eco-system

    C# 11 9 Apache License 2.0 Updated Dec 29, 2023
  • A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

    JavaScript MIT License Updated Dec 17, 2023
  • Python Apache License 2.0 Updated Dec 16, 2023
  • diffusers Public

    Forked from huggingface/diffusers

    🤗 Diffusers: experiment of diffusion ONNX models

    Python Apache License 2.0 Updated Dec 15, 2023
  • DemoFusion Public

    Forked from PRIS-CV/DemoFusion

    Let us democratise high-resolution generation! (arXiv 2023)

    Jupyter Notebook Updated Dec 15, 2023
  • ONNX Runtime support for SAM

    Jupyter Notebook Apache License 2.0 Updated Jun 30, 2023
  • TensorRT Public

    Forked from NVIDIA/TensorRT

    NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…

    C++ Apache License 2.0 Updated May 3, 2023
  • 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

    Python Apache License 2.0 Updated Jul 11, 2022
  • onnx Public

    Forked from onnx/onnx

    Open Neural Network Exchange

    PureBasic MIT License Updated May 3, 2022
  • inference Public

    Forked from mlcommons/inference

    Reference implementations of inference benchmarks

    Python Apache License 2.0 Updated Sep 24, 2020
  • tutorials Public

    Forked from onnx/tutorials

    Tutorials for creating and using ONNX models

    Jupyter Notebook MIT License Updated May 15, 2020
  • bert Public

    Forked from google-research/bert

    TensorFlow code and pre-trained models for BERT

    Python Apache License 2.0 Updated Jun 11, 2019
  • CNTK Public

    Forked from microsoft/CNTK

    Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

    C++ Other Updated Feb 15, 2018