Skip to content
View XinDongol's full-sized avatar
🏁
Loading...
🏁
Loading...

Block or report XinDongol

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

LLM reads a paper and produce a working prototype

Python 44 18 Updated Dec 30, 2024
Jupyter Notebook 10 Updated Jan 3, 2025

Train, tune, and infer Bamba model

Python 71 11 Updated Jan 8, 2025

Code for Quiet-STaR

Python 689 89 Updated Aug 21, 2024

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker

Python 14,213 1,032 Updated Jan 7, 2025

Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467

Python 274 25 Updated Aug 5, 2023
Python 54 4 Updated Apr 27, 2024

The website for PyTorch

HTML 234 295 Updated Jan 7, 2025

Puzzles for learning Triton, play it with minimal environment configuration!

Python 190 8 Updated Dec 3, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,996 519 Updated Jan 7, 2025

Material for gpu-mode lectures

Jupyter Notebook 3,425 347 Updated Jan 6, 2025
Python 144 10 Updated Dec 11, 2024

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024

Python 256 19 Updated Jan 7, 2025

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Python 186 14 Updated Jan 1, 2025

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 18,407 1,021 Updated Jan 8, 2025

Tiny PyTorch library for maintaining a moving average of a collection of parameters.

Python 414 26 Updated Oct 2, 2024

Implementation of the proposed minGRU in Pytorch

Python 265 21 Updated Dec 18, 2024

build ai agents that have the full context, open source, runs locally, developer friendly. 24/7 screen, mic, keyboard recording and control

TypeScript 11,462 743 Updated Jan 8, 2025

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 291 22 Updated Sep 9, 2024
Python 16 2 Updated Dec 2, 2024

Using FlexAttention to compute attention with different masking patterns

Python 40 Updated Sep 22, 2024

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 755 170 Updated Jan 7, 2025

Source code of Telegram for macos on Swift 5.0

Swift 5,102 872 Updated Aug 11, 2024

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 14,121 1,482 Updated Nov 20, 2024

NVIDIA Cosmos Nemotron is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,420 191 Updated Jan 7, 2025

An automated pipeline for evaluating LLMs for role-playing.

Python 150 8 Updated Sep 14, 2024

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 683 40 Updated Apr 10, 2024
Next