XinDongol

Follow

🏁

Loading...

Xin (Simon) Dong XinDongol

🏁

Loading...

Follow

@NVIDIA Research @harvard CS Ph.D. Working on LLM, Multi-Modal LM, and computing efficiency.

143 followers · 134 following

Harvard University
Cambridge
https://simonxin.com

Achievements

Achievements

Starred repositories

phunterlau / paper_without_code

LLM reads a paper and produce a working prototype

Python 44 18 Updated Dec 30, 2024

vicksEmmanuel / latent-gemma

Jupyter Notebook 10 Updated Jan 3, 2025

foundation-model-stack / bamba

Train, tune, and infer Bamba model

Python 71 11 Updated Jan 8, 2025

ezelikman / quiet-star

Code for Quiet-STaR

Python 689 89 Updated Aug 21, 2024

Byaidu / PDFMathTranslate

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/Docker

Python 14,213 1,032 Updated Jan 7, 2025

jayelm / gisting

Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467

Python 274 25 Updated Aug 5, 2023

JacobPfau / fillerTokens

Python 54 4 Updated Apr 27, 2024

pytorch / pytorch.github.io

The website for PyTorch

HTML 234 295 Updated Jan 7, 2025

da03 / Internalize_CoT_Step_by_Step

Python 132 14 Updated Sep 29, 2024

da03 / implicit_chain_of_thought

Python 113 25 Updated Nov 11, 2024

SiriusNEO / Triton-Puzzles-Lite

Puzzles for learning Triton, play it with minimal environment configuration!

Python 190 8 Updated Dec 3, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,996 519 Updated Jan 7, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 3,425 347 Updated Jan 6, 2025

NVlabs / hymba

Python 144 10 Updated Dec 11, 2024

facebookresearch / LayerSkip

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024

Python 256 19 Updated Jan 7, 2025

jxiw / MambaInLlama

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Python 186 14 Updated Jan 1, 2025

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 18,407 1,021 Updated Jan 8, 2025

fadel / pytorch_ema

Tiny PyTorch library for maintaining a moving average of a collection of parameters.

Python 414 26 Updated Oct 2, 2024

lucidrains / minGRU-pytorch

Implementation of the proposed minGRU in Pytorch

Python 265 21 Updated Dec 18, 2024

mediar-ai / screenpipe

build ai agents that have the full context, open source, runs locally, developer friendly. 24/7 screen, mic, keyboard recording and control

TypeScript 11,462 743 Updated Jan 8, 2025

princeton-nlp / AutoCompressors

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 291 22 Updated Sep 9, 2024

namespace-Pt / UltraGist

Python 16 2 Updated Dec 2, 2024

shreyansh26 / Attention-Mask-Patterns

Using FlexAttention to compute attention with different masking patterns

Python 40 Updated Sep 22, 2024

pytorch / kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 755 170 Updated Jan 7, 2025

cloneofsimo / karras-power-ema-tutorial

Python 51 1 Updated Jan 6, 2024

overtake / TelegramSwift

Source code of Telegram for macos on Swift 5.0

Swift 5,102 872 Updated Aug 11, 2024

Zeyi-Lin / HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 14,121 1,482 Updated Nov 20, 2024

NVlabs / Cosmos-Nemotron

NVIDIA Cosmos Nemotron is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,420 191 Updated Jan 7, 2025

boson-ai / RPBench-Auto

An automated pipeline for evaluating LLMs for role-playing.

Python 150 8 Updated Sep 14, 2024

tomaarsen / attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 683 40 Updated Apr 10, 2024

Starred topics

Machine learning