Skip to content
View gradientwolf's full-sized avatar
💭
Busy in compilation wars
💭
Busy in compilation wars
  • Tokyo
  • 11:58 (UTC -12:00)

Block or report gradientwolf

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

34 repositories

A collection of 150+ surveys on LLMs

343 26 Updated Feb 19, 2025

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,169 568 Updated Aug 22, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,233 356 Updated Dec 22, 2025

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,744 463 Updated Oct 14, 2025

Mamba SSM architecture

Python 16,805 1,548 Updated Dec 23, 2025

RNN Encoder-Decoder in PyTorch

Python 45 3 Updated Aug 7, 2024

TensorFlow code and pre-trained models for BERT

Python 39,763 9,708 Updated Jul 23, 2024

Library for fast text representation and classification.

HTML 26,464 4,815 Updated Mar 22, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,046 6,639 Updated Sep 30, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 38,530 4,163 Updated Dec 23, 2025

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,605 4,951 Updated Aug 1, 2024

A natural language modeling framework based on PyTorch

Python 6,316 795 Updated Oct 17, 2022

Language-Agnostic SEntence Representations

Jupyter Notebook 3,660 464 Updated May 2, 2024

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,922 497 Updated Feb 14, 2023

Foundation Architecture for (M)LLMs

Python 3,128 222 Updated Apr 11, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,341 2,136 Updated Dec 18, 2025

An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pdf/2106.04718.pdf

Python 433 37 Updated Aug 17, 2022

LLM inference in C/C++

C++ 91,993 14,246 Updated Dec 25, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,750 270 Updated Jul 18, 2025

PyTorch building blocks for the OLMo ecosystem

Python 620 113 Updated Dec 25, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,066 1,278 Updated Oct 11, 2025
Python 213 20 Updated Dec 23, 2025

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 18,890 4,494 Updated Dec 17, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,084 524 Updated May 5, 2025

A resource repository for machine unlearning in large language models

516 31 Updated Dec 17, 2025

Everything about the SmolLM and SmolVLM family of models

Python 3,499 245 Updated Nov 20, 2025

MCP server for integrating OpenAI's Deep Research APIs and Hugging Face's Open Deep Research with Claude Code and other AI assistants

Python 38 3 Updated Oct 20, 2025

Friends of OLMo and their links.

356 31 Updated Sep 15, 2025