LLM
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
General technology for enabling AI capabilities w/ LLMs and MLLMs
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
RNN Encoder-Decoder in PyTorch
TensorFlow code and pre-trained models for BERT
Library for fast text representation and classification.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
A library for efficient similarity search and clustering of dense vectors.
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
A natural language modeling framework based on PyTorch
Language-Agnostic SEntence Representations
PyTorch original implementation of Cross-lingual Language Model Pretraining.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pdf/2106.04718.pdf
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
PyTorch building blocks for the OLMo ecosystem
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
A resource repository for machine unlearning in large language models
Everything about the SmolLM and SmolVLM family of models
MCP server for integrating OpenAI's Deep Research APIs and Hugging Face's Open Deep Research with Claude Code and other AI assistants