jamesliu (James)

Pinned Loading

nanoDPO nanoDPO Public

A nimble and innovative implementation of the Direct Preference Optimization (DPO) algorithm with Causal Transformer and LSTM model, inspired by the paper of DPO in fine-tuning unsupervised Languag…

Python 6
nanoPPO nanoPPO Public

An efficient implementation of the Proximal Policy Optimization (PPO) algorithm with linear and attention policy for reinforcement learning.

Python 10
nChain nChain Public

a flexible and efficient implementation to create LLM bots over extensible dataset.

Python 2
microsoft/autogen microsoft/autogen Public

A programming framework for agentic AI

Python 51.6k 7.9k
nanoTransformer nanoTransformer Public

A PyTorch-based featuring an efficiently implemented Transformer model. The core of our attention mechanisms is powered by torch.einsum, ensuring clean, readable, and highly optimized tensor operat…

Python 2 1
xgboost xgboost Public

Forked from dmlc/xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

C++ 1