Skip to content
View jamesliu's full-sized avatar

Highlights

  • Pro

Block or report jamesliu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. nanoDPO nanoDPO Public

    A nimble and innovative implementation of the Direct Preference Optimization (DPO) algorithm with Causal Transformer and LSTM model, inspired by the paper of DPO in fine-tuning unsupervised Languag…

    Python 6

  2. nanoPPO nanoPPO Public

    An efficient implementation of the Proximal Policy Optimization (PPO) algorithm with linear and attention policy for reinforcement learning.

    Python 10

  3. nChain nChain Public

    a flexible and efficient implementation to create LLM bots over extensible dataset.

    Python 2

  4. microsoft/autogen microsoft/autogen Public

    A programming framework for agentic AI

    Python 51.6k 7.9k

  5. nanoTransformer nanoTransformer Public

    A PyTorch-based featuring an efficiently implemented Transformer model. The core of our attention mechanisms is powered by torch.einsum, ensuring clean, readable, and highly optimized tensor operat…

    Python 2 1

  6. xgboost xgboost Public

    Forked from dmlc/xgboost

    Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

    C++ 1