Skip to content
View imoneoi's full-sized avatar
🎯
Tuning PPO
🎯
Tuning PPO

Organizations

@OpenOrca @FastEval

Block or report imoneoi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. openchat openchat Public

    OpenChat: Advancing Open-source Language Models with Imperfect Data

    Python 5.5k 433

  2. multipack multipack Public

    Multipack distributed sampler for fast padding-free training of LLMs

    Python 202 16

  3. EvolvingConnectivity EvolvingConnectivity Public

    Code for paper Evolving Connectivity for Spiking Neural Networks

    Python 23 4

  4. RSP_JAX RSP_JAX Public

    [AAAI'25 Oral] Are Expressive Models Truly Necessary for Offline RL?

    Python 13 4