Skip to content
View holarissun's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report holarissun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Prompt-OIRL Prompt-OIRL Public

    code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

    Python 32 5

  2. PanelGPT PanelGPT Public

    We introduce new zero-shot prompting magic words that improves the reasoning ability of language models: panel discussion!

    Python 130 11

  3. RewardModelingBeyondBradleyTerry RewardModelingBeyondBradleyTerry Public

    8

  4. RewardShifting RewardShifting Public

    Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

    Python 27 2

  5. YangRui2015/AWGCSL YangRui2015/AWGCSL Public

    Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.

    Python 26 2

  6. PCHID_code PCHID_code Public

    Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics

    Jupyter Notebook 15