Skip to content
View MiaoLu3's full-sized avatar

Block or report MiaoLu3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Regularized-Preference-Optimization Regularized-Preference-Optimization Public

    Forked from YSLIU627/Regularized-Preference-Optimization

    Code for: [NeurIPS 2024] Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer

    Python

  2. MEX MEX Public

    Forked from agentification/MEX

    Code for: [NeurIPS 2023] Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration

    Python

  3. YSLIU627/RL-for-Markov-Exchange-Economy YSLIU627/RL-for-Markov-Exchange-Economy Public

    Codes for the ICML 2022 accepted paper: *Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy*.

    Jupyter Notebook 6

  4. Learning-Pruning-Friendly-Networks-via-Frank-Wolfe-One-Shot-Any-Sparsity-and-No-Retraining Learning-Pruning-Friendly-Networks-via-Frank-Wolfe-One-Shot-Any-Sparsity-and-No-Retraining Public

    Code for: [ICLR 2022] Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining

    Python 2 1

  5. RL-SCPO RL-SCPO Public

    Forked from MIRALab-USTC/RL-SCPO

    Code for: [AAAI 2022] Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization

    Python

  6. MiaoLu3.github.io MiaoLu3.github.io Public

    Personal Website

    PostScript 1