billhhh

Follow

Coding

Bill Wang billhhh

Coding

Follow

AI research scientist

224 followers · 390 following

UAE
https://huwang01.github.io

Achievements

Achievements

Pinned Loading

KRPO_LLMs_RL KRPO_LLMs_RL Public

The code repository for paper "Kalman Filter Enhanced Group Relative Policy Optimization for Language Model Reasoning"

Python 12 1
ShaSpec ShaSpec Public

The official code repository of ShaSpec model from CVPR 2023 [paper](https://arxiv.org/pdf/2307.14126) "Multi-modal Learning with Missing Modality via Shared-Specific Feature Modelling"

Python 92 10
TrafficOptim_RL TrafficOptim_RL Public

The code repo for paper "Multi-intersection Traffic Optimisation: ABenchmark Dataset and a Strong Baseline"

Python 11
MetaKD MetaKD Public

The code repository of MetaKD model from paper (https://arxiv.org/pdf/2405.07155) "Meta-Learned Modality-Weighted Knowledge Distillation for Robust Multi-Modal Learning with Missing Data".

Python 9 3
Rethink-Merge Rethink-Merge Public

The code repository of from [paper](https://arxiv.org/abs/2411.09263) "Rethinking Weight-Averaged Model-merging".

Python 4 2
RDP RDP Public

Codes for IJCAI2020 paper "Unsupervised Representation Learning by Predicting Random Distances” https://arxiv.org/abs/1912.12186

Python 29 8