FLOW

Upweighting Easy Samples in Fine-tuning Mitigates Forgeting
Sunny Sanyal*, Hayden Prairie*, Rudrajit Das*, Ali Kavis*, Sujay Sanghavi
Paper: https://arxiv.org/abs/2502.02797

Installing Locally

Our language and vision experiments were run in seperate environments, and thus we have two different installations.

Vision Installation

cd vision
conda crate --name flow python=3.9.12
conda activate flow_vision
pip install -r requirements.txt

Language Installation

Run the following script to create the environment necissary to run all of the language model experiments.

conda crate --name flow python=3.10
conda activate flow
pip install -r requirements.txt

To install evaluation functionality, please also run the following:

git clone --depth 1 https://github.com/EleutherAI/lm-evaluation-harness
cd lm-evaluation-harness
pip install -e .

Vision Experiments

The datasets for vision experiments are downloaded using torchvision.datasets, except stanford cars.

FLOW fine-tuning for vision models are performed as follows.

Download a pre-trained model to be fine-tuned.
Perform linear probing on model from step 1, using a target dataset to develop a linear probe (lp) model.
Using the lp model, evaluate the temperature (median lp loss) for a given dataset-model pair.
Re-weight every sample of a target dataset using lp loss and temperature.
Finetune the model using sample-wise weighted loss.

One can finetune ResNet-18/ResNet-50 on 6 image classification datasets based on the following steps.

To run full finetuning, you can run the following script:

bash run_standardfinetune.sh

To linear probe a ImageNet-1K pre-trained model, you can run the following script:

bash run_linearprobing.sh

Next we can evaluate the temperature of the dataset model pair using the following script:

python compute_temp.py --dataset cifar10 --model resnet18 --checkpoint-dir ./checkpoint/linear/resnet18 --loss-save-dir ./logs/ours/train_loss

Finetune the full model with sample-wise weighted loss using the following script:

bash run_flow_round1.sh

Next we finetune only the task specific head with regular loss. This is done using the following script:

bash run_flow_round2.sh

Language Experiments

We have three stages to our language experiment pipeline.

Evaluate the temperature for a given dataset-model pair
Re-weight a dataset with a given temperature
Fine-tune a model with a re-weighted dataset

To evaluate the temperature of a model (once cd into the language folder), you can simply run the following script:

bash scripts/launch_get_temperature.slurm

or if using slurm then:

sbatch scripts/launch_get_temperature.slurm

To re-weight a dataset, you can run the following script with (bash/sbatch):

(bash/sbatch) scripts/launch_weight_dataset.slurm

Finally, to train a model you can run the following script:

(bash/sbatch) scripts/launch_ft_arithmetic.slurm

In each of these scripts, you can set the run configuration at the top of the script. In order to evaluate an run use the following script:

(bash/sbatch) scripts/launch_eval.slurm

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
language		language
vision		vision
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FLOW

Installing Locally

Vision Installation

Language Installation

Vision Experiments

Language Experiments

About

Releases

Packages

Contributors 2

Languages

sanyalsunny111/FLOW_finetuning

Folders and files

Latest commit

History

Repository files navigation

FLOW

Installing Locally

Vision Installation

Language Installation

Vision Experiments

Language Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages