ColPret

Paper: https://arxiv.org/abs/2410.11840
Efficient Scaling laws and collaborative pretraining.

data

To just get the data use

from util.read_data import get_data
all_df = get_data()

The columns you may expect in it are DATA_AWARE_DF_COLS and ARCH_AWARE_DF_COLS in read_data.py

aggregate performance on kinds of losses you care about

# choose losses
loss_types = (LossType.PERP, LossType.LOSS)
get_perf_df(all_df, loss_types)

fit

If you want something initial that predicts, you have fit.py It has fit_per_model() that just fits a scaling law for every model on the beginnig of the training. It also has data_aware_fit() which tries to fit a function to all the data. Note that the current function fit is not a reasonable one, it does not care about the model type (e.g. OPT\GPT) or on the loss (e.g. training loss or an aggregation over some tasks).

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
__pycache__		__pycache__
aggregated_eval		aggregated_eval
datablations		datablations
experiments		experiments
figs		figs
figs_old		figs_old
fow		fow
gpt-neox		gpt-neox
graphs		graphs
output		output
pythiarch		pythiarch
raw_data		raw_data
scaling		scaling
test_cache		test_cache
train_repro		train_repro
training_trajectory_analysis		training_trajectory_analysis
util		util
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
aggregate_evals.py		aggregate_evals.py
create_star.py		create_star.py
fit.py		fit.py
fitting_funcs.py		fitting_funcs.py
overview.png		overview.png
papers.pdf		papers.pdf
papers.png		papers.png
scale_fit_per_model_Trueperp_lossdf.csv		scale_fit_per_model_Trueperp_lossdf.csv
star_marker.pdf		star_marker.pdf
tmp.csv		tmp.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ColPret

data

aggregate performance on kinds of losses you care about

fit

About

Releases

Packages

Contributors 2

Languages

License

IBM/ColPret

Folders and files

Latest commit

History

Repository files navigation

ColPret

data

aggregate performance on kinds of losses you care about

fit

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages