Skip to content

lnsmith54/hyperParam1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

hyperParam1

Here are some of the Caffe files of my recent NRL Technical Report : Smith, Leslie N. "A disciplined approach to neural network hyper-parameters: Part 1 - learning rate, batch size, momentum, and weight decay" arXiv preprint arXiv:1803.09820 (2018). Please read the report for details. The report is based on over a thousand experiments, with each experiment consisting of four runs to determine the mean and standard deviation of the reported results. Due to the large number of experiments, all the files are not included but the basic files that enable replication of the results should be present.

This repository contains two folders: Cifar and Imagenet. By far, most of the experiments were with Cifar and there are results for both Cifar-10 and Cifar-100. In this folder are a few versions of the train.sh script, solver.prototxt file, and the scripts used for modifying and submitting the jobs on my system. While the submission scripts are tailored only for my system/environment, they can provide guidance as to the jobs run. In this folder is the architectures folder that contains the architectures used for the experiments. Also included is a few of the output files in the folder called results. These output files might help in replicating my work.

The Imagenet folder contains 4 folders: architectures, inception, resnet, and results. The resnet and inception prototxt files are in the architectures folder. The inception folder contains my system's submission files, the train.sh scripts, and the solver.prototxt files for running the inception experiments. The resnet folder contains my system's submission files, the train.sh scripts, and the solver.prototxt files for running the resnet experiments. Some of the outputs are contained in the results folder.

The files in this repository are not sufficient to replicate all of the results in the report but they do contain the information that makes replication possible.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages