Add more options to `ViT` #184

theabhirath · 2022-07-24T09:51:23Z

This modifies the current ViT API to add more options - notably, there is now a optional prenorm/postnorm toggle. There's also an option to make class tokens disappear completely, and also to allow class tokens to be before the positional embedding as in DeIT-III. This makes some other cleanup changes as well. The API is more congested for now but I thought I'd get this in before I start working on the other ViTs - maybe there's some potential for extracting common stuff out there.
Needs #174 to land before this makes sense. Also documentation is pending

1. Some docs 2. Basic tests for ResNet and ResNeXt now pass

1. Less keywords for the user to worry about 2. Delete `ResNeXt` just for now

`downsample_args` is actually redundant

Also add tests. A lot of tests

Also 1. Tweaks - II : Formatting + some docs 2. Groundwork for abstracting out the classifier

1. Reorganise layer imports for easy access 2. Get pooling to work

So much GC, might as well have a function for it

Co-authored-by: Brian Chen <[email protected]>

Neither does formatting, unfortunately. Also refactor `classifier` to separate out FC-layer creation and pooling

It really does never stop Co-Authored-By: Kyle Daruwalla <[email protected]>

And make `downsample_opts` a smidge easier to work with. Also, a wee bit o' formatting and cleanup.

1. Some docs 2. Basic tests for ResNet and ResNeXt now pass

1. Less keywords for the user to worry about 2. Delete `ResNeXt` just for now

`downsample_args` is actually redundant

Also add tests. A lot of tests

Also 1. Tweaks - II : Formatting + some docs 2. Groundwork for abstracting out the classifier

1. Reorganise layer imports for easy access 2. Get pooling to work

So much GC, might as well have a function for it

Co-authored-by: Brian Chen <[email protected]>

Neither does formatting, unfortunately. Also refactor `classifier` to separate out FC-layer creation and pooling

It really does never stop Co-Authored-By: Kyle Daruwalla <[email protected]>

And make `downsample_opts` a smidge easier to work with. Also, a wee bit o' formatting and cleanup.

Closures is the name of the game

…d.jl into resnet-plus

Simplify `conv_bn` to `conv_norm` and use it

And some more formatting

Also 1. Make class tokens optional 2. Allow class tokens to be before positional embedding as in DeIT-III

zsz00 · 2023-01-08T12:49:18Z

what's status now ?

theabhirath · 2023-01-08T16:14:56Z

I've had less time to work on this in the recent past, but I'm going to try and push some of these refactors through in the next few months. However, there's some work that's also happening on the Attention implementations around the Flux ecosystem - I suspect any reforms to the ViT will wait on that work to land

theabhirath and others added 30 commits June 27, 2022 06:38

Add DropBlock

cd0edef

Initial commit for new ResNet API

271b430

Cleanup

866dbcc

Get some stuff to work

a038ff8

1. Some docs 2. Basic tests for ResNet and ResNeXt now pass

Tweaks - I

de079bc

Make pretrain condition explicit

4fa28d4

More declarative interface for ResNet

7846f8b

1. Less keywords for the user to worry about 2. Delete `ResNeXt` just for now

Make DropBlock really work

a1d5ddc

Construct the stem outside and pass it into resnet

3be1d81

`downsample_args` is actually redundant

Add ResNeXt back

16cbcd0

Also add tests. A lot of tests

Enable CI for Windows

e5294ec

Add more general implementation of SE layer

a439bdf

Also 1. Tweaks - II : Formatting + some docs 2. Groundwork for abstracting out the classifier

Tweaks III + Some more docs

441ade8

1. Reorganise layer imports for easy access 2. Get pooling to work

Fix DropBlock on the GPU

5d059f5

Add SEResNet and SEResNeXt

226e96a

So much GC, might as well have a function for it

More docs, more tweaks

3a4ffbf

More aggressive GC

2f755cf

Co-authored-by: Brian Chen <[email protected]>

Tweaks don't stop

5ba4b84

Neither does formatting, unfortunately. Also refactor `classifier` to separate out FC-layer creation and pooling

Reorganisation and formatting

aaf2abb

It really does never stop Co-Authored-By: Kyle Daruwalla <[email protected]>

Refactor shortcut connections

326f36c

Generalise resnet further

4e01443

Documentation

e8d3488

And make `downsample_opts` a smidge easier to work with. Also, a wee bit o' formatting and cleanup.

Add classifier and backbone methods

92ed4fa

Refactor of resnet core

96a7d31

Add DropBlock

9540299

Initial commit for new ResNet API

588d703

Cleanup

2a5d0cc

Get some stuff to work

07c1e95

1. Some docs 2. Basic tests for ResNet and ResNeXt now pass

Tweaks - I

2e88201

Make pretrain condition explicit

01eaa8b

theabhirath and others added 23 commits July 22, 2022 06:30

More declarative interface for ResNet

546b131

1. Less keywords for the user to worry about 2. Delete `ResNeXt` just for now

Make DropBlock really work

3f45f27

Construct the stem outside and pass it into resnet

f373f45

`downsample_args` is actually redundant

Add ResNeXt back

51d0757

Also add tests. A lot of tests

Add more general implementation of SE layer

106f260

Also 1. Tweaks - II : Formatting + some docs 2. Groundwork for abstracting out the classifier

Tweaks III + Some more docs

7147309

1. Reorganise layer imports for easy access 2. Get pooling to work

Fix DropBlock on the GPU

7ed20d4

Add SEResNet and SEResNeXt

f0051b7

So much GC, might as well have a function for it

More docs, more tweaks

e5d2295

More aggressive GC

4a91fc4

Co-authored-by: Brian Chen <[email protected]>

Tweaks don't stop

cf538bb

Neither does formatting, unfortunately. Also refactor `classifier` to separate out FC-layer creation and pooling

Reorganisation and formatting

5be45ef

It really does never stop Co-Authored-By: Kyle Daruwalla <[email protected]>

Refactor shortcut connections

1e509df

Generalise resnet further

e4930f1

Documentation

80bdcde

And make `downsample_opts` a smidge easier to work with. Also, a wee bit o' formatting and cleanup.

Add classifier and backbone methods

ab37901

Refactor of resnet core

68abbb7

Refactor of resnet core II

7ad362b

Closures is the name of the game

Merge branch 'resnet-plus' of https://github.com/theabhirath/Metalhea…

93fb500

…d.jl into resnet-plus

Allow prenorm

13ed5ac

Simplify `conv_bn` to `conv_norm` and use it

Cleanup

6c005d3

Reorganisation

bd443f1

And some more formatting

Add prenorm/postnorm option to ViT

6a91485

Also 1. Make class tokens optional 2. Allow class tokens to be before positional embedding as in DeIT-III

theabhirath marked this pull request as draft July 26, 2022 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more options to `ViT` #184

Add more options to `ViT` #184

theabhirath commented Jul 24, 2022

zsz00 commented Jan 8, 2023

theabhirath commented Jan 8, 2023

Add more options to ViT #184

Are you sure you want to change the base?

Add more options to ViT #184

Conversation

theabhirath commented Jul 24, 2022

zsz00 commented Jan 8, 2023

theabhirath commented Jan 8, 2023

Add more options to `ViT` #184

Add more options to `ViT` #184