Adding ShuffleNet model #258

RafaelT00 · 2023-11-03T04:58:10Z

I'm working on this implementation of ShuffleNet from https://arxiv.org/abs/1707.01083.

ShuffleNet model

ToucheSir

Thanks for the contribution! This is a great start, and the next steps would be adding tests + better matching the code style of the rest of the repo.

ToucheSir · 2023-11-13T22:15:34Z

src/convnets/shufflenet.jl

+function ChannelShuffle(x::Array{Float32, 4}, g::Int)
+    width, height, channels, batch = size(x)
+    channels_per_group = channels÷g
+    if (channels % g) == 0


Suggested change

if (channels % g) == 0

if channels % g == 0

We have a JuliaFormatter config in this repo, so make sure to run that before pushing your code.

ToucheSir · 2023-11-13T22:18:10Z

src/convnets/shufflenet.jl

+  - `channels`: number of channels
+  - `groups`: number of groups
+"""
+function ChannelShuffle(x::Array{Float32, 4}, g::Int)


Suggested change

function ChannelShuffle(x::Array{Float32, 4}, g::Int)

function channel_shuffle(x::AbstractArray{Float32, 4}, g::Int)

This type constraint is too restrictive. If ChannelShuffle works for all number types than it should reflect that. Generally all utility functions in Metalhead need to be GPU-compatible too. The renaming is a suggestion for how to make this function more "Julian", since it's not a callable type (which would be PascalCase) but a plain function. Lastly, how does this handle 3D inputs?

I didn't think about it when writing the function, so for a 3D inputs, a batch of grey images, would be necessary to artificially create a channel dimension.

ToucheSir · 2023-11-13T22:19:51Z

src/convnets/shufflenet.jl

+              BatchNorm(mid_channels),
+              NNlib.relu,


Suggested change

BatchNorm(mid_channels),

NNlib.relu,

BatchNorm(mid_channels, relu),

relu is already in scope because of using NNlib and fusing it into the preceeding norm is slightly more efficient. Also, is the activation function not configurable for ShuffleNet?

ToucheSir · 2023-11-13T22:20:25Z

src/convnets/shufflenet.jl

+    m = Chain(Conv((1,1), in_channels => mid_channels; groups,pad=SamePad()),
+              BatchNorm(mid_channels),
+              NNlib.relu,
+              x -> ChannelShuffle(x, groups),


Suggested change

x -> ChannelShuffle(x, groups),

Base.Fix2(channel_shuffle, groups),

Will be easier on the compiler.

ToucheSir · 2023-11-13T22:21:57Z

src/convnets/shufflenet.jl

+              NNlib.relu)
+
+    if downsample
+        m = Parallel((mx, x) -> cat(mx, x, dims=3),m, MeanPool((3,3); pad=SamePad(), stride=2))


Suggested change

m = Parallel((mx, x) -> cat(mx, x, dims=3),m, MeanPool((3,3); pad=SamePad(), stride=2))

m = Parallel(cat_channels, m, MeanPool((3,3); pad=SamePad(), stride=2))

We have cat_channels for this exact case.

ToucheSir · 2023-11-13T22:26:23Z

src/convnets/shufflenet.jl

+
+    model = Chain(features...)
+
+    return Chain(model, GlobalMeanPool(), Flux.flatten, Dense(in_channels => num_classes))


Suggested change

return Chain(model, GlobalMeanPool(), Flux.flatten, Dense(in_channels => num_classes))

return Chain(model, GlobalMeanPool(), MLUtils.flatten, Dense(in_channels => num_classes))

The general modus operandi of this library has been to create named types for the top-level model and wrap the underlying Chain with them. You can see this pattern in the files for any of the other exported models.

For the suggestion. flatten is only imported and not defined in Flux. It's preferable to use a symbol from the library that actually defined when that library is available (which MLUtils is, being a dep of Metalhead).

Could I see an example? Sorry, I'm still a newbie using Julia, I looked to the rest of convnets and tried to code with a similar style, but there are still things I that still haven't fully understood.

Better matching the code style of the rest of Metalhead

RafaelT00 · 2024-05-07T19:52:53Z

I made the suggested changes

ToucheSir · 2024-05-08T05:12:39Z

Thanks for the updates. On a quick skim nothing stands out to me, can you add it to the test suite to finish off the PR?

…est of the repo

corrected typo

added missing includes

RafaelT00 added 2 commits November 3, 2023 05:44

Create shufflenet.jl

79dcc99

ShuffleNet model

Update shufflenet.jl

08fb6b9

RafaelT00 changed the title ~~Adding Shuffle~~ Adding ShuffleNet model Nov 3, 2023

ToucheSir reviewed Nov 13, 2023

View reviewed changes

RafaelT00 and others added 4 commits May 3, 2024 22:41

Merge branch 'FluxML:master' into master

367c140

applied julia formatter

9d01bf2

deleted () from if

894ae7a

Merge pull request #1 from RafaelT00/ShuffleNet

c29dab3

Better matching the code style of the rest of Metalhead

RafaelT00 added 4 commits May 7, 2024 22:27

fused relu into BatchNorm

597aa2f

replaced anonymous functions by fix2

873dc51

using cat_channels instead of an annonymous function

9d91f81

applied JuliaFormatter

f366f77

RafaelT00 and others added 7 commits June 14, 2024 02:38

added test for ShuffleNet

526df7a

Merge pull request #2 from RafaelT00/ShuffleNet

2aae877

created SheffleNet structure, better matching the code style of the r…

9261218

…est of the repo

Merge pull request #3 from RafaelT00/ShuffleNet

5cb3430

corrected typo

corrected typo

73f73ca

added missing includes

7093459

Merge pull request #4 from RafaelT00/ShuffleNet

cfb9a24

added missing includes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding ShuffleNet model #258

Adding ShuffleNet model #258

RafaelT00 commented Nov 3, 2023 •

edited

Loading

ToucheSir left a comment

ToucheSir Nov 13, 2023

RafaelT00 May 7, 2024

ToucheSir Nov 13, 2023

RafaelT00 May 7, 2024

ToucheSir Nov 13, 2023

RafaelT00 May 7, 2024

ToucheSir Nov 13, 2023

RafaelT00 May 7, 2024

ToucheSir Nov 13, 2023

RafaelT00 May 7, 2024

ToucheSir Nov 13, 2023

RafaelT00 May 7, 2024

RafaelT00 commented May 7, 2024

ToucheSir commented May 8, 2024

	function ChannelShuffle(x::Array{Float32, 4}, g::Int)
	function channel_shuffle(x::AbstractArray{Float32, 4}, g::Int)

	BatchNorm(mid_channels),
	NNlib.relu,
	BatchNorm(mid_channels, relu),

	x -> ChannelShuffle(x, groups),
	Base.Fix2(channel_shuffle, groups),

	m = Parallel((mx, x) -> cat(mx, x, dims=3),m, MeanPool((3,3); pad=SamePad(), stride=2))
	m = Parallel(cat_channels, m, MeanPool((3,3); pad=SamePad(), stride=2))


		model = Chain(features...)

		return Chain(model, GlobalMeanPool(), Flux.flatten, Dense(in_channels => num_classes))

	return Chain(model, GlobalMeanPool(), Flux.flatten, Dense(in_channels => num_classes))
	return Chain(model, GlobalMeanPool(), MLUtils.flatten, Dense(in_channels => num_classes))

Adding ShuffleNet model #258

Are you sure you want to change the base?

Adding ShuffleNet model #258

Conversation

RafaelT00 commented Nov 3, 2023 • edited Loading

ToucheSir left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RafaelT00 commented May 7, 2024

ToucheSir commented May 8, 2024

RafaelT00 commented Nov 3, 2023 •

edited

Loading