SANet-Keras

An unofficial implementation of SANet for crowd counting in Keras==2.24 + TF==1.14.0.

Paper:

Original_paper: Cao, X., Wang, Z., Zhao, Y., & Su, F. (2018). Scale Aggregation Network for Accurate and Efficient Crowd Counting. The European Conference on Computer Vision (ECCV), 1–17.

Results now:

On dataset ShanghaiTech B

Still far from the performance in the original paper(MAE 8.4)

MAE	MSE	MAPE	Mean DM Distance
12.41	20.33	0.11	4.942

Dataset:

ShanghaiTech dataset: dropbox(backup on my personl google-drive) or Baidu Disk.

Env

conda install cudatoolkit=10.0 cudnn=7.6.5

pip install -r requirements.txt

Training Parameters:

Loss = ssim_loss + L2
Optimizer = Adam(lr=1e-4)
Data augmentation: Flip horizontally.
Patch: No patch, input the whole image, output the same shape DM.
Instance normalization: No IN layers at present, since network with IN layers is very hard to train and IN layers didn't show improvement to the network in my experiments.
Output Zeros: The density map output may fade to zeros in 95%+ random initialization, I tried the initialization method in the original paper while it didn't work. In the past, when this happens, I just restarted the kernel and re-run. But now, I tried to train different modules(1-5) separately in the first several epochs to get relatively reasonable weights:

, and it worked out to greatly decrease the probability of the zero-output-phenomena. Any other question, welcome to contact me.
Weights: On SHB, got best weights in 292-th epoch(300 epochs in total), and here is the loss records:
Prediction example:

Run:

Download dataset;
Data generation: run thegenerate_datasets.ipynb .
Run the main.ipynb to train the model and do the test.

Abstraction:

Network = encoder + decoder, model plot is here:

Network encoder decoder

Composition scale aggregation module conv2dTranspose

Usage extract multi-scale features generate high resolution density map
Loss:

Loss =
Normalization layer:
- Ease the training process;
- Reduce 'statistic shift problem'.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data/paths_train_val_test		data/paths_train_val_test
images		images
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SANet.py		SANet.py
generate_datasets.ipynb		generate_datasets.ipynb
main.ipynb		main.ipynb
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SANet-Keras

Paper:

Results now:

Dataset:

Env

Training Parameters:

Run:

Abstraction:

About

Releases

Packages

Languages

Network	encoder	decoder
Composition	scale aggregation module	conv2dTranspose
Usage	extract multi-scale features	generate high resolution density map

License

ZhengPeng7/SANet-Keras

Folders and files

Latest commit

History

Repository files navigation

SANet-Keras

Paper:

Results now:

Dataset:

Env

Training Parameters:

Run:

Abstraction:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages