Real-time Scene Text Detection with Differentiable Binarization

note: some code is inherited from MhLiao/DB

update

2020-06-07: 添加灰度图训练，训练灰度图时需要在配置里移除dataset.args.transforms.Normalize

Install Using Conda

conda env create -f environment.yml
git clone https://github.com/WenmuZhou/DBNet.pytorch.git
cd DBNet.pytorch/

or

Install Manually

conda create -n dbnet python=3.6
conda activate dbnet

conda install ipython pip

# python dependencies
pip install -r requirement.txt

# install PyTorch with cuda-10.1
# Note that you can change the cudatoolkit version to the version you want.
conda install pytorch torchvision cudatoolkit=10.1 -c pytorch

# clone repo
git clone https://github.com/WenmuZhou/DBNet.pytorch.git
cd DBNet.pytorch/

Requirements

pytorch 1.4+
torchvision 0.5+
gcc 4.9+

Download

TBD

Data Preparation

Training data: prepare a text train.txt in the following format, use '\t' as a separator

./datasets/train/img/001.jpg	./datasets/train/gt/001.txt

Validation data: prepare a text test.txt in the following format, use '\t' as a separator

./datasets/test/img/001.jpg	./datasets/test/gt/001.txt

Store images in the img folder
Store groundtruth in the gt folder

The groundtruth can be .txt files, with the following format:

x1, y1, x2, y2, x3, y3, x4, y4, annotation

Train

config the dataset['train']['dataset'['data_path']',dataset['validate']['dataset'['data_path']in config/icdar2015_resnet18_fpn_DBhead_polyLR.yaml

. single gpu train

bash singlel_gpu_train.sh

. Multi-gpu training

bash multi_gpu_train.sh

Test

eval.py is used to test model on test dataset

config model_path in eval.sh
use following script to test

bash eval.sh

Predict

predict.py Can be used to inference on all images in a folder

config model_path,input_folder,output_folder in predict.sh
use following script to predict

bash predict.sh

You can change the model_path in the predict.sh file to your model location.

tips: if result is not good, you can change thre in predict.sh

The project is still under development.

Performance

ICDAR 2015

only train on ICDAR2015 dataset

Method	image size (short size)	learning rate	Precision (%)	Recall (%)	F-measure (%)	FPS
SynthText-Defrom-ResNet-18(paper)	736	0.007	86.8	78.4	82.3	48
ImageNet-resnet18-FPN-DBHead	736	1e-3	87.03	75.06	80.6	43
ImageNet-Defrom-Resnet18-FPN-DBHead	736	1e-3	88.61	73.84	80.56	36
ImageNet-resnet50-FPN-DBHead	736	1e-3	88.06	77.14	82.24	27
ImageNet-resnest50-FPN-DBHead	736	1e-3	88.18	76.27	81.78	27

examples

TBD

todo

mutil gpu training

reference

If this repository helps you，please star it. Thanks.

Name	Name	Last commit message	Last commit date
Latest commit WenmuZhou add res50 Dec 29, 2022 e03acf0 · Dec 29, 2022 History 149 Commits
base	base	先保存模型，再验证	Jun 17, 2020
config	config	add res50	Dec 29, 2022
data_loader	data_loader	去掉()	Jun 16, 2020
datasets	datasets	Rename train.txt to datasets/train.txt	Jan 14, 2020
imgs/paper	imgs/paper	delete db.png	Dec 9, 2019
models	models	重排列	Jun 19, 2020
post_processing	post_processing	1. fix bug in post_p when output type is polygon	Jan 17, 2020
test	test	update .gitignore	Apr 26, 2020
tools	tools	修复eval和predict加载模型的错误	Jul 10, 2020
trainer	trainer	先保存模型，再验证	Jun 17, 2020
utils	utils	删除路径	Jun 19, 2020
.gitattributes	.gitattributes	语言标记为python	Dec 2, 2019
.gitignore	.gitignore	update .gitignore	Apr 26, 2020
LICENSE.md	LICENSE.md	init commit	Nov 29, 2019
README.MD	README.MD	更新readMe	Jul 2, 2020
environment.yml	environment.yml	更新imgaug版本	Jul 9, 2020
eval.sh	eval.sh	fix a bug on postprocessing	Dec 11, 2019
generate_lists.sh	generate_lists.sh	Create generate_lists.sh	Jan 14, 2020
multi_gpu_train.sh	multi_gpu_train.sh	fix a post_processing bug	Dec 9, 2019
predict.sh	predict.sh	修复eval和predict加载模型的错误	Jul 10, 2020
requirement.txt	requirement.txt	更新imgaug版本	Jul 9, 2020
singlel_gpu_train.sh	singlel_gpu_train.sh	fix a post_processing bug	Dec 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-time Scene Text Detection with Differentiable Binarization

update

Install Using Conda

Install Manually

Requirements

Download

Data Preparation

Train

Test

Predict

Performance

ICDAR 2015

examples

todo

reference

About

Releases

Packages

Languages

License

WenmuZhou/DBNet.pytorch

Folders and files

Latest commit

History

Repository files navigation

Real-time Scene Text Detection with Differentiable Binarization

update

Install Using Conda

Install Manually

Requirements

Download

Data Preparation

Train

Test

Predict

Performance

ICDAR 2015

examples

todo

reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages