History

HuangHai a72954a90e 'commit'		1 month ago
..
base	'commit'	1 month ago
config	'commit'	1 month ago
data_loader	'commit'	1 month ago
imgs/paper	'commit'	1 month ago
models	'commit'	1 month ago
post_processing	'commit'	1 month ago
test	'commit'	1 month ago
test_tipc	'commit'	1 month ago
tools	'commit'	1 month ago
trainer	'commit'	1 month ago
utils	'commit'	1 month ago
.gitattributes	'commit'	1 month ago
.gitignore	'commit'	1 month ago
LICENSE.md	'commit'	1 month ago
README.MD	'commit'	1 month ago
environment.yml	'commit'	1 month ago
eval.sh	'commit'	1 month ago
generate_lists.sh	'commit'	1 month ago
multi_gpu_train.sh	'commit'	1 month ago
predict.sh	'commit'	1 month ago
requirement.txt	'commit'	1 month ago
single_gpu_train.sh	'commit'	1 month ago

README.MD

Unescape Escape

This file contains ambiguous Unicode characters that may be confused with others in your current locale. If your use case is intentional and legitimate, you can safely ignore this warning. Use the Escape button to highlight these characters.

Real-time Scene Text Detection with Differentiable Binarization

note: some code is inherited from WenmuZhou/DBNet.pytorch

中文解读

update

2020-06-07: 添加灰度图训练，训练灰度图时需要在配置里移除dataset.args.transforms.Normalize

Install Using Conda

conda env create -f environment.yml
git clone https://github.com/WenmuZhou/DBNet.paddle.git
cd DBNet.paddle/

Install Manually

conda create -n dbnet python=3.6
conda activate dbnet

conda install ipython pip

# python dependencies
pip install -r requirement.txt

# clone repo
git clone https://github.com/WenmuZhou/DBNet.paddle.git
cd DBNet.paddle/

Requirements

paddlepaddle 2.4+

Download

TBD

Data Preparation

Training data: prepare a text train.txt in the following format, use '\t' as a separator

./datasets/train/img/001.jpg	./datasets/train/gt/001.txt

Validation data: prepare a text test.txt in the following format, use '\t' as a separator

./datasets/test/img/001.jpg	./datasets/test/gt/001.txt

Store images in the img folder
Store groundtruth in the gt folder

The groundtruth can be .txt files, with the following format:

x1, y1, x2, y2, x3, y3, x4, y4, annotation

Train

config the dataset['train']['dataset'['data_path']',dataset['validate']['dataset'['data_path']in config/icdar2015_resnet18_fpn_DBhead_polyLR.yaml

. single gpu train

bash single_gpu_train.sh

. Multi-gpu training

bash multi_gpu_train.sh

Test

eval.py is used to test model on test dataset

config model_path in eval.sh
use following script to test

bash eval.sh

Predict

predict.py Can be used to inference on all images in a folder

config model_path,input_folder,output_folder in predict.sh
use following script to predict

bash predict.sh

You can change the model_path in the predict.sh file to your model location.

tips: if result is not good, you can change thre in predict.sh

Export Model

export_model.py Can be used to inference on all images in a folder

use following script to export inference model

python tools/export_model.py --config_file config/icdar2015_resnet50_FPN_DBhead_polyLR.yaml -o trainer.resume_checkpoint=model_best.pth trainer.output_dir=output/infer

Paddle Inference infer

infer.py Can be used to inference on all images in a folder