Publications

Combining convolutional neural networks and lstms for segmentation-free ocr

Abstract

We present a novel end-to-end trainable OCR system combining a CNN for feature extraction with 1-D LSTMs for sequence modeling. We present results on English and Arabic handwriting data, and on English machine print data, showing state-of-the-art performance. We believe that our method is simpler than existing 2D LSTM models, and will make it easier to use techniques borrowed from CNN research in computer vision to improve OCR performance.

Date
November 9, 2017
Authors
Stephen Rawls, Huaigu Cao, Senthil Kumar, Prem Natarajan
Conference
2017 14th IAPR international conference on document analysis and recognition (ICDAR)
Volume
1
Pages
155-160
Publisher
IEEE