知能メディア処理研究室文献データベース: 文献の詳細

文献の詳細

論文の言語	英語
著者	Wei Fan, Koichi Kise, and Masakazu Iwamura
論文名	Automatic Character Labeling for Camera Captured Document Images
論文誌名	Proc. 2016 The International Conference on Image Processing (ICIP 2016)
ページ	pp.3284-3288
発表場所	Phoenix, Arizona, USA
査読の有無	有
年月	2016年9月
要約	Character groundtruth for camera captured documents is crucial for training and evaluating advanced OCR algorithms. Manually generating character level groundtruth is a time consuming and costly process. This paper proposes a robust groundtruth generation method based on document retrieval and image registration for camera captured documents. We use an elastic non-rigid alignment method to fit the captured document image which relaxes the flat paper assumption made by conventional solutions. The proposed method allows building very large scale labeled camera captured documents dataset, without any human intervention. We construct a large labeled dataset consisting of 1 million camera captured Chinese character images. Evaluation of samples generated by our approach showed that 99.99% of the images were correctly labeled, even with different distortions specific to cameras such as blur, specularity and perspective distortion.
DOI	10.1109/ICIP.2016.7532967

次のファイルが利用可能です．

0003284.pdf (5.6MB)

BibTeX用エントリー

@InProceedings{Fan2016,
  author =	{Wei Fan and Koichi Kise and Masakazu Iwamura},
  title =	{Automatic Character Labeling for Camera Captured Document Images},
  booktitle =	{Proc. 2016 The International Conference on Image Processing (ICIP 2016)},
  year =	2016,
  month =	sep,
  pages =	{3284--3288},
  DOI =		{10.1109/ICIP.2016.7532967},
  location =	{Phoenix, Arizona, USA}
}

一覧に戻る

トップページ
-------
文献一覧
-------
文献検索