ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술대회 Extraction of Character Areas from Digital Camera Based Color Document Images and OCR System
Cited 3 time in scopus Download 2 time Share share facebook twitter linkedin kakaostory
저자
정연구, 지수영, 배경숙, 김계경, 장대근, 김길천, 최영우
발행일
200508
출처
Optical Information Systems III (SPIE 5908), v.5908, pp.1-12
DOI
https://dx.doi.org/10.1117/12.614174
협약과제
05MI1400, URC를 위한 내장형 컴포넌트 기술개발 및 표준화, 조영조
초록
When document images are obtained from digital cameras, many imaging problems have to be solved for better extraction of characters from the images. Variation of illumination intensity sensitively affects to color values. A simple colored document image could be converted to a monochrome image by a traditional method and then a binarization algorithm is used. But this method is not stably working to the variation of illumination because sensitivity of colors to variation of illumination. For narrowly distributed colors, the conversion is not working well. Secondly, in case that the number of colors is more than two, it is not easy to figure out which color is for character and which others are for background. This paper discusses about an extraction method from a colored document image using a color process algorithm based on characteristics of color features. Variation of intensities and color distribution are used to classify character areas and background areas. A document image is segmented into several color groups and similar color groups are merged. In final step, only two colored groups are left for the character and background. The extracted character areas from the document images are entered into optical character recognition system. This method solves a color problem, which comes from traditional scanner based OCR systems. This paper also describes the OCR system for character conversion of a colored document image. Our method is working for the colored document images of cellular phones and digital cameras in real world.
KSP 제안 키워드
Binarization algorithm, Color features, Color values, Digital camera, Extraction method, Illumination intensity, OCR System, Optical character recognition, Real-world, Recognition System, Traditional methods