• ورود به سامانه
      مشاهده مورد 
      •   صفحهٔ اصلی
      • نشریات انگلیسی
      • International Journal of Engineering
      • Volume 33, Issue 7
      • مشاهده مورد
      •   صفحهٔ اصلی
      • نشریات انگلیسی
      • International Journal of Engineering
      • Volume 33, Issue 7
      • مشاهده مورد
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Learning Document Image Features With SqueezeNet Convolutional Neural Network

      (ندگان)پدیدآور
      Hassanpour, M.Malek, H.
      Thumbnail
      دریافت مدرک مشاهده
      FullText
      اندازه فایل: 
      531.0کیلوبایت
      نوع فايل (MIME): 
      PDF
      نوع مدرک
      Text
      Original Article
      زبان مدرک
      English
      نمایش کامل رکورد
      چکیده
      The classification of various document image classes is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for training, and their very large number of weights. Previous successful attempts at learning document image features have been based on training very large CNNs. SqueezeNet is a CNN architecture that achieves accuracies comparable to other state of the art CNNs while containing up to 50 times less weights, but never before experimented on document image classification tasks. In this research we have taken a novel approach towards learning these  document image features by training on a very small CNN network such as SqueezeNet. We show that an ImageNet pretrained SqueezeNet achieves an accuracy of approximately 75 percent over 10 classes on the Tobacco-3482 dataset, which is comparable to other state of the art CNN. We then visualize saliency maps of the gradient of our trained SqueezeNet's output to input, which shows that the network is able to learn meaningful features that are useful for document classification. Previous works in this field have made no emphasis on visualizing the learned document features. The importance of features such as the existence of handwritten text, document titles, text alignment and tabular structures in the extracted saliency maps, proves that the network does not overfit to redundant representations of the rather small Tobacco-3482 dataset, which contains only 3482 document images over 10 classes.
      کلید واژگان
      Squeezenet
      convolutional neural network
      Document image classification

      شماره نشریه
      7
      تاریخ نشر
      2020-07-01
      1399-04-11
      ناشر
      Materials and Energy Research Center
      سازمان پدید آورنده
      Department of Computer Science Engineering, Shahid Beheshti University, Tehran, Iran
      Department of Computer Science Engineering, Shahid Beheshti University, Tehran, Iran

      شاپا
      1025-2495
      1735-9244
      URI
      https://dx.doi.org/10.5829/ije.2020.33.07a.05
      http://www.ije.ir/article_108484.html
      https://iranjournals.nlai.ir/handle/123456789/337403

      مرور

      همه جای سامانهپایگاه‌ها و مجموعه‌ها بر اساس تاریخ انتشارپدیدآورانعناوینموضوع‌‌هااین مجموعه بر اساس تاریخ انتشارپدیدآورانعناوینموضوع‌‌ها

      حساب من

      ورود به سامانهثبت نام

      تازه ترین ها

      تازه ترین مدارک
      © کليه حقوق اين سامانه برای سازمان اسناد و کتابخانه ملی ایران محفوظ است
      تماس با ما | ارسال بازخورد
      قدرت یافته توسطسیناوب