• ورود به سامانه
      مشاهده مورد 
      •   صفحهٔ اصلی
      • نشریات انگلیسی
      • International Journal of Engineering
      • Volume 38, Issue 10
      • مشاهده مورد
      •   صفحهٔ اصلی
      • نشریات انگلیسی
      • International Journal of Engineering
      • Volume 38, Issue 10
      • مشاهده مورد
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Advanced Multi-Task Learning with Lightweight Networks and Multi-Head Attention for Efficient Facial Attribute Estimation

      (ندگان)پدیدآور
      Rohani, M.Farsi, H.Mohamadzadeh, S.
      Thumbnail
      دریافت مدرک مشاهده
      FullText
      اندازه فایل: 
      1.599 مگابایت
      نوع فايل (MIME): 
      PDF
      نوع مدرک
      Text
      Original Article
      زبان مدرک
      English
      نمایش کامل رکورد
      چکیده
      The rapid advancement of computer vision algorithms demands efficient computational resource utilization for practical applications. This study proposes a novel framework that integrates multi-task learning (MTL) with MobileNetV3-Large networks and multi-head attention (MHA) mechanisms to simultaneously estimate facial attributes, including age, gender, race, and emotions. By employing MHA, the model enhances feature extraction and representation by focusing on multiple regions of the input image, thereby reducing computational complexity while significantly improving accuracy. The Receptive Field Enhanced Multi-Task Cascaded (RFEMTC) technique is utilized for effective preprocessing of the input data. Our methodology is rigorously evaluated on the UTKFace, FairFace, and RAF-DB datasets. We introduce a weighted loss function to balance task contributions, enhancing overall performance. Through refinement of the network architecture by analyzing branching points and optimizing the balance between shared and task-specific layers, our experimental results demonstrate significant improvements: a 7% reduction in parameters, a 3% increase in gender detection accuracy, a 5% improvement in race detection accuracy, and a 6% enhancement in emotion detection accuracy compared to single-task methods. Additionally, our proposed architecture reduces age estimation error by approximately one year on the UTKFace dataset and improves age estimation accuracy on the FairFace dataset by 5% compared to state-of-the-art approaches.
      کلید واژگان
      Facial Attribute Estimation
      convolutional neural network
      Multi-task Learning
      preprocessing
      Multi-Head Attention
      Computer vision

      شماره نشریه
      10
      تاریخ نشر
      2025-10-01
      1404-07-09
      ناشر
      Materials and Energy Research Center
      سازمان پدید آورنده
      Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran
      Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran
      Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran

      شاپا
      1025-2495
      1735-9244
      URI
      https://dx.doi.org/10.5829/ije.2025.38.10a.05
      https://www.ije.ir/article_208536.html
      https://iranjournals.nlai.ir/handle/123456789/1161553

      مرور

      همه جای سامانهپایگاه‌ها و مجموعه‌ها بر اساس تاریخ انتشارپدیدآورانعناوینموضوع‌‌هااین مجموعه بر اساس تاریخ انتشارپدیدآورانعناوینموضوع‌‌ها

      حساب من

      ورود به سامانهثبت نام

      تازه ترین ها

      تازه ترین مدارک
      © کليه حقوق اين سامانه برای سازمان اسناد و کتابخانه ملی ایران محفوظ است
      تماس با ما | ارسال بازخورد
      قدرت یافته توسطسیناوب