Deep Learning Based Image Aesthetic Quality Assessment- A Review

None, None; None, None; None, None; None, None

Deep Learning Based Image Aesthetic Quality Assessment- A Review

Review (2025)

Author(s)

Maedeh Daryanavard Chounchenani (University Medical Center Groningen)

A. Shahbahrami (TU Delft - Computer Engineering, University of Guilan)

Reza Hassanpour (University Medical Center Groningen)

G. Gaydadjiev (TU Delft - Computer Engineering)

Research Group

Computer Engineering

DOI related publication

https://doi.org/10.1145/3716820

Deep learning Computer vision Image aesthetic Image aesthetic quality assessment

To reference this document use:

https://resolver.tudelft.nl/uuid:cc598b2f-a284-4044-945b-79a535374266

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Computer Engineering

Issue number

7

Volume number

57

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Image Aesthetic Quality Assessment (IAQA) spans applications such as the fashion industry, AI-generated content, product design, and e-commerce. Recent deep learning advancements have been employed to evaluate image aesthetic quality. A few surveys have been conducted on IAQA models; however, details of recent deep learning models and challenges have not been fully mentioned. This article aims to fill these gaps by providing a review of deep learning IAQA over the past decade, based on input, process, and output phases. Methodologies for deep learning-based IAQA can be categorized into general and task-specific approaches, depending on the type and diversity of input images. The processing phase involves considerations related to network architecture, learning structures, and feature extraction methods. The output phase generates results such as scoring, distribution, attributes, and description. Despite achieving a maximum accuracy of 91.5%, further improvements in deep learning models are still required. Our study highlights several challenges, including adapting models for task-specific methodology, accounting for environmental factors influencing aesthetics, the lack of substantial datasets with appropriate labels, imbalanced data, preserving image aspect ratio and integrity in network architecture design, and the need for explainable AI to understand the causative factors behind aesthetic judgments.

Files

3716820.pdf

(pdf | 3.4 Mb)