TY - GEN
T1 - A machine learning approach to predicting the Oscars
AU - Butcher, Sophie
AU - Abbass, Jad
PY - 2026/5/13
Y1 - 2026/5/13
N2 - The Academy Awards, or Oscars, are the film industry's highest honour. Since debuting in 1929, they have become a global phenomenon, reflecting their importance in entertainment. Employing AI to predict the Oscars is a relatively underexplored topic. This paper investigates the application of machine learning techniques for predicting Academy Award (Oscar) winners in six major categories: Best Picture, Best Director, and the four Acting awards using a dataset spanning 1961-2025 and incorporating both established and newly engineered features. Logistic Regression proved most effective for Best Picture, Best Director and Male Acting categories, with Support Vector Machines performing best for Female Acting categories, suggesting non-linear patterns in how female performances are evaluated. Key predictors included Directors Guild of America (DGA) wins, which contributed most to both Picture and Director outcomes, and a novel Best Editing nomination feature, which strongly factored into Best Picture success. Results showed top pick accuracy of 85% (Picture), 92% (Director), and 69% (Male and Female Acting), meeting or exceeding benchmarks from prior research while addressing methodological flaws. Importantly, gender-specific analysis and modelling revealed distinct factors influencing male and female acting awards, highlighting unconscious bias in Academy voting patterns.
AB - The Academy Awards, or Oscars, are the film industry's highest honour. Since debuting in 1929, they have become a global phenomenon, reflecting their importance in entertainment. Employing AI to predict the Oscars is a relatively underexplored topic. This paper investigates the application of machine learning techniques for predicting Academy Award (Oscar) winners in six major categories: Best Picture, Best Director, and the four Acting awards using a dataset spanning 1961-2025 and incorporating both established and newly engineered features. Logistic Regression proved most effective for Best Picture, Best Director and Male Acting categories, with Support Vector Machines performing best for Female Acting categories, suggesting non-linear patterns in how female performances are evaluated. Key predictors included Directors Guild of America (DGA) wins, which contributed most to both Picture and Director outcomes, and a novel Best Editing nomination feature, which strongly factored into Best Picture success. Results showed top pick accuracy of 85% (Picture), 92% (Director), and 69% (Male and Female Acting), meeting or exceeding benchmarks from prior research while addressing methodological flaws. Importantly, gender-specific analysis and modelling revealed distinct factors influencing male and female acting awards, highlighting unconscious bias in Academy voting patterns.
U2 - 10.1109/MPCON69668.2026.11508236
DO - 10.1109/MPCON69668.2026.11508236
M3 - Conference contribution
SN - 9798331593360
T3 - Madhya Pradesh Section Conference (MPCON)
SP - 226
EP - 232
BT - 2026 IEEE Madhya Pradesh Section Conference (MPCON)
PB - Institute of Electrical and Electronics Engineers Inc.
CY - Piscataway, U.S.
T2 - 2026 IEEE Madhya Pradesh Section Conference (MPCON)
Y2 - 14 March 2026 through 15 March 2026
ER -