SIAM Undergraduate Research Online
Volume 17
In This Volume
-
Accurately Classifying Out-Of-Distribution Data in Facial Recognition
Published electronically November 13, 2024DOI: 10.1137/24S1649848
Authors
Gianluca Barone (Corresponding author – Rowan University), Aashrit Cunchala (University of Pittsburgh), Rudy Nunez (Emory University)
Project Advisors
Nicole Yang (Emory University)
Abstract
Standard classification theory assumes that the distribution of images in the test and training sets are identical. Unfortunately, real-life scenarios typically feature unseen data (“out-of-distribution data”) which is different from data in the training distribution (“in-distribution”). This issue is most prevalent in social justice problems where data from under-represented groups may appear in the test data without representing an equal proportion of the training data. This may result in a model returning confidently wrong decisions and predictions. We are interested in the following question: Can the performance of a neural network improve on facial images of out-of-distribution data when it is trained simultaneously on multiple datasets of in-distribution data? We approach this problem by incorporating the Outlier Exposure model and investigate how the model’s performance changes when other datasets of facial images were implemented. We observe that the accuracy and other metrics of the model can be increased by applying Outlier Exposure, incorporating a trainable weight parameter to increase the machine’s emphasis on outlier images, and by re-weighting the importance of different class labels. We also experimented with whether sorting the images and determining outliers via image features would have more of an effect on the metrics than sorting by average pixel value, and found no conclusive results. Our goal was to make models not only more accurate but also more fair by scanning a more expanded range of images. Utilizing Python and the Pytorch package, we found models utilizing outlier exposure could result in more fair classification.
-
Optimizing Energy Functional in Wave and Heat Equations with Initial Conditions in a Class of Rearrangements
Published electronically October 30, 2024 -
Travelling Waves of the Diffusive Streeter-Phelps Equations with Braun-Berthouex BOD Decay
Published electronically October 24, 2024 -
Machine Learning for Hotel Reservation Prediction
Published electronically October 17, 2024 -
Mixed Precision MINRES
Published electronically October 4, 2024 -
Predicting Market Share: Application of Alternative Lotka-Volterra Competition Model
Published electronically September 23, 2024 -
Mean Imputation and Stochastic Coordinate Descent for Linear Systems with Missing Data
Published electronically August 29, 2024 -
MCRAGE: Synthetic Healthcare Data for Fairness
Published electronically August 13, 2024 -
Statistical Methods Applied to Gene Expression Data to Explore Cancer Features
Published electronically June 27, 2024 -
Using a Smartphone Accelerometer to Classify Longboarding Motion
Published electronically June 20, 2024 -
Fast & Fair: Efficient Second-Order Robust Optimization for Fairness in Machine Learning
Published electronically June 11, 2024 -
Comparative Evaluation and Refinement of Linear Algebra-Based Camera Calibration Algorithms
Published electronically June 7, 2024 -
Pooling Matrix Designs for Group Testing
Published electronically May 30, 2024 -
Predictive Modeling of H5N1 Bird Flu in United States of America: A 2022-2023 Analysis
Published electronically May 17, 2024 -
Quantification of the Effects of Voter Protocols on the Outcome of Approval Voting
Published electronically April 16, 2024 -
Modeling Renewable Electricity Purchasing for Sustainable Management of Clarkson University's Energy Portfolio
Published electronically April 9, 2024 -
An Infectious Disease Model with Asymptomatic Transmission and Waning Immunity
Published electronically March 29, 2024 -
Predator-Prey Oscillations in a Cellular Automaton of Huffaker's Mite Experiment
Published electronically February 13, 2024 -
How to be #1 in the IOI? A Study on Rating Nations Participating in the International Informatics Olympiad
Published electronically February 9, 2024 -
From Pole to Podium: Adjusting Elo Method to Separate Car and Driver in Formula One Racing
Published electronically February 6, 2024 -
Identifying Priority Areas for Expanding Mental Health Facilities with Mixed Integer Linear Programming
Published electronically January 25, 2024
Become a SIURO Author
Publish your undergraduate research with SIAM to experience all aspects of the peer review process — from submission, review and revision, to publication.
Submit a Paper to SIURO