Artificial Intelligence: breaking ground or repeating the past's mistakes?

academics artificial intelligence computer science

Artificial Intelligence (AI) has become embedded in nearly every aspect of our lives. The purchases we make, the people we virtually connect with, even the mechanisms to unlock our phones (if the phone was made in the last four years) are all influenced by AI. That said, should there be a limit to what parts of our lives AI touches? Moreover, how can we be sure AI systems will behave in the way we would expect? 

How did we get here?

While AI has broadly existed for the last 70 years, machine learning, and more specifically deep learning, has taken off in the last 10 years due to access to larger datasets and faster computational resources. While the goal is not to provide an in-depth discussion of deep learning, there are a couple important properties to note:

  1. Deep learning can be used to create models that are extremely accurate on a large number of diverse tasks, such as image recognition, natural language processing, or network analysis. This has led many to adopt deep learning models in settings where they may not always be appropriate. 
  2. Given deep learning’s complexity, deep learning models are notoriously difficult to understand, i.e., you usually cannot confidently argue why a prediction was made. This becomes extremely debilitating when the application domain necessitates understanding, or impacts human lives.
  3. Even more problematic, deep learning models are often highly susceptible to adversarial attacks -- inputs designed to fool the model into believing the input is something that it is not. Many of these attacks are particularly nefarious as they only require a minor change to cripple the model (think only needing to change ~0.1% of the pixels in an image). This leads to an important insight in regards to deep learning models; often they are simply learning spurious correlation, rather than casual relationships, leading to highly unstable predictions.

These three situations lead to a significant problem in deep learning where complex models are applied to sensitive applications, due to their perceived success, with no ability to understand why a decision is made or if the decision making process is in line with human intuition. This practice raises many red flags in the fields of ethics, especially when a model is directly impacting the life of a human. 

The era of AI fairness  

As the misuse of deep learning has become a realization, AI fairness has emerged as a field to understand and critique the use of models in sensitive settings. Some aspects of AI fairness include establishing new metrics and datasets to better assess model performance, developing new models able to better handle biased data, and ultimately arguing for best practices when moving a model into production. All of this said, one of the most important AI fairness tenets is to use AI for problems that make sense. Let us consider facial recognition to solidify the significance of this last point.

Facial recognition has become extremely prominent in computer vision with the creation of more powerful deep learning models able to take advantage of images. In fact, it has become so popular, many have moved past simple recognition and instead considered directly predicting attributes and properties of individuals. Some of the properties considered have included an individual’s sexual orientation, as well as a person’s likelihood to commit a crime. Without mechanisms to properly determine how a decision is made, we often see that models are taking advantage of spurious, unethical, and sometimes blatantly wrong facial attributes to make decisions when post-hoc analysis is performed in response to a harmful prediction. That said, whether the model picks up particular facial attributes or not, it is important to take a step back and consider an important question: “Why would I believe facial features to be indicative of these properties?” 

Repeating the harmful mistakes of the past

The belief that physical attributes are indicative of non-physical properties, such as being more likely to commit crime, alludes to a dark point in history where scientific racism worked to differentiate based on observable characteristics. In modern 21st century science, notions of inherent inferiority based on skin tone, eye shape, cranial size, etc., have largely been relegated to late 1800s pseudo-scientists. If this is the case though, why do deep learning applications creep up that seem to be based on similar hypotheses? One issue stems from the black-box nature of deep learning which has allowed the view of data objectivity, and the model simply extracts insights from that data, to absolve the model creators of harm. However, to make this argument fails to recognize the historical and systematic biases that dictate many of our data generation processes, such as issues of over-sentencing or redlining. 

I believe deep learning does have the power to change the world, hopefully for the better. I advocate for recognizing that the supposed success of deep learning is not without fault, and these faults can produce significant societal harm if not properly vetted. Continuing to push for research that bridges social science, computer science, and political science in regards to AI is, in my opinion, the only way to safely and fairly integrate deep learning into society.

Donald graduated with a BS in Physics at California Polytechnic Polytechnic State University. His research in machine learning led to a staff research scientist position at Lawrence Livermore National Lab. He's currently pursuing a PhD in Computer Science at the University of Michigan.

Comments

topicTopics
academics study skills medical school admissions MCAT SAT college admissions expository writing strategy English MD/PhD admissions writing LSAT physics GMAT GRE chemistry academic advice graduate admissions biology math law school admissions ACT interview prep language learning test anxiety personal statements premed career advice MBA admissions AP exams homework help test prep creative writing MD mathematics study schedules Common Application computer science summer activities history secondary applications philosophy organic chemistry research economics supplements 1L grammar statistics & probability PSAT admissions coaching dental admissions psychology law legal studies ESL reading comprehension CARS PhD admissions SSAT covid-19 logic games calculus engineering USMLE medical school mentorship Latin Spanish parents AMCAS admissions advice biochemistry case coaching verbal reasoning DAT English literature STEM excel political science skills French Linguistics MBA coursework Tutoring Approaches academic integrity astrophysics chinese classics dental school gap year genetics letters of recommendation mechanical engineering units Anki DO Social Advocacy algebra art history artificial intelligence business careers cell biology data science diversity statement first generation student freewriting geometry graphing kinematics linear algebra mental health presentations quantitative reasoning study abroad tech industry technical interviews time management work and activities 2L AAMC DMD IB exams ISEE MD/PhD programs MMI Sentence Correction adjusting to college algorithms amino acids analysis essay athletics business skills cold emails executive function fellowships finance functions genomics information sessions international students internships logic networking office hours poetry pre-dental proofs resume revising scholarships science social sciences software engineering trigonometry writer's block 3L Academic Interest EMT FlexMed Fourier Series Greek Health Professional Shortage Area Italian JD/MBA admissions Lagrange multipliers London MD vs PhD Montessori National Health Service Corps Pythagorean Theorem Python Shakespeare Step 2 TMDSAS Taylor Series Truss Analysis Zoom acids and bases active learning architecture argumentative writing art art and design schools art portfolios bacteriology bibliographies biomedicine brain teaser burnout campus visits cantonese capacitors capital markets central limit theorem centrifugal force chem/phys chemical engineering chess chromatography class participation climate change clinical experience community service constitutional law consulting cover letters curriculum dementia demonstrated interest dimensional analysis distance learning econometrics electric engineering electricity and magnetism embryology entropy escape velocity evolution extracurriculars fundraising harmonics health policy history of medicine history of science hybrid vehicles hydrophobic effect ideal gas law immunology induction infinite institutional actions integrated reasoning intermolecular forces intern investing investment banking lab reports letter of continued interest linear maps mandarin chinese matrices