How algorithms changed my perspective on biology

academics algorithms biology
By Will B.

When I first started studying biology, I thought the discipline was mostly about memorizing facts and figures about different organisms and their characteristics. In high school, I was more interested in physics and chemistry, which seemed to involve learning general principles and laws that could be applied to many problems. In other words, I appreciated the elegance and efficiency of the principles of physics, math, and chemistry.

However, during my first year of undergrad, I took a class titled “Bioinformatics Programming,” which was pitched to me as an introductory programming class in Python mixed with biological application. Intrigued, I took a risk with the class. Early on, we learned about the problem of DNA sequence alignment. Our professor explained how scientists use mathematical algorithms to compare different DNA sequences and understand how they evolved over time. For example, the fact that humans and chimpanzees share a substantial portion of their DNA is based on the principles of comparing the billions of DNA bases in our genomes and how they differ from other species because of accumulated insertions, deletions, substitutions, and rearrangements in the DNA sequence over time. 

I found this concept fascinating, and once I learned more about how these comparison algorithms worked, I was amazed at how simple, elegant principles could reveal so much about the relationships between different species. I was finding in biology the thing I loved most about studying math, physics and chemistry. 

When I took a deeper dive into the topic, I found that the problem of sequence alignment was just the tip of the iceberg. I learned that many of the most exciting discoveries in biology are made by applying mathematical and computational methods to understand the underlying principles of life.

Today, the frontier of innovation in artificial intelligence and machine learning is turning to problems in biology, specifically because the application of computational and mathematical methods to the discipline has been so successful in the past two decades. 

Since its founding, the company 23andMe has sold more than 10 million DNA testing kits. After receiving this material, companies like 23andMe implement similar DNA comparison algorithms to assess genetic ancestry and disease risk. There are now more than 100 trillion DNA sequences that have been deposited into GenBank, the US National Institutes of Health public repository for DNA sequencing data of all organisms. Companies such as Google-owned DeepMind and others have used this DNA sequencing data and machine learning algorithms to predict the structures of nearly every single protein encoded by all known DNA sequences. 

Biology and the core principles underlying genetic variation and evolution are increasingly relevant to our daily lives. My decision to become a biologist really began by recognizing as a first year in college how powerful and intriguing the union of computational and mathematical principles with the study of organisms could be. In truth, these principles also helped me learn the subject because it convinced me that I didn’t have to have a photographic memory to gain a deep understanding—I merely needed to learn how to apply broadly-applicable principles to diverse problems.

Will majored in Integrative Sciences, Molecular Biology & Biochemistry, and Science in Society at Wesleyan. After working at Rockefeller University in New York, he is now a PhD student in Biology at MIT.


academics study skills MCAT medical school admissions SAT expository writing college admissions English MD/PhD admissions writing LSAT GMAT strategy GRE physics chemistry math biology graduate admissions ACT law school admissions test anxiety interview prep language learning academic advice MBA admissions premed personal statements homework help career advice AP exams creative writing MD study schedules Common Application summer activities test prep history computer science philosophy organic chemistry secondary applications supplements economics PSAT admissions coaching grammar mathematics research law statistics & probability psychology 1L ESL CARS SSAT covid-19 dental admissions legal studies logic games reading comprehension USMLE engineering Spanish calculus parents Latin verbal reasoning DAT PhD admissions case coaching excel mentorship political science AMCAS French Linguistics MBA coursework Tutoring Approaches academic integrity chinese medical school Anki DO English literature Social Advocacy admissions advice algebra astrophysics biochemistry business classics diversity statement genetics geometry kinematics mental health presentations quantitative reasoning skills study abroad time management work and activities IB exams ISEE MD/PhD programs STEM adjusting to college algorithms art history artificial intelligence athletics business skills careers cold emails data science functions gap year international students internships letters of recommendation logic mechanical engineering poetry resume science social sciences software engineering tech industry technical interviews trigonometry 2L 3L AAMC Academic Interest DMD EMT FlexMed Fourier Series Greek Health Professional Shortage Area Italian Lagrange multipliers London MD vs PhD MMI Montessori National Health Service Corps Pythagorean Theorem Python Sentence Correction Step 2 TMDSAS Taylor Series Zoom acids and bases amino acids analysis essay architecture argumentative writing art art and design schools art portfolios biomedicine brain teaser campus visits cantonese capacitors capital markets cell biology central limit theorem centrifugal force chemical engineering chess chromatography class participation climate change clinical experience community service constitutional law consulting cover letters curriculum dementia demonstrated interest dental school dimensional analysis distance learning electric engineering electricity and magnetism enrichment escape velocity european history executive function finance first generation student freewriting fun facts genomics graphing harmonics health policy history of medicine history of science hybrid vehicles hydrophobic effect ideal gas law induction infinite information sessions institutional actions integrated reasoning intern investing investment banking lab reports linear algebra linear maps mandarin chinese matrices mba medical physics meiosis microeconomics mitosis music music theory neurology neuroscience office hours operating systems organization pedagogy phrase structure rules plagiarism potential energy pre-dental proofs pseudocode psych/soc qualifying exams quantum mechanics relativity