Statistics; for people #toobusysavinglives to deal with nonsense

statistics .jpg

For the vast majority of humans I’ve met during my time on Earth, statistics is a scary topic, relegated to the brains of nerds. But avoiding this subject altogether may leave you vulnerable to exploitation by the spin doctors who understand how to manipulate a fact into an opinion! Let’s explore simple concepts that can help foster an awareness of what’s real, and what’s conjecture. 

A framework for statistics

Our brains love linear relationships between two variables. For example, the hours spent honing my rockstar guitar skills is (hopefully) positively associated with the amount of money I make busking in Harvard square. For relatively simple, measurable outcomes, statistic offers practical ways to test and iterate towards better results.

But the larger (and more meaningful) the question, the more difficult it is to reduce the problem to just two variables. For example: finding the causal roots behind autism, or depression, or the meteoric rise of demagogue leaders in our free world… these phenomena are too complex to be reliably simplified.

Hence my first point: basic (read: college) statistics is easy. It’s limited in its function and is meant as a tool to understand just a little more than we already know. Expecting beyond this is reaching into the realm of God, or Zeus, or Einstein.

The Line: first step towards statistical wizardry 

And now let’s delve into my favorite building block of statistics.

Many statisticians loves to work with lines because they are:

  1. Easiest to make sense of, both for the statistician and his/her/its audience.
  2. Easiest to create, scrap, and re-create
  3. Actually quite useful.

Furthermore, even if you use more complex, convoluted models, chances are you want to make the final output as linear as possible to reach the widest possible audience. Our lazy brains love digestible lines more than complex curves!

Lingo for the Linear Model

To be fair, y’all may know these terms from middle school. Who knew algebra could come in handy?

Dependent variable: the thing you care about. The outcome that changes in response to your manipulations.

Independent variable: the thing you manipulate to get more or less of the dependent variable. 

Slope: how quickly your desired outcome changes with each change in the manipulated variable. A positive slope means you get more dependent variable by increasing the independent variable; a negative slope means the opposite.

‘r’: a number (from -1 to 1) that expresses how related your variables seem to be. If r is -1 or 1, your two variables can be modeled perfectly using a straight line. If r is 0, your independent variable fails to predict anything about your dependent variable. Real life happens somewhere in between. 

A ‘perfect’ example

Here I remind us all that real-life linear models often cannot prove causality; that is, even if r is close to -1 or 1 and indicates that your model is near-perfect, without strict tests in enclosed systems it’s truly a nightmare to prove that your dependent variable changed because of your independent variable! 

Jeewong_Blog.pngIn the above example, r is close to 1 (near-perfect fit). A misinformed human may conclude that increasing mozzarella cheese consumption in our societies will also reliably spawn more civil engineers! Hmm…

Yeah right. 

Next steps

Want to check out other random correlations? Go here.

Lines are too uninspiring? Check this If you want to explore where your budding obsession with math and data can lead.

Are you interested in working with JC on statistics or on standardized tests?

Contact us!

Want to read more on statistics from our other bloggers?

Data Science and Intuitive Mathematics

Statistics Tutor: The 4 Keys to Conquering Statistics

Comments

topicTopics
academics study skills MCAT medical school admissions SAT college admissions expository writing English MD/PhD admissions strategy writing LSAT GMAT physics GRE chemistry biology math graduate admissions academic advice ACT interview prep law school admissions test anxiety language learning career advice premed MBA admissions personal statements homework help AP exams creative writing MD study schedules test prep computer science Common Application summer activities mathematics history philosophy organic chemistry secondary applications economics supplements research 1L PSAT admissions coaching grammar law psychology statistics & probability legal studies ESL dental admissions CARS SSAT covid-19 logic games reading comprehension engineering USMLE calculus mentorship PhD admissions Spanish parents Latin biochemistry case coaching verbal reasoning DAT English literature STEM excel medical school political science skills AMCAS French Linguistics MBA coursework Tutoring Approaches academic integrity astrophysics chinese genetics letters of recommendation mechanical engineering Anki DO Social Advocacy admissions advice algebra art history artificial intelligence business careers cell biology classics dental school diversity statement gap year geometry kinematics linear algebra mental health presentations quantitative reasoning study abroad tech industry technical interviews time management work and activities 2L DMD IB exams ISEE MD/PhD programs Sentence Correction adjusting to college algorithms amino acids analysis essay athletics business skills cold emails data science finance first generation student functions graphing information sessions international students internships logic networking poetry proofs resume revising science social sciences software engineering trigonometry units writer's block 3L AAMC Academic Interest EMT FlexMed Fourier Series Greek Health Professional Shortage Area Italian Lagrange multipliers London MD vs PhD MMI Montessori National Health Service Corps Pythagorean Theorem Python Shakespeare Step 2 TMDSAS Taylor Series Truss Analysis Zoom acids and bases active learning architecture argumentative writing art art and design schools art portfolios bacteriology bibliographies biomedicine brain teaser campus visits cantonese capacitors capital markets central limit theorem centrifugal force chemical engineering chess chromatography class participation climate change clinical experience community service constitutional law consulting cover letters curriculum dementia demonstrated interest dimensional analysis distance learning econometrics electric engineering electricity and magnetism escape velocity evolution executive function fellowships freewriting genomics harmonics health policy history of medicine history of science hybrid vehicles hydrophobic effect ideal gas law immunology induction infinite institutional actions integrated reasoning intermolecular forces intern investing investment banking lab reports linear maps mandarin chinese matrices mba medical physics meiosis microeconomics mitosis mnemonics music music theory nervous system neurology neuroscience

Related Content