Data science in risk management: value-at-risk and expected shortfall

academics data science economics
By John L.

Have you just started your investment journey and wondered how much to invest? Did you follow promising strategies but still see your account in red? Are you worried about the risk that you cannot foresee in your holdings? If you are nodding along these questions, know that you are not alone. In fact, these are what professionals like quants, traders, and asset managers wonder but fail to solve all the time- one often enters the market with a correct timing at a correct price but walks out with a flat wallet, and worse off, with margin debt. An example was when Ray Dalio’s hedge fund, Bridgewater Associates, lost 12.1billion of investor’s money during the post-COVID19 recovery in 2020. According to the U.S. Securities and Exchange Commission (SEC), 70% of investments are lost to mistreated risks. These risks are often inevitable, yet identifiable with the right statistics techniques. An example using Apple’s stock return is used to illustrate this.

Plotting a Histogram 

Daily stock returns of Apple from January 1st, 2018 to May 21st, 2021 are put on a log-scale for the sake of normalization, and a histogram is plotted to show their distribution. The log-returns seem to range from -0.14 to 0.14, and peak around 0. The distribution seems to be symmetric with moderate spreads on both sides.

Fitting Probability Distribution Models 

Our goal is to find the distribution curve that best fits the histogram. In the graphs below, a Normal Distribution and a Generalized Error Distribution (GED) are fitted. Which distribution curve helps achieve our goal better?

Observantly, one could see that the GED curve fits the histogram better because its line traces the density bars closer throughout all log-return values, particularly at the center of the histogram, when the Normal Distribution curve fails to capture the peak of the histogram.

Having found such a curve means that we have obtained a model function that can simulate Apple’s log-return for the next day, the day after, and so on. For example, according to the GED model, there is around a 7.5% probability that Apple’s investors will have a log-return of either -0.02 or 0.025 on the next trading day.

Finding Value-at-Risk (VaR) and Expected Shortfall (ES)

VaR is a statistic used to quantify the risk of potential losses for a firm or an investment. ES is a statistic used to quantify the average losses for a firm or an investment given the loss is at least the VaR level. Both statistics are widely used by institutions like commercial banks and hedge funds to evaluate market risk and portfolio losses. An example is for a mortgage company to estimate the probability a user defaults on a loan.

To find VaR, we first specify a probability, also known as a risk level (red area), then find the quantile up to which the probability covers from the left tail of the distribution. For example, using the GED curve selected above, we can conclude there is a 25% probability that an investment in Apple will incur a loss of at least 2.5% (-0.025 log-return). Apple’s ES is the average amount at risk, given that its risk exposure is at least 2.5%. This amount may be computed via Monte Carlo simulation, which we will not do here.

Question: After reading Apple’s chart, an investor believes that the company is fundamentally healthy and holds a $25000 position.Using the GED model, what is the probability for this investor to lose at least $1250? (Assume that log-returns represents returns on a 1:1 scale)

Answer: Since $1250 is equivalent to $1250/$25000=5% of the investor’s position in Apple, according to the GED curve above, the probability for the investor to lose at least 5% of his position is represented by the blue shaded area.The exact area can be determined through integration or algorithmic computation, which turns out to be 10%.

Sources

https://www.investopedia.com/terms/v/var.asp

Comments

topicTopics
academics study skills medical school admissions MCAT SAT college admissions expository writing strategy English MD/PhD admissions writing LSAT physics GMAT GRE chemistry graduate admissions biology academic advice math law school admissions ACT interview prep language learning test anxiety personal statements premed career advice MBA admissions AP exams homework help test prep creative writing MD study schedules Common Application computer science mathematics summer activities history secondary applications philosophy organic chemistry research economics supplements 1L grammar statistics & probability PSAT admissions coaching dental admissions psychology law legal studies ESL CARS PhD admissions SSAT covid-19 logic games reading comprehension calculus engineering USMLE medical school mentorship Latin Spanish parents AMCAS admissions advice biochemistry case coaching verbal reasoning DAT English literature STEM excel political science skills French Linguistics MBA coursework Tutoring Approaches academic integrity astrophysics chinese classics dental school gap year genetics letters of recommendation mechanical engineering units Anki DO Social Advocacy algebra art history artificial intelligence business careers cell biology data science diversity statement first generation student freewriting geometry graphing kinematics linear algebra mental health presentations quantitative reasoning study abroad tech industry technical interviews time management work and activities 2L AAMC DMD IB exams ISEE MD/PhD programs MMI Sentence Correction adjusting to college algorithms amino acids analysis essay athletics business skills cold emails executive function fellowships finance functions genomics information sessions international students internships logic networking office hours poetry pre-dental proofs resume revising scholarships science social sciences software engineering trigonometry writer's block 3L Academic Interest EMT FlexMed Fourier Series Greek Health Professional Shortage Area Italian JD/MBA admissions Lagrange multipliers London MD vs PhD Montessori National Health Service Corps Pythagorean Theorem Python Shakespeare Step 2 TMDSAS Taylor Series Truss Analysis Zoom acids and bases active learning architecture argumentative writing art art and design schools art portfolios bacteriology bibliographies biomedicine brain teaser burnout campus visits cantonese capacitors capital markets central limit theorem centrifugal force chem/phys chemical engineering chess chromatography class participation climate change clinical experience community service constitutional law consulting cover letters curriculum dementia demonstrated interest dimensional analysis distance learning econometrics electric engineering electricity and magnetism entropy escape velocity evolution extracurriculars fundraising harmonics health policy history of medicine history of science hybrid vehicles hydrophobic effect ideal gas law immunology induction infinite institutional actions integrated reasoning intermolecular forces intern investing investment banking lab reports letter of continued interest linear maps mandarin chinese matrices mba