Category: Uncategorized
-
Optimism, Anxiety and Bayesian Reasoning
By Jessica Umoren Have you ever jumped to the worst possible conclusion after seeing a piece of bad news? Maybe you got a strange email from a professor and immediately thought you failed the class, or you felt a small symptom and convinced yourself you had a serious illness. Humans are naturally wired to panic…
-
Autocorrelation in Time Series Analysis
By Scarlett Barge In statistics, autocorrelation refers to the correlation of a variable with itself across time. Rather than examining relationships between different variables, autocorrelation measures whether past values of a series help explain its current behavior. This concept is central to time series analysis and plays an important role in regression diagnostics, forecasting, and…
-
Quick Guide to Non-Linear Models
By Scarlett Barge Many relationships we wish to study are not well captured by classic linear regression. In non-linear models, an increase in X can have different effects depending on the starting value of X. These models allow us to capture curvature, diminishing returns, and other patterns that linear models miss. Polynomial Models Polynomial models…
-
Why Should Data Science Students Learn SAS in 2026?
By Jessica Umoren 5 Reasons SAS Still Matters Although Python and R dominate modern data science education, SAS continues to play a major role in industries where data accuracy, compliance, and long-term analysis are critical. For data science students in 2026, learning SAS can provide a helpful advantage. 1. Strong Foundations in Advanced Statistical Analysis…
-
Why Does SAS Hold a Monopoly on the FDA?
By Jessica Umoren SAS (Statistical Analysis System) is a powerful statistical programming platform made for advanced analytics, especially on large administrative datasets. One of its most prominent and established clients is the U.S. Food and Drug Administration (FDA), a relationship that dates back decades and includes a recent 40-year partnership extension worth $49.9 million to…
-
Understanding Two-Way ANOVA in SAS
By Scarlett Barge A two-way Analysis of Variance (ANOVA) is a statistical test used to examine how two categorical independent variables (IVs) affect a quantitative dependent variable (DV). In addition to assessing the individual (main) effects of each independent variable, a two-way ANOVA also allows us to test whether there is an interaction effect between…
-
Mapping Gerrymandering in R: A Data-Driven Approach
By Jessica Umoren What is Gerrymandering? Gerrymandering is Congress redrawing district lines in order to favor certain political parties, normally for a congressperson or political party to win or maintain a particular district, or to gain more electoral votes for a presidential election. Gerrymandering is very present in today’s political climate, and both political parties…
-
John Snow, Cholera, and the Art of Learning from Imperfect Experiments
By Scarlett Barge Before collecting and analyzing data, we must first ensure the design of the experiment is plausible and valid. Certain hypotheses can be easily tested using a typical lab setting, an experimental group, and a control. But what happens when this is simply not possible? Enter John Snow. Long before dragons and direwolves,…
-
The Power of R: A Visual Journey into Data Analysis
By Oluwanifemi Kayode-Alese Welcome aboard the data express! 🚀 Today, we’re diving into the world of R, a statistical programming language that’s here to make data analysis easy. Buckle up as we explore how to unleash the potential of your datasets, uncovering insights and trends in your dataset that might just blow your mind. Setting…
-
Navigating the Data Highway
by Oluwanifemi Kayode-Alese The assumptions of a regression model can be summarized into an acronym known as LINE therefore we can also call it the LINE test. Where: L stands for Linearity I stands for independence N stands for normality E stands for Equal Variance (Homoscedasticity) In the statistical world, understanding a dataset often…
