The importance of data hygiene

New analysis of the ‘mystery’ equis found in 2004 at Pompeii reveals that the initial data was corrupted.  The find and the subsequent DNA analysis generated much excitement about a previously unknown extinct breed of horse, but upon reexamination it was...

Directed searching vs. Data-driven research

The Art of Counting project is based on the combination of a custom-built relational database and advanced statistical methods.  The database revolves around a core of variables that are recorded in a binary (yes or no) manner.  In the case of my dissertation project,...
Cluster analysis is NOT scary, I promise

Cluster analysis is NOT scary, I promise

Cluster analysis as defined by a statistician:  A procedure by which subjects, cases, or variables are clustered into groups based on similar characteristics of each.  Hierarchical cluster analysis attempts to identify relatively homogenous groups of variables (or...
Factor Analysis is NOT scary, I promise

Factor Analysis is NOT scary, I promise

There are two primary advanced statistical techniques that have already been applied to selected data from Medinet Habu—factor analysis and cluster analysis.  This post will endeavor to explain the insights provided by factor analysis in the examination of visual...
Correlation is NOT scary, I promise

Correlation is NOT scary, I promise

Many of the findings that are presented by Art of Counting will require a basic understanding of key statistical techniques.  This will be an exploration into correlation. Correlation, as defined by a statistician: The degree of similarity or difference between two...