Andriy Koval

Assistant Professor of Health Management and Informatics

Biography

I am a data scientist with background in quantitative methods and interest in data-driven models of human aging. I subscribe to aspirations of open science and reproducible research.

Read my academic biography.

See my work on github at /andkov

Download my CV

Interests

Graph Making
Longitudinal Modeling
Health Informatics

Education

PhD in Quantitative Methods, 2014

Vanderbilt University
MA in Quantitative Psychology, 2008

Middle Tennessee State University
BSc in Mass Communication, 2005

Middle Tennessee State University

Skills

Where I spend my time

R for Data Science

90%

Graph making

100%

Reproducible Research

100%

Statistical Learning

80%

Experience

Assistant Professor

University of Central Florida

Dec 2018 – Present Orlando, Florida

Research and teaching at the department of Health Management and Informatics

Health System Impact Fellow

Observatory for Population and Public Health (UBC)

Aug 2017 – Dec 2018 Vancouver, BC, Canada

Designing reproducible workflows for suppressing small cells before public release

Postdoctoral Fellow

University of Victoria

Aug 2014 – Aug 2017 Victoria, BC, Canada

Harmonization of longitudinal studies of aging with the IALSA network. Developing analytic workflows for longitudinal modeling.

Featured Publications

2021-03-05 HEPL epidemiology

United States Response to the COVID-19 Pandemic, January-November 2020

The paper tracks the response of US government to the unfolding pandemi of COVID-19

Code DOI

2020-11-10 IPDLN suppress-for-release

Using Reproducible Data Visualisations to Augment Decision-Making During Suppression of Small Counts

Demonstrates using reproducible data visualisations for augmenting redaction decisions during small cell supression and creating documentaion transparent for non-technical audit.

PDF Code Slides Source Document

Andriy Koval, Anthony Leamon, Kate Smolina

2019-05-29 CAHSPR suppress-for-release

Suppressing Small Counts for Public Release

Demonstrates the methods of suppressing small counts in a provincial surveillance system in preparation of data for public release.

Code Poster

Emily C Duggan, Andrea M Piccinin, Sean Clouston, Andriy Koval, Annie Robitalle, Andrea R Zammit, Chenkai Wu, Cassandra L Brown, Lewina O Lee, Deborah Finkel, William H. Beasley, Jeffrey Kaye, Graciela Muniz Terrera, Mindy Katz, Richard B Lipton, Dorly Deeg, David A Bennett, Marcus Praetorius Björk, Boo Johansson, Avron Spiro II, Jennifer Weuve, Scott M Hofer

2019-03-02 J Gerontol A Biol Sci Med Sci coordinated analysis, IALSA-Portland

A Multi-Study Coordinated Meta-Analysis of Pulmonary Function and Cognition in Aging

In this second paper of a two-paper series, we conducted a coordinated analysis and summary meta-analysis of new results on the aging-related dynamics linking pulmonary function and cognitive performance.

PDF Code DOI

2018-03-11 CAHSPR MHSU

Severity and Burden of Mental Health

Visualizing the variability in clinical histories of patients with confirmed diagnosis of (1) schizophrenia and (2) bipolar disorders using cross-continuum clinical records.

PDF

Andriy Koval, Kate Smolina, Scott M. Hofer, Ken Moselle

2018-03-08 CAHSPR Health Informatics, Health Services Research, MHSU

Blueprints for Learning

Describes the tools and framework for identifying, describing, and analyzing person- and cohort- level service utilization in complex health service terrain.

PDF

Andriy Koval

2014-08-05 thesis

A Graphical System for Longitudinal Modeling using Dynamic Documents.Application to NLSY97 Religiosity Data

Proposes a graphical analysis and presentation system for fitting, evaluating, and reporting multilevel longitudinal models.

PDF

Andriy Koval

2008-08-08 thesis

Deriving Cut-off Scores for Significance Testing of NCDIF Indices Within the DFIT Framework Through Item Parameter Replication Method

investigated the properties of DIF indices in the differential functioning of items and tests (DFIT) framework and further tested the power of item parameter replication method (IPR) in identifying biased items within the item response theory (IRT) paradigm.

PDF

Andriy Koval

2004-12-05 thesis

Influence of Brand Advertising on the Nature of Interpersonal Communication in Ukrainian Society

Explores the history of brand advertising and its implementation in the emergence market economy of post-soviet Ukraine, specifically the effects of western marketing campaigns on the fabric of interpersonal relations and communication.

PDF

Projects

Gallery

Talks

Recent and Upcoming Events

Andriy Koval

2019-11-26 00:00

Managing Data Analysis with RStudio

The workshop introduces R and RStudio and makes the case for project-oriented workflows for applied data analysis. Using logistic regression on Titanic data as an example, the participants will learn to communicate statistical findings more effectively, and will evaluate the advantages of using computational notebooks in RStudio to disseminate the results

PDF Code Slides

Andriy Koval

2019-11-06 00:00

Implementing Reproducible Visualizations

Visualising results of statistical modeling is a key component of data science workflow. Statistical graphs often is the best means to explain and promote research findings. However,in order to find that one graph that tells the story worth sharing, we sometimes have to try out and sift through many data visualizations. How should we approach such a task? What can we do to make it easier from both production and evaluation perspectives?

PDF Code Dataset Project Slides

Andriy Koval, Ken Moselle

2018-11-28 00:00

Transactional Data of Island Health

Describing the application of Clinical Context Coding Scheme to stratification of Mental Health and Substance Use services on Vancouver Island from 2007 to 2017

PDF Project Slides

Andriy Koval

2018-11-01 16:00

Visualizing Logistic Regression

Visualising results of statistical modeling is a key component of data science workflow. Statistical graphs are often the best means to explain and promote research findings. However, in order to find that one graph that tells the story worth sharing, we sometimes have to try out and sift through many data visualizations. How should we approach such a task? What can we do to make it easier from both production and evaluation perspectives?

PDF Code Dataset Project Slides Video

Andriy Koval

2018-10-31 15:00

When notebooks are not enough

Abstract While computational notebooks offer scientists and engineers many helpful features, the limitations of this medium make it but a starting point in creating software - the practical goal of data science. Where do we go from computational notebooks if our projects require multiple interconnected scripts and dynamic documents? How do we ensure reproducibility amidst growing complexity of analyses and operations? I will use a concrete analytical example to demonstrate how constructing workflows for reproducible analyses can serve as the next step from computational notebooks towards creating an analytical software.

PDF Code Slides

See all talks