Author Archives: Jeff Leek

Creating the field of evidence based data analysis - do people know what a p-value looks like?

In the medical sciences, there is a discipline called "evidence based medicine". The basic idea is to study the actual practice of medicine using experimental techniques. The reason is that while we may have good experimental evidence about specific medicines or … Continue reading

Posted in Uncategorized | 2 Comments

Data science can't be point and click

As data becomes cheaper and cheaper there are more people that want to be able to analyze and interpret that data.  I see more and more that people are creating tools to accommodate folks who aren't trained but who still … Continue reading

Posted in Uncategorized | 4 Comments

The Leek group guide to genomics papers

Leek group guide to genomics papers When I was a student, my advisor, John Storey, made a list of papers for me to read on nights and weekends. That list was incredibly helpful for a couple of reasons. It got me … Continue reading

Posted in Uncategorized | 4 Comments

An economic model for peer review

I saw this tweet the other day: Has anyone applied game theory to the issue of anonymous peer review in academia? — Mick Watson (@BioMickWatson) October 2, 2014 It reminded me that a few years ago I had a paper … Continue reading

Posted in Uncategorized | 1 Comment

The Drake index for academics

I think academic indices are pretty silly; maybe we should introduce so many academic indices that people can't even remember which one is which. There are pretty serious flaws with both citation indices and social media indices that I think … Continue reading

Posted in Uncategorized | 2 Comments

You think P-values are bad? I say show me the data.

Both the scientific community and the popular press are freaking out about reproducibility right now. I think they have good reason to, because even the US Congress is now investigating the transparency of science. It has been driven by the … Continue reading

Posted in Uncategorized | 13 Comments

A non-comprehensive list of awesome female data people on Twitter

I was just talking to a student who mentioned she didn't know Jenny Bryan was on Twitter. She is and she is an awesome person to follow. I also realized that I hadn't seen a good list of women on Twitter … Continue reading

Posted in Uncategorized | 23 Comments

Why the three biggest positive contributions to reproducible research are the iPython Notebook, knitr, and Galaxy

There is a huge amount of interest in reproducible research and replication of results. Part of this is driven by some of the pretty major mistakes in reproducibility we have seen in economics and genomics. This has spurred discussion at a variety … Continue reading

Posted in Uncategorized | 12 Comments

A (very) brief review of published human subjects research conducted with social media companies

As I wrote the other day, more and more human subjects research is being performed by large tech companies. The best way to handle the ethical issues raised by this research is still unclear. The first step is to get some … Continue reading

Posted in Uncategorized | 3 Comments

SwiftKey and Johns Hopkins partner for Data Science Specialization Capstone

I use SwiftKey on my Android phone all the time. So I was super pumped up when they agreed to partner with us on the first Capstone course for the Johns Hopkins Data Science  Specialization to run in October 2014. To enroll … Continue reading

Posted in Uncategorized | 13 Comments