John Wilder Tukey (June 16, 1915 - July 26, 2000) was a statistician. Born New Bedford, Massachusetts, Tukey obtained a Bachelor of Science degree in 1936 and a Master of Science degree in chemistry in 1937 from Brown University before moving to Princeton University to study for his doctorate in mathematics.
During World War II, Tukey worked at the Fire Control Research Office and collaborated with Samuel Wilks and William Cochran .
Among many contributions to civil society, Tukey served on a committee of the American Statistical Association that produced a report challenging the conclusions of the Kinsey Report, Statistical Problems of the Kinsey Report on Sexual Behavior in the Human Male.
His statistical interests were many and varied. He is particularly remembered for his development, with James Cooley of the Cooley-Tukey Fast Fourier transform algorithm. Among his "smaller", but also lasting achievements were the term software and the box plot. In 1977, he introduced the quartile diagram .
He also contributed to statistical practice and articulated the important distinction between exploratory data analysis and confirmatory data analysis , believing that much statistical methodology placed too great an emphasis on the latter. Though he believed in the utility of separating the two types of analysis, he pointed out that sometimes, especially in natural science, this was problematic and termed such situations uncomfortable science.
A D Gordon offered the following summary of Tukey's principles for statistical practice :
... the usefulness and limitation of mathematical statistics; the importance of having methods of statistical analysis that are robust to violations of the assumptions underlying their use; the need to amass experience of the behaviour of specific methods of analysis in order to provide guidance on their use; the importance of allowing the possibility of data's influencing the choice of method by which they are analysed; the need for statisticians to reject the role of 'guardian of proven truth', and to resist attempts to provide once-for-all solutions and tidy over-unifications of the subject; the iterative nature of data analysis; implications of the increasing power, availability and cheapness of computing facilities; the training of statisticians.