Notes – Data Science History

The term “Data Science” may sound modern, but its foundation was laid many decades ago. It evolved gradually through statistics, computing, and the explosion of digital data.


Timeline of Key Events


YearMilestone
1962John Tukey introduced the idea of combining statistics with computing
1974Peter Naur used the term Data Science in his work
1996First use of Data Science as a discipline in the International Federation of Classification Societies (IFCS) conference
2001William S. Cleveland formalized Data Science as an independent field
2008DJ Patil and Jeff Hammerbacher popularized the modern role of “Data Scientist” at LinkedIn and Facebook
2012Harvard Business Review called Data Scientist the “Sexiest Job of the 21st Century”

How Did It Evolve?

  • Early Days (Before 1980s)
    • Focused on statistics and mathematics
    • Data was limited and mostly offline
  • Rise of Computing (1980s–1990s)
    • Computers enabled faster data processing
    • Database systems (like SQL) emerged
  • Internet Era (2000s)
    • Huge increase in data from online activities
    • Need for automated data analysis grew
  • Big Data & AI Era (2010s–Present)
    • Emergence of tools like Hadoop, Spark, and Python
    • Machine Learning and AI became core parts of data science

What Changed Over Time?


Then (Before 2000s)Now (Post-2010s)
Manual data analysisAutomated, real-time analytics
Limited, structured dataHuge, diverse (text, image, video) data
Statistical modeling onlyML, AI, and Deep Learning-based models
Excel and basic toolsPython, R, Big Data tools