The most recommended big data books

Who picked these books? Meet our 33 experts.

33 authors created a book list connected to big data, and here are their favorite big data books.
Shepherd is reader supported. When you buy books, we may earn an affiliate commission.

What type of big data book?

Loading...
Loading...

Book cover of Out of the Crisis

Steve Fenton Author Of Web Operations Dashboards, Monitoring, & Alerting

From my list on DevOps from before DevOps was invented.

Why am I passionate about this?

I'm a programmer and technical author at Octopus Deploy and I'm deeply interested in DevOps. Since the 1950s, people have been studying software delivery in search of better ways of working. We’ve seen many revolutions since Lincoln Labs first introduced us to phased delivery, with lightweight methods transforming how we wrote software at the turn of the century. My interest in DevOps goes beyond my enthusiasm for methods in general, because we now have a great body of research that adds to our empirical observations on the ways we work.

Steve's book list on DevOps from before DevOps was invented

Steve Fenton Why did Steve love this book?

Before Agile and Lean had rocked the software development industry, William Deming was busy forging this new world of work.

Out of the Crisis is predominantly a management book, but it’s really the spark that started the lightweight movement in software delivery. A key concept in the book is how to identify the work system's performance, separate from the performance of individuals.

By W. Edwards Deming,

Why should I read it?

3 authors picked Out of the Crisis as one of their favorite books, and they share why you should read it.

What is this book about?

Essential reading for managers and leaders, this is the classic work on management, problem solving, quality control, and more—based on the famous theory, 14 Points for Management

In his classic Out of the Crisis, W. Edwards Deming describes the foundations for a completely new and transformational way to lead and manage people, processes, and resources. Translated into twelve languages and continuously in print since its original publication, it has proved highly influential. Research shows that Deming’s approach has high levels of success and sustainability. Readers today will find Deming’s insights relevant, significant, and effective in business thinking and practice. This…


Book cover of Predict and Surveil: Data, Discretion, and the Future of Policing

Luke Hunt Author Of Police Deception and Dishonesty: The Logic of Lying

From my list on the cluster-f*ck we call policing.

Why am I passionate about this?

I’m an Associate Professor in the University of Alabama’s Department of Philosophy. I worked as an FBI Special Agent before making the natural transition to academic philosophy. Being a professor was always a close second to Quantico, but that scene in Point Break in which Keanu Reeves and Patrick Swayze fight Anthony Kiedis on the beach made it seem like the FBI would be more fun than academia. In my current position as a professor at the University of Alabama, I teach in my department’s Jurisprudence Specialization. My primary research interests are at the intersection of philosophy of law, political philosophy, and criminal justice. I’ve written three books on policing.

Luke's book list on the cluster-f*ck we call policing

Luke Hunt Why did Luke love this book?

I love this book because it reminds us of the many ways that technology can affect justice.

It is tempting to think sophisticated tactics such as “predictive policing” can solve all problems relating to human bias. However, Brayne shows that data and algorithms do not eliminate bias and discretion. Instead, high-tech police tools simply make bias less overt and visible, which erodes the public’s ability to hold the police accountable.

I especially enjoyed how the book flips the script, considering diverse ways to use these tools to help the public. For example, how can municipalities use technology to analyze the underlying factors that contribute to policing problems in the first place?

By Sarah Brayne,

Why should I read it?

1 author picked Predict and Surveil as one of their favorite books, and they share why you should read it.

What is this book about?

The scope of criminal justice surveillance, from the police to the prisons, has expanded rapidly in recent decades. At the same time, the use of big data has spread across a range of fields, including finance, politics, health, and marketing. While law enforcement's use of big data is hotly contested, very little is known about how the police actually use it in daily operations and with what consequences.

In Predict and Surveil, Sarah Brayne offers an unprecedented, inside look at how police use big data and new surveillance technologies, leveraging on-the-ground fieldwork with one of the most technologically advanced law…


Book cover of Good Data: An Optimist's Guide to Our Digital Future

Jamie Steane Author Of The Principles and Processes of Interactive Design

From my list on aspiring UX/UI designers in the digital age.

Why am I passionate about this?

I would like to consider myself an experienced and successful designer, researcher, and educator. I'm an Associate Professor in Communication Design and the Head of Education for the School of Design at Northumbria University in the United Kingdom, where I've taught and researched for the last twenty years so I'm super passionate about this subject and love explaining how design works. Before joining academia, I worked internationally as a designer and creative director for numerous prestigious design and media organizations, including Philips, Time-Warner, Windmill Lane Pictures, and WPP in the UK, Ireland, USA, and Southeast Asia. Working in these different businesses and locations gave me a broad perspective on the role and importance of design.

Jamie's book list on aspiring UX/UI designers in the digital age

Jamie Steane Why did Jamie love this book?

There is so much understandable suspicion about how organisations use or misuse your personal data that it's hard to see the many potential benefits of data sharing. This book restores a little faith in technology and those who develop it for public benefit.

It is a compelling read, learning how data can be used for good and bad, with many references to the author’s personal journey, from working in customer services to being an internet entrepreneur before becoming a researcher.

By Sam Gilbert,

Why should I read it?

1 author picked Good Data as one of their favorite books, and they share why you should read it.

What is this book about?

AN FT BUSINESS BOOK OF THE MONTH

'An essential read' Diane Coyle, University of Cambridge

'We are currently living in a moment of extreme pessimism about data. This book will change your mind.'

It's impossible to escape digital technology. And with that comes fear. But whatever the news has told you about data and technology, think again. Data expert and tech insider turned Cambridge researcher Sam Gilbert shows that, actually, this data revolution could be the best thing that ever happened to us.

Good Data examines the incredible new ways this information explosion is already helping us - whether that's…


Book cover of Counting: How We Use Numbers to Decide What Matters

Carolyn Purnell Author Of The Sensational Past: How the Enlightenment Changed the Way We Use Our Senses

From my list on everyday things we take for granted.

Why am I passionate about this?

I’m a historian who’s spent far too much time thinking about how the color magenta contributed to climate change and why eighteenth-century humanitarians were obsessed with tobacco enemas. My favorite historical topics—like sensation, color, and truth—don’t initially seem historical, but that’s exactly why they need to be explored. I’ve learned that the things that seem like second nature are where our deepest cultural assumptions and unconscious biases hide. In addition to writing nonfiction, I’ve been lucky enough to grow up on a ranch, live in Paris, work as an interior design writer, teach high school and college, and help stray dogs get adopted.

Carolyn's book list on everyday things we take for granted

Carolyn Purnell Why did Carolyn love this book?

I had never really given much thought to counting until I read this book, but in the very first chapter, Stone made me rethink everything I thought I knew about “one fish, two fish, red fish, blue fish.” She shows that every time we count, we’re making cultural assumptions. For example, what counts as a fish? And what makes the color of the fish more relevant than other features? Counting reveals that while these choices may seem intuitive, basic, and meaningless, they have very real impacts on people’s lives. Especially when we use numbers to measure things like merit, poverty, race, and productivity, those fundamental assumptions matter more than we care to admit.  

By Deborah Stone,

Why should I read it?

1 author picked Counting as one of their favorite books, and they share why you should read it.

What is this book about?

Early in her extraordinary career, Deborah Stone wrote Policy Paradox, a landmark work on politics. Now, in Counting, she revolutionises how we approach numbers and shows how counting shapes the way we see the world. Most of us think of counting as a skill so basic that we see numbers as objective, indisputable facts. Not so, says Stone. In this playful-yet-probing work, Stone reveals the inescapable link between quantifying and classifying, and explains how counting determines almost every facet of our lives-from how we are evaluated at work to how our political opinions are polled to whether we get into…


Book cover of The Art of Statistics: How to Learn from Data

Valliappa Lakshmanan Author Of Data Science on the Google Cloud Platform: Implementing End-To-End Real-Time Data Pipelines: From Ingest to Machine Learning

From my list on if you want to become a data scientist.

Why am I passionate about this?

I started my career as a research scientist building machine learning algorithms for weather forecasting. Twenty years later, I found myself at a precision agriculture startup creating models that provided guidance to farmers on when to plant, what to plant, etc. So, I am part of the movement from academia to industry. Now, at Google Cloud, my team builds cross-industry solutions and I see firsthand what our customers need in their data science teams. This set of books is what I suggest when a CTO asks how to upskill their workforce, or when a graduate student asks me how to break into the industry.

Valliappa's book list on if you want to become a data scientist

Valliappa Lakshmanan Why did Valliappa love this book?

What if you are faced with a problem for which a standard approach doesn’t yet exist? In such a case, you will need to be able to figure out the approach from the first principles. This book will help you learn how to derive insights starting from raw data.

By David Spiegelhalter,

Why should I read it?

2 authors picked The Art of Statistics as one of their favorite books, and they share why you should read it.

What is this book about?

'A statistical national treasure' Jeremy Vine, BBC Radio 2

'Required reading for all politicians, journalists, medics and anyone who tries to influence people (or is influenced) by statistics. A tour de force' Popular Science

Do busier hospitals have higher survival rates? How many trees are there on the planet? Why do old men have big ears? David Spiegelhalter reveals the answers to these and many other questions - questions that can only be addressed using statistical science.

Statistics has played a leading role in our scientific understanding of the world for centuries, yet we are all familiar with the way…


Book cover of The College Dropout Scandal

Peter Temin Author Of The Vanishing Middle Class: Prejudice and Power in a Dual Economy

From my list on racial and economic inequality in the USA.

Why am I passionate about this?

Peter Temin is an economist and economic historian, currently a professor at MIT and the former head of the Economics Department. His research interests include macroeconomic history, the Great Depression, industry studies in both the nineteenth and twentieth centuries, and ancient Rome. 

Peter's book list on racial and economic inequality in the USA

Peter Temin Why did Peter love this book?

This is a positive book that shows how education can help Blacks and other minorities get an education that will help them stay out of mass incarceration. It is good to have a positive program as we attempt to deal with American racism.

By David Kirp,

Why should I read it?

1 author picked The College Dropout Scandal as one of their favorite books, and they share why you should read it.

What is this book about?

Higher education today faces numerous challenges, from quality to cost. But the fact that fewer than sixty percent of college freshmen graduate in six years and fewer than forty percent earn an associate degree in three years turns few heads. The dropout problem is especially acute for black and Latino students, those from poor families, and those who are first in their families to go to college. In short, millions of students are leaving college without a degree,
saddled with debt, and little to show for it.

In The College Dropout Scandal, David Kirp outlines the scale of the problem…


Book cover of An Ugly Truth: Inside Facebook's Battle for Domination

Roger Highfield Author Of The Dance of Life: Symmetry, Cells and How We Become Human

From my list on what big data is and how it impacts us.

Why am I passionate about this?

I’m the Science Director of the Science Museum Group, based at the Science Museum in London, and visiting professor at the Dunn School, University of Oxford, and Department of Chemistry, University College London. Every time I write a book I swear that it will be my last and yet I'm now working on my ninth, after earlier forays into the physics of Christmas and the love life of Albert Einstein. Working with Peter Coveney of UCL, we're exploring ideas about computation and complexity we tackled in our two earlier books, along with the revolutionary implications of creating digital twins of people from the colossal amount of patient data now flowing from labs worldwide.

Roger's book list on what big data is and how it impacts us

Roger Highfield Why did Roger love this book?

‘They trust me….dumb f*cks.’ This telling exchange from the Harvard days of Facebook co-founder and CEO, Mark Zuckerberg appears in An Ugly Truth, which shines a harsh light on the tech behemoth that, ultimately, is built on the data of billions of people. As Meta, Zuckerberg’s new business incarnation, wafts into the virtual worlds of the metaverse, the story of Facebook is far from over, which makes this engaging book a tad unsatisfying. Nonetheless, it is a vivid example of how with Big Data comes Big Responsibility.

By Sheera Frenkel, Cecilia Kang,

Why should I read it?

1 author picked An Ugly Truth as one of their favorite books, and they share why you should read it.

What is this book about?

'An explosive new book' Daily Mail

'[A] careful, comprehensive interrogation of every major Facebook scandal. An Ugly Truth provides the kind of satisfaction you might get if you hired a private investigator to track a cheating spouse: it confirms your worst suspicions and then gives you all the dates and details you need to cut through the company's spin' New York Times

__________________________________________

Award-winning New York Times reporters Sheera Frenkel and Cecilia Kang unveil the tech story of our times in this riveting, behind-the-scenes expose that offers the definitive account of Facebook's fall from grace. Once one of Silicon Valley's…


Book cover of Free: Why Science Hasn't Disproved Free Will

S.M. Amadae Author Of Prisoners of Reason: Game Theory and Neoliberal Political Economy

From my list on to move beyond neoliberalism.

Why am I passionate about this?

I have been studying neoliberal political economy and its future transformations since I wrote Rationalizing Capitalist Democracy. One major insight has been the deep entanglement of neoliberal political-economic practices with de facto power relations. The liberal normative bargaining characterizing Adam Smith’s Wealth of Nations yields to coercive bargaining in which threats of harm are the surest and best means to get one’s way. If one seeks to understand how systems will evolve when governed by strategic competition, then orthodox game theory is useful. However, if one seeks to live in a post-scarcity society in which genuine cooperation is possible, then we can enact solidarity, trust-based relationships, and collective moral accountability. 

S.M.'s book list on to move beyond neoliberalism

S.M. Amadae Why did S.M. love this book?

In order to be moral and responsible agents, our will must be free in the sense that we make choices animated by our individual consciences. Much of the neoliberal consumer world uses big data sets and our personalized digital fingerprints in order to cater to our every wish and desire, and to sell merchandise. Research shows that individuals disregard ethical responsibility when they believe that humans are not free, and that we are instead governed by innate drives and biological functions. Mele challenges recent research that uses cognitive science to argue that the human will is not free and instead exists as an illusion. This book provides a deep analysis of why we have grounds to be confident that we can act freely, governed by our internal beliefs, commitments, and goals.

By Alfred R. Mele,

Why should I read it?

1 author picked Free as one of their favorite books, and they share why you should read it.

What is this book about?

Does free will exist? The question has fueled heated debates spanning from philosophy to psychology and religion. The answer has major implications, and the stakes are high. To put it in the simple terms that have come to dominate these debates, if we are free to make our own decisions, we are accountable for what we do, and if we aren't free, we're off the hook.

There are neuroscientists who claim that our decisions are made unconsciously and are therefore outside of our control and social psychologists who argue that myriad imperceptible factors influence even our minor decisions to the…


Book cover of Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale

Tomasz Lelek Author Of Software Mistakes and Tradeoffs: How to make good programming decisions

From my list on big data processing ecosystem.

Why am I passionate about this?

I am motivated by working on products that many people use. I've been a part of companies that deliver products impacting millions of people. To achieve it, I am working in the Big Data ecosystem and striving to simplify it by contributing to Dremio's Data LakeHouse solution. I worked on projects using Spark, HDFS, Cassandra, and Kafka technologies. I have been working in the software engineering industry for ten years now, and I've tried to share my experience and lessons learned in the Software Mistakes and Tradeoffs book, hoping that it will allow current and the next generation of engineers to create better software, leading to more happy users.

Tomasz's book list on big data processing ecosystem

Tomasz Lelek Why did Tomasz love this book?

Apache Kafka is the backbone of almost every streaming-based system today.

The solutions created and implemented in Kafka are the key concepts in every streaming system that you will work with.

This book will allow you to fully understand the Kafka architecture, its internals, and APIs and allow you to become an expert in this technology.

By Neha Narkhede, Gwen Shapira, Todd Palino

Why should I read it?

1 author picked Kafka as one of their favorite books, and they share why you should read it.

What is this book about?

Every enterprise application creates data, whether it's log messages, metrics, user activity, outgoing messages, or something else. And how to move all of this data becomes nearly as important as the data itself. If you're an application architect, developer, or production engineer new to Apache Kafka, this practical guide shows you how to use this open source streaming platform to handle real-time data feeds.

Engineers from Confluent and LinkedIn who are responsible for developing Kafka explain how to deploy production Kafka clusters, write reliable event-driven microservices, and build scalable stream-processing applications with this platform. Through detailed examples, you'll learn Kafka's…


Book cover of Winning the Loser's Game: Timeless Strategies for Successful Investing

Stephen R. Foerster Author Of In Pursuit of the Perfect Portfolio: The Stories, Voices, and Key Insights of the Pioneers Who Shaped the Way We Invest

From my list on developing your investment philosophy.

Why am I passionate about this?

I’ve been interested in investing for over four decades since I started as a finance PhD student at Wharton. Since then my research has focused on understanding the stock market. Early on, I tried applying my research to my investing. For example, I was convinced that a recently listed stock called Google was way overvalued—was I ever wrong! That got me to reflect on my investment philosophy—what did I truly believe about how markets really behaved? That brought me back to understanding and appreciating the contributors to Modern Portfolio Theory, which led to a fun decade-long book project. Currently I enjoy writing about investing through my blog.

Stephen's book list on developing your investment philosophy

Stephen R. Foerster Why did Stephen love this book?

I had the pleasure of interviewing Charley for our book.

He’s a great storyteller. He was probably the first practitioner to advocate for passive index investing. He’s a tennis enthusiast, and his book was inspired by a book he read aimed at amateur tennis players. Ellis learned that to win at tennis, the best strategy is to simply try to not lose, and to not try to act like professional players.

He realized that the same strategy worked for investors as well. That means that investors shouldn’t try to beat the market.

By Charles Ellis,

Why should I read it?

5 authors picked Winning the Loser's Game as one of their favorite books, and they share why you should read it.

What is this book about?

The definitive guide to long-term investing success-fully updated to address the realities of today's markets

Technology, information overload, and increasing market dominance by expert investors and computers make it harder than ever to produce investing results that overcome operating costs and fees. Winning the Loser's Game reveals everything you need to know to reduce costs, fees, and taxes, and focus on long-term policies that are right for you.

Candid, short, and super easy to read, Winning the Loser's Game walks you through the process of developing and implementing a powerful investing strategy that generates solid profits year after year. In…