Mar 5th

Next webinar:
Mar 18, 2026 - Gemini API with VertexAI for Developers

Dear Reader,

Welcome to the 308th issue of our newsletter!

Announcements

The first edition of the CrewAI for Production-Ready Multi‑Agent Systems was a great success and we're already planning on the next edition. Meanwhile, if you missed out on the live session, I put a package together on Gumroad so you can work through it at your own pace.

It's five Jupyter notebooks walking through everything from the basics of CrewAI agents through production-grade patterns: configuration-driven agents, memory across sessions, human approval checkpoints, multi-LLM routing, and structured logging. Each module is self-contained, runs against real APIs, and tested during a Live Training session.

Details here!

This week’s newsletter connects “how it works” and “why it matters.” On the delightfully nerdy end, one piece tackles the question of whether data has weight, walking through why solid-state drives can (in theory) get infinitesimally heavier as you write to them, even if it’s the kind of “measurement” that mostly lives in physics-trivia land.

On the sobering end, a deep dive into tens of thousands of kernel issues asks a more human question: who tends to introduce bugs, when they slip in, and what review dynamics seem to help catch them faster. If you’re building with coding agents, another guide usefully reframes “prompting” as engineering: break work into checkable steps, force explicit plans, and design tight feedback loops so the agent can self-correct instead of confidently wandering. For hands-on learning, there are two excellent interactive explainers: one that makes decision trees feel concrete (splits, entropy, overfitting) and another that visually demystifies diffusion by turning noisy static into an image you can see emerge.

And finally, a timely reminder that capability cuts both ways: new research shows how LLM-driven pipelines can scale deanonymization by stitching together identity signals scattered across platforms.

On the academic front, the thread running through this week’s papers is that “systems” don’t just enact behaviors, but rather compute, coordinate, and leave fingerprints you can measure. One study uses cause-specific excess mortality to reveal how the pandemic’s toll in rural India didn’t arrive as a single wave, but as a shifting mix of direct COVID deaths and knock-on effects that standard reporting often misses.

Zooming out, another paper asks what it even means for a system to compute, pushing past the metaphor of “everything is information” toward a more careful accounting of who is doing the computing, what counts as input/output, and where the boundaries of inference actually sit. That same boundary-setting shows up in AI: work on discovering multi-agent learning algorithms with large language models treats LLMs less as oracles and more as search engines over strategy space.

But once agents start interacting at scale, their society becomes an object of study too: early social-network analysis of agent communities reveals emergent roles, clustering, and influence patterns that look uncannily familiar, except that the “people” are programs. In parallel, research on the statistical signature of LLMs suggests that generated text carries detectable regularities, which matters for everything from provenance to governance: if synthetic content is measurable, so is its diffusion through a network. And hovering over the whole stack are papers on economic complexity and on bitcoin as a quasi-religious techno-libertarian project, both of which point to a shared lesson: incentives and beliefs are part of the system dynamics, shaping what gets built, what gets adopted, and what narratives become “common sense” long before the metrics catch up.

Our current book recommendation is "Visualizing Generative AI: How AI Paints, Writes, and Assists" by P. Vergadia and V. Lakshmanan. You can find all the previous book reviews on our website. In this week's video, we compare AI Agent frameworks: AutoGen, CrewAI, and LangGraph.

Data shows that the best way for a newsletter to grow is by word of mouth, so if you think one of your friends or colleagues would enjoy this newsletter, go ahead and forward this email to them. This will help us spread the word!

Semper discentes,

The D4S Team

"Visualizing Generative AI: How AI Paints, Writes, and Assists" by P. Vergadia and V. Lakshmanan is a concept-first, diagram-rich guide that makes modern GenAI feel legible. Priyanka Vergadia’s visual explanations are the star: clean mental models for tokens, embeddings, transformers, and “why the model says what it says,” without burying you in math. It’s the kind of book that helps you keep the whole system in your head fast.

For data scientists and ML engineers, the best value is the shared vocabulary it builds for real-world conversations: architecture tradeoffs, where GenAI fits in products, and what it’s actually good at today (assistive workflows, automation, and augmentation more than magic). It also doesn’t dodge the sharp edges, such as hallucinations, security concerns, and practical limitations, so you’re not left with a glossy, hype-only view.

The main drawback is depth: if you want rigorous internals, training dynamics, evaluation deep dives, or extensive code and end-to-end implementation details, this isn’t the book for you. But as a quick, sticky mental map, something you can read in a weekend and keep referencing when you’re designing, reviewing, or educating stakeholders, it’s a very strong pick, and likely to earn a spot on your “worth recommending” shelf.

Cause-specific excess mortality in rural India during the COVID-19 pandemic 2020–2023 (P. Kumar, W. Suraweera, A. Karlinsky, P. Jha)
What does it mean for a system to compute? (D. H. Wolpert, J. Korbel)
Fix the money, fix the world: bitcoin as techno-libertarian religion (T. Ahn)
Optimizing economic complexity (V. Stojkoski, C. A. Hidalgo)
Discovering Multiagent Learning Algorithms with Large Language Models (Z. Li, J. Schultz, D. Hennes, M. Lanctot)
Let There Be Claws: An Early Social Network Analysis of AI Agents on Moltbook (H. C. W. Price, H. AlMuhanna, P. M. Bassani, M. Ho, T. S. Evans)
The Statistical Signature of LLMs (O. Hadad, E. Loru, J. Nudo, N. D. Marco, M. Cinelli, W. Quattrociocchi)

AutoGen vs CrewAI vs LangGraph

All the videos of the week are now available in our YouTube playlist.

Upcoming Events:

Opportunities to learn from us

Mar 18, 2026 - Gemini API with VertexAI for Developers [Register]
Apr 22, 2026 - LangChain for Generative AI Pipelines [Register]
Apr 29, 2026 - Claude API for Python Developers [Register]

On-Demand Videos:

Long-form tutorials

Natural Language Processing 7h, covering basic and advanced techniques using NTLK and PyTorch.
Python Data Visualization 7h, covering basic and advanced visualization with matplotlib, ipywidgets, seaborn, plotly, and bokeh.
Times Series Analysis for Everyone 6h, covering data pre-processing, visualization, ARIMA, ARCH, and Deep Learning models.

Learn More

Unsubscribe

Data For Science, Inc

Data Science Briefing #308

Mar 5th

Announcements

Upcoming Events:

On-Demand Videos:

Data Science Briefing #307

Data Science Briefing #306

Data Science Briefing #305