Feb 25th

Next webinar:
Mar 18, 2026 - Gemini API with VertexAI for Developers

Dear Reader,

Welcome to the Feb 25th issue of our newsletter!

Announcements

The first edition of the CrewAI for Production-Ready Multi‑Agent Systems was a great success and we're already planning on the next edition. Meanwhile, if you missed out on the live session, I put a package together on Gumroad so you can work through it at your own pace.

It's five Jupyter notebooks walking through everything from the basics of CrewAI agents through production-grade patterns: configuration-driven agents, memory across sessions, human approval checkpoints, multi-LLM routing, and structured logging. Each module is self-contained, runs against real APIs, and tested during a Live Training session.

Details here!

This week’s links connect the steady migration from “AI as a chat box” to “AI as a practical engineering substrate.” One piece sits at the frontier: it explores why math is such a brutal benchmark for generative models and why even imperfect systems can still be transformative as collaborators that search, check, and remix ideas at scale. On the builder side, a hands-on walkthrough demystifies reinforcement learning with human feedback by spelling out the moving parts in a way that makes “alignment” stop being magic and become code you can read.

Another guide frames retrieval-augmented generation as a maturity ladder: you start with the toy version, then quickly discover that real reliability comes from unglamorous decisions like chunking strategy, hybrid retrieval, and reranking (plus evaluation that punishes confident nonsense). There’s even a delightfully concrete detour into graph pruning for game maps, showing how simple constraints (stay connected, avoid degenerate paths) and a better search strategy can turn “design intuition” into an algorithm. And finally, the hardware story keeps accelerating: a “thinking” model small enough to run fully on-device under 1GB hints at a near future where private, offline reasoning becomes a default product feature instead of a luxury add-on.

This week’s papers sketch a sobering picture of “smart” systems operating inside messy social worlds: even when a model looks like it’s reasoning, its performance can collapse as problems get more complex, suggesting we should treat chain-of-thought fluency as a surface signal, not a guarantee of reliable cognition. That brittleness shows up in the wild too: sustained, multi-turn interaction can quietly derail a model’s internal state, turning helpful assistants into confident wanderers unless we build guardrails, memory, and evaluation suites that stress-test long-horizon coherence. Layer on top the question of moral competence, and the challenge becomes less “did it answer?” and more “did it answer responsibly, consistently, and for the right reasons across contexts?”

Meanwhile, platform dynamics remind us that outputs don’t land in a vacuum: ranking algorithms can reshape political exposure at scale, and coordinated “chaos agents” can exploit attention systems, narrative incentives, and model weaknesses to push confusion as a strategy. Even our favorite fallback, crowdsourcing, isn’t immune: network topology can systematically distort collective perception, amplifying local consensus into global certainty. The picture that is becoming clear is that if we want trustworthy AI, we need measurement that spans reasoning difficulty, conversational durability, ethical judgment, privacy-preserving data practices (like federated approaches with local differential privacy), and the surrounding information ecosystem that ultimately determines what people see, share, and believe.

Our current book recommendation is "Visualizing Generative AI: How AI Paints, Writes, and Assists" by P. Vergadia and V. Lakshmanan. You can find all the previous book reviews on our website. In this week's video, we have an overview of Hamming codes. The origin of error correction.

Data shows that the best way for a newsletter to grow is by word of mouth, so if you think one of your friends or colleagues would enjoy this newsletter, go ahead and forward this email to them. This will help us spread the word!

Semper discentes,

The D4S Team

"Visualizing Generative AI: How AI Paints, Writes, and Assists" by P. Vergadia and V. Lakshmanan is a concept-first, diagram-rich guide that makes modern GenAI feel legible. Priyanka Vergadia’s visual explanations are the star: clean mental models for tokens, embeddings, transformers, and “why the model says what it says,” without burying you in math. It’s the kind of book that helps you keep the whole system in your head fast.

For data scientists and ML engineers, the best value is the shared vocabulary it builds for real-world conversations: architecture tradeoffs, where GenAI fits in products, and what it’s actually good at today (assistive workflows, automation, and augmentation more than magic). It also doesn’t dodge the sharp edges, such as hallucinations, security concerns, and practical limitations, so you’re not left with a glossy, hype-only view.

The main drawback is depth: if you want rigorous internals, training dynamics, evaluation deep dives, or extensive code and end-to-end implementation details, this isn’t the book for you. But as a quick, sticky mental map, something you can read in a weekend and keep referencing when you’re designing, reviewing, or educating stakeholders, it’s a very strong pick, and likely to earn a spot on your “worth recommending” shelf.

A roadmap for evaluating moral competence in large language models (J. Haas, S. Bridgers, A. Manzini, B. Henke, J. May, S. Levine, L. Weidinger, M. Shanahan, K. Lum, I. Gabriel, W. Isaac)
The political effects of X’s feed algorithm (G. Gauthier, R. Hodler, P. Widmer, E. Zhuravskaya)
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity (P. Shojaee, I. Mirzadeh, K. Alizadeh, M. Horton, S. Bengio, M. Farajtabar)
Impact of federated data with local differential privacy for human mobility modeling (H. Gibbs, M. Musolesi, J. Cheshire, R. M. Eggo)
Agents of Chaos (N. Shapira, C. Wendler, A. Yen, G. Sarti, K. Pal, O. Floody, A. Belfki, A. Loftus, A. R. Jannali, N. Prakash, J. Cui, G. Rogers, J. Brinkmann, C. Rager, A. Zur, M. Ripa, A. Sankaranarayanan, D. Atkinson, R. Gandikota, J. Fiotto-Kaufman, E.J. Hwang, H. Orgad, P. S. Sahil, N. Taglicht, T. Shabtay, A. Ambus, N. Alon, S. Oron, A. Gordon-Tapiero, Y. Kaplan, V. Shwartz, T. R. Shaham, C. Riedl, R. Mirsky, M. Sap, D. Manheim, T. Ullman, D. Bau)
Beyond the Wisdom of the Crowd: How Network Topology Distorts Collective Perception (G. Palermo, V. Loreto, G. Cimini)
LLMs Get Lost In Multi-Turn Conversation (P. Laban, H. Hayashi, Y. Zhou, J. Neville)

But what are Hamming codes? The origin of error correction

All the videos of the week are now available in our YouTube playlist.

Upcoming Events:

Opportunities to learn from us

Mar 18, 2026 - Gemini API with VertexAI for Developers [Register]
Apr 22, 2026 - LangChain for Generative AI Pipelines [Register]

On-Demand Videos:

Long-form tutorials

Natural Language Processing 7h, covering basic and advanced techniques using NTLK and PyTorch.
Python Data Visualization 7h, covering basic and advanced visualization with matplotlib, ipywidgets, seaborn, plotly, and bokeh.
Times Series Analysis for Everyone 6h, covering data pre-processing, visualization, ARIMA, ARCH, and Deep Learning models.

Learn More

Unsubscribe

Data For Science, Inc

Data Science Briefing #307

Feb 25th

Announcements

Upcoming Events:

On-Demand Videos:

Data Science Briefing #308

Data Science Briefing #306

Data Science Briefing #305