Data Science Briefing #256


(view in browser)

Sept 18th

Next webinar:
Sep 19, 2024 - LLMs for Data Science [Register]
Count down to 2024-09-19T17:00:00.000Z

Welcome to the Sep 18th edition of the Data Science Briefing!

We want to remind you that the very first edition of the LLMs for Data Science webinar will be held Tomorrow! There are still a couple of spots left, so Register now!

The most recent post on the Epidemiology series, Epidemiology 303: Metapopulation Models, explores how to connect multiple populations through a travel matrix. In the graph subsection, you'll find all you need to know about k-core Decomposition, while in the Visualization section, you can explore The Effects of Vaccination through a WSJ visualization.

In our regularly scheduled content, we learn how cosine similarity works, How LLMs Got Lost in the Network and Discovered Graph Reasoning and explore B-trees and database indexes.

On the academic front, we ground AI in reality with a little help from Data Commons, review the structure of street networks, and learn to Live with the fact that LLMs Will Always Hallucinate.

This week's book recommendation is "Why Machines Learn: The Elegant Math Behind Modern AI" by A. Ananthaswamy. You can find all the previous book recommendations on our website. In this week's video, we have a lecture by Terence Tao on AI and Mathematics.

Data shows that the best way for a newsletter to grow is by word of mouth, so if you think one of your friends or colleagues would enjoy this newsletter, go ahead and forward this email to them. This will help us spread the word!

Semper discentes,

The D4S Team


This week's book is "Why Machines Learn: The Elegant Math Behind Modern AI" by A. Ananthaswamy. The book introduces the main ideas and developments of Artificial Intelligence clearly and concisely. Starting with the invention of the Perceptron in the 50s, through all the significant developments of the last several decades, such as Support Vector Machines, Hopfield Networks, and Backpropagation, to the latest developments in Large Language Models. Ananthaswamy explains how they fit in the historical development of Computer Science and AI, as well as how they connect to insights originating in biology and psychology.

The book targets a general audience familiar with basic math. Mathematical concepts such as probability and linear algebra are introduced in an intuitive way that provides just enough detail to understand the more technical parts of the text. Overall, a great resource whether your reviewing these concepts or encountering them for the first time.


  1. How does cosine similarity work? [tomhazledine.com]
  2. B-trees and database indexes [planetscale.com]
  3. Learning to Reason with LLMs [openai.com]
  4. Geometric Search Trees [g-trees.github.io]
  5. Novel Architecture Makes Neural Networks More Understandable [quantamagazine.org]
  6. How I Mastered Data Structures and Algorithms [medium.com/algomaster-io]
  7. How the LLM Got Lost in the Network and Discovered Graph Reasoning [towardsdatascience.com]


Terence Tao at IMO 2024: AI and Mathematics

video preview

All the videos of the week are now available in our Youtube playlist.

Upcoming Events:

Opportunities to learn from us

On-Demand Videos:

Long-form tutorials

Data For Science, Inc

I'm a maker and blogger who loves to talk about technology. Subscribe and join over 3,000+ newsletter readers every week!

Read more from Data For Science, Inc

(view in browser) May 13th Next webinar: May 27, 2026 - Code Development with AI Assistants [Register] Dear Reader, Announcements Ever wonder how we can turn thousands of unstructured news articles into structured, actionable insights? In the latest post from Data4Sci, we dive into the fascinating process of transforming raw text from news articles into interconnected networks of information. If you're interested in Natural Language Processing (NLP), entity extraction, and how to connect the...

(view in browser) May 6th Next webinar: May 27, 2026 - Code Development with AI Assistants [Register] Dear Reader, Announcements Ever wonder how we can turn thousands of unstructured news articles into structured, actionable insights? In the latest post from Data4Sci, we dive into the fascinating process of transforming raw text from news articles into interconnected networks of information. If you're interested in Natural Language Processing (NLP), entity extraction, and how to connect the...

(view in browser) Apr 30th Next webinar: May 6, 2026 - Automate the Boring Developer Stuff with LLMs [Register] Dear Reader, Announcements ✈️ Mapping the skies: How do we visualize airline traffic between states? We often think of air travel in terms of airports, but viewing it as a network of state-to-state connections reveals fascinating patterns in how our country moves. Our latest substack uses data visualization to turn raw statistics into a clear story about infrastructure and mobility....