Data Science


  • Intro to Search: Measurement

    Welcome to the third part of our mini-series “Intro to Search.” In my previous posts, I’ve discussed the characteristics of great search results and what a search engine looks like from the inside. But, how do we know if our algorithms actually deliver relevant search results? The answer is, of course, by measurement! There is…

  • This Week in Data Reading

    There’s nothing tastier than a set of links to devour! Check out what we’ve been reading recently and be sure to share your links to thought-provoking articles and discussions on topics in the field of data science.

  • Evolution of a Plot: Better Data Visualization, One Step at a Time

    The goal of data visualization is to transform numbers into insights. However, default data visualization output often disappoints. Sometimes, the graph shows irrelevant data or misses important aspects; sometimes, the graph lacks context; sometimes, it’s difficult to read. Often, data practitioners “feel” that something isn’t right with the graph, but cannot pinpoint the problem. In this…

  • Network Science at Automattic: Mapping the Communities of WP.com — Methodology

    If you have read our analysis on the communities of WordPress.com and would like to know more about the methods behind it, then keep on reading! In this — slightly more technical — post, I will show how we constructed, filtered, projected, and clustered a network around WordPress.com users and blogs. Building the Network of WP.com People…

  • Intro to Search: Anatomy of a search engine

    Welcome to the second post in our “Intro to Search”-series! Today, we’ll dig into the building blocks of search engines to give you an idea of just how we identify what posts to show readers using WordPress.com’s search tool. A (web) search engine connects users with relevant documents. This process generally has five main stages:…