Open Source


  • Organizing data.blog content via NLP and LLM

    The redesign of data.blog aimed to enhance content discoverability by making category and tag pages more prominent. We re-imagined the blog taxonomy developed over ten years. Using NLP and LLM techniques, we analyzed categories and tags to consolidate and improve on clarity and relevance.

  • Synchronizing Data with Apache Superset: Our Internal Solution

    At Automattic, our deep appreciation for open source software is evident through our active contributions, with a primary focus on WordPress. Recently, we integrated Apache Superset, to help support our intricate data visualization needs. One notable achievement includes the automation of dataset creation, a solution that not only resolved an issue but also enhanced our…

  • Women of Datamattic: Madison Swain-Bowden

    Welcome to Women of Datamattic—conversations with some of the remarkable women working all over the world to build, maintain, and explore Automattic’s data landscape and make the web a better place. Today’s interviewee is the Data Pagan, Madison Swain-Bowden.

  • The Data Docs: A WordPress based Data Discoverability Tool

    The Data Docs is Automattic’s new home made data discoverability solution. It collects and publishes our datasets’ metadata to Wordpress so it is accessible and searchable by our users. Here we describe the idea from its conception and explain how it works.

  • ExPlat: Automattic’s Experimentation Platform

    Over the past 18 months, the Decision Science team has been building the Experimentation Platform (ExPlat): a tool to help our colleagues run experiments to improve customer experiences, inform product […]