Data Engineering


  • The Data Docs: A WordPress based Data Discoverability Tool

    The Data Docs is Automattic’s new home made data discoverability solution. It collects and publishes our datasets’ metadata to Wordpress so it is accessible and searchable by our users. Here we describe the idea from its conception and explain how it works.

  • SQL — a Common Language for the Whole Data Team

    Today, I wanted to share how we’ve empowered colleagues outside the data engineering team to write their own data transformations by overcoming coding language barriers. Within our Data team, there are several specializations: Data Analysts create dashboards and analyze data for business leads and product teams across the company. Data Scientists apply machine learning technology at…

  • Looker NYC Meetup

    Like any company, Automattic is constantly on a journey to get better: sometimes we have the good fortune of finding improvement in leaps and bounds, but most of the time, we move slowly, we make small changes, finding iterative wins and moving down the to‑do list.  I think probably this is how most progress happens:…

  • Reflections From Spark + AI Summit 2018

    Here are our favorite talks from Spark + AI Summit 2018

  • Real-Time Elasticsearch Indexing on WordPress.com

    Love databases, indexing, and Elasticsearch gymnastics? Greg Brown walks us through the indexing sausage factory on WordPress.com.