The Data Docs is Automattic's new home made data discoverability solution. It collects and publishes our datasets' metadata to WordPress so it is accessible and searchable by our users. Here we describe the idea from its conception and explain how it works.
… Continue readingCategory: Data Engineering
SQL — a Common Language for the Whole Data Team
Today, I wanted to share how we’ve empowered colleagues outside the data engineering team to write their own data transformations by overcoming coding language barriers. Within our Data team, there are several specializations: Data Analysts create dashboards and analyze data for business leads and product teams across the company.Data Scientists apply machine learning technology at … Continue reading SQL — a Common Language for the Whole Data Team
Building Thousands of Reproducible ML Models with pipe, the Automattic Machine Learning Pipeline
Demet takes you deep into pipe, a tool that allows anyone at Automattic to build solid machine learning models.
… Continue readingReal-Time Elasticsearch Indexing on WordPress.com
Love databases, indexing, and Elasticsearch gymnastics? Greg Brown walks us through the indexing sausage factory on WordPress.com.
… Continue readingMay the Bot Be With You: How Algorithms are Supporting Happiness at WordPress.com
Charles Earl reports on Elfbot, a machine learning project geared to helping Happiness Engineers provide fast, efficient, wonderfully human support to WordPress.com users.
… Continue reading