Today, I wanted to share how we’ve empowered colleagues outside the data engineering team to write their own data transformations by overcoming coding language barriers. Within our Data team, there are several specializations: Data Analysts create dashboards and analyze data for business leads and product teams across the company.Data Scientists apply machine learning technology at … Continue reading SQL — a Common Language for the Whole Data Team
Category: Data Engineering
Building Thousands of Reproducible ML Models with pipe, the Automattic Machine Learning Pipeline
Demet takes you deep into pipe, a tool that allows anyone at Automattic to build solid machine learning models.
… Continue readingReal-Time Elasticsearch Indexing on WordPress.com
Love databases, indexing, and Elasticsearch gymnastics? Greg Brown walks us through the indexing sausage factory on WordPress.com.
… Continue readingMay the Bot Be With You: How Algorithms are Supporting Happiness at WordPress.com
Charles Earl reports on Elfbot, a machine learning project geared to helping Happiness Engineers provide fast, efficient, wonderfully human support to WordPress.com users.
… Continue readingBulk Log Analytics With Hive
Leveraging the distributed powers of MapReduce to perform custom log analysis or some one-time queries on the raw data is fast and easy and you don't even have to build a complicated ETL process to do it. The data engineering team at WordPress.com recently used this approch to query tens of billions of log lines with just a couple minutes of work.
… Continue reading