Leveraging the distributed powers of MapReduce to perform custom log analysis or some one-time queries on the raw data is fast and easy and you don't even have to build a complicated ETL process to do it. The data engineering team at WordPress.com recently used this approch to query tens of billions of log lines with just a couple minutes of work.
… Continue readingCategory: Data Engineering
State of WordPress.com Elasticsearch Systems 2016
We get asked periodically about how extensively we are using Elasticsearch. And it has come up twice in the past week, so time to write a blog post. We are constantly expanding what we are using Elasticsearch for and so although some previous posts have broadly define what we are doing, they don't really capture … Continue reading State of WordPress.com Elasticsearch Systems 2016