Data Discoverability


  • Organizing data.blog content via NLP and LLM

    The redesign of data.blog aimed to enhance content discoverability by making category and tag pages more prominent. We re-imagined the blog taxonomy developed over ten years. Using NLP and LLM techniques, we analyzed categories and tags to consolidate and improve on clarity and relevance.

  • New data.blog is here: Designed for Discovery

    The new data.blog has launched with a refreshed visual design and improved user experience. Key enhancements include a prominent search bar, better-organized categories, and an updated logo. The redesign aims to facilitate navigation and inspire community engagement for both new and existing authors while encouraging content exploration.

  • Synchronizing Data with Apache Superset: Our Internal Solution

    At Automattic, our deep appreciation for open source software is evident through our active contributions, with a primary focus on WordPress. Recently, we integrated Apache Superset, to help support our intricate data visualization needs. One notable achievement includes the automation of dataset creation, a solution that not only resolved an issue but also enhanced our…

  • The Data Docs: A WordPress based Data Discoverability Tool

    The Data Docs is Automattic’s new home made data discoverability solution. It collects and publishes our datasets’ metadata to Wordpress so it is accessible and searchable by our users. Here we describe the idea from its conception and explain how it works.