Tag: Structured Contents

Blog articles related to API features and announcements specific to our beta Structured Contents initiative.

← Back to Blog: Latest Articles

Unlock Wikipedia Tables as Structured JSON: Introducing Parsed Tables in Wikimedia Enterprise

10 Sep 2025

Access Wikipedia’s most valuable tables as structured JSON with the new Parsed Tables feature from Wikimedia Enterprise. Instantly convert complex tables into clean, machine-readable data without scraping. Enhance your AI, search, and knowledge graph projects with reliable, human-curated facts that were previously locked away in HTML and wikitext.
Read this article
Wikipedia Kaggle Dataset using Structured Contents Snapshot

16 Apr 2025

Explore Wikipedia content in a clean, structured format with our new beta dataset on Kaggle. Built from our Snapshot API using the Structured Contents beta, it’s ideal for data science, ML training, and experimentation.
Read this article
Parsing Wikipedia References with Quality Scoring Models

19 Mar 2025

The latest API release boosts Wikipedia data integration with parsed references in JSON and two quality scoring models – Reference Need and Reference Risk. These enhancements streamline citation access and improve content reliability for developers.
Read this article
Wikipedia Hugging Face Dataset using Structured Contents Snapshot

19 Sep 2024

We’re releasing an early beta dataset on Hugging Face, offering structured content from English and French Wikipedia. This machine-readable dataset, derived from our Snapshot API’s new Structured Contents beta, opens up new possibilities for AI and machine learning applications.
Read this article
Structured Contents extends to Snapshot API

19 Sep 2024

Snapshot API now includes a beta Structured Contents endpoint, offering bulk access to parsed Wikipedia data for testing partners.
Read this article
Build a Knowledge Panel with Structured Wikipedia API

25 Jun 2024

In this engineering tutorial, we show a simple way to build a working knowledge panel pulling pre-parsed content from Wikipedia articles using Wikimedia Enterprise API Structured Contents endpoint.
Read this article
Parsed Article Sections and Short Descriptions

30 Apr 2024

We’ve expanded the data available in our On-Demand Structured Contents endpoint by introducing two significant features: Article Body Sections and short Descriptions.
Read this article
Wikipedia API Parsed Infobox. Introducing Structured Contents

15 Sep 2023

We’ve heard all your requests for a more machine-readable API for Wikimedia data. We are announcing a new Structured Contents endpoint with the fully parsed contents of Wikipedia article Infoboxes in JSON! Jump into the article to read about it and get started.
Read this article