Structured Data Crawler

Status

Open to develop

Categories

Submitted

This Actors crawls entire websites or individual web pages, and extracts Google's Structured data markup and Schema.org markup in formats such as Microdata, RDFa, or JSON-LD from the web pages. This is useful to get structured data about the content of a webpage as provided by its author, e.g. products, movies, people, articles, books, datasets, recipes, etc.

This is just a starting point. You’re free to adapt it, expand on it, or take it in a different direction. Treat this brief as guidance, not rules.

Actors in Store