
SEO/GEO - Schema Markup Scraper
Pricing
$12.00/month + usage

SEO/GEO - Schema Markup Scraper
This actor to fetches JSON-LD/Schema Markup from Multiple URLs & checks whether the page contains markups for the following types: AggregateRating, Article, Event, FAQPage, LocalBusiness, Organization, Person, Product, & Review. Schema Markup helps search and generative engines find & read webpages.
0.0 (0)
Pricing
$12.00/month + usage
0
Monthly users
3
Runs succeeded
>99%
Last modified
4 days ago
You can access the SEO/GEO - Schema Markup Scraper programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.
1{
2 "openapi": "3.0.1",
3 "info": {
4 "version": "0.0",
5 "x-build-id": "qQg5htAWaihIR9qDs"
6 },
7 "servers": [
8 {
9 "url": "https://api.apify.com/v2"
10 }
11 ],
12 "paths": {
13 "/acts/wisteria_banjo~schema-markup-scraper/run-sync-get-dataset-items": {
14 "post": {
15 "operationId": "run-sync-get-dataset-items-wisteria_banjo-schema-markup-scraper",
16 "x-openai-isConsequential": false,
17 "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
18 "tags": [
19 "Run Actor"
20 ],
21 "requestBody": {
22 "required": true,
23 "content": {
24 "application/json": {
25 "schema": {
26 "$ref": "#/components/schemas/inputSchema"
27 }
28 }
29 }
30 },
31 "parameters": [
32 {
33 "name": "token",
34 "in": "query",
35 "required": true,
36 "schema": {
37 "type": "string"
38 },
39 "description": "Enter your Apify token here"
40 }
41 ],
42 "responses": {
43 "200": {
44 "description": "OK"
45 }
46 }
47 }
48 },
49 "/acts/wisteria_banjo~schema-markup-scraper/runs": {
50 "post": {
51 "operationId": "runs-sync-wisteria_banjo-schema-markup-scraper",
52 "x-openai-isConsequential": false,
53 "summary": "Executes an Actor and returns information about the initiated run in response.",
54 "tags": [
55 "Run Actor"
56 ],
57 "requestBody": {
58 "required": true,
59 "content": {
60 "application/json": {
61 "schema": {
62 "$ref": "#/components/schemas/inputSchema"
63 }
64 }
65 }
66 },
67 "parameters": [
68 {
69 "name": "token",
70 "in": "query",
71 "required": true,
72 "schema": {
73 "type": "string"
74 },
75 "description": "Enter your Apify token here"
76 }
77 ],
78 "responses": {
79 "200": {
80 "description": "OK",
81 "content": {
82 "application/json": {
83 "schema": {
84 "$ref": "#/components/schemas/runsResponseSchema"
85 }
86 }
87 }
88 }
89 }
90 }
91 },
92 "/acts/wisteria_banjo~schema-markup-scraper/run-sync": {
93 "post": {
94 "operationId": "run-sync-wisteria_banjo-schema-markup-scraper",
95 "x-openai-isConsequential": false,
96 "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
97 "tags": [
98 "Run Actor"
99 ],
100 "requestBody": {
101 "required": true,
102 "content": {
103 "application/json": {
104 "schema": {
105 "$ref": "#/components/schemas/inputSchema"
106 }
107 }
108 }
109 },
110 "parameters": [
111 {
112 "name": "token",
113 "in": "query",
114 "required": true,
115 "schema": {
116 "type": "string"
117 },
118 "description": "Enter your Apify token here"
119 }
120 ],
121 "responses": {
122 "200": {
123 "description": "OK"
124 }
125 }
126 }
127 }
128 },
129 "components": {
130 "schemas": {
131 "inputSchema": {
132 "type": "object",
133 "properties": {
134 "startUrls": {
135 "title": "URLs",
136 "type": "array",
137 "description": "One or more URLs of the pages where the crawler will start. Note that the Actor will additionally only crawl sub-pages of these URLs. For example, for the start URL `https://www.example.com/blog`, it will crawl pages like `https://example.com/blog/article-1`, but will skip `https://example.com/docs/something-else`.",
138 "items": {
139 "type": "object",
140 "required": [
141 "url"
142 ],
143 "properties": {
144 "url": {
145 "type": "string",
146 "title": "URL of a web page",
147 "format": "uri"
148 }
149 }
150 }
151 },
152 "crawlerType": {
153 "title": "Crawler type",
154 "enum": [
155 "playwright:chrome",
156 "cheerio",
157 "jsdom"
158 ],
159 "type": "string",
160 "description": "Select the crawling engine:\n- **Headless web browser** (default) - Useful for modern websites with anti-scraping protections and JavaScript rendering. It recognizes common blocking patterns like CAPTCHAs and automatically retries blocked requests through new sessions. However, running web browsers is more expensive as it requires more computing resources and is slower. It is recommended to use at least 8 GB of RAM.\n- **Raw HTTP client** - High-performance crawling mode that uses raw HTTP requests to fetch the pages. It is faster and cheaper, but it might not work on all websites.",
161 "default": "playwright:chrome"
162 },
163 "maxCrawlDepth": {
164 "title": "Max crawling depth",
165 "minimum": 0,
166 "type": "integer",
167 "description": "The maximum number of links starting from the start URL that the crawler will recursively descend. The start URLs have a depth of 0, the pages linked directly from the start URLs have a depth of 1, and so on.\n\nThis setting is useful to prevent accidental crawler runaway. By setting it to 0, the Actor will only crawl start URLs.",
168 "default": 20
169 },
170 "maxCrawlPages": {
171 "title": "Max pages",
172 "minimum": 0,
173 "type": "integer",
174 "description": "The maximum number pages to crawl. It includes the start URLs, pagination pages, pages with no content, etc. The crawler will automatically finish after reaching this number. This setting is useful to prevent accidental crawler runaway.",
175 "default": 9999999
176 }
177 }
178 },
179 "runsResponseSchema": {
180 "type": "object",
181 "properties": {
182 "data": {
183 "type": "object",
184 "properties": {
185 "id": {
186 "type": "string"
187 },
188 "actId": {
189 "type": "string"
190 },
191 "userId": {
192 "type": "string"
193 },
194 "startedAt": {
195 "type": "string",
196 "format": "date-time",
197 "example": "2025-01-08T00:00:00.000Z"
198 },
199 "finishedAt": {
200 "type": "string",
201 "format": "date-time",
202 "example": "2025-01-08T00:00:00.000Z"
203 },
204 "status": {
205 "type": "string",
206 "example": "READY"
207 },
208 "meta": {
209 "type": "object",
210 "properties": {
211 "origin": {
212 "type": "string",
213 "example": "API"
214 },
215 "userAgent": {
216 "type": "string"
217 }
218 }
219 },
220 "stats": {
221 "type": "object",
222 "properties": {
223 "inputBodyLen": {
224 "type": "integer",
225 "example": 2000
226 },
227 "rebootCount": {
228 "type": "integer",
229 "example": 0
230 },
231 "restartCount": {
232 "type": "integer",
233 "example": 0
234 },
235 "resurrectCount": {
236 "type": "integer",
237 "example": 0
238 },
239 "computeUnits": {
240 "type": "integer",
241 "example": 0
242 }
243 }
244 },
245 "options": {
246 "type": "object",
247 "properties": {
248 "build": {
249 "type": "string",
250 "example": "latest"
251 },
252 "timeoutSecs": {
253 "type": "integer",
254 "example": 300
255 },
256 "memoryMbytes": {
257 "type": "integer",
258 "example": 1024
259 },
260 "diskMbytes": {
261 "type": "integer",
262 "example": 2048
263 }
264 }
265 },
266 "buildId": {
267 "type": "string"
268 },
269 "defaultKeyValueStoreId": {
270 "type": "string"
271 },
272 "defaultDatasetId": {
273 "type": "string"
274 },
275 "defaultRequestQueueId": {
276 "type": "string"
277 },
278 "buildNumber": {
279 "type": "string",
280 "example": "1.0.0"
281 },
282 "containerUrl": {
283 "type": "string"
284 },
285 "usage": {
286 "type": "object",
287 "properties": {
288 "ACTOR_COMPUTE_UNITS": {
289 "type": "integer",
290 "example": 0
291 },
292 "DATASET_READS": {
293 "type": "integer",
294 "example": 0
295 },
296 "DATASET_WRITES": {
297 "type": "integer",
298 "example": 0
299 },
300 "KEY_VALUE_STORE_READS": {
301 "type": "integer",
302 "example": 0
303 },
304 "KEY_VALUE_STORE_WRITES": {
305 "type": "integer",
306 "example": 1
307 },
308 "KEY_VALUE_STORE_LISTS": {
309 "type": "integer",
310 "example": 0
311 },
312 "REQUEST_QUEUE_READS": {
313 "type": "integer",
314 "example": 0
315 },
316 "REQUEST_QUEUE_WRITES": {
317 "type": "integer",
318 "example": 0
319 },
320 "DATA_TRANSFER_INTERNAL_GBYTES": {
321 "type": "integer",
322 "example": 0
323 },
324 "DATA_TRANSFER_EXTERNAL_GBYTES": {
325 "type": "integer",
326 "example": 0
327 },
328 "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
329 "type": "integer",
330 "example": 0
331 },
332 "PROXY_SERPS": {
333 "type": "integer",
334 "example": 0
335 }
336 }
337 },
338 "usageTotalUsd": {
339 "type": "number",
340 "example": 0.00005
341 },
342 "usageUsd": {
343 "type": "object",
344 "properties": {
345 "ACTOR_COMPUTE_UNITS": {
346 "type": "integer",
347 "example": 0
348 },
349 "DATASET_READS": {
350 "type": "integer",
351 "example": 0
352 },
353 "DATASET_WRITES": {
354 "type": "integer",
355 "example": 0
356 },
357 "KEY_VALUE_STORE_READS": {
358 "type": "integer",
359 "example": 0
360 },
361 "KEY_VALUE_STORE_WRITES": {
362 "type": "number",
363 "example": 0.00005
364 },
365 "KEY_VALUE_STORE_LISTS": {
366 "type": "integer",
367 "example": 0
368 },
369 "REQUEST_QUEUE_READS": {
370 "type": "integer",
371 "example": 0
372 },
373 "REQUEST_QUEUE_WRITES": {
374 "type": "integer",
375 "example": 0
376 },
377 "DATA_TRANSFER_INTERNAL_GBYTES": {
378 "type": "integer",
379 "example": 0
380 },
381 "DATA_TRANSFER_EXTERNAL_GBYTES": {
382 "type": "integer",
383 "example": 0
384 },
385 "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
386 "type": "integer",
387 "example": 0
388 },
389 "PROXY_SERPS": {
390 "type": "integer",
391 "example": 0
392 }
393 }
394 }
395 }
396 }
397 }
398 }
399 }
400 }
401}
Schema Markup Extractor OpenAPI definition
OpenAPI is a standard for designing and describing RESTful APIs, allowing developers to define API structure, endpoints, and data formats in a machine-readable way. It simplifies API development, integration, and documentation.
OpenAPI is effective when used with AI agents and GPTs by standardizing how these systems interact with various APIs, for reliable integrations and efficient communication.
By defining machine-readable API specifications, OpenAPI allows AI models like GPTs to understand and use varied data sources, improving accuracy. This accelerates development, reduces errors, and provides context-aware responses, making OpenAPI a core component for AI applications.
You can download the OpenAPI definitions for SEO/GEO - Schema Markup Scraper from the options below:
If you’d like to learn more about how OpenAPI powers GPTs, read our blog post.
You can also check out our other API clients:
Pricing
Pricing model
RentalTo use this Actor, you have to pay a monthly rental fee to the developer. The rent is subtracted from your prepaid usage every month after the free trial period. You also pay for the Apify platform usage.
Free trial
1 day
Price
$12.00