# Article Scraper — Extract Clean Text & Markdown

**Use case:** 

Scrape clean article text, title, author, and date from any URL as Markdown, HTML, or text. Bulk-extract for RAG, AI agents, and content monitoring.

## Input

```json
{
  "urls": [
    {
      "url": "https://blog.apify.com/what-is-web-scraping/"
    }
  ],
  "outputFormat": "markdown",
  "maxItems": 10,
  "extractImages": true,
  "extractLinks": false,
  "timeout": 30,
  "maxConcurrency": 5
}
```

## Output

```json
{
  "title": {
    "label": "Title",
    "format": "string"
  },
  "author": {
    "label": "Author",
    "format": "string"
  },
  "publishedDate": {
    "label": "Published Date",
    "format": "string"
  },
  "wordCount": {
    "label": "Word Count",
    "format": "integer"
  },
  "language": {
    "label": "Language",
    "format": "string"
  },
  "siteName": {
    "label": "Site Name",
    "format": "string"
  },
  "url": {
    "label": "URL",
    "format": "string"
  }
}
```

## About this Actor

This example demonstrates how to use [Article Extraction API](https://apify.com/tugelbay/article-extractor) with a specific input configuration. Visit the [Actor detail page](https://apify.com/tugelbay/article-extractor) to learn more, explore other use cases, and run it yourself.