![Article Text Extractor avatar](https://images.apifyusercontent.com/AzZcy5YOUgCalA6CbIcjWm-tIJ44n5qwokMB93aw0Kw/rs:fill:92:92/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9STHZoTUtUTkZudWVHQm9hYS9MNE1mRFJSU0xxOE14UTJaUC1hcnRpY2xlX2V4dHJhY3Rvci0wMS5wbmc.webp)
Article Text Extractor
Try for free
No credit card required
View all Actors![Article Text Extractor](https://images.apifyusercontent.com/AzZcy5YOUgCalA6CbIcjWm-tIJ44n5qwokMB93aw0Kw/rs:fill:92:92/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9STHZoTUtUTkZudWVHQm9hYS9MNE1mRFJSU0xxOE14UTJaUC1hcnRpY2xlX2V4dHJhY3Rvci0wMS5wbmc.webp)
![Article Text Extractor](https://images.apifyusercontent.com/AzZcy5YOUgCalA6CbIcjWm-tIJ44n5qwokMB93aw0Kw/rs:fill:92:92/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9STHZoTUtUTkZudWVHQm9hYS9MNE1mRFJSU0xxOE14UTJaUC1hcnRpY2xlX2V4dHJhY3Rvci0wMS5wbmc.webp)
Article Text Extractor
mtrunkat/article-text-extractor
Try for free
No credit card required
Simply extracts article texts and other meta info from the given URL. Uses https://github.com/ageitgey/node-unfluff which is a NodeJS implementation of https://github.com/grangier/python-goose.
The code examples below show how to run the Actor and get its results. To run the code, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token, which you can find under Settings > Integrations in Apify Console. Learn more
1# Set API token
2API_TOKEN=<YOUR_API_TOKEN>
3
4# Prepare Actor input
5cat > input.json <<'EOF'
6{
7 "url": "https://www.bbc.com/news/world-asia-china-48659073"
8}
9EOF
10
11# Run the Actor using an HTTP API
12# See the full API reference at https://docs.apify.com/api/v2
13curl "https://api.apify.com/v2/acts/mtrunkat~article-text-extractor/runs?token=$API_TOKEN" \
14 -X POST \
15 -d @input.json \
16 -H 'Content-Type: application/json'
Developer
Maintained by Community
Actor metrics
- 22 monthly users
- 8 stars
- 99.7% runs succeeded
- 7.3 hours response time
- Created in Mar 2018
- Modified 10 months ago
Categories