TikTok Data Extractor avatar

TikTok Data Extractor

Try for free

Pay $4.00 for 1,000 results

Go to Store
TikTok Data Extractor

TikTok Data Extractor

clockworks/free-tiktok-scraper
Try for free

Pay $4.00 for 1,000 results

Extract data about videos, users, and channels based on hashtags or scrape full user profiles including posts, total likes, name, nickname, numbers of comments, shares, followers, following, and more.

Do you want to learn more about this Actor?

Get a demo
ON

mediaUrls is empty

Open

onionfist opened this issue
15 days ago

The mediaUrl is [], even though I used the option "shouldDownloadVideos: true".

Normally, the mediaUrls contains one Apify CDN link. Only in about 10% of the cases, the mediaUrls is empty.

ruocco-l avatar

Hello, thank you for opening this issue. I've opened an internal ticket for this problem, I'll notify you when progress will be made.

ON

onionfist

6 days ago

Thanks Luigi. How can I access the internal ticket? I'm new to the Apify platform.

ruocco-l avatar

Oh no sorry for the miscommunication, I meant our team was working on it :D

So, in the run you showed when opening the issue is a very rare occurrence. We worked towards preventing it and now should be fixed.

It is not a 10% occurrence though, that makes me think that you probably misused the scraper. If you try to scrape a "video" from TikTok which is a slideshow you won't get the download unless you set the shouldDownloadSlideshowImages property to true. This will download the single images composing the slideshow in the KVStore and you'll have the links to the images in the slideshowImageLinks property of the output.

It is confusing because TikTok allows this kind of urls https://www.tiktok.com/@apifyoffice/video/7442238525333966102 (with video in it) even though they are slideshow (if you click on it you will see that you will be redirected to https://www.tiktok.com/@apifyoffice/photo/7442238525333966102, with photo in the url).

You can check if your older run where in fact scraping of slideshow by looking at the isSlideshow boolean property.

I'll wait for your response before closing this issue to make sure we sorted everything out. Please feel free to ask more question!

Happy scraping!

ON

onionfist

4 days ago

Dear Luigi,

I was aware of Tiktok's carousel / slideshow feature. However, for the run I mentioned in this issue, I provided the following as input:

https://www.tiktok.com/@mediocre_face/video/7447982644203130158

It doesn't seem to be a slideshow / carousel type video / photo. As you can see from the link, it's just a normal video.

Yet, mediaUrls was [] even with shouldDownloadVideos set to true.

Please investigate why the particular run failed. This was my input:

1{
2  "excludePinnedPosts": false,
3  "postURLs": [
4    "https://www.tiktok.com/@mediocre_face/video/7447982644203130158"
5  ],
6  "resultsPerPage": 1,
7  "shouldDownloadCovers": true,
8  "shouldDownloadSlideshowImages": false,
9  "shouldDownloadSubtitles": false,
10  "shouldDownloadVideos": true,
11  "profileScrapeSections": [
12    "videos"
13  ],
14  "profileSorting": "latest",
15  "searchSection": "",
16  "maxProfilesPerQuery": 10
17}
ruocco-l avatar

Yes, we investigated and we find out that the incidence of the mediaUrls being empty from a video (not a slideshow/carousel) is far lower than 10%, as I already said in my last post.

What happened in your run is, according to our analysis, something very very unlikely to happen and we still added new contingency measure to make this possibility even more unlikely.

With my last comment I was trying to make sense of your initial message and what we find out during our study. Can you confirm that what happened in the run you linked is in fact the only occurrence of this problem, while the other runs with mediaUrls were, in fact, slideshows/carousels?

Developer
Maintained by Apify

Actor Metrics

  • 1.7k monthly users

  • 286 stars

  • >99% runs succeeded

  • 1.8 days response time

  • Created in Feb 2022

  • Modified 5 days ago