GPT Scraper avatar
GPT Scraper
Try for free

Pay $9.00 for 1,000 pages

View all Actors
GPT Scraper

GPT Scraper

drobnikj/gpt-scraper
Try for free

Pay $9.00 for 1,000 pages

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

User avatar

only read one collumn

Closed

Benjaminh opened this issue
4 months ago

I want to intergrate the google maps extractor tool to the gpt scraper. but the problem is that the maps extractor provide website urls and social media urls (which i want to keep). So how can i set the gpt scraper so it only read the website urls instead of also the social media urls?

User avatar

Hi, I don't have the full grasp of your dataflow and I'm having troubles with understanding your problem. Could you please provide some more detail? I'm unware of any social media URL output field on Google Maps Scraper, do you mean the website URL field?

Generally this seems like a data processing issue on your end. This Actor should just take the input as given, any data filtering in your pipeline should be done outside of this Actor. Please provide me with further details, I would love to help you out with this, thanks!

User avatar

Benjaminh

4 months ago

https://console.apify.com/view/runs/1BUUA4ApefR0xyq8b

This is a example run that is just finished. As you can see their are collumns as instagram urls, facebook urls, google maps urls which i want to keep. But i only want to GPT scraper to read the collumn "website"

User avatar

Thanks for the details. Yes, I understand now, still as said this is primarily an issue in your data processing part. I don't how you are starting this Actor in your integrations, but you will probably have to create a new custom orchestrator Actor for this. The new Actor will basically just take the desired fields (website) from the Google Maps Scraper's integration and feed them as input into this one.

If can't setup the custom Actor yourself, you can always find someone in our discord community. I could also help you out with it ;) Let me know how this works for you, thanks!

User avatar

Benjaminh

4 months ago

but that is the whole problem.. I want to keep the other urls... I only want the gpt scraper to read the 'website'

User avatar

Oh I see, well you will still have the other URLs in the original Google Maps Scraper's run but I understand that it might be better to have them in one place. You can modify the possible custom Actor to collect both of the Google and GPT Actor run's dataset IDs and merge the parts you desire. To make this easier you can use a utility Actor like the Merge, Dedup & Transform Datasets Actor.

User avatar

Hi,

I'm closing this due to inactivity.

If you are still having some troubles with this, feel free to reopen this, thanks!

Developer
Maintained by Apify
Actor metrics
  • 314 monthly users
  • 97.1% runs succeeded
  • 9.8 days response time
  • Created in Mar 2023
  • Modified 12 days ago