GPT Scraper avatar

GPT Scraper

Try for free

Pay $9.00 for 1,000 pages

Go to Store
GPT Scraper

GPT Scraper

drobnikj/gpt-scraper
Try for free

Pay $9.00 for 1,000 pages

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Do you want to learn more about this Actor?

Get a demo
BE

only read one collumn

Closed

Benjaminh opened this issue
a year ago

I want to intergrate the google maps extractor tool to the gpt scraper. but the problem is that the maps extractor provide website urls and social media urls (which i want to keep). So how can i set the gpt scraper so it only read the website urls instead of also the social media urls?

lukas.prusa avatar

Hi, I don't have the full grasp of your dataflow and I'm having troubles with understanding your problem. Could you please provide some more detail? I'm unware of any social media URL output field on Google Maps Scraper, do you mean the website URL field?

Generally this seems like a data processing issue on your end. This Actor should just take the input as given, any data filtering in your pipeline should be done outside of this Actor. Please provide me with further details, I would love to help you out with this, thanks!

BE

Benjaminh

a year ago

https://console.apify.com/view/runs/1BUUA4ApefR0xyq8b

This is a example run that is just finished. As you can see their are collumns as instagram urls, facebook urls, google maps urls which i want to keep. But i only want to GPT scraper to read the collumn "website"

lukas.prusa avatar

Thanks for the details. Yes, I understand now, still as said this is primarily an issue in your data processing part. I don't how you are starting this Actor in your integrations, but you will probably have to create a new custom orchestrator Actor for this. The new Actor will basically just take the desired fields (website) from the Google Maps Scraper's integration and feed them as input into this one.

If can't setup the custom Actor yourself, you can always find someone in our discord community. I could also help you out with it ;) Let me know how this works for you, thanks!

BE

Benjaminh

a year ago

but that is the whole problem.. I want to keep the other urls... I only want the gpt scraper to read the 'website'

lukas.prusa avatar

Oh I see, well you will still have the other URLs in the original Google Maps Scraper's run but I understand that it might be better to have them in one place. You can modify the possible custom Actor to collect both of the Google and GPT Actor run's dataset IDs and merge the parts you desire. To make this easier you can use a utility Actor like the Merge, Dedup & Transform Datasets Actor.

lukas.prusa avatar

Hi,

I'm closing this due to inactivity.

If you are still having some troubles with this, feel free to reopen this, thanks!

Developer
Maintained by Apify

Actor Metrics

  • 135 monthly users

  • 72 stars

  • >99% runs succeeded

  • Created in Mar 2023

  • Modified 24 days ago