Quick Instagram Posts Checker avatar
Quick Instagram Posts Checker

Pricing

Pay per usage

Go to Store
Quick Instagram Posts Checker

Quick Instagram Posts Checker

Developed by

Apify

Apify

Maintained by Apify

Fast Instagram stats (for profiles and post only) for bulk tracking and analytics

4.8 (5)

Pricing

Pay per usage

21

Total users

1.5K

Monthly users

92

Runs succeeded

>99%

Issues response

2.5 days

Last modified

3 months ago

MI

Incorrect Characters / Encoding Issues in Post Captions

Closed

aminfa92 opened this issue
a month ago

The "Quick Instagram Posts Checker" actor appears to be returning Instagram post captions with incorrect characters or encoding issues. Specifically, newline characters and some special punctuation marks (like apostrophes) are being replaced with other character sequences (e.g., nn for newline, u2019 for apostrophe). This makes the retrieved captions difficult to read and process programmatically.

Observed Behavior (Example):

For a post from the BMW Instagram page, the actor returned the following caption:

Getting THE shot has never been easier!nnThe BMW M440i xDrive Convertible.n#THE4 #BMW #4Series nnMandatory information according to german law u2019Pkw-EnVKVu2019 based on WLTP: energy consumption combined: 8,7 l/100 km; COu2082 emissions combined: 197 g/km; CO2-class(es): G

Expected Behavior (Correct Caption):

The correct caption for the post is:

Getting THE shot has never been easier!

The BMW M440i xDrive Convertible. #THE4 #BMW #4Series

Mandatory information according to german law ’Pkw-EnVKV’ based on WLTP: energy consumption combined: 8,7 l/100 km; CO₂ emissions combined: 197 g/km; CO2-class(es): G

Key Discrepancies Noted:

nn is appearing instead of a proper newline character (\n or a visual line break). u2019 is appearing instead of an apostrophe (’). u2082 is appearing instead of the subscript '2' (₂) in CO₂. Impact: This issue degrades the quality of the data retrieved by the actor. Users relying on this actor for accurate caption text will need to implement post-processing steps to clean the data, or they will have to deal with incorrectly formatted text.

Suggested Area of Investigation: Please review the text extraction and encoding/decoding mechanisms within the actor. It's possible that there's an issue with how Unicode characters or specific HTML entities are being handled during the scraping or data processing phase. Ensuring consistent UTF-8 encoding throughout the process might resolve this.

Instagram Post Short Code: DKH9i9vM3e7

alexey avatar

Hi!

The actor is saving internal Instagram data "as is" without any logic for formatting and mapping. If you need to change the output, please use external tools or actors.

I will close the issue now, but if there is anything else we can help with, please let us know.

MI

aminfa92

25 days ago

Hello,

Thank you for your response.

I've found that not only are captions affected, but all text fields appear to have corrupted Unicode characters. In some cases, this corruption prevents the original data from being correctly restored.

For comparison, I tested another Apify actor, apify/instagram-profile-scraper, using the same Instagram profile (kshop_online). The apify/instagram-profile-scraper successfully returns the correct, uncorrupted captions and other text fields.

Here are the links to the dataset responses from both actors for the username kshop_online for your comparison:

Response from apify/quick-instagram-posts-checker (showing corruption): https://api.apify.com/v2/datasets/h1gKO7ZZxDSn9BDjb/items?clean=true&format=json&limit=1000 Response from apify/instagram-profile-scraper (correct data): https://api.apify.com/v2/datasets/3dNOpOwB9jN6c1KLO/items?clean=true&format=json&limit=1000 I hope this information helps clarify the issue.

alexey avatar

Hi!

Sorry for the misunderstanding, this is the experimental actor, not maintained because of low popularity. Not a replacement for the regular Instagram profile scraper.

We just forgot to provide a clear statement, will change the description this week.

I will close the issue now, but if there is anything else we can help with, please let us know.