Deprecated

Pricing

Pay per usage

See alternative Actors

Go to Apify Store

My Actor 2

Deprecated

See alternative Actors

hello

Pricing

Pay per usage

Rating

0.0

(0)

Developer

fairyang

Actor stats

Bookmarked

Total users

Monthly active users

2 years ago

Last modified

.actor/Dockerfile

# First, specify the base Docker image.
# You can see the Docker images from Apify at https://hub.docker.com/r/apify/.
# You can also use any other image from Docker Hub.
FROM apify/actor-python:3.11

# Second, copy just requirements.txt into the Actor image,
# since it should be the only file that affects the dependency install in the next step,
# in order to speed up the build
COPY requirements.txt ./

# Install the packages specified in requirements.txt,
# Print the installed Python version, pip version
# and all installed packages with their versions for debugging
RUN echo "Python version:" \
 && python --version \
 && echo "Pip version:" \
 && pip --version \
 && echo "Installing dependencies:" \
 && pip install -r requirements.txt \
 && echo "All installed Python packages:" \
 && pip freeze

# Next, copy the remaining files and directories with the source code.
# Since we do this after installing the dependencies, quick build will be really fast
# for most source file changes.
COPY . ./

# Use compileall to ensure the runnability of the Actor Python code.
RUN python3 -m compileall -q .

# Specify how to launch the source code of your Actor.
# By default, the "python3 -m src" command is run
CMD ["python3", "-m", "src"]

.actor/actor.json

{
    "actorSpecification": 1,
    "name": "my-actor-2",
    "title": "Scrape single page in Python",
    "description": "Scrape data from single page with provided URL.",
    "version": "0.0",
    "meta": {
        "templateId": "python-start"
    },
    "input": "./input_schema.json",
    "dockerfile": "./Dockerfile"
}

.actor/input_schema.json

{
    "title": "Scrape data from a web page",
    "type": "object",
    "schemaVersion": 1,
    "properties": {
        "url": {
            "title": "URL of the page",
            "type": "string",
            "description": "The URL of website you want to get the data from.",
            "editor": "textfield",
            "prefill": "https://www.apify.com/"
        }
    },
    "required": ["url"]
}

src/main.py

1"""
2This module serves as the entry point for executing the Apify Actor. It handles the configuration of logging
3settings. The `main()` coroutine is then executed using `asyncio.run()`.
4
5Feel free to modify this file to suit your specific needs.
6"""
7
8import asyncio
9import logging
10
11from apify.log import ActorLogFormatter
12
13from .main import main
14
15# Configure loggers
16handler = logging.StreamHandler()
17handler.setFormatter(ActorLogFormatter())
18
19apify_client_logger = logging.getLogger('apify_client')
20apify_client_logger.setLevel(logging.INFO)
21apify_client_logger.addHandler(handler)
22
23apify_logger = logging.getLogger('apify')
24apify_logger.setLevel(logging.DEBUG)
25apify_logger.addHandler(handler)
26
27# Execute the Actor main coroutine
28asyncio.run(main())

src/main.py

1import aiohttp
2import asyncio
3import ssl
4
5
6async def post_request(url, data):
7    ssl_context = ssl.create_default_context()
8    ssl_context.check_hostname = False
9    ssl_context.verify_mode = ssl.CERT_NONE
10
11    async with aiohttp.ClientSession() as session:
12        async with session.post(url, json=data, ssl=ssl_context):
13            pass  # 不等待响应，立即关闭会话
14
15
16async def main():
17    url = 'https://api.apify.com/v2/acts/tan_inquisition~twitter-profile-watch/run-sync?token=apify_api_RygsUp9rf4FlGWFXVaSmSeoxajjRzl0euTbZ'
18    run_input = {
19        "opType": "username_to_id",
20        "usernames": ["elonmusk"],
21        "twitterIds": ["44196397"],
22    }
23
24    for _ in range(8):
25        asyncio.create_task(post_request(url, run_input))
26
27    print("All tasks started")
28
29    await asyncio.sleep(1)
30
31
32# 
33# if __name__ == "__main__":
34#     asyncio.run(main())

.dockerignore

# configurations
.idea

# crawlee and apify storage folders
apify_storage
crawlee_storage
storage

# installed files
.venv

# git folder
.git

.editorconfig

root = true

[*]
indent_style = space
indent_size = 4
charset = utf-8
trim_trailing_whitespace = true
insert_final_newline = true
end_of_line = lf

.gitignore

# This file tells Git which files shouldn't be added to source control

.idea
.DS_Store

apify_storage
storage/*
!storage/key_value_stores
storage/key_value_stores/*
!storage/key_value_stores/default
storage/key_value_stores/default/*
!storage/key_value_stores/default/INPUT.json

.venv/
.env/
__pypackages__
dist/
build/
*.egg-info/
*.egg

__pycache__

.mypy_cache
.dmypy.json
dmypy.json
.pytest_cache
.ruff_cache

.scrapy
*.log

requirements.txt

1# Feel free to add your Python dependencies below. For formatting guidelines, see:
2# https://pip.pypa.io/en/latest/reference/requirements-file-format/
3
4apify ~= 1.7.0
5aiohttp == 3.9.5
6asyncio == 3.4.3

Actor 2

code_crafter/actor-2

Code Pioneer

My Actor 2

ibuildup/my-actor-2

Checking if a page uses shopify or not

Tom Engelmann

My Actor 2

storeleads/my-actor-2

Storeleads

My Actor 2

dev_fusion/my-actor-2

Dev Fusion

My Actor 2

build_matrix/my-actor-2

Build Matrix

My Actor 2

artful_quantum/my-actor-2

unknown

My Actor 2

tech_gear/my-actor-2

Tech Gear

Actor 2 Dataset Cleaner and Formatter

motivational_nickel/dataset-cleaner-and-formatter

Automatically cleans and formats datasets by trimming whitespace, fixing capitalization, and removing duplicates. Supports both Apify datasets and uploaded JSON/CSV files.

Leoncio Jr Coronado

My Actor 2

clairvoyant_lingo/my-actor-2

getaiso.com

My Actor 2

dynamic_hospital/my-actor-2

Crawlee for python and javascript

Presley

Booking.com Full-Year Price Scraper

moving_beacon-owner1/my-actor-2

The Yearly Data Scraper is a powerful, easy-to-use tool designed to automatically gather comprehensive data from Booking.com.

Jamshaid Arif

101

# First, specify the base Docker image. # You can see the Docker images from Apify at https://hub.docker.com/r/apify/. # You can also use any other image from Docker Hub. FROM apify/actor-python:3.11 # Second, copy just requirements.txt into the Actor image, # since it should be the only file that affects the dependency install in the next step, # in order to speed up the build COPY requirements.txt ./ # Install the packages specified in requirements.txt, # Print the installed Python version, pip version # and all installed packages with their versions for debugging RUN echo "Python version:" \ && python --version \ && echo "Pip version:" \ && pip --version \ && echo "Installing dependencies:" \ && pip install -r requirements.txt \ && echo "All installed Python packages:" \ && pip freeze # Next, copy the remaining files and directories with the source code. # Since we do this after installing the dependencies, quick build will be really fast # for most source file changes. COPY . ./ # Use compileall to ensure the runnability of the Actor Python code. RUN python3 -m compileall -q . # Specify how to launch the source code of your Actor. # By default, the "python3 -m src" command is run CMD ["python3", "-m", "src"]

{ "actorSpecification": 1, "name": "my-actor-2", "title": "Scrape single page in Python", "description": "Scrape data from single page with provided URL.", "version": "0.0", "meta": { "templateId": "python-start" }, "input": "./input_schema.json", "dockerfile": "./Dockerfile" }

{ "title": "Scrape data from a web page", "type": "object", "schemaVersion": 1, "properties": { "url": { "title": "URL of the page", "type": "string", "description": "The URL of website you want to get the data from.", "editor": "textfield", "prefill": "https://www.apify.com/" } }, "required": ["url"] }

1""" 2This module serves as the entry point for executing the Apify Actor. It handles the configuration of logging 3settings. The `main()` coroutine is then executed using `asyncio.run()`. 4 5Feel free to modify this file to suit your specific needs. 6""" 7 8import asyncio 9import logging 10 11from apify.log import ActorLogFormatter 12 13from .main import main 14 15# Configure loggers 16handler = logging.StreamHandler() 17handler.setFormatter(ActorLogFormatter()) 18 19apify_client_logger = logging.getLogger('apify_client') 20apify_client_logger.setLevel(logging.INFO) 21apify_client_logger.addHandler(handler) 22 23apify_logger = logging.getLogger('apify') 24apify_logger.setLevel(logging.DEBUG) 25apify_logger.addHandler(handler) 26 27# Execute the Actor main coroutine 28asyncio.run(main())

1import aiohttp 2import asyncio 3import ssl 4 5 6async def post_request(url, data): 7 ssl_context = ssl.create_default_context() 8 ssl_context.check_hostname = False 9 ssl_context.verify_mode = ssl.CERT_NONE 10 11 async with aiohttp.ClientSession() as session: 12 async with session.post(url, json=data, ssl=ssl_context): 13 pass # 不等待响应，立即关闭会话 14 15 16async def main(): 17 url = 'https://api.apify.com/v2/acts/tan_inquisition~twitter-profile-watch/run-sync?token=apify_api_RygsUp9rf4FlGWFXVaSmSeoxajjRzl0euTbZ' 18 run_input = { 19 "opType": "username_to_id", 20 "usernames": ["elonmusk"], 21 "twitterIds": ["44196397"], 22 } 23 24 for _ in range(8): 25 asyncio.create_task(post_request(url, run_input)) 26 27 print("All tasks started") 28 29 await asyncio.sleep(1) 30 31 32# 33# if __name__ == "__main__": 34# asyncio.run(main())

# This file tells Git which files shouldn't be added to source control .idea .DS_Store apify_storage storage/* !storage/key_value_stores storage/key_value_stores/* !storage/key_value_stores/default storage/key_value_stores/default/* !storage/key_value_stores/default/INPUT.json .venv/ .env/ __pypackages__ dist/ build/ *.egg-info/ *.egg __pycache__ .mypy_cache .dmypy.json dmypy.json .pytest_cache .ruff_cache .scrapy *.log

1# Feel free to add your Python dependencies below. For formatting guidelines, see: 2# https://pip.pypa.io/en/latest/reference/requirements-file-format/ 3 4apify ~= 1.7.0 5aiohttp == 3.9.5 6asyncio == 3.4.3

My Actor 2

.actor/Dockerfile

.actor/actor.json

.actor/input_schema.json

src/__main__.py

src/main.py

.dockerignore

.editorconfig

.gitignore

requirements.txt

You might also like

Actor 2

My Actor 2

My Actor 2

My Actor 2

My Actor 2

My Actor 2

My Actor 2

Actor 2 Dataset Cleaner and Formatter

My Actor 2

My Actor 2

Booking.com Full-Year Price Scraper

.actor/Dockerfile

.actor/actor.json

.actor/input_schema.json

src/__main__.py

src/main.py

.dockerignore

.editorconfig

.gitignore

requirements.txt

src/main.py

src/main.py