🐝 BeeAI agent

Example of how to use Bee Agent Framework with Apify Actors to create a social media analysis agent.

src/main.ts

1import { ChatOpenAI } from '@langchain/openai';
2import { Actor, log } from 'apify';
3import { LangChainChatModel } from 'bee-agent-framework/adapters/langchain/backend/chat';
4import { OpenAIChatModel } from 'bee-agent-framework/adapters/openai/backend/chat';
5import { BeeAgent } from 'bee-agent-framework/agents/bee/agent';
6import { UnconstrainedMemory } from 'bee-agent-framework/memory/unconstrainedMemory';
7import { z } from 'zod';
8
9import { StructuredOutputGenerator } from './structured_response_generator.js';
10import { CalculatorSumTool } from './tools/calculator.js';
11import { InstagramScrapeTool } from './tools/instagram.js';
12
13// This is an ESM project, and as such, it requires you to specify extensions in your relative imports.
14// Read more about this here: https://nodejs.org/docs/latest-v18.x/api/esm.html#mandatory-file-extensions
15// Note that we need to use `.js` even when inside TS files
16// import { router } from './routes.js';
17
18// Actor input schema
19interface Input {
20    query: string;
21    modelName: string;
22    debug?: boolean;
23}
24
25// The init() call configures the Actor for its environment. It's recommended to start every Actor with an init().
26await Actor.init();
27
28// Charge for Actor start
29await Actor.charge({ eventName: 'actor-start' });
30
31// Handle input
32const {
33    // The query default value is provided only for template testing purposes.
34    // You can remove it.
35    query,
36    modelName,
37    debug,
38} = (await Actor.getInput()) as Input;
39if (debug) {
40    log.setLevel(log.LEVELS.DEBUG);
41}
42if (!query) {
43    throw new Error('An agent query is required.');
44}
45
46/**
47 * Actor code
48 */
49// Create a ReAct agent that can use tools.
50// See https://i-am-bee.github.io/bee-agent-framework/#/agents?id=bee-agent
51// In order to use PPE, the LangChain adapter must be used
52// otherwise, the token usage is not tracked.
53log.debug(`Using model: ${modelName}`);
54const llm = new LangChainChatModel(new ChatOpenAI({ model: modelName }));
55// The LangChain adapter does not work with the structured output generation
56// for some reason.
57// Create a separate LLM for structured output generation.
58const llmStructured = new OpenAIChatModel(modelName);
59const agent = new BeeAgent({
60    llm,
61    memory: new UnconstrainedMemory(),
62    tools: [new CalculatorSumTool(), new InstagramScrapeTool()],
63});
64
65// Store tool messages for later structured output generation.
66// This can be removed if you don't need structured output.
67const structuredOutputGenerator = new StructuredOutputGenerator(llmStructured);
68
69// Prompt the agent with the query.
70// Debug log agent status updates, e.g., thoughts, tool calls, etc.
71const response = await agent.run({ prompt: query }).observe((emitter) => {
72    emitter.on('update', async ({ update }) => {
73        log.debug(`Agent (${update.key}) 🤖 : ${update.value}`);
74
75        // Save tool messages for later structured output generation.
76        // This can be removed if you don't need structured output.
77        if (['tool_name', 'tool_output', 'tool_input'].includes(update.key as string)) {
78            structuredOutputGenerator.processToolMessage(
79                update.key as 'tool_name' | 'tool_output' | 'tool_input',
80                update.value,
81            );
82        }
83        // End of tool message saving.
84    });
85});
86
87log.info(`Agent 🤖 : ${response.result.text}`);
88
89// Hacky way to get the structured output.
90// Using the stored tool messages and the user query to create a structured output.
91const structuredResponse = await structuredOutputGenerator.generateStructuredOutput(
92    query,
93    z.object({
94        totalLikes: z.number(),
95        totalComments: z.number(),
96        mostPopularPosts: z.array(
97            z.object({
98                url: z.string(),
99                likes: z.number(),
100                comments: z.number(),
101                timestamp: z.string(),
102                caption: z.string().nullable().optional(),
103                alt: z.string().nullable().optional(),
104            }),
105        ),
106    }),
107);
108log.debug(`Structured response: ${JSON.stringify(structuredResponse)}`);
109
110// Charge for task completion
111await Actor.charge({ eventName: 'task-completed' });
112
113// Push results to the dataset.
114await Actor.pushData({
115    query,
116    response: response.result.text,
117    // This can be removed if you don't need structured output.
118    structuredResponse: structuredResponse.object,
119});
120log.info('Pushed the data into the dataset!');
121
122// Gracefully exit the Actor process. It's recommended to quit all Actors with an exit().
123await Actor.exit();

TypeScript BeeAI agent Template

A template for BeeAI agent projects in TypeScript for building AI agents with Apify Actors. This template offers a structured setup and an example ReAct agent utilizing Instagram Scraper and a calculator tool in a workflow context.

How it Works

A ReAct agent is employed, equipped with tools to respond to user queries. The agent processes a user query, decides on the tools to use, and in what sequence, to achieve the desired outcome. Here, the agent leverages an Instagram Scraper to fetch posts from a profile and a calculator tool to compute sums, such as totaling likes or comments. The agent produces textual and structured output, which is saved to a dataset.

How to Use

Add or modify tools in the src/tool_calculator.ts and src/tool_instagram.ts files, and ensure they are included in the agent's tool list in src/main.ts. Additionally, you can update the agent's system prompt or other configurations within src/main.ts. For more information, refer to the BeeAI documentation.

Pay Per Event

This template uses the Pay Per Event (PPE) monetization model, which provides flexible pricing based on defined events.

To charge users, define events in JSON format and save them on the Apify platform. Here is an example schema with the task-completed event:

[
    {
        "task-completed": {
            "eventTitle": "Task completed",
            "eventDescription": "Cost per query answered.",
            "eventPriceUsd": 0.1
        }
    }
]

In the Actor, trigger the event with:

await Actor.charge({ eventName: 'task-completed' });

This approach allows you to programmatically charge users directly from your Actor, covering the costs of execution and related services, such as LLM input/output tokens.

To set up the PPE model for this Actor:

Configure the OpenAI API key environment variable: provide your OpenAI API key to the OPENAI_API_KEY in the Actor's Environment variables.
Configure Pay Per Event: establish the Pay Per Event pricing schema in the Actor's Monetization settings. First, set the Pricing model to Pay per event and add the schema. An example schema can be found in .actor/pay_per_event.json.

Included Features

Apify SDK for JavaScript - a toolkit for building Apify Actors and scrapers in JavaScript
Input schema - define and easily validate a schema for your Actor's input
Dataset - store structured data where each object stored has the same attributes
Key-value store - store any kind of data, such as JSON documents, images, or text files

Resources

Start with TypeScript

Scrape single page with provided URL with Axios and extract data from page's HTML with Cheerio.

Starter

Crawlee + Cheerio

A scraper example that uses Cheerio to parse HTML. It's fast, but it can't run the website's JavaScript or pass JS anti-scraping challenges.

Crawlee + Puppeteer + Chrome

Example of a Puppeteer and headless Chrome web scraper. Headless browsers render JavaScript and are harder to block, but they're slower than plain HTTP.

Crawlee + Playwright + Chrome

Web scraper example with Crawlee, Playwright and headless Chrome. Playwright is more modern, user-friendly and harder to block than Puppeteer.

Crawlee + Playwright + Camoufox

Web scraper example with Crawlee, Playwright and headless Camoufox. Camoufox is a custom stealthy fork of Firefox. Try this template if you're facing anti-scraping challenges.

Playwright + Chrome Test Runner

Example of using the Playwright Test project to run automated website tests in the cloud and display their results. Usable as an API.

Already have a solution in mind?

Sign up for a free Apify account and deploy your code to the platform in just a few minutes! If you want a head start without coding it yourself, browse our Store of existing solutions.

Import your code Go to store