Pricing

from $0.01 / 1,000 results

Try for free

Go to Apify Store

Free Kimi 2.5 Api

Try for free

Secure, scalable, pay-per-use access to high-performance Large Language Model APIs through a single unified gateway. No infrastructure, no model hosting, no setup complexity.

Pricing

from $0.01 / 1,000 results

Rating

5.0

(1)

Developer

Akash Kumar Naik

Actor stats

Bookmarked

Total users

Monthly active users

13 days ago

Last modified

Free Kimi 2.5 API — Free LLM API Access to MoonshotAI Kimi K2.5

Get free LLM API access to MoonshotAI Kimi K2.5, a powerful open-source AI model with advanced reasoning capabilities. This free AI API provides instant access to Kimi AI without infrastructure setup, GPU requirements, or complex configuration. Perfect for developers building AI-powered applications, automation workflows, and chatbots.

Key Features

Free LLM API Access — No-cost access to Kimi K2.5 through NVIDIA endpoints
Advanced Reasoning Mode — Enable thinking capabilities for complex problem-solving
16K Token Context — Large context window for detailed analysis and long documents
Streaming Responses — Real-time token generation for interactive applications
Zero Infrastructure — No servers, GPUs, or ML ops required
Pay-Per-Event Pricing — Only pay for what you use with transparent pricing
LangChain Integration — Built on LangChain for seamless AI workflow integration

What is Kimi K2.5?

MoonshotAI Kimi K2.5 is a state-of-the-art open-source large language model (LLM) that rivals commercial AI models like GPT-4 and Claude. With 16,384 token context window and native reasoning/thinking capabilities, Kimi K2.5 excels at:

Code generation and programming assistance
Complex reasoning and problem-solving
Document analysis and summarization
Creative writing and content creation
Research and academic tasks
AI chatbots and conversational agents

Use Cases

AI-Powered Development

Code review and debugging — Get instant feedback on your code
API documentation generation — Automatically create docs from code
Programming tutorials — Generate step-by-step coding guides
Algorithm explanations — Understand complex computer science concepts

Content Creation & Marketing

Blog post generation — Create SEO-optimized articles
Social media content — Generate engaging posts for Twitter, LinkedIn
Email templates — Craft professional business communications
Product descriptions — Write compelling marketing copy

Research & Analysis

Document summarization — Extract key insights from long texts
Data analysis — Interpret complex datasets with reasoning
Academic research — Assist with literature reviews and analysis
Market research — Analyze trends and generate reports

Automation & Workflows

Chatbot development — Build intelligent conversational agents
Customer support automation — Create AI-powered helpdesk solutions
Workflow automation — Integrate AI into business processes
Data extraction — Parse and structure unstructured text

Available Model

Model	Max Tokens	Reasoning	Best For
MoonshotAI Kimi K2.5	16,384	✅ Yes	General purpose, coding, reasoning, analysis

Input Parameters

Parameter	Type	Required	Default	Description
`prompt`	string	Yes	—	Your message or question to the AI
`systemMessage`	string	No	"You are a helpful AI assistant..."	Sets the AI's behavior and persona
`temperature`	number	No	1.0	Controls creativity (0.0 = focused, 2.0 = creative)
`topP`	number	No	1.0	Nucleus sampling for output diversity
`maxTokens`	integer	No	16384	Maximum response length
`enableThinking`	boolean	No	true	Enable reasoning mode for complex tasks

Parameter Guidelines

Temperature Settings:

0.1 - 0.3 — Precise, deterministic responses (best for coding, facts)
0.7 - 1.0 — Balanced creativity and coherence (general purpose)
1.2 - 2.0 — Highly creative, exploratory responses (brainstorming, creative writing)

Max Tokens:

256 - 512 — Short answers, tweets, brief summaries
1024 - 2048 — Medium responses, blog paragraphs, explanations
4096 - 16384 — Long-form content, detailed analysis, code generation

Output

The Actor returns a structured JSON response:

{
  "prompt": "Explain quantum computing in simple terms",
  "response": "Quantum computing is a type of computing that uses quantum mechanics principles...",
  "reasoningContent": "Let me break this down step by step. First, I need to explain what makes quantum computers different from regular computers...",
  "model": "moonshotai/kimi-k2.5",
  "timestamp": "2026-01-27T13:30:00+00:00"
}

Output Schema Views

Results are automatically organized in multiple formats:

View	Description	Best For
Dataset Results	All responses in structured format	Data export, analysis, bulk processing
Overview	Summary table of all interactions	Quick review, monitoring
Detailed View	Full responses with reasoning	Debugging, understanding AI decisions
Output Data	Single JSON object	API integration, automation

Usage Examples

Basic Query

{
  "prompt": "What are the benefits of renewable energy?"
}

Programming Help

{
  "prompt": "Write a Python function to calculate the Fibonacci sequence",
  "systemMessage": "You are an expert Python programmer. Provide clean, well-documented code with explanations.",
  "temperature": 0.3
}

Creative Writing

{
  "prompt": "Write a short story about a robot learning to paint",
  "temperature": 1.2,
  "maxTokens": 2048
}

Complex Reasoning

{
  "prompt": "Analyze the pros and cons of remote work for tech companies",
  "enableThinking": true,
  "systemMessage": "You are a business analyst. Provide structured, balanced analysis."
}

Cost & Pricing

This Actor uses Pay-Per-Event (PPE) pricing:

Event	Price	Description
Actor Start	$0.00005	Charged when the Actor begins running
LLM Request	$0.01	Charged per API call to the LLM

Example Costs:

100 requests: ~$1.00
1,000 requests: ~$10.00
10,000 requests: ~$100.00

Note: Pricing covers platform costs and API usage. No hidden fees or subscriptions.

Integration Options

LangChain Integration

from langchain_nvidia_ai_endpoints import ChatNVIDIA

client = ChatNVIDIA(
    model="moonshotai/kimi-k2.5",
    api_key="your-api-key",
    temperature=1,
    max_tokens=16384
)

response = client.invoke([{"role": "user", "content": "Hello!"}])

API Integration

Call the Actor programmatically via Apify API:

curl -X POST https://api.apify.com/v2/acts/free-kimi-2-5-api/runs \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "Your question here"
  }'

Webhook Integration

Set up webhooks to automatically process results:

{
  "prompt": "Analyze this data",
  "webhook": "https://your-app.com/webhook"
}

Local Development

Prerequisites

Python 3.11+
pip package manager
Apify CLI (npm install -g apify-cli)

Installation

# Clone the repository
git clone <repository-url>
cd free-kimi-2-5-api

# Install dependencies
pip install -r requirements.txt

# Run locally
apify run

Testing

Create a test input file:

{
  "prompt": "Test message",
  "temperature": 0.7
}

Run with test input:

$apify run --input-file=test-input.json

Schema Configuration

This Actor implements Apify's schema system for structured output:

Dataset Schema

Specification Version: 1
Fields: prompt, response, reasoningContent, model, timestamp
Views: Overview (tabular), Detail (with reasoning)

Key-Value Store Schema

Schema Version: 1
Collections: Responses, Configs, Output

Output Schema

Schema Version: 1
Templates: Dataset links, KV store access, API endpoints

Troubleshooting

Common Issues

"Failed to generate response"

Check your prompt is not empty
Verify maxTokens is within valid range (1-32768)
Try reducing temperature if responses seem off

Empty or truncated responses

Increase maxTokens parameter
Check if prompt fits within context window
Enable thinking mode for better reasoning

Slow response times

Reduce maxTokens for faster generation
Disable thinking mode if not needed
Check your internet connection

Getting Help

Apify Documentation: https://docs.apify.com
LangChain Docs: https://python.langchain.com
NVIDIA API Docs: https://docs.api.nvidia.com
Report Issues: Use the Issues tab in Apify Console

Best Practices

For Best Results

Use clear, specific prompts — The more context you provide, the better the response
Adjust temperature based on task — Lower for facts, higher for creativity
Enable thinking for complex tasks — Helps with reasoning and step-by-step problems
Use system messages — Guide the AI's behavior for consistent results
Start with lower maxTokens — Increase if you need longer responses

Rate Limits & Usage

Respect spending limits set in your Apify account
Monitor your usage in the Apify Console
Use batch processing for multiple requests
Implement caching for repeated queries

Changelog

v0.1.1

Added proper output schema with multiple view templates
Fixed dataset schema with actorSpecification version 1
Fixed key-value store schema with proper collections
Added schema views: Overview and Detail for dataset
Added collections: responses, configs, and output for key-value store
Updated actor.json to reference all schema files correctly

v0.1.0

Initial release
Support for MoonshotAI Kimi K2.5 model
Thinking/reasoning mode support
Streaming responses
Configurable generation parameters
Pay-per-event monetization

License

MIT License — feel free to use and modify as needed.

Support & Resources

Apify Discord: https://discord.com/invite/jyEM2PRvMU
Developer Forum: https://community.apify.com
Apify Support: https://support.apify.com
Actor Page: https://apify.com/free-kimi-2-5-api

Keywords: Free LLM API, Kimi AI, MoonshotAI, AI API, Free AI API, LLM reasoning, Chat API, NVIDIA API, LangChain, AI automation, GPT alternative, Open source LLM, AI model API, Text generation API, Conversational AI

FuelPrices | Pay Per Result, Easy to Use, No Cookies

johnvc/fuelprices

Get live fuel prices and gas price data. Pay only for the results you need—no subscriptions, no commitments. Perfect for tracking local fuel costs, building comparison apps, or analyzing price trends. Pay per usage: no setup, no minimums, no subscriptions.

John

2.8

Government & Research MCP Server - Unified Data Access

alizarin_refrigerator-owner/government-research-mcp-server---unified-data-access

Unified MCP (Model Context Protocol) server providing single-interface access to 16 government and research data sources. Query multiple APIs simultaneously, aggregate results, and use smart routing for natural language queries.

The Howlers

Tweet(X)/Twitter scraper|$0.2/1K| Pay-Per Result v2

fastcrawler/tweet-x-twitter-scraper-0-2-1k-pay-per-result-v2

Need a no-code, online Twitter (X)/Tweet scraper? Get real-time data for just $0.2 per 1K results with our pay-per-result model—no hidden fees! Extract tweets, profiles, and engagement metrics instantly. Fast, reliable, and cost-effective. Start scraping now! 🚀

fastcrawler

465

1.9

Huggingface Model Page Scraper

akash9078/huggingface-model-page-scraper

Scrapes metadata and model card text from Hugging Face model pages.

Akash Kumar Naik

Zillow [$2.5💰] Search | Property(ies) | Listing | Agent->Email

memo23/zillow-scraper-ppe

💰 $2.50 per 1,000 listings – No limits, no quotas, unlimited extraction. Unleash the full power of Zillow data - no limits, no restrictions. Comprehensive scraping for any search URL, delivering complete property insights at scale.

Muhamed Didovic

5.0

AI Cost Audit & Anomaly Detection

solutionssmart/ai-cost-audit-anomaly-detection

Audit and explain your LLM/API usage costs from CSV or JSON. Calculates token-based cost per model, groups by model/day/feature, detects spikes and expensive-model patterns, and outputs a dataset plus summary.json and report.md. No external APIs required.

Solutions Smart

Upwork Scraper | Job and Details | pay per result 2,5$/1k

fasty_dev/upwork-job-scraper

⚡ Fastest Upwork scraper - no auth needed! Get detailed, structured job data at 2.5$/1k results. faster, cheaper, and more reliable than competitors with zero setup hassle.

Fasty Dev

5.0

GPT-2 text generation

jirimoravcik/gpt2-text-generation

This actor uses the GPT-2 language model to generate text.

Jiří Moravčík

392

LinkedIn Jobs Scraper – Incredibly Fast ⚡️(PPR)

crawlworks/linkedin-job-scraper

Designed for both personal and professional use, our ⚡️ Blazing fast LinkedIn job scraper scrapes 1,000 listings in under 2 minutes. Built-in datacenter proxy rotation. No setup. Pay-per-result (PPR) pricing model. Export to CSV, JSON, and more.

Crawlworks

175

4.0

OpenRouter

apify/openrouter

You can use any AI LLM model without accounts in AI providers. Use this Actor as a proxy for all requests. Use pay-per-event pricing to pay only for the real credit used.

Apify

3.9K

5.0