Ai Job Finder

Pricing

Pay per usage

Try for free

Go to Apify Store

Ai Job Finder

Try for free

Developed by

Steafanie Braid

Maintained by Community

Give a prompt or a CV and find jobs according to you

0.0 (0)

Pricing

Pay per usage

Last modified

7 months ago

Jobs

Open source

AI Linkedin Job Search

gokdeniz_kaymak/AI-Linkedin-Job-Search

A smart job search agent that analyzes your CV, scrapes LinkedIn job posts, and uses AI to filter and return only the most relevant opportunities.

Gökdeniz Kaymak

Linkedin Jobs Scraper - PPR

curious_coder/linkedin-jobs-scraper

Scrape jobs from linkedin jobs search results along with company details. Get key information to find contact info

Curious Coder

8.1K

5.0

Linkedin Jobs Scraper

logical_scrapers/linkedin-jobs-scraper

A powerful LinkedIn job listings scraper that extracts detailed job information. Perfect for recruiters, job seekers, and market researchers looking to gather comprehensive job market data.

Goldmine

LinkedIn Jobs Scraper | Remove Duplicate Jobs | Pay Per Result

cheap_scraper/linkedin-job-scraper

LinkedIn Jobs Scraper | Remove Duplicate Jobs. The LinkedIn jobs scraper allows you to collect jobs in 2 ways: By providing one or more start URLs, or By entering multiple keywords, search queries. You can use either method individually or combine both.

cheap_scraper

882

4.2

Fast LinkedIn Jobs Scraper

aaas.ma/fast-linkedin-jobs-scraper

Simply input your desired job titles and location to get a customized list of job opportunities, job titles, job description, job salary, job type, job location complete with company details and links. Start exploring today!

aaas ma

300

3.3

Advanced Linkedin Job Scraper

curious_coder/linkedin-jobs-search-scraper

Scrape linkedin job postings and get company and recruiter details. Supports boolean search and other advanced search filters. Success rate is > 98%

Curious Coder

4.1

LinkedIn Jobs Scraper – Incredibly Fast ⚡️

data_wizard/linkedin-jobs-scraper

Designed for both personal and professional use, our ⚡️ Blazing fast LinkedIn job scraper scrapes 1,000 listings in under 1 minute. Built-in proxy rotation. No setup. Export to CSV, JSON, and more.

Data Wizard

111

5.0

Best Linkedin Jobs Scrapy

lads.yc/easy-linkedin-jobs-scrapy

Easy way to get jobs and details

YC W

🔥 LinkedIn Jobs Scraper

bebity/linkedin-jobs-scraper

ℹ️ Designed for both personal and professional use, simply enter your desired job title and location to receive a tailored list of job opportunities. Try it today!

Bebity

16K

4.4

Linkedin Job Search

clothefobia/linkedin-job-search

Linkedin Job Search - Scrape LinkedIn job listings with ease. Extract job titles, companies, locations, and more in real-time. Automate job data collection for recruiting, analysis, or lead generation.

clothe fobia

requirements.txt

1apify-client>=1.0.0,<2.0.0
2openai>=1.0.0
3anthropic>=0.5.0
4google-generativeai==0.3.0
5pydantic>=2.0.0
6# File processing libraries
7PyPDF2>=3.0.0
8python-docx>=0.8.11

test.py

1#!/usr/bin/env python3
2import asyncio
3import json
4import sys
5import os
6from typing import Dict, Any, Optional
7
8# Import from our modules
9from src.llm_providers.factory import create_llm_provider
10from src.cv_processor import process_cv
11from src.prompt_processor import process_prompt
12from src.parameter_handler import apply_parameter_defaults
13
14async def test_cv_processing():
15    """Test CV processing with a local file"""
16    # Check if file path was provided
17    if len(sys.argv) < 2:
18        print("Usage: python test.py path/to/cv.pdf [prompt]")
19        sys.exit(1)
20    
21    # Get CV file path and optional prompt
22    cv_path = sys.argv[1]
23    prompt = sys.argv[2] if len(sys.argv) > 2 else None
24    
25    # Check if API key is set
26    openai_key = os.environ.get("OPENAI_API_KEY")
27    if not openai_key:
28        print("ERROR: OPENAI_API_KEY environment variable not set.")
29        print("Please set it with: export OPENAI_API_KEY=your-api-key")
30        sys.exit(1)
31    
32    # Read CV file
33    try:
34        with open(cv_path, "rb") as f:
35            cv_data = f.read()
36            
37        # Convert to base64 for testing
38        import base64
39        import mimetypes
40        mime_type, _ = mimetypes.guess_type(cv_path)
41        if not mime_type:
42            mime_type = "application/octet-stream"
43            
44        cv_data_base64 = f"data:{mime_type};base64,{base64.b64encode(cv_data).decode('utf-8')}"
45    except Exception as e:
46        print(f"Error reading CV file: {str(e)}")
47        sys.exit(1)
48    
49    # Create LLM provider
50    provider = create_llm_provider("openai", openai_key)
51    
52    # Process CV
53    print("Processing CV...")
54    cv_parameters = await process_cv(cv_data_base64, provider, "openai")
55    print(f"CV Parameters: {json.dumps(cv_parameters, indent=2)}")
56    
57    # Process prompt if provided
58    prompt_parameters = {}
59    if prompt:
60        print("\nProcessing prompt...")
61        prompt_parameters = await process_prompt(prompt, provider)
62        print(f"Prompt Parameters: {json.dumps(prompt_parameters, indent=2)}")
63    
64    # Merge and apply defaults
65    parameters = {**cv_parameters, **prompt_parameters}
66    final_parameters = apply_parameter_defaults(parameters)
67    
68    print("\nFinal LinkedIn Search Parameters:")
69    print(json.dumps(final_parameters, indent=2))
70    
71    # Note: This test doesn't actually call the LinkedIn scraper
72    print("\nTest complete. To perform a real LinkedIn search, upload this Actor to Apify.")
73
74if __name__ == "__main__":
75    asyncio.run(test_cv_processing())

.actor/actor.json

{
    "actorSpecification": 1,
    "name": "ai-job-finder",
    "title": "AI Job Finder",
    "description": "An AI-powered tool that reads a CV and/or prompt to find relevant jobs on LinkedIn",
    "version": "0.1",
    "buildTag": "latest",
    "meta": {
        "templateId": "python-apify"
    },
    "input": "./input_schema.json",
    "dockerfile": "./Dockerfile"
}

.actor/Dockerfile

# First specify the base Docker image.
FROM apify/actor-python:3.12

# Copy requirements.txt into the Actor image
COPY requirements.txt ./

# Install the packages specified in requirements.txt
RUN echo "Python version:" \
 && python --version \
 && echo "Pip version:" \
 && pip --version \
 && echo "Installing dependencies:" \
 && pip install -r requirements.txt \
 && echo "All installed Python packages:" \
 && pip freeze

# Copy the remaining files and directories with the source code
COPY . ./

# Use compileall to ensure the runnability of the Actor Python code
RUN python3 -m compileall -q .

# Specify how to launch the source code of your Actor
CMD ["python3", "-m", "src"]

.actor/input_schema.json

{
    "title": "AI Job Finder",
    "description": "An AI-powered tool that reads a CV and/or prompt to find relevant jobs on LinkedIn",
    "type": "object",
    "schemaVersion": 1,
    "properties": {
        "cv": {
            "title": "CV/Resume",
            "type": "object",
            "description": "Upload your CV/resume (PDF, DOCX, TXT formats supported) as an object with contentType and buffer properties",
            "editor": "json",
            "nullable": true,
            "prefill": {
                "contentType": "",
                "buffer": ""
            }
        },
        "prompt": {
            "title": "Job Search Query",
            "type": "string",
            "description": "Describe the job you're looking for (e.g., 'Senior Python Developer in New York')",
            "editor": "textarea",
            "default": "I'm looking for remote senior software engineering roles in AI companies. I have 5 years of experience with Python and machine learning.",
            "nullable": true
        },
        "llm_settings": {
            "title": "LLM Provider Settings",
            "type": "object",
            "description": "Configure which LLM provider to use",
            "editor": "json",
            "default": {
                "provider": "gemini",
                "model": "gemini-1.5-pro"
            },
            "prefill": {
                "provider": "gemini",
                "model": "gemini-1.5-pro"
            }
        },
        "api_keys": {
            "title": "API Keys",
            "type": "object",
            "description": "API keys for LLM providers (optional - defaults to environment variables)",
            "editor": "json",
            "default": {},
            "prefill": {
                "openai": "",
                "claude": "",
                "gemini": ""
            }
        },
        "linkedin_search_params": {
            "title": "Additional LinkedIn Search Parameters",
            "type": "object",
            "description": "Override specific LinkedIn search parameters",
            "editor": "json",
            "nullable": true
        },
        "proxy": {
            "title": "Proxy Configuration",
            "type": "object",
            "description": "Configure Apify proxy for LinkedIn scraping",
            "editor": "proxy",
            "default": {
                "useApifyProxy": true,
                "apifyProxyGroups": ["RESIDENTIAL"]
            }
        }
    },
    "required": []
}

src/cv_processor.py

1import logging
2import base64
3import json
4import re
5import io
6import tempfile
7from typing import Dict, Any, Optional
8
9# Import file processing libraries
10import PyPDF2
11import docx
12
13logger = logging.getLogger(__name__)
14
15async def process_cv(cv_data: Dict[str, Any], llm_provider, provider_name: str) -> Dict[str, Any]:
16    """
17    Process CV data using the appropriate LLM provider
18    
19    Args:
20        cv_data: CV file data (object with contentType and buffer properties)
21        llm_provider: The LLM provider instance to use
22        provider_name: Name of the provider ('openai', 'claude', or 'gemini')
23        
24    Returns:
25        Dictionary of extracted parameters for LinkedIn job search
26    """
27    try:
28        logger.info(f"Processing CV with {provider_name} provider")
29        
30        # Extract text from CV based on file format
31        cv_text = extract_text_from_cv(cv_data)
32        
33        if not cv_text:
34            logger.error("Failed to extract text from CV file")
35            return {}
36        
37        # Process CV with the provider
38        cv_parameters = await llm_provider.process_cv(cv_text)
39        
40        # Validate and clean the parameters
41        cv_parameters = validate_cv_parameters(cv_parameters)
42        
43        logger.info(f"Successfully extracted parameters from CV: {json.dumps(cv_parameters, indent=2)}")
44        return cv_parameters
45        
46    except Exception as e:
47        logger.error(f"Error processing CV: {str(e)}")
48        # Return empty parameters, which will use defaults later
49        return {}
50
51def extract_text_from_cv(cv_data: Dict[str, Any]) -> str:
52    """
53    Extract text content from CV file based on its format
54    
55    Args:
56        cv_data: CV file data (object with contentType and buffer properties)
57        
58    Returns:
59        Extracted text content from the file
60    """
61    if not cv_data or not isinstance(cv_data, dict):
62        logger.warning("Invalid CV data format")
63        return ""
64    
65    content_type = cv_data.get("contentType", "")
66    file_buffer = cv_data.get("buffer", None)
67    
68    if not file_buffer:
69        logger.warning("No file buffer found in CV data")
70        return ""
71    
72    logger.info(f"Processing CV file of type: {content_type}")
73    
74    try:
75        # Handle PDF files
76        if content_type == "application/pdf":
77            return extract_text_from_pdf(file_buffer)
78        
79        # Handle DOCX files
80        elif content_type == "application/vnd.openxmlformats-officedocument.wordprocessingml.document":
81            return extract_text_from_docx(file_buffer)
82        
83        # Handle plain text files
84        elif content_type == "text/plain":
85            return file_buffer.decode("utf-8")
86        
87        # Handle base64 encoded string (for backward compatibility)
88        elif isinstance(file_buffer, str) and file_buffer.startswith("data:"):
89            try:
90                # Extract the base64 part
91                base64_data = file_buffer.split(",")[1]
92                decoded_data = base64.b64decode(base64_data).decode("utf-8")
93                return decoded_data
94            except Exception as e:
95                logger.error(f"Error decoding base64 data: {str(e)}")
96                return file_buffer
97        
98        # Fallback: assume it's plain text
99        else:
100            if isinstance(file_buffer, str):
101                return file_buffer
102            elif isinstance(file_buffer, bytes):
103                return file_buffer.decode("utf-8", errors="ignore")
104            else:
105                logger.warning(f"Unsupported file buffer type: {type(file_buffer)}")
106                return ""
107    
108    except Exception as e:
109        logger.error(f"Error extracting text from CV: {str(e)}")
110        return ""
111
112def extract_text_from_pdf(pdf_buffer: bytes) -> str:
113    """Extract text from PDF file buffer"""
114    text = ""
115    try:
116        with io.BytesIO(pdf_buffer) as pdf_file:
117            pdf_reader = PyPDF2.PdfReader(pdf_file)
118            for page_num in range(len(pdf_reader.pages)):
119                text += pdf_reader.pages[page_num].extract_text() + "\n"
120    except Exception as e:
121        logger.error(f"Error extracting text from PDF: {str(e)}")
122    return text
123
124def extract_text_from_docx(docx_buffer: bytes) -> str:
125    """Extract text from DOCX file buffer"""
126    text = ""
127    try:
128        with io.BytesIO(docx_buffer) as docx_file:
129            doc = docx.Document(docx_file)
130            for para in doc.paragraphs:
131                text += para.text + "\n"
132    except Exception as e:
133        logger.error(f"Error extracting text from DOCX: {str(e)}")
134    return text
135
136def validate_cv_parameters(parameters: Dict[str, Any]) -> Dict[str, Any]:
137    """
138    Validate and clean the parameters extracted from the CV
139    
140    Args:
141        parameters: Raw parameters extracted by the LLM
142        
143    Returns:
144        Cleaned and validated parameters
145    """
146    cleaned = {}
147    
148    # Clean and validate title
149    if "title" in parameters and parameters["title"]:
150        cleaned["title"] = str(parameters["title"]).strip()
151    
152    # Clean and validate location
153    if "location" in parameters and parameters["location"]:
154        cleaned["location"] = str(parameters["location"]).strip()
155    
156    # Clean and validate experienceLevel
157    if "experienceLevel" in parameters and parameters["experienceLevel"]:
158        exp_level = str(parameters["experienceLevel"]).strip()
159        # Ensure it's a number from 1-5
160        if exp_level in ["1", "2", "3", "4", "5"]:
161            cleaned["experienceLevel"] = exp_level
162    
163    # Clean and validate workType
164    if "workType" in parameters and parameters["workType"]:
165        work_type = str(parameters["workType"]).strip()
166        # Ensure it's a valid work type (1, 2, or 3)
167        if work_type in ["1", "2", "3"]:
168            cleaned["workType"] = work_type
169    
170    # Clean and validate contractType
171    if "contractType" in parameters and parameters["contractType"]:
172        contract_type = str(parameters["contractType"]).strip().upper()
173        # Ensure it's a valid contract type (F, P, C, T, I, or V)
174        if contract_type in ["F", "P", "C", "T", "I", "V"]:
175            cleaned["contractType"] = contract_type
176    
177    # Clean and validate skills (might be used for custom filtering later)
178    if "skills" in parameters and isinstance(parameters["skills"], list):
179        cleaned["skills"] = [str(skill).strip() for skill in parameters["skills"] if skill]
180    
181    return cleaned

src/main.py

1#!/usr/bin/env python3
2from apify import Actor
3import logging
4import json
5import base64
6import re
7import os
8from typing import Dict, List, Any, Optional
9
10# Import providers
11from .llm_providers.factory import create_llm_provider
12from .cv_processor import process_cv
13from .prompt_processor import process_prompt
14from .parameter_handler import apply_parameter_defaults
15
16# Set up logging
17logging.basicConfig(level=logging.INFO)
18logger = logging.getLogger(__name__)
19
20async def main():
21    """Main entry point for the Actor"""
22    # Initialize the Actor
23    await Actor.init()
24    
25    # Get input from the actor
26    actor_input = await Actor.get_input() or {}
27    
28    # Validate input - require at least CV or prompt
29    cv_data = actor_input.get("cv")
30    prompt = actor_input.get("prompt")
31    
32    if not cv_data and not prompt:
33        raise ValueError("At least one of CV or prompt must be provided")
34        
35    # Log input types for debugging
36    if cv_data:
37        cv_type = "unknown"
38        if isinstance(cv_data, dict):
39            cv_type = cv_data.get("contentType", "unknown format")
40        logger.info(f"CV data provided with type: {cv_type}")
41    
42    # Get LLM settings
43    llm_settings = actor_input.get("llm_settings", {"provider": "gemini", "model": "gemini-1.5-pro"})
44    provider_name = llm_settings.get("provider", "gemini")
45    
46    # Get API key - first from input, then from environment variables
47    api_keys = actor_input.get("api_keys", {})
48    api_key = api_keys.get(provider_name)
49    
50    # If no API key in input, try to get from environment variables
51    if not api_key:
52        if provider_name == "openai":
53            api_key = os.getenv("OPENAI_API_KEY")
54        elif provider_name == "gemini":
55            api_key = os.getenv("GEMINI_API_KEY")
56        elif provider_name == "claude":
57            api_key = os.getenv("CLAUDE_API_KEY")
58    
59    # If no API key was found, we can't proceed with LLM processing
60    if not api_key:
61        logger.warning(f"No API key provided for {provider_name}")
62        await Actor.push_data([{
63            "title": "LLM API KEY IS NEEDED",
64            "description": f"Please provide an API key for {provider_name.upper()} to use this Actor",
65            "instructions": f"Set the {provider_name.upper()}_API_KEY environment variable or provide it in the api_keys input parameter",
66            "location": "N/A",
67            "companyName": "AI Job Finder",
68            "experienceLevel": "N/A",
69            "workType": "N/A",
70            "contractType": "N/A",
71            "publishedAt": "N/A",
72            "message": f"API key for {provider_name} is required to get real results"
73        }])
74        logger.info("Returned message indicating API key is needed")
75        return
76    
77    # Create LLM provider for processing
78    model = llm_settings.get("model")
79    if provider_name == "gemini" and not model:
80        model = "gemini-1.5-pro"
81        
82    logger.info(f"Using LLM provider: {provider_name} with model: {model}")
83    llm_provider = create_llm_provider(provider_name, api_key, model)
84    
85    # Process parameters
86    parameters = {}
87    
88    # Extract parameters from CV and/or prompt
89    if cv_data:
90        logger.info("Processing CV...")
91        cv_parameters = await process_cv(cv_data, llm_provider, provider_name)
92        parameters.update(cv_parameters)
93    
94    if prompt:
95        logger.info("Processing prompt...")
96        try:
97            prompt_parameters = await process_prompt(prompt, llm_provider)
98            # Prompt parameters override CV parameters
99            parameters.update(prompt_parameters)
100        except Exception as e:
101            logger.error(f"Error processing prompt: {str(e)}")
102            # Continue with default parameters
103    
104    # Apply any explicit parameters from input
105    linkedin_params = actor_input.get("linkedin_search_params", {})
106    if linkedin_params:
107        parameters.update(linkedin_params)
108    
109    # Apply defaults for missing parameters
110    parameters = apply_parameter_defaults(parameters)
111    
112    # Set proxy configuration
113    if "proxy_configuration" in actor_input:
114        parameters["proxy"] = actor_input["proxy_configuration"]
115    elif "proxy" in actor_input:
116        parameters["proxy"] = actor_input["proxy"]
117    
118    # Log the parameters we'll use
119    logger.info(f"Using LinkedIn search parameters: {json.dumps(parameters, indent=2)}")
120    
121    # Call LinkedIn scraper
122    logger.info("Calling LinkedIn scraper with parameters")
123    try:
124        jobs = await call_linkedin_scraper(parameters)
125        
126        # Save output
127        await Actor.push_data(jobs)
128        logger.info(f"Found {len(jobs)} matching jobs")
129    except Exception as e:
130        logger.error(f"Error calling LinkedIn scraper: {str(e)}")
131        # Return a meaningful error to the user
132        await Actor.push_data([{
133            "title": "Error Connecting to LinkedIn Scraper",
134            "description": f"An error occurred while trying to connect to the LinkedIn Jobs Scraper: {str(e)}",
135            "error": True,
136            "parameters": parameters
137        }])
138
139async def call_linkedin_scraper(parameters):
140    """Call the LinkedIn scraper with the given parameters"""
141    # Prepare the Actor input
142    run_input = {
143        "title": parameters.get("title", ""),
144        "location": parameters.get("location", ""),
145        "companyName": parameters.get("companyName", []),
146        "companyId": parameters.get("companyId", []),
147        "workType": parameters.get("workType", ""),
148        "experienceLevel": parameters.get("experienceLevel", ""),
149        "contractType": parameters.get("contractType", ""),
150        "publishedAt": parameters.get("publishedAt", ""),
151        "rows": parameters.get("rows", 10),
152        "proxy": parameters.get("proxy", {
153            "useApifyProxy": True,
154            "apifyProxyGroups": ["RESIDENTIAL"]
155        })
156    }
157    
158    # Run the Actor and wait for it to finish using Actor.apify_client
159    # This automatically handles the authentication - no need for explicit API key
160    run = await Actor.apify_client.actor("BHzefUZlZRKWxkTck").call(run_input=run_input)
161    
162    # Fetch and return the Actor's output
163    dataset_items = await Actor.apify_client.dataset(run["defaultDatasetId"]).list_items()
164    return dataset_items.items

src/parameter_handler.py

1import logging
2from typing import Dict, Any
3
4logger = logging.getLogger(__name__)
5
6def apply_parameter_defaults(parameters: Dict[str, Any]) -> Dict[str, Any]:
7    """
8    Apply default values for missing parameters
9
10    Args:
11        parameters: Current set of parameters
12
13    Returns:
14        Parameters with defaults applied
15    """
16    # Create a copy of the parameters to avoid modifying the original
17    final_params = parameters.copy()
18
19    # Check for title (required parameter)
20    if "title" not in final_params or not final_params["title"]:
21        final_params["title"] = "Software Engineer"  # Default job title
22        logger.info("Using default job title: 'Software Engineer'")
23
24    # Set default location if not provided
25    if "location" not in final_params or not final_params["location"]:
26        final_params["location"] = "United States"  # Country is required, default to United States
27        logger.info("Using default location: United States")
28
29    # Set default experience level if not provided
30    if "experienceLevel" not in final_params or not final_params["experienceLevel"]:
31        final_params["experienceLevel"] = "3"  # Associate
32        logger.info("Using default experience level: 3 (Associate)")
33
34    # Set default work type if not provided
35    if "workType" not in final_params or not final_params["workType"]:
36        final_params["workType"] = ""  # Empty string means any work type
37        logger.info("Using default work type: any")
38
39    # Set default contract type if not provided
40    if "contractType" not in final_params or not final_params["contractType"]:
41        final_params["contractType"] = "F"  # Full-time
42        logger.info("Using default contract type: F (Full-Time)")
43
44    # Set default published at if not provided
45    if "publishedAt" not in final_params or not final_params["publishedAt"]:
46        final_params["publishedAt"] = ""  # Empty string means any time
47        logger.info("Using default time frame: any time")
48
49    # Set default company name if not provided
50    if "companyName" not in final_params or not final_params["companyName"]:
51        final_params["companyName"] = []  # Empty list means any company
52        logger.info("Using default company name: any company")
53
54    # Set default company ID if not provided
55    if "companyId" not in final_params or not final_params["companyId"]:
56        final_params["companyId"] = []  # Empty list means any company ID
57        logger.info("Using default company ID: any company ID")
58
59    # Set default rows if not provided
60    if "rows" not in final_params or not final_params["rows"]:
61        final_params["rows"] = 10  # Default to 10 results
62        logger.info("Using default rows: 10")
63
64    # Ensure we have proper proxy configuration
65    if "proxy" not in final_params or not final_params["proxy"]:
66        final_params["proxy"] = {
67            "useApifyProxy": True,
68            "apifyProxyGroups": ["RESIDENTIAL"]
69        }
70        logger.info("Using default proxy configuration")
71
72    return final_params

src/prompt_processor.py

1import logging
2import json
3from typing import Dict, Any, Optional
4
5logger = logging.getLogger(__name__)
6
7async def process_prompt(prompt: str, llm_provider) -> Dict[str, Any]:
8    """
9    Process user prompt and extract job search parameters
10    
11    Args:
12        prompt: User's job search query
13        llm_provider: The LLM provider instance to use
14        
15    Returns:
16        Dictionary of extracted parameters for LinkedIn job search
17    """
18    try:
19        logger.info("Processing user prompt")
20        
21        # Process prompt with the provider
22        prompt_parameters = await llm_provider.process_prompt(prompt)
23        
24        # Validate and clean the parameters
25        prompt_parameters = validate_prompt_parameters(prompt_parameters)
26        
27        logger.info(f"Successfully extracted parameters from prompt: {json.dumps(prompt_parameters, indent=2)}")
28        return prompt_parameters
29        
30    except Exception as e:
31        logger.error(f"Error processing prompt: {str(e)}")
32        # Return empty parameters, which will use defaults later
33        return {}
34
35def validate_prompt_parameters(parameters: Dict[str, Any]) -> Dict[str, Any]:
36    """
37    Validate and clean the parameters extracted from the prompt
38    
39    Args:
40        parameters: Raw parameters extracted by the LLM
41        
42    Returns:
43        Cleaned and validated parameters
44    """
45    cleaned = {}
46    
47    # Clean and validate title
48    if "title" in parameters and parameters["title"]:
49        cleaned["title"] = str(parameters["title"]).strip()
50    
51    # Clean and validate location
52    if "location" in parameters and parameters["location"]:
53        cleaned["location"] = str(parameters["location"]).strip()
54    
55    # Clean and validate experienceLevel
56    if "experienceLevel" in parameters and parameters["experienceLevel"]:
57        exp_level = str(parameters["experienceLevel"]).strip()
58        # Ensure it's a number from 1-5
59        if exp_level in ["1", "2", "3", "4", "5"]:
60            cleaned["experienceLevel"] = exp_level
61    
62    # Clean and validate workType
63    if "workType" in parameters and parameters["workType"]:
64        work_type = str(parameters["workType"]).strip()
65        # Ensure it's a valid work type (1, 2, or 3)
66        if work_type in ["1", "2", "3"]:
67            cleaned["workType"] = work_type
68    
69    # Clean and validate contractType
70    if "contractType" in parameters and parameters["contractType"]:
71        contract_type = str(parameters["contractType"]).strip().upper()
72        # Ensure it's a valid contract type (F, P, C, T, I, or V)
73        if contract_type in ["F", "P", "C", "T", "I", "V"]:
74            cleaned["contractType"] = contract_type
75    
76    # Clean and validate publishedAt
77    if "publishedAt" in parameters and parameters["publishedAt"]:
78        published_at = str(parameters["publishedAt"]).strip()
79        # Ensure it's a valid time frame
80        if published_at in ["r86400", "r604800", "r2592000", ""]:
81            cleaned["publishedAt"] = published_at
82    
83    # Clean and validate rows
84    if "rows" in parameters and parameters["rows"]:
85        try:
86            rows = int(parameters["rows"])
87            if rows > 0:
88                cleaned["rows"] = rows
89        except (ValueError, TypeError):
90            pass
91    
92    # Clean and validate companyName
93    if "companyName" in parameters and isinstance(parameters["companyName"], list):
94        cleaned["companyName"] = [str(company).strip() for company in parameters["companyName"] if company]
95    
96    # Clean and validate companyId
97    if "companyId" in parameters and isinstance(parameters["companyId"], list):
98        cleaned["companyId"] = [str(company_id).strip() for company_id in parameters["companyId"] if company_id]
99    
100    return cleaned

src/init.py

1# AI Job Finder package

src/main.py

1import asyncio
2
3from .main import main
4
5# Execute the Actor entrypoint
6asyncio.run(main())

Ai Job Finder

Ai Job Finder

You might also like

AI Linkedin Job Search

Linkedin Jobs Scraper - PPR

Linkedin Jobs Scraper

LinkedIn Jobs Scraper | Remove Duplicate Jobs | Pay Per Result

Fast LinkedIn Jobs Scraper

Advanced Linkedin Job Scraper

LinkedIn Jobs Scraper – Incredibly Fast ⚡️

Best Linkedin Jobs Scrapy

🔥 LinkedIn Jobs Scraper

Linkedin Job Search

requirements.txt

test.py

.actor/actor.json

.actor/Dockerfile

.actor/input_schema.json

src/cv_processor.py

src/main.py

src/parameter_handler.py

src/prompt_processor.py

src/__init__.py

src/__main__.py

__pycache__/main.cpython-312.pyc

example/advanced-reddit-scraper/.dockerignore

example/advanced-reddit-scraper/.gitignore

example/advanced-reddit-scraper/README.md

example/advanced-reddit-scraper/requirements.txt

src/llm_providers/base_provider.py

src/llm_providers/claude_provider.py

src/llm_providers/factory.py

src/llm_providers/gemini_provider.py

src/llm_providers/openai_provider.py

src/llm_providers/__init__.py

src/__pycache__/cv_processor.cpython-312.pyc

src/__pycache__/main.cpython-312.pyc

src/__pycache__/parameter_handler.cpython-312.pyc

src/__pycache__/prompt_processor.cpython-312.pyc

src/__pycache__/__init__.cpython-312.pyc

src/__pycache__/__main__.cpython-312.pyc

example/advanced-reddit-scraper/.actor/actor.json

example/advanced-reddit-scraper/.actor/Dockerfile

example/advanced-reddit-scraper/.actor/input_schema.json

example/advanced-reddit-scraper/.git/COMMIT_EDITMSG

example/advanced-reddit-scraper/.git/config

example/advanced-reddit-scraper/.git/description

example/advanced-reddit-scraper/.git/FETCH_HEAD

example/advanced-reddit-scraper/.git/HEAD

example/advanced-reddit-scraper/.git/index

example/advanced-reddit-scraper/.git/ORIG_HEAD

example/advanced-reddit-scraper/src/cookies.json

example/advanced-reddit-scraper/src/main.py

example/advanced-reddit-scraper/src/redditor.py

example/advanced-reddit-scraper/src/session.py

example/advanced-reddit-scraper/src/__main__.py

src/llm_providers/__pycache__/base_provider.cpython-312.pyc

src/llm_providers/__pycache__/factory.cpython-312.pyc

src/llm_providers/__pycache__/gemini_provider.cpython-312.pyc

src/llm_providers/__pycache__/__init__.cpython-312.pyc

example/advanced-reddit-scraper/.git/info/exclude

example/advanced-reddit-scraper/.git/hooks/applypatch-msg.sample

example/advanced-reddit-scraper/.git/hooks/commit-msg.sample

example/advanced-reddit-scraper/.git/hooks/fsmonitor-watchman.sample

example/advanced-reddit-scraper/.git/hooks/post-update.sample

example/advanced-reddit-scraper/.git/hooks/pre-applypatch.sample

example/advanced-reddit-scraper/.git/hooks/pre-commit.sample

example/advanced-reddit-scraper/.git/hooks/pre-merge-commit.sample

example/advanced-reddit-scraper/.git/hooks/pre-push.sample

example/advanced-reddit-scraper/.git/hooks/pre-rebase.sample

example/advanced-reddit-scraper/.git/hooks/pre-receive.sample

example/advanced-reddit-scraper/.git/hooks/prepare-commit-msg.sample

example/advanced-reddit-scraper/.git/hooks/push-to-checkout.sample

example/advanced-reddit-scraper/.git/hooks/sendemail-validate.sample

example/advanced-reddit-scraper/.git/hooks/update.sample

example/advanced-reddit-scraper/.git/logs/HEAD

example/advanced-reddit-scraper/src/__pycache__/main.cpython-312.pyc

example/advanced-reddit-scraper/src/__pycache__/reddit.cpython-312.pyc

example/advanced-reddit-scraper/src/__pycache__/redditor.cpython-312.pyc

example/advanced-reddit-scraper/src/__pycache__/session.cpython-312.pyc

src/init.py

src/main.py

pycache/main.cpython-312.pyc

src/llm_providers/init.py

src/pycache/cv_processor.cpython-312.pyc

src/pycache/main.cpython-312.pyc

src/pycache/parameter_handler.cpython-312.pyc

src/pycache/prompt_processor.cpython-312.pyc

src/pycache/init.cpython-312.pyc

src/pycache/main.cpython-312.pyc

example/advanced-reddit-scraper/src/main.py

src/llm_providers/pycache/base_provider.cpython-312.pyc

src/llm_providers/pycache/factory.cpython-312.pyc

src/llm_providers/pycache/gemini_provider.cpython-312.pyc

src/llm_providers/pycache/init.cpython-312.pyc

example/advanced-reddit-scraper/src/pycache/main.cpython-312.pyc

example/advanced-reddit-scraper/src/pycache/reddit.cpython-312.pyc

example/advanced-reddit-scraper/src/pycache/redditor.cpython-312.pyc

example/advanced-reddit-scraper/src/pycache/session.cpython-312.pyc

example/advanced-reddit-scraper/src/pycache/main.cpython-312.pyc