Vocalize HR Screen

An AI-powered voice screening agent for HR interviews using LiveKit, LangGraph, and Google Gemini. This system conducts automated 15-minute HR screening calls to evaluate candidates across basic qualifications, motivation, logistical fit, and communication skills.

💡 The Story Behind This Project

The Problem: Job seekers often struggle with interview anxiety and lack practice opportunities for screening calls, while startups and small companies can't afford dedicated HR teams for initial candidate screening.

The Solution: Vocalize HR Screen bridges this gap by providing:

🎯 For Job Seekers: A safe environment to practice mock interviews and receive structured feedback on their screening performance
🚀 For Startups: An affordable, consistent screening solution that evaluates candidates professionally without requiring HR expertise
⚖️ For Everyone: Standardized, bias-free initial screening that focuses on qualifications and fit rather than subjective impressions

This project democratizes professional HR screening, making it accessible to both candidates who want to improve and companies that need efficient hiring processes.

🚀 Quick Start

Prerequisites

Python 3.11+
uv (Python package manager)
just (command runner)

Installation

Clone the repository:

git clone git@github.com:0xRichardH/vocalize-hr-screen.git
cd vocalize-hr-screen

Install dependencies:

uv sync

Set up environment variables (see API Keys section below)

🔑 API Keys

Create a .env file in the project root with the following API keys:

Required API Keys

AssemblyAI (Speech-to-Text)
- Sign up: https://www.assemblyai.com/dashboard/signup
- Get your API key from the dashboard
```
ASSEMBLYAI_API_KEY=your_assemblyai_api_key_here
```
Cartesia (Text-to-Speech)
- Sign up: https://play.cartesia.ai/sign-up
- Get your API key from the dashboard
```
CARTESIA_API_KEY=your_cartesia_api_key_here
```

LiveKit Cloud

Sign up: https://cloud.livekit.io/
Create a project and get your credentials

LIVEKIT_URL=wss://your-project.livekit.cloud
LIVEKIT_API_KEY=your_livekit_api_key
LIVEKIT_API_SECRET=your_livekit_api_secret

Google Gemini (LLM)
- Get your API key from Google AI Studio
```
GOOGLE_API_KEY=your_google_api_key_here
```

Interview Configuration

Add these configuration variables to your .env file:

# Interview Settings
CANDIDATE_NAME="John Doe"
COMPANY_NAME="Tech Innovators Inc"
JOB_ROLE="Senior Software Engineer"
INTERVIEW_DURATION_MINUTES=15
WARNING_THRESHOLD_MINUTES=5

# Optional: Model Configuration
CHAT_MODEL="google_genai:gemini-2.5-flash"
GUARDRAIL_MODEL="google_genai:gemini-2.5-flash-lite"
WEB_SEARCH_MODEL="gemini-2.0-flash"

🏃‍♂️ How to Run

Development Mode

just dev

This command runs:

uv run app.py dev

Using uv directly

# Install dependencies
just uv sync

# Run the application
just uv run app.py dev

🧪 Testing with LiveKit Agents Playground

Access the Playground: Visit https://agents-playground.livekit.io/
Configure Connection:
- Enter your LiveKit Cloud URL (from .env)
- Enter your API Key and Secret (from .env)
Connect to Your Agent:
- Make sure your agent is running (just dev)
- In the playground, connect to your room
- Start speaking to begin the HR screening interview
Test Flow:
- The agent (Rachel) will introduce herself
- She'll ask about your background and experience
- Answer naturally as if in a real HR screening
- The interview will automatically end after 15 minutes

🏗️ Architecture Overview

Core Components

Voice Agent (voice_agent/)
- Handles voice interactions via LiveKit
- Integrates STT (AssemblyAI), TTS (Cartesia), and VAD (Silero)
- Bridges voice input/output with the LangGraph agent
HR Screen Agent (hr_screen_agent/)
- LangGraph-based conversational agent
- Conducts structured HR screening interviews
- Uses Google Gemini for reasoning and responses
Tools & Capabilities:
- Document processing (CV/resume reading)
- Web search for company/role research
- Time tracking and management
- Interview summary generation
- Guardrails for safety and relevance

Key Features

Intelligent Conversation Flow: Structured 15-minute interviews with time awareness
Document Analysis: Automatically reads and analyzes uploaded CVs/resumes
Safety Guardrails: Prevents jailbreaking and keeps conversations relevant
Comprehensive Evaluation: Assesses qualifications, motivation, logistics, and communication
Voice Interruption Handling: Natural turn-taking with voice activity detection
Persistent State: SQLite-based conversation checkpoints

📋 Input Files

Place candidate documents in the input/ folder:

CVs/Resumes: PDF format (.pdf)
Job Descriptions: Markdown format (.md)

The agent will automatically:

List available files at interview start
Read and analyze relevant documents
Use document content to inform interview questions

🔄 Workflow Diagram

graph TB
    subgraph "Voice Interface"
        A[User Speech] --> B[AssemblyAI STT]
        B --> C[Voice Agent]
        D[Cartesia TTS] --> E[Agent Speech]
        C --> D
    end

    subgraph "Core Agent"
        C --> F[LLM Adapter]
        F --> G[HR Screen Agent]
        G --> H[LangGraph Executor]
    end

    subgraph "Guardrails"
        I[Pre-Model Hook] --> J[Jailbreak Check]
        I --> K[Relevance Check]
        J --> L[Safety Filter]
        K --> L
    end

    subgraph "Tools"
        M[Document Loader] --> N[CV/JD Analysis]
        O[Web Search] --> P[Company Research]
        Q[Time Tracker] --> R[Interview Management]
        S[Think Tool] --> T[Reasoning]
        U[Summary Tool] --> V[Interview Report]
    end

    subgraph "Data Sources"
        W[Input Folder] --> M
        X[SQLite DB] --> Y[Conversation State]
        Z[Google Gemini] --> G
    end

    F --> I
    L --> H
    H --> M
    H --> O
    H --> Q
    H --> S
    H --> U

    style A fill:#e1f5fe
    style E fill:#e8f5e8
    style G fill:#fff3e0
    style I fill:#fce4ec

🎯 Interview Process

1. Preparation Phase

Initialize 15-minute timer
Read available CVs and job descriptions
Research company/role context via web search

2. Interview Execution

Introduction: Agent introduces herself as Rachel from the company
Qualification Verification: Validates resume claims against job requirements
Motivation Assessment: Explores interest in role and company
Logistics Discussion: Covers salary, availability, work authorization
Communication Evaluation: Assesses throughout the conversation

3. Conclusion & Documentation

Time-aware graceful ending
Comprehensive interview summary generation
Call termination with summary for HR review

🛡️ Safety Features

Jailbreak Prevention: Detects and blocks attempts to extract system prompts
Relevance Filtering: Keeps conversations focused on professional topics
Time Management: Automatic interview duration control
Professional Boundaries: Maintains appropriate HR screening context

📊 Output & Evaluation

The agent generates structured interview summaries including:

Basic Qualifications: Skills/experience alignment with job requirements
Interest & Motivation: Genuine interest evaluation and job search reasons
Logistical Fit: Salary expectations, availability, work authorization
Communication & Professionalism: Overall communication effectiveness
Recommendation: Proceed/Hold/Reject with detailed justification
Key Highlights: Notable points for next interview rounds

🔧 Development

Project Structure

vocalize-hr-screen/
├── app.py                      # Main application entry point
├── hr_screen_agent/            # Core HR screening logic
│   ├── agent.py               # LangGraph agent creation
│   ├── configuration.py       # Environment configuration
│   ├── prompts.py            # Agent instructions & prompts
│   ├── state.py              # Conversation state schema
│   ├── hooks/                # Pre-processing hooks
│   │   ├── guardrail.py      # Safety guardrail implementations
│   │   └── pre_model_hook.py # Request preprocessing
│   └── tools/                # Agent capabilities
│       ├── document_loader.py # CV/JD processing
│       ├── time_tracker.py   # Interview timing
│       ├── web_search.py     # Company research
│       ├── think.py          # Internal reasoning
│       ├── interview_summary.py # Report generation
│       └── end_call.py       # Call termination
├── voice_agent/               # Voice interface
│   ├── agent.py              # LiveKit voice agent
│   └── llm_adapter.py        # Voice-to-LangGraph bridge
└── input/                    # Document storage
    ├── *.pdf                 # Candidate CVs/resumes
    └── *.md                  # Job descriptions

Extending the Agent

To add new capabilities:

Create new tools in hr_screen_agent/tools/
Register tools in hr_screen_agent/tools/__init__.py
Add tool to agent creation in hr_screen_agent/agent.py
Update prompts if needed in hr_screen_agent/prompts.py

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
hr_screen_agent		hr_screen_agent
input		input
voice_agent		voice_agent
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
SUBMISSION.md		SUBMISSION.md
app.py		app.py
justfile		justfile
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vocalize HR Screen

💡 The Story Behind This Project

🚀 Quick Start

Prerequisites

Installation

🔑 API Keys

Required API Keys

Interview Configuration

🏃‍♂️ How to Run

Development Mode

Using uv directly

🧪 Testing with LiveKit Agents Playground

🏗️ Architecture Overview

Core Components

Key Features

📋 Input Files

🔄 Workflow Diagram

🎯 Interview Process

1. Preparation Phase

2. Interview Execution

3. Conclusion & Documentation

🛡️ Safety Features

📊 Output & Evaluation

🔧 Development

Project Structure

Extending the Agent

📝 License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

0xRichardH/vocalize-hr-screen

Folders and files

Latest commit

History

Repository files navigation

Vocalize HR Screen

💡 The Story Behind This Project

🚀 Quick Start

Prerequisites

Installation

🔑 API Keys

Required API Keys

Interview Configuration

🏃‍♂️ How to Run

Development Mode

Using uv directly

🧪 Testing with LiveKit Agents Playground

🏗️ Architecture Overview

Core Components

Key Features

📋 Input Files

🔄 Workflow Diagram

🎯 Interview Process

1. Preparation Phase

2. Interview Execution

3. Conclusion & Documentation

🛡️ Safety Features

📊 Output & Evaluation

🔧 Development

Project Structure

Extending the Agent

📝 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages