Skip to content

Roadmap

Future features and improvements planned for Intervu.

Overview

Intervu is actively developed with a focus on improving the interview experience. This roadmap outlines planned features organized by phase.

Source

This roadmap is based on internal development planning.


Phase 1: Core UX Improvements

High-impact improvements to daily usage.

Keyboard Shortcuts

Enable quick actions without mouse interaction.

ShortcutAction
Ctrl+SpaceToggle microphone
Ctrl+RRegenerate answer
Ctrl+LClear history
Ctrl+,Open settings
Ctrl+SExport session

Status: Planned


Answer Length Control

Let users choose answer verbosity.

OptionDescription
Brief2-3 sentences
Medium1-2 paragraphs
DetailedFull comprehensive answer

Status: Planned


Session Recording & Export (✅ Implemented)

Save and export interview transcripts.

  • Export transcript and answers as Markdown
  • Copy to clipboard functionality
  • Export button in title bar

Status: Implemented


Phase 2: Enhanced Assistance

Advanced features for better interview support.

STAR Framework Support

Automatic detection and formatting for behavioral questions.

  • Detect "Tell me about a time..." questions
  • Inject STAR prompt template
  • Format answers with Situation/Task/Action/Result sections

Status: Planned


Interview Question Bank (✅ Implemented)

Pre-prepared Q&A reference bank and company/role context for the LLM.

  • Q&A Bank field for pre-prepared interview answers and preferences
  • Company / Role Context field for company info, job description, team details
  • Both injected into LLM system message alongside resume
  • Edit-in-window support for comfortable long-text editing

Status: Implemented


Real-Time Answer Edits

Edit AI-generated answers while streaming.

  • Click answer to enter edit mode
  • Pause streaming during edit
  • Revert to original answer

Status: Planned


Follow-Up Question Suggestions

Suggest thoughtful questions to ask the interviewer.

  • Generate after each answer
  • Copy-to-clipboard functionality
  • Toggle in settings

Status: Planned


Phase 3: Advanced Features

Professional-grade capabilities.

Multiple Resume Profiles

Quick switching between different resumes.

  • Profile management UI
  • Quick switch dropdown
  • Custom system prompt per profile

Status: Planned


Contextual Answer Tips (✅ Implemented)

Key points extraction and confidence scoring.

  • Bullet-point summary
  • Confidence meter (0-100)
  • Visual highlights in answer

Status: Implemented


Answer Quality Feedback (✅ Implemented)

Rate generated answers for learning.

  • Thumbs up/down
  • In-context learning from ratings
  • Local storage of preferences

Status: Implemented


Advanced Mode (✅ Implemented)

Dual-LLM question extraction.

  • Extract complete questions
  • Filter non-questions
  • Queue mode for multiple questions

Status: Implemented


macOS Support (✅ Implemented)

Full macOS support with AVFoundation audio capture.

  • AVFoundation-based device enumeration
  • BlackHole integration for system audio
  • Multi-output device for audio routing
  • DMG and ZIP builds (x64 + arm64)
  • Dock icon and proper app naming
  • Microphone permission handling

Status: Implemented


Application Branding (✅ Implemented)

App icon and logo for all platforms.

  • Custom icon for Windows, macOS, and Linux
  • Logo in title bar
  • Icon bundled in all builds

Status: Implemented


Speaker Diarization (✅ Implemented)

Identify individual speakers in system audio for panel interviews.

  • Toggle in Settings with on-enable validation against STT backend
  • Dual backend support: auto-detects /asr (whisperx-asr-service) vs OpenAI-compatible endpoints
  • Recommended backend: learnedmachine/whisperx-asr-service (used by Speakr) for reliable speaker embeddings
  • Client-side embedding matching (cosine similarity, weighted moving average 30/70) for cross-chunk speaker consistency
  • Ambiguity check: skips matching when top 2 candidates are within 5% similarity gap
  • Per-speaker transcript entries with unique colors
  • Click-to-rename speaker labels in transcript pane
  • SpeakerMap persisted in settings, min speakers configurable (2-6)
  • Runtime hard error when backend doesn't support diarization
  • Retry-with-fallback on 500 errors (NaN embedding workaround)

Status: Implemented (Phase 1)


Speaker Monitor Panel (✅ Implemented)

Centralized speaker management panel for diarization sessions.

  • Popover panel in title bar (visible when diarization is enabled)
  • Per-speaker info: first seen, last spoke, message count, last 3 messages
  • Click-to-rename speakers directly in monitor
  • Clear speaker data (names + embeddings) without losing transcript
  • Fixed: clear transcript now also resets speaker names and embeddings

Status: Implemented


LLM Speaker-Aware Prompting (✅ Implemented)

Enrich LLM prompts with speaker identity for better question attribution.

  • System prompt includes multi-speaker context with detected speaker names
  • All non-'you' messages prefixed with [DisplayName]: using speakerMap
  • Auto-trigger fires on any non-'you' speaker (not just 'interviewer')
  • Question detection finds latest non-'you' entry for card titles
  • Advanced mode extractor uses display names from speakerMap

Status: Implemented (Phase 2)


AI-Assisted Speaker Naming (✅ Implemented)

LLM detects real names from conversational cues and suggests them for unnamed speakers.

  • LLM analyzes transcript for name mentions ("Hi Anna", "John, what do you think?")
  • Sparkle hints appear next to unnamed SPEAKER_XX labels in transcript and speaker monitor
  • Accept/dismiss buttons for each suggestion
  • Uses extractor LLM (advanced mode) or main LLM
  • Debounced analysis (8s after last transcript update, 5+ entries required)
  • Dismissed suggestions remembered per session
  • Accepted suggestions persist to speakerMap

Status: Implemented (Phase 3)


Chunk Duration Tuning & Speaker Profile Persistence (✅ Implemented)

Optimized audio chunking for diarization and optional cross-session speaker recognition.

  • Audio chunk duration auto-enforced to 6s+ when diarization is ON
  • Chunk duration setting locked in UI with explanatory hint
  • "Remember Speakers" toggle — saves voice profiles to disk
  • Cross-session speaker recognition from saved embeddings
  • Clear speaker data also clears saved profiles

Status: Implemented (Phase 4)


Interview Timer (✅ Implemented)

Track interview duration and speaking time.

  • Elapsed time display in title bar with pulsing indicator when LIVE
  • Speaking ratio breakdown (You vs Them) on hover
  • Configurable time warnings at custom intervals
  • Session-based timer that pauses when LIVE is off
  • Clear timer option in clear menu

Status: Implemented


Phase 4: Nice-to-Have

Features for future consideration.

Code Signing

Sign all application binaries for Windows and macOS.

  • Apple Developer certificate for macOS
  • Windows code signing certificate
  • Eliminates quarantine workaround on macOS
  • Reduces SmartScreen warnings on Windows

Status: Planned


Sentiment Analysis

Detect interviewer interest/confusion.

  • Real-time sentiment indicators
  • Visual cues in transcript
  • Suggestion to elaborate

Status: Planned


Practice Mode

Pre-interview simulation with random questions.

  • Question bank by category
  • Record and compare answers
  • Self-rating system

Status: Planned


Cloud Sync

Sync settings and history across devices.

  • Google Drive integration
  • Dropbox integration
  • Local file sync

Status: Planned


Feature Requests

Have an idea? We'd love to hear it!

  1. Check existing issues on GitHub
  2. Open a new issue with the "feature request" label
  3. Describe your use case and expected behavior

Implementation Priority

Features are prioritized based on:

  1. User Impact — How many users benefit
  2. Implementation Effort — Development time required
  3. Maintenance Cost — Ongoing support needed
  4. Dependencies — Blocking requirements

Recently Completed

FeatureVersionDescription
PGVector Context Retrievalv1.2.0Semantic search over indexed resume, Q&A bank, and company context
Chat Windowv1.2.0Dedicated chat window for freeform questions with live context
Session Exportv1.2.0Export transcript and answers as Markdown or copy to clipboard
Resizable Settingsv1.2.0Settings dialog can be resized and persists size across sessions
Chunk Tuning & Profile Persistencev1.1.0Auto 6s chunks for diarization, cross-session speaker recognition
AI Speaker Namingv1.1.0LLM detects names from transcript cues, suggests for unnamed speakers
LLM Speaker Promptingv1.1.0Multi-speaker context in LLM prompts, speaker-aware auto-trigger and question attribution
Speaker Monitorv1.1.0Popover panel for speaker management with metadata, rename, and clear
Speaker Diarizationv1.1.0Multi-speaker identification with dual-backend support and cross-chunk voice tracking
macOS Supportv1.0.0AVFoundation audio, BlackHole, multi-output device
App Brandingv1.0.0Icons and logo for all platforms
Windows Buildv1.0.0NSIS installer + portable executable
Advanced Modev1.0.0Dual-LLM question extraction
Contextual Tipsv1.0.0Key points and confidence
Answer Ratingsv1.0.0Thumbs up/down feedback
Question Bankv1.0.2Q&A bank and company/role context fields
LLM Queue Modev1.0.0Process questions in order
Streaming Answersv1.0.0Real-time answer generation

Platform Support

PlatformStatusNotes
Windows✅ SupportedDirectShow audio, bundled FFmpeg
macOS✅ SupportedAVFoundation audio, manual FFmpeg install required
Linux🔜 PlannedNot yet tested

Last updated: Check GitHub commits for real-time status.

Made with ❤️by Aldrick Bonaobra