Voice Input

Speak your tasks. Whisper-powered transcription with Web Speech API fallback for instant, hands-free task capture.

01

Groq Whisper Transcription

Powered by Groq's Whisper large-v3-turbo model, voice input is transcribed with exceptional accuracy and speed. The cloud-based engine handles accents, background noise, and domain-specific vocabulary with ease — turning your spoken words into structured data.

Processing happens in seconds, not minutes. Groq's inference speed makes voice input feel instant.

Groq Whisper v3-turbo

Listening...

Speak naturally

Latency: 340ms Accuracy: 98.2%
02

Web Speech API Fallback

When cloud transcription isn't available — whether due to network issues, privacy preferences, or cost considerations — the browser's built-in Web Speech API takes over seamlessly. No configuration required, no degraded experience.

Automatic fallback means voice input always works, regardless of connectivity.

Transcription Provider

Groq Whisper Primary

Connected and ready

Web Speech API Fallback

Standby — activates automatically

03

Real-Time Waveform

Watch your voice come alive as you speak. A real-time audio waveform visualization gives immediate feedback that the system is listening. The visual response helps you gauge volume, pacing, and when to pause for natural sentence breaks.

Motion-safe animations respect user preferences for reduced motion.

Audio Waveform

Duration: 00:04
Recording
04

Voice-to-Task Conversion

Speak a task, get a task. The transcribed text flows directly into the AI parsing engine, which extracts titles, priorities, deadlines, and categories. The entire pipeline — from speech to structured task — happens in a single, fluid interaction.

"Remind me to review the proposal by Friday, it's urgent" becomes a high-priority task due Friday.

Voice Input

"Remind me to review the proposal by Friday, it's urgent"

Parsed Task

Review the proposal

Urgent Due Friday Reminder
05

Multi-Language Support

Whisper supports over 90 languages natively. Speak in English, Spanish, Japanese, Arabic, or any other supported language and the system transcribes accurately. Language detection is automatic — just start speaking.

The Web Speech API fallback supports the languages available in your browser.

Language Detection

EN English
99%
ES Español
97%
JP 日本語
96%
AR العربية
95%
FR Français
98%
DE Deutsch
97%

90+ languages supported natively

Ready to get started?

Try mAiTasks free during the beta and experience AI-powered productivity.